DE eng

Search in the Catalogues and Directories

Hits 1 – 11 of 11

1
MAGIC DUST FOR CROSS-LINGUAL ADAPTATION OF MONOLINGUAL WAV2VEC-2.0
In: ICASSP 2022 ; https://hal.archives-ouvertes.fr/hal-03544515 ; ICASSP 2022, May 2022, Singapour, Singapore (2022)
BASE
Show details
2
Magic dust for cross-lingual adaptation of monolingual wav2vec-2.0 ...
Abstract: We propose a simple and effective cross-lingual transfer learning method to adapt monolingual wav2vec-2.0 models for Automatic Speech Recognition (ASR) in resource-scarce languages. We show that a monolingual wav2vec-2.0 is a good few-shot ASR learner in several languages. We improve its performance further via several iterations of Dropout Uncertainty-Driven Self-Training (DUST) by using a moderate-sized unlabeled speech dataset in the target language. A key finding of this work is that the adapted monolingual wav2vec-2.0 achieves similar performance as the topline multilingual XLSR model, which is trained on fifty-three languages, on the target language ASR task. ...
Keyword: Audio and Speech Processing eess.AS; Computation and Language cs.CL; FOS Computer and information sciences; FOS Electrical engineering, electronic engineering, information engineering; Sound cs.SD
URL: https://dx.doi.org/10.48550/arxiv.2110.03560
https://arxiv.org/abs/2110.03560
BASE
Hide details
3
A Convolutional Deep Markov Model for Unsupervised Speech Representation Learning
In: Interspeech 2020 ; https://hal.archives-ouvertes.fr/hal-02912029 ; Interspeech 2020, Oct 2020, Shanghai, China (2020)
BASE
Show details
4
CSTNet: Contrastive Speech Translation Network for Self-Supervised Speech Representation Learning ...
BASE
Show details
5
A Convolutional Deep Markov Model for Unsupervised Speech Representation Learning ...
BASE
Show details
6
DARTS: Dialectal Arabic Transcription System ...
BASE
Show details
7
The Summa Platform Prototype ...
BASE
Show details
8
The Summa Platform Prototype ...
BASE
Show details
9
The SUMMA Platform Prototype
In: http://infoscience.epfl.ch/record/233575 (2017)
BASE
Show details
10
Multi-view Dimensionality Reduction for Dialect Identification of Arabic Broadcast Speech ...
BASE
Show details
11
Automatic Dialect Detection in Arabic Broadcast Speech ...
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
11
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern