1 |
Language Recognition for Dialects and Closely Related Languages
|
|
|
|
In: Odyssey 2016 ; https://hal.archives-ouvertes.fr/hal-01744188 ; Odyssey 2016, Jun 2016, Bilbao, Spain (2016)
|
|
BASE
|
|
Show details
|
|
2 |
Language Model Data Augmentation for Keyword Spotting
|
|
|
|
In: Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-01837186 ; Annual Conference of the International Speech Communication Association , Jan 2016, San Francisco, United States (2016)
|
|
Abstract:
International audience ; This research extends our earlier work on using machinetranslation (MT) and word-based recurrent neural networks toaugment language model training data for keyword search inconversational Cantonese speech. MT-based data augmenta-tion is applied to two language pairs: English-Lithuanian andEnglish-Amharic. Using filtered N-best MT hypotheses for lan-guage modeling is found to perform better than just using the 1-best translation. Target language texts collected from the Weband filtered to select conversational-like data are used in severalmanners. In addition to using Web data for training the languagemodel of the speech recognizer, we further investigate using thisdata to improve the language model and phrase table of the MTsystem to get better translations of the English data. Finally,generating text data with a character-based recurrent neural net-work is investigated. This approach allows new word forms tobe produced, providing a way to reduce the out-of-vocabularyrate and thereby improve keyword spotting performance. Westudy how these different methods of language model data aug-mentation impact speech-to-text and keyword spotting perfor-mance for the Lithuanian and Amharic languages. The best re-sults are obtained by combining all of the explored methods.
|
|
Keyword:
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [INFO]Computer Science [cs]; language modeling; low-resourced languages; machine translation; speech recognition; text augmentation
|
|
URL: https://hal.archives-ouvertes.fr/hal-01837186
|
|
BASE
|
|
Hide details
|
|
3 |
Investigating techniques for low resource conversational speech recognition
|
|
|
|
In: 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ; 41st IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016) ; https://hal-univ-lemans.archives-ouvertes.fr/hal-01515254 ; 41st IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), Mar 2016, Shangai, China. pp.5975-5979, ⟨10.1109/ICASSP.2016.7472824⟩ ; www.icassp2016.org (2016)
|
|
BASE
|
|
Show details
|
|
4 |
Multimodal Emotion Recognition for AVEC 2016 Challenge
|
|
|
|
In: Audio/Visual Emotion Challenge ; https://hal.archives-ouvertes.fr/hal-01837203 ; Audio/Visual Emotion Challenge, ACM, Oct 2016, Amsterdam, Netherlands (2016)
|
|
BASE
|
|
Show details
|
|
5 |
Marginal Contrast Among Romanian Vowels: Evidence from ASR and Functional Load
|
|
|
|
In: Interspeech 2016 ; https://hal.archives-ouvertes.fr/hal-01453014 ; Interspeech 2016, ISCA, Sep 2016, San Francisco, United States. pp.2433 - 2437, ⟨10.21437/Interspeech.2016-762⟩ ; http://www.interspeech2016.org/ (2016)
|
|
BASE
|
|
Show details
|
|
6 |
Réalisation phonétique et contraste phonologique marginal : une étude automatique des voyelles du roumain
|
|
|
|
In: JEP 2016 ; https://hal.archives-ouvertes.fr/hal-01452974 ; JEP 2016, Aug 2016, Paris, France (2016)
|
|
BASE
|
|
Show details
|
|
7 |
BULB: Breaking the Unwritten Language Barrier
|
|
|
|
In: Procedia Computer Science ; Computational Methods for Endangered Language Documentation and Description ; https://hal.archives-ouvertes.fr/hal-01836496 ; Computational Methods for Endangered Language Documentation and Description, May 2016, Yogyakarta, Indonesia. pp.8-14, ⟨10.1016/j.procs.2016.04.023⟩ (2016)
|
|
BASE
|
|
Show details
|
|
8 |
A phonologically weak contrast can induce phonetic overlap
|
|
|
|
In: Laboratory Phonology Conference ; https://hal.archives-ouvertes.fr/hal-01837204 ; Laboratory Phonology Conference, Jul 2016, Ithaca, United States (2016)
|
|
BASE
|
|
Show details
|
|
9 |
Breaking the unwritten language barrier: the BULB project
|
|
|
|
In: SLTU-2016 5th Workshop on Spoken Language Technologies for Under-resourced languages ; https://halshs.archives-ouvertes.fr/halshs-01428027 ; SLTU-2016 5th Workshop on Spoken Language Technologies for Under-resourced languages, May 2016, Yogyakarta, Indonesia. ⟨10.1016/j.procs.2016.04.023⟩ (2016)
|
|
BASE
|
|
Show details
|
|
10 |
Innovative technologies for under-resourced language documentation: The BULB Project
|
|
|
|
In: CCURL proceedings ; Workshop CCURL 2016 - Collaboration and Computing for Under-Resourced Languages - LREC ; https://hal.archives-ouvertes.fr/hal-01350124 ; Workshop CCURL 2016 - Collaboration and Computing for Under-Resourced Languages - LREC, May 2016, Portoroz, Slovenia (2016)
|
|
BASE
|
|
Show details
|
|
12 |
Machine Translation Based Data Augmentation for Cantonese Keyword Spotting (Author's Manuscript)
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Investigating Techniques for Low Resource Conversational Speech Recognition
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Breaking the unwritten language barrier: the BULB project
|
|
|
|
In: SLTU-2016 5th Workshop on Spoken Language Technologies for Under-resourced languages ; https://halshs.archives-ouvertes.fr/halshs-01428027 ; SLTU-2016 5th Workshop on Spoken Language Technologies for Under-resourced languages, May 2016, Yogyakarta, Indonesia. ⟨10.1016/j.procs.2016.04.023⟩ (2016)
|
|
BASE
|
|
Show details
|
|
15 |
Innovative technologies for under-resourced language documentation: The BULB Project
|
|
|
|
In: CCURL proceedings ; Workshop CCURL 2016 - Collaboration and Computing for Under-Resourced Languages - LREC ; https://hal.archives-ouvertes.fr/hal-01350124 ; Workshop CCURL 2016 - Collaboration and Computing for Under-Resourced Languages - LREC, May 2016, Portoroz, Slovenia (2016)
|
|
BASE
|
|
Show details
|
|
16 |
BULB: Breaking the Unwritten Language Barrier
|
|
|
|
In: Procedia Computer Science ; Computational Methods for Endangered Language Documentation and Description ; https://hal.archives-ouvertes.fr/hal-01836496 ; Computational Methods for Endangered Language Documentation and Description, May 2016, Yogyakarta, Indonesia. pp.8-14, ⟨10.1016/j.procs.2016.04.023⟩ (2016)
|
|
BASE
|
|
Show details
|
|
|
|