1 |
Lattice-Free Maximum Mutual Information Training of Multilingual Speech Recognition System
|
|
|
|
In: http://infoscience.epfl.ch/record/284989 (2021)
|
|
BASE
|
|
Show details
|
|
2 |
CHiME-6 Challenge: Tackling multispeaker speech recognition for unsegmented recordings
|
|
|
|
In: CHiME 2020 - 6th International Workshop on Speech Processing in Everyday Environments ; https://hal.inria.fr/hal-02546993 ; CHiME 2020 - 6th International Workshop on Speech Processing in Everyday Environments, May 2020, Barcelona / Virtual, Spain (2020)
|
|
BASE
|
|
Show details
|
|
3 |
Acoustic data-driven lexicon learning based on a greedy pronunciation selection framework ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Approaches to automatic lexicon learning with limited training examples
|
|
|
|
In: http://infoscience.epfl.ch/record/203451 (2014)
|
|
BASE
|
|
Show details
|
|
5 |
Subspace Gaussian Mixture Models for speech recognition
|
|
|
|
In: http://infoscience.epfl.ch/record/203448 (2014)
|
|
BASE
|
|
Show details
|
|
6 |
Multilingual acoustic modeling for speech recognition based on subspace Gaussian Mixture Models
|
|
|
|
In: http://infoscience.epfl.ch/record/203450 (2014)
|
|
BASE
|
|
Show details
|
|
7 |
Multilingual Deep Neural Network based Acoustic Modeling For Rapid Language Adaptation
|
|
|
|
In: http://infoscience.epfl.ch/record/198446 (2014)
|
|
Abstract:
This paper presents a study on multilingual deep neural network (DNN) based acoustic modeling and its application to new languages. We investigate the effect of phone merging on multilingual DNN in context of rapid language adaptation. Moreover, the combination of multilingual DNNs with Kullback--Leibler divergence based acoustic modeling (KL-HMM) is explored. Using ten different languages from the Globalphone database, our studies reveal that crosslingual acoustic model transfer through multilingual DNNs is superior to unsupervised RBM pre-training and greedy layer-wise supervised training. We also found that KL-HMM based decoding consistently outperforms conventional hybrid decoding, especially in low-resource scenarios. Furthermore, the experiments indicate that multilingual DNN training equally benefits from simple phoneset concatenation and manually derived universal phonesets.
|
|
URL: https://doi.org/10.1109/ICASSP.2014.6855086 http://infoscience.epfl.ch/record/198446 https://infoscience.epfl.ch/record/198446/files/Vu_ICASSP_2014.pdf
|
|
BASE
|
|
Hide details
|
|
8 |
The Kaldi Speech Recognition Toolkit
|
|
|
|
In: http://infoscience.epfl.ch/record/192584 (2013)
|
|
BASE
|
|
Show details
|
|
9 |
The Kaldi Speech Recognition Toolkit
|
|
|
|
In: http://infoscience.epfl.ch/record/192761 (2013)
|
|
BASE
|
|
Show details
|
|
|
|