DE eng

Search in the Catalogues and Directories

Hits 1 – 14 of 14

1
Lattice-Free Maximum Mutual Information Training of Multilingual Speech Recognition System
In: http://infoscience.epfl.ch/record/284989 (2021)
BASE
Show details
2
CHiME-6 Challenge: Tackling multispeaker speech recognition for unsegmented recordings
In: CHiME 2020 - 6th International Workshop on Speech Processing in Everyday Environments ; https://hal.inria.fr/hal-02546993 ; CHiME 2020 - 6th International Workshop on Speech Processing in Everyday Environments, May 2020, Barcelona / Virtual, Spain (2020)
BASE
Show details
3
Acoustic data-driven lexicon learning based on a greedy pronunciation selection framework ...
BASE
Show details
4
Approaches to automatic lexicon learning with limited training examples
In: http://infoscience.epfl.ch/record/203451 (2014)
BASE
Show details
5
Subspace Gaussian Mixture Models for speech recognition
In: http://infoscience.epfl.ch/record/203448 (2014)
BASE
Show details
6
Multilingual acoustic modeling for speech recognition based on subspace Gaussian Mixture Models
In: http://infoscience.epfl.ch/record/203450 (2014)
BASE
Show details
7
Multilingual Deep Neural Network based Acoustic Modeling For Rapid Language Adaptation
In: http://infoscience.epfl.ch/record/198446 (2014)
Abstract: This paper presents a study on multilingual deep neural network (DNN) based acoustic modeling and its application to new languages. We investigate the effect of phone merging on multilingual DNN in context of rapid language adaptation. Moreover, the combination of multilingual DNNs with Kullback--Leibler divergence based acoustic modeling (KL-HMM) is explored. Using ten different languages from the Globalphone database, our studies reveal that crosslingual acoustic model transfer through multilingual DNNs is superior to unsupervised RBM pre-training and greedy layer-wise supervised training. We also found that KL-HMM based decoding consistently outperforms conventional hybrid decoding, especially in low-resource scenarios. Furthermore, the experiments indicate that multilingual DNN training equally benefits from simple phoneset concatenation and manually derived universal phonesets.
URL: https://doi.org/10.1109/ICASSP.2014.6855086
http://infoscience.epfl.ch/record/198446
https://infoscience.epfl.ch/record/198446/files/Vu_ICASSP_2014.pdf
BASE
Hide details
8
The Kaldi Speech Recognition Toolkit
In: http://infoscience.epfl.ch/record/192584 (2013)
BASE
Show details
9
The Kaldi Speech Recognition Toolkit
In: http://infoscience.epfl.ch/record/192761 (2013)
BASE
Show details
10
A basis representation of constrained MLLR transforms for robust adaptation
In: Computer speech and language. - Amsterdam [u.a.] : Elsevier 26 (2012) 1, 35-51
BLLDB
OLC Linguistik
Show details
11
The subspace Gaussian mixture model - a structured model for speech recognition
In: Computer speech and language. - Amsterdam [u.a.] : Elsevier 25 (2011) 2, 404-439
BLLDB
OLC Linguistik
Show details
12
Minimum Bayes risk decoding and system combination based on a recursion for edit distance
In: Computer speech and language. - Amsterdam [u.a.] : Elsevier 25 (2011) 4, 802-828
BLLDB
OLC Linguistik
Show details
13
Advances in Arabic speech transcription at IBM under the DARPA GALE program
In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 17 (2009) 5, 884-894
BLLDB
OLC Linguistik
Show details
14
Advances in speech transcription at IBM under the DARPA EARS program
In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 14 (2006) 5, 1596-1608
BLLDB
OLC Linguistik
Show details

Catalogues
0
0
5
0
0
0
0
Bibliographies
5
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
9
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern