Home Catalogue search

eng

Refine your search:
- Keyword
- Creator / Publisher
- Year:
  - 2021 (1)
  - 2020 (1)
  - 2017 (1)
  - 2014 (4)
  - 2013 (2)
  - 2012 (1)
  - 2011 (2)
  - 2009 (1)
  - 2006 (1)
- Medium
- Type:
  - Article (13)
  - Miscellaneous (1)
- BLLDB-Access:
  - free (14)
  - subject to license (0)

Search in the Catalogues and Directories






	Sort by
Simple Search

Hits 1 – 14 of 14

1	Lattice-Free Maximum Mutual Information Training of Multilingual Speech Recognition System
	Madikeri, Srikanth; Khonglah, Banriskhem; Tong, Sibo...
	In: http://infoscience.epfl.ch/record/284989 (2021)
	BASE
	Show details

2	CHiME-6 Challenge: Tackling multispeaker speech recognition for unsegmented recordings
	Watanabe, Shinji; Mandel, Michael; Barker, Jon...
	In: CHiME 2020 - 6th International Workshop on Speech Processing in Everyday Environments ; https://hal.inria.fr/hal-02546993 ; CHiME 2020 - 6th International Workshop on Speech Processing in Everyday Environments, May 2020, Barcelona / Virtual, Spain (2020)
	BASE
	Show details

3	Acoustic data-driven lexicon learning based on a greedy pronunciation selection framework ...
	Zhang, Xiaohui; Manohar, Vimal; Povey, Daniel. - : arXiv, 2017
	BASE
	Show details

4	Approaches to automatic lexicon learning with limited training examples
	Goel, Nagendra; Thomas, Samuel; Agarwal, Mohit...
	In: http://infoscience.epfl.ch/record/203451 (2014)
	BASE
	Show details

5	Subspace Gaussian Mixture Models for speech recognition
	Povey, Daniel; Burget, Lukas; Agarwal, Mohit...
	In: http://infoscience.epfl.ch/record/203448 (2014)
	BASE
	Show details

6	Multilingual acoustic modeling for speech recognition based on subspace Gaussian Mixture Models
	Burget, Lukas; Schwarz, Petr; Agarwal, Mohit...
	In: http://infoscience.epfl.ch/record/203450 (2014)
	BASE
	Show details

7	Multilingual Deep Neural Network based Acoustic Modeling For Rapid Language Adaptation
	Vu, Ngoc Thang; Imseng, David; Povey, Daniel; Motlicek, Petr; Schultz, Tanja; Bourlard, Hervé
	In: http://infoscience.epfl.ch/record/198446 (2014)
	Abstract: This paper presents a study on multilingual deep neural network (DNN) based acoustic modeling and its application to new languages. We investigate the effect of phone merging on multilingual DNN in context of rapid language adaptation. Moreover, the combination of multilingual DNNs with Kullback--Leibler divergence based acoustic modeling (KL-HMM) is explored. Using ten different languages from the Globalphone database, our studies reveal that crosslingual acoustic model transfer through multilingual DNNs is superior to unsupervised RBM pre-training and greedy layer-wise supervised training. We also found that KL-HMM based decoding consistently outperforms conventional hybrid decoding, especially in low-resource scenarios. Furthermore, the experiments indicate that multilingual DNN training equally benefits from simple phoneset concatenation and manually derived universal phonesets.
	URL: https://doi.org/10.1109/ICASSP.2014.6855086 http://infoscience.epfl.ch/record/198446 https://infoscience.epfl.ch/record/198446/files/Vu_ICASSP_2014.pdf
	BASE
	Hide details

8	The Kaldi Speech Recognition Toolkit
	Povey, Daniel; Ghoshal, Arnab; Boulianne, Gilles...
	In: http://infoscience.epfl.ch/record/192584 (2013)
	BASE
	Show details

9	The Kaldi Speech Recognition Toolkit
	Povey, Daniel; Ghoshal, Arnab; Boulianne, Gilles...
	In: http://infoscience.epfl.ch/record/192761 (2013)
	BASE
	Show details

10	A basis representation of constrained MLLR transforms for robust adaptation
	Povey, Daniel; Yao, Kaisheng
	In: Computer speech and language. - Amsterdam [u.a.] : Elsevier 26 (2012) 1, 35-51
	BLLDB
	OLC Linguistik
	Show details

11	The subspace Gaussian mixture model - a structured model for speech recognition
	Akyazı, Pınar; Thomas, Samuel; Schwarz, Petr...
	In: Computer speech and language. - Amsterdam [u.a.] : Elsevier 25 (2011) 2, 404-439
	BLLDB
	OLC Linguistik
	Show details

12	Minimum Bayes risk decoding and system combination based on a recursion for edit distance
	Xu, Haihua; Zhu, Jie; Povey, Daniel...
	In: Computer speech and language. - Amsterdam [u.a.] : Elsevier 25 (2011) 4, 802-828
	BLLDB
	OLC Linguistik
	Show details

13	Advances in Arabic speech transcription at IBM under the DARPA GALE program
	Kuo, Hong-Kwang Jeff; Emami, Ahmad; Povey, Daniel...
	In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 17 (2009) 5, 884-894
	BLLDB
	OLC Linguistik
	Show details

14	Advances in speech transcription at IBM under the DARPA EARS program
	Kingsbury, Brian; Mangu, Lidia; Povey, Daniel...
	In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 14 (2006) 5, 1596-1608
	BLLDB
	OLC Linguistik
	Show details

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern