Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Hits 1 – 19 of 19

1	Understanding Medical Conversations: Rich Transcription, Confidence Scores & Information Extraction ...
	Soltau, Hagen; Wang, Mingqiu; Shafran, Izhak. - : arXiv, 2021
	BASE
	Show details

2	Joint Speech Recognition and Speaker Diarization via Sequence Transduction ...
	Shafey, Laurent El; Soltau, Hagen; Shafran, Izhak. - : arXiv, 2019
	BASE
	Show details

3	Neural Speech Recognizer: Acoustic-to-Word LSTM Model for Large Vocabulary Speech Recognition ...
	Soltau, Hagen; Liao, Hank; Sak, Hasim. - : arXiv, 2016
	BASE
	Show details

4	Boosting systems for large vocabulary continuous speech recognition
	Saon, George; Soltau, Hagen
	In: Speech communication. - Amsterdam [u.a.] : Elsevier 54 (2012) 2, 212-218
	BLLDB
	OLC Linguistik
	Show details

5	Discriminative Phonotactics for Dialect Recognition Using Context-Dependent Phone Classifiers: Presentation Slides
	Biadsy, Fadi; Soltau, Hagen; Mangu, Lidia. - : Odyssey 2010, The Speaker and Language Recognition Workshop, 2010
	BASE
	Show details

6	Discriminative Phonotactics for Dialect Recognition Using Context-Dependent Phone Classifiers
	Soltau, Hagen; Hirschberg, Julia Bell; Biadsy, Fadi. - : Odyssey 2010, The Speaker and Language Recognition Workshop, 2010
	BASE
	Show details

7	Discriminative Phonotactics for Dialect Recognition Using Context-Dependent Phone Classifiers ...
	Biadsy, Fadi; Soltau, Hagen; Mangu, Lidia; Navratil, Jiri; Hirschberg, Julia Bell. - : Columbia University, 2010
	Abstract: In this paper, we introduce a new approach to dialect recognition that relies on context-dependent (CD) phonetic differences between dialects as well as phonotactics. Given a speech utterance, we obtain the phone sequence using a CD-phone recognizer. We then identify the most likely dialect of these CD-phones using SVM classifiers. Augmenting these phones with the output of these classifiers, we extract augmented phonotactic features which are subsequently given to a logistic regression classifier to obtain a dialect detection score. We test our approach on the task of detecting four Arabic dialects from 30s utterances. We compare our performance to two baselines, PRLM and GMM-UBM, as well as to our own improved version of GMM-UBM which employs fMLLR adaptation. Our approach performs significantly better than all three baselines at 5% absolute Equal Error Rate (EER). The overall EER of our system is 6%. ...
	Keyword: Computer science; FOS Languages and literature; Information technology; Linguistics
	URL: https://dx.doi.org/10.7916/d8cr62vb https://academiccommons.columbia.edu/doi/10.7916/D8CR62VB
	BASE
	Hide details

8	Discriminative Phonotactics for Dialect Recognition Using Context-Dependent Phone Classifiers: Presentation Slides ...
	Biadsy, Fadi; Soltau, Hagen; Mangu, Lidia. - : Columbia University, 2010
	BASE
	Show details

9	Advances in Arabic speech transcription at IBM under the DARPA GALE program
	Kuo, Hong-Kwang Jeff; Emami, Ahmad; Povey, Daniel...
	In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 17 (2009) 5, 884-894
	BLLDB
	OLC Linguistik
	Show details

10	A one-pass decoder based on polymorphic linguistic context assignment
	Soltau, Hagen; Metze, Florian; Fuegen, Christian. - 2008
	BASE
	Show details

11	Efficient language model lookahead through polymorphic linguistic context assignment
	Soltau, Hagen; Metze, Florian; Fuegen, Christian. - 2008
	BASE
	Show details

12	Efficient Handling of Multilingual Language Models
	Fügen, Christian; Stüker, Sebastian; Soltau, Hagen. - 2008
	BASE
	Show details

13	Advances in speech transcription at IBM under the DARPA EARS program
	Kingsbury, Brian; Mangu, Lidia; Povey, Daniel...
	In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 14 (2006) 5, 1596-1608
	BLLDB
	OLC Linguistik
	Show details

14	Efficient Handling of Multilingual Language Models ...
	Fügen, Christian; Stüker, Sebastian; Soltau, Hagen. - : Karlsruhe, 2003
	BASE
	Show details

15	A Multi-Perspective evaluation of the NESPOLE! Speech-to-Speech Translation System
	Lavie, Alon; Metze, Florian; Cattoni, Roldano...
	In: ACL'02 workshop on Speech-to-Speech Translation: Algorithms and Systems ; https://hal.inria.fr/inria-00326403 ; ACL'02 workshop on Speech-to-Speech Translation: Algorithms and Systems, ACL, Jun 2002, Philadelphia - Pennsylvania, United States. 9 p (2002)
	BASE
	Show details

16	Enhancing the Usability and Performance of Nespole! - a Real-World Speech-to-Speech Translation System
	Lavie, Alon; Metze, Florian; Pianesi, Fabio...
	In: Human Language Technologies 2002 ; https://hal.inria.fr/inria-00326412 ; Human Language Technologies 2002, Mar 2002, San Diego - California, United States. 6 p (2002)
	BASE
	Show details

17	Multilingual speech recognition ...
	Waibel, Alex; Soltau, Hagen; Schultz, Tanja. - : Karlsruhe, 2000
	BASE
	Show details

18	Multilingual speech recognition
	Waibel, Alex; Soltau, Hagen; Schultz, Tanja. - : Springer Verlag, 2000
	BASE
	Show details

19	Automatische Identifizierung spontan gesprochener Sprachen mit neuronalen Netzen
	Schultz, Tanja; Soltau, Hagen
	In: Natural language processing and speech technology. - Berlin [u.a.] : Mouton de Gruyter (1996), 102-110
	BLLDB
	Show details

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern