DE eng

Search in the Catalogues and Directories

Hits 1 – 19 of 19

1
Understanding Medical Conversations: Rich Transcription, Confidence Scores & Information Extraction ...
BASE
Show details
2
Joint Speech Recognition and Speaker Diarization via Sequence Transduction ...
BASE
Show details
3
Neural Speech Recognizer: Acoustic-to-Word LSTM Model for Large Vocabulary Speech Recognition ...
Soltau, Hagen; Liao, Hank; Sak, Hasim. - : arXiv, 2016
BASE
Show details
4
Boosting systems for large vocabulary continuous speech recognition
In: Speech communication. - Amsterdam [u.a.] : Elsevier 54 (2012) 2, 212-218
BLLDB
OLC Linguistik
Show details
5
Discriminative Phonotactics for Dialect Recognition Using Context-Dependent Phone Classifiers: Presentation Slides
Biadsy, Fadi; Soltau, Hagen; Mangu, Lidia. - : Odyssey 2010, The Speaker and Language Recognition Workshop, 2010
BASE
Show details
6
Discriminative Phonotactics for Dialect Recognition Using Context-Dependent Phone Classifiers
Soltau, Hagen; Hirschberg, Julia Bell; Biadsy, Fadi. - : Odyssey 2010, The Speaker and Language Recognition Workshop, 2010
BASE
Show details
7
Discriminative Phonotactics for Dialect Recognition Using Context-Dependent Phone Classifiers ...
Abstract: In this paper, we introduce a new approach to dialect recognition that relies on context-dependent (CD) phonetic differences between dialects as well as phonotactics. Given a speech utterance, we obtain the phone sequence using a CD-phone recognizer. We then identify the most likely dialect of these CD-phones using SVM classifiers. Augmenting these phones with the output of these classifiers, we extract augmented phonotactic features which are subsequently given to a logistic regression classifier to obtain a dialect detection score. We test our approach on the task of detecting four Arabic dialects from 30s utterances. We compare our performance to two baselines, PRLM and GMM-UBM, as well as to our own improved version of GMM-UBM which employs fMLLR adaptation. Our approach performs significantly better than all three baselines at 5% absolute Equal Error Rate (EER). The overall EER of our system is 6%. ...
Keyword: Computer science; FOS Languages and literature; Information technology; Linguistics
URL: https://dx.doi.org/10.7916/d8cr62vb
https://academiccommons.columbia.edu/doi/10.7916/D8CR62VB
BASE
Hide details
8
Discriminative Phonotactics for Dialect Recognition Using Context-Dependent Phone Classifiers: Presentation Slides ...
Biadsy, Fadi; Soltau, Hagen; Mangu, Lidia. - : Columbia University, 2010
BASE
Show details
9
Advances in Arabic speech transcription at IBM under the DARPA GALE program
In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 17 (2009) 5, 884-894
BLLDB
OLC Linguistik
Show details
10
A one-pass decoder based on polymorphic linguistic context assignment
BASE
Show details
11
Efficient language model lookahead through polymorphic linguistic context assignment
BASE
Show details
12
Efficient Handling of Multilingual Language Models
BASE
Show details
13
Advances in speech transcription at IBM under the DARPA EARS program
In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 14 (2006) 5, 1596-1608
BLLDB
OLC Linguistik
Show details
14
Efficient Handling of Multilingual Language Models ...
BASE
Show details
15
A Multi-Perspective evaluation of the NESPOLE! Speech-to-Speech Translation System
In: ACL'02 workshop on Speech-to-Speech Translation: Algorithms and Systems ; https://hal.inria.fr/inria-00326403 ; ACL'02 workshop on Speech-to-Speech Translation: Algorithms and Systems, ACL, Jun 2002, Philadelphia - Pennsylvania, United States. 9 p (2002)
BASE
Show details
16
Enhancing the Usability and Performance of Nespole! - a Real-World Speech-to-Speech Translation System
In: Human Language Technologies 2002 ; https://hal.inria.fr/inria-00326412 ; Human Language Technologies 2002, Mar 2002, San Diego - California, United States. 6 p (2002)
BASE
Show details
17
Multilingual speech recognition ...
Waibel, Alex; Soltau, Hagen; Schultz, Tanja. - : Karlsruhe, 2000
BASE
Show details
18
Multilingual speech recognition
Waibel, Alex; Soltau, Hagen; Schultz, Tanja. - : Springer Verlag, 2000
BASE
Show details
19
Automatische Identifizierung spontan gesprochener Sprachen mit neuronalen Netzen
In: Natural language processing and speech technology. - Berlin [u.a.] : Mouton de Gruyter (1996), 102-110
BLLDB
Show details

Catalogues
0
0
3
0
0
0
0
Bibliographies
4
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
15
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern