DE eng

Search in the Catalogues and Directories

Page: 1 2
Hits 1 – 20 of 21

1
Cascaded Multilingual Audio-Visual Learning from Videos ...
BASE
Show details
2
USC-SFI MALACH Interviews and Transcripts English – Speech Recognition Edition
Ramabhadran, Bhuvana; Gustman, Samuel; Byrne, William. - : Linguistic Data Consortium, 2019. : https://www.ldc.upenn.edu, 2019
BASE
Show details
3
Challenging the Boundaries of Speech Recognition: The MALACH Corpus ...
Abstract: There has been huge progress in speech recognition over the last several years. Tasks once thought extremely difficult, such as SWITCHBOARD, now approach levels of human performance. The MALACH corpus (LDC catalog LDC2012S05), a 375-Hour subset of a large archive of Holocaust testimonies collected by the Survivors of the Shoah Visual History Foundation, presents significant challenges to the speech community. The collection consists of unconstrained, natural speech filled with disfluencies, heavy accents, age-related coarticulations, un-cued speaker and language switching, and emotional speech - all still open problems for speech recognition systems. Transcription is challenging even for skilled human annotators. This paper proposes that the community place focus on the MALACH corpus to develop speech recognition systems that are more robust with respect to accents, disfluencies and emotional speech. To reduce the barrier for entry, a lexicon and training and testing setups have been created and baseline ... : Accepted for publication at INTERSPEECH 2019 ...
Keyword: Audio and Speech Processing eess.AS; Computation and Language cs.CL; FOS Computer and information sciences; FOS Electrical engineering, electronic engineering, information engineering; Sound cs.SD
URL: https://arxiv.org/abs/1908.03455
https://dx.doi.org/10.48550/arxiv.1908.03455
BASE
Hide details
4
Identifying Mood Episodes Using Dialogue Features from Clinical Interviews ...
BASE
Show details
5
USC-SFI MALACH Interviews and Transcripts English – Speech Recognition Edition ...
Ramabhadran, Bhuvana; Gustman, Samuel; Byrne, William. - : Linguistic Data Consortium, 2019
BASE
Show details
6
Building competitive direct acoustics-to-word models for English conversational speech recognition ...
BASE
Show details
7
Matching criteria for vocabulary-independent search
In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 20 (2012) 5, 1633-1643
BLLDB
OLC Linguistik
Show details
8
USC-SFI MALACH Interviews and Transcripts English
Ramabhadran, Bhuvana; Gustman, Samuel; Byrne, William. - : Linguistic Data Consortium, 2012. : https://www.ldc.upenn.edu, 2012
BASE
Show details
9
USC-SFI MALACH Interviews and Transcripts English ...
Ramabhadran, Bhuvana; Gustman, Samuel; Byrne, William. - : Linguistic Data Consortium, 2012
BASE
Show details
10
Exemplar-based sparse representation features: from TIMIT to LVCSR
In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 19 (2011) 8, 2598-2613
BLLDB
OLC Linguistik
Show details
11
The IBM expressive text-to-speech synthesis system for American English
In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 14 (2006) 4, 1099-1108
BLLDB
Show details
12
Concept-based speech-to-speech translation using maximum entropy models for statistical natural concept generation
In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 14 (2006) 2, 377-392
BLLDB
OLC Linguistik
Show details
13
Using semantic analysis to improve speech recognition performance
In: Computer speech and language. - Amsterdam [u.a.] : Elsevier 19 (2005) 3, 321-343
BLLDB
Show details
14
Using semantic analysis to improve speech recognition performance
In: Computer speech and language. - Amsterdam [u.a.] : Elsevier 19 (2005) 3, 321-344
OLC Linguistik
Show details
15
Semantic confidence measurement for spoken dialog systems
In: Institute of Electrical and Electronics Engineers. IEEE transactions on speech and audio processing. - New York, NY : Inst. 13 (2005) 4, 534-545
BLLDB
OLC Linguistik
Show details
16
Applications of Language Modeling in Speech-To-Speech Translation
In: International journal of speech technology. - Boston, Mass. [u.a.] : Kluwer Acad. Publ. 7 (2004) 2-3, 221-230
OLC Linguistik
Show details
17
Spontaneous speech processing
Furui, Sadaoki (Hrsg.); Beckman, Mary E. (Hrsg.); Hirschberg, Julia (Hrsg.)...
In: Institute of Electrical and Electronics Engineers. IEEE transactions on speech and audio processing. - New York, NY : Inst. 12 (2004) 4, 349-445
BLLDB
Show details
18
Embedded MT systems, part 2 : speech MT
Voss, Clare (Hrsg.); Ess-Dykema, Carol van (Hrsg.); Bangalore, Srinivas (Mitarb.)...
In: Machine translation. - Dordrecht [u.a.] : Springer Science + Business Media 17 (2002) 3, 165-243
BLLDB
Show details
19
MARS: A Statistical Semantic Parsing and Generation-Based Multilingual Automatic tRanslation System
In: Machine translation. - Dordrecht [u.a.] : Springer Science + Business Media 17 (2002) 3, 185-212
OLC Linguistik
Show details
20
Automatic speech and speaker recognition : advanced topics
Gopalakrishnan, P.S. (Mitarb.); Alleva, Fileno A. (Mitarb.); Lee, Chin-Hui (Hrsg.). - Boston [u.a.] : Kluwer, 1996
BLLDB
UB Frankfurt Linguistik
Show details

Page: 1 2

Catalogues
1
0
7
0
0
0
0
Bibliographies
10
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
8
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern