DE eng

Search in the Catalogues and Directories

Page: 1 2
Hits 1 – 20 of 21

1
Cascaded Multilingual Audio-Visual Learning from Videos ...
BASE
Show details
2
USC-SFI MALACH Interviews and Transcripts English – Speech Recognition Edition
Ramabhadran, Bhuvana; Gustman, Samuel; Byrne, William. - : Linguistic Data Consortium, 2019. : https://www.ldc.upenn.edu, 2019
BASE
Show details
3
Challenging the Boundaries of Speech Recognition: The MALACH Corpus ...
BASE
Show details
4
Identifying Mood Episodes Using Dialogue Features from Clinical Interviews ...
BASE
Show details
5
USC-SFI MALACH Interviews and Transcripts English – Speech Recognition Edition ...
Ramabhadran, Bhuvana; Gustman, Samuel; Byrne, William. - : Linguistic Data Consortium, 2019
BASE
Show details
6
Building competitive direct acoustics-to-word models for English conversational speech recognition ...
BASE
Show details
7
Matching criteria for vocabulary-independent search
In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 20 (2012) 5, 1633-1643
BLLDB
OLC Linguistik
Show details
8
USC-SFI MALACH Interviews and Transcripts English
Ramabhadran, Bhuvana; Gustman, Samuel; Byrne, William; Hajič, Jan; Oard, Douglas; Olsson, J. Scott; Picheny, Michael; Psutka, Josef. - : Linguistic Data Consortium, 2012. : https://www.ldc.upenn.edu, 2012
Abstract: *Introduction* USC-SFI MALACH Interviews and Transcripts English, LDC Catalog Number LDC2012S05 and ISBN 1-58563-602-9, was developed by The University of Southern California Shoah Foundation Institute (USC-SFI), the University of Maryland, IBM and Johns Hopkins University as part of the MALACH (Multilingual Access to Large Spoken ArCHives) Project. It contains approximately 375 hours of interviews from 784 interviewees along with transcripts and other documentation. Inspired by his experience making Schindlers List, Steven Spielberg established the Survivors of the Shoah Visual History Foundation in 1994 to gather video testimonies from survivors and other witnesses of the Holocaust. While most of those who gave testimony were Jewish survivors, the Foundation also interviewed homosexual survivors, Jehovah Witness survivors, liberators and liberation witnesses, political prisoners, rescuers and aid providers, Roma and Sinti (Gypsy) survivors, survivors of eugenics policies, and war crimes trials participants. The Foundation’s Visual History Archive holds nearly 55,000 video testimonies in 43 languages, representing 65 countries; it is the largest archive of its kind in the world. In 2006, the Foundation became part of the Dana and David Dornsife College of Letters, Arts and Sciences at the University of Southern California in Los Angeles and was renamed as the USC Shoah Foundation Institute for Visual History and Education. The goal of the MALACH project was to develop methods for improved access to large multinational spoken archives. The focus was advancing the state of the art of automatic speech recognition (ASR) and information retrieval. The characteristics of the USC-SFI collection -- unconstrained, natural speech filled with disfluencies, heavy accents, age-related coarticulations, un-cued speaker and language switching and emotional speech -- were considered well-suited for that task. The work centered on five languages: English, Czech, Russian, Polish and Slovak. USC-SFI MALACH Interviews and Transcripts English was developed for the English speech recognition experiments. LDC has also released USC-SFI MALACH Interviews and Transcripts Czech (LDC2014S04) and USC-SFI MALACH Interviews and Transcripts English – Speech Recognition Edition (LDC2019S11). *Data* The speech data in this release was collected beginning in 1994 under a wide variety of conditions ranging from quiet to noisy (e.g., airplane overflights, wind noise, background conversations and highway noise). Original interviews were recorded on Sony Beta SP tapes, then digitized into a 3 MB/s MPEG-1 stream with 128 kb/s (44 kHz) stereo audio. The sound files in this release are compressed in MP2 format at a sampling frequency of 44.1 kHz. Approximately 25,000 of all USC-SFI collected interviews are in English and average approximately 2.5 hours each. The 784 interviews included in this release are each a 30 minute section of the corresponding larger interview. Due to the way the original interviews were arranged on the tapes, some interviews are clipped and have a duration of less than 30 minutes. Certain interviews include speech from family members in addition to that of the subject and the interviewer. Accordingly, the corpus contains speech from more than 784 speakers, who are more or less equally distributed between males and females. The interviews also include accented speech over a wide range (e.g., Hungarian, Italian, Yiddish, German and Polish). This release includes transcripts in .trs format of the first 15 minutes of each interview. The transcripts were created using Transcriber 1.5.1 and later modified. *Samples* For a sample of the audio in this release, use this link. *Updates* None at this time.
URL: https://catalog.ldc.upenn.edu/LDC2012S05
BASE
Hide details
9
USC-SFI MALACH Interviews and Transcripts English ...
Ramabhadran, Bhuvana; Gustman, Samuel; Byrne, William. - : Linguistic Data Consortium, 2012
BASE
Show details
10
Exemplar-based sparse representation features: from TIMIT to LVCSR
In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 19 (2011) 8, 2598-2613
BLLDB
OLC Linguistik
Show details
11
The IBM expressive text-to-speech synthesis system for American English
In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 14 (2006) 4, 1099-1108
BLLDB
Show details
12
Concept-based speech-to-speech translation using maximum entropy models for statistical natural concept generation
In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 14 (2006) 2, 377-392
BLLDB
OLC Linguistik
Show details
13
Using semantic analysis to improve speech recognition performance
In: Computer speech and language. - Amsterdam [u.a.] : Elsevier 19 (2005) 3, 321-343
BLLDB
Show details
14
Using semantic analysis to improve speech recognition performance
In: Computer speech and language. - Amsterdam [u.a.] : Elsevier 19 (2005) 3, 321-344
OLC Linguistik
Show details
15
Semantic confidence measurement for spoken dialog systems
In: Institute of Electrical and Electronics Engineers. IEEE transactions on speech and audio processing. - New York, NY : Inst. 13 (2005) 4, 534-545
BLLDB
OLC Linguistik
Show details
16
Applications of Language Modeling in Speech-To-Speech Translation
In: International journal of speech technology. - Boston, Mass. [u.a.] : Kluwer Acad. Publ. 7 (2004) 2-3, 221-230
OLC Linguistik
Show details
17
Spontaneous speech processing
Furui, Sadaoki (Hrsg.); Beckman, Mary E. (Hrsg.); Hirschberg, Julia (Hrsg.)...
In: Institute of Electrical and Electronics Engineers. IEEE transactions on speech and audio processing. - New York, NY : Inst. 12 (2004) 4, 349-445
BLLDB
Show details
18
Embedded MT systems, part 2 : speech MT
Voss, Clare (Hrsg.); Ess-Dykema, Carol van (Hrsg.); Bangalore, Srinivas (Mitarb.)...
In: Machine translation. - Dordrecht [u.a.] : Springer Science + Business Media 17 (2002) 3, 165-243
BLLDB
Show details
19
MARS: A Statistical Semantic Parsing and Generation-Based Multilingual Automatic tRanslation System
In: Machine translation. - Dordrecht [u.a.] : Springer Science + Business Media 17 (2002) 3, 185-212
OLC Linguistik
Show details
20
Automatic speech and speaker recognition : advanced topics
Gopalakrishnan, P.S. (Mitarb.); Alleva, Fileno A. (Mitarb.); Lee, Chin-Hui (Hrsg.). - Boston [u.a.] : Kluwer, 1996
BLLDB
UB Frankfurt Linguistik
Show details

Page: 1 2

Catalogues
1
0
7
0
0
0
0
Bibliographies
10
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
8
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern