Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2

Hits 1 – 20 of 21

1	Cascaded Multilingual Audio-Visual Learning from Videos ...
	Rouditchenko, Andrew; Boggust, Angie; Harwath, David. - : arXiv, 2021
	BASE
	Show details

2	USC-SFI MALACH Interviews and Transcripts English – Speech Recognition Edition
	Ramabhadran, Bhuvana; Gustman, Samuel; Byrne, William; Hajič, Jan; Oard, Douglas; Olsson, J. Scott; Picheny, Michael; Psutka, Josef. - : Linguistic Data Consortium, 2019. : https://www.ldc.upenn.edu, 2019
	Abstract: Introduction USC-SFI MALACH Interviews and Transcripts English – Speech Recognition Edition, LDC Catalog Number LDC2019S11 and ISBN 1-58563-889-7, was developed by IBM as part of the MALACH (Multilingual Access to Large Spoken ArCHives) Project. This edition augments USC-SFI MALACH Interviews and Transcripts English (LDC2012S05) by modifying and updating a subset of the original corpus for use with the Kaldi toolkit in speech recognition work, and is easily portable for use by other speech recognition systems as well. It contains approximately 168 hours of interviews from 682 Holocaust witnesses along with transcripts, a lexicon, Kaldi specific files, and other documentation. Inspired by his experience making Schindler’s List, Steven Spielberg established the Survivors of the Shoah Visual History Foundation in 1994 to gather video testimonies from survivors and other witnesses of the Holocaust. While most of those who gave testimony were Jewish survivors, the Foundation also interviewed homosexual survivors, Jehovah’s Witness survivors, liberators and liberation witnesses, political prisoners, rescuers and aid providers, Roma and Sinti (Gypsy) survivors, survivors of eugenics policies, and war crimes trials participants. The Foundation’s Visual History Archive holds nearly 55,000 video testimonies in 43 languages, representing 65 countries; it is the largest archive of its kind in the world. In 2006, the Foundation became part of the Dana and David Dornsife College of Letters, Arts and Sciences at the University of Southern California in Los Angeles and was renamed as the USC Shoah Foundation Institute for Visual History and Education. The goal of the MALACH project was to develop methods for improved access to large multinational spoken archives; the focus was advancing the state of the art of automatic speech recognition and information retrieval. The characteristics of the USC-SFI collection -- unconstrained, natural speech filled with disfluencies, heavy accents, age-related coarticulations, un-cued speaker and language switching and emotional speech -- were considered well-suited for that task. The work centered on five languages: English, Czech, Russian, Polish and Slovak. LDC has also released USC-SFI MALACH Interviews and Transcripts Czech (LDC2014S04). Data The original MALACH English data set (LDC2012S05) consists of unsegmented audio interviews in mp2 format and speaker-turn, time-marked transcripts in Transcriber (.trs) format presented in a single flat file. In this release, the speech files are segmented and converted to flac format, and the transcripts are updated to an utterance-by-utterance format. Additionally, a lexicon mapping words to phonemes is provided, and the data is divided into development and training sets. See the included documentation for more details on these changes, and the documentation and catalog entry for LDC2012S05 for further information about the source files. Samples Please view the following samples. Approximately 40 seconds of silence was left at the start of the speech file to preserve the time stamps' accuracy. * Speech * Segments * Transcript Updates None at this time.
	URL: https://catalog.ldc.upenn.edu/LDC2019S11
	BASE
	Hide details

3	Challenging the Boundaries of Speech Recognition: The MALACH Corpus ...
	Picheny, Michael; Tüske, Zóltan; Kingsbury, Brian. - : arXiv, 2019
	BASE
	Show details

4	Identifying Mood Episodes Using Dialogue Features from Clinical Interviews ...
	Aldeneh, Zakaria; Jaiswal, Mimansa; Picheny, Michael. - : arXiv, 2019
	BASE
	Show details

5	USC-SFI MALACH Interviews and Transcripts English – Speech Recognition Edition ...
	Ramabhadran, Bhuvana; Gustman, Samuel; Byrne, William. - : Linguistic Data Consortium, 2019
	BASE
	Show details

6	Building competitive direct acoustics-to-word models for English conversational speech recognition ...
	Audhkhasi, Kartik; Kingsbury, Brian; Ramabhadran, Bhuvana. - : arXiv, 2017
	BASE
	Show details

7	Matching criteria for vocabulary-independent search
	Picheny, Michael; Chaudhari, Upendra V.
	In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 20 (2012) 5, 1633-1643
	BLLDB
	OLC Linguistik
	Show details

8	USC-SFI MALACH Interviews and Transcripts English
	Ramabhadran, Bhuvana; Gustman, Samuel; Byrne, William. - : Linguistic Data Consortium, 2012. : https://www.ldc.upenn.edu, 2012
	BASE
	Show details

9	USC-SFI MALACH Interviews and Transcripts English ...
	Ramabhadran, Bhuvana; Gustman, Samuel; Byrne, William. - : Linguistic Data Consortium, 2012
	BASE
	Show details

10	Exemplar-based sparse representation features: from TIMIT to LVCSR
	Sainath, Tara N.; Kanevsky, Dimitri; Ramabhadran, Bhuvana...
	In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 19 (2011) 8, 2598-2613
	BLLDB
	OLC Linguistik
	Show details

11	The IBM expressive text-to-speech synthesis system for American English
	Pitrelli, John F.; Bakis, Raimo; Eide, Ellen M....
	In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 14 (2006) 4, 1099-1108
	BLLDB
	Show details

12	Concept-based speech-to-speech translation using maximum entropy models for statistical natural concept generation
	Gu, Liang; Gao, Yuqing; Picheny, Michael...
	In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 14 (2006) 2, 377-392
	BLLDB
	OLC Linguistik
	Show details

13	Using semantic analysis to improve speech recognition performance
	Erdogan, Hakan; Sarikaya, Ruhi; Chen, Stanley F....
	In: Computer speech and language. - Amsterdam [u.a.] : Elsevier 19 (2005) 3, 321-343
	BLLDB
	Show details

14	Using semantic analysis to improve speech recognition performance
	Erdogan, Hakan; Sarikaya, Ruhi; Chen, Stanley F....
	In: Computer speech and language. - Amsterdam [u.a.] : Elsevier 19 (2005) 3, 321-344
	OLC Linguistik
	Show details

15	Semantic confidence measurement for spoken dialog systems
	Sarikaya, Ruhi; Picheny, Michael; Gao, Yuqing...
	In: Institute of Electrical and Electronics Engineers. IEEE transactions on speech and audio processing. - New York, NY : Inst. 13 (2005) 4, 534-545
	BLLDB
	OLC Linguistik
	Show details

16	Applications of Language Modeling in Speech-To-Speech Translation
	Liu, Fu-Hua; Gu, Liang; Gao, Yuqing...
	In: International journal of speech technology. - Boston, Mass. [u.a.] : Kluwer Acad. Publ. 7 (2004) 2-3, 221-230
	OLC Linguistik
	Show details

17	Spontaneous speech processing
	Furui, Sadaoki (Hrsg.); Beckman, Mary E. (Hrsg.); Hirschberg, Julia (Hrsg.)...
	In: Institute of Electrical and Electronics Engineers. IEEE transactions on speech and audio processing. - New York, NY : Inst. 12 (2004) 4, 349-445
	BLLDB
	Show details

18	Embedded MT systems, part 2 : speech MT
	Voss, Clare (Hrsg.); Ess-Dykema, Carol van (Hrsg.); Bangalore, Srinivas (Mitarb.)...
	In: Machine translation. - Dordrecht [u.a.] : Springer Science + Business Media 17 (2002) 3, 165-243
	BLLDB
	Show details

19	MARS: A Statistical Semantic Parsing and Generation-Based Multilingual Automatic tRanslation System
	Gao, Yuqing; Zhou, Bowen; Diao, Zijian...
	In: Machine translation. - Dordrecht [u.a.] : Springer Science + Business Media 17 (2002) 3, 185-212
	OLC Linguistik
	Show details

20	Automatic speech and speaker recognition : advanced topics
	Gopalakrishnan, P.S. (Mitarb.); Alleva, Fileno A. (Mitarb.); Lee, Chin-Hui (Hrsg.). - Boston [u.a.] : Kluwer, 1996
	BLLDB
	UB Frankfurt Linguistik
	Show details

Page: 1 2

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern