4 |
Fisher Spanish Speech ...
|
|
|
|
Abstract:
Introduction Fisher Spanish - Speech was developed by the Linguistic Data Consortium (LDC) and consists of audio files covering roughly 163 hours of telephone speech from 136 native Caribbean Spanish and non-Caribbean Spanish speakers. Full orthographic transcripts of these audio files are available in Fisher Spanish - Transcripts (LDC2010T04). The Fisher telephone conversation collection protocol was created at LDC to address a critical need of developers trying to build robust automatic speech recognition (ASR) systems. Previous collection protocols, such as CALLFRIEND and Switchboard-II and the resulting corpora, have been adapted for ASR research but were in fact developed for language and speaker identification respectively. Although the CALLHOME protocol and corpora were developed to support ASR technology, they feature small numbers of speakers making telephone calls of relatively long duration with ...
|
|
URL: https://catalog.ldc.upenn.edu/LDC2010S01 https://dx.doi.org/10.35111/skrw-t863
|
|
BASE
|
|
Hide details
|
|
5 |
The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research
|
|
|
|
In: DTIC (2006)
|
|
BASE
|
|
Show details
|
|
8 |
Chinese <-> English Name Entity Lists v 1.0
|
|
Huang, Shudong. - : Linguistic Data Consortium, 2005. : https://www.ldc.upenn.edu, 2005
|
|
BASE
|
|
Show details
|
|
|
|