1 |
Comparing Multi-Stage Approaches for Cross-Show Speaker Diarization
|
|
|
|
In: Interspeech 2011 ; https://hal.archives-ouvertes.fr/hal-01690265 ; Interspeech 2011, Aug 2011, Florence, Italy (2011)
|
|
BASE
|
|
Show details
|
|
2 |
Towards Exploring Linguistic Variation in ASR Errors: Paradigm & Tool for Perceptual experiments
|
|
|
|
In: Proceedings of the New tools and methods for very-large-scale phonetics research workshop (VLSP'11) ; New tools and methods for very-large-scale phonetics research workshop (VLSP'11) ; https://hal.archives-ouvertes.fr/hal-01135133 ; New tools and methods for very-large-scale phonetics research workshop (VLSP'11), Jan 2011, Philadelphie, United States (2011)
|
|
Abstract:
International audience ; It is well-known that human listeners significantly outperform machines when it comes to transcribing speech. This paper presents a paradigm for perceptual experiments that aims to increase our understanding of automatic speech recognition errors. The paradigm asks human listeners to transcribe speech segments containing words that are frequently misrecognized by the system. In particular, we sought to gain information about the impact of increased con text to help humans disambiguate problematic lexical items. The long-term aim of the this research is to improve the modeling of ambiguous items so as to reduce automatic transcription errors. To this extent we have been developing a tool, the Q-ERROR graphical interface, to facilitate the analysis of automatic speech recognition errors. As previous research has shown, speech recognition errors are often modulated by a number of factors, and it can be difficult to assess the impact of each. By enabling a user to filter data in large corpora, the proposed interface can also be used to help select the relevant stimuli for human perceptual tests.
|
|
Keyword:
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [SHS.LANGUE]Humanities and Social Sciences/Linguistics; acoustic-phonetic studies; ASR errors; automatic speech recognition; error analysis; linguistic variation; perceptual paradigm; perceptual test
|
|
URL: https://hal.archives-ouvertes.fr/hal-01135133
|
|
BASE
|
|
Hide details
|
|
3 |
Cross-lingual study of ASR errors: on the role of the context in human perception of near homophones
|
|
|
|
In: Proceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech'11) ; 12th Annual Conference of the International Speech Communication Association (Interspeech'11) ; https://hal.archives-ouvertes.fr/hal-01135150 ; 12th Annual Conference of the International Speech Communication Association (Interspeech'11), International Speech Communication Association (ISCA), Aug 2011, Florence, Italy. pp.1949--1952 (2011)
|
|
BASE
|
|
Show details
|
|
4 |
Studying Luxembourgish phonetics via multilingual forced alignments
|
|
|
|
In: Proceedings of the 17th International Congress of Phonetic Sciences (ICPhS'11) ; 17th International Congress of Phonetic Sciences (ICPhS'11) ; https://hal.archives-ouvertes.fr/hal-01135124 ; 17th International Congress of Phonetic Sciences (ICPhS'11), Aug 2011, Hong Kong, China. pp.196-199 (2011)
|
|
BASE
|
|
Show details
|
|
|
|