DE eng

Search in the Catalogues and Directories

Hits 1 – 4 of 4

1
Comparing Multi-Stage Approaches for Cross-Show Speaker Diarization
In: Interspeech 2011 ; https://hal.archives-ouvertes.fr/hal-01690265 ; Interspeech 2011, Aug 2011, Florence, Italy (2011)
BASE
Show details
2
Towards Exploring Linguistic Variation in ASR Errors: Paradigm & Tool for Perceptual experiments
In: Proceedings of the New tools and methods for very-large-scale phonetics research workshop (VLSP'11) ; New tools and methods for very-large-scale phonetics research workshop (VLSP'11) ; https://hal.archives-ouvertes.fr/hal-01135133 ; New tools and methods for very-large-scale phonetics research workshop (VLSP'11), Jan 2011, Philadelphie, United States (2011)
Abstract: International audience ; It is well-known that human listeners significantly outperform machines when it comes to transcribing speech. This paper presents a paradigm for perceptual experiments that aims to increase our understanding of automatic speech recognition errors. The paradigm asks human listeners to transcribe speech segments containing words that are frequently misrecognized by the system. In particular, we sought to gain information about the impact of increased con text to help humans disambiguate problematic lexical items. The long-term aim of the this research is to improve the modeling of ambiguous items so as to reduce automatic transcription errors. To this extent we have been developing a tool, the Q-ERROR graphical interface, to facilitate the analysis of automatic speech recognition errors. As previous research has shown, speech recognition errors are often modulated by a number of factors, and it can be difficult to assess the impact of each. By enabling a user to filter data in large corpora, the proposed interface can also be used to help select the relevant stimuli for human perceptual tests.
Keyword: [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [SHS.LANGUE]Humanities and Social Sciences/Linguistics; acoustic-phonetic studies; ASR errors; automatic speech recognition; error analysis; linguistic variation; perceptual paradigm; perceptual test
URL: https://hal.archives-ouvertes.fr/hal-01135133
BASE
Hide details
3
Cross-lingual study of ASR errors: on the role of the context in human perception of near homophones
In: Proceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech'11) ; 12th Annual Conference of the International Speech Communication Association (Interspeech'11) ; https://hal.archives-ouvertes.fr/hal-01135150 ; 12th Annual Conference of the International Speech Communication Association (Interspeech'11), International Speech Communication Association (ISCA), Aug 2011, Florence, Italy. pp.1949--1952 (2011)
BASE
Show details
4
Studying Luxembourgish phonetics via multilingual forced alignments
In: Proceedings of the 17th International Congress of Phonetic Sciences (ICPhS'11) ; 17th International Congress of Phonetic Sciences (ICPhS'11) ; https://hal.archives-ouvertes.fr/hal-01135124 ; 17th International Congress of Phonetic Sciences (ICPhS'11), Aug 2011, Hong Kong, China. pp.196-199 (2011)
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
4
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern