1 |
Morpho-Syntactic Study of Errors from Speech Recognition System
|
|
|
|
In: International Conference on Language Resources and Evaluation ; https://hal.archives-ouvertes.fr/hal-01831243 ; International Conference on Language Resources and Evaluation, Jan 2014, Reykjavik, Iceland (2014)
|
|
Abstract:
International audience ; The study provides an original standpoint of the speech transcription errors by focusing on the morpho-syntactic features of the erroneous chunks and of the surrounding left and right context. The typology concerns the forms, the lemmas and the POS involved in erroneous chunks, and in the surrounding contexts. Comparison with error free contexts are also provided. The study is conducted on French. Morpho-syntactic analysis underlines that three main classes are particularly represented in the erroneous chunks: (i) grammatical words (to, of, the), (ii) auxiliary verbs (has, is), and (iii) modal verbs (should, must). Such items are widely encountered in the ASR outputs as frequent candidates to transcription errors. The analysis of the context points out that some left 3-grams contexts (e.g., repetitions, that is disfluencies, bracketing formulas such as “c’est”, etc.) may be better predictors than others. Finally, the surface analysis conducted through a Levensthein distance analysis, highlighted that the most common distance is of 2 characters and mainly involves differences between inflected forms of a unique item.
|
|
Keyword:
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [INFO]Computer Science [cs]; Automatic Speech Recognition; Error Analysis; Morpho-Syntactic Analysis
|
|
URL: https://hal.archives-ouvertes.fr/hal-01831243
|
|
BASE
|
|
Hide details
|
|
2 |
Human Annotation of ASR Error Regions: is "gravity" a Sharable Concept for Human Annotators?
|
|
|
|
In: Ninth International Conference on Language Resources and Evaluation (LREC'14) ; https://hal.archives-ouvertes.fr/hal-01134802 ; Ninth International Conference on Language Resources and Evaluation (LREC'14), May 2014, Reykjavik, Iceland. Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), pp.3050-3056, 2014 ; http://lrec2014.lrec-conf.org/en/ (2014)
|
|
BASE
|
|
Show details
|
|
3 |
Automatic named entity pre-annotation for out-of-domain human annotation
|
|
|
|
In: Linguistic Annotation Workshop ; https://hal.archives-ouvertes.fr/hal-01831229 ; Linguistic Annotation Workshop, ACL, Jan 2013, Sofia, Bulgaria (2013)
|
|
BASE
|
|
Show details
|
|
4 |
Human annotation of asr error regions: Is ”gravity” a sharable concept for human annotators?
|
|
|
|
In: Errors by Humans and Machines in Multimedia, Multimodal, Multilingual Data Processing (ERRARE 2013) ; https://halshs.archives-ouvertes.fr/halshs-01424915 ; Errors by Humans and Machines in Multimedia, Multimodal, Multilingual Data Processing (ERRARE 2013), Nov 2013, Ermenonville, France (2013)
|
|
BASE
|
|
Show details
|
|
5 |
Combining an expert-based medical entity recognizer to a machine-learning system: methods and a case-study
|
|
|
|
In: Biomedical Informatics Insights ; https://hal.archives-ouvertes.fr/hal-01972779 ; Biomedical Informatics Insights, 2013, 13p (2013)
|
|
BASE
|
|
Show details
|
|
6 |
Extended named entities annotation on OCRed documents: from corpus constitution to evaluation campaign
|
|
|
|
In: International Conference on Language Resources and Evaluation ; https://hal.archives-ouvertes.fr/hal-01831254 ; International Conference on Language Resources and Evaluation, Jan 2012, Istanbul, Turkey (2012)
|
|
BASE
|
|
Show details
|
|
7 |
Manual Corpus Annotation: Giving Meaning to the Evaluation Metrics
|
|
|
|
In: Proceedings of the International Conference on Computational Linguistics (COLING 2012) ; International Conference on Computational Linguistics ; https://hal.archives-ouvertes.fr/hal-00769639 ; International Conference on Computational Linguistics, Dec 2012, Mumbaï, India. pp.809--818 (2012)
|
|
BASE
|
|
Show details
|
|
8 |
Structured Named Entities in two distinct press corpora: Contemporary Broadcast News and Old Newspapers
|
|
|
|
In: Proceedings of the Sixth ACL Linguistic Annotation Workshop ; 6th Linguistics Annotation Workshop (The LAW VI) ; https://hal.archives-ouvertes.fr/hal-00709193 ; 6th Linguistics Annotation Workshop (The LAW VI), Jul 2012, Jeju, South Korea. pp.40-48 (2012)
|
|
BASE
|
|
Show details
|
|
9 |
Proposal for an Extension of Traditional Named Entitites: from Guidelines to Evaluation, an Overview
|
|
|
|
In: Proceedings of the Fifth ACL Linguistic Annotation Workshop ; 5th Linguistics Annotation Workshop (The LAW V) ; https://hal.archives-ouvertes.fr/hal-00604369 ; 5th Linguistics Annotation Workshop (The LAW V), Jun 2011, Portland, United States. pp.92--100 (2011)
|
|
BASE
|
|
Show details
|
|
|
|