DE eng

Search in the Catalogues and Directories

Hits 1 – 11 of 11

1
Automatic language identity tagging on word and sentence-level in multilingual text sources: a case-study on Luxembourgish
In: International Conference on Language Resources and Evaluation ; https://hal.archives-ouvertes.fr/hal-01843401 ; International Conference on Language Resources and Evaluation, May 2014, Reykjavik, Iceland (2014)
BASE
Show details
2
Automatic Language Identity Tagging on Word and Sentence-Level in Multilingual Text Sources: a Case-Study on Luxembourgish
In: Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14) ; Ninth International Conference on Language Resources and Evaluation (LREC'14) ; https://hal.archives-ouvertes.fr/hal-01134776 ; Ninth International Conference on Language Resources and Evaluation (LREC'14), European Language Resources Association (ELRA), May 2014, Reykjavik, Iceland. pp.3300-3304 ; http://lrec2014.lrec-conf.org/en/ (2014)
Abstract: International audience ; Luxembourgish, embedded in a multilingual context on the divide between Romance and Germanic cultures, remains one of Europe's under-described languages. This is due to the fact that the written production remains relatively low, and linguistic knowledge and resources, such as lexica and pronunciation dictionaries, are sparse. The speakers or writers will frequently switch between Luxembourgish, German, and French, on a per-sentence basis, as well as on a sub-sentence level. In order to build resources like lexicons, and especially pronunciation lexicons, or language models needed for natural language processing tasks such as automatic speech recognition, language used in text corpora should be identified. In this paper, we present the design of a manually annotated corpus of mixed language sentences as well as the tools used to select these sentences. This corpus of difficult sentences was used to test a word-based language identification system. This language identification system was used to select textual data extracted from the web, in order to build a lexicon and language models. This lexicon and language model were used in an Automatic Speech Recognition system for the Luxembourgish language which obtain a 25\% WER on the Quaero development data.
Keyword: [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [SHS.LANGUE]Humanities and Social Sciences/Linguistics; corpus of Luxembourguish; language identification; under-resourced language
URL: https://hal.archives-ouvertes.fr/hal-01134776
BASE
Hide details
3
Modélisation acoustico-phonétique de langues peu dotées : Études phonétiques et travaux de reconnaissance automatique en luxembourgois
In: Journées d'Etude sur la Parole ; https://hal.archives-ouvertes.fr/hal-01843399 ; Journées d'Etude sur la Parole, Jan 2014, Le Mans, France (2014)
BASE
Show details
4
Traitements automatiques de l’oral et de l’écrit
In: https://halshs.archives-ouvertes.fr/halshs-01420863 ; France. 141, pp.4-8, 2014 (2014)
BASE
Show details
5
How to assess the quality of automatic transcriptions for the extraction of named entities? ; Comment évaluer la qualité des transcriptions automatiques pour la détection d’entités nommées ?
In: Actes des XXXe Journées d'Études sur la Parole (JEP'14) ; XXXe Journées d'Études sur la Parole (JEP'14) ; https://hal.archives-ouvertes.fr/hal-01134868 ; XXXe Journées d'Études sur la Parole (JEP'14), Jun 2014, Le Mans, France. pp.430-437 ; http://www-lium.univ-lemans.fr/jep2014/ (2014)
BASE
Show details
6
Human Annotation of ASR Error Regions: is "gravity" a Sharable Concept for Human Annotators?
In: Ninth International Conference on Language Resources and Evaluation (LREC'14) ; https://hal.archives-ouvertes.fr/hal-01134802 ; Ninth International Conference on Language Resources and Evaluation (LREC'14), May 2014, Reykjavik, Iceland. Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), pp.3050-3056, 2014 ; http://lrec2014.lrec-conf.org/en/ (2014)
BASE
Show details
7
A CRF-Based Approach to Automatic Disfluency Detection in a French Call-Centre Corpus
In: Proceedings of the 15th Annual Conference of the International Speech Communication Association (Interspeech'14) ; 15th Annual Conference of the International Speech Communication Association (Interspeech'14) ; https://hal.archives-ouvertes.fr/hal-01134812 ; 15th Annual Conference of the International Speech Communication Association (Interspeech'14), International Speech Communication Association (ISCA), Sep 2014, Singapour, Singapore. pp.2897-2901 ; http://www.interspeech2014.org/public.php?page=home.html (2014)
BASE
Show details
8
ETER: a New Metric for the Evaluation of Hierarchical Named Entity Recognition
In: Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14) ; Ninth International Conference on Language Resources and Evaluation (LREC'14) ; https://hal.archives-ouvertes.fr/hal-01134713 ; Ninth International Conference on Language Resources and Evaluation (LREC'14), European Language Resources Association (ELRA), May 2014, Reykjavik, Iceland. pp.3987-3994 ; http://lrec2014.lrec-conf.org/en/ (2014)
BASE
Show details
9
Reconnaissance automatique de la parole
In: ISSN: 0222-9838 ; EISSN: 1783-1601 ; L'information grammaticale ; https://hal.archives-ouvertes.fr/hal-01135037 ; L'information grammaticale, Peeters Publishers, 2014, TRAITEMENTS AUTOMATIQUES DE L’ORAL ET DE L’ÉCRIT (1) Panorama des recherches et des technologies actuelles, 141, pp.10 (2014)
BASE
Show details
10
Speech Alignment and Recognition Experiments for Luxembourgish
In: Proceedings of the 4th International Workshop on Spoken Language Technologies for Underresourced Languages ; 4th International Workshop on Spoken Language Technologies for Underresourced Languages ; https://hal.archives-ouvertes.fr/hal-01134824 ; 4th International Workshop on Spoken Language Technologies for Underresourced Languages, May 2014, Saint-Petersbourg, Russia. pp.53-60 ; http://www.mica.edu.vn/sltu2014/ (2014)
BASE
Show details
11
A First LVCSR System for Luxembourgish, a Low-Resourced European Language
In: Human Language Technology Challenges for Computer Science and Linguistics ; https://hal.archives-ouvertes.fr/hal-01135103 ; Zygmunt Vetulani; Joseph Mariani. Human Language Technology Challenges for Computer Science and Linguistics, 8387, Springer International Publishing, pp.479-490, 2014, 5th Language and Technology Conference, LTC 2011, Poznań, Poland, November 25--27, 2011, Revised Selected Papers, 978-3-319-08957-7. ⟨10.1007/978-3-319-08958-4_39⟩ (2014)
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
11
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern