DE eng

Search in the Catalogues and Directories

Hits 1 – 10 of 10

1
Adaptor Grammars for the Linguist: Word Segmentation Experiments for Very Low-Resource Languages
In: Workshop on Computational Research in Phonetics, Phonology, and Morphology ; https://hal.archives-ouvertes.fr/hal-01910757 ; Workshop on Computational Research in Phonetics, Phonology, and Morphology, Oct 2018, Bruxelles, Belgium. pp.32 - 42, ⟨10.18653/v1/P17⟩ (2018)
BASE
Show details
2
A corpus based study of morpheme deletion in a low resourced language: A case study for Embosi
In: Annual Meeting of the Linguistic Society of America ; https://hal.archives-ouvertes.fr/hal-01837164 ; Annual Meeting of the Linguistic Society of America, Jan 2018, Salt Lake City, United States (2018)
BASE
Show details
3
Developing an Embosi (Bantu C25) Speech Variant Dictionary to Model Vowel Elision and Morpheme Deletion
In: Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-01837178 ; Annual Conference of the International Speech Communication Association , ISCA, Aug 2017, Stockholm, Sweden (2017)
BASE
Show details
4
Corpus base linguistic exploration via forced alignments with a ‘light-weight’ ASR tool
In: Language & Technology Conference : Human Language Technologies as a Challenge for Computer Science and Linguistics ; https://hal.archives-ouvertes.fr/hal-01837174 ; Language & Technology Conference : Human Language Technologies as a Challenge for Computer Science and Linguistics, Nov 2017, Poznań, Poland (2017)
BASE
Show details
5
BULB: Breaking the Unwritten Language Barrier
In: Procedia Computer Science ; Computational Methods for Endangered Language Documentation and Description ; https://hal.archives-ouvertes.fr/hal-01836496 ; Computational Methods for Endangered Language Documentation and Description, May 2016, Yogyakarta, Indonesia. pp.8-14, ⟨10.1016/j.procs.2016.04.023⟩ (2016)
Abstract: The project Breaking the Unwritten Language Barrier (BULB), which brings together linguists and computer scientists, aimsat supporting linguists in documenting unwritten languages. In order to achieve this we develop tools tailored to the needs ofdocumentary linguists by building upon technology and expertise from the area of natural language processing, most prominentlyautomatic speech recognition and machine translation. As a development and test bed for this we have chosen three less-resourcedAfrican languages from the Bantu family: Basaa, Myene and Embosi. Work within the project is divided into three main steps:1) Collection of a large corpus of speech (100h per language) at a reasonable cost. For this we use standard mobile devices and adedicated software—Lig-Aikuma. After initial recording, the data is re-spoken by a reference speaker to enhance the signal qualityand orally translated into French.2) Automatic transcription of the Bantu languages at phoneme level and the French translation at word level. The recognizedBantu phonemes and French words will then be automatically aligned.3) Tool development. In close cooperation and discussion with the linguists, the speech and language technologists will design and implement tools that will support the linguists in their work, taking into account the linguists’ needs and technology’scapabilities.
Keyword: [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [INFO]Computer Science [cs]; Endangered Languages; Low Resource Language
URL: https://hal.archives-ouvertes.fr/hal-01836496
https://doi.org/10.1016/j.procs.2016.04.023
BASE
Hide details
6
BULB: Breaking the Unwritten Language Barrier
In: Procedia Computer Science ; Computational Methods for Endangered Language Documentation and Description ; https://hal.archives-ouvertes.fr/hal-01836496 ; Computational Methods for Endangered Language Documentation and Description, May 2016, Yogyakarta, Indonesia. pp.8-14, ⟨10.1016/j.procs.2016.04.023⟩ (2016)
BASE
Show details
7
Automatic language identity tagging on word and sentence-level in multilingual text sources: a case-study on Luxembourgish
In: International Conference on Language Resources and Evaluation ; https://hal.archives-ouvertes.fr/hal-01843401 ; International Conference on Language Resources and Evaluation, May 2014, Reykjavik, Iceland (2014)
BASE
Show details
8
Modélisation acoustico-phonétique de langues peu dotées : Études phonétiques et travaux de reconnaissance automatique en luxembourgois
In: Journées d'Etude sur la Parole ; https://hal.archives-ouvertes.fr/hal-01843399 ; Journées d'Etude sur la Parole, Jan 2014, Le Mans, France (2014)
BASE
Show details
9
What we can learn from ASR errors about low-resourced languages: a case- study of Luxembourgish and Austrian
In: Errors by Humans and Machines in Multimedia, Multimodal, Multilingual Data Processing ; https://hal.archives-ouvertes.fr/hal-01843440 ; Errors by Humans and Machines in Multimedia, Multimodal, Multilingual Data Processing, Jan 2013, Ermenonville, France (2013)
BASE
Show details
10
Annotation and analysis of overlapping speech in political interviews
In: LREC 2008 ; https://hal.archives-ouvertes.fr/hal-01690328 ; LREC 2008, May 2008, Marrakech, Morocco (2008)
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
10
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern