DE eng

Search in the Catalogues and Directories

Hits 1 – 13 of 13

1
A Very Low Resource Language Speech Corpus for Computational Language Documentation Experiments
In: Language Resources and Evaluation Conference (LREC) ; https://hal.archives-ouvertes.fr/hal-01807093 ; Language Resources and Evaluation Conference (LREC), Nicoletta Calzolari (Conference chair) and Khalid Choukri and Christopher Cieri and Thierry Declerck and Sara Goggi and Koiti Hasida and Hitoshi Isahara and Bente Maegaard and Joseph Mariani and Hélène Mazo and Asuncion Moreno and Jan Odijk and Stelios Pi, May 2018, Miyazaki, Japan (2018)
Abstract: International audience ; Most speech and language technologies are trained with massive amounts of speech and text information. However, most of the world languages do not have such resources and some even lack a stable orthography. Building systems under these almost zero resource conditions is not only promising for speech technology but also for computational language documentation. The goal of computational language documentation is to help field linguists to (semi-)automatically analyze and annotate audio recordings of endangered, unwritten languages. Example tasks are automatic phoneme discovery or lexicon discovery from the speech signal. This paper presents a speech corpus collected during a realistic language documentation process. It is made up of 5k speech utterances in Mboshi (Bantu C25) aligned to French text translations. Speech transcriptions are also made available: they correspond to a non-standard graphemic form close to the language phonology. We detail how the data was collected, cleaned and processed and we illustrate its use through a zero-resource task: spoken term discovery. The dataset is made available to the community for reproducible computational language documentation experiments and their evaluation.
Keyword: [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; field linguistics; language documentation; spoken term discovery; unwritten languages; word segmentation; zero resource technologies
URL: https://hal.archives-ouvertes.fr/hal-01807093/document
https://hal.archives-ouvertes.fr/hal-01807093/file/lrec2018_mboshi_final-3.pdf
https://hal.archives-ouvertes.fr/hal-01807093
BASE
Hide details
2
Neural language codes for multilingual acoustic models
In: ISSN: 2308-457X (2018)
BASE
Show details
3
A Very Low Resource Language Speech Corpus for Computational Language Documentation Experiments ...
Godard, P.; Adda, G.; Adda-Decker, M.. - : arXiv, 2017
BASE
Show details
4
Lexicometric analysis: a methodological prelude
In: Perceptions of the EU in Eastern Europe and sub-Saharan Africa: looking from the outside in, pp. 69-76 (2015)
BASE
Show details
5
Multilingual shifting deep bottleneck features for low-resource ASR
Nguyen, Q. B.; Gehring, J.; Mueller, M.. - : Institute of Electrical and Electronics Engineers, 2014
BASE
Show details
6
Audio Analysis and Synthesis - Extracting Predominant Local Pulse Information From Music Recordings
In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 19 (2011) 6, 1688-1701
OLC Linguistik
Show details
7
Towards Timbre-Invariant Audio Features for Harmony-Based Music
In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 18 (2010) 3, 649-662
OLC Linguistik
Show details
8
Efficient Index-Based Audio Matching
In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 16 (2008) 2, 382-395
OLC Linguistik
Show details
9
Recombinative generalization of within-syllable units in prereading children.
BASE
Show details
10
V.K Trediakovskijs "Gespräch zwischen einem Fremden und einem Russen über die alte Orthographie und alles, was zu dieser Materie gehört" Regensburg 1994
In: Russian linguistics. - Dordrecht [u.a.] : Springer 21 (1997) 3, 327-330
OLC Linguistik
Show details
11
Translating by factors
In: Language. - Washington, DC : Linguistic Society of America 73 (1997) 3, 681
OLC Linguistik
Show details
12
Rules for parallel processing networks with adaptive structure
In: Mathematical psychology in progress (Berlin, 1989), P.215-228
MPI für Psycholinguistik
Show details
13
Zur Verbindbarkeit der Determinantien und Quantoren
In: Zur Syntax der Determination (Tuebingen, 1986), P.33-56
MPI für Psycholinguistik
Show details

Catalogues
0
0
5
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
2
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
6
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern