1 |
“. to grasp the native's point of view.” - A Plea for a Holistic Documentation of the Trobriand Islanders' Language, Culture and Cognition
|
|
|
|
In: Russian Journal of Linguistics, Vol 24, Iss 1, Pp 7-30 (2020) (2020)
|
|
BASE
|
|
Show details
|
|
2 |
Productivity, influence, and evolution: The complex language shift of Modern Ladino
|
|
|
|
BASE
|
|
Show details
|
|
3 |
A Very Low Resource Language Speech Corpus for Computational Language Documentation Experiments
|
|
Godard, P.; Adda, G; Adda-Decker, Martine; Benjumea, J; Besacier, Laurent; Cooper-Leavitt, J; Kouarata, G-N; Lamel, L; Maynard, H; Müller, M.; Rialland, A; Stüker, S.; Yvon, F.; Zanon-Boito, M
|
|
In: Language Resources and Evaluation Conference (LREC) ; https://hal.archives-ouvertes.fr/hal-01807093 ; Language Resources and Evaluation Conference (LREC), Nicoletta Calzolari (Conference chair) and Khalid Choukri and Christopher Cieri and Thierry Declerck and Sara Goggi and Koiti Hasida and Hitoshi Isahara and Bente Maegaard and Joseph Mariani and Hélène Mazo and Asuncion Moreno and Jan Odijk and Stelios Pi, May 2018, Miyazaki, Japan (2018)
|
|
Abstract:
International audience ; Most speech and language technologies are trained with massive amounts of speech and text information. However, most of the world languages do not have such resources and some even lack a stable orthography. Building systems under these almost zero resource conditions is not only promising for speech technology but also for computational language documentation. The goal of computational language documentation is to help field linguists to (semi-)automatically analyze and annotate audio recordings of endangered, unwritten languages. Example tasks are automatic phoneme discovery or lexicon discovery from the speech signal. This paper presents a speech corpus collected during a realistic language documentation process. It is made up of 5k speech utterances in Mboshi (Bantu C25) aligned to French text translations. Speech transcriptions are also made available: they correspond to a non-standard graphemic form close to the language phonology. We detail how the data was collected, cleaned and processed and we illustrate its use through a zero-resource task: spoken term discovery. The dataset is made available to the community for reproducible computational language documentation experiments and their evaluation.
|
|
Keyword:
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; field linguistics; language documentation; spoken term discovery; unwritten languages; word segmentation; zero resource technologies
|
|
URL: https://hal.archives-ouvertes.fr/hal-01807093/document https://hal.archives-ouvertes.fr/hal-01807093/file/lrec2018_mboshi_final-3.pdf https://hal.archives-ouvertes.fr/hal-01807093
|
|
BASE
|
|
Hide details
|
|
4 |
ДИАЛЕКТ СЕЛА СТАРОШВЕДСКОЕ: ОПЫТ СОСТАВЛЕНИЯ СЛОВАРЯ НЕИЗУЧЕННОГО ЯЗЫКА (GRāVNIŋ - Gǖ) MAN’KOV A
|
|
МАНЬКОВ АЛЕКСАНДР ЕВГЕНЬЕВИЧ. - : Негосударственное образовательное учреждение высшего профессионального образования «Православный Свято-Тихоновский гуманитарный университет», 2016
|
|
BASE
|
|
Show details
|
|
5 |
ДИАЛЕКТ СЕЛА СТАРОШВЕДСКОЕ: ОПЫТ СОСТАВЛЕНИЯ СЛОВАРЯ ИСЧЕЗАЮЩЕГО ЯЗЫКА (DRǟŋ FINN)
|
|
Маньков, Александр. - : Негосударственное образовательное учреждение высшего профессионального образования "Православный Свято-Тихоновский гуманитарный университет", 2015
|
|
BASE
|
|
Show details
|
|
6 |
On The Status Of The Interdental Fricatives /Ṯ/, /Ḏ/, And /Ḍ/ In Gaza City ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Verbparadigmen von Grüsch, Luzein, Safien, Obertschappina, Urmein, Mutten, Splügen und Zillis : Aufnahmen aus dem Jahre 1988 [Online resource]
|
|
|
|
Linguistik-Repository
|
|
Show details
|
|
9 |
Extraordinary claims require extraordinary evidence (and ordinary ones require ordinary evidence) : on experimental linguistics for less well studied languages [Online resource]
|
|
|
|
In: Revista da Abralin 13 (2014) 2, 121-149
|
|
Linguistik-Repository
|
|
Show details
|
|
10 |
Wántwint Inmí Tiináwit: A Reflection of What I Have Learned
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Lenguas de Guinea Ecuatorial: de la documentación a la implementación ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Lenguas de Guinea Ecuatorial: de la documentación a la implementación ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Ormuri Interlinear Texts: texts 1-51 from Logar, Afghanistan, recorded by V.A. Efimov
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Terminology Management at the National Language Service
|
|
|
|
In: Lexikos, Vol 10 (2011) (2011)
|
|
BASE
|
|
Show details
|
|
15 |
Appendix : relative clause questionnaire [Online resource]
|
|
In: Papers from the Workshop on Bantu Relative Clauses : [held in Paris on 8 - 9 January 2010] / Laura Downing, ... (eds.), Zentrum für Allgemeine Sprachwissenschaft, Berlin; ZASPil Vol. 53, S. 243-250 53 (2010), 243-250
|
|
Linguistik-Repository
|
|
Show details
|
|
18 |
语言类型学 : 功能语言学派视野下的语言学田野调查 : Yu yan lei xing xue : gong neng yu yan xue pai shi ye xia de yu yan xue tian ye diao cha [Online resource]
|
|
|
|
In: 語言學論叢 ; Yu yan xue lun cong 36 (2007), 42-56
|
|
Linguistik-Repository
|
|
Show details
|
|
19 |
Managing fieldwork data with Toolbox and the Natural Language Toolkit
|
|
|
|
BASE
|
|
Show details
|
|
20 |
A new mass elicitation technique: the dictionary development program
|
|
|
|
BASE
|
|
Show details
|
|
|
|