1 |
MasakhaNER: Named entity recognition for African languages
|
|
|
|
In: EISSN: 2307-387X ; Transactions of the Association for Computational Linguistics ; https://hal.inria.fr/hal-03350962 ; Transactions of the Association for Computational Linguistics, The MIT Press, 2021, ⟨10.1162/tacl⟩ (2021)
|
|
BASE
|
|
Show details
|
|
2 |
Speech technology for unwritten languages
|
|
|
|
In: ISSN: 2329-9290 ; EISSN: 2329-9304 ; IEEE/ACM Transactions on Audio, Speech and Language Processing ; https://hal.inria.fr/hal-02480675 ; IEEE/ACM Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2020, ⟨10.1109/TASLP.2020.2973896⟩ (2020)
|
|
BASE
|
|
Show details
|
|
3 |
AlloVera: a multilingual allophone database
|
|
|
|
In: LREC 2020: 12th Language Resources and Evaluation Conference ; https://halshs.archives-ouvertes.fr/halshs-02527046 ; LREC 2020: 12th Language Resources and Evaluation Conference, European Language Resources Association, May 2020, Marseille, France ; https://lrec2020.lrec-conf.org/ (2020)
|
|
BASE
|
|
Show details
|
|
4 |
AlloVera: a multilingual allophone database
|
|
|
|
In: LREC 2020: 12th Language Resources and Evaluation Conference ; https://halshs.archives-ouvertes.fr/halshs-02527046 ; LREC 2020: 12th Language Resources and Evaluation Conference, European Language Resources Association, May 2020, Marseille, France ; https://lrec2020.lrec-conf.org/ (2020)
|
|
BASE
|
|
Show details
|
|
5 |
Linguistic unit discovery from multi-modal inputs in unwritten languages: Summary of the “Speaking rosetta” JSALT 2017 workshop
|
|
|
|
In: ICASSP 2018 - IEEE International Conference on Acoustics, Speech and Signal Processing ; https://hal.archives-ouvertes.fr/hal-01709578 ; ICASSP 2018 - IEEE International Conference on Acoustics, Speech and Signal Processing, Apr 2018, Calgary, Alberta, Canada (2018)
|
|
BASE
|
|
Show details
|
|
6 |
Evaluating phonemic transcription of low-resource tonal languages for language documentation
|
|
|
|
In: LREC 2018 (Language Resources and Evaluation Conference) ; https://halshs.archives-ouvertes.fr/halshs-01709648 ; LREC 2018 (Language Resources and Evaluation Conference), May 2018, Miyazaki, Japan. pp.3356-3365 (2018)
|
|
BASE
|
|
Show details
|
|
7 |
Integrating automatic transcription into the language documentation workflow: Experiments with Na data and the Persephone toolkit
|
|
|
|
In: ISSN: 1934-5275 ; EISSN: 1934-5275 ; Language Documentation & Conservation ; https://halshs.archives-ouvertes.fr/halshs-01841979 ; Language Documentation & Conservation, University of Hawaiʻi Press 2018, 12, pp.393-429 ; hdl.handle.net/10125/24793 (2018)
|
|
BASE
|
|
Show details
|
|
8 |
Integrating automatic transcription into the language documentation workflow: Experiments with Na data and the Persephone toolkit
|
|
|
|
In: ISSN: 1934-5275 ; EISSN: 1934-5275 ; Language Documentation & Conservation ; https://halshs.archives-ouvertes.fr/halshs-01841979 ; Language Documentation & Conservation, University of Hawaiʻi Press 2018, 12, pp.393-429 ; hdl.handle.net/10125/24793 (2018)
|
|
BASE
|
|
Show details
|
|
9 |
Evaluating phonemic transcription of low-resource tonal languages for language documentation
|
|
|
|
In: LREC 2018 (Language Resources and Evaluation Conference) ; https://halshs.archives-ouvertes.fr/halshs-01709648 ; LREC 2018 (Language Resources and Evaluation Conference), May 2018, Miyazaki, Japan. pp.3356-3365 (2018)
|
|
Abstract:
International audience ; Transcribing speech is an important part of language documentation, yet speech recognition technology has not been widely harnessed to aid linguists. We explore the use of a neural network architecture with the connectionist temporal classification loss function for phonemic and tonal transcription in a language documentation setting. In this framework, we explore jointly modelling phonemes and tones versus modelling them separately, and assess the importance of pitch information versus phonemic context for tonal prediction. Experiments on two tonal languages, Yongning Na and Eastern Chatino, show the changes in recognition performance as training data is scaled from 10 minutes up to 50 minutes for Chatino, and up to 224 minutes for Na. We discuss the findings from incorporating this technology into the linguistic workflow for documenting Yongning Na, which show the method's promise in improving efficiency, minimizing typographical errors, and maintaining the transcription's faithfulness to the acoustic signal, while highlighting phonetic and phonemic facts for linguistic consideration.
|
|
Keyword:
[SHS.LANGUE]Humanities and Social Sciences/Linguistics; Asian languages; language documentation; low-resource languages; Mesoamerican languages; speech recognition
|
|
URL: https://halshs.archives-ouvertes.fr/halshs-01709648v4/document https://halshs.archives-ouvertes.fr/halshs-01709648 https://halshs.archives-ouvertes.fr/halshs-01709648v4/file/Adams_et_al2018_LREC.pdf
|
|
BASE
|
|
Hide details
|
|
10 |
Phonemic transcription of low-resource tonal languages
|
|
|
|
In: ISSN: 1834-7037 ; Australasian Language Technology Association Workshop 2017 ; https://halshs.archives-ouvertes.fr/halshs-01656683 ; Australasian Language Technology Association Workshop 2017, Dec 2017, Brisbane, Australia. pp.53-60 (2017)
|
|
BASE
|
|
Show details
|
|
11 |
Phonemic transcription of low-resource tonal languages
|
|
|
|
In: ISSN: 1834-7037 ; Australasian Language Technology Association Workshop 2017 ; https://halshs.archives-ouvertes.fr/halshs-01656683 ; Australasian Language Technology Association Workshop 2017, Dec 2017, Brisbane, Australia. pp.53-60 (2017)
|
|
BASE
|
|
Show details
|
|
|
|