1 |
Automatic Extraction of Verb Paradigms in Regional Languages: the case of the Linguistic Crescent varieties
|
|
|
|
In: STLU (Spoken Language Technologies for Under-resourced languages) ; https://halshs.archives-ouvertes.fr/halshs-02508210 ; STLU (Spoken Language Technologies for Under-resourced languages), European Language Resources Association (ELRA), Jan 2020, Marseille, France. pp.245-249 ; https://lrec2020.lrec-conf.org/en/ (2020)
|
|
BASE
|
|
Show details
|
|
2 |
Automatic Extraction of Verb Paradigms in Regional Languages: the case of the Linguistic Crescent varieties
|
|
|
|
In: STLU (Spoken Language Technologies for Under-resourced languages) ; https://halshs.archives-ouvertes.fr/halshs-02508210 ; STLU (Spoken Language Technologies for Under-resourced languages), European Language Resources Association (ELRA), Jan 2020, Marseille, France. pp.245-249 ; https://lrec2020.lrec-conf.org/en/ (2020)
|
|
Abstract:
International audience ; An important and costly step in the process of language documentation is the transcription (total or partial transcripts) of speech data collected in the field. Several projects adopt a methodology involving the use of speech transcription systems (Adda et al. 2016; Michaud et al. 2018); in such an approach, it is necessary to adapt the systems so that they can transcribe (at least phonetically) speech collected during fieldwork. However, within the data gathered, some have either an approximate transcription (e.g. in the case of reading), or more or less precise information on its content, for example in the case of verb conjugations: the linguist proposes a verb, and the informant must give all the possible inflections, most often in a fixed order for tenses and persons. The question addressed in this paper is to explore whether it is possible to use a transcription system developed for a given language (here French) without precise adaptation of acoustic models, in order to produce both segmentation and transcription of verbal paradigms of a closely related language (here several Romance varieties spoken in central France), and the conditions under which the system will or will not require post-processing.
|
|
Keyword:
[SHS.LANGUE]Humanities and Social Sciences/Linguistics; [SHS]Humanities and Social Sciences; Dialectology; Language documentation; Linguistic Crescent; Romance languages; Speech processing
|
|
URL: https://halshs.archives-ouvertes.fr/halshs-02508210/document https://halshs.archives-ouvertes.fr/halshs-02508210 https://halshs.archives-ouvertes.fr/halshs-02508210/file/sltu-article-publie-2020.pdf
|
|
BASE
|
|
Hide details
|
|
3 |
LIMSI@WMT16: Machine Translation of News
|
|
|
|
In: First Conference on Machine Translation ; https://hal.archives-ouvertes.fr/hal-01388659 ; First Conference on Machine Translation, Aug 2016, Berlin, Germany. pp.239--245, ⟨10.18653/v1/W16-2304⟩ ; https://statmt.org/wmt16 (2016)
|
|
BASE
|
|
Show details
|
|
4 |
Two-Step MT: Predicting Target Morphology
|
|
|
|
In: International Workshop on Spoken Language Translation ; https://hal.archives-ouvertes.fr/hal-01592337 ; International Workshop on Spoken Language Translation, 2016, Seattle, WA, United States (2016)
|
|
BASE
|
|
Show details
|
|
5 |
LIMSI$@$WMT'15 : Translation Task
|
|
|
|
In: Proceedings of the Tenth Workshop on Statistical Machine Translation ; https://hal.archives-ouvertes.fr/hal-02912383 ; Proceedings of the Tenth Workshop on Statistical Machine Translation, Sep 2015, Lisbon, Portugal. pp.145-151, ⟨10.18653/v1/W15-3016⟩ (2015)
|
|
BASE
|
|
Show details
|
|
6 |
Apprentissage partiellement supervisé d'un étiqueteur morpho-syntaxique par transfert cross-lingue
|
|
|
|
In: Conférence sur le Traitement Automatique des Langues Naturelles ; https://hal.archives-ouvertes.fr/hal-01908360 ; Conférence sur le Traitement Automatique des Langues Naturelles, Jan 2014, Marseille, France (2014)
|
|
BASE
|
|
Show details
|
|
|
|