1 |
Fine-tuning pre-trained models for Automatic Speech Recognition: experiments on a fieldwork corpus of Japhug (Trans-Himalayan family)
|
|
|
|
In: https://halshs.archives-ouvertes.fr/halshs-03647315 ; 2022 (2022)
|
|
BASE
|
|
Show details
|
|
2 |
Fine-tuning pre-trained models for Automatic Speech Recognition: experiments on a fieldwork corpus of Japhug (Trans-Himalayan family)
|
|
|
|
In: https://halshs.archives-ouvertes.fr/halshs-03647315 ; 2022 (2022)
|
|
BASE
|
|
Show details
|
|
3 |
Are Neural Networks Extracting Linguistic Properties or Memorizing Training Data? An Observation with a Multilingual Probe for Predicting Tense
|
|
|
|
In: EACL 2021 ; https://halshs.archives-ouvertes.fr/halshs-03197072 ; EACL 2021, Apr 2021, Kiev (on line), Ukraine (2021)
|
|
BASE
|
|
Show details
|
|
4 |
Gender Bias in Neural Translation: a preliminary study ; Biais de genre dans un système de traduction automatique neuronale : une étude préliminaire
|
|
|
|
In: Actes de la 28e Conférence sur le Traitement Automatique des Langues Naturelles. Volume 1 : conférence principale ; Traitement Automatique des Langues Naturelles ; https://hal.archives-ouvertes.fr/hal-03265895 ; Traitement Automatique des Langues Naturelles, 2021, Lille, France. pp.11-25 ; https://talnrecital2021.inria.fr/ (2021)
|
|
BASE
|
|
Show details
|
|
5 |
Screening Gender Transfer in Neural Machine Translation
|
|
|
|
In: Proceedings of the Fourth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, ; Fourth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP ; https://hal.archives-ouvertes.fr/hal-03424174 ; Fourth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, Association for computational linguistics, Nov 2021, Punta Cana, Dominica ; https://blackboxnlp.github.io/ (2021)
|
|
BASE
|
|
Show details
|
|
6 |
The SPECTRANS System Description for the WMT21 Terminology Task
|
|
Ballier, Nicolas; Cho, Dahn; Faye, Bilal; Ke, Zong-You; Martikainen, Hanna; Pecman, Mojca; Yunès, Jean-Baptiste; Wisniewski, Guillaume; Zhu, Lichao; Zimina-Poirot, Maria
|
|
In: Proceedings of the Sixth Conference on Machine Translation ; EMNLP 2021 SIXTH CONFERENCE ON MACHINE TRANSLATION (WMT21) ; https://hal.archives-ouvertes.fr/hal-03574680 ; EMNLP 2021 SIXTH CONFERENCE ON MACHINE TRANSLATION (WMT21), ACL, Nov 2021, Punta Cana, Dominican Republic. pp.815-820 ; https://aclanthology.org/events/wmt-2021/ (2021)
|
|
Abstract:
International audience ; This paper discusses the WMT 2021 terminology shared task from a "meta" perspective. We present the results of our experiments using the terminology dataset and the OpenNMT (Klein et al., 2017) and JoeyNMT (Kreutzer et al., 2019) toolkits for the language direction English to French. Our experiment 1 compares the predictions of the two toolkits. Experiment 2 uses OpenNMT to fine-tune the model. We report our results for the task with the evaluation script but mostly discuss the linguistic properties of the terminology dataset provided for the task. We provide evidence of the importance of text genres across scores, having replicated the evaluation scripts.
|
|
Keyword:
[SHS.LANGUE]Humanities and Social Sciences/Linguistics
|
|
URL: https://hal.archives-ouvertes.fr/hal-03574680 https://hal.archives-ouvertes.fr/hal-03574680/file/WMT2021_Shared_task_EMNLP2021_SPECTRANS_submission%20%281%29.pdf https://hal.archives-ouvertes.fr/hal-03574680/document
|
|
BASE
|
|
Hide details
|
|
7 |
User-friendly automatic transcription of low-resource languages: Plugging ESPnet into Elpis
|
|
|
|
In: ComputEL-4: Fourth Workshop on the Use of Computational Methods in the Study of Endangered Languages ; https://halshs.archives-ouvertes.fr/halshs-03030529 ; ComputEL-4: Fourth Workshop on the Use of Computational Methods in the Study of Endangered Languages, Mar 2021, Hawai‘i, United States (2021)
|
|
BASE
|
|
Show details
|
|
8 |
Noisy UGC Translation at the Character Level: Revisiting Open-Vocabulary Capabilities and Robustness of Char-Based Models
|
|
|
|
In: W-NUT 2021 - 7th Workshop on Noisy User-generated Text (colocated with EMNLP 2021) ; https://hal.inria.fr/hal-03540174 ; W-NUT 2021 - 7th Workshop on Noisy User-generated Text (colocated with EMNLP 2021), Association for computational linguistics, Nov 2021, Punta Cana, Dominican Republic (2021)
|
|
BASE
|
|
Show details
|
|
9 |
Understanding the Impact of UGC Specificities on Translation Quality
|
|
|
|
In: W-NUT 2021 - Seventh Workshop on Noisy User-generated Text (colocated with EMNLP 2021) ; https://hal.inria.fr/hal-03540175 ; W-NUT 2021 - Seventh Workshop on Noisy User-generated Text (colocated with EMNLP 2021), association for computational linguistics, Nov 2021, Punta Cana, Dominican Republic (2021)
|
|
BASE
|
|
Show details
|
|
10 |
Are Transformers a Modern Version of ELIZA? Observations on French Object Verb Agreement ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
User-friendly automatic transcription of low-resource languages: Plugging ESPnet into Elpis
|
|
|
|
In: ComputEL-4: Fourth Workshop on the Use of Computational Methods in the Study of Endangered Languages ; https://halshs.archives-ouvertes.fr/halshs-03030529 ; ComputEL-4: Fourth Workshop on the Use of Computational Methods in the Study of Endangered Languages, Mar 2021, Hawai‘i, United States (2021)
|
|
BASE
|
|
Show details
|
|
12 |
La transcription du linguiste au miroir de l’intelligence artificielle : réflexions à partir de la transcription phonémique automatique
|
|
|
|
In: ISSN: 0037-9069 ; EISSN: 1783-1385 ; Bulletin de la Société de Linguistique de Paris ; https://halshs.archives-ouvertes.fr/halshs-02881731 ; Bulletin de la Société de Linguistique de Paris, Peeters Publishers, 2020, 116 (1) (2020)
|
|
BASE
|
|
Show details
|
|
13 |
Ouvrir aux linguistes « de terrain » un accès à la transcription automatique
|
|
|
|
In: Actes des 2èmes journées scientifiques du Groupement de Recherche Linguistique Informatique Formelle et de Terrain (LIFT). ; 2èmes journées scientifiques du Groupement de Recherche Linguistique Informatique Formelle et de Terrain (LIFT) ; https://hal.archives-ouvertes.fr/hal-03047148 ; 2èmes journées scientifiques du Groupement de Recherche Linguistique Informatique Formelle et de Terrain (LIFT), 2020, Montrouge, France. pp.83-94 (2020)
|
|
BASE
|
|
Show details
|
|
14 |
User-friendly automatic transcription of low-resource languages: Plugging ESPnet into Elpis
|
|
|
|
In: ComputEL-4: Fourth Workshop on the Use of Computational Methods in the Study of Endangered Languages ; https://halshs.archives-ouvertes.fr/halshs-03030529 ; 2020 ; https://computel-workshop.org/ (2020)
|
|
BASE
|
|
Show details
|
|
15 |
Phonemic transcription of low-resource languages: To what extent can preprocessing be automated?
|
|
|
|
In: 1st Joint SLTU (Spoken Language Technologies for Under-resourced languages) and CCURL (Collaboration and Computing for Under-Resourced Languages) Workshop ; https://halshs.archives-ouvertes.fr/hal-02513914 ; 1st Joint SLTU (Spoken Language Technologies for Under-resourced languages) and CCURL (Collaboration and Computing for Under-Resourced Languages) Workshop, 2020, Marseille, France. pp.306-315 ; https://lrec2020.lrec-conf.org/media/proceedings/Workshops/Books/SLTUCCURLbook.pdf (2020)
|
|
BASE
|
|
Show details
|
|
16 |
Ouvrir aux linguistes « de terrain » un accès à la transcription automatique
|
|
|
|
In: Actes des 2èmes journées scientifiques du Groupement de Recherche Linguistique Informatique Formelle et de Terrain (LIFT). ; 2èmes journées scientifiques du Groupement de Recherche Linguistique Informatique Formelle et de Terrain (LIFT) ; https://hal.archives-ouvertes.fr/hal-03047148 ; 2èmes journées scientifiques du Groupement de Recherche Linguistique Informatique Formelle et de Terrain (LIFT), 2020, Montrouge, France. pp.83-94 (2020)
|
|
BASE
|
|
Show details
|
|
17 |
Phonemic transcription of low-resource languages: To what extent can preprocessing be automated?
|
|
|
|
In: 1st Joint SLTU (Spoken Language Technologies for Under-resourced languages) and CCURL (Collaboration and Computing for Under-Resourced Languages) Workshop ; https://halshs.archives-ouvertes.fr/hal-02513914 ; 1st Joint SLTU (Spoken Language Technologies for Under-resourced languages) and CCURL (Collaboration and Computing for Under-Resourced Languages) Workshop, 2020, Marseille, France. pp.306-315 ; https://lrec2020.lrec-conf.org/media/proceedings/Workshops/Books/SLTUCCURLbook.pdf (2020)
|
|
BASE
|
|
Show details
|
|
18 |
La transcription du linguiste au miroir de l’intelligence artificielle : réflexions à partir de la transcription phonémique automatique
|
|
|
|
In: ISSN: 0037-9069 ; EISSN: 1783-1385 ; Bulletin de la Société de Linguistique de Paris ; https://halshs.archives-ouvertes.fr/halshs-02881731 ; Bulletin de la Société de Linguistique de Paris, Peeters Publishers, 2020, 116 (1) (2020)
|
|
BASE
|
|
Show details
|
|
19 |
How Bad are PoS Tagger in Cross-Corpora Settings? Evaluating Annotation Divergence in the UD Project
|
|
|
|
In: Proceedings of NAACL-HLT 2019, ; 2019 Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies ; https://hal.archives-ouvertes.fr/hal-02055137 ; 2019 Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Association for Computational Linguistics, Jun 2019, Minneapolis, Minnesota, United States. pp.218 - 227 (2019)
|
|
BASE
|
|
Show details
|
|
20 |
A Comparison between NMT and PBSMT Performance for Translating Noisy User-Generated Content
|
|
|
|
In: The 22nd Nordic Conference on Computational Linguistics (NoDaLiDa’19) ; https://hal.archives-ouvertes.fr/hal-02270524 ; The 22nd Nordic Conference on Computational Linguistics (NoDaLiDa’19), Sep 2019, Turku, Finland ; https://nodalida2019.org/index.html (2019)
|
|
BASE
|
|
Show details
|
|
|
|