Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2

Hits 1 – 20 of 28

1	Investigating alignment interpretability for low-resource NMT
	Zanon Boito, Marcely; Villavicencio, Aline; Besacier, Laurent
	In: ISSN: 0922-6567 ; EISSN: 1573-0573 ; Machine Translation ; https://hal.archives-ouvertes.fr/hal-03139744 ; Machine Translation, Springer Verlag, 2021, ⟨10.1007/s10590-020-09254-w⟩ (2021)
	BASE
	Show details

2	Impact of Encoding and Segmentation Strategies on End-to-End Simultaneous Speech Translation
	Nguyen, Ha; Estève, Yannick; Besacier, Laurent
	In: INTERSPEECH 2021 ; https://hal.archives-ouvertes.fr/hal-03372487 ; INTERSPEECH 2021, Aug 2021, Brno, Czech Republic (2021)
	BASE
	Show details

3	Alternate Endings: Improving Prosody for Incremental Neural TTS with Predicted Future Text Input
	Stephenson, Brooke; Hueber, Thomas; Girin, Laurent...
	In: Interspeech 2021 - 22nd Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-03372802 ; Interspeech 2021 - 22nd Annual Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic. pp.3865-3869, ⟨10.21437/Interspeech.2021-275⟩ (2021)
	BASE
	Show details

4	LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech
	Evain, Solène; Nguyen, Ha; Le, Hang...
	In: INTERSPEECH 2021: Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-03317730 ; INTERSPEECH 2021: Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic (2021)
	BASE
	Show details

5	LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech
	Evain, Solène; Nguyen, Ha; Le, Hang; Zanon Boito, Marcely; Mdhaffar, Salima; Alisamir, Sina; Tong, Ziyi; Tomashenko, Natalia; Dinarelli, Marco; Parcollet, Titouan; Allauzen, Alexandre; Estève, Yannick; Lecouteux, Benjamin; Portet, François; Rossato, Solange; Ringeval, Fabien; Schwab, Didier; Besacier, Laurent
	In: INTERSPEECH 2021: ; INTERSPEECH 2021: Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-03317730 ; INTERSPEECH 2021: Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic (2021)
	Abstract: International audience ; Self-Supervised Learning (SSL) using huge unlabeled data has been successfully explored for image and natural language processing. Recent works also investigated SSL from speech. They were notably successful to improve performance on downstream tasks such as automatic speech recognition (ASR). While these works suggest it is possible to reduce dependence on labeled data for building efficient speech systems, their evaluation was mostly made on ASR and using multiple and heterogeneous experimental settings (most of them for English). This questions the objective comparison of SSL approaches and the evaluation of their impact on building speech systems. In this paper, we propose LeBenchmark: a reproducible framework for assessing SSL from speech. It not only includes ASR (high and low resource) tasks but also spoken language understanding, speech translation and emotion recognition. We also focus on speech technologies in a language different than English: French. SSL models of different sizes are trained from carefully sourced and documented datasets. Experiments show that SSL is beneficial for most but not all tasks which confirms the need for exhaustive and reliable benchmarks to evaluate its real impact. LeBenchmark is shared with the scientific community for reproducible research in SSL from speech.
	Keyword: [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]; ASR; Automatic Emotion Recognition; Self-Supervised Representation Learning; SLU; Speech Translation
	URL: https://hal.archives-ouvertes.fr/hal-03317730v3/document https://hal.archives-ouvertes.fr/hal-03317730v3/file/FLOWBERT_IS2021%282%29.pdf https://hal.archives-ouvertes.fr/hal-03317730
	BASE
	Hide details

6	LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech
	Evain, Solène; Nguyen, Ha; Le, Hang...
	In: INTERSPEECH 2021: ; INTERSPEECH 2021: Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-03317730 ; INTERSPEECH 2021: Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic (2021)
	BASE
	Show details

7	Contribution d'informations syntaxiques aux capacités de généralisation compositionelle des modèles seq2seq convolutifs
	Popa, Diana Nicoleta; Havard, William,; Coavoux, Maximin...
	In: Actes de la 28e Conférence sur le Traitement Automatique des Langues Naturelles. Volume 1 : conférence principale ; Traitement Automatique des Langues Naturelles ; https://hal.archives-ouvertes.fr/hal-03265890 ; Traitement Automatique des Langues Naturelles, 2021, Lille, France. pp.134-141 (2021)
	BASE
	Show details

8	Lightweight Adapter Tuning for Multilingual Speech Translation
	Le, Hang; Pino, Juan; Wang, Changhan...
	In: The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP 2021) ; https://hal.archives-ouvertes.fr/hal-03294912 ; The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP 2021), Aug 2021, Bangkok (Virtual), Thailand (2021)
	BASE
	Show details

9	Visualizing Cross-Lingual Discourse Relations in Multilingual TED Corpora
	Kim, Zae; Nikoulina, Vassilina; Kang, Dongyeop...
	In: Proceedings of the 2nd Workshop on Computational Approaches to Discourse ; CODI 2021: 2nd Workshop on Computational Approaches to Discourse ; https://hal.archives-ouvertes.fr/hal-03642341 ; CODI 2021: 2nd Workshop on Computational Approaches to Discourse, Nov 2021, Punta Cana, Dominican Republic. ⟨10.18653/v1/2021.codi-main.16⟩ (2021)
	BASE
	Show details

10	Do Multilingual Neural Machine Translation Models Contain Language Pair Specific Attention Heads?
	Kim, Zae,; Besacier, Laurent; Nikoulina, Vassilina...
	In: Findings of ACL 2021 ; https://hal.archives-ouvertes.fr/hal-03299010 ; Findings of ACL 2021, Aug 2021, Bangkok (virtual), Thailand (2021)
	BASE
	Show details

11	User-friendly automatic transcription of low-resource languages: Plugging ESPnet into Elpis
	Adams, Oliver; Galliot, Benjamin; Wisniewski, Guillaume...
	In: ComputEL-4: Fourth Workshop on the Use of Computational Methods in the Study of Endangered Languages ; https://halshs.archives-ouvertes.fr/halshs-03030529 ; ComputEL-4: Fourth Workshop on the Use of Computational Methods in the Study of Endangered Languages, Mar 2021, Hawai‘i, United States (2021)
	BASE
	Show details

12	Investigating the Impact of Gender Representation in ASR Training Data: a Case Study on Librispeech
	Garnerin, Mahault; Rossato, Solange; Besacier, Laurent
	In: Proceedings of the 3rd Workshop on Gender Bias in Natural Language Processing ; 3rd Workshop on Gender Bias in Natural Language Processing ; https://hal.univ-grenoble-alpes.fr/hal-03472117 ; 3rd Workshop on Gender Bias in Natural Language Processing, Aug 2021, Online, France. pp.86-92, ⟨10.18653/v1/2021.gebnlp-1.10⟩ (2021)
	BASE
	Show details

13	User-friendly automatic transcription of low-resource languages: Plugging ESPnet into Elpis
	Adams, Oliver; Galliot, Benjamin; Wisniewski, Guillaume...
	In: ComputEL-4: Fourth Workshop on the Use of Computational Methods in the Study of Endangered Languages ; https://halshs.archives-ouvertes.fr/halshs-03030529 ; ComputEL-4: Fourth Workshop on the Use of Computational Methods in the Study of Endangered Languages, Mar 2021, Hawai‘i, United States (2021)
	BASE
	Show details

14	FlauBERT: Unsupervised Language Model Pre-training for French
	Le, Hang; Vial, Loïc; Frej, Jibril...
	In: Proceedings of the 12th Language Resources and Evaluation Conference ; LREC ; https://hal.archives-ouvertes.fr/hal-02890258 ; LREC, 2020, Marseille, France (2020)
	BASE
	Show details

15	FlauBERT : Unsupervised Language Model Pre-training for French ; FlauBERT : des modèles de langue contextualisés pré-entraînés pour le français
	Le, Hang; Vial, Loïc; Frej, Jibril...
	In: Actes de la 6e conférence conjointe Journées d'Études sur la Parole (JEP, 33e édition), Traitement Automatique des Langues Naturelles (TALN, 27e édition), Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (RÉCITAL, 22e édition). Volume 2 : Traitement Automatique des Langues Naturelles ; 6e conférence conjointe Journées d'Études sur la Parole (JEP, 33e édition), Traitement Automatique des Langues Naturelles (TALN, 27e édition), Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (RÉCITAL, 22e édition). Volume 2 : Traitement Automatique des Langues Naturelles ; https://hal.archives-ouvertes.fr/hal-02784776 ; 6e conférence conjointe Journées d'Études sur la Parole (JEP, 33e édition), Traitement Automatique des Langues Naturelles (TALN, 27e édition), Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (RÉCITAL, 22e édition). Volume 2 : Traitement Automatique des Langues Naturelles, Jun 2020, Nancy, France. pp.268-278 (2020)
	BASE
	Show details

16	Catplayinginthesnow: Impact of Prior Segmentation on a Model of Visually Grounded Speech
	Havard, William,; Besacier, Laurent; Chevrot, Jean-Pierre
	In: Conference on Natural Language Learning (CoNLL) ; https://hal.archives-ouvertes.fr/hal-02962275 ; Conference on Natural Language Learning (CoNLL), Nov 2020, Virtual, France (2020)
	BASE
	Show details

17	Investigating Language Impact in Bilingual Approaches for Computational Language Documentation
	Zanon Boito, Marcely; Villavicencio, Aline; Besacier, Laurent
	In: Proceedings of the 1st Joint SLTU and CCURL Workshop (SLTU-CCURL 2020), ; SLTU-CCURL workshop, LREC 2020 ; https://hal.archives-ouvertes.fr/hal-02895907 ; SLTU-CCURL workshop, LREC 2020, May 2020, Marseille, France (2020)
	BASE
	Show details

18	The Zero Resource Speech Challenge 2020: Discovering discrete subword and word units
	Dunbar, Ewan; Karadayi, Julien; Bernard, Mathieu...
	In: Interspeech 2020 - Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-02962224 ; Interspeech 2020 - Conference of the International Speech Communication Association, Oct 2020, Shangai / Virtual, China (2020)
	BASE
	Show details

19	Speech technology for unwritten languages
	Scharenborg, Odette; Besacier, Laurent; Black, Alan...
	In: ISSN: 2329-9290 ; EISSN: 2329-9304 ; IEEE/ACM Transactions on Audio, Speech and Language Processing ; https://hal.inria.fr/hal-02480675 ; IEEE/ACM Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2020, ⟨10.1109/TASLP.2020.2973896⟩ (2020)
	BASE
	Show details

20	MaSS: A Large and Clean Multilingual Corpus of Sentence-aligned Spoken Utterances Extracted from the Bible
	Zanon Boito, Marcely; Havard, William,; Garnerin, Mahault...
	In: Proceedings of The 12th Language Resources and Evaluation Conference ; https://hal.archives-ouvertes.fr/hal-02611059 ; Proceedings of The 12th Language Resources and Evaluation Conference, May 2020, Marseille, France. pp.6486 - 6493 (2020)
	BASE
	Show details

Page: 1 2

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern