Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Hits 1 – 18 of 18

1	Automatic Speech Recognition and Query By Example for Creole Languages Documentation
	MACAIRE, Cécile; Schwab, Didier; Lecouteux, Benjamin...
	In: Findings of the Association for Computational Linguistics: ACL 2022 ; https://hal.archives-ouvertes.fr/hal-03625303 ; Findings of the Association for Computational Linguistics: ACL 2022, May 2022, Dublin, Ireland (2022)
	BASE
	Show details

2	LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech
	Evain, Solène; Nguyen, Ha; Le, Hang...
	In: INTERSPEECH 2021: Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-03317730 ; INTERSPEECH 2021: Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic (2021)
	BASE
	Show details

3	LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech
	Evain, Solène; Nguyen, Ha; Le, Hang...
	In: INTERSPEECH 2021: ; INTERSPEECH 2021: Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-03317730 ; INTERSPEECH 2021: Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic (2021)
	BASE
	Show details

4	LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech
	Evain, Solène; Nguyen, Ha; Le, Hang...
	In: INTERSPEECH 2021: ; INTERSPEECH 2021: Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-03317730 ; INTERSPEECH 2021: Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic (2021)
	BASE
	Show details

5	ON-TRAC' systems for the IWSLT 2021 low-resource speech translation and multilingual speech translation shared tasks
	Le, Hang; Barbier, Florentin; Nguyen, Ha...
	In: Proceedings of the 18th International Conference on Spoken Language Translation, ; International Conference on Spoken Language Translation (IWSLT) ; https://hal.archives-ouvertes.fr/hal-03298854 ; International Conference on Spoken Language Translation (IWSLT), Aug 2021, Bangkok (virtual), Thailand. ⟨10.18653/v1/2021.iwslt-1.20⟩ (2021)
	BASE
	Show details

6	FlauBERT: Unsupervised Language Model Pre-training for French
	Le, Hang; Vial, Loïc; Frej, Jibril...
	In: Proceedings of the 12th Language Resources and Evaluation Conference ; LREC ; https://hal.archives-ouvertes.fr/hal-02890258 ; LREC, 2020, Marseille, France (2020)
	BASE
	Show details

7	FlauBERT : Unsupervised Language Model Pre-training for French ; FlauBERT : des modèles de langue contextualisés pré-entraînés pour le français
	Le, Hang; Vial, Loïc; Frej, Jibril; Segonne, Vincent; Coavoux, Maximin; Lecouteux, Benjamin; Allauzen, Alexandre; Crabbé, Benoît; Besacier, Laurent; Schwab, Didier
	In: Actes de la 6e conférence conjointe Journées d'Études sur la Parole (JEP, 33e édition), Traitement Automatique des Langues Naturelles (TALN, 27e édition), Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (RÉCITAL, 22e édition). Volume 2 : Traitement Automatique des Langues Naturelles ; 6e conférence conjointe Journées d'Études sur la Parole (JEP, 33e édition), Traitement Automatique des Langues Naturelles (TALN, 27e édition), Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (RÉCITAL, 22e édition). Volume 2 : Traitement Automatique des Langues Naturelles ; https://hal.archives-ouvertes.fr/hal-02784776 ; 6e conférence conjointe Journées d'Études sur la Parole (JEP, 33e édition), Traitement Automatique des Langues Naturelles (TALN, 27e édition), Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (RÉCITAL, 22e édition). Volume 2 : Traitement Automatique des Langues Naturelles, Jun 2020, Nancy, France. pp.268-278 (2020)
	Abstract: Language models have become a key step to achieve state-of-the art results in many NLP tasks. Leveraging the huge amount of unlabeled texts available, they provide an efficient way to pretrain continuous word representations that can be fine-tuned for downstream tasks, along with theircontextualization at the sentence level. This has been widely demonstrated for English. In this paper, we introduce and share FlauBERT, a model learned on a very large and heterogeneous French corpus. We train models of different sizes using the new CNRS Jean Zay supercomputer. We apply our French language models to several NLP tasks (text classification, paraphrasing, natural language inference, parsing, word sense disambiguation) and show that they often outperform other pre-training approaches on the FLUE benchmark also presented in this article. ; Les modèles de langue pré-entraînés sont désormais indispensables pour obtenir des résultats à l’état-de-l’art dans de nombreuses tâches du TALN. Tirant avantage de l’énorme quantité de textes bruts disponibles, ils permettent d’extraire des représentations continues des mots, contextualisées au niveau de la phrase. L’efficacité de ces représentations pour résoudre plusieurs tâches de TALN a été démontrée récemment pour l’anglais. Dans cet article, nous présentons et partageons FlauBERT, un ensemble de modèles appris sur un corpus français hétérogène et de taille importante. Des modèles de complexité différente sont entraînés à l’aide du nouveau supercalculateur Jean Zay du CNRS. Nous évaluons nos modèles de langue sur diverses tâches en français (classification de textes, paraphrase, inférence en langage naturel, analyse syntaxique, désambiguïsation automatique) et montrons qu’ils surpassent souvent les autres approches sur le référentiel d’évaluation FLUE également présenté ici.
	Keyword: [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; analyse syntaxique; BERT; classification de textes; désambiguïsation lexicale; évaluation; FlauBERT; FLUE; français; French; inférence en langue naturelle; language model; modèles de langue; natural language inference; NLP benchmark; paraphrase; parsing; text classification; word sense disambiguation
	URL: https://hal.archives-ouvertes.fr/hal-02784776v3/file/66.pdf https://hal.archives-ouvertes.fr/hal-02784776 https://hal.archives-ouvertes.fr/hal-02784776v3/document
	BASE
	Hide details

8	Providing semantic knowledge to a set of pictograms for people with disabilities: a set of links between WordNet and Arasaac: Arasaac-WN
	Schwab, D; Trial, P; Vaschalde, C...
	In: LREC ; https://hal.archives-ouvertes.fr/hal-02888279 ; LREC, 2020, Marseille, France (2020)
	BASE
	Show details

9	ON-TRAC Consortium for End-to-End and Simultaneous Speech Translation Challenge Tasks at IWSLT 2020
	Elbayad, Maha; Nguyen, Ha; Bougares, Fethi...
	In: Proceedings of the 17th International Conference on Spoken Language Translation ; https://hal.archives-ouvertes.fr/hal-02895893 ; Proceedings of the 17th International Conference on Spoken Language Translation, Jul 2020, Seattle, WA, United States. pp.35-43, ⟨10.18653/v1/2020.iwslt-1.2⟩ (2020)
	BASE
	Show details

10	ASR performance prediction on unseen broadcast programs using convolutional neurol networks
	Elloumi, Zied; Besacier, Laurent; Galibert, Olivier...
	In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ; https://hal.archives-ouvertes.fr/hal-01709779 ; IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Apr 2018, Calgary, Alberta, Canada (2018)
	BASE
	Show details

11	Improving the Coverage and the Generalization Ability of Neural Word Sense Disambiguation through Hypernymy and Hyponymy Relationships ...
	Vial, Loïc; Lecouteux, Benjamin; Schwab, Didier. - : arXiv, 2018
	BASE
	Show details

12	An Open Source Toolkit for Word-level Confidence Estimation in Machine Translation
	Servan, Christophe; Le, Ngoc-Tien; Luong, Ngoc Quang...
	In: The 12th International Workshop on Spoken Language Translation (IWSLT'15) ; https://hal.archives-ouvertes.fr/hal-01244477 ; The 12th International Workshop on Spoken Language Translation (IWSLT'15), Dec 2015, Da Nang, Vietnam ; http://workshop2015.iwslt.org/ (2015)
	BASE
	Show details

13	Dynamic combination of automatic speech recognition systems by driven decoding
	Lecouteux, Benjamin
	In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 21 (2013) 6, 1251-1260
	OLC Linguistik
	Show details

14	Quand nasal est plus que nasal : l’articulation orale des voyelles nasales en français ; (When nasal is more than nasal : oral articulation of French nasal vowels )
	Carignan, Christopher (R18263). - : U.S., Association for Computational Linguistics, 2012
	BASE
	Show details

15	Query-Driven Strategy for On-the-Fly Term Spotting in Spontaneous Speech
	Rouvier, Mickael; Linarès, Georges; Lecouteux, Benjamin
	In: ISSN: 1687-4714 ; EISSN: 1687-4722 ; EURASIP Journal on Audio, Speech, and Music Processing ; https://hal.archives-ouvertes.fr/hal-01320220 ; EURASIP Journal on Audio, Speech, and Music Processing, SpringerOpen, 2010, ⟨10.1155/2010/326578⟩ (2010)
	BASE
	Show details

16	Combined low level and high level features for Out-Of- Vocabulary Word detection
	Lecouteux, Benjamin; Linarès, Georges; Favre, Benoit
	In: Interspeech ; https://hal.archives-ouvertes.fr/hal-02088875 ; Interspeech, 2009, Brighton, United Kingdom (2009)
	BASE
	Show details

17	Using prompts to produce quality corpus for training automatic speech recognition systems
	Lecouteux, Benjamin; Linarès, Georges
	In: MELECON 2008 - The 14th IEEE Mediterranean Electrotechnical Conference ; https://hal.archives-ouvertes.fr/hal-01318050 ; MELECON 2008 - The 14th IEEE Mediterranean Electrotechnical Conference , May 2008, Ajaccio, France. ⟨10.1109/MELCON.2008.4618540⟩ (2008)
	BASE
	Show details

18	Imperfect transcript driven speech recognition
	Lecouteux, Benjamin; Linarès, Georges; Nocera, Pascal...
	In: INTERSPEECH ; https://hal.archives-ouvertes.fr/hal-01318085 ; INTERSPEECH, Sep 2006, Pittsburgh, United States (2006)
	BASE
	Show details

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern