Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Hits 1 – 18 of 18

1	First Align, then Predict: Understanding the Cross-Lingual Ability of Multilingual BERT
	Muller, Benjamin; Elazar, Yanai; Sagot, Benoît...
	In: https://hal.inria.fr/hal-03161685 ; 2021 (2021)
	BASE
	Show details

2	Can Multilingual Language Models Transfer to an Unseen Dialect? A Case Study on North African Arabizi
	Muller, Benjamin; Sagot, Benoît; Seddah, Djamé
	In: https://hal.inria.fr/hal-03161677 ; 2021 (2021)
	BASE
	Show details

3	First Align, then Predict: Understanding the Cross-Lingual Ability of Multilingual BERT
	Muller, Benjamin; Elazar, Yanai; Sagot, Benoît...
	In: EACL 2021 - The 16th Conference of the European Chapter of the Association for Computational Linguistics ; https://hal.inria.fr/hal-03239087 ; EACL 2021 - The 16th Conference of the European Chapter of the Association for Computational Linguistics, Apr 2021, Kyiv / Virtual, Ukraine ; https://2021.eacl.org/ (2021)
	BASE
	Show details

4	When Being Unseen from mBERT is just the Beginning: Handling New Languages With Multilingual Language Models
	Muller, Benjamin; Anastasopoulos, Antonios; Sagot, Benoît...
	In: NAACL-HLT 2021 - 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies ; https://hal.inria.fr/hal-03251105 ; NAACL-HLT 2021 - 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Jun 2021, Mexico City, Mexico (2021)
	BASE
	Show details

5	Building a User-Generated Content North-African Arabizi Treebank: Tackling Hell
	Seddah, Djamé; Essaidi, Farah; Fethi, Amal...
	In: ACL 2020 - 58th Annual Meeting of the Association for Computational Linguistics ; https://hal.inria.fr/hal-02889804 ; ACL 2020 - 58th Annual Meeting of the Association for Computational Linguistics, Jul 2020, Seattle / Virtual, Canada. ⟨10.18653/v1/2020.acl-main.107⟩ (2020)
	BASE
	Show details

6	CamemBERT: a Tasty French Language Model
	Martin, Louis; Muller, Benjamin; Ortiz Suárez, Pedro Javier; Dupont, Yoann; Romary, Laurent; Villemonte de La Clergerie, Éric; Seddah, Djamé; Sagot, Benoît
	In: ACL 2020 - 58th Annual Meeting of the Association for Computational Linguistics ; https://hal.inria.fr/hal-02889805 ; ACL 2020 - 58th Annual Meeting of the Association for Computational Linguistics, Jul 2020, Seattle / Virtual, United States. ⟨10.18653/v1/2020.acl-main.645⟩ (2020)
	Abstract: International audience ; Pretrained language models are now ubiquitous in Natural Language Processing. Despite their success, most available models have either been trained on English data or on the con-catenation of data in multiple languages. This makes practical use of such models-in all languages except English-very limited. In this paper, we investigate the feasibility of training monolingual Transformer-based language models for other languages, taking French as an example and evaluating our language models on part-of-speech tagging, dependency parsing, named entity recognition and natural language inference tasks. We show that the use of web crawled data is preferable to the use of Wikipedia data. More surprisingly, we show that a relatively small web crawled dataset (4GB) leads to results that are as good as those obtained using larger datasets (130+GB). Our best performing model CamemBERT reaches or improves the state of the art in all four downstream tasks.
	Keyword: [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]; [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]
	URL: https://hal.inria.fr/hal-02889805/file/ACL_2020___CamemBERT__a_Tasty_French_Language_Model-6.pdf https://hal.inria.fr/hal-02889805 https://hal.inria.fr/hal-02889805/document https://doi.org/10.18653/v1/2020.acl-main.645
	BASE
	Hide details

7	Simple, Interpretable and Stable Method for Detecting Words with Usage Change across Corpora
	Gonen, Hila; Jawahar, Ganesh; Seddah, Djamé...
	In: ACL 2020 - 58th Annual Meeting of the Association for Computational Linguistics ; https://hal.inria.fr/hal-03161637 ; ACL 2020 - 58th Annual Meeting of the Association for Computational Linguistics, Jul 2020, Seattle / Virtual, United States. pp.538-555, ⟨10.18653/v1/2020.acl-main.51⟩ (2020)
	BASE
	Show details

8	When Being Unseen from mBERT is just the Beginning: Handling New Languages With Multilingual Language Models
	Muller, Benjamin; Anastasopoulos, Antonis; Sagot, Benoît...
	In: https://hal.inria.fr/hal-03109106 ; 2020 (2020)
	BASE
	Show details

9	CamemBERT: a Tasty French Language Model
	Martin, Louis; Muller, Benjamin; Ortiz Suárez, Pedro Javier...
	In: https://hal.inria.fr/hal-02445946 ; 2019 (2019)
	BASE
	Show details

10	What does BERT learn about the structure of language?
	Jawahar, Ganesh; Sagot, Benoît; Seddah, Djamé
	In: ACL 2019 - 57th Annual Meeting of the Association for Computational Linguistics ; https://hal.inria.fr/hal-02131630 ; ACL 2019 - 57th Annual Meeting of the Association for Computational Linguistics, Jul 2019, Florence, Italy (2019)
	BASE
	Show details

11	Contextualized Diachronic Word Representations
	Jawahar, Ganesh; Seddah, Djamé
	In: 1st International Workshop on Computational Approaches to Historical Language Change 2019 (colocated with ACL 2019) ; https://hal.archives-ouvertes.fr/hal-02194763 ; 1st International Workshop on Computational Approaches to Historical Language Change 2019 (colocated with ACL 2019), Aug 2019, Florence, Italy (2019)
	BASE
	Show details

12	Cheating a Parser to Death: Data-driven Cross-Treebank Annotation Transfer
	Seddah, Djamé; Villemonte de La Clergerie, Éric; Sagot, Benoît...
	In: Eleventh International Conference on Language Resources and Evaluation (LREC 2018) ; https://hal.inria.fr/hal-01798801 ; Eleventh International Conference on Language Resources and Evaluation (LREC 2018), May 2018, Miyazaki, Japan (2018)
	BASE
	Show details

13	CoNLL-UL: Universal Morphological Lattices for Universal Dependency Parsing
	More, Amir; Çetinoğlu, Özlem; Çöltekin, Çağri...
	In: 11th Language Resources and Evaluation Conference ; https://hal.inria.fr/hal-01786125 ; 11th Language Resources and Evaluation Conference, May 2018, Miyazaki, Japan ; http://lrec2018.lrec-conf.org (2018)
	BASE
	Show details

14	The ParisNLP entry at the ConLL UD Shared Task 2017: A Tale of a #ParsingTragedy
	Villemonte de La Clergerie, Éric; Sagot, Benoît; Seddah, Djamé
	In: Conference on Computational Natural Language Learning ; https://hal.inria.fr/hal-01584168 ; Conference on Computational Natural Language Learning, Aug 2017, Vancouver, Canada. pp.243-252, ⟨10.18653/v1/K17-3026⟩ ; http://universaldependencies.org/conll17/ (2017)
	BASE
	Show details

15	Paris and Stanford at EPE 2017: Downstream Evaluation of Graph-based Dependency Representations
	Schuster, Sebastian; Villemonte de La Clergerie, Éric; Candito, Marie,...
	In: EPE 2017 - The First Shared Task on Extrinsic Parser Evaluation ; https://hal.inria.fr/hal-01592051 ; EPE 2017 - The First Shared Task on Extrinsic Parser Evaluation, Sep 2017, Pisa, Italy. pp.47-59 ; http://epe.nlpl.eu (2017)
	BASE
	Show details

16	Building a Question Treebank for French : The French QuestionBank ; Tour d'Horizon du French QuestionBank : Construire un Corpus Arboré de Questions pour le Français
	Seddah, Djamé; Candito, Marie
	In: ACor4French - Les corpus annotés du français ; https://hal.inria.fr/hal-01682869 ; ACor4French - Les corpus annotés du français, Jun 2017, Orléans, France (2017)
	BASE
	Show details

17	From Noisy Questions to Minecraft Texts: Annotation Challenges in Extreme Syntax Scenarios
	Martínez, Héctor Alonso; Seddah, Djamé; Sagot, Benoît
	In: 2nd Workshop on Noisy User-generated Text (W-NUT) at CoLing 2016 ; https://hal.inria.fr/hal-01584054 ; 2nd Workshop on Noisy User-generated Text (W-NUT) at CoLing 2016, Dec 2016, Osaka, Japan (2016)
	BASE
	Show details

18	Hard Time Parsing Questions: Building a QuestionBank for French
	Seddah, Djamé; Candito, Marie
	In: Tenth International Conference on Language Resources and Evaluation (LREC 2016) ; https://hal.archives-ouvertes.fr/hal-01457184 ; Tenth International Conference on Language Resources and Evaluation (LREC 2016), May 2016, Portorož, Slovenia (2016)
	BASE
	Show details

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern