Home
Catalogue search
Refine your search:
Keyword:
[INFO.INFO-CL]Computer Science [cs] / Computation and Language [cs.CL] (1)
Creator / Publisher:
ANR-15-CE38-0011,SoSweet,Une sociolinguistique de Twitter : liens sociaux et variations linguistiques(2015) (1)
ANR-16-CE33-0021,PARSITI,Analyser l'impossible, Traduire l'improbable(2016) (1)
ANR-18-CE38-0003,BASNUM,Numérisation et analyse du Dictionnaire universel de Basnage de Beauval: lexicographie et réseaux scientifiques(2018) (1)
Automatic Language Modelling and ANAlysis & Computational Humanities (ALMAnaCH) (1)
Dupont, Yoann (1)
Facebook (1)
Facebook AI Research Paris (FAIR) (1)
Inria de Paris (1)
Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria) (1)
Martin, Louis (1)
more
Year:
2019 (1)
Medium
Type:
Miscellaneous (1)
BLLDB-Access
Search in the Catalogues and Directories
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
Sort by
creator [A → Z]
'
creator [Z → A]
'
publishing year ↑ (asc)
'
publishing year ↓ (desc)
'
title [A → Z]
'
title [Z → A]
'
Simple Search
Hits 1 – 1 of 1
1
CamemBERT: a Tasty French Language Model
Martin, Louis
;
Muller, Benjamin
;
Ortiz Suárez, Pedro Javier
;
Dupont, Yoann
;
Romary, Laurent
;
Villemonte de La Clergerie, Éric
;
Seddah, Djamé
;
Sagot, Benoît
In: https://hal.inria.fr/hal-02445946 ; 2019 (2019)
Abstract:
Web site: https://camembert-model.fr ; Pretrained language models are now ubiquitous in Natural Language Processing. Despite their success, most available models have either been trained on English data or on the concatenation of data in multiple languages. This makes practical use of such models—in all languages except English—very limited. Aiming to address this issue for French, we release CamemBERT, a French version of the Bi-directional Encoders for Transformers (BERT). We measure the performance of CamemBERT compared to multilingual models in multiple downstream tasks, namely part-of-speech tagging, dependency parsing, named-entity recognition, and natural language inference. CamemBERT improves the state of the art for most of the tasks considered. We release the pretrained model for CamemBERT hoping to foster research and downstream applications for French NLP.
Keyword:
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]
URL:
https://hal.inria.fr/hal-02445946
BASE
Hide details
Mobile view
All
Catalogues
UB Frankfurt Linguistik
0
IDS Mannheim
0
OLC Linguistik
0
UB Frankfurt Retrokatalog
0
DNB Subject Category Language
0
Institut für Empirische Sprachwissenschaft
0
Leibniz-Centre General Linguistics (ZAS)
0
Bibliographies
BLLDB
0
BDSL
0
IDS Bibliografie zur deutschen Grammatik
0
IDS Bibliografie zur Gesprächsforschung
0
IDS Konnektoren im Deutschen
0
IDS Präpositionen im Deutschen
0
IDS OBELEX meta
0
MPI-SHH Linguistics Collection
0
MPI for Psycholinguistics
0
Linked Open Data catalogues
Annohub
0
Online resources
Link directory
0
Journal directory
0
Database directory
0
Dictionary directory
0
Open access documents
BASE
1
Linguistik-Repository
0
IDS Publikationsserver
0
Online dissertations
0
Language Description Heritage
0
© 2013 - 2024 Lin|gu|is|tik
|
Imprint
|
Privacy Policy
|
Datenschutzeinstellungen ändern