DE eng

Search in the Catalogues and Directories

Page: 1 2
Hits 1 – 20 of 21

1
Towards a Cleaner Document-Oriented Multilingual Crawled Corpus
In: https://hal.inria.fr/hal-03536361 ; 2022 (2022)
BASE
Show details
2
Lexicographic Data Seal of Compliance
In: https://hal.archives-ouvertes.fr/hal-03344267 ; [Research Report] ELEXIS; DARIAH. 2021 (2021)
BASE
Show details
3
Building, Encoding, and Annotating a Corpus of Parliamentary Debates in XML-TEI: A Cross-Linguistic Account
In: https://halshs.archives-ouvertes.fr/halshs-03097333 ; 2020 (2020)
BASE
Show details
4
CamemBERT: a Tasty French Language Model
In: https://hal.inria.fr/hal-02445946 ; 2019 (2019)
Abstract: Web site: https://camembert-model.fr ; Pretrained language models are now ubiquitous in Natural Language Processing. Despite their success, most available models have either been trained on English data or on the concatenation of data in multiple languages. This makes practical use of such models—in all languages except English—very limited. Aiming to address this issue for French, we release CamemBERT, a French version of the Bi-directional Encoders for Transformers (BERT). We measure the performance of CamemBERT compared to multilingual models in multiple downstream tasks, namely part-of-speech tagging, dependency parsing, named-entity recognition, and natural language inference. CamemBERT improves the state of the art for most of the tasks considered. We release the pretrained model for CamemBERT hoping to foster research and downstream applications for French NLP.
Keyword: [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]
URL: https://hal.inria.fr/hal-02445946
BASE
Hide details
5
From disparate disciplines to unity in diversity. How the PARTHENOS project brings Humanities Research Infrastructures together ...
BASE
Show details
6
From disparate disciplines to unity in diversity. How the PARTHENOS project brings Humanities Research Infrastructures together ...
BASE
Show details
7
LMF Reloaded ...
BASE
Show details
8
Automatic TEI encoding of manuscripts catalogues with GROBID-Dictionaries ...
BASE
Show details
9
Automatic TEI encoding of manuscripts catalogues with GROBID-Dictionaries ...
BASE
Show details
10
Open Access in Japan – a multi-institutional perspective
In: https://hal.archives-ouvertes.fr/hal-01290936 ; [Research Report] Ambassade de France au Japon. 2016 (2016)
BASE
Show details
11
Deep encoding of etymological information in TEI ...
Bowers, Jack; Romary, Laurent. - : arXiv, 2016
BASE
Show details
12
IPERION CH Data Management Plan
In: https://hal.archives-ouvertes.fr/hal-02139658 ; [Research Report] D 2.1, Inria. 2015 (2015)
BASE
Show details
13
Data formats for phonological corpora ...
Romary, Laurent; Witt, Andreas. - : arXiv, 2011
BASE
Show details
14
Pepper: Handling A Multiverse Of Formats ...
BASE
Show details
15
[Tiger2/] Documentation
In: https://hal.inria.fr/inria-00593903 ; [Technical Report] 2010 (2010)
BASE
Show details
16
HANDLING MULTILINGUAL CONTENT IN DIGITAL MEDIA: A CRITICAL ANALYSIS
In: https://hal.inria.fr/inria-00001120 ; [Research Report] 2006, pp.60 (2006)
BASE
Show details
17
Unification of multi-lingual scientific terminological resources using the ISO 16642 standard. The TermSciences initiative ...
BASE
Show details
18
Towards Multimodal Content Representation
In: https://hal.archives-ouvertes.fr/hal-00323338 ; 2002 (2002)
BASE
Show details
19
The ELAN Architecture ; The ELAN Architecture: ELAN Deliverables WP3
In: https://hal.inria.fr/hal-01875371 ; [Contract] Deliverables D3.1-1 and D3.2-1, Inria. 1999 (1999)
BASE
Show details
20
A cognitive model for the representation of time in a man-machine dialogue.
In: https://hal.inria.fr/hal-00721871 ; 1989 (1989)
BASE
Show details

Page: 1 2

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
21
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern