DE eng

Search in the Catalogues and Directories

Page: 1 2
Hits 1 – 20 of 21

1
Catalan Government Crawling ...
BASE
Show details
2
Catalan General Crawling ...
BASE
Show details
3
Catalan General Crawling ...
BASE
Show details
4
Catalan Government Crawling ...
BASE
Show details
5
Catalan General Crawling ...
BASE
Show details
6
Catalan Government Crawling ...
BASE
Show details
7
Catalan Government Crawling ...
BASE
Show details
8
Are Multilingual Models the Best Choice for Moderately Under-resourced Languages? A Comprehensive Assessment for Catalan ...
Abstract: Multilingual language models have been a crucial breakthrough as they considerably reduce the need of data for under-resourced languages. Nevertheless, the superiority of language-specific models has already been proven for languages having access to large amounts of data. In this work, we focus on Catalan with the aim to explore to what extent a medium-sized monolingual language model is competitive with state-of-the-art large multilingual models. For this, we: (1) build a clean, high-quality textual Catalan corpus (CaText), the largest to date (but only a fraction of the usual size of the previous work in monolingual language models), (2) train a Transformer-based language model for Catalan (BERTa), and (3) devise a thorough evaluation in a diversity of settings, comprising a complete array of downstream tasks, namely, Part of Speech Tagging, Named Entity Recognition and Classification, Text Classification, Question Answering, and Semantic Textual Similarity, with most of the corresponding datasets being ... : Accepted into Findings of ACL-IJCNLP 2021 ...
Keyword: Computation and Language cs.CL; FOS Computer and information sciences
URL: https://dx.doi.org/10.48550/arxiv.2107.07903
https://arxiv.org/abs/2107.07903
BASE
Hide details
9
Are Multilingual Models the Best Choice for Moderately Under-resourced Languages? {A} Comprehensive Assessment for {C}atalan ...
BASE
Show details
10
BioASQ at CLEF2020: Large-Scale Biomedical Semantic Indexing and Question Answering
BASE
Show details
11
MeSpEn_Parallel-Corpora ...
BASE
Show details
12
MeSpEn_Parallel-Corpora ...
BASE
Show details
13
Designing a new platform for AD analyses : the VIW project
BASE
Show details
14
Building an audio description multilingual multimodal corpus : the VIW project
BASE
Show details
15
One ontology to bind them all: The META-SHARE OWL ontology for the interoperability of linguistic datasets on the Web ...
BASE
Show details
16
LMF experiments on format conversions for resource merging : converters and problems
In: LMF — Lexical Markup Framework (London, 2013), p. 187-200
MPI für Psycholinguistik
Show details
17
Dictionaries, thesauri and lexical-semantic relations
Fontenelle, Thierry (Hrsg.); Lenci, Alessandro (Mitarb.); Bel, Núria (Mitarb.)...
In: International journal of lexicography. - Oxford : Oxford Univ. Press 13 (2000) 4, 229-312
BLLDB
Show details
18
SIMPLE: A GENERAL FRAMEWORK FOR THE DEVELOPMENT OF MULTILINGUAL LEXICONS
Lenci, Alessandro; Bel, Nuria; Busa, Federica. - : Oxford University Press, 2000
BASE
Show details
19
El fenómeno del control y la complementación verbal en HPSG para el español
In: Panorama de la investigació lingüística a l'estat espanyol ; 2. Comunicacions. - València : Univ., Dep. de Teoria dels Llenguatges (1994), 244-252
BLLDB
Show details
20
Cross-lingual text categorization
BASE
Show details

Page: 1 2

Catalogues
0
0
0
0
0
0
0
Bibliographies
2
0
0
0
0
0
0
0
1
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
18
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern