1 |
Monolingual and Cross-Lingual Acceptability Judgments with the Italian CoLA corpus ...
|
|
|
|
Abstract:
The development of automated approaches to linguistic acceptability has been greatly fostered by the availability of the English CoLA corpus, which has also been included in the widely used GLUE benchmark. However, this kind of research for languages other than English, as well as the analysis of cross-lingual approaches, has been hindered by the lack of resources with a comparable size in other languages. We have therefore developed the ItaCoLA corpus, containing almost 10,000 sentences with acceptability judgments, which has been created following the same approach and the same steps as the English one. In this paper we describe the corpus creation, we detail its content, and we present the first experiments on this new resource. We compare in-domain and out-of-domain classification, and perform a specific evaluation of nine linguistic phenomena. We also present the first cross-lingual experiments, aimed at assessing whether multilingual transformerbased approaches can benefit from using sentences in two ... : Findings of EMNLP 2021. Dataset available at https://github.com/dhfbk/ItaCoLA-dataset ...
|
|
Keyword:
Computation and Language cs.CL; FOS Computer and information sciences
|
|
URL: https://dx.doi.org/10.48550/arxiv.2109.12053 https://arxiv.org/abs/2109.12053
|
|
BASE
|
|
Hide details
|
|
2 |
Monolingual and Cross-Lingual Acceptability Judgments with the Italian CoLA corpus ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Lexicon-Grammar based open information extraction from natural language sentences in Italian
|
|
|
|
In: ISSN: 0957-4174 ; Expert Systems with Applications ; https://hal.archives-ouvertes.fr/hal-02291746 ; Expert Systems with Applications, Elsevier, 2020, pp.112954. ⟨10.1016/j.eswa.2019.112954⟩ (2020)
|
|
BASE
|
|
Show details
|
|
4 |
Developing an annotator for Latin texts using Wikipedia
|
|
|
|
In: EISSN: 2416-5999 ; Journal of Data Mining and Digital Humanities ; https://hal.archives-ouvertes.fr/hal-01279853 ; Journal of Data Mining and Digital Humanities, Episciences.org, In press (2017)
|
|
BASE
|
|
Show details
|
|
5 |
A hybrid method for the extraction and classification of product features from user-generated contents ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
A hybrid method for the extraction and classification of product features from user-generated contents
|
|
|
|
In: Lingue e Linguaggi; Volume 22 (2017) - Special Issue; 137-168 (2017)
|
|
BASE
|
|
Show details
|
|
|
|