1 |
The corpora they are a-changing: a case study in Italian newspapers
|
|
|
|
In: Basile, Pierpaolo orcid:0000-0002-0545-1105 , Caputo, Annalina orcid:0000-0002-7144-8545 , Caselli, Tommaso orcid:0000-0003-2936-0256 , Cassotti, Pierluigi and Varvara, Rossella orcid:0000-0001-9957-2807 (2021) The corpora they are a-changing: a case study in Italian newspapers. In: 2nd International Workshop on Computational Approaches to Historical Language Change 2021, Online. (2021)
|
|
Abstract:
The use of automatic methods for the study of lexical semantic change (LSC) has led to the creation of evaluation benchmarks. Benchmark datasets, however, are intimately tied to the corpus used for their creation questioning their reliability as well as the robustness of automatic methods. This contribution investigates these aspects showing the impact of unforeseen social and cultural dimensions. We also identify a set of additional issues (OCR quality, named entities) that impact the performance of the automatic methods, especially when used to discover LSC.
|
|
Keyword:
Computational linguistics
|
|
URL: http://doras.dcu.ie/26587/
|
|
BASE
|
|
Hide details
|
|
2 |
Extracting Relations from Italian Wikipedia using Self-Training ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
DUKweb: Diachronic word representations from the UK Web Archive corpus ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Extracting Relations from Italian Wikipedia using Self-Training ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Extracting Relations from Italian Wikipedia using Self-Training ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
DUKweb: Diachronic word representations from the UK Web Archive corpus ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
DIACR-Ita @ EVALITA2020: overview of the EVALITA2020 DiachronicLexical semantics (DIACR-Ita) task
|
|
|
|
In: Basile, Pierpaolo, Caputo, Annalina orcid:0000-0002-7144-8545 , Caselli, Tommaso orcid:0000-0003-2936-0256 , Cassotti, Pierluigi and Varvara, Rossella orcid:0000-0001-9957-2807 (2020) DIACR-Ita @ EVALITA2020: overview of the EVALITA2020 DiachronicLexical semantics (DIACR-Ita) task. In: Seventh Evaluation Campaign of Natural Language Processing and Speech Tools for Italian, 17 Dec 2020, Online. (2020)
|
|
BASE
|
|
Show details
|
|
8 |
A diachronic Italian corpus based on “L’Unit`a”
|
|
|
|
In: Basile, Pierpaolo, Caputo, Annalina orcid:0000-0002-7144-8545 , Caselli, Tommaso orcid:0000-0003-2936-0256 , Cassotti, Pierluigi and Varvara, Rossella orcid:0000-0001-9957-2807 (2020) A diachronic Italian corpus based on “L’Unit`a”. In: Seventh Italian Conference on Computational Linguistics, 1-3 Mar 2021, Bologna (Online). (2020)
|
|
BASE
|
|
Show details
|
|
9 |
GM-CTSC at SemEval-2020 Task 1: Gaussian mixtures cross temporal similarity clustering
|
|
|
|
In: Cassotti, Pierluigi, Caputo, Annalina orcid:0000-0002-7144-8545 , Polignano, Marco orcid:0000-0002-3939-0136 and Basile, Pierpaolo orcid:0000-0002-0545-1105 (2020) GM-CTSC at SemEval-2020 Task 1: Gaussian mixtures cross temporal similarity clustering. In: Fourteenth Workshop on Semantic Evaluation, Dec 2020, Barcelona (Online). (2020)
|
|
BASE
|
|
Show details
|
|
10 |
GM-CTSC at SemEval-2020 Task 1: Gaussian Mixtures Cross Temporal Similarity Clustering ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Diachronic Analysis of Entities by Exploiting Wikipedia Page revisions ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Diachronic Analysis of Entities by Exploiting Wikipedia Page revisions ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Exploiting the Web for Semantic Change Detection
|
|
McGillivray, Barbara; Basile, Pierpaolo. - : Springer Link, 2018. : https://link.springer.com/chapter/10.1007/978-3-030-01771-2_13#aboutcontent, 2018. : 21st International Conference, DS 2018, Limassol, Cyprus, October 29–31, 2018, Proceedings, 2018
|
|
BASE
|
|
Show details
|
|
17 |
Entity linking for Tweets
|
|
|
|
In: Basile, Pierpaolo orcid:0000-0002-0545-1105 and Caputo, Annalina orcid:0000-0002-7144-8545 (2017) Entity linking for Tweets. Encyclopedia with Semantic Computing and Robotic Intelligence, 1 (1). pp. 1-9. ISSN 2529-7376 (2017)
|
|
BASE
|
|
Show details
|
|
18 |
EVALITA Goes Social: Tasks, Data, and Community at the 2016 Edition
|
|
|
|
BASE
|
|
Show details
|
|
19 |
Argument Mining on Italian News Blogs
|
|
|
|
In: Third Italian Conference on Computational Linguistics (CLiC-it 2016) & Fifth Evaluation Campaign of Natural Language Processing and Speech Tools for Italian. Final Workshop (EVALITA 2016) ; https://hal.inria.fr/hal-01414698 ; Third Italian Conference on Computational Linguistics (CLiC-it 2016) & Fifth Evaluation Campaign of Natural Language Processing and Speech Tools for Italian. Final Workshop (EVALITA 2016), Dec 2016, Naples, Italy (2016)
|
|
BASE
|
|
Show details
|
|
20 |
EVALITA 2016: Overview of the 5th evaluation campaign of natural language processing and speech tools for Italian
|
|
|
|
BASE
|
|
Show details
|
|
|
|