1 |
The corpora they are a-changing: a case study in Italian newspapers
|
|
|
|
In: Basile, Pierpaolo orcid:0000-0002-0545-1105 , Caputo, Annalina orcid:0000-0002-7144-8545 , Caselli, Tommaso orcid:0000-0003-2936-0256 , Cassotti, Pierluigi and Varvara, Rossella orcid:0000-0001-9957-2807 (2021) The corpora they are a-changing: a case study in Italian newspapers. In: 2nd International Workshop on Computational Approaches to Historical Language Change 2021, Online. (2021)
|
|
BASE
|
|
Show details
|
|
2 |
Extracting Relations from Italian Wikipedia using Self-Training ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Extracting Relations from Italian Wikipedia using Self-Training ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Extracting Relations from Italian Wikipedia using Self-Training ...
|
|
|
|
Abstract:
This dataset contains relations extracted from the Italian Wikipedia by the WikiOIE framework. WikiOIE is based on UDPipe and the Universal Dependencies project for text processing. It easily allows customizing the information extraction (IE) approach to automatically extract triples (subject, predicate, object). This dataset contains relations extracted by a supervised approach based on self-training. The extraction process is provided in JSON format. Version 2 of the dataset was extracted using an improved version of the learning algorithm. The files of version 2 are identified by the suffix "_reg" in the file name. More information and the Java code are available here: https://github.com/pippokill/WikiOIE Self-training approach: Lucia Siciliani, Pierluigi Cassotti, Pierpaolo Basile, Marco de Gemmis, Pasquale Lops, and Giovanni Semeraro 2021. Extracting Relations from Italian Wikipedia using Self-Training. In Proceedings of the Eighth Italian Conference on Computational Linguistics (CLiC-it 2021). CEUR-WS. ...
|
|
Keyword:
Italian Wikipedia; open information extraction; Self training; Wikipedia
|
|
URL: https://dx.doi.org/10.5281/zenodo.5655028 https://zenodo.org/record/5655028
|
|
BASE
|
|
Hide details
|
|
5 |
DIACR-Ita @ EVALITA2020: overview of the EVALITA2020 DiachronicLexical semantics (DIACR-Ita) task
|
|
|
|
In: Basile, Pierpaolo, Caputo, Annalina orcid:0000-0002-7144-8545 , Caselli, Tommaso orcid:0000-0003-2936-0256 , Cassotti, Pierluigi and Varvara, Rossella orcid:0000-0001-9957-2807 (2020) DIACR-Ita @ EVALITA2020: overview of the EVALITA2020 DiachronicLexical semantics (DIACR-Ita) task. In: Seventh Evaluation Campaign of Natural Language Processing and Speech Tools for Italian, 17 Dec 2020, Online. (2020)
|
|
BASE
|
|
Show details
|
|
6 |
A diachronic Italian corpus based on “L’Unit`a”
|
|
|
|
In: Basile, Pierpaolo, Caputo, Annalina orcid:0000-0002-7144-8545 , Caselli, Tommaso orcid:0000-0003-2936-0256 , Cassotti, Pierluigi and Varvara, Rossella orcid:0000-0001-9957-2807 (2020) A diachronic Italian corpus based on “L’Unit`a”. In: Seventh Italian Conference on Computational Linguistics, 1-3 Mar 2021, Bologna (Online). (2020)
|
|
BASE
|
|
Show details
|
|
7 |
GM-CTSC at SemEval-2020 Task 1: Gaussian mixtures cross temporal similarity clustering
|
|
|
|
In: Cassotti, Pierluigi, Caputo, Annalina orcid:0000-0002-7144-8545 , Polignano, Marco orcid:0000-0002-3939-0136 and Basile, Pierpaolo orcid:0000-0002-0545-1105 (2020) GM-CTSC at SemEval-2020 Task 1: Gaussian mixtures cross temporal similarity clustering. In: Fourteenth Workshop on Semantic Evaluation, Dec 2020, Barcelona (Online). (2020)
|
|
BASE
|
|
Show details
|
|
8 |
GM-CTSC at SemEval-2020 Task 1: Gaussian Mixtures Cross Temporal Similarity Clustering ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|