7 |
Treebanking user-generated content: a proposal for a unified representation in universal dependencies
|
|
Sanguinetti, Manuela; Bosco, Cristina; Cassidy, Lauren; Çetinoglu, Özlem; Cignarella, Alessandra Teresa; Lynn, Teresa; Rehbein, Ines; Ruppenhofer, Josef; Seddah, Djamé; Zeldes, Amir
|
|
In: Sanguinetti, Manuela orcid:0000-0002-0147-2208 , Bosco, Cristina, Cassidy, Lauren, Çetinoglu, Özlem, Cignarella, Alessandra Teresa orcid:0000-0002-4409-6679 , Lynn, Teresa, Rehbein, Ines, Ruppenhofer, Josef, Seddah, Djamé and Zeldes, Amir orcid:0000-0001-8016-6753 (2020) Treebanking user-generated content: a proposal for a unified representation in universal dependencies. In: 12th Language Resources and Evaluation Conference. (LREC 2020), 11-16 May 2020, Marseille, France. (2020)
|
|
Abstract:
The paper presents a discussion on the main linguistic phenomena of user-generated texts found in web and social media, and proposes a set of annotation guidelines for their treatment within the Universal Dependencies (UD) framework. Given on the one hand the increasing number of treebanks featuring user-generated content, and its somewhat inconsistent treatment in these resources on the other, the aim of this paper is twofold: (1) to provide a short, though comprehensive, overview of such treebanks - based on available literature - along with their main features and a comparative analysis of their annotation criteria, and (2) to propose a set of tentative UD-based annotation guidelines, to promote consistent treatment of the particular phenomena found in these types of texts. The main goal of this paper is to provide a common framework for those teams interested in developing similar resources in UD, thus enabling cross-linguistic consistency, which is a principle that has always been in the spirit of UD.
|
|
Keyword:
Artificial intelligence; Irish language; Linguistics; Machine learning
|
|
URL: http://doras.dcu.ie/24477/
|
|
BASE
|
|
Hide details
|
|
8 |
Treebanking user-generated content: a proposal for a unified representation in universal dependencies
|
|
|
|
In: Sanguinetti, Manuela orcid:0000-0002-0147-2208 , Bosco, Cristina, Cassidy, Lauren, Çetinoglu, Özlem, Cignarella, Alessandra Teresa orcid:0000-0002-4409-6679 , Lynn, Teresa, Rehbein, Ines, Ruppenhofer, Josef, Seddah, Djamé and Zeldes, Amir orcid:0000-0001-8016-6753 (2020) Treebanking user-generated content: a proposal for a unified representation in universal dependencies. In: 12th Language Resources and Evaluation Conference. (LREC 2020), 11-16 May 2020, Marseille, France. (Virtual). (2020)
|
|
BASE
|
|
Show details
|
|
9 |
Treebanking User-Generated Content: a UD Based Overview of Guidelines, Corpora and Unified Recommendations ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Treebanking user-generated content: A proposal for a unified representation in universal dependencies
|
|
|
|
BASE
|
|
Show details
|
|
11 |
“Annexation or Reunification?” Linguistic Appraisal of German and Russian news reporting on Crimea
|
|
Cassidy, Lauren. - : Department of Germanic Languages & Literatures, University of Kansas, 2018
|
|
BASE
|
|
Show details
|
|
12 |
Tapadoir: developing a statistical machine translation engine and associated resources for Irish
|
|
|
|
In: Dowling, Meghan orcid:0000-0003-1637-4923 , Cassidy, Lauren, Maguire, Eimear, Lynn, Teresa, Srivastava, Ankit and Judge, John (2015) Tapadoir: developing a statistical machine translation engine and associated resources for Irish. In: 4th Biennial Workshop on Less-Resourced Languages (LRC 2015), 28 Nov 2015, Poznan, Poland. (2015)
|
|
BASE
|
|
Show details
|
|
13 |
Collaborative Writing Across Distances: An Ethnographic Study of Workplace Writing Across Coasts and Cultures
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Treebanking User-Generated Content: A Proposal for a Unified Representation in Universal Dependencies [Online resource]
|
|
|
|
IDS-Repository
|
|
Show details
|
|
16 |
Treebanking user-generated content: a UD based overview of guidelines, corpora and unified recommendations [Online resource]
|
|
|
|
IDS-Repository
|
|
Show details
|
|
|
|