DE eng

Search in the Catalogues and Directories

Hits 1 – 16 of 16

1
BOLT Chinese Co-reference -- Discussion Forum, SMS/Chat, and Conversational Telephone Speech
Agarwal, Nitin; Francini, Michelle; Kappler, Michelle. - : Linguistic Data Consortium, 2021. : https://www.ldc.upenn.edu, 2021
BASE
Show details
2
BOLT Egyptian Arabic Co-reference -- Discussion Forum, SMS/Chat, and Conversational Telephone Speech
Agarwal, Nitin; Francini, Michelle; Kappler, Michelle; Micciulla, Linnea; Pradhan, Sameer; Ramshaw, Lance. - : Linguistic Data Consortium, 2021. : https://www.ldc.upenn.edu, 2021
Abstract: *Introduction* BOLT Egyptian Arabic Co-reference -- Discussion Forum, SMS/Chat, and Conversational Telephone Speech was developed by Raytheon BBN Technologies and consists of co-reference annotation on Egyptian Arabic discussion forum (DF), SMS/Chat and conversational telephone speech (CTS). The DARPA BOLT (Broad Operational Language Translation) program developed machine translation and information retrieval for less formal genres, focusing particularly on user-generated content. The Linguistic Data Consortium (LDC) supported the BOLT program by collecting informal data sources -- discussion forums, text messaging and chat -- in Chinese, Egyptian Arabic and English. The collected data was translated and annotated for various tasks including word alignment, treebanking, propbanking and co-reference. *Data* DF data was collected from the web using a combination of manual and automatic processes. SMS/Chat material was donated or collected via live platforms. CTS data was taken from LDC's Egyptian Arabic CALLHOME and CALLFRIEND telephone collections. Co-reference annotation aims to fill in all of the connections between specific mentions in the text that refer to the same entities and events in the discourse context. BOLT co-reference annotation was performed on BOLT treebank annotation. It covers noun phrases (including proper nouns, nominals, pronouns and null arguments), possessives, proper noun pre-modifiers and verbs. Annotation files are presented in UTF-8 encoded XML format. *Sponsorship* This material is based upon work supported by the Defense Advanced Research Projects Agency (DARPA) under Contract No. HR0011-11-C-0145. The content does not necessarily reflect the position or the policy of the Government, and no official endorsement should be inferred. *Samples* Please view the following samples: * CTS Sample (TXT) * DF Sample (TXT) * SMS/Chat Sample (TXT) *Updates* None at this time.
URL: https://catalog.ldc.upenn.edu/LDC2021T14
BASE
Hide details
3
BOLT Egyptian Arabic Co-reference -- Discussion Forum, SMS/Chat, and Conversational Telephone Speech ...
Agarwal, Nitin; Francini, Michelle; Kappler, Michelle. - : Linguistic Data Consortium, 2021
BASE
Show details
4
BOLT Chinese Co-reference -- Discussion Forum, SMS/Chat, and Conversational Telephone Speech ...
Francini, Michelle; Agarwal, Nitin; Kappler, Michelle. - : Linguistic Data Consortium, 2021
BASE
Show details
5
BOLT English Co-reference -- Discussion Forum, SMS/Chat, and Conversational Telephone Speech
Agarwal, Nitin; Franchini, Michelle; Kappler, Michelle. - : Linguistic Data Consortium, 2020. : https://www.ldc.upenn.edu, 2020
BASE
Show details
6
BOLT English Co-reference -- Discussion Forum, SMS/Chat, and Conversational Telephone Speech ...
Agarwal, Nitin; Franchini, Michelle; Kappler, Michelle. - : Linguistic Data Consortium, 2020
BASE
Show details
7
OntoNotes Release 5.0
Weischedel, Ralph; Palmer, Martha; Marcus, Mitchell. - : Linguistic Data Consortium, 2013. : https://www.ldc.upenn.edu, 2013
BASE
Show details
8
OntoNotes Release 5.0 ...
Weischedel, Ralph; Palmer, Martha; Marcus, Mitchell. - : Linguistic Data Consortium, 2013
BASE
Show details
9
OntoNotes Release 4.0
Weischedel, Ralph; Palmer, Martha; Marcus, Mitchell. - : Linguistic Data Consortium, 2011. : https://www.ldc.upenn.edu, 2011
BASE
Show details
10
OntoNotes Release 4.0 ...
Weischedel, Ralph; Palmer, Martha; Marcus, Mitchell. - : Linguistic Data Consortium, 2011
BASE
Show details
11
OntoNotes Release 3.0
Weischedel, Ralph; Pradhan, Sameer; Ramshaw, Lance. - : Linguistic Data Consortium, 2009. : https://www.ldc.upenn.edu, 2009
BASE
Show details
12
OntoNotes Release 3.0 ...
Weischedel, Ralph; Pradhan, Sameer; Ramshaw, Lance. - : Linguistic Data Consortium, 2009
BASE
Show details
13
OntoNotes Release 2.0
Weischedel, Ralph; Pradhan, Sameer; Ramshaw, Lance. - : Linguistic Data Consortium, 2008. : https://www.ldc.upenn.edu, 2008
BASE
Show details
14
OntoNotes Release 2.0 ...
Weischedel, Ralph; Pradhan, Sameer; Ramshaw, Lance. - : Linguistic Data Consortium, 2008
BASE
Show details
15
OntoNotes Release 1.0
Weischedel, Ralph; Pradhan, Sameer; Ramshaw, Lance. - : Linguistic Data Consortium, 2007. : https://www.ldc.upenn.edu, 2007
BASE
Show details
16
OntoNotes Release 1.0 ...
Weischedel, Ralph; Pradhan, Sameer; Ramshaw, Lance. - : Linguistic Data Consortium, 2007
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
16
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern