5 |
HAVIC MED Novel 1 Test -- Videos, Metadata and Annotation ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
TAC KBP English Surprise Slot Filling -- Comprehensive Training and Evaluation Data 2010
|
|
|
|
BASE
|
|
Show details
|
|
13 |
TAC KBP English Sentiment Slot Filling -- Comprehensive Training and Evaluation Data 2013-2014
|
|
|
|
BASE
|
|
Show details
|
|
14 |
TAC KBP English Sentiment Slot Filling -- Comprehensive Training and Evaluation Data 2013-2014 ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
HAVIC MED Training Data -- Videos, Metadata and Annotation ...
|
|
|
|
BASE
|
|
Show details
|
|
18 |
TAC KBP English Surprise Slot Filling -- Comprehensive Training and Evaluation Data 2010 ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
BOLT Egyptian Arabic SMS/Chat Parallel Training Data ...
|
|
|
|
Abstract:
Introduction BOLT Egyptian Arabic SMS/Chat Parallel Training Data was developed by LDC and consists of approximately 723,000 tokens of Egyptian Arabic SMS/Chat data collected for the DARPA BOLT program along with their corresponding English translations. The DARPA BOLT (Broad Operational Language Translation) program developed machine translation and information retrieval for less formal genres, focusing particularly on user-generated content. LDC supported the BOLT program by collecting informal data sources -- discussion forums, text messaging and chat -- in Chinese, Egyptian Arabic and English. The collected data was translated and annotated for various tasks including word alignment, treebanking, propbanking and co-reference. Data The source date in this release was collected using two methods: new collection via LDC's collection platform, and donation of SMS or chat ...
|
|
URL: https://dx.doi.org/10.35111/k4bf-hh16 https://catalog.ldc.upenn.edu/LDC2021T15
|
|
BASE
|
|
Hide details
|
|
|
|