2 |
BOLT Egyptian Arabic Treebank - Conversational Telephone Speech
|
|
|
|
BASE
|
|
Show details
|
|
3 |
BOLT Egyptian Arabic Treebank - SMS/Chat ...
|
|
|
|
Abstract:
Introduction BOLT Egyptian Arabic Treebank - SMS/Chat, Linguistic Data Consortium (LDC) catalog number LDC2021T17 and ISBN 1-58563-976-1, was developed by LDC and consists of Egyptian Arabic SMS/Chat data with part-of-speech annotation, morphology, and syntactic tree annotation. The DARPA BOLT (Broad Operational Language Translation) program developed machine translation and information retrieval for less formal genres, focusing particularly on user-generated content. LDC supported the BOLT program by collecting informal data sources -- discussion forums, text messaging and chat -- in Chinese, Egyptian Arabic and English. The collected data was translated and annotated for various tasks including word alignment, treebanking, propbanking and co-reference. The unannotated Egyptian Arabic source data is released as BOLT Egyptian Arabic SMS/Chat and Transliteration ...
|
|
URL: https://dx.doi.org/10.35111/wksr-ks46 https://catalog.ldc.upenn.edu/LDC2021T17
|
|
BASE
|
|
Hide details
|
|
4 |
BOLT Egyptian Arabic Treebank - Conversational Telephone Speech ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
LDC Standard Arabic Morphological Analyzer (SAMA) Version 3.1
|
|
|
|
BASE
|
|
Show details
|
|
19 |
LDC Standard Arabic Morphological Analyzer (SAMA) Version 3.1 ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|