3 |
YouDACC: the Youtube Dialectal Arabic Commentary Corpus ...
|
|
|
|
Abstract:
In the Arab world, while Modern Standard Arabic is commonly used in formal written context, on sites like Youtube, people are increasingly using Dialectal Arabic, the language for everyday use to comment on a video and interact with the community. These user-contributed comments along with the video and user attributes, offer a rich source of multi-dialectal Arabic sentences and expressions from different countries in the Arab world. This paper presents YOUDACC, an automatically annotated large-scale multi-dialectal Arabic corpus collected from user comments on Youtube videos. Our corpus covers different groups of dialects: Egyptian (EG), Gulf (GU), Iraqi (IQ), Maghrebi (MG) and Levantine (LV). We perform an empirical analysis on the crawled corpus and demonstrate that our location-based proposed method is effective for the task of dialect labeling. ...
|
|
Keyword:
80107 Natural Language Processing; FOS Computer and information sciences
|
|
URL: https://kilthub.cmu.edu/articles/YouDACC_the_Youtube_Dialectal_Arabic_Commentary_Corpus/6373124 https://dx.doi.org/10.1184/r1/6373124
|
|
BASE
|
|
Hide details
|
|
6 |
MADARi: A Web Interface for Joint Arabic Morphological Annotation and Spelling Correction ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Dependency Parsing with an Extended Finite-State Approach ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Dependency Parsing with an Extended Finite-State Approach ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
A Pilot Study on Arabic Multi-Genre Corpus Diacritization Annotation ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
A Pilot Study on Arabic Multi-Genre Corpus Diacritization Annotation ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
QCMUQ@QALB-2015 Shared Task: Combining Character level MT and Error-tolerant Finite-State Recognition for Arabic Spelling Correction ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
QCMUQ@QALB-2015 Shared Task: Combining Character level MT and Error-tolerant Finite-State Recognition for Arabic Spelling Correction ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Domain and Dialect Adaptation for Machine Translation into Egyptian Arabic ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Domain and Dialect Adaptation for Machine Translation into Egyptian Arabic ...
|
|
|
|
BASE
|
|
Show details
|
|
17 |
A Human Judgment Corpus and a Metric for Arabic MT Evaluation ...
|
|
|
|
BASE
|
|
Show details
|
|
18 |
A Human Judgment Corpus and a Metric for Arabic MT Evaluation ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|