11 |
GALE Phase 4 Arabic Broadcast Conversation Speech ...
|
|
|
|
Abstract:
Introduction GALE Phase 4 Arabic Broadcast Conversation Speech was developed by the Linguistic Data Consortium (LDC) and is comprised of approximately 75 hours of Arabic broadcast conversation speech collected in 2008 and 2009 by LDC, MediaNet, Tunis, Tunisia and MTC, Rabat, Morocco during Phase 4 of the DARPA GALE (Global Autonomous Language Exploitation) Program. Corresponding transcripts are released as GALE Phase 4 Arabic Broadcast Conversation Transcripts (LDC2017T12). Broadcast audio for the GALE program was collected at LDC’s Philadelphia, PA USA facilities and at three remote collection sites: Hong Kong University of Science and Technology (HKUST), Hong Kong (Chinese), Medianet (Tunis, Tunisia) (Arabic), and MTC (Rabat, Morocco) (Arabic). The combined local and outsourced broadcast collection supported GALE at a rate of approximately 300 hours per week of programming from more than 50 broadcast sources ...
|
|
URL: https://dx.doi.org/10.35111/mr99-m774 https://catalog.ldc.upenn.edu/LDC2017S15
|
|
BASE
|
|
Hide details
|
|
17 |
GALE Phase 3 Arabic Broadcast Conversation Transcripts Part 2
|
|
|
|
BASE
|
|
Show details
|
|
|
|