17 |
GALE Phase 4 Chinese Broadcast News Transcripts ...
|
|
|
|
Abstract:
Introduction GALE Phase 4 Chinese Broadcast News Transcripts was developed by the Linguistic Data Consortium (LDC) and contains transcriptions of approximately 134 hours of Chinese broadcast news speech collected in 2008 by LDC and Hong University of Science and Technology (HKUST), Hong Kong, during Phase 4 of the DARPA GALE (Global Autonomous Language Exploitation) Program. Corresponding audio data is released as GALE Phase 4 Chinese Broadcast News Speech (LDC2017S25). The broadcast news recordings feature news broadcasts focusing principally on current events from the following sources: China Central TV (CCTV), a national and international broadcaster in Mainland China; Phoenix TV, a Hong Kong-based satellite television station; and Voice of America (VOA), a U.S. government-funded broadcast programmer. Data The transcript files are in plain-text, tab-delimited format (TDF) with UTF-8 encoding, and the transcribed data ...
|
|
URL: https://dx.doi.org/10.35111/20f8-x526 https://catalog.ldc.upenn.edu/LDC2017T18
|
|
BASE
|
|
Hide details
|
|
19 |
GALE Phase 3 Arabic Broadcast Conversation Transcripts Part 2
|
|
|
|
BASE
|
|
Show details
|
|
|
|