1 |
Korean Telephone Conversations Lexicon
|
|
|
|
Abstract:
*Introduction* Korean Telephone Conversations Lexicon was produced by Linguistic Data Consortium (LDC) catalog number LDC2003L02 and ISBN 1-58563-265-1. Korean Telephone Conversations Lexicon consists of 25,251 words, and contains separate fields with phonological, morphological, and frequency information for each word. The lexicon covers the tokens occurring in 100 telephone conversations transcribed and published as Korean Telephone Conversations Transcripts. The token coverage is 100%. The corresponding speech is published as Korean Telephone Conversations Speech. *Data* The lexicon contains five tab-separated information fields: * orthographic form in Hangul (head-word), encoded in the KSC-5601 (Wansung) system * orthographic form in Yale romanization * pronunciation * frequency of the word in Korean Telephone Conversations Transcripts * morphological analysis of the word Please follow this link for a sample page from the lexicon: txt | gif. *Updates* There are no updates available at this time.
|
|
Keyword:
Korean language
|
|
URL: https://catalog.ldc.upenn.edu/LDC2003L02
|
|
BASE
|
|
Hide details
|
|
|
|