DE eng

Search in the Catalogues and Directories

Hits 1 – 7 of 7

1
Toward a unified approach to statistical language modeling for Chinese
In: http://research.microsoft.com/~jfgao/paper/icassp00.pdf (2002)
BASE
Show details
2
Toward a Unified Approach to Statistical Language Modeling for Chinese
In: http://www.research.microsoft.com/~joshuago/talip01.pdf (2002)
BASE
Show details
3
Toward a unified approach to statistical language modeling for Chinese
In: http://research.microsoft.com/~jfgao/paper/cslm.taplip01.pdf (2001)
Abstract: This article presents a unified approach to Chinese statistical language modeling (SLM). Applying SLM techniques like trigram language models to Chinese is challenging because (1) there is no standard definition of words in Chinese; (2) word boundaries are not marked by spaces; and (3) there is a dearth of training data. Our unified approach automatically and consistently gathers a high-quality training data set from the Web, creates a high-quality lexicon, segments the training data using this lexicon, and compresses the language model, all by using the maximum likelihood principle, which is consistent with trigram model training. We show that each of the methods leads to improvements over standard SLM, and that the combined method yields the best pinyin conversion result reported.
Keyword: backoff; character; Chinese language; Chinese pinyin-to-character conversion; domain adaptation; Experimentation; General Terms; Human Factors; Languages; lexicon; Measurement Additional Key Words and Phrases; n-gram model; perplexity; pruning; smoothing; Statistical language modeling; word segmentation
URL: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.146.2560
http://research.microsoft.com/~jfgao/paper/cslm.taplip01.pdf
BASE
Hide details
4
Recent progress in robust vocabulary-independent speech recognition
In: http://acl.ldc.upenn.edu/H/H91/H91-1050.pdf (1991)
BASE
Show details
5
Automatic new word acquisition: Spelling from acoustics
In: http://acl.ldc.upenn.edu/H/H89/H89-2036.pdf (1989)
BASE
Show details
6
Discriminative Training on language model
In: http://www.microsoft.com/china/research/group/./downloads/g-nlps/NLPSP/n11.pdf
BASE
Show details
7
Toward a Unified Approach to Statistical Language Modeling for Chinese
In: http://research.microsoft.com/~jfgao/paper/talip01.pdf
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
7
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern