DE eng

Search in the Catalogues and Directories

Hits 1 – 9 of 9

1
Quantifying the value of pronunciation lexicons for keyword search in low resource languages
In: http://www.clsp.jhu.edu/%7Eguoguo/papers/icassp2013_lexicon_value.pdf (2013)
Abstract: This paper quantifies the value of pronunciation lexicons in large vocabulary continuous speech recognition (LVCSR) systems that support keyword search (KWS) in low resource languages. State-of-the-art LVCSR and KWS systems are developed for conver-sational telephone speech in Tagalog, and the baseline lexicon is augmented via three different grapheme-to-phoneme models that yield increasing coverage of a large Tagalog word-list. It is demon-strated that while the increased lexical coverage — or reduced out-of-vocabulary (OOV) rate — leads to only modest (ca 1%-4%) improvements in word error rate, the concomitant improvements in actual term weighted value are as much as 60%. It is also shown that incorporating the augmented lexicons into the LVCSR system before indexing speech is superior to using them post facto, e.g., for approximate phonetic matching of OOV keywords in pre-indexed lattices. These results underscore the disproportionate importance of automatic lexicon augmentation for KWS in morphologically rich languages, and advocate for using them early in the LVCSR stage.
Keyword: Index Terms — Speech Recognition; Informa- tion Retrieval; Keyword Search; large; Morphology; Speech Synthesis 1. LOW-RESOURCE KEYWORD SEARCH Thanks in part to the falling costs of storage and t
URL: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.674.401
http://www.clsp.jhu.edu/%7Eguoguo/papers/icassp2013_lexicon_value.pdf
BASE
Hide details
2
Sequencediscriminative training of deep neural networks
In: http://www.cstr.ed.ac.uk/downloads/publications/2013/is13-dnn_seq.pdf (2013)
BASE
Show details
3
The kaldi speech recognition toolkit
In: http://publications.idiap.ch/downloads/papers/2012/Povey_ASRU2011_2011.pdf (2011)
BASE
Show details
4
The Kaldi speech recognition toolkit,” in
In: http://danielpovey.com/files/2011_asru_kaldi.pdf (2011)
BASE
Show details
5
Multilingual Acoustic Modeling for Speech Recognition based on Subspace Gaussian Mixture Models
In: http://www.clsp.jhu.edu/%7Esamuel/pdfs/sgmm_multiling.pdf (2010)
BASE
Show details
6
USING PROXIES FOR OOV KEYWORDS IN THE KEYWORD SEARCH TASK
In: http://www.clsp.jhu.edu/%7Eguoguo/papers/asru2013_proxy_keyword.pdf
BASE
Show details
7
APPROACHES TO AUTOMATIC LEXICON LEARNING WITH LIMITED TRAINING EXAMPLES
In: http://www.clsp.jhu.edu/%7Esamuel/pdfs/sgmm_lexicon.pdf
BASE
Show details
8
Multilingual acoustic modeling for speech recognition based on subspace Gaussian mixture models
In: http://www.lsv.uni-saarland.de/personalPages/aghoshal/pubs/icassp10-multiling.pdf
BASE
Show details
9
SOME INSIGHTS FROM TRANSLATING CONVERSATIONAL TELEPHONE SPEECH
In: http://cs.jhu.edu/~post/papers/kumar2013some.pdf
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
9
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern