DE eng

Search in the Catalogues and Directories

Hits 1 – 2 of 2

1
Quantifying the value of pronunciation lexicons for keyword search in low resource languages
In: http://www.clsp.jhu.edu/%7Eguoguo/papers/icassp2013_lexicon_value.pdf (2013)
BASE
Show details
2
USING PROXIES FOR OOV KEYWORDS IN THE KEYWORD SEARCH TASK
In: http://www.clsp.jhu.edu/%7Eguoguo/papers/asru2013_proxy_keyword.pdf
Abstract: We propose a simple but effective weighted finite state transducer (WFST) based framework for handling out-of-vocabulary (OOV) keywords in a speech search task. State-of-the-art large vocabulary continuous speech recognition (LVCSR) and keyword search (KWS) systems are developed for conversational telephone speech in Tagalog. Word-based and phone-based indexes are created from word lattices, the latter by using the LVCSR system’s pronunciation lexicon. Pronunciations of OOV keywords are hypothesized via a standard grapheme-to-phoneme method. In-vocabulary prox-ies (word or phone sequences) are generated for each OOV keyword using WFST techniques that permit incorporation of a phone confusion matrix. Empirical results when searching for the Babel/NIST evaluation keywords in the Babel 10 hour development-test speech collection show that (i) searching for word proxies in the word index significantly outperforms searching for phonetic representations of OOV words in a phone index, and (ii) while phone confusion information yields minor improvement when searching a phone index, it yields up to 40 % improvement in actual term weighted value when searching a word index with word proxies.
Keyword: Index Terms — Speech Recognition; Keyword Search; Low Resource LVCSR. 1. SEARCHING FOR OOVWORDS IN SPEECH Keyword search (KWS) for spoken documents ha; OOV Keywords; Proxy Keywords
URL: http://www.clsp.jhu.edu/%7Eguoguo/papers/asru2013_proxy_keyword.pdf
http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.676.4085
BASE
Hide details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
2
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern