1 |
Using heterogeneity in semi-supervised transcription hypotheses to improve code-switched speech recognition ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Lexical speaker identification in TV shows
|
|
|
|
In: ISSN: 1380-7501 ; EISSN: 1573-7721 ; Multimedia Tools and Applications ; https://hal.archives-ouvertes.fr/hal-01690342 ; Multimedia Tools and Applications, Springer Verlag, 2015, 74 (4), pp.1377 - 1396. ⟨10.1007/s11042-014-1940-3⟩ (2015)
|
|
BASE
|
|
Show details
|
|
4 |
Comparing decoding strategies for subword-based keyword spotting in low-resourced languages
|
|
|
|
In: Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-01843408 ; Annual Conference of the International Speech Communication Association , ISCA, Sep 2014, Singapore, Singapore (2014)
|
|
Abstract:
International audience ; For languages with limited training resources, out-of-vocabulary (OOV) words are a significant problem, both fortranscription and keyword spotting. This paper investigates theuse of subword lexical units for keyword spotting. Three strate-gies for using the sub-word units are explored: 1) convertingword-based lattices to subword lattices after decoding, 2) per-forming a separate decoding for each subword type, and 3) asingle decoding using all possible subword units. In these ex-periments, the best performance is achieved by carrying out aseparate decoding for each subword type. Further gains are at-tained through system combination. We also find that ignor-ing word boundaries improves the detection of OOV keywordswithout significantly impacting in-vocabulary keyword detec-tion. Results are presented on four languages from the IARPABabel Program (Haitian Creole, Assamese, Bengali, and Zulu).
|
|
Keyword:
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [INFO]Computer Science [cs]; keyword search; low resource LVCSR; OOV; spoken term detection; sub-word lexical units
|
|
URL: https://hal.archives-ouvertes.fr/hal-01843408
|
|
BASE
|
|
Hide details
|
|
5 |
Efficient Rule Scoring for Improved Grapheme-Based Lexicons
|
|
|
|
In: European Signal Processing Conference ; https://hal.archives-ouvertes.fr/hal-01843411 ; European Signal Processing Conference, Jan 2014, Lisbon, Portugal (2014)
|
|
BASE
|
|
Show details
|
|
6 |
Cross-Word Sub-Word Units for Low-Resource Keyword Spotting
|
|
|
|
In: International Workshop on Spoken Languages Technologies for Under-resourced languages ; https://hal.archives-ouvertes.fr/hal-01843415 ; International Workshop on Spoken Languages Technologies for Under-resourced languages, May 2014, St. Petersburg, Russia (2014)
|
|
BASE
|
|
Show details
|
|
7 |
Efficient Rule Scoring For Improved Grapheme-Based Lexicons ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Acoustic unit discovery and pronunciation generation from a grapheme-based lexicon
|
|
|
|
In: IEEE Automatic Speech Recognition and Understanding Workshop ; https://hal.archives-ouvertes.fr/hal-01843433 ; IEEE Automatic Speech Recognition and Understanding Workshop, Dec 2013, Olomouc, Czech Republic (2013)
|
|
BASE
|
|
Show details
|
|
|
|