1 |
Source and Target Bidirectional Knowledge Distillation for End-to-end Speech Translation ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Speech Corpus of Ainu Folklore and End-to-end Speech Recognition for Ainu Language ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Improving OOV Detection and Resolution with External Language Models in Acoustic-to-Word ASR ...
|
|
|
|
Abstract:
Acoustic-to-word (A2W) end-to-end automatic speech recognition (ASR) systems have attracted attention because of an extremely simplified architecture and fast decoding. To alleviate data sparseness issues due to infrequent words, the combination with an acoustic-to-character (A2C) model is investigated. Moreover, the A2C model can be used to recover out-of-vocabulary (OOV) words that are not covered by the A2W model, but this requires accurate detection of OOV words. A2W models learn contexts with both acoustic and transcripts; therefore they tend to falsely recognize OOV words as words in the vocabulary. In this paper, we tackle this problem by using external language models (LM), which are trained only with transcriptions and have better linguistic information to detect OOV words. The A2C model is used to resolve these OOV words. Experimental evaluations show that external LMs have the effects of not only reducing errors but also increasing the number of detected OOV words, and the proposed method ... : SLT2018 ...
|
|
Keyword:
Computation and Language cs.CL; FOS Computer and information sciences
|
|
URL: https://arxiv.org/abs/1909.09993 https://dx.doi.org/10.48550/arxiv.1909.09993
|
|
BASE
|
|
Hide details
|
|
6 |
Transfer learning of language-independent end-to-end ASR with language model fusion ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|