DE eng

Search in the Catalogues and Directories

Page: 1 2
Hits 1 – 20 of 23

1
Source and Target Bidirectional Knowledge Distillation for End-to-end Speech Translation ...
BASE
Show details
2
ASR Rescoring and Confidence Estimation with ELECTRA ...
BASE
Show details
3
Speech Corpus of Ainu Folklore and End-to-end Speech Recognition for Ainu Language ...
BASE
Show details
4
Multilingual End-to-End Speech Translation ...
BASE
Show details
5
Improving OOV Detection and Resolution with External Language Models in Acoustic-to-Word ASR ...
Abstract: Acoustic-to-word (A2W) end-to-end automatic speech recognition (ASR) systems have attracted attention because of an extremely simplified architecture and fast decoding. To alleviate data sparseness issues due to infrequent words, the combination with an acoustic-to-character (A2C) model is investigated. Moreover, the A2C model can be used to recover out-of-vocabulary (OOV) words that are not covered by the A2W model, but this requires accurate detection of OOV words. A2W models learn contexts with both acoustic and transcripts; therefore they tend to falsely recognize OOV words as words in the vocabulary. In this paper, we tackle this problem by using external language models (LM), which are trained only with transcriptions and have better linguistic information to detect OOV words. The A2C model is used to resolve these OOV words. Experimental evaluations show that external LMs have the effects of not only reducing errors but also increasing the number of detected OOV words, and the proposed method ... : SLT2018 ...
Keyword: Computation and Language cs.CL; FOS Computer and information sciences
URL: https://arxiv.org/abs/1909.09993
https://dx.doi.org/10.48550/arxiv.1909.09993
BASE
Hide details
6
Transfer learning of language-independent end-to-end ASR with language model fusion ...
BASE
Show details
7
Lexicon optimization based on discriminative learning for automatic speech recognition of agglutinative language
In: Speech communication. - Amsterdam [u.a.] : Elsevier 60 (2014), 78-87
OLC Linguistik
Show details
8
Substring-based machine translation
In: Machine translation. - Dordrecht [u.a.] : Springer Science + Business Media 27 (2013) 2, 139-166
OLC Linguistik
Show details
9
A monotonic statistical machine translation approach to speaking style transformation
In: Computer speech and language. - Amsterdam [u.a.] : Elsevier 26 (2012) 5, 349-370
BLLDB
OLC Linguistik
Show details
10
Robust speech recognition based on dereverberation: parameter optimization using acoustic model likelihood
In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 18 (2010) 7, 1708-1716
BLLDB
OLC Linguistik
Show details
11
Statistical transformation of language and pronunciation models for spontaneous speech recognition
In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 18 (2010) 6, 1539-1549
BLLDB
Show details
12
Bayes risk-based dialogue management for document retrieval system with speech interface
In: Speech communication. - Amsterdam [u.a.] : Elsevier 52 (2010) 1, 61-71
BLLDB
OLC Linguistik
Show details
13
Bayes risk-based dialogue management for document retrieval system with speech interface
In: Speech communication. - Amsterdam [u.a.] : Elsevier 52 (2010) 1, 61-71
OLC Linguistik
Show details
14
Computer assisted language learning system based on dynamic question generation and error prediction for automatic speech recognition
In: Speech communication. - Amsterdam [u.a.] : Elsevier 51 (2009) 10, 995-1005
BLLDB
OLC Linguistik
Show details
15
Out-of-domain utterance detection using classification confidences of multiple topics
In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 15 (2007) 1, 150-161
BLLDB
Show details
16
Dialogue strategy to clarify user's queries for document retrieval system with speech interface
In: Speech communication. - Amsterdam [u.a.] : Elsevier 48 (2006) 9, 1137-1150
BLLDB
Show details
17
User Modeling in Spoken Dialogue Systems to Generate Flexible Guidance [<Journal>]
Komatani, Kazunori [Verfasser]; Ueno, Shinichi [Verfasser]; Kawahara, Tatsuya [Verfasser].
DNB Subject Category Language
Show details
18
Speaker model selection based on the Bayesian Information Criterion applied to unsupervised speaker indexing
In: Institute of Electrical and Electronics Engineers. IEEE transactions on speech and audio processing. - New York, NY : Inst. 13 (2005) 4, 583-592
BLLDB
OLC Linguistik
Show details
19
Spoken language systems
Kawahara, Tatsuya (Hrsg.); Nakagawa, Seiichi (Hrsg.); Okada, Michio (Hrsg.). - Tokyo [u.a.] : Ohmsha [u.a.], 2005
BLLDB
UB Frankfurt Linguistik
Show details
20
Spontaneous speech processing
Furui, Sadaoki (Hrsg.); Beckman, Mary E. (Hrsg.); Hirschberg, Julia (Hrsg.)...
In: Institute of Electrical and Electronics Engineers. IEEE transactions on speech and audio processing. - New York, NY : Inst. 12 (2004) 4, 349-445
BLLDB
Show details

Page: 1 2

Catalogues
1
0
8
0
1
0
0
Bibliographies
13
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
6
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern