Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2

Hits 1 – 20 of 23

1	Source and Target Bidirectional Knowledge Distillation for End-to-end Speech Translation ...
	Inaguma, Hirofumi; Kawahara, Tatsuya; Watanabe, Shinji. - : arXiv, 2021
	BASE
	Show details

2	ASR Rescoring and Confidence Estimation with ELECTRA ...
	Futami, Hayato; Inaguma, Hirofumi; Mimura, Masato. - : arXiv, 2021
	BASE
	Show details

3	Speech Corpus of Ainu Folklore and End-to-end Speech Recognition for Ainu Language ...
	Matsuura, Kohei; Ueno, Sei; Mimura, Masato. - : arXiv, 2020
	BASE
	Show details

4	Multilingual End-to-End Speech Translation ...
	Inaguma, Hirofumi; Duh, Kevin; Kawahara, Tatsuya. - : arXiv, 2019
	BASE
	Show details

5	Improving OOV Detection and Resolution with External Language Models in Acoustic-to-Word ASR ...
	Inaguma, Hirofumi; Mimura, Masato; Sakai, Shinsuke; Kawahara, Tatsuya. - : arXiv, 2019
	Abstract: Acoustic-to-word (A2W) end-to-end automatic speech recognition (ASR) systems have attracted attention because of an extremely simplified architecture and fast decoding. To alleviate data sparseness issues due to infrequent words, the combination with an acoustic-to-character (A2C) model is investigated. Moreover, the A2C model can be used to recover out-of-vocabulary (OOV) words that are not covered by the A2W model, but this requires accurate detection of OOV words. A2W models learn contexts with both acoustic and transcripts; therefore they tend to falsely recognize OOV words as words in the vocabulary. In this paper, we tackle this problem by using external language models (LM), which are trained only with transcriptions and have better linguistic information to detect OOV words. The A2C model is used to resolve these OOV words. Experimental evaluations show that external LMs have the effects of not only reducing errors but also increasing the number of detected OOV words, and the proposed method ... : SLT2018 ...
	Keyword: Computation and Language cs.CL; FOS Computer and information sciences
	URL: https://arxiv.org/abs/1909.09993 https://dx.doi.org/10.48550/arxiv.1909.09993
	BASE
	Hide details

6	Transfer learning of language-independent end-to-end ASR with language model fusion ...
	Inaguma, Hirofumi; Cho, Jaejin; Baskar, Murali Karthick. - : arXiv, 2018
	BASE
	Show details

7	Lexicon optimization based on discriminative learning for automatic speech recognition of agglutinative language
	Ablimit, Mijit; Kawahara, Tatsuya; Hamdulla, Askar
	In: Speech communication. - Amsterdam [u.a.] : Elsevier 60 (2014), 78-87
	OLC Linguistik
	Show details

8	Substring-based machine translation
	Neubig, Graham; Watanabe, Taro; Mori, Shinsuke...
	In: Machine translation. - Dordrecht [u.a.] : Springer Science + Business Media 27 (2013) 2, 139-166
	OLC Linguistik
	Show details

9	A monotonic statistical machine translation approach to speaking style transformation
	Kawahara, Tatsuya; Neubig, Graham; Akita, Yuya...
	In: Computer speech and language. - Amsterdam [u.a.] : Elsevier 26 (2012) 5, 349-370
	BLLDB
	OLC Linguistik
	Show details

10	Robust speech recognition based on dereverberation: parameter optimization using acoustic model likelihood
	Kawahara, Tatsuya; Gomez, Randy
	In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 18 (2010) 7, 1708-1716
	BLLDB
	OLC Linguistik
	Show details

11	Statistical transformation of language and pronunciation models for spontaneous speech recognition
	Akita, Yuya; Kawahara, Tatsuya
	In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 18 (2010) 6, 1539-1549
	BLLDB
	Show details

12	Bayes risk-based dialogue management for document retrieval system with speech interface
	Kawahara, Tatsuya; Misu, Teruhisa
	In: Speech communication. - Amsterdam [u.a.] : Elsevier 52 (2010) 1, 61-71
	BLLDB
	OLC Linguistik
	Show details

13	Bayes risk-based dialogue management for document retrieval system with speech interface
	Misu, Teruhisa; Kawahara, Tatsuya
	In: Speech communication. - Amsterdam [u.a.] : Elsevier 52 (2010) 1, 61-71
	OLC Linguistik
	Show details

14	Computer assisted language learning system based on dynamic question generation and error prediction for automatic speech recognition
	Waple, Christopher J.; Kawahara, Tatsuya; Wang, Hongcui
	In: Speech communication. - Amsterdam [u.a.] : Elsevier 51 (2009) 10, 995-1005
	BLLDB
	OLC Linguistik
	Show details

15	Out-of-domain utterance detection using classification confidences of multiple topics
	Lane, Ian; Kawahara, Tatsuya; Matsui, Tomoko...
	In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 15 (2007) 1, 150-161
	BLLDB
	Show details

16	Dialogue strategy to clarify user's queries for document retrieval system with speech interface
	Misu, Teruhisa; Kawahara, Tatsuya
	In: Speech communication. - Amsterdam [u.a.] : Elsevier 48 (2006) 9, 1137-1150
	BLLDB
	Show details

17	User Modeling in Spoken Dialogue Systems to Generate Flexible Guidance [<Journal>]
	Komatani, Kazunori [Verfasser]; Ueno, Shinichi [Verfasser]; Kawahara, Tatsuya [Verfasser].
	DNB Subject Category Language
	Show details

18	Speaker model selection based on the Bayesian Information Criterion applied to unsupervised speaker indexing
	Nishida, Masafumi; Kawahara, Tatsuya
	In: Institute of Electrical and Electronics Engineers. IEEE transactions on speech and audio processing. - New York, NY : Inst. 13 (2005) 4, 583-592
	BLLDB
	OLC Linguistik
	Show details

19	Spoken language systems
	Kawahara, Tatsuya (Hrsg.); Nakagawa, Seiichi (Hrsg.); Okada, Michio (Hrsg.). - Tokyo [u.a.] : Ohmsha [u.a.], 2005
	BLLDB
	UB Frankfurt Linguistik
	Show details

20	Spontaneous speech processing
	Furui, Sadaoki (Hrsg.); Beckman, Mary E. (Hrsg.); Hirschberg, Julia (Hrsg.)...
	In: Institute of Electrical and Electronics Engineers. IEEE transactions on speech and audio processing. - New York, NY : Inst. 12 (2004) 4, 349-445
	BLLDB
	Show details

Page: 1 2

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern