Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2 3

Hits 1 – 20 of 52

1	Joint Modeling of Code-Switched and Monolingual ASR via Conditional Factorization ...
	Yan, Brian; Zhang, Chunlei; Yu, Meng. - : arXiv, 2021
	BASE
	Show details

2	Source and Target Bidirectional Knowledge Distillation for End-to-end Speech Translation ...
	Inaguma, Hirofumi; Kawahara, Tatsuya; Watanabe, Shinji. - : arXiv, 2021
	BASE
	Show details

3	Self-Guided Curriculum Learning for Neural Machine Translation ...
	Zhou, Lei; Ding, Liang; Duh, Kevin. - : arXiv, 2021
	BASE
	Show details

4	Arabic Speech Recognition by End-to-End, Modular Systems and Human ...
	Hussein, Amir; Watanabe, Shinji; Ali, Ahmed. - : arXiv, 2021
	BASE
	Show details

5	Leveraging End-to-End ASR for Endangered Language Documentation: An Empirical Study on Yoloxóchitl Mixtec ...
	Shi, Jiatong; Amith, Jonathan D.; García, Rey Castillo. - : arXiv, 2021
	BASE
	Show details

6	On Prosody Modeling for ASR+TTS based Voice Conversion ...
	Huang, Wen-Chin; Hayashi, Tomoki; Li, Xinjian. - : arXiv, 2021
	BASE
	Show details

7	Leveraging Pre-trained Language Model for Speech Sentiment Analysis ...
	Shon, Suwon; Brusco, Pablo; Pan, Jing. - : arXiv, 2021
	BASE
	Show details

8	End-to-end ASR to jointly predict transcriptions and linguistic annotations ...
	NAACL 2021 2021; Fujita, Yuya; Omachi, Motoi. - : Underline Science Inc., 2021
	BASE
	Show details

9	Differentiable Allophone Graphs for Language-Universal Speech Recognition ...
	Yan, Brian; Dalmia, Siddharth; Mortensen, David R.. - : arXiv, 2021
	BASE
	Show details

10	Speech Representation Learning Combining Conformer CPC with Deep Cluster for the ZeroSpeech Challenge 2021 ...
	Maekaku, Takashi; Chang, Xuankai; Fujita, Yuya; Chen, Li-Wei; Watanabe, Shinji; Rudnicky, Alexander. - : arXiv, 2021
	Abstract: We present a system for the Zero Resource Speech Challenge 2021, which combines a Contrastive Predictive Coding (CPC) with deep cluster. In deep cluster, we first prepare pseudo-labels obtained by clustering the outputs of a CPC network with k-means. Then, we train an additional autoregressive model to classify the previously obtained pseudo-labels in a supervised manner. Phoneme discriminative representation is achieved by executing the second-round clustering with the outputs of the final layer of the autoregressive model. We show that replacing a Transformer layer with a Conformer layer leads to a further gain in a lexical metric. Experimental results show that a relative improvement of 35% in a phonetic metric, 1.5% in the lexical metric, and 2.3% in a syntactic metric are achieved compared to a baseline method of CPC-small which is trained on LibriSpeech 460h data. We achieve top results in this challenge with the syntactic metric. ...
	Keyword: Audio and Speech Processing eess.AS; FOS Computer and information sciences; FOS Electrical engineering, electronic engineering, information engineering; Sound cs.SD
	URL: https://dx.doi.org/10.48550/arxiv.2107.05899 https://arxiv.org/abs/2107.05899
	BASE
	Hide details

11	CHiME-6 Challenge: Tackling multispeaker speech recognition for unsegmented recordings
	Watanabe, Shinji; Mandel, Michael; Barker, Jon...
	In: CHiME 2020 - 6th International Workshop on Speech Processing in Everyday Environments ; https://hal.inria.fr/hal-02546993 ; CHiME 2020 - 6th International Workshop on Speech Processing in Everyday Environments, May 2020, Barcelona / Virtual, Spain (2020)
	BASE
	Show details

12	Learning Speaker Embedding from Text-to-Speech ...
	Cho, Jaejin; Zelasko, Piotr; Villalba, Jesus. - : arXiv, 2020
	BASE
	Show details

13	Massively Multilingual Adversarial Speech Recognition ...
	Adams, Oliver; Wiesner, Matthew; Watanabe, Shinji. - : arXiv, 2019
	BASE
	Show details

14	A Comparative Study on Transformer vs RNN in Speech Applications ...
	Karita, Shigeki; Chen, Nanxin; Hayashi, Tomoki. - : arXiv, 2019
	BASE
	Show details

15	Multilingual End-to-End Speech Translation ...
	Inaguma, Hirofumi; Duh, Kevin; Kawahara, Tatsuya. - : arXiv, 2019
	BASE
	Show details

16	Towards Online End-to-end Transformer Automatic Speech Recognition ...
	Tsunoo, Emiru; Kashiwagi, Yosuke; Kumakura, Toshiyuki. - : arXiv, 2019
	BASE
	Show details

17	Transformer ASR with Contextual Block Processing ...
	Tsunoo, Emiru; Kashiwagi, Yosuke; Kumakura, Toshiyuki. - : arXiv, 2019
	BASE
	Show details

18	The fifth 'CHiME' Speech Separation and Recognition Challenge: Dataset, task and baselines
	Barker, Jon; Watanabe, Shinji; Vincent, Emmanuel...
	In: Interspeech 2018 - 19th Annual Conference of the International Speech Communication Association ; https://hal.inria.fr/hal-01744021 ; Interspeech 2018 - 19th Annual Conference of the International Speech Communication Association, Sep 2018, Hyderabad, India (2018)
	BASE
	Show details

19	Analysis of Multilingual Sequence-to-Sequence speech recognition systems ...
	Karafiát, Martin; Baskar, Murali Karthick; Watanabe, Shinji. - : arXiv, 2018
	BASE
	Show details

20	Language model integration based on memory control for sequence to sequence speech recognition ...
	Cho, Jaejin; Watanabe, Shinji; Hori, Takaaki. - : arXiv, 2018
	BASE
	Show details

Page: 1 2 3

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern