Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2 3

Hits 1 – 20 of 52

1	Joint Modeling of Code-Switched and Monolingual ASR via Conditional Factorization ...
	Yan, Brian; Zhang, Chunlei; Yu, Meng. - : arXiv, 2021
	BASE
	Show details

2	Source and Target Bidirectional Knowledge Distillation for End-to-end Speech Translation ...
	Inaguma, Hirofumi; Kawahara, Tatsuya; Watanabe, Shinji. - : arXiv, 2021
	BASE
	Show details

3	Self-Guided Curriculum Learning for Neural Machine Translation ...
	Zhou, Lei; Ding, Liang; Duh, Kevin. - : arXiv, 2021
	BASE
	Show details

4	Arabic Speech Recognition by End-to-End, Modular Systems and Human ...
	Hussein, Amir; Watanabe, Shinji; Ali, Ahmed. - : arXiv, 2021
	BASE
	Show details

5	Leveraging End-to-End ASR for Endangered Language Documentation: An Empirical Study on Yoloxóchitl Mixtec ...
	Shi, Jiatong; Amith, Jonathan D.; García, Rey Castillo. - : arXiv, 2021
	BASE
	Show details

6	On Prosody Modeling for ASR+TTS based Voice Conversion ...
	Huang, Wen-Chin; Hayashi, Tomoki; Li, Xinjian. - : arXiv, 2021
	BASE
	Show details

7	Leveraging Pre-trained Language Model for Speech Sentiment Analysis ...
	Shon, Suwon; Brusco, Pablo; Pan, Jing. - : arXiv, 2021
	BASE
	Show details

8	End-to-end ASR to jointly predict transcriptions and linguistic annotations ...
	NAACL 2021 2021; Fujita, Yuya; Omachi, Motoi. - : Underline Science Inc., 2021
	BASE
	Show details

9	Differentiable Allophone Graphs for Language-Universal Speech Recognition ...
	Yan, Brian; Dalmia, Siddharth; Mortensen, David R.; Metze, Florian; Watanabe, Shinji. - : arXiv, 2021
	Abstract: Building language-universal speech recognition systems entails producing phonological units of spoken sound that can be shared across languages. While speech annotations at the language-specific phoneme or surface levels are readily available, annotations at a universal phone level are relatively rare and difficult to produce. In this work, we present a general framework to derive phone-level supervision from only phonemic transcriptions and phone-to-phoneme mappings with learnable weights represented using weighted finite-state transducers, which we call differentiable allophone graphs. By training multilingually, we build a universal phone-based speech recognition model with interpretable probabilistic phone-to-phoneme mappings for each language. These phone-based systems with learned allophone graphs can be used by linguists to document new languages, build phone-based lexicons that capture rich pronunciation variations, and re-evaluate the allophone mappings of seen language. We demonstrate the ... : INTERSPEECH 2021. Contains additional studies on phone recognition for unseen languages ...
	Keyword: Audio and Speech Processing eess.AS; Computation and Language cs.CL; FOS Computer and information sciences; FOS Electrical engineering, electronic engineering, information engineering; Sound cs.SD
	URL: https://arxiv.org/abs/2107.11628 https://dx.doi.org/10.48550/arxiv.2107.11628
	BASE
	Hide details

10	Speech Representation Learning Combining Conformer CPC with Deep Cluster for the ZeroSpeech Challenge 2021 ...
	Maekaku, Takashi; Chang, Xuankai; Fujita, Yuya. - : arXiv, 2021
	BASE
	Show details

11	CHiME-6 Challenge: Tackling multispeaker speech recognition for unsegmented recordings
	Watanabe, Shinji; Mandel, Michael; Barker, Jon...
	In: CHiME 2020 - 6th International Workshop on Speech Processing in Everyday Environments ; https://hal.inria.fr/hal-02546993 ; CHiME 2020 - 6th International Workshop on Speech Processing in Everyday Environments, May 2020, Barcelona / Virtual, Spain (2020)
	BASE
	Show details

12	Learning Speaker Embedding from Text-to-Speech ...
	Cho, Jaejin; Zelasko, Piotr; Villalba, Jesus. - : arXiv, 2020
	BASE
	Show details

13	Massively Multilingual Adversarial Speech Recognition ...
	Adams, Oliver; Wiesner, Matthew; Watanabe, Shinji. - : arXiv, 2019
	BASE
	Show details

14	A Comparative Study on Transformer vs RNN in Speech Applications ...
	Karita, Shigeki; Chen, Nanxin; Hayashi, Tomoki. - : arXiv, 2019
	BASE
	Show details

15	Multilingual End-to-End Speech Translation ...
	Inaguma, Hirofumi; Duh, Kevin; Kawahara, Tatsuya. - : arXiv, 2019
	BASE
	Show details

16	Towards Online End-to-end Transformer Automatic Speech Recognition ...
	Tsunoo, Emiru; Kashiwagi, Yosuke; Kumakura, Toshiyuki. - : arXiv, 2019
	BASE
	Show details

17	Transformer ASR with Contextual Block Processing ...
	Tsunoo, Emiru; Kashiwagi, Yosuke; Kumakura, Toshiyuki. - : arXiv, 2019
	BASE
	Show details

18	The fifth 'CHiME' Speech Separation and Recognition Challenge: Dataset, task and baselines
	Barker, Jon; Watanabe, Shinji; Vincent, Emmanuel...
	In: Interspeech 2018 - 19th Annual Conference of the International Speech Communication Association ; https://hal.inria.fr/hal-01744021 ; Interspeech 2018 - 19th Annual Conference of the International Speech Communication Association, Sep 2018, Hyderabad, India (2018)
	BASE
	Show details

19	Analysis of Multilingual Sequence-to-Sequence speech recognition systems ...
	Karafiát, Martin; Baskar, Murali Karthick; Watanabe, Shinji. - : arXiv, 2018
	BASE
	Show details

20	Language model integration based on memory control for sequence to sequence speech recognition ...
	Cho, Jaejin; Watanabe, Shinji; Hori, Takaaki. - : arXiv, 2018
	BASE
	Show details

Page: 1 2 3

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern