Home Catalogue search

eng

Refine your search:
- Keyword
- Creator / Publisher:
- Year:
  - 2021 (6)
  - 2020 (7)
  - 2019 (3)
  - 2018 (7)
  - 2016 (1)
  - 2015 (1)
- Medium:
  - Online (25)
- Type
- BLLDB-Access:
  - free (25)
  - subject to license (0)

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2

Hits 1 – 20 of 25

1	Mandarin-English Code-switching Speech Recognition with Self-supervised Speech Representation Models ...
	Tseng, Liang-Hsuan; Fu, Yu-Kuan; Chang, Heng-Jui. - : arXiv, 2021
	BASE
	Show details

2	Improving Cross-Lingual Reading Comprehension with Self-Training ...
	Huang, Wei-Cheng; Huang, Chien-yu; Lee, Hung-yi. - : arXiv, 2021
	BASE
	Show details

3	Investigating the Reordering Capability in CTC-based Non-Autoregressive End-to-End Speech Translation ...
	The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing 2021; Chang, Chih-Chiang; Chuang, Yung-Sung. - : Underline Science Inc., 2021
	BASE
	Show details

4	S2VC: A Framework for Any-to-Any Voice Conversion with Self-Supervised Pretrained Representations ...
	Lin, Jheng-hao; Lin, Yist Y.; Chien, Chung-Ming. - : arXiv, 2021
	BASE
	Show details

5	Mitigating Biases in Toxic Language Detection through Invariant Rationalization ...
	Chuang, Yung-Sung; Gao, Mingye; Luo, Hongyin. - : arXiv, 2021
	BASE
	Show details

6	Mitigating Biases in Toxic Language Detection through Invariant Rationalization ...
	The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing 2021; Chen, Yun-Nung; Chuang, Yung-Sung. - : Underline Science Inc., 2021
	BASE
	Show details

7	Looking for Clues of Language in Multilingual BERT to Improve Cross-lingual Generalization ...
	Liu, Chi-Liang; Hsu, Tsung-Yuan; Chuang, Yung-Sung. - : arXiv, 2020
	BASE
	Show details

8	DARTS-ASR: Differentiable Architecture Search for Multilingual Speech Recognition and Adaptation ...
	Chen, Yi-Chen; Hsu, Jui-Yang; Lee, Cheng-Kuang. - : arXiv, 2020
	BASE
	Show details

9	What makes multilingual BERT multilingual? ...
	Liu, Chi-Liang; Hsu, Tsung-Yuan; Chuang, Yung-Sung. - : arXiv, 2020
	BASE
	Show details

10	Pretrained Language Model Embryology: The Birth of ALBERT ...
	Chiang, Cheng-Han; Huang, Sung-Feng; Lee, Hung-yi. - : arXiv, 2020
	BASE
	Show details

11	AGAIN-VC: A One-shot Voice Conversion using Activation Guidance and Adaptive Instance Normalization ...
	Chen, Yen-Hao; Wu, Da-Yi; Wu, Tsung-Han; Lee, Hung-yi. - : arXiv, 2020
	Abstract: Recently, voice conversion (VC) has been widely studied. Many VC systems use disentangle-based learning techniques to separate the speaker and the linguistic content information from a speech signal. Subsequently, they convert the voice by changing the speaker information to that of the target speaker. To prevent the speaker information from leaking into the content embeddings, previous works either reduce the dimension or quantize the content embedding as a strong information bottleneck. These mechanisms somehow hurt the synthesis quality. In this work, we propose AGAIN-VC, an innovative VC system using Activation Guidance and Adaptive Instance Normalization. AGAIN-VC is an auto-encoder-based model, comprising of a single encoder and a decoder. With a proper activation as an information bottleneck on content embeddings, the trade-off between the synthesis quality and the speaker similarity of the converted speech is improved drastically. This one-shot VC system obtains the best performance regardless of the ... : Submitted to ICASSP 2021 ...
	Keyword: Audio and Speech Processing eess.AS; FOS Computer and information sciences; FOS Electrical engineering, electronic engineering, information engineering; Sound cs.SD
	URL: https://dx.doi.org/10.48550/arxiv.2011.00316 https://arxiv.org/abs/2011.00316
	BASE
	Hide details

12	Defending Your Voice: Adversarial Attack on Voice Conversion ...
	Huang, Chien-yu; Lin, Yist Y.; Lee, Hung-yi. - : arXiv, 2020
	BASE
	Show details

13	FragmentVC: Any-to-Any Voice Conversion by End-to-End Extracting and Fusing Fine-Grained Voice Fragments With Attention ...
	Lin, Yist Y.; Chien, Chung-Ming; Lin, Jheng-Hao. - : arXiv, 2020
	BASE
	Show details

14	Zero-shot Reading Comprehension by Cross-lingual Transfer Learning with Multi-lingual Language Representation Model ...
	Hsu, Tsung-yuan; Liu, Chi-liang; Lee, Hung-yi. - : arXiv, 2019
	BASE
	Show details

15	Towards Unsupervised Speech Recognition and Synthesis with Quantized Speech Representation Learning ...
	Liu, Alexander H.; Tu, Tao; Lee, Hung-yi. - : arXiv, 2019
	BASE
	Show details

16	From Semi-supervised to Almost-unsupervised Speech Recognition with Very-low Resource by Jointly Learning Phonetic Structures from Audio and Text Embeddings ...
	Chen, Yi-Chen; Huang, Sung-Feng; Lee, Hung-yi. - : arXiv, 2019
	BASE
	Show details

17	Multi-target Voice Conversion without Parallel Data by Adversarially Learning Disentangled Audio Representations ...
	Chou, Ju-chieh; Yeh, Cheng-chieh; Lee, Hung-yi. - : arXiv, 2018
	BASE
	Show details

18	Almost-unsupervised Speech Recognition with Close-to-zero Resource Based on Phonetic Structures Learned from Very Small Unpaired Speech and Text Data ...
	Chen, Yi-Chen; Shen, Chia-Hao; Huang, Sung-Feng. - : arXiv, 2018
	BASE
	Show details

19	Segmental Audio Word2Vec: Representing Utterances as Sequences of Vectors with Applications in Spoken Term Detection ...
	Wang, Yu-Hsuan; Lee, Hung-yi; Lee, Lin-shan. - : arXiv, 2018
	BASE
	Show details

20	Phonetic-and-Semantic Embedding of Spoken Words with Applications in Spoken Content Retrieval ...
	Chen, Yi-Chen; Huang, Sung-Feng; Shen, Chia-Hao. - : arXiv, 2018
	BASE
	Show details

Page: 1 2

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern