Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2

Hits 1 – 20 of 26

1	Cross-Lingual Text-to-Speech Using Multi-Task Learning and Speaker Classifier Joint Training ...
	Yang, J.; He, Lei. - : arXiv, 2022
	BASE
	Show details

2	BEA-Base: A Benchmark for ASR of Spontaneous Hungarian ...
	Mihajlik, P.; Balog, A.; Gráczi, T. E.. - : arXiv, 2022
	BASE
	Show details

3	Improving the fusion of acoustic and text representations in RNN-T ...
	Zhang, Chao; Li, Bo; Lu, Zhiyun. - : arXiv, 2022
	BASE
	Show details

4	Automatic Depression Detection: An Emotional Audio-Textual Corpus and a GRU/BiLSTM-based Model ...
	Shen, Ying; Yang, Huiyu; Lin, Lin. - : arXiv, 2022
	BASE
	Show details

5	Separate What You Describe: Language-Queried Audio Source Separation ...
	Liu, Xubo; Liu, Haohe; Kong, Qiuqiang. - : arXiv, 2022
	BASE
	Show details

6	Chain-based Discriminative Autoencoders for Speech Recognition ...
	Lee, Hung-Shin; Huang, Pin-Tuan; Cheng, Yao-Fei. - : arXiv, 2022
	BASE
	Show details

7	Unsupervised word-level prosody tagging for controllable speech synthesis ...
	Guo, Yiwei; Du, Chenpeng; Yu, Kai. - : arXiv, 2022
	BASE
	Show details

8	Cetacean Translation Initiative: a roadmap to deciphering the communication of sperm whales ...
	Andreas, Jacob; Beguš, Gašper; Bronstein, Michael M.. - : arXiv, 2021
	BASE
	Show details

9	Improving End-To-End Modeling for Mispronunciation Detection with Effective Augmentation Mechanisms ...
	Lo, Tien-Hong; Sung, Yao-Ting; Chen, Berlin. - : arXiv, 2021
	BASE
	Show details

10	An Improved StarGAN for Emotional Voice Conversion: Enhancing Voice Quality and Data Augmentation ...
	He, Xiangheng; Chen, Junjie; Rizos, Georgios. - : arXiv, 2021
	BASE
	Show details

11	NVC-Net: End-to-End Adversarial Voice Conversion ...
	Nguyen, Bac; Cardinaux, Fabien. - : arXiv, 2021
	BASE
	Show details

12	Speech2Slot: An End-to-End Knowledge-based Slot Filling from Speech ...
	Wang, Pengwei; Ye, Xin; Zhou, Xiaohuan. - : arXiv, 2021
	BASE
	Show details

13	NIST SRE CTS Superset: A large-scale dataset for telephony speaker recognition ...
	Sadjadi, Seyed Omid. - : arXiv, 2021
	BASE
	Show details

14	Interpreting intermediate convolutional layers of CNNs trained on raw speech ...
	Beguš, Gašper; Zhou, Alan. - : arXiv, 2021
	BASE
	Show details

15	LipSound2: Self-Supervised Pre-Training for Lip-to-Speech Reconstruction and Lip Reading ...
	Qu, Leyuan; Weber, Cornelius; Wermter, Stefan. - : arXiv, 2021
	BASE
	Show details

16	SERAB: A multi-lingual benchmark for speech emotion recognition ...
	Scheidwasser-Clow, Neil; Kegler, Mikolaj; Beckmann, Pierre. - : arXiv, 2021
	BASE
	Show details

17	Detecting Emotion Carriers by Combining Acoustic and Lexical Representations ...
	Bayerl, Sebastian P.; Tammewar, Aniruddha; Riedhammer, Korbinian. - : arXiv, 2021
	BASE
	Show details

18	Textless Speech Emotion Conversion using Discrete and Decomposed Representations ...
	Kreuk, Felix; Polyak, Adam; Copet, Jade. - : arXiv, 2021
	BASE
	Show details

19	Can phones, syllables, and words emerge as side-products of cross-situational audiovisual learning? -- A computational investigation ...
	Khorrami, Khazar; Räsänen, Okko. - : arXiv, 2021
	BASE
	Show details

20	Converting Anyone's Emotion: Towards Speaker-Independent Emotional Voice Conversion ...
	Zhou, Kun; Sisman, Berrak; Zhang, Mingyang. - : arXiv, 2020
	BASE
	Show details

Page: 1 2

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern