Home Catalogue search

eng

Refine your search:
- Keyword
- Creator / Publisher
- Year
- Medium
- Type
- BLLDB-Access:
  - free (5.836)
  - subject to license (522)

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2 3 4 5 6 7...292

Hits 41 – 60 of 5.836

41	Data and knowledge-driven approaches for multilingual training to improve the performance of speech recognition systems of Indian languages ...
	Madhavaraj, A.; Ganesan, Ramakrishnan Angarai. - : arXiv, 2022
	BASE
	Show details

42	Tackling data scarcity in speech translation using zero-shot multilingual machine translation techniques ...
	Dinh, Tu Anh; Liu, Danni; Niehues, Jan. - : arXiv, 2022
	BASE
	Show details

43	AVQVC: One-shot Voice Conversion by Vector Quantization with applying contrastive learning ...
	Tang, Huaizhen; Zhang, Xulong; Wang, Jianzong. - : arXiv, 2022
	BASE
	Show details

44	Multimodal Clustering with Role Induced Constraints for Speaker Diarization ...
	Flemotomos, Nikolaos; Narayanan, Shrikanth. - : arXiv, 2022
	BASE
	Show details

45	Freeform Body Motion Generation from Speech ...
	Xu, Jing; Zhang, Wei; Bai, Yalong. - : arXiv, 2022
	BASE
	Show details

46	Linguistic-Acoustic Similarity Based Accent Shift for Accent Recognition ...
	Shao, Qijie; Yan, Jinghao; Kang, Jian. - : arXiv, 2022
	BASE
	Show details

47	Automatic Speech Recognition Datasets in Cantonese: A Survey and New Dataset ...
	Yu, Tiezheng; Frieske, Rita; Xu, Peng. - : arXiv, 2022
	BASE
	Show details

48	WavThruVec: Latent speech representation as intermediate features for neural speech synthesis ...
	Siuzdak, Hubert; Dura, Piotr; van Rijn, Pol. - : arXiv, 2022
	BASE
	Show details

49	Knowledge Transfer from Large-scale Pretrained Language Models to End-to-end Speech Recognizers ...
	Kubo, Yotaro; Karita, Shigeki; Bacchiani, Michiel. - : arXiv, 2022
	BASE
	Show details

50	A Character-level Span-based Model for Mandarin Prosodic Structure Prediction ...
	Chen, Xueyuan; Song, Changhe; Zhou, Yixuan. - : arXiv, 2022
	BASE
	Show details

51	CTA-RNN: Channel and Temporal-wise Attention RNN Leveraging Pre-trained ASR Embeddings for Speech Emotion Recognition ...
	Chen, Chengxin; Zhang, Pengyuan. - : arXiv, 2022
	BASE
	Show details

52	Automatic Depression Detection: An Emotional Audio-Textual Corpus and a GRU/BiLSTM-based Model ...
	Shen, Ying; Yang, Huiyu; Lin, Lin. - : arXiv, 2022
	BASE
	Show details

53	Fine-grained Noise Control for Multispeaker Speech Synthesis ...
	Nikitaras, Karolos; Vamvoukakis, Georgios; Ellinas, Nikolaos. - : arXiv, 2022
	BASE
	Show details

54	Emotion Intensity and its Control for Emotional Voice Conversion ...
	Zhou, Kun; Sisman, Berrak; Rana, Rajib. - : arXiv, 2022
	BASE
	Show details

55	Automatic Speech recognition for Speech Assessment of Preschool Children ...
	Abaskohi, Amirhossein; Mortazavi, Fatemeh; Moradi, Hadi. - : arXiv, 2022
	BASE
	Show details

56	The HCCL-DKU system for fake audio generation task of the 2022 ICASSP ADD Challenge ...
	Chen, Ziyi; Hua, Hua; Zhang, Yuxiang. - : arXiv, 2022
	BASE
	Show details

57	Dawn of the transformer era in speech emotion recognition: closing the valence gap ...
	Wagner, Johannes; Triantafyllopoulos, Andreas; Wierstorf, Hagen. - : arXiv, 2022
	BASE
	Show details

58	Deep Speech Based End-to-End Automated Speech Recognition (ASR) for Indian-English Accents ...
	Dubey, Priyank; Shah, Bilal. - : arXiv, 2022
	BASE
	Show details

59	KazakhTTS2: Extending the Open-Source Kazakh TTS Corpus With More Data, Speakers, and Topics ...
	Mussakhojayeva, Saida; Khassanov, Yerbolat; Varol, Huseyin Atakan. - : arXiv, 2022
	BASE
	Show details

60	Automated speech tools for helping communities process restricted-access corpora for language revival efforts ...
	San, Nay; Bartelds, Martijn; Ògúnrèmí, Tolúlopé. - : arXiv, 2022
	BASE
	Show details

Page: 1 2 3 4 5 6 7...292

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern