Home Catalogue search

eng

Refine your search:
- Keyword
- Creator / Publisher
- Year:
  - 2022 (134)
  - 2021 (305)
  - 2020 (305)
  - 2019 (164)
  - 2018 (136)
  - 2017 (97)
  - 2016 (48)
  - 2015 (59)
  - 2014 (57)
  - 2013 (34)
  - more
- Medium
- Type
- BLLDB-Access

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2 3 4 5 6 7 8...91

Hits 61 – 80 of 1.811

61	CTA-RNN: Channel and Temporal-wise Attention RNN Leveraging Pre-trained ASR Embeddings for Speech Emotion Recognition ...
	Chen, Chengxin; Zhang, Pengyuan. - : arXiv, 2022
	BASE
	Show details

62	Automatic Depression Detection: An Emotional Audio-Textual Corpus and a GRU/BiLSTM-based Model ...
	Shen, Ying; Yang, Huiyu; Lin, Lin. - : arXiv, 2022
	BASE
	Show details

63	Fine-grained Noise Control for Multispeaker Speech Synthesis ...
	Nikitaras, Karolos; Vamvoukakis, Georgios; Ellinas, Nikolaos. - : arXiv, 2022
	BASE
	Show details

64	Emotion Intensity and its Control for Emotional Voice Conversion ...
	Zhou, Kun; Sisman, Berrak; Rana, Rajib. - : arXiv, 2022
	BASE
	Show details

65	Automatic Speech recognition for Speech Assessment of Preschool Children ...
	Abaskohi, Amirhossein; Mortazavi, Fatemeh; Moradi, Hadi. - : arXiv, 2022
	BASE
	Show details

66	The HCCL-DKU system for fake audio generation task of the 2022 ICASSP ADD Challenge ...
	Chen, Ziyi; Hua, Hua; Zhang, Yuxiang. - : arXiv, 2022
	BASE
	Show details

67	Dawn of the transformer era in speech emotion recognition: closing the valence gap ...
	Wagner, Johannes; Triantafyllopoulos, Andreas; Wierstorf, Hagen. - : arXiv, 2022
	BASE
	Show details

68	Deep Speech Based End-to-End Automated Speech Recognition (ASR) for Indian-English Accents ...
	Dubey, Priyank; Shah, Bilal. - : arXiv, 2022
	BASE
	Show details

69	KazakhTTS2: Extending the Open-Source Kazakh TTS Corpus With More Data, Speakers, and Topics ...
	Mussakhojayeva, Saida; Khassanov, Yerbolat; Varol, Huseyin Atakan. - : arXiv, 2022
	BASE
	Show details

70	Automated speech tools for helping communities process restricted-access corpora for language revival efforts ...
	San, Nay; Bartelds, Martijn; Ògúnrèmí, Tolúlopé. - : arXiv, 2022
	BASE
	Show details

71	Classifying Autism from Crowdsourced Semi-Structured Speech Recordings: A Machine Learning Approach ...
	Chi, Nathan A.; Washington, Peter; Kline, Aaron. - : arXiv, 2022
	BASE
	Show details

72	Learning English with Peppa Pig ...
	Nikolaus, Mitja; Alishahi, Afra; Chrupała, Grzegorz. - : arXiv, 2022
	BASE
	Show details

73	Language-Independent Speaker Anonymization Approach using Self-Supervised Pre-Trained Models ...
	Miao, Xiaoxiao; Wang, Xin; Cooper, Erica. - : arXiv, 2022
	BASE
	Show details

74	Separate What You Describe: Language-Queried Audio Source Separation ...
	Liu, Xubo; Liu, Haohe; Kong, Qiuqiang. - : arXiv, 2022
	BASE
	Show details

75	A Complementary Joint Training Approach Using Unpaired Speech and Text for Low-Resource Automatic Speech Recognition ...
	Du, Ye-Qian; Zhang, Jie; Zhu, Qiu-Shi. - : arXiv, 2022
	BASE
	Show details

76	DRSpeech: Degradation-Robust Text-to-Speech Synthesis with Frame-Level and Utterance-Level Acoustic Representation Learning ...
	Saeki, Takaaki; Tachibana, Kentaro; Yamamoto, Ryuichi. - : arXiv, 2022
	BASE
	Show details

77	Arabic Text-To-Speech (TTS) Data Preparation ...
	Masri, Hala Al; Za'ter, Muhy Eddin. - : arXiv, 2022
	Abstract: People may be puzzled by the fact that voice over recordings data sets exist in addition to Text-to-Speech (TTS), Synthesis system advancements, albeit this is not the case. The goal of this study is to explain the relevance of TTS as well as the data preparation procedures. TTS relies heavily on recorded data since it can have a substantial influence on the outcomes of TTS modules. Furthermore, whether the domain is specialized or general, appropriate data should be developed to address all predicted language variants and domains. Different recording methodologies, taking into account quality and behavior, may also be advantageous in the development of the module. In light of the lack of Arabic language in present synthesizing systems, numerous variables that impact the flow of recorded utterances are being considered in order to manipulate an Arabic TTS module. In this study, two viewpoints will be discussed: linguistics and the creation of high-quality recordings for TTS. The purpose of this work is to ...
	Keyword: Audio and Speech Processing eess.AS; Computation and Language cs.CL; FOS Computer and information sciences; FOS Electrical engineering, electronic engineering, information engineering; Sound cs.SD
	URL: https://dx.doi.org/10.48550/arxiv.2204.03255 https://arxiv.org/abs/2204.03255
	BASE
	Hide details

78	Hedy Lamarr and Frequency Hopping ...
	Đurić D., Miloš. - : Zenodo, 2022
	BASE
	Show details

79	Impact of Naturalistic Field Acoustic Environments on Forensic Text-independent Speaker Verification System ...
	Wang, Zhenyu; Hansen, John H. L.. - : arXiv, 2022
	BASE
	Show details

80	Inferring Pitch from Coarse Spectral Features ...
	Ma, Danni; Ryant, Neville; Liberman, Mark. - : arXiv, 2022
	BASE
	Show details

Page: 1 2 3 4 5 6 7 8...91

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern