Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2 3 4 5...14

Hits 1 – 20 of 276

1	A comparative study of several parameterizations for speaker recognition ...
	Faundez-Zanuy, Marcos. - : arXiv, 2022
	Abstract: This paper presents an exhaustive study about the robustness of several parameterizations, in speaker verification and identification tasks. We have studied several mismatch conditions: different recording sessions, microphones, and different languages (it has been obtained from a bilingual set of speakers). This study reveals that the combination of several parameterizations can improve the robustness in all the scenarios for both tasks, identification and verification. In addition, two different methods have been evaluated: vector quantization, and covariance matrices with an arithmetic-harmonic sphericity measure. ... : 4 pages ...
	Keyword: Audio and Speech Processing eess.AS; FOS Computer and information sciences; FOS Electrical engineering, electronic engineering, information engineering; Machine Learning cs.LG; Sound cs.SD
	URL: https://arxiv.org/abs/2203.00513 https://dx.doi.org/10.48550/arxiv.2203.00513
	BASE
	Hide details

2	Subspace-based Representation and Learning for Phonotactic Spoken Language Recognition ...
	Lee, Hung-Shin; Tsao, Yu; Jeng, Shyh-Kang. - : arXiv, 2022
	BASE
	Show details

3	Learning and controlling the source-filter representation of speech with a variational autoencoder ...
	Sadok, Samir; Leglaive, Simon; Girin, Laurent. - : arXiv, 2022
	BASE
	Show details

4	Correcting Misproducted Speech using Spectrogram Inpainting ...
	Ben-Simon, Talia; Kreuk, Felix; Awwad, Faten. - : arXiv, 2022
	BASE
	Show details

5	The 2021 NIST Speaker Recognition Evaluation ...
	Sadjadi, Seyed Omid; Greenberg, Craig; Singer, Elliot. - : arXiv, 2022
	BASE
	Show details

6	Multilingual and Multimodal Abuse Detection ...
	Sharon, Rini; Shah, Heet; Mukherjee, Debdoot. - : arXiv, 2022
	BASE
	Show details

7	WavThruVec: Latent speech representation as intermediate features for neural speech synthesis ...
	Siuzdak, Hubert; Dura, Piotr; van Rijn, Pol. - : arXiv, 2022
	BASE
	Show details

8	CTA-RNN: Channel and Temporal-wise Attention RNN Leveraging Pre-trained ASR Embeddings for Speech Emotion Recognition ...
	Chen, Chengxin; Zhang, Pengyuan. - : arXiv, 2022
	BASE
	Show details

9	Fine-grained Noise Control for Multispeaker Speech Synthesis ...
	Nikitaras, Karolos; Vamvoukakis, Georgios; Ellinas, Nikolaos. - : arXiv, 2022
	BASE
	Show details

10	Emotion Intensity and its Control for Emotional Voice Conversion ...
	Zhou, Kun; Sisman, Berrak; Rana, Rajib. - : arXiv, 2022
	BASE
	Show details

11	Dawn of the transformer era in speech emotion recognition: closing the valence gap ...
	Wagner, Johannes; Triantafyllopoulos, Andreas; Wierstorf, Hagen. - : arXiv, 2022
	BASE
	Show details

12	Classifying Autism from Crowdsourced Semi-Structured Speech Recordings: A Machine Learning Approach ...
	Chi, Nathan A.; Washington, Peter; Kline, Aaron. - : arXiv, 2022
	BASE
	Show details

13	Common Phone: A Multilingual Dataset for Robust Acoustic Modelling ...
	Klumpp, Philipp; Arias-Vergara, Tomás; Pérez-Toro, Paula Andrea. - : arXiv, 2022
	BASE
	Show details

14	Low-dimensional representation of infant and adult vocalization acoustics ...
	Pagliarini, Silvia; Schneider, Sara; Kello, Christopher T.. - : arXiv, 2022
	BASE
	Show details

15	Chain-based Discriminative Autoencoders for Speech Recognition ...
	Lee, Hung-Shin; Huang, Pin-Tuan; Cheng, Yao-Fei. - : arXiv, 2022
	BASE
	Show details

16	Speech segmentation using multilevel hybrid filters ...
	Faundez-Zanuy, Marcos; Vallverdu-Bayes, Francesc. - : arXiv, 2022
	BASE
	Show details

17	On the relevance of language in speaker recognition ...
	Satue-Villar, Antonio; Faundez-Zanuy, Marcos. - : arXiv, 2022
	BASE
	Show details

18	Unsupervised word-level prosody tagging for controllable speech synthesis ...
	Guo, Yiwei; Du, Chenpeng; Yu, Kai. - : arXiv, 2022
	BASE
	Show details

19	Filter-based Discriminative Autoencoders for Children Speech Recognition ...
	Tai, Chiang-Lin; Lee, Hung-Shin; Tsao, Yu. - : arXiv, 2022
	BASE
	Show details

20	Self-Supervised Representation Learning for Speech Using Visual Grounding and Masked Language Modeling ...
	Peng, Puyuan; Harwath, David. - : arXiv, 2022
	BASE
	Show details

Page: 1 2 3 4 5...14

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern