DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5...14
Hits 1 – 20 of 276

1
A comparative study of several parameterizations for speaker recognition ...
Faundez-Zanuy, Marcos. - : arXiv, 2022
Abstract: This paper presents an exhaustive study about the robustness of several parameterizations, in speaker verification and identification tasks. We have studied several mismatch conditions: different recording sessions, microphones, and different languages (it has been obtained from a bilingual set of speakers). This study reveals that the combination of several parameterizations can improve the robustness in all the scenarios for both tasks, identification and verification. In addition, two different methods have been evaluated: vector quantization, and covariance matrices with an arithmetic-harmonic sphericity measure. ... : 4 pages ...
Keyword: Audio and Speech Processing eess.AS; FOS Computer and information sciences; FOS Electrical engineering, electronic engineering, information engineering; Machine Learning cs.LG; Sound cs.SD
URL: https://arxiv.org/abs/2203.00513
https://dx.doi.org/10.48550/arxiv.2203.00513
BASE
Hide details
2
Subspace-based Representation and Learning for Phonotactic Spoken Language Recognition ...
BASE
Show details
3
Learning and controlling the source-filter representation of speech with a variational autoencoder ...
BASE
Show details
4
Correcting Misproducted Speech using Spectrogram Inpainting ...
BASE
Show details
5
The 2021 NIST Speaker Recognition Evaluation ...
BASE
Show details
6
Multilingual and Multimodal Abuse Detection ...
BASE
Show details
7
WavThruVec: Latent speech representation as intermediate features for neural speech synthesis ...
BASE
Show details
8
CTA-RNN: Channel and Temporal-wise Attention RNN Leveraging Pre-trained ASR Embeddings for Speech Emotion Recognition ...
Chen, Chengxin; Zhang, Pengyuan. - : arXiv, 2022
BASE
Show details
9
Fine-grained Noise Control for Multispeaker Speech Synthesis ...
BASE
Show details
10
Emotion Intensity and its Control for Emotional Voice Conversion ...
Zhou, Kun; Sisman, Berrak; Rana, Rajib. - : arXiv, 2022
BASE
Show details
11
Dawn of the transformer era in speech emotion recognition: closing the valence gap ...
BASE
Show details
12
Classifying Autism from Crowdsourced Semi-Structured Speech Recordings: A Machine Learning Approach ...
BASE
Show details
13
Common Phone: A Multilingual Dataset for Robust Acoustic Modelling ...
BASE
Show details
14
Low-dimensional representation of infant and adult vocalization acoustics ...
BASE
Show details
15
Chain-based Discriminative Autoencoders for Speech Recognition ...
BASE
Show details
16
Speech segmentation using multilevel hybrid filters ...
BASE
Show details
17
On the relevance of language in speaker recognition ...
BASE
Show details
18
Unsupervised word-level prosody tagging for controllable speech synthesis ...
Guo, Yiwei; Du, Chenpeng; Yu, Kai. - : arXiv, 2022
BASE
Show details
19
Filter-based Discriminative Autoencoders for Children Speech Recognition ...
BASE
Show details
20
Self-Supervised Representation Learning for Speech Using Visual Grounding and Masked Language Modeling ...
Peng, Puyuan; Harwath, David. - : arXiv, 2022
BASE
Show details

Page: 1 2 3 4 5...14

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
276
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern