DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4
Hits 1 – 20 of 66

1
Emotion Intensity and its Control for Emotional Voice Conversion ...
Abstract: Emotional voice conversion (EVC) seeks to convert the emotional state of an utterance while preserving the linguistic content and speaker identity. In EVC, emotions are usually treated as discrete categories overlooking the fact that speech also conveys emotions with various intensity levels that the listener can perceive. In this paper, we aim to explicitly characterize and control the intensity of emotion. We propose to disentangle the speaker style from linguistic content and encode the speaker style into a style embedding in a continuous space that forms the prototype of emotion embedding. We further learn the actual emotion encoder from an emotion-labelled database and study the use of relative attributes to represent fine-grained emotion intensity. To ensure emotional intelligibility, we incorporate emotion classification loss and emotion embedding similarity loss into the training of the EVC network. As desired, the proposed network controls the fine-grained emotion intensity in the output speech. ... : Submitted to IEEE Transactions on Affective Computing ...
Keyword: Audio and Speech Processing eess.AS; Computation and Language cs.CL; FOS Computer and information sciences; FOS Electrical engineering, electronic engineering, information engineering; Machine Learning cs.LG; Sound cs.SD
URL: https://arxiv.org/abs/2201.03967
https://dx.doi.org/10.48550/arxiv.2201.03967
BASE
Hide details
2
Dawn of the transformer era in speech emotion recognition: closing the valence gap ...
BASE
Show details
3
Probing Speech Emotion Recognition Transformers for Linguistic Knowledge ...
BASE
Show details
4
An Improved StarGAN for Emotional Voice Conversion: Enhancing Voice Quality and Data Augmentation ...
BASE
Show details
5
The INTERSPEECH 2021 Computational Paralinguistics Challenge: COVID-19 Cough, COVID-19 Speech, Escalation & Primates ...
BASE
Show details
6
On the Impact of Word Error Rate on Acoustic-Linguistic Speech Emotion Recognition: An Update for the Deep Learning Era ...
BASE
Show details
7
Multistage linguistic conditioning of convolutional layers for speech emotion recognition ...
BASE
Show details
8
A Comparison of Acoustic and Linguistics Methodologies for Alzheimer's Dementia Recognition
In: http://infoscience.epfl.ch/record/284990 (2021)
BASE
Show details
9
The voice of COVID-19: Acoustic correlates of infection in sustained vowels
In: J Acoust Soc Am (2021)
BASE
Show details
10
COVID-19 and Computer Audition: An Overview on What Speech & Sound Analysis Could Contribute in the SARS-CoV-2 Corona Crisis
In: Front Digit Health (2021)
BASE
Show details
11
AI-based Human Audio Processing for COVID-19: A Comprehensive Overview
In: Pattern Recognit (2021)
BASE
Show details
12
Face Mask Recognition from Audio: The MASC Database and an Overview on the Mask Challenge
In: Pattern Recognit (2021)
BASE
Show details
13
Audio, Speech, Language, & Signal Processing for COVID-19: A Comprehensive Overview ...
BASE
Show details
14
Speaker trait characterization in web videos: Uniting speech, language, and facial features
In: Proceedings of the 38th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2013) ; 3647-3651 ; International Conference on Acoustics, Speech and Signal Processing (ICASSP 2013) ; 38 (2020)
BASE
Show details
15
On-line emotion recognition in a 3-D activation-valence-time continuum using acoustic and linguistic cues
BASE
Show details
16
Discrimination of speech and non-linguistic vocalizations by non-negative matrix factorization
BASE
Show details
17
A comparison of acoustic and linguistics methodologies for Alzheimer's dementia recognition
BASE
Show details
18
"The Godfather" vs. "Chaos": comparing linguistic analysis based on on-line knowledge sources and Bags-of-N-Grams for movie review valence estimation
BASE
Show details
19
Speaker independent emotion recognition by early fusion of acoustic and linguistic features within ensembles
BASE
Show details
20
On the influence of phonetic content variation for acoustic emotion recognition
BASE
Show details

Page: 1 2 3 4

Catalogues
2
0
13
0
6
0
0
Bibliographies
8
0
0
0
0
0
0
0
1
Linked Open Data catalogues
0
Online resources
1
0
0
0
Open access documents
42
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern