Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2 3 4

Hits 1 – 20 of 66

1	Emotion Intensity and its Control for Emotional Voice Conversion ...
	Zhou, Kun; Sisman, Berrak; Rana, Rajib. - : arXiv, 2022
	BASE
	Show details

2	Dawn of the transformer era in speech emotion recognition: closing the valence gap ...
	Wagner, Johannes; Triantafyllopoulos, Andreas; Wierstorf, Hagen; Schmitt, Maximilian; Burkhardt, Felix; Eyben, Florian; Schuller, Björn W.. - : arXiv, 2022
	Abstract: Recent advances in transformer-based architectures which are pre-trained in self-supervised manner have shown great promise in several machine learning tasks. In the audio domain, such architectures have also been successfully utilised in the field of speech emotion recognition (SER). However, existing works have not evaluated the influence of model size and pre-training data on downstream performance, and have shown limited attention to generalisation, robustness, fairness, and efficiency. The present contribution conducts a thorough analysis of these aspects on several pre-trained variants of wav2vec 2.0 and HuBERT that we fine-tuned on the dimensions arousal, dominance, and valence of MSP-Podcast, while additionally using IEMOCAP and MOSI to test cross-corpus generalisation. To the best of our knowledge, we obtain the top performance for valence prediction without use of explicit linguistic information, with a concordance correlation coefficient (CCC) of .638 on MSP-Podcast. Furthermore, our ...
	Keyword: Audio and Speech Processing eess.AS; FOS Computer and information sciences; FOS Electrical engineering, electronic engineering, information engineering; Machine Learning cs.LG; Sound cs.SD
	URL: https://dx.doi.org/10.48550/arxiv.2203.07378 https://arxiv.org/abs/2203.07378
	BASE
	Hide details

3	Probing Speech Emotion Recognition Transformers for Linguistic Knowledge ...
	Triantafyllopoulos, Andreas; Wagner, Johannes; Wierstorf, Hagen. - : arXiv, 2022
	BASE
	Show details

4	An Improved StarGAN for Emotional Voice Conversion: Enhancing Voice Quality and Data Augmentation ...
	He, Xiangheng; Chen, Junjie; Rizos, Georgios. - : arXiv, 2021
	BASE
	Show details

5	The INTERSPEECH 2021 Computational Paralinguistics Challenge: COVID-19 Cough, COVID-19 Speech, Escalation & Primates ...
	Schuller, Björn W.; Batliner, Anton; Bergler, Christian. - : arXiv, 2021
	BASE
	Show details

6	On the Impact of Word Error Rate on Acoustic-Linguistic Speech Emotion Recognition: An Update for the Deep Learning Era ...
	Amiriparian, Shahin; Sokolov, Artem; Aslan, Ilhan. - : arXiv, 2021
	BASE
	Show details

7	Multistage linguistic conditioning of convolutional layers for speech emotion recognition ...
	Triantafyllopoulos, Andreas; Reichel, Uwe; Liu, Shuo. - : arXiv, 2021
	BASE
	Show details

8	A Comparison of Acoustic and Linguistics Methodologies for Alzheimer's Dementia Recognition
	Cummins, Nicholas; Pan, Yilin; Ren, Zhao...
	In: http://infoscience.epfl.ch/record/284990 (2021)
	BASE
	Show details

9	The voice of COVID-19: Acoustic correlates of infection in sustained vowels
	Bartl-Pokorny, Katrin D.; Pokorny, Florian B.; Batliner, Anton...
	In: J Acoust Soc Am (2021)
	BASE
	Show details

10	COVID-19 and Computer Audition: An Overview on What Speech & Sound Analysis Could Contribute in the SARS-CoV-2 Corona Crisis
	Schuller, Björn W.; Schuller, Dagmar M.; Qian, Kun...
	In: Front Digit Health (2021)
	BASE
	Show details

11	AI-based Human Audio Processing for COVID-19: A Comprehensive Overview
	Deshpande, Gauri; Batliner, Anton; Schuller, Björn W.
	In: Pattern Recognit (2021)
	BASE
	Show details

12	Face Mask Recognition from Audio: The MASC Database and an Overview on the Mask Challenge
	Mohamed, Mostafa M.; Nessiem, Mina A.; Batliner, Anton...
	In: Pattern Recognit (2021)
	BASE
	Show details

13	Audio, Speech, Language, & Signal Processing for COVID-19: A Comprehensive Overview ...
	Deshpande, Gauri; Schuller, Björn W.. - : arXiv, 2020
	BASE
	Show details

14	Speaker trait characterization in web videos: Uniting speech, language, and facial features
	Weninger, Felix; Wagner, Claudia; Wöllmer, Martin...
	In: Proceedings of the 38th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2013) ; 3647-3651 ; International Conference on Acoustics, Speech and Signal Processing (ICASSP 2013) ; 38 (2020)
	BASE
	Show details

15	On-line emotion recognition in a 3-D activation-valence-time continuum using acoustic and linguistic cues
	Eyben, Florian; Wöllmer, Martin; Graves, Alex. - 2020
	BASE
	Show details

16	Discrimination of speech and non-linguistic vocalizations by non-negative matrix factorization
	Schuller, Björn; Weninger, Felix. - 2020
	BASE
	Show details

17	A comparison of acoustic and linguistics methodologies for Alzheimer's dementia recognition
	Cummins, Nicholas; Pan, Yilin; Ren, Zhao. - 2020
	BASE
	Show details

18	"The Godfather" vs. "Chaos": comparing linguistic analysis based on on-line knowledge sources and Bags-of-N-Grams for movie review valence estimation
	Schuller, Björn; Schenk, Joachim; Rigoll, Gerhard. - 2020
	BASE
	Show details

19	Speaker independent emotion recognition by early fusion of acoustic and linguistic features within ensembles
	Schuller, Björn; Müller, Ronald; Lang, Manfred. - 2020
	BASE
	Show details

20	On the influence of phonetic content variation for acoustic emotion recognition
	Vlasenko, Bogdan; Schuller, Björn; Wendemuth, Andreas. - 2020
	BASE
	Show details

Page: 1 2 3 4

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern