Home Catalogue search

eng

Refine your search:
- Keyword
- Creator / Publisher
- Year
- Medium:
  - Online (43)
  - Print (23)
- Type
- BLLDB-Access:
  - free (66)
  - subject to license (6)

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2 3 4

Hits 1 – 20 of 66

1	Emotion Intensity and its Control for Emotional Voice Conversion ...
	Zhou, Kun; Sisman, Berrak; Rana, Rajib. - : arXiv, 2022
	BASE
	Show details

2	Dawn of the transformer era in speech emotion recognition: closing the valence gap ...
	Wagner, Johannes; Triantafyllopoulos, Andreas; Wierstorf, Hagen. - : arXiv, 2022
	BASE
	Show details

3	Probing Speech Emotion Recognition Transformers for Linguistic Knowledge ...
	Triantafyllopoulos, Andreas; Wagner, Johannes; Wierstorf, Hagen; Schmitt, Maximilian; Reichel, Uwe; Eyben, Florian; Burkhardt, Felix; Schuller, Björn W.. - : arXiv, 2022
	Abstract: Large, pre-trained neural networks consisting of self-attention layers (transformers) have recently achieved state-of-the-art results on several speech emotion recognition (SER) datasets. These models are typically pre-trained in self-supervised manner with the goal to improve automatic speech recognition performance -- and thus, to understand linguistic information. In this work, we investigate the extent in which this information is exploited during SER fine-tuning. Using a reproducible methodology based on open-source tools, we synthesise prosodically neutral speech utterances while varying the sentiment of the text. Valence predictions of the transformer model are very reactive to positive and negative sentiment content, as well as negations, but not to intensifiers or reducers, while none of those linguistic features impact arousal or dominance. These findings show that transformers can successfully leverage linguistic information to improve their valence predictions, and that linguistic analysis should ... : This work has been submitted for publication to Interspeech 2022 ...
	Keyword: Computation and Language cs.CL; FOS Computer and information sciences; Machine Learning cs.LG
	URL: https://dx.doi.org/10.48550/arxiv.2204.00400 https://arxiv.org/abs/2204.00400
	BASE
	Hide details

4	An Improved StarGAN for Emotional Voice Conversion: Enhancing Voice Quality and Data Augmentation ...
	He, Xiangheng; Chen, Junjie; Rizos, Georgios. - : arXiv, 2021
	BASE
	Show details

5	The INTERSPEECH 2021 Computational Paralinguistics Challenge: COVID-19 Cough, COVID-19 Speech, Escalation & Primates ...
	Schuller, Björn W.; Batliner, Anton; Bergler, Christian. - : arXiv, 2021
	BASE
	Show details

6	On the Impact of Word Error Rate on Acoustic-Linguistic Speech Emotion Recognition: An Update for the Deep Learning Era ...
	Amiriparian, Shahin; Sokolov, Artem; Aslan, Ilhan. - : arXiv, 2021
	BASE
	Show details

7	Multistage linguistic conditioning of convolutional layers for speech emotion recognition ...
	Triantafyllopoulos, Andreas; Reichel, Uwe; Liu, Shuo. - : arXiv, 2021
	BASE
	Show details

8	A Comparison of Acoustic and Linguistics Methodologies for Alzheimer's Dementia Recognition
	Cummins, Nicholas; Pan, Yilin; Ren, Zhao...
	In: http://infoscience.epfl.ch/record/284990 (2021)
	BASE
	Show details

9	The voice of COVID-19: Acoustic correlates of infection in sustained vowels
	Bartl-Pokorny, Katrin D.; Pokorny, Florian B.; Batliner, Anton...
	In: J Acoust Soc Am (2021)
	BASE
	Show details

10	COVID-19 and Computer Audition: An Overview on What Speech & Sound Analysis Could Contribute in the SARS-CoV-2 Corona Crisis
	Schuller, Björn W.; Schuller, Dagmar M.; Qian, Kun...
	In: Front Digit Health (2021)
	BASE
	Show details

11	AI-based Human Audio Processing for COVID-19: A Comprehensive Overview
	Deshpande, Gauri; Batliner, Anton; Schuller, Björn W.
	In: Pattern Recognit (2021)
	BASE
	Show details

12	Face Mask Recognition from Audio: The MASC Database and an Overview on the Mask Challenge
	Mohamed, Mostafa M.; Nessiem, Mina A.; Batliner, Anton...
	In: Pattern Recognit (2021)
	BASE
	Show details

13	Audio, Speech, Language, & Signal Processing for COVID-19: A Comprehensive Overview ...
	Deshpande, Gauri; Schuller, Björn W.. - : arXiv, 2020
	BASE
	Show details

14	Speaker trait characterization in web videos: Uniting speech, language, and facial features
	Weninger, Felix; Wagner, Claudia; Wöllmer, Martin...
	In: Proceedings of the 38th International Conference on Acoustics, Speech and Signal Processing (ICASSP 2013) ; 3647-3651 ; International Conference on Acoustics, Speech and Signal Processing (ICASSP 2013) ; 38 (2020)
	BASE
	Show details

15	On-line emotion recognition in a 3-D activation-valence-time continuum using acoustic and linguistic cues
	Eyben, Florian; Wöllmer, Martin; Graves, Alex. - 2020
	BASE
	Show details

16	Discrimination of speech and non-linguistic vocalizations by non-negative matrix factorization
	Schuller, Björn; Weninger, Felix. - 2020
	BASE
	Show details

17	A comparison of acoustic and linguistics methodologies for Alzheimer's dementia recognition
	Cummins, Nicholas; Pan, Yilin; Ren, Zhao. - 2020
	BASE
	Show details

18	"The Godfather" vs. "Chaos": comparing linguistic analysis based on on-line knowledge sources and Bags-of-N-Grams for movie review valence estimation
	Schuller, Björn; Schenk, Joachim; Rigoll, Gerhard. - 2020
	BASE
	Show details

19	Speaker independent emotion recognition by early fusion of acoustic and linguistic features within ensembles
	Schuller, Björn; Müller, Ronald; Lang, Manfred. - 2020
	BASE
	Show details

20	On the influence of phonetic content variation for acoustic emotion recognition
	Vlasenko, Bogdan; Schuller, Björn; Wendemuth, Andreas. - 2020
	BASE
	Show details

Page: 1 2 3 4

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern