DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5...81
Hits 1 – 20 of 1.603

1
Learning and controlling the source-filter representation of speech with a variational autoencoder
In: https://hal.archives-ouvertes.fr/hal-03650569 ; 2022 (2022)
Abstract: 17 pages, 4 figures, companion website: https://samsad35.github.io/site-sfvae/ ; Understanding and controlling latent representations in deep generative models is a challenging yet important problem for analyzing, transforming and generating various types of data. In speech processing, inspiring from the anatomical mechanisms of phonation, the source-filter model considers that speech signals are produced from a few independent and physically meaningful continuous latent factors, among which the fundamental frequency f0 and the formants are of primary importance. In this work, we show that the source-filter model of speech production naturally arises in the latent space of a variational autoencoder (VAE) trained in an unsupervised manner on a dataset of natural speech signals. Using only a few seconds of labeled speech signals generated with an artificial speech synthesizer, we experimentally illustrate that f0 and the formant frequencies are encoded in orthogonal subspaces of the VAE latent space and we develop a weakly-supervised method to accurately and independently control these speech factors of variation within the learned latent subspaces. Without requiring additional information such as text or human-labeled data, this results in a deep generative model of speech spectrograms that is conditioned on f0 and the formant frequencies, and which is applied to the transformation of speech signals.
Keyword: [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG]; [INFO.INFO-SD]Computer Science [cs]/Sound [cs.SD]; Deep generative models; Representation learning; Source-filter model; Variational autoencoder
URL: https://hal.archives-ouvertes.fr/hal-03650569
https://hal.archives-ouvertes.fr/hal-03650569/document
https://hal.archives-ouvertes.fr/hal-03650569/file/sadok2022learning.pdf
BASE
Hide details
2
Genetic Neural Architecture Search for automatic assessment of human sperm images
In: ISSN: 0957-4174 ; Expert Systems with Applications ; https://hal.archives-ouvertes.fr/hal-03585035 ; Expert Systems with Applications, Elsevier, 2022 (2022)
BASE
Show details
3
Unsupervised quantification of entity consistency between photos and text in real-world news ...
Müller-Budack, Eric. - : Hannover : Institutionelles Repositorium der Leibniz Universität Hannover, 2022
BASE
Show details
4
Multi language Email Classification Using Transfer learning
BASE
Show details
5
Jibes & Delights: A Dataset of Targeted Insults and Compliments to Tackle Online Abuse​ ...
BASE
Show details
6
Bird’s Eye: Probing for Linguistic Graph Structures with a Simple Information-Theoretic Approach ...
BASE
Show details
7
Lexicon-Based vs. Bert-Based Sentiment Analysis: A Comparative Study in Italian
In: Electronics; Volume 11; Issue 3; Pages: 374 (2022)
BASE
Show details
8
COVID-19 Vaccination-Related Sentiments Analysis: A Case Study Using Worldwide Twitter Dataset
In: Healthcare; Volume 10; Issue 3; Pages: 411 (2022)
BASE
Show details
9
A Novel Pathological Voice Identification Technique through Simulated Cochlear Implant Processing Systems
In: Applied Sciences; Volume 12; Issue 5; Pages: 2398 (2022)
BASE
Show details
10
Considering Commonsense in Solving QA: Reading Comprehension with Semantic Search and Continual Learning
In: Applied Sciences; Volume 12; Issue 9; Pages: 4099 (2022)
BASE
Show details
11
Multimodal Lip-Reading for Tracheostomy Patients in the Greek Language
In: Computers; Volume 11; Issue 3; Pages: 34 (2022)
BASE
Show details
12
Identifying Learners’ Interaction Patterns in an Online Learning Community
In: International Journal of Environmental Research and Public Health; Volume 19; Issue 4; Pages: 2245 (2022)
BASE
Show details
13
Analysis of the Full-Size Russian Corpus of Internet Drug Reviews with Complex NER Labeling Using Deep Learning Neural Networks and Language Models
In: Applied Sciences; Volume 12; Issue 1; Pages: 491 (2022)
BASE
Show details
14
An Evolution Gaining Momentum—The Growing Role of Artificial Intelligence in the Diagnosis and Treatment of Spinal Diseases
In: Diagnostics; Volume 12; Issue 4; Pages: 836 (2022)
BASE
Show details
15
Artificial Intelligence in Digestive Endoscopy—Where Are We and Where Are We Going?
In: Diagnostics; Volume 12; Issue 4; Pages: 927 (2022)
BASE
Show details
16
Detection of Chinese Deceptive Reviews Based on Pre-Trained Language Model
In: Applied Sciences; Volume 12; Issue 7; Pages: 3338 (2022)
BASE
Show details
17
Transformer-Based Abstractive Summarization for Reddit and Twitter: Single Posts vs. Comment Pools in Three Languages
In: Future Internet; Volume 14; Issue 3; Pages: 69 (2022)
BASE
Show details
18
The Sustainable Development of Intangible Cultural Heritage with AI: Cantonese Opera Singing Genre Classification Based on CoGCNet Model in China
In: Sustainability; Volume 14; Issue 5; Pages: 2923 (2022)
BASE
Show details
19
Artificial Intelligence and Machine Learning in the Diagnosis and Management of Gastroenteropancreatic Neuroendocrine Neoplasms—A Scoping Review
In: Diagnostics; Volume 12; Issue 4; Pages: 874 (2022)
BASE
Show details
20
Automatic Classification Framework of Tongue Feature Based on Convolutional Neural Networks
In: Micromachines; Volume 13; Issue 4; Pages: 501 (2022)
BASE
Show details

Page: 1 2 3 4 5...81

Catalogues
2
0
0
0
0
0
2
Bibliographies
3
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
1.598
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern