1 |
A Character-level Span-based Model for Mandarin Prosodic Structure Prediction ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Enhancing Word-Level Semantic Representation via Dependency Structure for Expressive Text-to-Speech Synthesis ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
VQMIVC: Vector Quantization and Mutual Information-Based Unsupervised Speech Representation Disentanglement for One-shot Voice Conversion ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Advancing Technological Equity in Speech and Language Processing ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
DiffSVC: A Diffusion Probabilistic Model for Singing Voice Conversion ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
FastSVC: Fast Cross-Domain Singing Voice Conversion with Feature-wise Linear Modulation ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Emotional Voice Conversion With Cycle-consistent Adversarial Network ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Transferring Source Style in Non-Parallel Voice Conversion ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Speaker Independent and Multilingual/Mixlingual Speech-Driven Talking Head Generation Using Phonetic Posteriorgrams ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Deep segmental phonetic posterior-grams based discovery of non-categories in L2 English speech ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Multi-Target Emotional Voice Conversion With Neural Vocoders ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Audio-visual Recognition of Overlapped speech for the LRS2 dataset ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Aero-tactile integration in fricatives : converting audio to air flow information for speech perception enhancement
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Listen with your skin : Aerotak speech perception enhancement system
|
|
|
|
BASE
|
|
Show details
|
|
|
|