2 |
Chain-based Discriminative Autoencoders for Speech Recognition ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Unsupervised word-level prosody tagging for controllable speech synthesis ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
gTLO: A Generalized and Non-linear Multi-Objective Deep Reinforcement Learning Approach ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Cetacean Translation Initiative: a roadmap to deciphering the communication of sperm whales ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Building a great multi-lingual teacher with sparsely-gated mixture of experts for speech recognition ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
SERAB: A multi-lingual benchmark for speech emotion recognition ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Learning Efficient Representations for Keyword Spotting with Triplet Loss ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
What BERT Based Language Models Learn in Spoken Transcripts: An Empirical Study ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Textless Speech Emotion Conversion using Discrete and Decomposed Representations ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Can phones, syllables, and words emerge as side-products of cross-situational audiovisual learning? -- A computational investigation ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Reduce and Reconstruct: ASR for Low-Resource Phonetic Languages ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Chance-Constrained Control with Lexicographic Deep Reinforcement Learning ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|