21 |
Hierarchical Softmax for End-to-End Low-resource Multilingual Speech Recognition ...
|
|
|
|
BASE
|
|
Show details
|
|
22 |
Cross-Lingual Text-to-Speech Using Multi-Task Learning and Speaker Classifier Joint Training ...
|
|
|
|
BASE
|
|
Show details
|
|
23 |
Cross-lingual Self-Supervised Speech Representations for Improved Dysarthric Speech Recognition ...
|
|
|
|
BASE
|
|
Show details
|
|
24 |
VoxSRC 2021: The Third VoxCeleb Speaker Recognition Challenge ...
|
|
|
|
BASE
|
|
Show details
|
|
25 |
Code-Switching Text Augmentation for Multilingual Speech Processing ...
|
|
|
|
BASE
|
|
Show details
|
|
28 |
Unsupervised Data Selection via Discrete Speech Representation for ASR ...
|
|
|
|
BASE
|
|
Show details
|
|
29 |
CVSS Corpus and Massively Multilingual Speech-to-Speech Translation ...
|
|
|
|
BASE
|
|
Show details
|
|
30 |
Improving the fusion of acoustic and text representations in RNN-T ...
|
|
|
|
BASE
|
|
Show details
|
|
31 |
Data and knowledge-driven approaches for multilingual training to improve the performance of speech recognition systems of Indian languages ...
|
|
|
|
BASE
|
|
Show details
|
|
32 |
Tackling data scarcity in speech translation using zero-shot multilingual machine translation techniques ...
|
|
|
|
BASE
|
|
Show details
|
|
33 |
Multimodal Clustering with Role Induced Constraints for Speaker Diarization ...
|
|
|
|
BASE
|
|
Show details
|
|
34 |
Linguistic-Acoustic Similarity Based Accent Shift for Accent Recognition ...
|
|
|
|
BASE
|
|
Show details
|
|
35 |
Automatic Speech Recognition Datasets in Cantonese: A Survey and New Dataset ...
|
|
|
|
BASE
|
|
Show details
|
|
36 |
WavThruVec: Latent speech representation as intermediate features for neural speech synthesis ...
|
|
|
|
BASE
|
|
Show details
|
|
37 |
A Character-level Span-based Model for Mandarin Prosodic Structure Prediction ...
|
|
|
|
BASE
|
|
Show details
|
|
38 |
CTA-RNN: Channel and Temporal-wise Attention RNN Leveraging Pre-trained ASR Embeddings for Speech Emotion Recognition ...
|
|
|
|
BASE
|
|
Show details
|
|
39 |
Fine-grained Noise Control for Multispeaker Speech Synthesis ...
|
|
|
|
BASE
|
|
Show details
|
|
40 |
Emotion Intensity and its Control for Emotional Voice Conversion ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|