Page: 1 2 3 4 5 6 7... 292
41 |
Data and knowledge-driven approaches for multilingual training to improve the performance of speech recognition systems of Indian languages ...
|
|
|
|
BASE
|
|
Show details
|
|
42 |
Tackling data scarcity in speech translation using zero-shot multilingual machine translation techniques ...
|
|
|
|
BASE
|
|
Show details
|
|
43 |
AVQVC: One-shot Voice Conversion by Vector Quantization with applying contrastive learning ...
|
|
|
|
BASE
|
|
Show details
|
|
44 |
Multimodal Clustering with Role Induced Constraints for Speaker Diarization ...
|
|
|
|
BASE
|
|
Show details
|
|
46 |
Linguistic-Acoustic Similarity Based Accent Shift for Accent Recognition ...
|
|
|
|
BASE
|
|
Show details
|
|
47 |
Automatic Speech Recognition Datasets in Cantonese: A Survey and New Dataset ...
|
|
|
|
BASE
|
|
Show details
|
|
48 |
WavThruVec: Latent speech representation as intermediate features for neural speech synthesis ...
|
|
|
|
BASE
|
|
Show details
|
|
49 |
Knowledge Transfer from Large-scale Pretrained Language Models to End-to-end Speech Recognizers ...
|
|
|
|
BASE
|
|
Show details
|
|
50 |
A Character-level Span-based Model for Mandarin Prosodic Structure Prediction ...
|
|
|
|
BASE
|
|
Show details
|
|
51 |
CTA-RNN: Channel and Temporal-wise Attention RNN Leveraging Pre-trained ASR Embeddings for Speech Emotion Recognition ...
|
|
|
|
BASE
|
|
Show details
|
|
52 |
Automatic Depression Detection: An Emotional Audio-Textual Corpus and a GRU/BiLSTM-based Model ...
|
|
|
|
BASE
|
|
Show details
|
|
53 |
Fine-grained Noise Control for Multispeaker Speech Synthesis ...
|
|
|
|
BASE
|
|
Show details
|
|
54 |
Emotion Intensity and its Control for Emotional Voice Conversion ...
|
|
|
|
BASE
|
|
Show details
|
|
55 |
Automatic Speech recognition for Speech Assessment of Preschool Children ...
|
|
|
|
BASE
|
|
Show details
|
|
56 |
The HCCL-DKU system for fake audio generation task of the 2022 ICASSP ADD Challenge ...
|
|
|
|
BASE
|
|
Show details
|
|
57 |
Dawn of the transformer era in speech emotion recognition: closing the valence gap ...
|
|
|
|
BASE
|
|
Show details
|
|
58 |
Deep Speech Based End-to-End Automated Speech Recognition (ASR) for Indian-English Accents ...
|
|
|
|
BASE
|
|
Show details
|
|
59 |
KazakhTTS2: Extending the Open-Source Kazakh TTS Corpus With More Data, Speakers, and Topics ...
|
|
|
|
BASE
|
|
Show details
|
|
60 |
Automated speech tools for helping communities process restricted-access corpora for language revival efforts ...
|
|
|
|
BASE
|
|
Show details
|
|
Page: 1 2 3 4 5 6 7... 292
|
|