DE eng

Search in the Catalogues and Directories

Page: 1 2
Hits 1 – 20 of 34

1
Mandarin-English Code-switching Speech Recognition with Self-supervised Speech Representation Models ...
BASE
Show details
2
Improving Cross-Lingual Reading Comprehension with Self-Training ...
BASE
Show details
3
Investigating the Reordering Capability in CTC-based Non-Autoregressive End-to-End Speech Translation ...
Abstract: Read paper: https://www.aclanthology.org/2021.findings-acl.92 Abstract: We study the possibilities of building a non-autoregressive speech-to-text translation model using connectionist temporal classification (CTC), and use CTC-based automatic speech recognition as an auxiliary task to improve the performance. CTC's success on translation is counter-intuitive due to its monotonicity assumption, so we analyze its reordering capability. Kendall's tau distance is introduced as the quantitative metric, and gradient-based visualization provides an intuitive way to take a closer look into the model. Our analysis shows that transformer encoders have the ability to change the word order and points out the future research direction that worth being explored more on non-autoregressive speech translation. ...
Keyword: Computational Linguistics; Condensed Matter Physics; Deep Learning; Electromagnetism; FOS Physical sciences; Information and Knowledge Engineering; Neural Network; Semantics
URL: https://dx.doi.org/10.48448/htvt-6185
https://underline.io/lecture/26183-investigating-the-reordering-capability-in-ctc-based-non-autoregressive-end-to-end-speech-translation
BASE
Hide details
4
S2VC: A Framework for Any-to-Any Voice Conversion with Self-Supervised Pretrained Representations ...
BASE
Show details
5
Mitigating Biases in Toxic Language Detection through Invariant Rationalization ...
BASE
Show details
6
Mitigating Biases in Toxic Language Detection through Invariant Rationalization ...
BASE
Show details
7
Looking for Clues of Language in Multilingual BERT to Improve Cross-lingual Generalization ...
BASE
Show details
8
DARTS-ASR: Differentiable Architecture Search for Multilingual Speech Recognition and Adaptation ...
BASE
Show details
9
What makes multilingual BERT multilingual? ...
BASE
Show details
10
A Study of Cross-Lingual Ability and Language-specific Information in Multilingual BERT ...
BASE
Show details
11
Pretrained Language Model Embryology: The Birth of ALBERT ...
BASE
Show details
12
AGAIN-VC: A One-shot Voice Conversion using Activation Guidance and Adaptive Instance Normalization ...
BASE
Show details
13
VQVC+: One-Shot Voice Conversion by Vector Quantization and U-Net architecture ...
Wu, Da-Yi; Chen, Yen-Hao; Lee, Hung-Yi. - : arXiv, 2020
BASE
Show details
14
Defending Your Voice: Adversarial Attack on Voice Conversion ...
BASE
Show details
15
FragmentVC: Any-to-Any Voice Conversion by End-to-End Extracting and Fusing Fine-Grained Voice Fragments With Attention ...
BASE
Show details
16
Training a code-switching language model with monolingual data ...
BASE
Show details
17
Zero-shot Reading Comprehension by Cross-lingual Transfer Learning with Multi-lingual Language Representation Model ...
BASE
Show details
18
Towards Unsupervised Speech Recognition and Synthesis with Quantized Speech Representation Learning ...
BASE
Show details
19
From Semi-supervised to Almost-unsupervised Speech Recognition with Very-low Resource by Jointly Learning Phonetic Structures from Audio and Text Embeddings ...
BASE
Show details
20
Improved Speech Separation with Time-and-Frequency Cross-domain Joint Embedding and Clustering ...
BASE
Show details

Page: 1 2

Catalogues
0
0
1
0
0
0
0
Bibliographies
1
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
33
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern