DE eng

Search in the Catalogues and Directories

Hits 1 – 20 of 20

1
Joint Modeling of Code-Switched and Monolingual ASR via Conditional Factorization ...
Yan, Brian; Zhang, Chunlei; Yu, Meng. - : arXiv, 2021
BASE
Show details
2
Source and Target Bidirectional Knowledge Distillation for End-to-end Speech Translation ...
BASE
Show details
3
Self-Guided Curriculum Learning for Neural Machine Translation ...
Zhou, Lei; Ding, Liang; Duh, Kevin. - : arXiv, 2021
BASE
Show details
4
Arabic Speech Recognition by End-to-End, Modular Systems and Human ...
BASE
Show details
5
Leveraging End-to-End ASR for Endangered Language Documentation: An Empirical Study on Yoloxóchitl Mixtec ...
BASE
Show details
6
On Prosody Modeling for ASR+TTS based Voice Conversion ...
BASE
Show details
7
Leveraging Pre-trained Language Model for Speech Sentiment Analysis ...
Shon, Suwon; Brusco, Pablo; Pan, Jing. - : arXiv, 2021
BASE
Show details
8
Differentiable Allophone Graphs for Language-Universal Speech Recognition ...
BASE
Show details
9
Speech Representation Learning Combining Conformer CPC with Deep Cluster for the ZeroSpeech Challenge 2021 ...
Abstract: We present a system for the Zero Resource Speech Challenge 2021, which combines a Contrastive Predictive Coding (CPC) with deep cluster. In deep cluster, we first prepare pseudo-labels obtained by clustering the outputs of a CPC network with k-means. Then, we train an additional autoregressive model to classify the previously obtained pseudo-labels in a supervised manner. Phoneme discriminative representation is achieved by executing the second-round clustering with the outputs of the final layer of the autoregressive model. We show that replacing a Transformer layer with a Conformer layer leads to a further gain in a lexical metric. Experimental results show that a relative improvement of 35% in a phonetic metric, 1.5% in the lexical metric, and 2.3% in a syntactic metric are achieved compared to a baseline method of CPC-small which is trained on LibriSpeech 460h data. We achieve top results in this challenge with the syntactic metric. ...
Keyword: Audio and Speech Processing eess.AS; FOS Computer and information sciences; FOS Electrical engineering, electronic engineering, information engineering; Sound cs.SD
URL: https://dx.doi.org/10.48550/arxiv.2107.05899
https://arxiv.org/abs/2107.05899
BASE
Hide details
10
Learning Speaker Embedding from Text-to-Speech ...
BASE
Show details
11
Massively Multilingual Adversarial Speech Recognition ...
BASE
Show details
12
Multilingual End-to-End Speech Translation ...
BASE
Show details
13
Towards Online End-to-end Transformer Automatic Speech Recognition ...
BASE
Show details
14
Transformer ASR with Contextual Block Processing ...
BASE
Show details
15
Analysis of Multilingual Sequence-to-Sequence speech recognition systems ...
BASE
Show details
16
Language model integration based on memory control for sequence to sequence speech recognition ...
BASE
Show details
17
Transfer learning of language-independent end-to-end ASR with language model fusion ...
BASE
Show details
18
Multi-Head Decoder for End-to-End Speech Recognition ...
BASE
Show details
19
Low-Resource Contextual Topic Identification on Speech ...
BASE
Show details
20
Multilingual sequence-to-sequence speech recognition: architecture, transfer learning, and language modeling ...
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
20
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern