DE eng

Search in the Catalogues and Directories

Hits 1 – 6 of 6

1
Speech Segmentation Optimization using Segmented Bilingual Speech Corpus for End-to-end Speech Translation ...
Abstract: Speech segmentation, which splits long speech into short segments, is essential for speech translation (ST). Popular VAD tools like WebRTC VAD have generally relied on pause-based segmentation. Unfortunately, pauses in speech do not necessarily match sentence boundaries, and sentences can be connected by a very short pause that is difficult to detect by VAD. In this study, we propose a speech segmentation method using a binary classification model trained using a segmented bilingual speech corpus. We also propose a hybrid method that combines VAD and the above speech segmentation method. Experimental results revealed that the proposed method is more suitable for cascade and end-to-end ST systems than conventional segmentation methods. The hybrid approach further improved the translation performance. ... : Submitted to INTERSPEECH 2022 ...
Keyword: Audio and Speech Processing eess.AS; Computation and Language cs.CL; FOS Computer and information sciences; FOS Electrical engineering, electronic engineering, information engineering; Sound cs.SD
URL: https://arxiv.org/abs/2203.15479
https://dx.doi.org/10.48550/arxiv.2203.15479
BASE
Hide details
2
Using Perturbed Length-aware Positional Encoding for Non-autoregressive Neural Machine Translation ...
BASE
Show details
3
Using Perturbed Length-aware Positional Encoding for Non-autoregressive Neural Machine Translation ...
BASE
Show details
4
Simultaneous Neural Machine Translation with Constituent Label Prediction ...
BASE
Show details
5
Towards Machine Speech-to-speech Translation
BASE
Show details
6
Multi-Source Neural Machine Translation with Missing Data ...
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
6
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern