DE eng

Search in the Catalogues and Directories

Hits 1 – 19 of 19

1
Speech Segmentation Optimization using Segmented Bilingual Speech Corpus for End-to-end Speech Translation ...
Abstract: Speech segmentation, which splits long speech into short segments, is essential for speech translation (ST). Popular VAD tools like WebRTC VAD have generally relied on pause-based segmentation. Unfortunately, pauses in speech do not necessarily match sentence boundaries, and sentences can be connected by a very short pause that is difficult to detect by VAD. In this study, we propose a speech segmentation method using a binary classification model trained using a segmented bilingual speech corpus. We also propose a hybrid method that combines VAD and the above speech segmentation method. Experimental results revealed that the proposed method is more suitable for cascade and end-to-end ST systems than conventional segmentation methods. The hybrid approach further improved the translation performance. ... : Submitted to INTERSPEECH 2022 ...
Keyword: Audio and Speech Processing eess.AS; Computation and Language cs.CL; FOS Computer and information sciences; FOS Electrical engineering, electronic engineering, information engineering; Sound cs.SD
URL: https://arxiv.org/abs/2203.15479
https://dx.doi.org/10.48550/arxiv.2203.15479
BASE
Hide details
2
Representing `how you say' with `what you say': English corpus of focused speech and text reflecting corresponding implications ...
Suzuki, Naoaki; Nakamura, Satoshi. - : arXiv, 2022
BASE
Show details
3
Applying Syntax$\unicode{x2013}$Prosody Mapping Hypothesis and Prosodic Well-Formedness Constraints to Neural Sequence-to-Sequence Speech Synthesis ...
BASE
Show details
4
Using Perturbed Length-aware Positional Encoding for Non-autoregressive Neural Machine Translation ...
BASE
Show details
5
Using Perturbed Length-aware Positional Encoding for Non-autoregressive Neural Machine Translation ...
BASE
Show details
6
Simultaneous Neural Machine Translation with Constituent Label Prediction ...
BASE
Show details
7
Transformer VQ-VAE for Unsupervised Unit Discovery and Speech Synthesis: ZeroSpeech 2020 Challenge ...
BASE
Show details
8
Speech-to-speech Translation between Untranscribed Unknown Languages ...
BASE
Show details
9
Towards Machine Speech-to-speech Translation
BASE
Show details
10
Multi-Source Neural Machine Translation with Missing Data ...
BASE
Show details
11
Local Monotonic Attention Mechanism for End-to-End Speech and Language Processing ...
BASE
Show details
12
Listening while Speaking: Speech Chain by Deep Learning ...
BASE
Show details
13
Incorporating Discrete Translation Lexicons into Neural Machine Translation ...
BASE
Show details
14
Context Awareness and Priority Control for ITS based on Automatic Speech Recognition
In: International conference on ITS Telecommunications ; https://hal.inria.fr/hal-01225312 ; International conference on ITS Telecommunications, Dec 2015, Copenhagen, Denmark ; http://www.itst-conf.org/ (2015)
BASE
Show details
15
Neural Reranking Improves Subjective Quality of Machine Translation: NAIST at WAT2015 ...
BASE
Show details
16
Spoken Dialogue Systems for Ambient Environments : Second International Workshop, IWSDS 2010, Gotemba, Shizuoka, Japan, October 1-2, 2010. Proceedings
Lee, Gary Geunbae; Mariani, Joseph; Minker, Wolfgang. - Berlin, Heidelberg : Springer Berlin Heidelberg, 2010
UB Frankfurt Linguistik
Show details
17
Validation of a training method for L2 continuous-speech segmentation
Cutler, Anne (R12329); Shanley, Janise. - : Japan, ISCA, 2010
BASE
Show details
18
Cultural communication idiosyncrasies in human-computer interaction
Miehle, Juliana; Ultes, Stefan; Minker, Wolfgang. - : ACL (Association for Computational Linguistics)
BASE
Show details
19ALAGIN - Advanced LAnGuage INformation Forum
http://www.alagin.jp/
Topic: Computational linguistics; Corpus linguistics; Pragmalinguistics / Communication research; ...
Language: Chinese, Mandarin; English; Japanese
Source type: Corpora; Linguistic associations; Software / Tools
Access: free access

Catalogues
1
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
1
0
0
0
Open access documents
17
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern