DE eng

Search in the Catalogues and Directories

Hits 1 – 2 of 2

1
Assessing the quality of TTS audio in the LARA learning-by-reading platform
In: ISBN: 9782490057979 ; CALL and professionalisation: short papers from EUROCALL 2021 pp. 1-5 (2021)
BASE
Show details
2
PROMIS: a statistical-parametric speech synthesis system with prominence control via a prominence network
Malisz, Zofia; Berthelsen, Harald; Beskow, Jonas; Gustafson, Joakim. - : KTH, Tal, musik och hörsel, TMH, 2019. : KTH, Tal-kommunikation, 2019. : STTS – Södermalms talteknologiservice AB, 2019. : Vienna, 2019
Abstract: We implement an architecture with explicit prominence learning via a prominence network in Merlin, a statistical-parametric DNN-based text-to-speech system. We build on our previous results that successfully evaluated the inclusion of an automatically extracted, speech-based prominence feature into the training and its control at synthesis time. In this work, we expand the PROMIS system by implementing the prominence network that predicts prominence values from text. We test the network predictions as well as the effects of a prominence control module based on SSML-like tags. Listening tests for the complete PROMIS system, combining a prominence feature, a prominence network and prominence control, show that it effectively controls prominence in a diagnostic set of target words. The tests also show a minor negative impact on perceived naturalness, relative to baseline, exerted by the two prominence tagging methods implemented in the control module. ; QC 20201020
Keyword: Computer Systems; Datorsystem
URL: http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-283137
BASE
Hide details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
2
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern