1 |
The effect of phoneme distribution on perceptual similarity in English
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Emotional response language education: a first ‘in-the-wild’ evaluation
|
|
|
|
BASE
|
|
Show details
|
|
3 |
The effect of soft, modal and loud voice levels on entrainment in noisy conditions
|
|
|
|
BASE
|
|
Show details
|
|
6 |
The Effect of Soft, Modal and Loud Voice Levels on Entrainment in Noisy Conditions
|
|
|
|
BASE
|
|
Show details
|
|
11 |
A system for facial expression-based affective speech translation
|
|
|
|
BASE
|
|
Show details
|
|
13 |
A Multi-Agent Computational Linguistic Approach to Speech Recognition
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Multi-level exemplar-based duration generation for expressive speech synthesis
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Evaluating expressive speech synthesis from audiobooks in conversational phrases
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Synthesizing expressive speech from amateur audiobook recordings
|
|
|
|
BASE
|
|
Show details
|
|
17 |
Rapidly Testing the Interaction Model of a Pronunciation Training System via Wizard-of-Oz.
|
|
|
|
BASE
|
|
Show details
|
|
18 |
WinkTalk: a multimodal speech synthesis interface linking facial expressions to expressive synthetic voices
|
|
|
|
BASE
|
|
Show details
|
|
19 |
WinkTalk : a demonstration of a multimodal speech synthesis platform linking facial expressions to expressive synthetic voices
|
|
|
|
BASE
|
|
Show details
|
|
20 |
Clustering Expressive Speech Styles in Audiobooks Using Glottal Source Parameters.
|
|
|
|
Abstract:
A great challenge for text-to-speech synthesis is to produce ex- pressive speech. The main problem is that it is difficult to syn- thesise high-quality speech using expressive corpora. With the increasing interest in audiobook corpora for speech synthesis, there is a demand to synthesise speech which is rich in prosody, emotions and voice styles. In this work, Self-Organising Fea- ture Maps (SOFM) are used for clustering the speech data using voice quality parameters of the glottal source, in order to map out the variety of voice styles in the corpus. Subjective evalu- ation showed that this clustering method successfully separated the speech data into groups of utterances associated with dif- ferent voice characteristics. This work can be applied in unit- selection synthesis by selecting appropriate data sets to synthe- sise utterances with specific voice styles. It can also be used in parametric speech synthesis to model different voice styles separately. ; QC 20160426
|
|
Keyword:
Engineering and Technology; Teknik och teknologier
|
|
URL: http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-185519
|
|
BASE
|
|
Hide details
|
|
|
|