1 |
The effect of phoneme distribution on perceptual similarity in English
|
|
|
|
Abstract:
The 20th Annual Conference of the International Speech Communication Association (Interspeech 2019), Graz, Austria, 15-19 September 2019 ; This paper investigates the extent to which native speaker perceptions regarding the similarity between phonemes of English are influenced by their distributional properties. A similarity hierarchy model based on the distribution of consonantal phonemes in the English language was generated by creating phoneme-embeddings from contextual information. We compare this to similarity models based on phonological feature theory and on native speaker perception. Characteristics of the perception-based model are shown to appear in the distribution-based model whilst not being captured by the feature-based model. This not only provides evidence of similarity perceptions being influenced by distributional properties but is an argument for incorporating distributional information alongside phonological features when modelling perceptual similarity.
|
|
Keyword:
Distribution; Perception; Phenome similarity; Phonology
|
|
URL: https://doi.org/10.21437/Interspeech.2019-3042 http://hdl.handle.net/10197/11508
|
|
BASE
|
|
Hide details
|
|
2 |
Emotional response language education: a first ‘in-the-wild’ evaluation
|
|
|
|
BASE
|
|
Show details
|
|
3 |
The effect of soft, modal and loud voice levels on entrainment in noisy conditions
|
|
|
|
BASE
|
|
Show details
|
|
6 |
The Effect of Soft, Modal and Loud Voice Levels on Entrainment in Noisy Conditions
|
|
|
|
BASE
|
|
Show details
|
|
11 |
A system for facial expression-based affective speech translation
|
|
|
|
BASE
|
|
Show details
|
|
13 |
A Multi-Agent Computational Linguistic Approach to Speech Recognition
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Multi-level exemplar-based duration generation for expressive speech synthesis
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Evaluating expressive speech synthesis from audiobooks in conversational phrases
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Synthesizing expressive speech from amateur audiobook recordings
|
|
|
|
BASE
|
|
Show details
|
|
17 |
Rapidly Testing the Interaction Model of a Pronunciation Training System via Wizard-of-Oz.
|
|
|
|
BASE
|
|
Show details
|
|
18 |
WinkTalk: a multimodal speech synthesis interface linking facial expressions to expressive synthetic voices
|
|
|
|
BASE
|
|
Show details
|
|
19 |
WinkTalk : a demonstration of a multimodal speech synthesis platform linking facial expressions to expressive synthetic voices
|
|
|
|
BASE
|
|
Show details
|
|
20 |
Clustering Expressive Speech Styles in Audiobooks Using Glottal Source Parameters.
|
|
|
|
BASE
|
|
Show details
|
|
|
|