22 |
Incorporating Lexical and Prosodic Information at Different Levels for Meeting Summarization ...
|
|
|
|
BASE
|
|
Show details
|
|
24 |
Recording speech articulation in dialogue: Evaluating a synchronized double Electromagnetic Articulography setup
|
|
|
|
BASE
|
|
Show details
|
|
25 |
Toward summarization of communicative activities in spoken conversation
|
|
|
|
BASE
|
|
Show details
|
|
26 |
Cross-lingual automatic speech recognition using tandem features
|
|
|
|
BASE
|
|
Show details
|
|
27 |
Ageing voices: The effect of changes in voice parameters on ASR performance
|
|
|
|
BASE
|
|
Show details
|
|
28 |
Evaluating speech synthesis intelligibility using Amazon Mechanical Turk
|
|
|
|
Abstract:
Microtask platforms such as Amazon Mechanical Turk (AMT) are increasingly used to create speech and language resources. AMT in particular allows researchers to quickly recruit a large number of fairly demographically diverse participants. In this study, we investigated whether AMT can be used for comparing the intelligibility of speech synthesis systems. We conducted two experiments in the lab and via AMT, one comparing US English diphone to US English speaker-adaptive HTS synthesis and one comparing UK English unit selection to UK English speaker-dependent HTS synthesis. While AMT word error rates were worse than lab error rates, AMT results were more sensitive to relative differences between systems. This is mainly due to the larger number of listeners. Boxplots and multilevel modelling allowed us to identify listeners who performed particularly badly, while thresholding was sufficient to eliminate rogue workers. We conclude that AMT is a viable platform for synthetic speech intelligibility comparisons.
|
|
URL: http://hdl.handle.net/1842/4660
|
|
BASE
|
|
Hide details
|
|
31 |
Transforming Voice Source Parameters in a HMM-based Speech Synthesiser with Glottal Post-Filtering
|
|
|
|
BASE
|
|
Show details
|
|
32 |
HMM-based speech synthesis using an acoustic glottal source model
|
|
|
|
BASE
|
|
Show details
|
|
33 |
Investigating Non-Uniqueness in the Acoustic-Articulatory Inversion Mapping
|
|
|
|
BASE
|
|
Show details
|
|
37 |
Recognition and Understanding of Meetings Overview of the European AMI and AMIDA Projects
|
|
|
|
In: http://infoscience.epfl.ch/record/145946 (2010)
|
|
BASE
|
|
Show details
|
|
38 |
Recognition and Understanding of Meetings Overview of the European AMI and AMIDA Projects
|
|
|
|
In: http://infoscience.epfl.ch/record/146377 (2010)
|
|
BASE
|
|
Show details
|
|
39 |
Combining Spectral Representations for Large Vocabulary Continuous Speech Recognition
|
|
|
|
BASE
|
|
Show details
|
|
40 |
Recognition of Dialogue Acts in Multiparty Meetings using a Switching DBN
|
|
|
|
BASE
|
|
Show details
|
|
|
|