1 |
Synthesizing Dysarthric Speech Using Multi-talker TTS for Dysarthric Speech Recognition ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Introducing Phonetic Information to Speaker Embedding for Speaker Verification
|
|
|
|
In: Electrical and Computer Engineering Faculty Publications (2019)
|
|
BASE
|
|
Show details
|
|
4 |
Advanced Recurrent Network-Based Hybrid Acoustic Models for Low Resource Speech Recognition
|
|
|
|
In: Electrical and Computer Engineering Faculty Publications (2018)
|
|
BASE
|
|
Show details
|
|
5 |
Investigation of Frame Alignments for GMM-based Digit-prompted Speaker Verification ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Comparison of Multiple Features and Modeling Methods for Text-dependent Speaker Verification ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Jaw Rotation in Dysarthria Measured With a Single Electromagnetic Articulography Sensor
|
|
|
|
In: Speech Pathology and Audiology Faculty Research and Publications (2017)
|
|
BASE
|
|
Show details
|
|
8 |
Development of Kinematic Templates for Automatic Pronunciation Assessment Using Acoustic-to-Articulatory Inversion
|
|
|
|
In: Master's Theses (2009 -) (2017)
|
|
BASE
|
|
Show details
|
|
9 |
Acoustic sequences in non-human animals : a tutorial review and prospectus
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Analysis of Interference between Electromagnetic Articulography and Electroglottograph Systems
|
|
|
|
In: Master's Theses (2009 -) (2016)
|
|
BASE
|
|
Show details
|
|
11 |
Parallel Reference Speaker Weighting for Kinematic-Independent Acoustic-to-Articulatory Inversion
|
|
|
|
In: Speech Pathology and Audiology Faculty Research and Publications (2016)
|
|
Abstract:
Acoustic-to-articulatory inversion, the estimation of articulatory kinematics from an acoustic waveform, is a challenging but important problem. Accurate estimation of articulatory movements has the potential for significant impact on our understanding of speech production, on our capacity to assess and treat pathologies in a clinical setting, and on speech technologies such as computer aided pronunciation assessment and audio-video synthesis. However, because of the complex and speaker-specific relationship between articulation and acoustics, existing approaches for inversion do not generalize well across speakers. As acquiring speaker-specific kinematic data for training is not feasible in many practical applications, this remains an important and open problem. This paper proposes a novel approach to acoustic-to-articulatory inversion, Parallel Reference Speaker Weighting (PRSW), which requires no kinematic data for the target speaker and a small amount of acoustic adaptation data. PRSW hypothesizes that acoustic and kinematic similarities are correlated and uses speaker-adapted articulatory models derived from acoustically derived weights. The system was assessed using a 20-speaker data set of synchronous acoustic and Electromagnetic Articulography (EMA) kinematic data. Results demonstrate that by restricting the reference group to a subset consisting of speakers with strong individual speaker-dependent inversion performance, the PRSW method is able to attain kinematic-independent acoustic-to-articulatory inversion performance nearly matching that of the speaker-dependent model, with an average correlation of 0.62 versus 0.63. This indicates that given a sufficiently complete and appropriately selected reference speaker set for adaptation, it is possible to create effective articulatory models without kinematic training data.
|
|
Keyword:
acoustic-to-articulatory inversion; acoustics; adaptation models; Electrical and Computer Engineering; electromagnetic articulography; hidden markov models; kinematics; maximum likelihood estimation; speech; Speech Pathology and Audiology; speech processing
|
|
URL: https://epublications.marquette.edu/cgi/viewcontent.cgi?article=1038&context=spaud_fac https://epublications.marquette.edu/spaud_fac/39
|
|
BASE
|
|
Hide details
|
|
12 |
Acoustic Sequences in Non-human Animals: A Tutorial Review and Prospectus
|
|
|
|
In: Electrical and Computer Engineering Faculty Research and Publications (2016)
|
|
BASE
|
|
Show details
|
|
13 |
Acoustic sequences in nonâ human animals: a tutorial review and prospectus
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Acoustic sequences in non-human animals: a tutorial review and prospectus.
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Embodied cognition, Latin pedagogy, and the rhetorical foundations of medieval vernacular poetry
|
|
|
|
BASE
|
|
Show details
|
|
18 |
The Electromagnetic Articulography Mandarin Accented English (EMA-MAE) Corpus of Acoustic and 3D Articulatory Kinematic Data
|
|
|
|
In: Speech Pathology and Audiology Faculty Research and Publications (2014)
|
|
BASE
|
|
Show details
|
|
19 |
Sensorimotor Adaptation of Speech Using Real-time Articulatory Resynthesis
|
|
|
|
In: Speech Pathology and Audiology Faculty Research and Publications (2014)
|
|
BASE
|
|
Show details
|
|
20 |
Physiologically-motivated Feature Extraction for Speaker Identification
|
|
|
|
In: Electrical and Computer Engineering Faculty Research and Publications (2014)
|
|
BASE
|
|
Show details
|
|
|
|