21 |
Advanced Audio Interface for Phonetic Speech Recognition in a High Noise Environment
|
|
In: DTIC AND NTIS (2000)
|
|
BASE
|
|
Show details
|
|
22 |
The effects of symmetrical and asymmetrical sensorineural hearing loss on speech perception in noise
|
|
|
|
BASE
|
|
Show details
|
|
23 |
Spatial Audio Displays for Speech Communications: A Comparison of Free Field and Virtual Acoustic Environments
|
|
|
|
In: DTIC AND NTIS (1999)
|
|
BASE
|
|
Show details
|
|
24 |
Sound Basics: A Primer in Psychoacoustics
|
|
|
|
In: DTIC AND NTIS (1998)
|
|
BASE
|
|
Show details
|
|
25 |
The Acoustic-to-Articulatory Mapping of Voiced and Fricated Speech
|
|
|
|
In: DTIC AND NTIS (1997)
|
|
Abstract:
Acoustic-to-articulatory mapping is the estimation of a time-varying vocal-tract shape from an acoustic waveform. While most research in acoustic-to-articulatory mapping considers only purely voiced speech, this dissertation investigates the problem for speech that includes fricatives. Aspects of fricative production and perception challenge many of the assumptions and techniques used in existing acoustic-to-articulatory mapping algorithms. This work investigates these issues and extends existing techniques for the acoustic-to-articulatory mapping of purely voiced speech to unvoiced and voiced fricatives in isolation and in continuous speech. Linked-codebooks are used to examine the acoustic-to-articulatory mapping of voiced and unvoiced static fricatives. Acoustic-to-articulatory mapping performance is evaluated by analyzing articulatory estimation error for a number of synthetic fricatives and phonetic class clustering for a collection of real fricatives. Scatter plots of acoustic-to-articulatory mapping results on unvoiced fricatives demonstrate good phonetic class clustering and inter-class separability. For equivalent performance on voiced fricatives, the acoustic features had to be modified to deemphasize frequencies below 1 kHz. Linked-codebook lookup, along with dynamic programming, is used to perform acoustic-to-articulatory mapping of continuous, purely voiced speech. Direct application of the algorithm to speech containing fricatives suggests that purely voiced acoustic-to-articulatory mapping provides contextual information that can improve fricative acoustic-to-articulatory mapping. A five step procedure is developed for the dynamic acoustic-to-articulatory mapping of continuous, voiced speech containing intervocalic fricatives. A collection of vowel-fricative-vowel tokens is used for development and testing. In most cases, the estimated articulatory ; Doctoral thesis
|
|
Keyword:
*ACOUSTIC WAVES; *SPEECH ANALYSIS; *SPEECH ARTICULATION; ACOUSTIC DATA; ACOUSTIC MAPPING; ACOUSTIC SCATTERING; ACOUSTIC SIGNALS; Acoustics; ALGORITHMS; AUDITORY PERCEPTION; DYNAMIC PROGRAMMING; Linguistics; PHONETICS; THESES; Voice Communications; WAVEFORMS
|
|
URL: http://www.dtic.mil/docs/citations/ADA334781 http://oai.dtic.mil/oai/oai?&verb=getRecord&metadataPrefix=html&identifier=ADA334781
|
|
BASE
|
|
Hide details
|
|
26 |
Perception of Auditory Events: Attentional Limitations
|
|
|
|
In: DTIC AND NTIS (1997)
|
|
BASE
|
|
Show details
|
|
27 |
Fuzzy Controller for Acoustic Vehicle Target Intercept Guidance
|
|
|
|
In: DTIC AND NTIS (1997)
|
|
BASE
|
|
Show details
|
|
28 |
Methods and Metrics of Voice Communications.
|
|
|
|
In: DTIC AND NTIS (1996)
|
|
BASE
|
|
Show details
|
|
29 |
A Two-Phase Damped-Exponential Model for Speech Synthesis
|
|
|
|
In: DTIC (1996)
|
|
BASE
|
|
Show details
|
|
30 |
Diphone-Based Speech Recognition Using Neural Networks.
|
|
|
|
In: DTIC AND NTIS (1996)
|
|
BASE
|
|
Show details
|
|
31 |
A Self-Organizing Neural Network Architecture for Auditory and Speech Perception with Applications to Acoustic and other Temporal Prediction Problems.
|
|
|
|
In: DTIC AND NTIS (1995)
|
|
BASE
|
|
Show details
|
|
32 |
The Unsupervised Acquisition of a Lexicon from Continuous Speech.
|
|
|
|
In: DTIC AND NTIS (1995)
|
|
BASE
|
|
Show details
|
|
33 |
A Robust Loose Coupling for Speech Recognition and Natural Understanding.
|
|
|
|
In: DTIC AND NTIS (1995)
|
|
BASE
|
|
Show details
|
|
34 |
Language Identification Through Parallel Phone Recognition.
|
|
|
|
In: DTIC AND NTIS (1995)
|
|
BASE
|
|
Show details
|
|
35 |
MMN And P3 Auditory Evoked Potentials within and Across Phonetic Categories.
|
|
|
|
In: DTIC AND NTIS (1995)
|
|
BASE
|
|
Show details
|
|
36 |
Training of Homoscedastic Hidden Markov Models For Automatic Speech Recognition.
|
|
|
|
In: DTIC AND NTIS (1995)
|
|
BASE
|
|
Show details
|
|
37 |
External/Internal Data Fusion Testbed: History, Components and Experimental Analysis.
|
|
|
|
In: DTIC AND NTIS (1995)
|
|
BASE
|
|
Show details
|
|
38 |
High-Order Modeling Techniques for Continuous Speech Recognition.
|
|
|
|
In: DTIC AND NTIS (1995)
|
|
BASE
|
|
Show details
|
|
39 |
A Self-Organizing Neural Network Architecture for Auditory and Speech Perception with Applications to Acoustic and Other Temporal Prediction Problems.
|
|
|
|
In: DTIC AND NTIS (1994)
|
|
BASE
|
|
Show details
|
|
40 |
A Self-Organizing Neural Network Architecture for Auditory and Speech Perception with Applications to Acoustic and Other Temporal Prediction Problems
|
|
|
|
In: DTIC AND NTIS (1994)
|
|
BASE
|
|
Show details
|
|
|
|