Catalogue search • Linguistik portal • Fachinformationsdienst (FID)

1	Automatic Prediction of Intelligible Speaking Rate for Individuals with ALS from Speech Acoustic and Articulatory Samples
	Wang, Jun; Kothalkar, Prasanna V.; Kim, Myungjong. - 2018
	BASE
	Show details

2	Recognizing Whispered Speech Produced by an Individual with Surgically Reconstructed Larynx Using Articulatory Movement Data
	Cao, Beiming; Kim, Myungjong; Mau, Ted; Wang, Jun. - 2016
	Abstract: Individuals with larynx (vocal folds) impaired have problems in controlling their glottal vibration, producing whispered speech with extreme hoarseness. Standard automatic speech recognition using only acoustic cues is typically ineffective for whispered speech because the corresponding spectral characteristics are distorted. Articulatory cues such as the tongue and lip motion may help in recognizing whispered speech since articulatory motion patterns are generally not affected. In this paper, we investigated whispered speech recognition for patients with reconstructed larynx using articulatory movement data. A data set with both acoustic and articulatory motion data was collected from a patient with surgically reconstructed larynx using an electromagnetic articulograph. Two speech recognition systems, Gaussian mixture model-hidden Markov model (GMM-HMM) and deep neural network-HMM (DNN-HMM), were used in the experiments. Experimental results showed adding either tongue or lip motion data to acoustic features such as mel-frequency cepstral coefficient (MFCC) significantly reduced the phone error rates on both speech recognition systems. Adding both tongue and lip data achieved the best performance.
	Keyword: Article
	URL: http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5800526/ https://doi.org/10.21437/SLPAT.2016-14
	BASE
	Hide details

Search in the Catalogues and Directories