Catalogue search • Linguistik portal • Fachinformationsdienst (FID)

1	An empirical analysis of information encoded in disentangled neural speaker representations ...
	Peri, Raghuveer; Li, Haoqi; Somandepalli, Krishna. - : arXiv, 2020
	BASE
	Show details

2	Predicting Behavior in Cancer-Afflicted Patient and Spouse Interactions using Speech and Language ...
	Chakravarthula, Sandeep Nallan; Li, Haoqi; Tseng, Shao-Yen. - : arXiv, 2019
	BASE
	Show details

3	Learning from Past Mistakes: Improving Automatic Speech Recognition Output via Noisy-Clean Phrase Context Modeling ...
	Shivakumar, Prashanth Gurunath; Li, Haoqi; Knight, Kevin; Georgiou, Panayiotis. - : arXiv, 2018
	Abstract: Automatic speech recognition (ASR) systems often make unrecoverable errors due to subsystem pruning (acoustic, language and pronunciation models); for example pruning words due to acoustics using short-term context, prior to rescoring with long-term context based on linguistics. In this work we model ASR as a phrase-based noisy transformation channel and propose an error correction system that can learn from the aggregate errors of all the independent modules constituting the ASR and attempt to invert those. The proposed system can exploit long-term context using a neural network language model and can better choose between existing ASR output possibilities as well as re-introduce previously pruned or unseen (out-of-vocabulary) phrases. It provides corrections under poorly performing ASR conditions without degrading any accurate transcriptions; such corrections are greater on top of out-of-domain and mismatched data ASR. Our system consistently provides improvements over the baseline ASR, even when baseline ...
	Keyword: Audio and Speech Processing eess.AS; Computation and Language cs.CL; FOS Computer and information sciences; FOS Electrical engineering, electronic engineering, information engineering; Sound cs.SD
	URL: https://dx.doi.org/10.48550/arxiv.1802.02607 https://arxiv.org/abs/1802.02607
	BASE
	Hide details

Search in the Catalogues and Directories