DE eng

Search in the Catalogues and Directories

Hits 1 – 4 of 4

1
Audiovisual speech recognition with articulator positions as hidden variables
In: http://www.sls.csail.mit.edu/sls/publications/2007/1719.pdf (2007)
BASE
Show details
2
Audiovisual speech recognition with articulator positions as hidden variables
In: http://people.csail.mit.edu/klivescu/papers/hasegawa-johnson_icphs07.pdf (2007)
Abstract: Speech recognition, by both humans and machines, benefits from visual observation of the face, especially at low signal-to-noise ratios (SNRs). It has often been noticed, however, that the audible and visible correlates of a phoneme may be asynchronous; perhaps for this reason, automatic speech recognition structures that allow asynchrony between the audible phoneme and the visible viseme outperform recognizers that allow no such asynchrony. This paper proposes, and tests using experimental speech recognition systems, a new explanation for audio-visual asynchrony. Specifically, we propose that audio-visual asynchrony may be the result of asynchrony between the gestures implemented by different articulators, such that the most visibly salient articulator (e.g., the lips) and the most audibly salient articulator (e.g., the glottis) may, at any given time, be dominated by gestures associated with different phonemes. The proposed model of audio-visual asynchrony is tested by implementing an “articulatory-feature model ” audiovisual speech recognizer: a system with multiple hidden state variables, each representing the gestures of one articulator. The proposed system performs as well as a standard audiovisual recognizer on a digit recognition task; the best results are achieved by combining the outputs of the two systems.
Keyword: Articulatory Phonology; Audiovisual Speech; Automatic Speech Recognition (ASR; Dynamic Bayesian Network (DBN
URL: http://people.csail.mit.edu/klivescu/papers/hasegawa-johnson_icphs07.pdf
http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.118.2363
BASE
Hide details
3
Audiovisual speech recognition with articulator positions as hidden variables
In: http://www.ifp.uiuc.edu/speech/pubs/2007/hasegawa-johnson07icphs.pdf (2007)
BASE
Show details
4
Audiovisual speech recognition with articulator positions as hidden variables
In: http://www.icphs2007.de/conference/Papers/1719/1719.pdf
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
4
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern