DE eng

Search in the Catalogues and Directories

Hits 1 – 2 of 2

1
Patch-based representation of visual speech
In: http://crpit.com/confpapers/CRPITV56Lucey2.pdf (2006)
BASE
Show details
2
for academic, not-for profit purposes permitted provided this text is included. Patch-Based Representation of Visual Speech
In: http://eprints.qut.edu.au/12844/1/12844.pdf
Abstract: Visual information from a speaker’s mouth region is known to improve automatic speech recognition robustness, especially in the presence of acoustic noise. To date, the vast majority of work in this field has viewed these visual features in a holistic manner, which may not take into account the various changes that occur within articulation (process of changing the shape of the vocal tract using the articulators, i.e lips and jaw). Motivated by the work being conducted in fields of audio-visual automatic speech recognition (AVASR) and face recognition using articulatory features (AFs) and patches respectively, we present a proof of concept paper which represents the mouth region as a ensemble of image patches. Our experiments show that by dealing with the mouth region in this manner, we are able to extract more speech information from the visual domain. For the task of visual-only speaker-independent isolated digit recognition, we were able to improve the relative word error rate by more than 23 % on the CUAVE audio-visual corpus.
Keyword: Articulatory Features (AFs; Patches; Visual Speech Recognition (VSR
URL: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.154.9408
http://eprints.qut.edu.au/12844/1/12844.pdf
BASE
Hide details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
2
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern