Home
Catalogue search
Refine your search:
Keyword:
Electrical and Computer Engineering (3)
Creator / Publisher:
Zhang, Xiaozheng (4)
Clements, Mark A. (3)
Mersereau, Russell M. (3)
Broun, Charles C. (2)
Brown, Douglas (1)
Cao, Jie (1)
Gao, Yongsheng (1)
Li, Hanxi (1)
Wang, Bin (1)
Year
Medium
Type
BLLDB-Access
Search in the Catalogues and Directories
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
Sort by
creator [A → Z]
'
creator [Z → A]
'
publishing year ↑ (asc)
'
publishing year ↓ (desc)
'
title [A → Z]
'
title [Z → A]
'
Simple Search
Hits 1 – 4 of 4
1
Polygonal approximation using integer particle swarm optimization
Wang, Bin
;
Brown, Douglas
;
Zhang, Xiaozheng
...
In:
Information sciences. - New York, NY : Elsevier Science Inc.
278 (2014), 311-326
OLC Linguistik
Show details
2
Audio-Visual Speech Recognition by Speechreading
Zhang, Xiaozheng
;
Clements, Mark A.
;
Mersereau, Russell M.
In: Electrical Engineering (2002)
BASE
Show details
3
Visual Speech Feature Extraction for Improved Speech Recognition
Zhang, Xiaozheng
;
Mersereau, Russell M.
;
Broun, Charles C.
;
Clements, Mark A.
In: Electrical Engineering (2002)
Abstract:
Mainstream automatic speech recognition has focused almost exclusively on the acoustic signal. The performance of these systems degrades considerably in the real world in the presence of noise. On the other hand, most human listeners, both hearing-impaired and normal hearing, make use of visual information to improve speech perception in acoustically hostile environments. Motivated by humans' ability to lipread, the visual component is considered to yield information that is not always present in the acoustic signal and enables improved accuracy over totally acoustic systems, especially in noisy environments. In this paper, we investigate the usefulness of visual information in speech recognition. We first present a method for automatically locating and extracting visual speech features from a talking person in color video sequences. We then develop a recognition engine to train and recognize sequences of visual parameters for the purpose of speech recognition. We particularly explore the impact of various combinations of visual features on the recognition accuracy. We conclude that the inner lip contour features together with the information about the visibility of the tongue and teeth significantly improve the performance over using outer contour only features in both speaker dependent and speaker independent recognition tasks.
Keyword:
Electrical and Computer Engineering
URL:
https://digitalcommons.calpoly.edu/eeng_fac/264
https://digitalcommons.calpoly.edu/cgi/viewcontent.cgi?article=1264&context=eeng_fac
BASE
Hide details
4
Automatic Speechreading with Application to Speaker Verification
Zhang, Xiaozheng
;
Mersereau, Russell M.
;
Broun, Charles C.
...
In: Electrical Engineering (2002)
BASE
Show details
Mobile view
All
Catalogues
UB Frankfurt Linguistik
0
IDS Mannheim
0
OLC Linguistik
1
UB Frankfurt Retrokatalog
0
DNB Subject Category Language
0
Institut für Empirische Sprachwissenschaft
0
Leibniz-Centre General Linguistics (ZAS)
0
Bibliographies
BLLDB
0
BDSL
0
IDS Bibliografie zur deutschen Grammatik
0
IDS Bibliografie zur Gesprächsforschung
0
IDS Konnektoren im Deutschen
0
IDS Präpositionen im Deutschen
0
IDS OBELEX meta
0
MPI-SHH Linguistics Collection
0
MPI for Psycholinguistics
0
Linked Open Data catalogues
Annohub
0
Online resources
Link directory
0
Journal directory
0
Database directory
0
Dictionary directory
0
Open access documents
BASE
3
Linguistik-Repository
0
IDS Publikationsserver
0
Online dissertations
0
Language Description Heritage
0
© 2013 - 2024 Lin|gu|is|tik
|
Imprint
|
Privacy Policy
|
Datenschutzeinstellungen ändern