2 |
Automatic Assessment of Speech Capability Loss in Disordered Speech
|
|
|
|
In: ISSN: 1936-7228 ; EISSN: 1936-7236 ; ACM Transactions on Accessible Computing ; https://hal.archives-ouvertes.fr/hal-01371812 ; ACM Transactions on Accessible Computing , ACM New York, NY, USA 2015, 6 (3), pp.1-14. ⟨10.1145/2739051⟩ (2015)
|
|
BASE
|
|
Show details
|
|
3 |
Context Awareness and Priority Control for ITS based on Automatic Speech Recognition
|
|
|
|
In: International conference on ITS Telecommunications ; https://hal.inria.fr/hal-01225312 ; International conference on ITS Telecommunications, Dec 2015, Copenhagen, Denmark ; http://www.itst-conf.org/ (2015)
|
|
BASE
|
|
Show details
|
|
4 |
A preliminary study on improving the recognition of esophageal speech using a hybrid system based on statistical voice conversion
|
|
|
|
In: ISSN: 2193-1801 ; SpringerPlus ; https://hal.inria.fr/hal-01221503 ; SpringerPlus, SpringerOpen, 2015, ⟨10.1186/s40064-015-1428-2⟩ (2015)
|
|
BASE
|
|
Show details
|
|
11 |
IR-Depth Face Detection and Lip Localization Using Kinect V2
|
|
|
|
In: Master's Theses (2015)
|
|
Abstract:
Face recognition and lip localization are two main building blocks in the development of audio visual automatic speech recognition systems (AV-ASR). In many earlier works, face recognition and lip localization were conducted in uniform lighting conditions with simple backgrounds. However, such conditions are seldom the case in real world applications. In this paper, we present an approach to face recognition and lip localization that is invariant to lighting conditions. This is done by employing infrared and depth images captured by the Kinect V2 device. First we present the use of infrared images for face detection. Second, we use the face’s inherent depth information to reduce the search area for the lips by developing a nose point detection. Third, we further reduce the search area by using a depth segmentation algorithm to separate the face from its background. Finally, with the reduced search range, we present a method for lip localization based on depth gradients. Experimental results demonstrated an accuracy of 100% for face detection, and 96% for lip localization.
|
|
Keyword:
Audio-visual automatic speech recognition; depth information; Face detection; infrared (IR); lip localization; Microsoft Kinect; Signal Processing
|
|
URL: https://digitalcommons.calpoly.edu/cgi/viewcontent.cgi?article=2586&context=theses https://digitalcommons.calpoly.edu/theses/1425
|
|
BASE
|
|
Hide details
|
|
12 |
Automatic intelligibility measures applied to speech signals simulating age-related hearing loss
|
|
|
|
In: Proceedings of INTERSPEECH 2015 ; 16th Annual Conference of the International Speech Communication Association (INTERSPEECH 2015) ; https://hal.archives-ouvertes.fr/hal-01343047 ; 16th Annual Conference of the International Speech Communication Association (INTERSPEECH 2015), Sep 2015, Dresden, Germany. pp. 663-667 (2015)
|
|
BASE
|
|
Show details
|
|
13 |
The analysis and applications of Subglottal Resonances in height estimation and speaker identification and normalization
|
|
Guo, Jinxi. - : eScholarship, University of California, 2015
|
|
In: Guo, Jinxi. (2015). The analysis and applications of Subglottal Resonances in height estimation and speaker identification and normalization. UCLA: Electrical Engineering 0303. Retrieved from: http://www.escholarship.org/uc/item/5fq3p577 (2015)
|
|
BASE
|
|
Show details
|
|
14 |
Automatic-Type Calibration of Traditionally Derived Likelihood Ratios: Forensic Analysis of Australian English/o/Formant Trajectories
|
|
|
|
In: Proceedings of Interspeech 2008 incorporating SST 2008 (2015)
|
|
BASE
|
|
Show details
|
|
19 |
Possibilities, Challenges And The State Of The Art Of Automatic Speech Recognition In Air Traffic Control ...
|
|
|
|
BASE
|
|
Show details
|
|
20 |
Possibilities, Challenges And The State Of The Art Of Automatic Speech Recognition In Air Traffic Control ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|