DE eng

Search in the Catalogues and Directories

Hits 1 – 8 of 8

1
CASA-based robust speaker identification
In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 20 (2012) 5, 1608-1616
BLLDB
OLC Linguistik
Show details
2
A computational auditory scene analysis system for speech segregation and robust speech recognition
In: Computer speech and language. - Amsterdam [u.a.] : Elsevier 24 (2010) 1, 77-93
OLC Linguistik
Show details
3
A computational auditory scene analysis system for speech segregation and robust speech recognition
In: Computer speech and language. - Amsterdam [u.a.] : Elsevier 24 (2010) 1, 77-93
BLLDB
OLC Linguistik
Show details
4
Sequential organization of speech in computational auditory scene analysis
In: Speech communication. - Amsterdam [u.a.] : Elsevier 51 (2009) 8, 657-667
BLLDB
OLC Linguistik
Show details
5
Sequential organization of speech in computational auditory scene analysis
In: Speech communication. - Amsterdam [u.a.] : Elsevier 51 (2009) 8, 657-667
OLC Linguistik
Show details
6
Monaural Speech Segregation by Integrating Primitive and Schema-Based Analysis
In: DTIC (2008)
BASE
Show details
7
A Computational Auditory Scene Analysis System for Speech Segregation and Robust Speech Recognition
Abstract: A conventional automatic speech recognizer does not perform well in the presence of multiple sound sources, while human listeners are able to segregate and recognize a signal of interest through auditory scene analysis. We present a computational auditory scene analysis system for separating and recognizing target speech in the presence of competing speech ornoise. We estimate, in two stages, the ideal binary time-frequency (T-F) mask which retainsthe mixture in a local T-F unit if and only if the target is stronger than the interference within the unit. In the first stage, we use harmonicity to segregate the voiced portions of individual sources in each time frame based on multipitch tracking. Additionally, unvoiced portions are segmented based on an onset/offset analysis. In the second stage, speaker characteristics are used to group the T-F units across time frames. The resulting masks are used in an uncertainty decoding framework for automatic speech recognition. We evaluate our system on a speech separation challenge and show that our system yields substantial improvement over the baseline performance. ; 01 Dec 1900, 01 Dec 1900
Keyword: auditory signals; BINARY TIME-FREQUENCY MASK; CASA(COMPUTATIONAL AUDITORY SCENE ANALYSIS); noise; ROBUST SPEECH RECOGNITION; speech; speech analysis; speech recognition; SPEECH SEGREGATION; UNCERTAINTY DECODING FRAMEWORKS; VOICE COMMUNICATIONS
URL: http://oai.dtic.mil/oai/oai?&verb=getRecord&metadataPrefix=html&identifier=AD1001212
http://www.dtic.mil/docs/citations/AD1001212
BASE
Hide details
8
Model-based sequential organization in cochannel speech
In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 14 (2006) 1, 289-298
BLLDB
OLC Linguistik
Show details

Catalogues
0
0
6
0
0
0
0
Bibliographies
4
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
2
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern