2 |
Document Image Parsing and Understanding using Neuromorphic Architecture
|
|
|
|
In: DTIC (2015)
|
|
BASE
|
|
Show details
|
|
3 |
Machine Recognition vs Human Recognition of Voices
|
|
|
|
In: DTIC (2012)
|
|
Abstract:
While automated speaker recognition by machines can be quite good as demonstrated in NIST Speaker Recognition Evaluations, performance can still suffer when environmental conditions, emotions, or recording quality change. This research examines how robust humans are compared with machines at speaker recognition in changing environments. Several data conditions, including short sentences, frequency selective noise, and time-reversed speech were used to test the robustness of human listeners versus machine algorithms. Statistical significance tests were completed on the results and, for under conditions, human speaker recognition was more robust. The strength of the human listeners was especially evident for the challenging case of noise in the 2000-3000 Hz frequency range. Additional analysis was performed to identify factors that may impact a listener's ability to identify a person's identity. For example, the amount of voiced (or unvoiced) speech was examined to see if there was a correlation with how easily a speaker's voice was recognized. Unfortunately, the amount of voiced (or unvoiced) speech did not correlate strongly with how easily a speaker's voice was recognized. Other factors such as fundamental pitch, formant locations, pitch shimmer, pitch jitter, and other modulation measures also are being examined. The original goal of this effort was to discover which frequency bands are most important for the familiar speaker recognition task. This research was a cursory look at what frequency information is important for speaker identification. More listening experiments with better randomization of stimuli and phonetic consideration are required. ; See also ADA561051. Presented at the International Conference on Acoustics, Speech and Signal Processing (37th) (ICASSP 2012) held in Kyoto, Japan, on March 25-30, 2012. Published in the Proceedings of the 37th International Conference on Acoustics, Speech and Signal Processing, p4245-4248, 2012. U.S. Government or Federal Purpose Rights License. The original document contains color images.
|
|
Keyword:
*AUDIO FREQUENCY; *FAMILIAR SPEAKERS; *FREQUENCY BANDS; *HUMAN PERFORMANCE; *LEARNING MACHINES; *MACHINE PERFORMANCE; *PERFORMANCE(ENGINEERING); *PERFORMANCE(HUMAN); *SIGNAL TO NOISE RATIO; *SPEAKER IDENTIFICATION; *SPEAKER RECOGNITION; *SPEECH RECOGNITION; Acoustics; ALGORITHMS; AUDITORY SIGNALS; BACKGROUND NOISE; CO-WORKER IDENTIFICATION; CUES(STIMULI); Cybernetics; HEARING DEFICIT GROUP; HUMAN LISTENERS; IDENTIFICATION; NORMAL HEARING GROUP; Psychology; SIGNAL PROCESSING; SPEAKER CUES; SPEAKER FAMILIARITY; SPEECH SIGNAL DEGRADATION; SPEECH SIGNALS; SPEECH-SHAPED ADDITIVE NOISE; STATISTICAL ANALYSIS; SYMPOSIA; TRAINING; Voice Communications
|
|
URL: http://www.dtic.mil/docs/citations/ADA568903 http://oai.dtic.mil/oai/oai?&verb=getRecord&metadataPrefix=html&identifier=ADA568903
|
|
BASE
|
|
Hide details
|
|
4 |
Performance and Power Optimization for Cognitive Processor Design Using Deep-Submicron Very Large Scale Integration (VLSI) Technology
|
|
|
|
In: DTIC (2010)
|
|
BASE
|
|
Show details
|
|
5 |
A Hybrid Approach for QA Track Definitional Questions
|
|
|
|
In: DTIC (2006)
|
|
BASE
|
|
Show details
|
|
6 |
Spoken Dialogue for Simulation Control and Conversational Tutoring
|
|
|
|
In: DTIC (2004)
|
|
BASE
|
|
Show details
|
|
7 |
Automated Tutoring Dialogues for Training in Shipboard Damage Control
|
|
|
|
In: DTIC (2001)
|
|
BASE
|
|
Show details
|
|
8 |
Translating English and Mandarin Verbs with Argument Structure (Mis)matches Using LCS Representation
|
|
|
|
In: DTIC (1998)
|
|
BASE
|
|
Show details
|
|
9 |
Beyond Word Processing: Using an Interactive Learning Environment to Teach Writing.
|
|
|
|
In: DTIC AND NTIS (1996)
|
|
BASE
|
|
Show details
|
|
10 |
High-Performance Speech Recognition Using Consistency Modeling.
|
|
|
|
In: DTIC AND NTIS (1994)
|
|
BASE
|
|
Show details
|
|
11 |
GE-CMU: Description of the Shogun System Used for MUC-5
|
|
|
|
In: DTIC (1993)
|
|
BASE
|
|
Show details
|
|
13 |
Tipster Shogun System (Joint GE-CMU): MUC-4 Test Results and Analysis
|
|
|
|
In: DTIC (1992)
|
|
BASE
|
|
Show details
|
|
14 |
The Cognitive, Perceptual, and Neural Bases of Skilled Performance.
|
|
|
|
In: DTIC AND NTIS (1992)
|
|
BASE
|
|
Show details
|
|
15 |
Performance Analysis of Large Adaptive Sidelobe Canceller Arrays with Reused Elements
|
|
|
|
In: DTIC AND NTIS (1989)
|
|
BASE
|
|
Show details
|
|
|
|