Page: 1 2 3 4 5 6 7 8 9... 42
81 |
Prosodic Stress, Information, and Intelligibility of Speech in Noise
|
|
|
|
In: DTIC (2009)
|
|
BASE
|
|
Show details
|
|
82 |
Speech Processing in Realistic Battlefield Environments (Le Traitement de la Parole en Environnement de Combat Realiste)
|
|
In: DTIC (2009)
|
|
BASE
|
|
Show details
|
|
83 |
Long Term Examination of Intra-Session and Inter-Session Speaker Variability
|
|
|
|
In: DTIC (2009)
|
|
BASE
|
|
Show details
|
|
84 |
2-D Processing of Speech for Multi-Pitch Analysis
|
|
|
|
In: DTIC (2009)
|
|
BASE
|
|
Show details
|
|
85 |
Towards Co-Channel Speaker Separation by 2-D Demodulation of Spectrograms
|
|
|
|
In: DTIC (2009)
|
|
Abstract:
This paper explores a two-dimensional (2-D) processing approach for co-channel speaker separation of voiced speech. We analyze localized time-frequency regions of a narrowband spectrogram using 2-D Fourier transforms and propose a 2-D amplitude modulation model based on pitch information for single and multi-speaker content in each region. Our model maps harmonically-related speech content to concentrated entities in a transformed 2-D space, thereby motivating 2-D demodulation of the spectrogram for analysis/synthesis and speaker separation. Using a priori pitch estimates of individual speakers, we show through a quantitative evaluation: 1) Utility of the model for representing speech content of a single speaker and 2) Its feasibility for speaker separation. For the separation task, we also illustrate benefits of the model's representation of pitch dynamics relative to a sinusoidal-based separation system. ; Presented at IEEE Workshop on Applications of Signal Processing to Audio and Acoutics held in New Paltz, NY on Oct 18-21, 2009. Published in Proceedings of IEEE Workshop on Applications of Signal Processing to Audio and Acoutics, v65-68, Oct 2009.
|
|
Keyword:
*SPEECH ANALYSIS; FOURIER TRANSFORMATION; GRATING COMPRESSION TRANSFORM; SEPARATION; SPEAKERS; SPECTROGRAMS; SPECTROGRAPHY; TWO DIMENSIONAL; Voice Communications; WORKSHOPS
|
|
URL: http://oai.dtic.mil/oai/oai?&verb=getRecord&metadataPrefix=html&identifier=ADA519581 http://www.dtic.mil/docs/citations/ADA519581
|
|
BASE
|
|
Hide details
|
|
86 |
Perturbation and Pitch Normalization as Enhancements to Speaker Recognition
|
|
|
|
In: DTIC (2009)
|
|
BASE
|
|
Show details
|
|
87 |
Long Term Examination of Intra-Session and Inter-Session Speaker Variability
|
|
|
|
In: DTIC (2009)
|
|
BASE
|
|
Show details
|
|
88 |
Spoken Word Recognition by Humans: A Single- or a Multi-Layer Process
|
|
|
|
In: DTIC (2008)
|
|
BASE
|
|
Show details
|
|
89 |
Automated Speech Intelligibility System for Head-Borne Personal Protective Equipment: Proof of Concept
|
|
|
|
In: DTIC (2008)
|
|
BASE
|
|
Show details
|
|
90 |
Automating Convoy Training Assessment to Improve Soldier Performance
|
|
|
|
In: DTIC (2008)
|
|
BASE
|
|
Show details
|
|
91 |
Odds of Successful Transfer of Low-level Concepts: A Key Metric for Bidirectional Speech-to-Speech Machine Translation in DARPA's TRANSTAC Program
|
|
|
|
In: DTIC (2008)
|
|
BASE
|
|
Show details
|
|
92 |
A Review of Contributions by Australian Research Institutions into Speech Processing
|
|
|
|
In: DTIC (2008)
|
|
BASE
|
|
Show details
|
|
93 |
Pilot English Language Proficiency and the Prevalence of Communication Problems at Five U.S. Air Route Traffic Control Centers
|
|
|
|
In: DTIC (2008)
|
|
BASE
|
|
Show details
|
|
94 |
Iterated Class-Specific Subspaces for Speaker-Dependent Phoneme Classification
|
|
|
|
In: DTIC (2008)
|
|
BASE
|
|
Show details
|
|
95 |
Listener Detection of Talker Stress in Low-Rate Coded Speech
|
|
|
|
In: DTIC (2008)
|
|
BASE
|
|
Show details
|
|
96 |
Spatial Hearing, Attention and Informational Masking in Speech Identification
|
|
|
|
In: DTIC (2008)
|
|
BASE
|
|
Show details
|
|
97 |
Entropy Based Classifier Combination for Sentence Segmentation
|
|
|
|
In: DTIC (2007)
|
|
BASE
|
|
Show details
|
|
98 |
Experimental Tuning of the AIFSN Parameter to Prioritize Voice Over Data Transmission in 802.11E WLAN Networks
|
|
|
|
In: Conference papers (2007)
|
|
BASE
|
|
Show details
|
|
99 |
A Model for Predicting Intelligibility of Binaurally Perceived Speech
|
|
|
|
In: DTIC (2007)
|
|
BASE
|
|
Show details
|
|
100 |
Linking Semantic and Knowledge Representations in a Multi-Domain Dialogue System
|
|
|
|
In: DTIC (2007)
|
|
BASE
|
|
Show details
|
|
Page: 1 2 3 4 5 6 7 8 9... 42
|
|