1 |
Robust Speech Processing & Recognition: Speaker ID, Language ID, Speech Recognition/Keyword Spotting, Diarization/Co-Channel/Environmental Characterization, Speaker State Assessment
|
|
|
|
In: DTIC (2015)
|
|
BASE
|
|
Show details
|
|
3 |
Distributed Estimation in Sensor Networks with Imperfect Model Information: An Adaptive Learning-Based Approach
|
|
|
|
In: DTIC (2012)
|
|
Abstract:
The paper considers the problem of distributed estimation of an unknown deterministic scalar parameter (the target signal) in wireless sensor networks (WSNs), in which each sensor receives a single snapshot of the field. The observation or sensing mode is only partially known at the corresponding nodes, perhaps, due to their limited sensing capabilities or other unpredictable physical factors. Specifically, it is assumed that the observation process at a node switches stochastically between two modes, with mode one corresponding to the desired signal plus noise observation mode (a valid observation), and mode two corresponding to pure noise with no signal information (an invalid observation). With no prior information on the local sensing modes (valid or invalid), the paper introduces a learning-based distributed estimation procedure, the mixed detection-estimation (MDE) algorithm, based on closed-loop interactions between the iterative distributed mode learning and estimation. The online learning (or sensing mode detection) step re-assesses the validity of the local observations at each iteration, thus refining the ongoing estimation update process. The convergence of the MDE algorithm is established analytically. Simulation studies show that, in the high signal-to-noise ratio (SNR) regime, the MDE estimation error converges to that of an ideal (centralized) estimator with perfect information about the node sensing modes. This is in contrast with the estimation performance of a naive average consensus based distributed estimator (with no mode learning), whose estimation error blows up with an increasing SNR. ; See also ADA561051. AOARD-CSP-111007 International Conference on Acoustics, Speech and Signal Processing (37th) (ICASSP 2012) Held in Kyoto, Japan on March 25-30, 2012. U.S. Government or Federal Purpose Rights License., The original document contains color images.
|
|
Keyword:
*ADAPTIVE SYSTEMS; *COMMUNICATIONS NETWORKS; *LEARNING; ALGORITHMS; CONVERGENCE; DETECTION; DETERMINANTS(MATHEMATICS); ERRORS; ESTIMATES; ITERATIONS; Linguistics; NODES; NOISE; OBSERVATION; ONLINE SYSTEMS; PHYSICAL PROPERTIES; PURITY; RADIO LINKS; RADIOTELEPHONES; RATIOS; SCALAR FUNCTIONS; Voice Communications; WSN(WIRELESS SENSOR NETWORKS)
|
|
URL: http://oai.dtic.mil/oai/oai?&verb=getRecord&metadataPrefix=html&identifier=ADA570274 http://www.dtic.mil/docs/citations/ADA570274
|
|
BASE
|
|
Hide details
|
|
4 |
Machine Recognition vs Human Recognition of Voices
|
|
|
|
In: DTIC (2012)
|
|
BASE
|
|
Show details
|
|
5 |
Speaker Clustering for a Mixture of Singing and Reading (Preprint)
|
|
|
|
In: DTIC (2012)
|
|
BASE
|
|
Show details
|
|
6 |
Open-Source Multi-Language Audio Database for Spoken Language Processing Applications
|
|
|
|
In: DTIC (2012)
|
|
BASE
|
|
Show details
|
|
7 |
2-D Processing of Speech for Multi-Pitch Analysis
|
|
|
|
In: DTIC (2009)
|
|
BASE
|
|
Show details
|
|
8 |
Using Prosody for Automatic Sentence Segmentation of Multi-Party Meetings
|
|
|
|
In: DTIC (2006)
|
|
BASE
|
|
Show details
|
|
9 |
Text Detection and Translation from Natural Scenes
|
|
|
|
In: DTIC (2001)
|
|
BASE
|
|
Show details
|
|
10 |
From Word-Spotting to OOV Modeling
|
|
|
|
In: DTIC AND NTIS (2001)
|
|
BASE
|
|
Show details
|
|
11 |
Isolated Speech Recognition Using Artificial Neural Networks
|
|
|
|
In: DTIC (2001)
|
|
BASE
|
|
Show details
|
|
12 |
Preserving Spectral Contrast in Amplitude Compression for Hearing Aids
|
|
|
|
In: DTIC AND NTIS (2001)
|
|
BASE
|
|
Show details
|
|
13 |
Clustering of Context Dependent Speech Units for Multilingual Speech Recognition
|
|
|
|
In: DTIC (2000)
|
|
BASE
|
|
Show details
|
|
14 |
Vowel System Modeling: A Complement to Phonetic Modeling in Language Identification
|
|
|
|
In: DTIC (2000)
|
|
BASE
|
|
Show details
|
|
15 |
Comparing Three Methods to Create Multilingual Phone Models for Vocabulary Independent Speech Recognition Tasks
|
|
|
|
In: DTIC (2000)
|
|
BASE
|
|
Show details
|
|
17 |
The Acoustic-to-Articulatory Mapping of Voiced and Fricated Speech
|
|
|
|
In: DTIC AND NTIS (1997)
|
|
BASE
|
|
Show details
|
|
18 |
A Transducer/Equipment System for Capturing Speech and Telemedicine Information for Subsequent Processing by Computer Systems
|
|
|
|
In: DTIC (1997)
|
|
BASE
|
|
Show details
|
|
19 |
Smoothing Disjoint Formant Track Boundaries Caused by Waveform Substitution in Packet Voice Communication.
|
|
|
|
In: DTIC AND NTIS (1996)
|
|
BASE
|
|
Show details
|
|
20 |
Automatic Language Identification with Sequences of Language-Independent Phoneme Clusters.
|
|
|
|
In: DTIC AND NTIS (1996)
|
|
BASE
|
|
Show details
|
|
|
|