DE eng

Search in the Catalogues and Directories

Hits 1 – 9 of 9

1
Enhancing target speech based on nonlinear soft masking using a single acoustic vector sensor
In: Faculty of Engineering and Information Sciences - Papers: Part B (2018)
Abstract: Enhancing speech captured by distant microphones is a challenging task. In this study, we investigate the multichannel signal properties of the single acoustic vector sensor (AVS) to obtain the inter-sensor data ratio (ISDR) model in the time-frequency (TF) domain. Then, the monotone functions describing the relationship between the ISDRs and the direction of arrival (DOA) of the target speaker are derived. For the target speech enhancement (SE) task, the DOA of the target speaker is given, and the ISDRs are calculated. Hence, the TF components dominated by the target speech are extracted with high probability using the established monotone functions, and then, a nonlinear soft mask of the target speech is generated. As a result, a masking-based speech enhancement method is developed, which is termed the AVS-SMASK method. Extensive experiments with simulated data and recorded data have been carried out to validate the effectiveness of our proposed AVS-SMASK method in terms of suppressing spatial speech interferences and reducing the adverse impact of the additive background noise while maintaining less speech distortion. Moreover, our AVS-SMASK method is computationally inexpensive, and the AVS is of a small physical size. These merits are favorable to many applications, such as robot auditory systems.
Keyword: Engineering; Science and Technology Studies
URL: https://ro.uow.edu.au/eispapers1/1754
https://ro.uow.edu.au/cgi/viewcontent.cgi?article=2756&context=eispapers1
BASE
Hide details
2
Multizone Soundfield Reproduction With Privacy- and Quality-Based Speech Masking Filters
In: Faculty of Engineering and Information Sciences - Papers: Part B (2018)
BASE
Show details
3
Encoding and communicating navigable speech soundfields
In: Faculty of Engineering and Information Sciences - Papers: Part A (2016)
BASE
Show details
4
An effective target speech enhancement with single acoustic vector sensor based on the speech time-frequency sparsity
In: Faculty of Engineering and Information Sciences - Papers: Part A (2014)
BASE
Show details
5
Packet loss protection for interactive audio object rendering: A multiple description approach
In: Faculty of Engineering and Information Sciences - Papers: Part A (2012)
BASE
Show details
6
Encoding navigable speech sources: an analysis by synthesis approach
In: Faculty of Informatics - Papers (Archive) (2012)
BASE
Show details
7
Linear predictive perceptual filtering for acoustic vector sensors: exploiting directional recordings for high quality speech enhancement
In: Faculty of Engineering - Papers (Archive) (2011)
BASE
Show details
8
A novel voicing cut-off determination for low bit-rate harmonic speech coding
In: Faculty of Informatics - Papers (Archive) (2005)
BASE
Show details
9
Transcoding of Narrowband to Wideband Speech
In: Faculty of Informatics - Papers (Archive) (2005)
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
9
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern