DE eng

Search in the Catalogues and Directories

Hits 1 – 16 of 16

1
Detection and Evaluation of human and machine generated speech in spoofing attacks on automatic speaker verification systems ...
Gao, Yang; Lian, Jiachen; Raj, Bhiksha. - : arXiv, 2020
BASE
Show details
2
The phonetic bases of vocal expressed emotion: natural versus acted ...
BASE
Show details
3
Voice Impersonation using Generative Adversarial Networks ...
Gao, Yang; Singh, Rita; Raj, Bhiksha. - : arXiv, 2018
Abstract: Voice impersonation is not the same as voice transformation, although the latter is an essential element of it. In voice impersonation, the resultant voice must convincingly convey the impression of having been naturally produced by the target speaker, mimicking not only the pitch and other perceivable signal qualities, but also the style of the target speaker. In this paper, we propose a novel neural network based speech quality- and style- mimicry framework for the synthesis of impersonated voices. The framework is built upon a fast and accurate generative adversarial network model. Given spectrographic representations of source and target speakers' voices, the model learns to mimic the target speaker's voice quality and style, regardless of the linguistic content of either's voice, generating a synthetic spectrogram from which the time domain signal is reconstructed using the Griffin-Lim method. In effect, this model reframes the well-known problem of style-transfer for images as the problem of ... : Accepted by 2018 International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2018) ...
Keyword: Audio and Speech Processing eess.AS; FOS Computer and information sciences; FOS Electrical engineering, electronic engineering, information engineering; Sound cs.SD
URL: https://dx.doi.org/10.48550/arxiv.1802.06840
https://arxiv.org/abs/1802.06840
BASE
Hide details
4
AudioPairBank: Towards A Large-Scale Tag-Pair-Based Audio Content Analysis ...
BASE
Show details
5
Binary Sparse Coding of Convolutive Mixtures for Sound Localization and Separation via Spatialization
In: http://infoscience.epfl.ch/record/211543 (2015)
BASE
Show details
6
Privacy-preserving speaker verification and identification using gaussian mixture models
In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 21 (2013) 2, 397-406
OLC Linguistik
Show details
7
Techniques for noise robustness in automatic speech recognition
Singh, Rita (Hrsg.); Raj, Bhiksha (Hrsg.); Virtanen, Tuomas (Hrsg.). - Chichester : Wiley, 2013
BLLDB
UB Frankfurt Linguistik
Show details
8
Learning-based auditory encoding for robust speech recognition
In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 20 (2012) 3, 900-914
BLLDB
OLC Linguistik
Show details
9
Preface
In: Speech communication. - Amsterdam [u.a.] : Elsevier 53 (2011) 5, 591
OLC Linguistik
Show details
10
Soft mask methods for single-channel speaker separation
In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 15 (2007) 6, 1766-1776
BLLDB
OLC Linguistik
Show details
11
The recognition and organization of real-world sound
Ellis, D. P. W. (Hrsg.); Cooke, M.P. (Hrsg.); Raj, Bhiksha (Mitarb.)...
In: Speech communication. - Amsterdam [u.a.] : Elsevier 43 (2004) 4, 273-393
BLLDB
Show details
12
Classifier-based non-linear projection for adaptive endpointing of continuous speech
In: Computer speech and language. - Amsterdam [u.a.] : Elsevier 17 (2003) 1, 5-26
OLC Linguistik
Show details
13
Multi-channel Source Separation by Beamforming Trained with Factorial HMMs
BASE
Show details
14
Automatic generation of subword units for speech recognition systems
In: Institute of Electrical and Electronics Engineers. IEEE transactions on speech and audio processing. - New York, NY : Inst. 10 (2002) 2, 89-99
BLLDB
Show details
15
Data-driven environmental compensation for speech recognition: Aunified approach
In: Speech communication. - Amsterdam [u.a.] : Elsevier 24 (1998) 4, 267-286
OLC Linguistik
Show details
16
Data-driven environmental compensation for speech recognition : a unified approach
In: Speech communication. - Amsterdam [u.a.] : Elsevier 24 (1998) 4, 267-285
BLLDB
Show details

Catalogues
1
0
6
0
0
0
0
Bibliographies
6
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
6
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern