Home Catalogue search

eng

Refine your search:
- Keyword
- Creator / Publisher
- Year:
  - 2020 (1)
  - 2019 (1)
  - 2018 (1)
  - 2016 (1)
  - 2015 (1)
  - 2013 (2)
  - 2012 (1)
  - 2011 (1)
  - 2007 (1)
  - 2004 (1)
  - more
- Medium
- Type
- BLLDB-Access:
  - free (16)
  - subject to license (0)

Search in the Catalogues and Directories






	Sort by
Simple Search

Hits 1 – 16 of 16

1	Detection and Evaluation of human and machine generated speech in spoofing attacks on automatic speaker verification systems ...
	Gao, Yang; Lian, Jiachen; Raj, Bhiksha. - : arXiv, 2020
	BASE
	Show details

2	The phonetic bases of vocal expressed emotion: natural versus acted ...
	Dhamyal, Hira; Memon, Shahan Ali; Raj, Bhiksha. - : arXiv, 2019
	BASE
	Show details

3	Voice Impersonation using Generative Adversarial Networks ...
	Gao, Yang; Singh, Rita; Raj, Bhiksha. - : arXiv, 2018
	Abstract: Voice impersonation is not the same as voice transformation, although the latter is an essential element of it. In voice impersonation, the resultant voice must convincingly convey the impression of having been naturally produced by the target speaker, mimicking not only the pitch and other perceivable signal qualities, but also the style of the target speaker. In this paper, we propose a novel neural network based speech quality- and style- mimicry framework for the synthesis of impersonated voices. The framework is built upon a fast and accurate generative adversarial network model. Given spectrographic representations of source and target speakers' voices, the model learns to mimic the target speaker's voice quality and style, regardless of the linguistic content of either's voice, generating a synthetic spectrogram from which the time domain signal is reconstructed using the Griffin-Lim method. In effect, this model reframes the well-known problem of style-transfer for images as the problem of ... : Accepted by 2018 International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2018) ...
	Keyword: Audio and Speech Processing eess.AS; FOS Computer and information sciences; FOS Electrical engineering, electronic engineering, information engineering; Sound cs.SD
	URL: https://dx.doi.org/10.48550/arxiv.1802.06840 https://arxiv.org/abs/1802.06840
	BASE
	Hide details

4	AudioPairBank: Towards A Large-Scale Tag-Pair-Based Audio Content Analysis ...
	Sager, Sebastian; Elizalde, Benjamin; Borth, Damian. - : arXiv, 2016
	BASE
	Show details

5	Binary Sparse Coding of Convolutive Mixtures for Sound Localization and Separation via Spatialization
	Asaei, Afsaneh; Taghizadeh, Mohammadjavad; Haghighatshoar, Saeid...
	In: http://infoscience.epfl.ch/record/211543 (2015)
	BASE
	Show details

6	Privacy-preserving speaker verification and identification using gaussian mixture models
	Pathak, Manas A.; Raj, Bhiksha
	In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 21 (2013) 2, 397-406
	OLC Linguistik
	Show details

7	Techniques for noise robustness in automatic speech recognition
	Singh, Rita (Hrsg.); Raj, Bhiksha (Hrsg.); Virtanen, Tuomas (Hrsg.). - Chichester : Wiley, 2013
	BLLDB
	UB Frankfurt Linguistik
	Show details

8	Learning-based auditory encoding for robust speech recognition
	Stern, Richard M.; Bosco Chiu, Yu-Hsiang; Raj, Bhiksha
	In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 20 (2012) 3, 900-914
	BLLDB
	OLC Linguistik
	Show details

9	Preface
	Heckmann, Martin; Raj, Bhiksha; Smaragdis, Paris
	In: Speech communication. - Amsterdam [u.a.] : Elsevier 53 (2011) 5, 591
	OLC Linguistik
	Show details

10	Soft mask methods for single-channel speaker separation
	Raj, Bhiksha; Reddy, Aarthi M.
	In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 15 (2007) 6, 1766-1776
	BLLDB
	OLC Linguistik
	Show details

11	The recognition and organization of real-world sound
	Ellis, D. P. W. (Hrsg.); Cooke, M.P. (Hrsg.); Raj, Bhiksha (Mitarb.)...
	In: Speech communication. - Amsterdam [u.a.] : Elsevier 43 (2004) 4, 273-393
	BLLDB
	Show details

12	Classifier-based non-linear projection for adaptive endpointing of continuous speech
	Raj, Bhiksha; Singh, Rita
	In: Computer speech and language. - Amsterdam [u.a.] : Elsevier 17 (2003) 1, 5-26
	OLC Linguistik
	Show details

13	Multi-channel Source Separation by Beamforming Trained with Factorial HMMs
	Reyes-Gomez, Manuel; Raj, Bhiksha; Ellis, Daniel P. W.. - : IEEE, 2003
	BASE
	Show details

14	Automatic generation of subword units for speech recognition systems
	Singh, Rita; Raj, Bhiksha; Stern, Richard M.
	In: Institute of Electrical and Electronics Engineers. IEEE transactions on speech and audio processing. - New York, NY : Inst. 10 (2002) 2, 89-99
	BLLDB
	Show details

15	Data-driven environmental compensation for speech recognition: Aunified approach
	Moreno, Pedro J.; Raj, Bhiksha; Stern, Richard M.
	In: Speech communication. - Amsterdam [u.a.] : Elsevier 24 (1998) 4, 267-286
	OLC Linguistik
	Show details

16	Data-driven environmental compensation for speech recognition : a unified approach
	Moreno, Pedro J.; Raj, Bhiksha; Stern, Richard M.
	In: Speech communication. - Amsterdam [u.a.] : Elsevier 24 (1998) 4, 267-285
	BLLDB
	Show details

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern