Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Hits 1 – 9 of 9

1	An Overview of Indian Spoken Language Recognition from Machine Learning Perspective
	Dey, Spandan; Sahidullah, Md; Saha, Goutam
	In: ISSN: 2375-4699 ; EISSN: 2375-4702 ; ACM Transactions on Asian and Low-Resource Language Information Processing ; https://hal.inria.fr/hal-03616853 ; ACM Transactions on Asian and Low-Resource Language Information Processing, ACM, In press, ⟨10.1145/3523179⟩ (2022)
	BASE
	Show details

2	Utterance partitioning for speaker recognition: an experimental review and analysis with new findings under GMM-SVM framework
	Sen, Nirmalya; Sahidullah, Md; Patil, Hemant; Das Mandal, Shyamal Kumar; Rao, Sreenivasa Krothapalli; Basu, Tapan Kumar
	In: ISSN: 1381-2416 ; EISSN: 1572-8110 ; International Journal of Speech Technology ; https://hal.archives-ouvertes.fr/hal-03232723 ; International Journal of Speech Technology, Springer Verlag, In press, ⟨10.1007/s10772-021-09862-8⟩ (2021)
	Abstract: International audience ; The performance of speaker recognition system is highly dependent on the amount of speech used in enrollment and test. This work presents a detailed experimental review and analysis of the GMM-SVM based speaker recognition system in presence of duration variability. This article also reports a comparison of the performance of GMM-SVM classifier with its precursor technique Gaussian mixture model-universal background model (GMM-UBM) classifier in presence of duration variability. The goal of this research work is not to propose a new algorithm for improving speaker recognition performance in presence of duration variability. However, the main focus of this work is on utterance partitioning (UP), a commonly used strategy to compensate the duration variability issue. We have analysed in detailed the impact of training utterance partitioning in speaker recognition performance under GMM-SVM framework. We further investigate the reason why the utterance partitioning is important for boosting speaker recognition performance. We have also shown in which case the utterance partitioning could be useful and where not. Our study has revealed that utterance partitioning does not reduce the data imbalance problem of the GMM-SVM classifier as claimed in earlier study. Apart from these, we also discuss issues related to the impact of parameters such as number of Gaussians, supervector length, amount of splitting required for obtaining better performance in short and long duration test conditions from speech duration perspective. We have performed the experiments with telephone speech from POLYCOST corpus consisting of 130 speakers.
	Keyword: [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG]; [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing; Duration Variability; GMM-SVM Classifier; GMM-UMB Classifier; Short Test Utterance; Speaker Recognition; Utterance Partitioning
	URL: https://hal.archives-ouvertes.fr/hal-03232723/file/Manuscript_UtterancePartition.pdf https://hal.archives-ouvertes.fr/hal-03232723/document https://hal.archives-ouvertes.fr/hal-03232723 https://doi.org/10.1007/s10772-021-09862-8
	BASE
	Hide details

3	Privacy and utility of x-vector based speaker anonymization
	Srivastava, Brij Mohan Lal; Maouche, Mohamed; Sahidullah, Md...
	In: https://hal.inria.fr/hal-03197376 ; 2021 (2021)
	BASE
	Show details

4	Benchmarking and challenges in security and privacy for voice biometrics
	Bonastre, Jean-Francois; Delgado, Hector; Evans, Nicholas...
	In: SPSC 2021, 1st ISCA Symposium on Security and Privacy in Speech Communication ; https://hal.archives-ouvertes.fr/hal-03346196 ; SPSC 2021, 1st ISCA Symposium on Security and Privacy in Speech Communication, ISCA, Nov 2021, Magdeburg, Germany. ⟨10.21437/SPSC.2021-11⟩ ; https://spsc-symposium2021.de/#home (2021)
	BASE
	Show details

5	Language recognition on unknown conditions: the LORIA-Inria-MULTISPEECH system for AP20-OLR Challenge
	Duroselle, Raphaël; Sahidullah, Md; Jouvet, Denis...
	In: Interspeech ; https://hal.archives-ouvertes.fr/hal-03228823 ; Interspeech, Aug 2021, Brno, Czech Republic (2021)
	BASE
	Show details

6	Language recognition on unknown conditions: the LORIA-Inria-MULTISPEECH system for AP20-OLR Challenge
	Duroselle, Raphaël; Sahidullah, Md; Jouvet, Denis...
	In: https://hal.archives-ouvertes.fr/hal-03228823 ; 2021 (2021)
	BASE
	Show details

7	Privacy and utility of x-vector based speaker anonymization
	Srivastava, Brij Mohan Lal; Maouche, Mohamed; Sahidullah, Md...
	In: https://hal.inria.fr/hal-03197376 ; 2021 (2021)
	BASE
	Show details

8	Evaluating Voice Conversion-based Privacy Protection against Informed Attackers
	Srivastava, Brij Mohan Lal; Vauquier, Nathalie; Sahidullah, Md...
	In: ICASSP 2020 - 45th International Conference on Acoustics, Speech, and Signal Processing ; https://hal.inria.fr/hal-02355115 ; ICASSP 2020 - 45th International Conference on Acoustics, Speech, and Signal Processing, IEEE Signal Processing Society, May 2020, Barcelona, Spain. pp.2802-2806 (2020)
	BASE
	Show details

9	Design, analysis and experimental evaluation of block based transformation in MFCC computation for speaker recognition
	Saha, Goutam; Sahidullah, Md.
	In: Speech communication. - Amsterdam [u.a.] : Elsevier 54 (2012) 4, 543-565
	BLLDB
	OLC Linguistik
	Show details

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern