DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5...23
Hits 1 – 20 of 460

1
Machine learning for speaker recognition
Mak, M. W.; Chien, Jen-tzung. - Cambridge : Cambridge University Press, 2020
BLLDB
UB Frankfurt Linguistik
Show details
2
Sprachanalyse : does forensic phonetics reveal the criminal? = Voice analysis
Braun, Stefan K.. - Frankfurt am Main : neowiss - Europäischer Wissenschaftsverlag; MCDP International UG, 2020
BLLDB
Institut für Empirische Sprachwissenschaft
UB Frankfurt Linguistik
Show details
3
Towards Understanding Voice Discrimination Abilities of Humans and Machines
Park, Soo Jin. - : eScholarship, University of California, 2019
In: Park, Soo Jin. (2019). Towards Understanding Voice Discrimination Abilities of Humans and Machines. UCLA: Electrical and Computer Engineering 0333. Retrieved from: http://www.escholarship.org/uc/item/22d942x3 (2019)
Abstract: An individual's voice can vary dramatically depending on word choice, affect, and other factors. Such intrinsic within-talker variability causes considerable difficulties when distinguishing talkers by their voices, both for humans and machines. For machines, phonetic content variability substantially degrades performance when utterances are short (e.g., < 10 sec). Humans, on the contrary, are less influenced by content variability, and they perform better than machines in such conditions. Hence, understanding which and how acoustic features are related to human responses might provide insights to improve machine performance. Yet, little is known about human and machine voice discrimination ability under various kinds of intrinsic within-talker variabilities.This dissertation presents studies of voice discrimination abilities of humans and machines under text, affect, and speaking-style variabilities. The main focus is in developing a feature set, based on a psychoacoustic model of voice quality, that can be used to improve machine performance and to find acoustic correlates with human responses. In order to systematically investigate the effects of within- and between-talker variability, a database was developed at UCLA. More than a hundred females and a hundred males were recorded with various speech styles, including sustained vowels, read sentences, affective speech, and pet-directed speech.Preliminary experiments indicated that the voice quality feature set (VQual1) was promising for predicting human responses, and for improving automatic speaker verification (ASV) performance which degraded significantly under text, affect and/or speaking-style variabilities. VQual1 was modified to another set (VQual2) to better differentiate talkers, leading to further improvements in short-utterance text-independent ASV tasks. Voice discrimination abilities of humans and machines for very short utterances (~ 2 sec) under high text and style variability were analyzed using read sentences and pet-directed speech. Humans were more accurate than machines for read sentence pairs, but the performance difference became small for style-mismatched pairs and for perceptually marked talkers. Humans' and machines' decision spaces were weakly correlated, indicating a weak or non-linear relationship between talker representations by humans and machines. However, for different-talker pairs, the VQual2-based system responses were highly correlated with human responses. Results also suggested that machines could supplement human decisions for perceptually marked talkers. Additionally, VQual2 was effective in perceived affect recognition, suggesting another application where voice quality features can contribute to predict human decisions.
Keyword: Automatic speaker recognition; Engineering; Linguistics; Psychology; Speaker perception; Voice discrimination; Voice quality
URL: http://www.escholarship.org/uc/item/22d942x3
BASE
Hide details
4
Der VokalJäger : eine phonetisch-algorithmische Methode zur Vokaluntersuchung : exemplarisch angewendet auf historische Tondokumente der Frankfurter Stadtmundart
Keil, Carsten. - New York : Georg Olms Verlag, 2017
BLLDB
UB Frankfurt Linguistik
Show details
5
Linguistically-constrained formant-based i-vectors for automatic speaker recognition
BASE
Show details
6
Elektronische Sprachsignalverarbeitung 2015 : Tagungsband der 26. Konferenz, Eichstätt, 25. - 27. März 2015
Wirsching, Günther (Hrsg.). - Dresden : TUDpress, 2015
BLLDB
UB Frankfurt Linguistik
Show details
7
Improving the self-adaptive voice activity detector for speaker verification using map adaptation and asymmetric tapers
In: International journal of speech technology. - Boston, Mass. [u.a.] : Kluwer Acad. Publ. 18 (2015) 2, 195-203
BLLDB
Show details
8
Automatic-Type Calibration of Traditionally Derived Likelihood Ratios: Forensic Analysis of Australian English/o/Formant Trajectories
In: Proceedings of Interspeech 2008 incorporating SST 2008 (2015)
BASE
Show details
9
Cepstral trajectories in linguistic units for text-independent speaker recognition
BASE
Show details
10
Severe apnoea detection using speaker recognition techniques
Fernández Pozo, Rubén; Blanco, José Luis; Hernández, Luis Alberto. - : Institute for Systems and Technologies of Information, Control and Communication, 2015
BASE
Show details
11
Implementation of forensic voice comparison within the new paradigm for the evaluation of forensic evidence
Enzinger, Ewald, Electrical Engineering & Telecommunications, Faculty of Engineering, UNSW. - : University of New South Wales. Electrical Engineering & Telecommunications, 2015
BASE
Show details
12
Statistical language and speech processing : second International Conference, SLSP 2014, Grenoble, France, October 14-16, 2014 ; proceedings
Besacier, Laurent (Hrsg.). - Cham [u.a.] : Springer, 2014
BLLDB
UB Frankfurt Linguistik
Show details
13
Text, speech, and dialogue : 17th international conference, TSD 2014, Brno, Czech Republic, September 8 - 12, 2014. ; proceedings
Sojka, Petr (Hrsg.). - Cham [u.a.] : Springer, 2014
BLLDB
UB Frankfurt Linguistik
Show details
14
Evaluating Automatic Speaker Recognition systems: An overview of the NIST Speaker Recognition Evaluations (1996-2014)
In: http://atvs.ii.uam.es/files/loquens_jgr_published.pdf (2014)
BASE
Show details
15
Evaluating automatic speaker recognition systems: an overview of the nist speaker recognition evaluations (1996-2014)
BASE
Show details
16
Advances in Nonlinear Speech Processing : 6th International Conference, NOLISP 2013, Mons, Belgium, June 19-21, 2013, Proceedings
Drugman, Thomas; Dutoit, Thierry. - Berlin, Heidelberg : Springer Berlin Heidelberg, 2013
UB Frankfurt Linguistik
Show details
17
Advances in nonlinear speech processing : 6th international conference ; proceedings
Solé-Casals, Jordi; Carson-Berndsen, Julie; Daoudi, Khalid. - Heidelberg [u.a.] : Springer, 2013
BLLDB
UB Frankfurt Linguistik
Show details
18
Elektronische Sprachsignalverarbeitung 2013 : Tagungsband der 24. Konferenz Bielefeld, 26. - 28.3.2013
Wagner, Petra (Hrsg.). - Dresden : TUDpress, 2013
BLLDB
UB Frankfurt Linguistik
Show details
19
Eigenvoice modelling for cross likelihood ratio based speaker clustering: a Bayesian approach
In: Computer speech and language. - Amsterdam [u.a.] : Elsevier 27 (2013) 4, 1011-1027
BLLDB
Show details
20
Will smart surveillance systems listen, understand and speak Slovene?
In: Slovenščina 2.0: Empirične, aplikativne in interdisciplinarne raziskave, Vol 1, Iss 2, Pp 165-180 (2013) (2013)
BASE
Show details

Page: 1 2 3 4 5...23

Catalogues
52
0
213
0
0
8
0
Bibliographies
441
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
17
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern