DE eng

Search in the Catalogues and Directories

Page: 1 2 3
Hits 1 – 20 of 47

1
First DIHARD Challenge -- System Submissions and Scores ...
BASE
Show details
2
First DIHARD Challenge -- System Submissions and Scores ...
BASE
Show details
3
Adaptations in Speech Processing
Xu, Jue. - : Humboldt-Universität zu Berlin, 2021
BASE
Show details
4
Learning speech embeddings for speaker adaptation and speech understanding
Sari, Leda. - 2021
BASE
Show details
5
Towards unsupervised learning of speech features in the wild
In: SLT 2020 : IEEE Spoken Language Technology Workshop ; https://hal.archives-ouvertes.fr/hal-03070411 ; SLT 2020 : IEEE Spoken Language Technology Workshop, Dec 2020, Shenzhen / Virtual, China (2020)
BASE
Show details
6
Achieving Multi-Accent ASR via Unsupervised Acoustic Model Adaptation
In: INTERSPEECH 2020 ; https://hal.inria.fr/hal-02907929 ; INTERSPEECH 2020, Oct 2020, Shanghai, China (2020)
BASE
Show details
7
Preschoolers' Attention to Emotional Prosody as a Function of Speaker Conventionality ...
Wieczorek, Karolina Marta. - : Arts, 2020
BASE
Show details
8
Learning to adapt: meta-learning approaches for speaker adaptation
Klejch, Ondrej. - : The University of Edinburgh, 2020
BASE
Show details
9
Introducing Phonetic Information to Speaker Embedding for Speaker Verification
In: Electrical and Computer Engineering Faculty Publications (2019)
Abstract: Phonetic information is one of the most essential components of a speech signal, playing an important role for many speech processing tasks. However, it is difficult to integrate phonetic information into speaker verification systems since it occurs primarily at the frame level while speaker characteristics typically reside at the segment level. In deep neural network-based speaker verification, existing methods only apply phonetic information to the frame-wise trained speaker embeddings. To improve this weakness, this paper proposes phonetic adaptation and hybrid multi-task learning and further combines these into c-vector and simplified c-vector architectures. Experiments on National Institute of Standards and Technology (NIST) speaker recognition evaluation (SRE) 2010 show that the four proposed speaker embeddings achieve better performance than the baseline. The c-vector system performs the best, providing over 30% and 15% relative improvements in equal error rate (EER) for the core-extended and 10 s–10 s conditions, respectively. On the NIST SRE 2016, 2018, and VoxCeleb datasets, the proposed c-vector approach improves the performance even when there is a language mismatch within the training sets or between the training and evaluation sets. Extensive experimental results demonstrate the effectiveness and robustness of the proposed methods.
Keyword: C-vector; Deep neural networks; Electrical and Computer Engineering; Multi-task learning; Phonetic adaptation; Speaker embedding; Speaker verification
URL: https://uknowledge.uky.edu/ece_facpub/37
https://uknowledge.uky.edu/cgi/viewcontent.cgi?article=1037&context=ece_facpub
BASE
Hide details
10
Speaker-Adapted Confidence Measures for ASR using Deep Bidirectional Recurrent Neural Networks
Del Agua Teba, Miguel Angel; Giménez Pastor, Adrián; Sanchis Navarro, José Alberto. - : Institute of Electrical and Electronics Engineers, 2018
BASE
Show details
11
Extending the Cascaded Gaussian Mixture Regression Framework for Cross-Speaker Acoustic-Articulatory Mapping
In: ISSN: 2329-9290 ; EISSN: 2329-9304 ; IEEE/ACM Transactions on Audio, Speech and Language Processing ; https://hal.archives-ouvertes.fr/hal-01485540 ; IEEE/ACM Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2017, 25 (3), pp.662-673. ⟨10.1109/TASLP.2017.2651398⟩ (2017)
BASE
Show details
12
Articulatory representations to address acoustic variability in speech ...
Sivaraman, Ganesh. - : Digital Repository at the University of Maryland, 2017
BASE
Show details
13
Articulatory representations to address acoustic variability in speech
BASE
Show details
14
Adaptation au locuteur pour la séparation de la parole par NMF
In: https://hal.sorbonne-universite.fr/hal-01482183 ; [Stage] STMS - Sciences et Technologies de la Musique et du Son UMR 9912 IRCAM-CNRS-UPMC. 2016 (2016)
BASE
Show details
15
Iterative PLDA Adaptation for Speaker Diarization
In: Interspeech 2016 ; https://hal.archives-ouvertes.fr/hal-01433172 ; Interspeech 2016, Sep 2016, San Francisco, United States. pp.2175 - 2179, ⟨10.21437/Interspeech.2016-572⟩ (2016)
BASE
Show details
16
Speaker-dependent Multipitch Tracking Using Deep Neural Networks
BASE
Show details
17
Phonetic reduction in spontaneous speech by children aged 9-14 years
In: Presented at: 18th International Congress of Phonetic Sciences, Glasgow, UK. (2015) (2015)
BASE
Show details
18
The Acquisition of Vowel Normalization during Early Infancy: Theory and Computational Framework
In: http://rave.ohiolink.edu/etdc/view?acc_num=osu1388689249 (2014)
BASE
Show details
19
'All the better for not seeing you': effects of communicative context on the speech of an individual with acquired communication difficulties.
In: J Commun Disord , 46 (5-6) 475 - 483. (2013) (2013)
BASE
Show details
20
Contributions to Adaptation on Automatic Speech Recognition and Multilingual Handwritten Text Recognition
Del Agua Teba, Miguel Angel. - : Universitat Politècnica de València, 2013
BASE
Show details

Page: 1 2 3

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
47
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern