DE eng

Search in the Catalogues and Directories

Hits 1 – 14 of 14

1
Towards Contextual Spelling Correction for Customization of End-to-end Speech Recognition Systems ...
BASE
Show details
2
A Configurable Multilingual Model is All You Need to Recognize All Languages ...
Zhou, Long; Li, Jinyu; Sun, Eric. - : arXiv, 2021
BASE
Show details
3
Self-Supervised Learning for speech recognition with Intermediate layer supervision ...
Wang, Chengyi; Wu, Yu; Chen, Sanyuan. - : arXiv, 2021
BASE
Show details
4
Factorized Neural Transducer for Efficient Language Model Adaptation ...
BASE
Show details
5
Production de la parole en réponse à de multiples perturbations du feedback auditif
In: Actes de la 6e conférence conjointe Journées d'Études sur la Parole (JEP, 33e édition), Traitement Automatique des Langues Naturelles (TALN, 27e édition), Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (RÉCITAL, 22e édition). Volume 1 : Journées d'Études sur la Parole ; 6e conférence conjointe Journées d'Études sur la Parole (JEP, 33e édition), Traitement Automatique des Langues Naturelles (TALN, 27e édition), Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (RÉCITAL, 22e édition). Volume 1 : Journées d'Études sur la Parole ; https://hal.archives-ouvertes.fr/hal-02798560 ; 6e conférence conjointe Journées d'Études sur la Parole (JEP, 33e édition), Traitement Automatique des Langues Naturelles (TALN, 27e édition), Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (RÉCITAL, 22e édition). Volume 1 : Journées d'Études sur la Parole, Jun 2020, Nancy, France. pp.370-378 (2020)
BASE
Show details
6
Complexity patterns underlying speech production activity
In: ISSP 2020 ; https://hal.archives-ouvertes.fr/hal-03100430 ; ISSP 2020, Dec 2020, Online, United States (2020)
BASE
Show details
7
Speech production in response to multiple perturbations of auditory feedback
In: ISSP 2020 ; https://hal.archives-ouvertes.fr/hal-03100466 ; ISSP 2020, Dec 2020, Online, United States (2020)
BASE
Show details
8
Manipulating verbal interaction via artificial agents to study inter-speaker coordination
In: Social cognition in humans and robots ; https://hal.archives-ouvertes.fr/hal-01874505 ; Social cognition in humans and robots, Sep 2018, Hamburg, Germany ; https://www.socsmcs.eu/conference2018 (2018)
BASE
Show details
9
End-to-End Attention based Text-Dependent Speaker Verification ...
Abstract: A new type of End-to-End system for text-dependent speaker verification is presented in this paper. Previously, using the phonetically discriminative/speaker discriminative DNNs as feature extractors for speaker verification has shown promising results. The extracted frame-level (DNN bottleneck, posterior or d-vector) features are equally weighted and aggregated to compute an utterance-level speaker representation (d-vector or i-vector). In this work we use speaker discriminative CNNs to extract the noise-robust frame-level features. These features are smartly combined to form an utterance-level speaker vector through an attention mechanism. The proposed attention model takes the speaker discriminative information and the phonetic information to learn the weights. The whole system, including the CNN and attention model, is joint optimized using an end-to-end criterion. The training algorithm imitates exactly the evaluation process --- directly mapping a test utterance and a few target speaker utterances into ... : @article{zhang2016End2End, title={End-to-End Attention based Text-Dependent Speaker Verification}, author={Shi-Xiong Zhang, Zhuo Chen$^{\dag}$, Yong Zhao, Jinyu Li and Yifan Gong}, journal={IEEE Workshop on Spoken Language Technology}, pages={171--178}, year={2016}, publisher={IEEE} } ...
Keyword: Computation and Language cs.CL; FOS Computer and information sciences; Machine Learning stat.ML
URL: https://arxiv.org/abs/1701.00562
https://dx.doi.org/10.48550/arxiv.1701.00562
BASE
Hide details
10
Improved training for online end-to-end speech recognition systems ...
BASE
Show details
11
Calibration of confidence measures in speech recognition
In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 19 (2011) 8, 2461-2473
BLLDB
OLC Linguistik
Show details
12
A study on the generalization capability of acoustic models for robust speech recognition
In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 18 (2010) 6, 1158-1169
BLLDB
Show details
13
A unified framework of HMM adaptation with joint compensation of additive and convolutive distortions
In: Computer speech and language. - Amsterdam [u.a.] : Elsevier 23 (2009) 3, 389-405
BLLDB
OLC Linguistik
Show details
14
Approximate test risk bound minimization through soft margin estimation
In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 15 (2007) 8, 2393-2404
BLLDB
OLC Linguistik
Show details

Catalogues
0
0
3
0
0
0
0
Bibliographies
4
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
10
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern