1 |
Towards Contextual Spelling Correction for Customization of End-to-end Speech Recognition Systems ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
A Configurable Multilingual Model is All You Need to Recognize All Languages ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Self-Supervised Learning for speech recognition with Intermediate layer supervision ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Factorized Neural Transducer for Efficient Language Model Adaptation ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Production de la parole en réponse à de multiples perturbations du feedback auditif
|
|
|
|
In: Actes de la 6e conférence conjointe Journées d'Études sur la Parole (JEP, 33e édition), Traitement Automatique des Langues Naturelles (TALN, 27e édition), Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (RÉCITAL, 22e édition). Volume 1 : Journées d'Études sur la Parole ; 6e conférence conjointe Journées d'Études sur la Parole (JEP, 33e édition), Traitement Automatique des Langues Naturelles (TALN, 27e édition), Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (RÉCITAL, 22e édition). Volume 1 : Journées d'Études sur la Parole ; https://hal.archives-ouvertes.fr/hal-02798560 ; 6e conférence conjointe Journées d'Études sur la Parole (JEP, 33e édition), Traitement Automatique des Langues Naturelles (TALN, 27e édition), Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (RÉCITAL, 22e édition). Volume 1 : Journées d'Études sur la Parole, Jun 2020, Nancy, France. pp.370-378 (2020)
|
|
BASE
|
|
Show details
|
|
6 |
Complexity patterns underlying speech production activity
|
|
|
|
In: ISSP 2020 ; https://hal.archives-ouvertes.fr/hal-03100430 ; ISSP 2020, Dec 2020, Online, United States (2020)
|
|
BASE
|
|
Show details
|
|
7 |
Speech production in response to multiple perturbations of auditory feedback
|
|
|
|
In: ISSP 2020 ; https://hal.archives-ouvertes.fr/hal-03100466 ; ISSP 2020, Dec 2020, Online, United States (2020)
|
|
BASE
|
|
Show details
|
|
8 |
Manipulating verbal interaction via artificial agents to study inter-speaker coordination
|
|
|
|
In: Social cognition in humans and robots ; https://hal.archives-ouvertes.fr/hal-01874505 ; Social cognition in humans and robots, Sep 2018, Hamburg, Germany ; https://www.socsmcs.eu/conference2018 (2018)
|
|
BASE
|
|
Show details
|
|
9 |
End-to-End Attention based Text-Dependent Speaker Verification ...
|
|
|
|
Abstract:
A new type of End-to-End system for text-dependent speaker verification is presented in this paper. Previously, using the phonetically discriminative/speaker discriminative DNNs as feature extractors for speaker verification has shown promising results. The extracted frame-level (DNN bottleneck, posterior or d-vector) features are equally weighted and aggregated to compute an utterance-level speaker representation (d-vector or i-vector). In this work we use speaker discriminative CNNs to extract the noise-robust frame-level features. These features are smartly combined to form an utterance-level speaker vector through an attention mechanism. The proposed attention model takes the speaker discriminative information and the phonetic information to learn the weights. The whole system, including the CNN and attention model, is joint optimized using an end-to-end criterion. The training algorithm imitates exactly the evaluation process --- directly mapping a test utterance and a few target speaker utterances into ... : @article{zhang2016End2End, title={End-to-End Attention based Text-Dependent Speaker Verification}, author={Shi-Xiong Zhang, Zhuo Chen$^{\dag}$, Yong Zhao, Jinyu Li and Yifan Gong}, journal={IEEE Workshop on Spoken Language Technology}, pages={171--178}, year={2016}, publisher={IEEE} } ...
|
|
Keyword:
Computation and Language cs.CL; FOS Computer and information sciences; Machine Learning stat.ML
|
|
URL: https://arxiv.org/abs/1701.00562 https://dx.doi.org/10.48550/arxiv.1701.00562
|
|
BASE
|
|
Hide details
|
|
10 |
Improved training for online end-to-end speech recognition systems ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|