1 |
Towards Contextual Spelling Correction for Customization of End-to-end Speech Recognition Systems ...
|
|
|
|
Abstract:
Contextual biasing is an important and challenging task for end-to-end automatic speech recognition (ASR) systems, which aims to achieve better recognition performance by biasing the ASR system to particular context phrases such as person names, music list, proper nouns, etc. Existing methods mainly include contextual LM biasing and adding bias encoder into end-to-end ASR models. In this work, we introduce a novel approach to do contextual biasing by adding a contextual spelling correction model on top of the end-to-end ASR system. We incorporate contextual information into a sequence-to-sequence spelling correction model with a shared context encoder. Our proposed model includes two different mechanisms: autoregressive (AR) and non-autoregressive (NAR). We propose filtering algorithms to handle large-size context lists, and performance balancing mechanisms to control the biasing degree of the model. We demonstrate the proposed model is a general biasing solution which is domain-insensitive and can be ... : This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible ...
|
|
Keyword:
Audio and Speech Processing eess.AS; Computation and Language cs.CL; FOS Computer and information sciences; FOS Electrical engineering, electronic engineering, information engineering; Sound cs.SD
|
|
URL: https://dx.doi.org/10.48550/arxiv.2203.00888 https://arxiv.org/abs/2203.00888
|
|
BASE
|
|
Hide details
|
|
2 |
A Configurable Multilingual Model is All You Need to Recognize All Languages ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Self-Supervised Learning for speech recognition with Intermediate layer supervision ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Factorized Neural Transducer for Efficient Language Model Adaptation ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Production de la parole en réponse à de multiples perturbations du feedback auditif
|
|
|
|
In: Actes de la 6e conférence conjointe Journées d'Études sur la Parole (JEP, 33e édition), Traitement Automatique des Langues Naturelles (TALN, 27e édition), Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (RÉCITAL, 22e édition). Volume 1 : Journées d'Études sur la Parole ; 6e conférence conjointe Journées d'Études sur la Parole (JEP, 33e édition), Traitement Automatique des Langues Naturelles (TALN, 27e édition), Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (RÉCITAL, 22e édition). Volume 1 : Journées d'Études sur la Parole ; https://hal.archives-ouvertes.fr/hal-02798560 ; 6e conférence conjointe Journées d'Études sur la Parole (JEP, 33e édition), Traitement Automatique des Langues Naturelles (TALN, 27e édition), Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (RÉCITAL, 22e édition). Volume 1 : Journées d'Études sur la Parole, Jun 2020, Nancy, France. pp.370-378 (2020)
|
|
BASE
|
|
Show details
|
|
6 |
Complexity patterns underlying speech production activity
|
|
|
|
In: ISSP 2020 ; https://hal.archives-ouvertes.fr/hal-03100430 ; ISSP 2020, Dec 2020, Online, United States (2020)
|
|
BASE
|
|
Show details
|
|
7 |
Speech production in response to multiple perturbations of auditory feedback
|
|
|
|
In: ISSP 2020 ; https://hal.archives-ouvertes.fr/hal-03100466 ; ISSP 2020, Dec 2020, Online, United States (2020)
|
|
BASE
|
|
Show details
|
|
8 |
Manipulating verbal interaction via artificial agents to study inter-speaker coordination
|
|
|
|
In: Social cognition in humans and robots ; https://hal.archives-ouvertes.fr/hal-01874505 ; Social cognition in humans and robots, Sep 2018, Hamburg, Germany ; https://www.socsmcs.eu/conference2018 (2018)
|
|
BASE
|
|
Show details
|
|
9 |
End-to-End Attention based Text-Dependent Speaker Verification ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Improved training for online end-to-end speech recognition systems ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|