1 |
Effect and Analysis of Large-scale Language Model Rescoring on Competitive ASR Systems ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Single headed attention based sequence-to-sequence model for state-of-the-art results on Switchboard ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Challenging the Boundaries of Speech Recognition: The MALACH Corpus ...
|
|
|
|
Abstract:
There has been huge progress in speech recognition over the last several years. Tasks once thought extremely difficult, such as SWITCHBOARD, now approach levels of human performance. The MALACH corpus (LDC catalog LDC2012S05), a 375-Hour subset of a large archive of Holocaust testimonies collected by the Survivors of the Shoah Visual History Foundation, presents significant challenges to the speech community. The collection consists of unconstrained, natural speech filled with disfluencies, heavy accents, age-related coarticulations, un-cued speaker and language switching, and emotional speech - all still open problems for speech recognition systems. Transcription is challenging even for skilled human annotators. This paper proposes that the community place focus on the MALACH corpus to develop speech recognition systems that are more robust with respect to accents, disfluencies and emotional speech. To reduce the barrier for entry, a lexicon and training and testing setups have been created and baseline ... : Accepted for publication at INTERSPEECH 2019 ...
|
|
Keyword:
Audio and Speech Processing eess.AS; Computation and Language cs.CL; FOS Computer and information sciences; FOS Electrical engineering, electronic engineering, information engineering; Sound cs.SD
|
|
URL: https://arxiv.org/abs/1908.03455 https://dx.doi.org/10.48550/arxiv.1908.03455
|
|
BASE
|
|
Hide details
|
|
4 |
Building competitive direct acoustics-to-word models for English conversational speech recognition ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Binary Pattern Recognition Using Markov Random Fields and HMMs
|
|
|
|
In: IEEE International Conference on Acoustics, Speech, and Signal Processing - ICASSP 1997 ; https://hal.inria.fr/inria-00537357 ; IEEE International Conference on Acoustics, Speech, and Signal Processing - ICASSP 1997, Apr 1997, Munich, Germany. pp.3725 - 3728, ⟨10.1109/ICASSP.1997.604678⟩ ; http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=604678 (1997)
|
|
BASE
|
|
Show details
|
|
17 |
Off-line Handwritten Word Recognition Using a Mixed HMM-MRF Approach
|
|
|
|
In: 4th International Conference on Document Analysis and Recognition - ICDAR'97 ; https://hal.inria.fr/inria-00537568 ; 4th International Conference on Document Analysis and Recognition - ICDAR'97, Aug 1997, Ulm, Germany. pp.118 - 122, ⟨10.1109/ICDAR.1997.619825⟩ ; http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=619825 (1997)
|
|
BASE
|
|
Show details
|
|
18 |
One and two-dimensional Markov models for off-line handwriting recognition ; Modèles markoviens uni- et bidimensionnels pour la reconnaissance de l'écriture manuscrite hors-ligne
|
|
|
|
In: https://hal.univ-lorraine.fr/tel-01747325 ; Autre [cs.OH]. Université Henri Poincaré - Nancy 1, 1997. Français. ⟨NNT : 1997NAN10299⟩ (1997)
|
|
BASE
|
|
Show details
|
|
19 |
Modèles markoviens uni- et bidimensionnels pour la reconnaissance de l'écriture manuscrite hors-ligne ; One and two-dimensional Markov models for off-line handwriting recognition
|
|
|
|
BASE
|
|
Show details
|
|
20 |
Off-Line Handwriting Recognition by Statistical Correlation
|
|
|
|
In: IAPR Workshop on Machine Vision Applications - MVA'94 ; https://hal.inria.fr/inria-00533959 ; IAPR Workshop on Machine Vision Applications - MVA'94, IAPR, Dec 1994, Kawasaki, Japan. pp.371-374 ; http://b2.cvl.iis.u-tokyo.ac.jp/mva/proceedings/CommemorativeDVD/1994/papers/1994371.pdf (1994)
|
|
BASE
|
|
Show details
|
|
|
|