4 |
Decoupling recognition and transcription in Mandarin ASR ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Automatic recognition of suprasegmentals in speech ...
|
|
|
|
Abstract:
This study reports our efforts to improve automatic recognition of suprasegmentals by fine-tuning wav2vec 2.0 with CTC, a method that has been successful in automatic speech recognition. We demonstrate that the method can improve the state-of-the-art on automatic recognition of syllables, tones, and pitch accents. Utilizing segmental information, by employing tonal finals or tonal syllables as recognition units, can significantly improve Mandarin tone recognition. Language models are helpful when tonal syllables are used as recognition units, but not helpful when tones are recognition units. Finally, Mandarin tone recognition can benefit from English phoneme recognition by combining the two tasks in fine-tuning wav2vec 2.0. ... : submitted to ASRU 2021 ...
|
|
Keyword:
Computation and Language cs.CL; FOS Computer and information sciences
|
|
URL: https://dx.doi.org/10.48550/arxiv.2108.01122 https://arxiv.org/abs/2108.01122
|
|
BASE
|
|
Hide details
|
|
6 |
The Role of Phonetic Units in Speech Emotion Recognition ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Data Collection vs. Knowledge Graph Completion: What is Needed to Improve Coverage? ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
The Future of Computational Linguistics: On Beyond Alchemy
|
|
|
|
In: Front Artif Intell (2021)
|
|
BASE
|
|
Show details
|
|
9 |
On Finite State Parsing
|
|
|
|
In: University of Massachusetts Occasional Papers in Linguistics (2020)
|
|
BASE
|
|
Show details
|
|
10 |
The Second DIHARD Diarization Challenge: Dataset, task, and baselines ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
ENHANCEMENT AND ANALYSIS OF CONVERSATIONAL SPEECH: JSALT 2017
|
|
|
|
BASE
|
|
Show details
|
|
14 |
A Summary of the 2012 JHU CLSP workshop on zero resource speech technologies and models of early language acquisition
|
|
|
|
BASE
|
|
Show details
|
|
15 |
A summary of the 2012 JHU CLSP workshop on zero resource speech technologies and models of early language acquisition ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
A summary of the 2012 JHU CLSP workshop on zero resource speech technologies and models of early language acquisition ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|