1 |
Low-cost electronic circuitry for photoacoustic gas sensing ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Low-cost electronic circuitry for photoacoustic gas sensing ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Eine agentenbasierte Architektur für Programmierung mit gesprochener Sprache ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Об истории речевых исследований в России ... : About the history of speech research in Russia ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Implementing a Statistical Parametric Speech Synthesis System for a Patient with Laryngeal Cancer
|
|
|
|
In: Sensors; Volume 22; Issue 9; Pages: 3188 (2022)
|
|
BASE
|
|
Show details
|
|
7 |
Improved Durability of Wood Treated with Nano Metal Fluorides against Brown-Rot and White-Rot Fungi
|
|
|
|
In: Applied Sciences; Volume 12; Issue 3; Pages: 1727 (2022)
|
|
BASE
|
|
Show details
|
|
8 |
Evaluation of Tacotron Based Synthesizers for Spanish and Basque
|
|
|
|
In: Applied Sciences; Volume 12; Issue 3; Pages: 1686 (2022)
|
|
BASE
|
|
Show details
|
|
9 |
Contribution of Vocal Tract and Glottal Source Spectral Cues in the Generation of Acted Happy and Aggressive Spanish Vowels
|
|
|
|
In: Applied Sciences; Volume 12; Issue 4; Pages: 2055 (2022)
|
|
BASE
|
|
Show details
|
|
10 |
Neural Vocoding for Singing and Speaking Voices with the Multi-Band Excited WaveNet
|
|
|
|
In: Information; Volume 13; Issue 3; Pages: 103 (2022)
|
|
BASE
|
|
Show details
|
|
11 |
La pulsión contrasexual: microficciones de la desintegración. A propósito de Paul-B. Preciado ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
La pulsión contrasexual: microficciones de la desintegración. A propósito de Paul-B. Preciado ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Technology-mediated task-based language teaching : a qualitative research synthesis
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Affect Expression: Global and Local Control of Voice Source Parameters ; Speech Prosody
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Eine agentenbasierte Architektur für Programmierung mit gesprochener Sprache
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Health practitioners’ perceptions of structural barriers to the identification of intimate partner abuse: a qualitative meta-synthesis
|
|
|
|
In: Test Series for Scopus Harvesting 2021 (2022)
|
|
BASE
|
|
Show details
|
|
18 |
Prosodic Boundary Prediction Model for Vietnamese Text-To-Speech
|
|
|
|
In: Proc. Interspeech 2021 ; Interspeech 2021 ; https://hal.archives-ouvertes.fr/hal-03329116 ; Interspeech 2021, Aug 2021, Brno, Czech Republic. pp.3885-3889, ⟨10.21437/interspeech.2021-125⟩ (2021)
|
|
Abstract:
International audience ; This research aims to build a prosodic boundary prediction model for improving the naturalness of Vietnamese speech synthesis. This model can be used directly to predict prosodic boundaries in the synthesis phase of the statistical parametric or end-to-end speech systems. Beside conventional features related to Part-Of-Speech (POS), this paper proposes two efficient features to predict prosodic boundaries: syntactic blocks and syntactic links, based on a thorough analysis of a Vietnamese dataset. Syntactic blocks are syntactic phrases whose sizes are bounded in their constituent syntactic tree. A syntactic link of two adjacent words is calculated based on the distance between them in the syntax tree. The experimental results show that the two proposed predictors improve the quality of the boundary prediction model using a decision tree classification algorithm, about 36.4% (F1 score) higher than the model with only POS features. The final boundary prediction model with POS, syntactic block, and syntactic link features using the LightGBM algorithm gives the best F1-score results at 87.0% in test data. The proposed model helps the TTS systems, developed by either HMM-based, DNN-based, or End-to-end speech synthesis techniques, improve about 0.3 MOS points (i.e. 6 to 10%) compared to the ones without the proposed model.
|
|
Keyword:
[INFO.INFO-HC]Computer Science [cs]/Human-Computer Interaction [cs.HC]; [INFO.INFO-SD]Computer Science [cs]/Sound [cs.SD]; [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing; [SHS.LANGUE]Humanities and Social Sciences/Linguistics; pause prediction; prosodic boundary; Prosody modeling; speech synthesis; Text-To-Speech; Vietnamese
|
|
URL: https://hal.archives-ouvertes.fr/hal-03329116/file/trang21_interspeech.pdf https://hal.archives-ouvertes.fr/hal-03329116 https://hal.archives-ouvertes.fr/hal-03329116/document https://doi.org/10.21437/interspeech.2021-125
|
|
BASE
|
|
Hide details
|
|
19 |
Supplementary material to the paper The VoicePrivacy 2020 Challenge: Results and findings
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03335126 ; 2021 (2021)
|
|
BASE
|
|
Show details
|
|
20 |
Supplementary material to the paper The VoicePrivacy 2020 Challenge: Results and findings
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03335126 ; 2021 (2021)
|
|
BASE
|
|
Show details
|
|
|
|