Home Catalogue search

eng

Refine your search:
- Keyword
- Creator / Publisher
- Year
- Medium:
- Type:
  - Article (1.664)
- BLLDB-Access:
  - free (1.664)
  - subject to license (58)

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2 3 4 5...84

Hits 1 – 20 of 1.664

1	Об истории речевых исследований в России ... : About the history of speech research in Russia ...
	Потапова, Р.К.; Потапов, В.В.. - : Издательство ГЕОС, 2022
	BASE
	Show details

2	Implementing a Statistical Parametric Speech Synthesis System for a Patient with Laryngeal Cancer
	Krzysztof Szklanny; Jakub Lachowicz
	In: Sensors; Volume 22; Issue 9; Pages: 3188 (2022)
	BASE
	Show details

3	Evaluation of Tacotron Based Synthesizers for Spanish and Basque
	Víctor García; Inma Hernáez; Eva Navas
	In: Applied Sciences; Volume 12; Issue 3; Pages: 1686 (2022)
	BASE
	Show details

4	Contribution of Vocal Tract and Glottal Source Spectral Cues in the Generation of Acted Happy and Aggressive Spanish Vowels
	Marc Freixes; Joan Claudi Socoró; Francesc Alías
	In: Applied Sciences; Volume 12; Issue 4; Pages: 2055 (2022)
	BASE
	Show details

5	Neural Vocoding for Singing and Speaking Voices with the Multi-Band Excited WaveNet
	Axel Roebel; Frederik Bous
	In: Information; Volume 13; Issue 3; Pages: 103 (2022)
	BASE
	Show details

6	Affect Expression: Global and Local Control of Voice Source Parameters ; Speech Prosody
	Yanushevskaya, Irena; Gobl, Christer; Murphy, Andrew. - 2022
	BASE
	Show details

7	Prosodic Boundary Prediction Model for Vietnamese Text-To-Speech
	Trang, Nguyen Thi Thu; Ky, Nguyen,; Rilliard, Albert; D'Alessandro, Christophe
	In: Proc. Interspeech 2021 ; Interspeech 2021 ; https://hal.archives-ouvertes.fr/hal-03329116 ; Interspeech 2021, Aug 2021, Brno, Czech Republic. pp.3885-3889, ⟨10.21437/interspeech.2021-125⟩ (2021)
	Abstract: International audience ; This research aims to build a prosodic boundary prediction model for improving the naturalness of Vietnamese speech synthesis. This model can be used directly to predict prosodic boundaries in the synthesis phase of the statistical parametric or end-to-end speech systems. Beside conventional features related to Part-Of-Speech (POS), this paper proposes two efficient features to predict prosodic boundaries: syntactic blocks and syntactic links, based on a thorough analysis of a Vietnamese dataset. Syntactic blocks are syntactic phrases whose sizes are bounded in their constituent syntactic tree. A syntactic link of two adjacent words is calculated based on the distance between them in the syntax tree. The experimental results show that the two proposed predictors improve the quality of the boundary prediction model using a decision tree classification algorithm, about 36.4% (F1 score) higher than the model with only POS features. The final boundary prediction model with POS, syntactic block, and syntactic link features using the LightGBM algorithm gives the best F1-score results at 87.0% in test data. The proposed model helps the TTS systems, developed by either HMM-based, DNN-based, or End-to-end speech synthesis techniques, improve about 0.3 MOS points (i.e. 6 to 10%) compared to the ones without the proposed model.
	Keyword: [INFO.INFO-HC]Computer Science [cs]/Human-Computer Interaction [cs.HC]; [INFO.INFO-SD]Computer Science [cs]/Sound [cs.SD]; [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing; [SHS.LANGUE]Humanities and Social Sciences/Linguistics; pause prediction; prosodic boundary; Prosody modeling; speech synthesis; Text-To-Speech; Vietnamese
	URL: https://hal.archives-ouvertes.fr/hal-03329116/file/trang21_interspeech.pdf https://hal.archives-ouvertes.fr/hal-03329116 https://hal.archives-ouvertes.fr/hal-03329116/document https://doi.org/10.21437/interspeech.2021-125
	BASE
	Hide details

8	Impact of Segmentation and Annotation in French end-to-end Synthesis
	Lenglet, Martin; Perrotin, Olivier; Bailly, Gérard
	In: Proc. 11th ISCA Speech Synthesis Workshop (SSW 11) ; SSW 11th ISCA Speech Synthesis Workshop ; https://hal.archives-ouvertes.fr/hal-03362000 ; SSW 11th ISCA Speech Synthesis Workshop, Aug 2021, Budapest, Hungary. pp.13-18, ⟨10.21437/SSW.2021-3⟩ ; https://ssw11.hte.hu/ (2021)
	BASE
	Show details

9	Anonymous speaker clusters: Making distinctions between anonymised speech recordings with clustering interface
	O'Brien, Benjamin; Tomashenko, Natalia; Chanclu, Anaïs...
	In: INTERSPEECH 2021 ; https://hal.archives-ouvertes.fr/hal-03267084 ; INTERSPEECH 2021, Aug 2021, Brno, Czech Republic (2021)
	BASE
	Show details

10	Learning emotions latent representation with CVAE for Text-Driven Expressive AudioVisual Speech Synthesis
	Dahmani, Sara; Colotte, Vincent; Girard, Valérian...
	In: ISSN: 0893-6080 ; Neural Networks ; https://hal.inria.fr/hal-03204193 ; Neural Networks, Elsevier, 2021, 141, pp.315-329. ⟨10.1016/j.neunet.2021.04.021⟩ (2021)
	BASE
	Show details

11	Anonymous speaker clusters: Making distinctions between anonymised speech recordings with clustering interface
	O'Brien, Benjamin; Tomashenko, Natalia; Chanclu, Anaïs...
	In: INTERSPEECH 2021 ; https://hal.archives-ouvertes.fr/hal-03267084 ; INTERSPEECH 2021, Aug 2021, Brno, Czech Republic (2021)
	BASE
	Show details

12	Sequence-to-Sequence Acoustic Modeling with Semi-Stepwise Monotonic Attention for Speech Synthesis
	Xiao Zhou; Zhenhua Ling; Yajun Hu...
	In: Applied Sciences ; Volume 11 ; Issue 21 (2021)
	BASE
	Show details

13	Acoustic Word Embeddings for End-to-End Speech Synthesis
	Feiyu Shen; Chenpeng Du; Kai Yu
	In: Applied Sciences ; Volume 11 ; Issue 19 (2021)
	BASE
	Show details

14	Discriminative Multi-Stream Postfilters Based on Deep Learning for Enhancing Statistical Parametric Speech Synthesis
	Marvin Coto-Jiménez
	In: Biomimetics ; Volume 6 ; Issue 1 (2021)
	BASE
	Show details

15	Korean Prosody Phrase Boundary Prediction Model for Speech Synthesis Service in Smart Healthcare
	Minho Kim; Youngim Jung; Hyuk-Chul Kwon
	In: Electronics ; Volume 10 ; Issue 19 (2021)
	BASE
	Show details

16	Integrating a voice analysis-synthesis system with a TTS framework for controlling affect and speaker identity ; 2021 32nd Irish Signals and Systems Conference (ISSC)
	Yanushevskaya, Irena; Gobl, Christer; Ni Chasaide, Ailbhe. - 2021
	BASE
	Show details

17	Acoustic analysis and measurements of distorted speech in the NZ population
	Erfanian Sabaee, Maryam; Sharifzadeh, Hamid; Ardekani, Iman. - 2021
	BASE
	Show details

18	Acoustic analysis and measurements of distorted speech in the NZ population
	Erfanian Sabaee, Maryam; Sharifzadeh, Hamid; Ardekani, Iman. - 2021
	BASE
	Show details

19	Acoustic analysis and measurements of distorted speech in the NZ population
	Erfanian Sabaee, Maryam; Sharifzadeh, Hamid; Ardekani, Iman. - 2021
	BASE
	Show details

20	Using Silence MR Image to Synthesise Dynamic MRI Vocal Tract Data of CV
	Douros, Ioannis,; Kulkarni, Ajinkya; Dourou, Chrysanthi...
	In: INTERSPEECH 2020 ; https://hal.inria.fr/hal-03090808 ; INTERSPEECH 2020, Oct 2020, Shangaï / Virtual, China ; http://www.interspeech2020.org/ (2020)
	BASE
	Show details

Page: 1 2 3 4 5...84

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern