1 |
Neural-Driven Search-Based Paraphrase Generation
|
|
|
|
In: 16th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2021) ; https://hal.archives-ouvertes.fr/hal-03540926 ; 16th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2021), Apr 2021, Kiev, Ukraine (2021)
|
|
BASE
|
|
Show details
|
|
2 |
Corpus design for expressive speech: impact of the utterance length
|
|
|
|
In: 10th International Conference on Speech Prosody 2020 ; https://hal.archives-ouvertes.fr/hal-02874005 ; 10th International Conference on Speech Prosody 2020, May 2020, Tokyo, Japan. pp.955-959, ⟨10.21437/SpeechProsody.2020-195⟩ (2020)
|
|
BASE
|
|
Show details
|
|
3 |
Can we Generate Emotional Pronunciations for Expressive Speech Synthesis?
|
|
|
|
In: ISSN: 1949-3045 ; IEEE Transactions on Affective Computing ; https://hal.archives-ouvertes.fr/hal-01802463 ; IEEE Transactions on Affective Computing, Institute of Electrical and Electronics Engineers, 2020, 11 (4), pp.684-695. ⟨10.1109/TAFFC.2018.2828429⟩ (2020)
|
|
BASE
|
|
Show details
|
|
4 |
Style versus Content: A distinction without a (learnable) difference?
|
|
|
|
In: International Conference on Computational Linguistics ; https://hal.archives-ouvertes.fr/hal-03112354 ; International Conference on Computational Linguistics, Dec 2020, Virtual, Spain (2020)
|
|
BASE
|
|
Show details
|
|
5 |
Introducing Prosodic Speaker Identity for a Better Expressive Speech Synthesis Control
|
|
|
|
In: 10th International Conference on Speech Prosody 2020 ; https://hal.archives-ouvertes.fr/hal-03000148 ; 10th International Conference on Speech Prosody 2020, May 2020, Tokyo, Japan. pp.935-939, ⟨10.21437/speechprosody.2020-191⟩ (2020)
|
|
BASE
|
|
Show details
|
|
7 |
SynPaFlex-Corpus: An Expressive French Audiobooks Corpus Dedicated to Expressive Speech Synthesis
|
|
|
|
In: Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018) ; https://hal.archives-ouvertes.fr/hal-01826690 ; Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), May 2018, Miyazaki, Japan ; http://lrec2018.lrec-conf.org/en/ (2018)
|
|
BASE
|
|
Show details
|
|
8 |
Discourse phrases classification: direct vs. narrative audio speech
|
|
|
|
In: Speech Prosody ; https://hal.archives-ouvertes.fr/hal-01790910 ; Speech Prosody, Jun 2018, Poznan, Poland (2018)
|
|
BASE
|
|
Show details
|
|
9 |
Disfluency Insertion for Spontaneous TTS: Formalization and Proof of Concept
|
|
|
|
In: SLSP 2018 - 6th International Conference on Statistical Language and Speech Processing ; https://hal.inria.fr/hal-01840798 ; SLSP 2018 - 6th International Conference on Statistical Language and Speech Processing, Oct 2018, Mons, Belgium. pp.1-12, ⟨10.1007/978-3-030-00810-9_4⟩ (2018)
|
|
BASE
|
|
Show details
|
|
10 |
Big Five vs. Prosodic Features as Cues to Detect Abnormality in SSPNET-Personality Corpus
|
|
|
|
In: Interspeech ; https://hal.inria.fr/hal-01583510 ; Interspeech, Aug 2017, Stockholm, Sweden ; http://www.interspeech2017.org/ (2017)
|
|
BASE
|
|
Show details
|
|
11 |
First Experiments to Detect Anomaly Using Personality Traits vs. Prosodic Features
|
|
|
|
In: 19th International Conference on Speech and Computer (SPECOM) ; https://hal.inria.fr/hal-01583539 ; 19th International Conference on Speech and Computer (SPECOM), Sep 2017, Hatfield, Hertfordshire, United Kingdom ; https://link.springer.com/chapter/10.1007/978-3-319-66429-3_37 (2017)
|
|
BASE
|
|
Show details
|
|
12 |
The IRISA Text-To-Speech System for the Blizzard Challenge 2017
|
|
|
|
In: Blizzard Challenge ; https://hal.inria.fr/hal-01662361 ; Blizzard Challenge, Aug 2017, Stockholm, Sweden (2017)
|
|
BASE
|
|
Show details
|
|
13 |
Perception of expressivity in TTS: linguistics, phonetics or prosody?
|
|
|
|
In: Statistical Language and Speech Processing ; https://hal-univ-lemans.archives-ouvertes.fr/hal-01623916 ; Statistical Language and Speech Processing, Oct 2017, Le Mans, France. pp.262-274, ⟨10.1007/978-3-319-68456-7_22⟩ ; http://grammars.grlmc.com/SLSP2017/index.php (2017)
|
|
Abstract:
International audience ; Actually a lot of work on expressive speech focus on acoustic models and prosody variations. However, in expressive Text-to-Speech (TTS) systems, prosody generation strongly relies on the sequence of phonemes to be expressed and also to the words below these phonemes. Consequently, linguistic and phonetic cues play a significant role in the perception of expressivity. In previous works, we proposed a statistical corpus-specific framework which adapts phonemes derived from an automatic phonetizer to the phonemes as labelled in the TTS speech corpus. This framework allows to synthesize good quality but neutral speech samples. The present study goes further in the generation of expressive speech by predicting not only corpus-specific but also expressive pronunciation. It also investigates the shared impacts of linguistics, phonetics and prosody, these impacts being evaluated through different French neutral and expressive speech collected with different speaking styles and linguistic content and expressed under diverse emotional states. Perception tests show that expressivity is more easily perceived when linguistics , phonetics and prosody are consistent. Linguistics seems to be the strongest cue in the perception of expressivity, but phonetics greatly improves expressiveness when combined with and adequate prosody.
|
|
Keyword:
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; ACM: I.: Computing Methodologies/I.2: ARTIFICIAL INTELLIGENCE/I.2.7: Natural Language Processing/I.2.7.5: Speech recognition and synthesis; Expressive speech synthesis; Linguistics; Perception; Phonetics -phonology; Pronunciation adaptation; Prosody
|
|
URL: https://hal-univ-lemans.archives-ouvertes.fr/hal-01623916v3/document https://hal-univ-lemans.archives-ouvertes.fr/hal-01623916v3/file/SLSP2017_Tahon_final.pdf https://doi.org/10.1007/978-3-319-68456-7_22 https://hal-univ-lemans.archives-ouvertes.fr/hal-01623916
|
|
BASE
|
|
Hide details
|
|
14 |
Statistical Pronunciation Adaptation for Spontaneous Speech Synthesis
|
|
|
|
In: Text, Speech and Dialogue (TSD) ; https://hal.inria.fr/hal-01532035 ; Text, Speech and Dialogue (TSD), Aug 2017, Prague, Czech Republic (2017)
|
|
BASE
|
|
Show details
|
|
15 |
De l'utilisation de descripteurs issus de la linguistique computationnelle dans le cadre de la synthèse par HMM
|
|
|
|
In: Journées d'Études sur la Parole ; https://hal.inria.fr/hal-01338953 ; Journées d'Études sur la Parole, Jul 2016, Paris, France (2016)
|
|
BASE
|
|
Show details
|
|
16 |
Patrons Rythmiques et Genres Littéraires en Synthèse de la Parole
|
|
|
|
In: Journées d'Études sur la Parole ; https://hal.inria.fr/hal-01338959 ; Journées d'Études sur la Parole, Jul 2016, Paris, France (2016)
|
|
BASE
|
|
Show details
|
|
17 |
Adaptation de la prononciation pour la synthèse de la parole spontanée en utilisant des informations linguistiques
|
|
|
|
In: Journées d'Études sur la Parole ; https://hal.inria.fr/hal-01321361 ; Journées d'Études sur la Parole, Jul 2016, Paris, France (2016)
|
|
BASE
|
|
Show details
|
|
18 |
Rhythmic Patterns and Literary Genres in Synthesized Speech
|
|
|
|
In: Speech Prosody ; https://hal.inria.fr/hal-01338873 ; Speech Prosody, 2016, Boston, United States (2016)
|
|
BASE
|
|
Show details
|
|
19 |
Une pénalité floue fondée phonologiquement pour améliorer la Sélection d'Unité
|
|
|
|
In: Journées d'Études sur la Parole ; https://hal.inria.fr/hal-01338948 ; Journées d'Études sur la Parole, Jul 2016, Paris, France (2016)
|
|
BASE
|
|
Show details
|
|
20 |
The IRISA Text-To-Speech System for the Blizzard Challenge 2016
|
|
|
|
In: Blizzard Challenge 2016 workshop ; https://hal.inria.fr/hal-01375897 ; Blizzard Challenge 2016 workshop, Sep 2016, Cupertino, United States (2016)
|
|
BASE
|
|
Show details
|
|
|
|