1 |
Phonetic Accommodation of L2 German Speakers to the Virtual Language Learning Tutor Mirabella ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
A Wizard-of-Oz Experiment to Study Phonetic Accommodation in Human-Computer Interaction ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Comparing phonetic changes in computer-directed and human-directed speech ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Phonetic Accommodation in a Wizard-of-Oz Experiment: Intonation and Segments ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Registration and statistical analysis of the tongue shape during speech production
|
|
|
|
Abstract:
This thesis analyzes the human tongue shape during speech production. First, a semi-supervised approach is derived for estimating the tongue shape from volumetric magnetic resonance imaging data of the human vocal tract. Results of this extraction are used to derive parametric tongue models. Next, a framework is presented for registering sparse motion capture data of the tongue by means of such a model. This method allows to generate full three-dimensional animations of the tongue. Finally, a multimodal and statistical text-to-speech system is developed that is able to synthesize audio and synchronized tongue motion from text. ; Diese Dissertation beschäftigt sich mit der Analyse der menschlichen Zungenform während der Sprachproduktion. Zunächst wird ein semi-überwachtes Verfahren vorgestellt, mit dessen Hilfe sich Zungenformen von volumetrischen Magnetresonanztomographie- Aufnahmen des menschlichen Vokaltrakts schätzen lassen. Die Ergebnisse dieses Extraktionsverfahrens werden genutzt, um ein parametrisches Zungenmodell zu konstruieren. Danach wird eine Methode hergeleitet, die ein solches Modell nutzt, um spärliche Bewegungsaufnahmen der Zunge zu registrieren. Dieser Ansatz erlaubt es, dreidimensionale Animationen der Zunge zu erstellen. Zuletzt wird ein multimodales und statistisches Text-to-Speech-System entwickelt, das in der Lage ist, Audio und die dazu synchrone Zungenbewegung zu synthetisieren. ; German Research Foundation
|
|
Keyword:
ddc:004
|
|
URL: https://doi.org/10.22028/D291-31368 http://nbn-resolving.org/urn:nbn:de:bsz:291--ds-313685
|
|
BASE
|
|
Hide details
|
|
6 |
A Multilinear Tongue Model Derived from Speech Related MRI Data of the Human Vocal Tract
|
|
|
|
In: ISSN: 0885-2308 ; EISSN: 1095-8363 ; Computer Speech and Language ; https://hal.archives-ouvertes.fr/hal-01418460 ; Computer Speech and Language, Elsevier, 2018, 51, pp.68-92. ⟨10.1016/j.csl.2018.02.001⟩ (2018)
|
|
BASE
|
|
Show details
|
|
7 |
Studying Mutual Phonetic Influence with a Web-Based Spoken Dialogue System ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Studying Mutual Phonetic Influence with a Web-Based Spoken Dialogue System ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Shadowing Synthesized Speech — Segmental Analysis of Phonetic Convergence ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
A Computational Model for Phonetically Responsive Spoken Dialogue Systems ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
A Multimodal Corpus of Expert Gaze and Behavior during Phonetic Segmentation Tasks ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Investigating Phonetic Convergence in a Shadowing Experiment with Synthetic Stimuli ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Computation of L2 speech rhythm based on duration and fundamental frequency
|
|
|
|
In: Pellegrino, Elisa; He, Lei; Dellwo, Volker (2017). Computation of L2 speech rhythm based on duration and fundamental frequency. In: Trouvain, Jürgen; Steiner, Ingmar; Mobius, Bernd. Elektronische Sprachsignalverarbeitung 2017. Dresden: TUDpress, 246-253. (2017)
|
|
BASE
|
|
Show details
|
|
15 |
Amplitude envelope kinematics of speech signal: parameter extraction and applications
|
|
|
|
In: He, Lei; Dellwo, Volker (2017). Amplitude envelope kinematics of speech signal: parameter extraction and applications. In: Trouvain, Jürgen; Steiner, Ingmar; Möbius, Bernd. Elektronische Sprachsignalverarbeitung 2017. Dresden: TUDpress, 1-8. (2017)
|
|
BASE
|
|
Show details
|
|
16 |
Enhancing the objectivity of interactive formant estimation: introducing euclidean distance measure and numerical conditions for numbers and frequency ranges of formants
|
|
|
|
In: Kathiresan, Thayabaran; Maurer, Dieter; Dellwo, Volker (2017). Enhancing the objectivity of interactive formant estimation: introducing euclidean distance measure and numerical conditions for numbers and frequency ranges of formants. In: Trouvain, Juergen; Steiner, Ingmar; Moebius, Bernd. Elektronische Sprachsignalverarbeitung 2017. Dresden: TUDpress, 130-137. (2017)
|
|
BASE
|
|
Show details
|
|
17 |
De l'utilisation de descripteurs issus de la linguistique computationnelle dans le cadre de la synthèse par HMM
|
|
|
|
In: Journées d'Études sur la Parole ; https://hal.inria.fr/hal-01338953 ; Journées d'Études sur la Parole, Jul 2016, Paris, France (2016)
|
|
BASE
|
|
Show details
|
|
18 |
A real-time framework for visual feedback of articulatory data using statistical shape models
|
|
|
|
In: 17th Annual Conference of the International Speech Communication Association (Interspeech) ; https://hal.archives-ouvertes.fr/hal-01377360 ; 17th Annual Conference of the International Speech Communication Association (Interspeech), Oct 2016, San Francisco, United States. pp.1569-1570 ; http://www.interspeech2016.org/ (2016)
|
|
BASE
|
|
Show details
|
|
19 |
A statistical shape space model of the palate surface trained on 3D MRI scans of the vocal tract ...
|
|
|
|
BASE
|
|
Show details
|
|
20 |
A statistical shape space model of the palate surface trained on 3D MRI scans of the vocal tract
|
|
|
|
In: 18th International Congress of Phonetic Sciences ; https://hal.inria.fr/hal-01192790 ; 18th International Congress of Phonetic Sciences, Aug 2015, Glasgow, United Kingdom ; http://www.icphs2015.info/ (2015)
|
|
BASE
|
|
Show details
|
|
|
|