Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2

Hits 1 – 20 of 33

1	Breathing and Speech Planning in Spontaneous Speech Synthesis
	Székely, Éva; Beskow, Jonas; Henter, Gustav Eje; Gustafsson, Joakim. - : KTH, Tal, musik och hörsel, TMH, 2020
	Abstract: Breathing and speech planning in spontaneous speech are coordinated processes, often exhibiting disfluent patterns. While synthetic speech is not subject to respiratory needs, integrating breath into synthesis has advantages for naturalness and recall. At the same time, a synthetic voice reproducing disfluent breathing patterns learned from the data can be problematic. To address this, we first propose training stochastic TTS on a corpus of overlapping breath-group bigrams, to take context into account. Next, we introduce an unsupervised automatic annotation of likely-disfluent breath events, through a product-of-experts model that combines the output of two breath-event predictors, each using complementary information and operating in opposite directions. This annotation enables creating an automatically-breathing spontaneous speech synthesiser with a more fluent breathing style. A subjective evaluation on two spoken genres (impromptu and rehearsed) found the proposed system to be preferred over the baseline approach treating all breath events the same. ; QC 20210414
	Keyword: breathing; ensemble method; General Language Studies and Linguistics; Jämförande språkvetenskap och allmän lingvistik; speech planning; Speech synthesis; spontaneous speech
	URL: http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-283731 https://doi.org/10.1109/ICASSP40776.2020.9054107
	BASE
	Hide details

2	Style-Controllable Speech-Driven Gesture Synthesis Using Normalising Flows
	Kucherenko, Taras; Henter, Gustav Eje; Beskow, Jonas. - : KTH, Tal, musik och hörsel, TMH, 2020. : KTH, Robotik, perception och lärande, RPL, 2020. : Wiley, 2020
	BASE
	Show details

3	Self-Supervised Vision-Based Detection of the Active Speaker as Support for Socially-Aware Language Acquisition
	Stefanov, Kalin; Beskow, Jonas; Salvi, Giampiero. - : KTH, Tal, musik och hörsel, TMH, 2020. : Institute for Creative Technologies, University of Southern California, Los Angeles, CA 90089, United States, 2020. : NTNU Norwegian University of Science and Technology, Trondheim, Norway, 2020. : Institute of Electrical and Electronics Engineers (IEEE), 2020
	BASE
	Show details

4	The speech synthesis phoneticians need is both realistic and controllable ...
	Malisz, Zofia; Henter, Gustav Eje; Valentini-Botinhao, Cassia. - : Zenodo, 2019
	BASE
	Show details

5	The speech synthesis phoneticians need is both realistic and controllable ...
	Malisz, Zofia; Henter, Gustav Eje; Valentini-Botinhao, Cassia. - : Zenodo, 2019
	BASE
	Show details

6	PROMIS: a statistical-parametric speech synthesis system with prominence control via a prominence network
	Malisz, Zofia; Berthelsen, Harald; Beskow, Jonas. - : KTH, Tal, musik och hörsel, TMH, 2019. : KTH, Tal-kommunikation, 2019. : STTS – Södermalms talteknologiservice AB, 2019. : Vienna, 2019
	BASE
	Show details

7	Modern speech synthesis for phonetic sciences : A discussion and an evaluation
	Henter, Gustav Eje; Malisz, Zofia; Gustafson, Joakim. - : KTH, Tal, musik och hörsel, TMH, 2019. : The University of Edinburgh, UK, 2019
	BASE
	Show details

8	Off the cuff: Exploring extemporaneous speech delivery with TTS
	Székely, Éva; Henter, Gustav Eje; Beskow, Jonas. - : KTH, Tal, musik och hörsel, TMH, 2019
	BASE
	Show details

9	The speech synthesis phoneticians need is both realistic and controllable
	Malisz, Zofia; Henter, Gustav Eje; Valentini-Botinhao, Cassia. - : KTH, Tal, musik och hörsel, TMH, 2019. : KTH, Tal-kommunikation, 2019. : The Centre for Speech Technology, The University of Edinburgh, UK, 2019. : Stockholm, 2019
	BASE
	Show details

10	The visual prominence of whispered speech in Swedish
	Jonell, Patrik; Beskow, Jonas; Malisz, Zofia. - : KTH, Tal-kommunikation, 2019
	BASE
	Show details

11	A Multimodal Corpus for Mutual Gaze and Joint Attention in Multiparty Situated Interaction
	Kontogiorgos, Dimosthenis; Avramova, Vanya; Alexanderson, Simon. - : KTH, Tal, musik och hörsel, TMH, 2018. : KTH, 2018. : Ecole Polytechnique Fédérale de Lausanne (EPFL), Switzerland, 2018. : Paris, 2018
	BASE
	Show details

12	The proceedings of the 14th International Conference on Auditory-Visual Speech Processing
	Ouni, Slim; Davis, Chris; Jesse, Alexandra. - : HAL CCSD, 2017
	In: The 14th International Conference on Auditory-Visual Speech Processing (AVSP2017) ; https://hal.inria.fr/hal-01596625 ; The 14th International Conference on Auditory-Visual Speech Processing (AVSP2017), Aug 2017, Stockholm, Sweden. 2017 ; http://avsp2017.loria.fr (2017)
	BASE
	Show details

13	Self-Supervised Vision-Based Detection of the Active Speaker as Support for Socially-Aware Language Acquisition ...
	Stefanov, Kalin; Beskow, Jonas; Salvi, Giampiero. - : arXiv, 2017
	BASE
	Show details

14	Using deep neural networks to estimate tongue movements from speech face motion
	Kroos, Christian; Bundgaard-Nielsen, Rikke L. (R14172); Best, Catherine T. (R11322). - : Sweden, KTH Royal Institute of Technology, 2017
	BASE
	Show details

15	Animated Lombard speech: Motion capture, facial animation and visual intelligibility of speech produced in adverse conditions
	Alexanderson, Simon; Beskow, Jonas
	In: Computer speech and language. - Amsterdam [u.a.] : Elsevier 28 (2014) 2, 607-618
	OLC Linguistik
	Show details

16	Tutoring Robots
	Al Moubayed, Samer; Beskow, Jonas; Bollepalli, Bajibabu...
	In: IFIP Advances in Information and Communication Technology ; 9th International Summer Workshop on Multimodal Interfaces (eNTERFACE) ; https://hal.inria.fr/hal-01350740 ; 9th International Summer Workshop on Multimodal Interfaces (eNTERFACE), Jul 2013, Lisbon, Portugal. pp.80-113, ⟨10.1007/978-3-642-55143-7_4⟩ (2013)
	BASE
	Show details

17	Advances in nonlinear speech processing : 6th international conference ; proceedings
	Solé-Casals, Jordi; Carson-Berndsen, Julie; Daoudi, Khalid. - Heidelberg [u.a.] : Springer, 2013
	BLLDB
	UB Frankfurt Linguistik
	Show details

18	Visual Recognition of Isolated Swedish Sign Language Signs ...
	Akram, Saad; Beskow, Jonas; Kjellstrom, Hedvig. - : arXiv, 2012
	BASE
	Show details

19	Synthesising intonational varieties of Swedish
	Schötz, Susanne; Beskow, Jonas; Bruce, Gösta...
	In: Institutionen för Lingvistik <Lund>. Working papers. - Lund : Univ. (2010) 54, 85-90
	BLLDB
	Show details

20	Research focus: interactional aspects of spoken face-to-face communication
	Beskow, Jonas; Edlund, Jens; Gustafson, Joakim...
	In: Institutionen för Lingvistik <Lund>. Working papers. - Lund : Univ. (2010) 54, 7-10
	BLLDB
	Show details

Page: 1 2

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern