Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2 3 4 5 6 7...15

Hits 41 – 60 of 290

41	ИСТОРИЯ И ТЕОРИЯ: “RETURN TO THE REAL ... : ИСТОРИЯ И ТЕОРИЯ: “RETURN TO THE REAL ...
	Савельева Ирина Максимовна,. - : Диалог со временем, 2018
	BASE
	Show details

42	Extending Situated Language Comprehension (Accounts) with Speaker and Comprehender Characteristics: Toward Socially Situated Interpretation
	Münster, Katja; Knoeferle, Pia. - : Humboldt-Universität zu Berlin, 2018
	BASE
	Show details

43	Conversational Time Travel: Evidence of a Retrospective Bias in Real Life Conversations
	Demiray, Burcu; Mehl, Matthias R.; Martin, Mike
	In: Frontiers in Psychology, 9 (2018)
	BASE
	Show details

44	Extending Situated Language Comprehension (Accounts) with Speaker and Comprehender Characteristics: Toward Socially Situated Interpretation ...
	Münster, Katja; Knoeferle, Pia. - : Humboldt-Universität zu Berlin, 2018
	BASE
	Show details

45	The prosodic substrate of consonant and tone dynamics ...
	Lee, Yoonjeong. - : University of Southern California Digital Library (USC.DL), 2018
	BASE
	Show details

46	Towards Real-Time Simulation of a Finite Element Generic Lumbar Spine Model
	Maeda, Nathanial. - : University of Alberta. Department of Mechanical Engineering., 2018
	BASE
	Show details

47	Coerência dialetal na comunidade bilíngue de Flores da Cunha : alternância do ditongo nasal e variação da vibrante
	Azeredo Velho, Priscila Silvano. - 2018
	BASE
	Show details

48	Coerência dialetal na comunidade bilíngue de Flores da Cunha : alternância do ditongo nasal e variação da vibrante
	Azeredo Velho, Priscila Silvano. - 2018
	BASE
	Show details

49	Architectures for Real-Time Automatic Sign Language Recognition on Resource-Constrained Device
	Blair, James M
	In: UNF Graduate Theses and Dissertations (2018)
	BASE
	Show details

50	ImproteK: introducing scenarios into human-computer music improvisation
	Nika, Jérôme; Chemillier, Marc; Assayag, Gérard
	In: ACM Computers in Entertainment ; https://hal.archives-ouvertes.fr/hal-01380163 ; ACM Computers in Entertainment, 2017, ⟨10.1145/3022635⟩ (2017)
	BASE
	Show details

51	Facial Expression Emotion Detection for Real-time Embedded Systems
	Meng, H; Swash, M; Pleva, M. - 2017
	BASE
	Show details

52	ТИПОЛОГИЯ УСТУПИТЕЛЬНЫХ КОНСТРУКЦИЙ (НА МАТЕРИАЛЕ МЕДИАТЕКСТОВ СОВРЕМЕННОЙ ПУБЛИЦИСТИКИ)
	ВАЙРАХ ЮЛИЯ ВИКТОРОВНА; КАЗОРИНА АННА ВЛАДИМИРОВНА; ЧУГУНОВА НАТАЛЬЯ ЮРЬЕВНА. - : Федеральное государственное образовательное учреждение высшего профессионального образования Кубанский государственный аграрный университет, 2017
	BASE
	Show details

53	JACPoL: A Simple but Expressive JSON-based Access Control Policy Language
	Jiang, Hao; Bouabdallah, Ahmed
	In: Lecture Notes in Computer Science ; WISTP 2017 : 11th IFIP International Conference on Information Security Theory and Practice ; https://hal.archives-ouvertes.fr/hal-01802720 ; WISTP 2017 : 11th IFIP International Conference on Information Security Theory and Practice, Sep 2017, Heraklion, Crete, Greece. pp.56-72, ⟨10.1007/978-3-319-93524-9_4⟩ (2017)
	BASE
	Show details

54	Real-Time Systems Development Exploiting Aspect-Oriented Approach ; Développement de Systèmes Temps Réels Exploitant l'Approche Orientée Aspects
	Machta, Naoufel. - : HAL CCSD, 2017
	In: https://hal.archives-ouvertes.fr/tel-01543395 ; Génie logiciel [cs.SE]. Faculté des Sciences de Tunis, 2017. Français (2017)
	BASE
	Show details

55	Incremental text-to-speech synthesis ; Synthèse incrémentale de la parole à partir du texte
	Pouget, Maël. - : HAL CCSD, 2017
	In: https://tel.archives-ouvertes.fr/tel-01636327 ; Traitement du signal et de l'image [eess.SP]. Université Grenoble Alpes, 2017. Français. ⟨NNT : 2017GREAT008⟩ (2017)
	Abstract: In this thesis, we investigate a new paradigm for text-to-speech synthesis (TTS) allowing to deliver synthetic speech while the text is being inputted : incremental text-to-speech synthesis. Contrary to conventional TTS systems, that trigger the synthesis after a whole sentence has been typed down, incremental TTS devices deliver speech in a ``piece-meal'' fashion (i.e. word after word) while aiming at preserving the speech quality achievable by conventional TTS systems.By reducing the waiting time between two speech outputs while maintaining a good speech quality, such a system should improve the quality of the interaction for speech-impaired people using TTS devices to express themselves.The main challenge brought by incremental TTS is the synthesis of a word, or of a group of words, with the same segmental and supra-segmental quality as conventional TTS, but without knowing the end of the sentence to be synthesized. In this thesis, we propose to adapt the two main modules (natural language processing and speech synthesis) of a TTS system to the incremental paradigm.For the natural language processing module, we focused on part-of-speech tagging, which is a key step for phonetization and prosody generation. We propose an ``adaptive latency algorithm'' for part-of-speech tagging, that estimates if the inferred part-of-speech for a given word (based on the n-gram approach) is likely to change when adding one or several words. If the Part-of-speech is considered as likely to change, the synthesis of the word is delayed. In the other case, the word may be synthesized without risking to alter the segmental or supra-segmental quality of the synthetic speech. The proposed method is based on a set of binary decision trees trained over a large corpus of text. We achieve 92.5% precision for the incremental part-of-speech tagging task and a mean delay of 1.4 words.For the speech synthesis module, in the context of HMM-based speech synthesis, we propose a training method that takes into account the uncertainty about contextual features that cannot be computed at synthesis time (namely, contextual features related to the following words). We compare the proposed method to other strategies (baselines) described in the literature. Objective and subjective evaluation show that the proposed method outperforms the baselines for French.Finally, we describe a prototype developed during this thesis implementing the proposed solution for incremental part-of-speech tagging and speech synthesis. A perceptive evaluation of the word grouping derived from the proposed adaptive latency algorithm as well as the segmental quality of the synthetic speech tends to show that our system reaches a good trade-off between reactivity (minimizing the waiting time between the input and the synthesis of a word) and speech quality (both at segmental and supra-segmental levels). ; Ce travail de thèse porte sur un nouveau paradigme pour la synthèse de la parole à partir du texte, à savoir la synthèse incrémentale. L'objectif est de délivrer la parole de synthèse au fur et à mesure de la saisie du texte par l'utilisateur, contrairement aux systèmes classiques pour lesquels la synthèse est déclenchée après la saisie d'une ou plusieurs phrases. L'application principale visée est l'aide aux personnes présentant un trouble sévère de la communication orale, et communiquant principalement à l'aide d'un synthétiseur vocal. Un synthétiseur vocal incrémental permettrait de fluidifier une conversation en limitant le temps que passe l'interlocuteur à attendre la fin de la saisie de la phrase à synthétiser. Un des défi que pose ce paradigme est la synthèse d'un mot ou d'un groupe de mot avec une qualité segmentale et prosodique acceptable alors que la phrase qui le contient n'est que partiellement connue au moment de la synthèse. Pour ce faire, nous proposons différentes adaptations des deux principaux modules d'un système de synthèse de parole à partir du texte : le module de traitement automatique de la langue naturelle (TAL) et le module de synthèse sonore. Pour le TAL en synthèse incrémentale, nous nous sommes intéressé à l'analyse morpho-syntaxique, qui est une étape décisive pour la phonétisation et la détermination de la prosodie cible. Nous décrivons un algorithme d'analyse morpho-syntaxique dit "à latence adaptative". Ce dernier estime en ligne si une classe lexicale (estimée à l'aide d'un analyseur morpho-syntaxique standard basé sur l'approche n-gram), est susceptible de changer après l'ajout par l'utilisateur d'un ou plusieurs mots. Si la classe est jugée instable, alors la synthèse sonore est retardée, dans le cas contraire, elle peut s'effectuer sans risque a priori de dégrader de la qualité segmentale et suprasegmentale. Cet algorithme exploite une ensemble d'arbre de décisions binaires dont les paramètres sont estimés par apprentissage automatique sur un large corpus de texte. Cette méthode nous permet de réaliser un étiquetage morpho-syntaxique en contexte incrémental avec une précision de 92,5% pour une latence moyenne de 1,4 mots. Pour la synthèse sonore, nous nous plaçons dans le cadre de la synthèse paramétrique statistique, basée sur les modèles de Markov cachés (Hidden Markov Models, HMM). Nous proposons une méthode de construction de la voix de synthèse (estimation des paramètres de modèles HMM) prenant en compte une éventuelle incertitude sur la valeur de certains descripteurs contextuels qui ne peuvent pas être calculés en synthèse incrémentale (c'est-à-dire ceux qui portent sur les mots qui ne sont pas encore saisis au moment de la synthèse).Nous comparons la méthode proposée à deux autres stratégies décrites dans la littérature. Les résultats des évaluations objectives et perceptives montrent l’intérêt de la méthode proposée pour la langue française. Enfin, nous décrivons un prototype complet qui combine les deux méthodes proposées pour le TAL et la synthèse par HMM incrémentale. Une évaluation perceptive de la pertinence et de la qualité des groupes de mots synthétisés au fur et à mesure de la saisie montre que notre système réalise un compromis acceptable entre réactivité (minimisation du temps entre la saisie d'un mot et sa synthèse) et qualité (segmentale et prosodique) de la parole de synthèse.
	Keyword: [SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing; Apprentissage automatique; Machine learning; Parole; Prosodie; Prosody; Real-Time; Speech; Synthèse; Synthesis (TTS); Temps-Réel
	URL: https://tel.archives-ouvertes.fr/tel-01636327/file/POUGET_2017_diffusion.pdf https://tel.archives-ouvertes.fr/tel-01636327/document https://tel.archives-ouvertes.fr/tel-01636327
	BASE
	Hide details

56	Reading between the lines: Using media to improve German inflation forecasts
	Beckers, Benjamin; Kholodilin, Konstantin A.; Ulbricht, Dirk. - : Berlin: Deutsches Institut für Wirtschaftsforschung (DIW), 2017
	BASE
	Show details

57	Task Parameters Managing And System Accuracy In Fuzzy Realtime Scheduling ...
	Blej, Mohammed. - : Zenodo, 2016
	BASE
	Show details

58	Comparing the impact of subtitles on learning: automatically generated vs. corrected subtitles
	Chan, Wing Shan. - : Sydney, Australia : Macquarie University, 2016
	BASE
	Show details

59	On the Tail of the Scottish Vowel Length Rule in Glasgow
	Stuart-Smith, Jane H.; Rathcke, Tamara V.
	In: Language and Speech ; 59 (2016), 3. - S. 404-430. - Sage. - ISSN 0023-8309. - eISSN 1756-6053 (2016)
	BASE
	Show details

60	Mudança fônica em progresso no português de contato : palatização de /t/ e /d/ e vocalização de /l/ numa comunidade ítalo-brasileira
	Battisti, Elisa; Dornelles Filho, Adalberto Ayjara. - 2016
	BASE
	Show details

Page: 1 2 3 4 5 6 7...15

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern