1 |
Disfluency Insertion for Spontaneous TTS: Formalization and Proof of Concept
|
|
|
|
In: SLSP 2018 - 6th International Conference on Statistical Language and Speech Processing ; https://hal.inria.fr/hal-01840798 ; SLSP 2018 - 6th International Conference on Statistical Language and Speech Processing, Oct 2018, Mons, Belgium. pp.1-12, ⟨10.1007/978-3-030-00810-9_4⟩ (2018)
|
|
BASE
|
|
Show details
|
|
2 |
Statistical Pronunciation Adaptation for Spontaneous Speech Synthesis
|
|
|
|
In: Text, Speech and Dialogue (TSD) ; https://hal.inria.fr/hal-01532035 ; Text, Speech and Dialogue (TSD), Aug 2017, Prague, Czech Republic (2017)
|
|
BASE
|
|
Show details
|
|
3 |
Évaluation d'une nouvelle structuration thématique hiérarchique des textes dans un cadre de résumé automatique et de détection d'ancres au sein de vidéos
|
|
|
|
In: Actes de la conférence TALN ; Conférence sur le Traitement Automatique des Langues Naturelles ; https://hal.archives-ouvertes.fr/hal-01399670 ; Conférence sur le Traitement Automatique des Langues Naturelles, 2016, Paris, France. pp.139-152 (2016)
|
|
BASE
|
|
Show details
|
|
4 |
Adaptation de la prononciation pour la synthèse de la parole spontanée en utilisant des informations linguistiques
|
|
|
|
In: Journées d'Études sur la Parole ; https://hal.inria.fr/hal-01321361 ; Journées d'Études sur la Parole, Jul 2016, Paris, France (2016)
|
|
BASE
|
|
Show details
|
|
5 |
Hierarchical topic structuring: from dense segmentation to topically focused fragments via burst analysis
|
|
|
|
In: Recent Advances on Natural Language Processing ; https://hal.archives-ouvertes.fr/hal-01186443 ; Recent Advances on Natural Language Processing, 2015, Hissar, Bulgaria (2015)
|
|
BASE
|
|
Show details
|
|
6 |
Comparing corpora to identify learner-specific features of English: The case of this, that and it
|
|
|
|
In: Book of Abstracts LCR 2015 ; Learner Corpus Research Conference (LCR 2015) ; https://hal-univ-paris.archives-ouvertes.fr/hal-01239837 ; Learner Corpus Research Conference (LCR 2015), Radboud University, Sep 2015, Radboud, Netherlands. pp.68-69 (2015)
|
|
BASE
|
|
Show details
|
|
7 |
Probabilistic Speaker Pronunciation Adaptation for Spontaneous Speech Synthesis Using Linguistic Features
|
|
|
|
In: Proceedings of Statistical Language and Speech Processing ; International Conference on Statistical Language and Speech Processing (SLSP) ; https://hal.inria.fr/hal-01181192 ; International Conference on Statistical Language and Speech Processing (SLSP), Nov 2015, Budapest, Hungary. pp.229-241 (2015)
|
|
BASE
|
|
Show details
|
|
9 |
Automated classification of unexpected uses of this and that in a learner corpus of English
|
|
|
|
In: Recent Advances in Corpus Linguistics: Developing and Exploiting Corpora ; https://hal.archives-ouvertes.fr/hal-01058760 ; Lieven Vandelanotte; Kristin Davidse; Caroline Gentens. Recent Advances in Corpus Linguistics: Developing and Exploiting Corpora, 78, Brill, pp.309-324, 2014, Rodopi Language and Linguistics Special E-Book, ⟨10.1163/9789401211130_015⟩ (2014)
|
|
BASE
|
|
Show details
|
|
10 |
Text Recognition in Multimedia Documents: A Study of two Neural-based OCRs Using and Avoiding Character Segmentation
|
|
|
|
In: ISSN: 1433-2833 ; EISSN: 1433-2825 ; International Journal on Document Analysis and Recognition ; https://hal.archives-ouvertes.fr/hal-00867225 ; International Journal on Document Analysis and Recognition, Springer Verlag, 2014, 17 (1), pp.19-31. ⟨10.1007/s10032-013-0202-7⟩ (2014)
|
|
BASE
|
|
Show details
|
|
11 |
Phonology Modelling for Expressive Speech Synthesis: a Review
|
|
|
|
In: https://hal.inria.fr/hal-01021911 ; [Research Report] PI-2020, IRISA, équipe EXPRESSION. 2014, 18 p., 1 column (2014)
|
|
BASE
|
|
Show details
|
|
12 |
Leveraging lexical cohesion and disruption for topic segmentation
|
|
|
|
In: Proceedings of International Conference on Empirical Methods in Natural Language Processing, EMNLP 2013 ; International Conference on Empirical Methods in Natural Language Processing, EMNLP 2013 ; https://hal.archives-ouvertes.fr/hal-00867011 ; International Conference on Empirical Methods in Natural Language Processing, EMNLP 2013, Oct 2013, Seattle, United States. pp.1314--1324 (2013)
|
|
BASE
|
|
Show details
|
|
13 |
Un modèle segmental probabiliste combinant cohésion lexicale et rupture lexicale pour la segmentation thématique
|
|
|
|
In: TALN - Conférence sur le traitement automatique des langues naturelles ; https://hal.inria.fr/hal-00844112 ; TALN - Conférence sur le traitement automatique des langues naturelles, ATALA, Jun 2013, Les Sables d'Olonne, France ; http://www.taln2013.org/actes/www/TALN-2013/actes/taln-2013-long-015.pdf (2013)
|
|
Abstract:
International audience ; Identifying topical structure in any text-like data is a challenging task. Most existing techniques rely either on maximizing a measure of the lexical cohesion or on detecting lexical disruptions. A novel method combining the two criteria so as to obtain the best trade-off between cohesion and disruption is proposed in this paper. A new statistical model is defined, based on the work of Isahara and Utiyama (2001), maintaining the properties of domain independence and limited a priori of the latter. Evaluations are performed both on written texts and on automatic transcripts of TV shows, the latter not respecting the norms of written texts, thus increasing the difficulty of the task. Experimental results demonstrate the relevance of combining lexical cohesion and disrupture. ; L'identification d'une structure thématique dans des données textuelles quelconques est une tâche difficile. La plupart des techniques existantes reposent soit sur la maximisation d'une mesure de cohésion lexicale au sein d'un segment, soit sur la détection de ruptures lexicales. Nous proposons une nouvelle technique combinant ces deux critères de manière à obtenir le meilleur compromis entre cohésion et rupture. Nous définissons un nouveau modèle probabiliste, fondé sur l'approche proposée par Utiyama et Isahara (2001), en préservant les propriétés d'indépendance au domaine et de faible a priori de cette dernière. Des évaluations sont menées sur des textes écrits et sur des transcriptions automatiques de la parole à la télévision, transcriptions qui ne respectent pas les normes des textes écrits, ce qui accroît la difficulté. Les résultats expérimentaux obtenus démontrent la pertinence de la combinaison des critères de cohésion et de rupture.
|
|
Keyword:
[INFO.INFO-MM]Computer Science [cs]/Multimedia [cs.MM]; ACM: I.: Computing Methodologies/I.2: ARTIFICIAL INTELLIGENCE/I.2.7: Natural Language Processing/I.2.7.3: Language parsing and understanding; cohésion lexicale; journaux télévisés; rupture de cohésion; segmentation thématique
|
|
URL: https://hal.inria.fr/hal-00844112
|
|
BASE
|
|
Hide details
|
|
14 |
Automatic Acquisition of GL Resources, Using an Explanatory, Symbolic Technique
|
|
|
|
In: Advances in Generative Lexicon Theory ; https://hal.archives-ouvertes.fr/hal-00760258 ; Advances in Generative Lexicon Theory, Springer, pp.ch19, 2013 (2013)
|
|
BASE
|
|
Show details
|
|
15 |
Enhancing lexical cohesion measure with confidence measures, semantic relations and language model interpolation for multimedia spoken content topic segmentation
|
|
|
|
In: ISSN: 0885-2308 ; EISSN: 1095-8363 ; Computer Speech and Language ; https://hal.archives-ouvertes.fr/hal-00645705 ; Computer Speech and Language, Elsevier, 2012, 26 (2), pp.90-104 (2012)
|
|
BASE
|
|
Show details
|
|
16 |
Proper Noun Semantic Clustering using Bag-Of-Vectors
|
|
|
|
In: Proceedings of the Applied Natural Language Processing (ANLP) conference. Special track at the 25th International FLAIRS Conference ; ANLP - Applied Natural Language Processing conference. Special track at the 25th International FLAIRS Conference. ; https://hal.archives-ouvertes.fr/hal-00760105 ; ANLP - Applied Natural Language Processing conference. Special track at the 25th International FLAIRS Conference., May 2012, Marco Island, FL, United States (2012)
|
|
BASE
|
|
Show details
|
|
17 |
Automated processing of an English learner corpus: the case of this and that
|
|
|
|
In: ICAME332012 : Corpora at the centre and crossroads of English linguistics ; https://hal-univ-paris.archives-ouvertes.fr/hal-01239864 ; ICAME332012 : Corpora at the centre and crossroads of English linguistics, University of Leuven, May 2012, Louvain, Belgium ; http://s3.amazonaws.com/academia.edu.documents/30265742/icame33abstracts.pdf?AWSAccessKeyId=AKIAJ56TQJRTWSMTNPEA&Expires=1449576100&Signature=v5bh0ROXvar4AQ5uq1SzXGl14Ak%3D&response-content-disposition=inline%3B%20filename%3DCohesive_conjunctions_across_languages_a.pdf#page=274 (2012)
|
|
BASE
|
|
Show details
|
|
18 |
Combining Multi-Scale Character Recognition and Linguistic Knowledge for Natural Scene Text OCR
|
|
|
|
In: 10th IAPR International Workshop on Document Analysis Systems, DAS ; https://hal.archives-ouvertes.fr/hal-00753908 ; 10th IAPR International Workshop on Document Analysis Systems, DAS, Mar 2012, Gold Coast, Queensland, Australia. pp.120-124 (2012)
|
|
BASE
|
|
Show details
|
|
19 |
Automatically Finding Semantically Consistent N-grams to Add New Words in LVCSR Systems
|
|
|
|
In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP ; https://hal.archives-ouvertes.fr/hal-00645223 ; IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, May 2011, Prague, Czech Republic. 4 p., 2 columns (2011)
|
|
BASE
|
|
Show details
|
|
20 |
Using shallow linguistic features for relation extraction in bio-medical texts
|
|
|
|
In: Actes de la conférence TALN ; Traitement Automatique des Langues Naturelles, TALN ; https://hal.archives-ouvertes.fr/hal-00644070 ; Traitement Automatique des Langues Naturelles, TALN, 2011, Montpellier, France. 125-130, short paper (2011)
|
|
BASE
|
|
Show details
|
|
|
|