1 |
Language Modelling for Sound Event Detection with Teacher Forcing and Scheduled Sampling
|
|
|
|
In: IEEE AASP Challenge on Detection and Classification of Acoustic Scenes and Events Workshops (DCASE 2019) ; https://hal.inria.fr/hal-03132165 ; IEEE AASP Challenge on Detection and Classification of Acoustic Scenes and Events Workshops (DCASE 2019), Oct 2019, New York, United States ; http://dcase.community/challenge2019/index (2019)
|
|
BASE
|
|
Show details
|
|
2 |
ICDAR 2019 Competition on Post-OCR Text Correction
|
|
|
|
In: 15th International Conference on Document Analysis and Recognition ; https://hal.archives-ouvertes.fr/hal-02304334 ; 15th International Conference on Document Analysis and Recognition, Sep 2019, Sydney, Australia. pp.1588-1593 (2019)
|
|
BASE
|
|
Show details
|
|
3 |
Impact and Detection of Facial Beautification in Face Recognition: An Overview
|
|
|
|
In: ISSN: 2169-3536 ; EISSN: 2169-3536 ; IEEE Access ; https://hal.inria.fr/hal-02378939 ; IEEE Access, IEEE, 2019, ⟨10.1109/ACCESS.2019.DOI⟩ (2019)
|
|
BASE
|
|
Show details
|
|
4 |
A novel voice conversion approach using cascaded powerful cepstrum predictors with excitation and phase extracted from the target training space encoded as a KD-tree
|
|
|
|
In: ISSN: 1381-2416 ; EISSN: 1572-8110 ; International Journal of Speech Technology ; https://hal.inria.fr/hal-02315052 ; International Journal of Speech Technology, Springer Verlag, 2019, pp.1-13. ⟨10.1007/s10772-019-09643-4⟩ (2019)
|
|
BASE
|
|
Show details
|
|
5 |
FACIAL RECOGNITION AND PREVENTIVE CONTROLS ON THE PUBLIC HIGHWAY, THE CHALLENGE OF ACCEPTABILITY
|
|
|
|
In: Les Notes du CREOGN ; https://hal.archives-ouvertes.fr/hal-03358470 ; Les Notes du CREOGN, Centre de Recherche de l'Ecole des Officiers de la Gendarmerie Nationale, In press, 43 ; https://www.gendarmerie.interieur.gouv.fr/crgn/content/download/1388/document/Note_43_Facial_recognition.pdf?inLanguage=fre-FR&version=1 (2019)
|
|
BASE
|
|
Show details
|
|
6 |
F0 modeling using DNN for Arabic parametric speech synthesis
|
|
|
|
In: INNSBDDL 2019 - INNS Big Data and Deep Learning ; https://hal.inria.fr/hal-02177496 ; INNSBDDL 2019 - INNS Big Data and Deep Learning, Apr 2019, Sestri Levante, Italy (2019)
|
|
BASE
|
|
Show details
|
|
7 |
VoiceHome-2, an extended corpus for multichannel speech processing in real homes
|
|
|
|
In: ISSN: 0167-6393 ; EISSN: 1872-7182 ; Speech Communication ; https://hal.inria.fr/hal-01923108 ; Speech Communication, Elsevier : North-Holland, 2019, 106, pp.68-78. ⟨10.1016/j.specom.2018.11.002⟩ (2019)
|
|
BASE
|
|
Show details
|
|
8 |
Extracting Data From Line Charts in Scanned Medical Documents
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Sign Language Video Analysis For Automatic Recognition and Detection
|
|
|
|
In: 14th IEEE International Conference on Automatic Face and Gesture Recognition ; https://hal.archives-ouvertes.fr/hal-02146366 ; 14th IEEE International Conference on Automatic Face and Gesture Recognition, May 2019, Lille, France (2019)
|
|
BASE
|
|
Show details
|
|
10 |
Multimodal deep networks for text and image-based document classification ; Réseau de neurones multimodal pour la classification de documents image/texte
|
|
|
|
In: Conférence Nationale sur les Applications Pratiques de l'Intelligence Artificielle (APIA) ; https://hal.archives-ouvertes.fr/hal-02163257 ; Conférence Nationale sur les Applications Pratiques de l'Intelligence Artificielle (APIA), Jul 2019, Toulouse, France (2019)
|
|
BASE
|
|
Show details
|
|
11 |
Automatic recognition of Sign Language structures in RGB videos: the detection of pointing and lexical signs
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-02146368 ; 2019 (2019)
|
|
BASE
|
|
Show details
|
|
12 |
Optimization of a gesture representation network for Sign Language analysis
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-02146369 ; 2019 (2019)
|
|
BASE
|
|
Show details
|
|
13 |
Aphasia outcome: the interactions between initial severity, lesion size and location
|
|
|
|
In: ISSN: 0340-5354 ; EISSN: 1432-1459 ; Journal of Neurology ; https://hal.inria.fr/hal-02418331 ; Journal of Neurology, Springer Verlag, 2019, 266 (6), pp.1303-1309. ⟨10.1007/s00415-019-09259-3⟩ (2019)
|
|
BASE
|
|
Show details
|
|
14 |
Comparaison de systèmes automatiques de reconnaissance grand vocabulaire appliqué à de la parole pathologique
|
|
|
|
In: Actes des 8e Journees de Phonetique Clinique ; 8e Journees de Phonetique Clinique (JPC 2019) ; https://hal.archives-ouvertes.fr/hal-02421557 ; 8e Journees de Phonetique Clinique (JPC 2019), May 2019, Mons, Belgique. pp.53-54 (2019)
|
|
BASE
|
|
Show details
|
|
15 |
ImageCLEF 2019: multimedia retrieval in medicine, lifelogging, security and nature
|
|
|
|
In: Ionescu, Bogdan, Müller, Henning, Péteri, Renaud, Dicente Cid, Yashin, Liauchuk, Vitali, Kovalev, Vassili, Vasillopoulos, Nikos, Karampidis, Konstantinos, Chamberlain, John, Clark, Adrian, Campello, Antonio, Dang-Nguyen, Duc-Tien orcid:0000-0002-2761-2213 , Gurrin, Cathal orcid:0000-0003-4395-7702 , Garcia, Narciso, Kavallieratou, Ergina, del Blanco, Roberto and Cuevas, Carlos (2019) ImageCLEF 2019: multimedia retrieval in medicine, lifelogging, security and nature. In: Experimental IR Meets Multilinguality, Multimodality, and Interaction, 9 - 12 Sept 2019, Lugano, Switzerland. ISBN 978-3-030-28576-0 (2019)
|
|
BASE
|
|
Show details
|
|
16 |
Les instruments chanteurs
|
|
|
|
In: ISSN: 1263-8072 ; Acoustique et Techniques : trimestriel d'information des professionnels de l'acoustique ; https://hal.archives-ouvertes.fr/hal-02025861 ; Acoustique et Techniques : trimestriel d'information des professionnels de l'acoustique, Neuilly-sur-Seine : Centre d'information et de documentation sur le bruit, 2019, 89, pp.36-43 (2019)
|
|
BASE
|
|
Show details
|
|
17 |
Analyse de la qualité des phrases pour un bilan objectif de la parole
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-02389764 ; [Rapport de recherche] INSA Toulouse. 2019 (2019)
|
|
BASE
|
|
Show details
|
|
18 |
T-Voks: the Singing and Speaking Theremin
|
|
|
|
In: NIME 2019 International Conference on New Interfaces for Musical Expression ; https://hal.archives-ouvertes.fr/hal-02197063 ; NIME 2019 International Conference on New Interfaces for Musical Expression, UFRGS, Jun 2019, Porto Alegre, Brazil. pp.110-115 ; http://www.nime.org/proceedings/2019/nime2019_022.pdf (2019)
|
|
BASE
|
|
Show details
|
|
19 |
SEQUENCE-TO-SEQUENCE MODELLING OF F0 FOR SPEECH EMOTION CONVERSION
|
|
|
|
In: IEEE International Conference on Acoustics, Speech, and Signal Processing ; https://hal.sorbonne-universite.fr/hal-02018439 ; IEEE International Conference on Acoustics, Speech, and Signal Processing, May 2019, Brighton, United Kingdom (2019)
|
|
BASE
|
|
Show details
|
|
20 |
Visual Disambiguation of Preprositional Phrase Attachments : Multimodal Machine Learning for Syntactic Analysis Correction
|
|
|
|
In: Advances in Computational Intelligence 15th International Work-Conference on Artificial Neural Networks, IWANN 2019, Gran Canaria, Spain, June 12-14, 2019 ; IWANN: International Work-Conference on Artificial Neural Networks ; https://hal.archives-ouvertes.fr/hal-02465051 ; IWANN: International Work-Conference on Artificial Neural Networks, Jun 2019, Gran Canaria, Spain. ⟨10.1007/978-3-030-20521-8_52⟩ (2019)
|
|
Abstract:
International audience ; Prepositional phrase attachments are known to be an important source of errors in parsing natural language. In some cases, pure syntactic features cannot be used for prepositional phrase attachment disambiguation while visual features could help. In this work, we are interested in the impact of the integration of such features in a parsing system. We propose a correction strategy pipeline for prepositional attachments using visual information, trained on a multimodal corpus of images and captions. The evaluation of the system shows us that using visual features allows, in certain cases, to correct the errors of a parser. It also helps to identify the most difficult aspects of such integration.
|
|
Keyword:
[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]; [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG]; [INFO.INFO-NE]Computer Science [cs]/Neural and Evolutionary Computing [cs.NE]; [INFO.INFO-TI]Computer Science [cs]/Image Processing [eess.IV]; [INFO.INFO-TT]Computer Science [cs]/Document and Text Processing
|
|
URL: https://hal.archives-ouvertes.fr/hal-02465051/file/IWANN_2019.pdf https://doi.org/10.1007/978-3-030-20521-8_52 https://hal.archives-ouvertes.fr/hal-02465051/document https://hal.archives-ouvertes.fr/hal-02465051
|
|
BASE
|
|
Hide details
|
|
|
|