Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2 3 4 5

Hits 1 – 20 of 91

1	The impact of the Lombard effect on audio and visual speech recognition systems
	Marxer, Ricard; Barker, Jon; Alghamdi, Najwa...
	In: ISSN: 0167-6393 ; EISSN: 1872-7182 ; Speech Communication ; https://hal.archives-ouvertes.fr/hal-01779704 ; Speech Communication, Elsevier : North-Holland, 2018, 100, pp.58-68. ⟨10.1016/j.specom.2018.04.006⟩ (2018)
	BASE
	Show details

2	Cracking the social code of speech prosody using reverse correlation
	Ponsot, Emmanuel; Burred, Juan,; Belin, Pascal...
	In: ISSN: 0027-8424 ; EISSN: 1091-6490 ; Proceedings of the National Academy of Sciences of the United States of America ; https://hal.archives-ouvertes.fr/hal-02004519 ; Proceedings of the National Academy of Sciences of the United States of America , National Academy of Sciences, 2018, 115 (15), pp.3972-3977. ⟨10.1073/pnas.1716090115⟩ (2018)
	BASE
	Show details

3	Centered and Averaged Fuzzy Entropy to Improve Fuzzy Entropy Precision
	Girault, Jean-Marc; Humeau-Heurtier, Anne
	In: ISSN: 1099-4300 ; Entropy ; https://hal.archives-ouvertes.fr/hal-01773465 ; Entropy, MDPI, 2018, 20 (4), pp.287. ⟨10.3390/e20040287⟩ ; https://www.mdpi.com/1099-4300/20/4/287/htm (2018)
	BASE
	Show details

4	Cultural Differences in Pattern Matching: Multisensory Recognition of Socio-affective Prosody
	Shochi, Takaaki; Rouas, Jean-Luc; Guerry, Marine...
	In: Interspeech 2018 ; https://hal.archives-ouvertes.fr/hal-01913705 ; Interspeech 2018, Sep 2018, Hyderabad, India. ⟨10.21437/interspeech.2018-1795⟩ (2018)
	BASE
	Show details

5	Enhancement of esophageal speech obtained by a voice conversion technique using time dilated Fourier cepstra
	Ben Othmane, Imen; Di Martino, Joseph; Ouni, Kaïs
	In: ISSN: 1381-2416 ; EISSN: 1572-8110 ; International Journal of Speech Technology ; https://hal.inria.fr/hal-01954096 ; International Journal of Speech Technology, Springer Verlag, 2018, 22 (1), pp.99-110. ⟨10.1007/s10772-018-09579-1⟩ (2018)
	BASE
	Show details

6	DEEP-SEE FACE: a mobile face recognition system dedicated to visually impaired people
	Mocanu, Bogdan; Tapu, Ruxandra; Zaharia, Titus
	In: ISSN: 2169-3536 ; EISSN: 2169-3536 ; IEEE Access ; https://hal.archives-ouvertes.fr/hal-01993883 ; IEEE Access, IEEE, 2018, 6, pp.51975 - 51985. ⟨10.1109/ACCESS.2018.2870334⟩ (2018)
	BASE
	Show details

7	Enhancement of esophageal speech using statistical and neuromimetic voice conversion techniques
	Ben Othmane, Imen; Di Martino, Joseph; Ouni, Kais
	In: ISSN: 2351-8715 ; Journal of International Science and General Applications ; https://hal.inria.fr/hal-01724375 ; Journal of International Science and General Applications, ISGA, 2018, 1 (1), pp.10 (2018)
	Abstract: International audience ; This paper presents a novel approach for enhancing esophageal speech using voice conversion techniques. Esophageal speech (ES) is an alternative voice that allows a patient with no vocal cords to produce sounds after total laryngectomy: although it doesn't need any external devices, this voice sounds unnatural when compared to laryngeal speech. ES is frequently described as a harsh speech with low pitch frequency and loudness. Consequently, ES has a poor degree of intelligibility and a poor quality. To improve naturalness and intelligibility of esophageal speech, we propose a speaking-aid system enhancing ES in order to clarify and make it more natural. Given the specificity of ES, in this study, we propose to apply a new voice conversion technique taking into account the particularity of the pathological vocal apparatus. The vocal tract and excitation cepstral coefficients are separately estimated. We trained deep neural networks (DNNs) and Gaussian mixture models (GMMs) to predict "laryngeal" vocal tract features from esophageal speech. The converted cepstral vectors are then used to estimate excitation and phase coefficients by a search in the target training space previously encoded as a binary tree. The voice resynthesized sounds like a laryngeal voice, i.e., is more natural than the original ES, with an effective reconstruction of the prosodic information while retaining, and this is the highlight of our study, the characteristics of the vocal tract inherent to the source speaker. The results of voice conversion evaluated using objective and subjective experiments, validate the proposed approach.
	Keyword: [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing
	URL: https://hal.inria.fr/hal-01724375
	BASE
	Hide details

8	Duration modeling using DNN for Arabic speech synthesis
	Zangar, Imene; Mnasri, Zied; Colotte, Vincent...
	In: 9th International Conference on Speech Prosody ; https://hal.inria.fr/hal-01889917 ; 9th International Conference on Speech Prosody, Jun 2018, Poznań, Poland (2018)
	BASE
	Show details

9	Multiple speaker localization and identification through multiple camera and visible light communication
	Lefevre, Florent; Seguel, Fabián; Bombardier, Vincent...
	In: 1st Global LIFI Congress ; https://hal.archives-ouvertes.fr/hal-01723387 ; 1st Global LIFI Congress, Feb 2018, Paris, France (2018)
	BASE
	Show details

10	Linguistically-driven framework for computationally efficient and scalable sign recognition
	Metaxas, Dimitris N.; Dilsizian, Mark; Neidle, Carol. - : European Language Resources Association (ELRA), 2018
	BASE
	Show details

11	Scalable ASL sign recognition using model-based machine learning and linguistically annotated corpora
	Dilsizian, Mark; Neidle, Carol; Metaxas, Dimitris. - : European Language Resources Association (ELRA), 2018
	BASE
	Show details

12	NEW shared & interconnected ASL resources: SignStream® 3 Software; DAI 2 for web access to linguistically annotated video corpora; and a sign bank
	Neidle, Carol; Opoku, Augustine; Metaxas, Dimitris. - : European Language Resources Association (ELRA), 2018
	BASE
	Show details

13	Sign Language Video Analysis For Automatic Recognition and Detection
	Belissen, Valentin
	In: 20th International ACM SIGACCESS Conference on Computers and Accessibility ; https://hal.archives-ouvertes.fr/hal-02146365 ; 20th International ACM SIGACCESS Conference on Computers and Accessibility, Oct 2018, Galway, Ireland (2018)
	BASE
	Show details

14	Transfer Learning for a Letter-Ngrams to Word Decoder in the Context of Historical Handwriting Recognition with Scarce Resources
	Granet, Adeline; Morin, Emmanuel; Mouchère, Harold...
	In: 27th International Conference on Computational Linguistics (COLING) ; https://hal.archives-ouvertes.fr/hal-01868743 ; 27th International Conference on Computational Linguistics (COLING), Aug 2018, Santa Fe, NM, United States. pp.1474-1484 ; http://coling2018.org/ (2018)
	BASE
	Show details

15	Correction automatique d'attachements prépositionnels par utilisation de traits visuels
	Delecraz, Sebastien; Becerra-Bonache, Leonor; Favre, Benoit...
	In: 25ème conférence sur le Traitement Automatique des Langues Naturelles ; https://hal.archives-ouvertes.fr/hal-01832979 ; 25ème conférence sur le Traitement Automatique des Langues Naturelles, May 2018, Rennes, France (2018)
	BASE
	Show details

16	The structure of the mental lexicon: what primary progressive aphasias reveal
	Sanches, Clara; Routier, Alexandre; Colliot, Olivier...
	In: ISSN: 0028-3932 ; EISSN: 1873-3514 ; Neuropsychologia ; https://hal.inria.fr/hal-01672932 ; Neuropsychologia, Elsevier, 2018, 109, pp.107-115. ⟨10.1016/j.neuropsychologia.2017.12.018⟩ (2018)
	BASE
	Show details

17	Fast lexicographical order-based encoder for lattice vector quantization of Generalized Gaussian sources using pre-computed n-balls cardinalities
	Guillemot, Ludovic; Moureaux, Jean-Marie
	In: ISSN: 0923-5965 ; EISSN: 1879-2677 ; Signal Processing: Image Communication ; https://hal.archives-ouvertes.fr/hal-01670771 ; Signal Processing: Image Communication, Elsevier, 2018, 62, pp.42-50. ⟨10.1016/j.image.2017.12.004⟩ (2018)
	BASE
	Show details

18	Audio-visual synchronization in reading while listening to texts: Effects on visual behavior and verbal learning
	Gerbier, Emilie; Bailly, Gérard; Bosse, Marie-Line
	In: ISSN: 0885-2308 ; EISSN: 1095-8363 ; Computer Speech and Language ; https://hal.archives-ouvertes.fr/hal-01575227 ; Computer Speech and Language, Elsevier, 2018, 47 (january), pp.79-92. ⟨10.1016/j.csl.2017.07.003⟩ (2018)
	BASE
	Show details

19	Particularities about brain asymmetry and lexicosemantic processing in giftedness: a transdisciplinary approach
	Magnié-Mauro, Marie-Noële; Meste, Olivier; Rix, Hervé...
	In: First European Asymmetry Symposium ; https://hal.archives-ouvertes.fr/hal-01742517 ; First European Asymmetry Symposium, Mar 2018, Nice, France (2018)
	BASE
	Show details

20	Critical brain regions related to post-stroke aphasia severity identified by early diffusion imaging are not the same when predicting short- and long-term outcome
	Zavanone, Chiara; Samson, Yves; Arbizu, Celine...
	In: ISSN: 0093-934X ; EISSN: 1090-2155 ; Brain and Language ; https://hal.inria.fr/hal-01966081 ; Brain and Language, Elsevier, 2018, 186, pp.1-7. ⟨10.1016/j.bandl.2018.08.005⟩ (2018)
	BASE
	Show details

Page: 1 2 3 4 5

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern