1 |
Discriminant sparse label-sensitive embedding: Application to image-based face pose estimation
|
|
|
|
In: ISSN: 0952-1976 ; Engineering Applications of Artificial Intelligence ; https://hal-utt.archives-ouvertes.fr/hal-03320665 ; Engineering Applications of Artificial Intelligence, Elsevier, 2016, 50, pp.168-176. ⟨10.1016/j.engappai.2016.01.035⟩ (2016)
|
|
BASE
|
|
Show details
|
|
2 |
Visualization of single endogenous polysomes reveals the dynamics of translation in live human cells
|
|
|
|
In: ISSN: 0021-9525 ; EISSN: 1540-8140 ; Journal of Cell Biology ; https://hal-pasteur.archives-ouvertes.fr/pasteur-01622667 ; Journal of Cell Biology, Rockefeller University Press, 2016, 214 (6), pp.769 - 781. ⟨10.1083/jcb.201605024⟩ (2016)
|
|
BASE
|
|
Show details
|
|
3 |
Toponyms of Multicultural Environment as a Source of Information about the History of the Development of the Central Yakutia
|
|
|
|
In: 3rd International Multidisciplinary Scientific Conference on Social Sciences and Arts SGEM 2016 ; https://hal-amu.archives-ouvertes.fr/hal-01401146 ; 3rd International Multidisciplinary Scientific Conference on Social Sciences and Arts SGEM 2016, Aug 2016, Varna, Bulgaria. pp.587-592, ⟨10.5593/SGEMSOCIAL2016/B32/S10.076⟩ ; http://www.sgemsocial.org/ (2016)
|
|
BASE
|
|
Show details
|
|
4 |
Language Recognition for Dialects and Closely Related Languages
|
|
|
|
In: Odyssey 2016 ; https://hal.archives-ouvertes.fr/hal-01744188 ; Odyssey 2016, Jun 2016, Bilbao, Spain (2016)
|
|
BASE
|
|
Show details
|
|
5 |
Vocal effort modification for singing synthesis
|
|
|
|
In: INTERSPEECH 2016 ; Annual Conference of the International Speech Communication Association (INTERSPEECH 2016) ; https://hal.archives-ouvertes.fr/hal-01712564 ; Annual Conference of the International Speech Communication Association (INTERSPEECH 2016), Sep 2016, San Francisco, United States. pp.1235-1239, ⟨10.21437/Interspeech.2016-1096⟩ (2016)
|
|
BASE
|
|
Show details
|
|
6 |
A Source/Filter Model with Adaptive Constraints for NMF-based Speech Separation
|
|
|
|
In: International Conference on Acoustics, Speech, and Signal Processing ; https://hal.sorbonne-universite.fr/hal-01294681 ; International Conference on Acoustics, Speech, and Signal Processing, Mar 2016, Shanghai, China (2016)
|
|
BASE
|
|
Show details
|
|
7 |
Voice Activity Detection Based on Statistical Likelihood Ratio With Adaptive Thresholding
|
|
|
|
In: IWAENC 2016 - International Workshop on Acoustic Signal Enhancement (IWAENC) ; https://hal.inria.fr/hal-01349776 ; IWAENC 2016 - International Workshop on Acoustic Signal Enhancement (IWAENC), Sep 2016, Xi'an, China. pp.1-5, ⟨10.1109/IWAENC.2016.7602911⟩ (2016)
|
|
BASE
|
|
Show details
|
|
8 |
Text-informed speech inpainting via voice conversion
|
|
|
|
In: 24th European Signal Processing Conference (EUSIPCO 2016) ; https://hal.inria.fr/hal-01271257 ; 24th European Signal Processing Conference (EUSIPCO 2016), Aug 2016, Budapest, Hungary (2016)
|
|
BASE
|
|
Show details
|
|
9 |
Degree of Parkinson's Disease Severity Estimation Based on Speech Signal Processing
|
|
|
|
In: IEEE 39th International Conference on Telecommunications and Signal Processing ; https://hal.inria.fr/hal-01328198 ; IEEE 39th International Conference on Telecommunications and Signal Processing, Jun 2016, Vienna, Austria (2016)
|
|
BASE
|
|
Show details
|
|
10 |
Prosodic Parameters and Prosodic Structures of French Emotional Data
|
|
|
|
In: Speech Prosody 2016 ; https://hal.inria.fr/hal-01293516 ; Speech Prosody 2016, May 2016, Boston, United States (2016)
|
|
BASE
|
|
Show details
|
|
11 |
An Articulatory-Based Singing Voice Synthesis Using Tongue and Lips Imaging
|
|
|
|
In: Interspeech 2016 ; ISCA Interspeech 2016 ; https://hal.archives-ouvertes.fr/hal-01529630 ; ISCA Interspeech 2016, Sep 2016, San Francisco, United States. pp.1467 - 1471, ⟨10.21437/Interspeech.2016-385⟩ (2016)
|
|
BASE
|
|
Show details
|
|
12 |
Real-Time Control of an Articulatory-Based Speech Synthesizer for Brain Computer Interfaces
|
|
|
|
In: ISSN: 1553-734X ; EISSN: 1553-7358 ; PLoS Computational Biology ; https://hal.archives-ouvertes.fr/hal-01459706 ; PLoS Computational Biology, Public Library of Science, 2016, 12 (11), pp.e1005119. ⟨10.1371/journal.pcbi.1005119⟩ (2016)
|
|
BASE
|
|
Show details
|
|
13 |
Adaptation au locuteur pour la séparation de la parole par NMF
|
|
|
|
In: https://hal.sorbonne-universite.fr/hal-01482183 ; [Stage] STMS - Sciences et Technologies de la Musique et du Son UMR 9912 IRCAM-CNRS-UPMC. 2016 (2016)
|
|
BASE
|
|
Show details
|
|
14 |
A French corpus for distant-microphone speech processing in real homes
|
|
|
|
In: Interspeech 2016 ; https://hal.inria.fr/hal-01343060 ; Interspeech 2016, Sep 2016, San Francisco, United States (2016)
|
|
BASE
|
|
Show details
|
|
15 |
SegChain: Towards a generic automatic video segmentation framework, based on lexical chains of audio transcriptions
|
|
|
|
In: Proceedings of the 6th International Conference on Web Intelligence, Mining and Semantics ; https://hal-amu.archives-ouvertes.fr/hal-01490115 ; Proceedings of the 6th International Conference on Web Intelligence, Mining and Semantics, 2016, Unknown, Unknown Region. pp.21:1--21:8 (2016)
|
|
BASE
|
|
Show details
|
|
16 |
SegChainW2V: Towards a generic automatic video segmentationframework, based on lexical chains of audio transcriptions and word embeddings
|
|
|
|
In: Proceedings of KES 2016 ; https://hal-amu.archives-ouvertes.fr/hal-01490150 ; Proceedings of KES 2016, 2016, Unknown, Unknown Region (2016)
|
|
BASE
|
|
Show details
|
|
17 |
CNN-based phone segmentation experiments in a less-represented language
|
|
|
|
In: Proceedings of INTERSPEECH 2016 Volume 2 ; 17th Annual Conference of the International Speech Communication Association (INTERSPEECH 2016) ; https://hal.archives-ouvertes.fr/hal-01500519 ; 17th Annual Conference of the International Speech Communication Association (INTERSPEECH 2016), Sep 2016, San Francisco, United States. pp. 3549-3553 (2016)
|
|
BASE
|
|
Show details
|
|
18 |
Direct current stimulation over the anterior temporal areas boosts semantic processing in primary progressive aphasia
|
|
|
|
In: ISSN: 0364-5134 ; EISSN: 1531-8249 ; Annals of Neurology ; https://hal.inria.fr/hal-01377876 ; Annals of Neurology, Wiley, 2016 (2016)
|
|
BASE
|
|
Show details
|
|
19 |
Dublin City University and partners’ participation in the INS and VTT tracks at TRECVid 2016
|
|
Azevedo, Lucas; Zhou, Jiang; Smeaton, Alan F.; Marsden, Mark; Davis, Brian; Daudert, Tobias; O'Connor, Noel E.; Ganguly, Debasis; Afli, Haithem; McGuinness, Kevin; Li, Wei B.; Hurlimann, Manuela; Calafell, Andrea; Giró-i-Nieto, Xavier; Way, Andy; Mohedano, Eva; Du, Jinhua
|
|
In: Marsden, Mark, Mohedano, Eva, McGuinness, Kevin orcid:0000-0003-1336-6477 , Calafell, Andrea, Giró-i-Nieto, Xavier orcid:0000-0002-9935-5332 , O'Connor, Noel E. orcid:0000-0002-4033-9135 , Zhou, Jiang orcid:0000-0002-3067-8512 , Azevedo, Lucas, Daudert, Tobias, Davis, Brian, Hurlimann, Manuela, Afli, Haithem orcid:0000-0002-7449-4707 , Du, Jinhua, Ganguly, Debasis orcid:0000-0003-0050-7138 , Li, Wei B. orcid:0000-0001-7347-3501 , Way, Andy orcid:0000-0001-5736-5930 and Smeaton, Alan F. orcid:0000-0003-1028-8389 (2016) Dublin City University and partners’ participation in the INS and VTT tracks at TRECVid 2016. In: TRECVid Conference, 14-16 Nov 2016, Gaithersburg, Md., USA. (2016)
|
|
Abstract:
Dublin City University participated with a consortium of colleagues from NUI Galway and Universitat Politecnica de Catalunya in two tasks in TRECVid 2016, Instance Search (INS) and Video to Text (VTT). For the INS task we developed a framework consisting of face detection and representation and place detection and representation, with a user annotation of top-ranked videos. For the VTT task we ran 1,000 concept detectors from the VGG-16 deep CNN on 10 keyframes per video and submitted 4 runs for caption re-ranking, based on BM25, Fusion, word2vec and a fusion of baseline BM25 and word2vec. With the same pre-processing for caption generation we used an open source image-to-caption CNN-RNN toolkit NeuralTalk2 to generate a caption for each keyframe and combine them.
|
|
Keyword:
Artificial intelligence; Computational linguistics; Digital video; Image processing; Imaging systems; Machine learning; Multimedia systems; Semantic Concept; Video Captions
|
|
URL: http://doras.dcu.ie/21484/
|
|
BASE
|
|
Hide details
|
|
20 |
РЕАЛИЗАЦИЯ ОСНОВНЫХ МЕТОДОВ ЦИФРОВОЙ ОБРАБОТКИ ИЗОБРАЖЕНИЙ НА ЯЗЫКЕ С#
|
|
|
|
BASE
|
|
Show details
|
|
|
|