1 |
ViQuAE, a Dataset for Knowledge-based Visual Question Answering about Named Entities
|
|
|
|
In: ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’22) ; https://hal-universite-paris-saclay.archives-ouvertes.fr/hal-03650618 ; 2022 (2022)
|
|
BASE
|
|
Show details
|
|
2 |
Unsupervised quantification of entity consistency between photos and text in real-world news ...
|
|
Müller-Budack, Eric. - : Hannover : Institutionelles Repositorium der Leibniz Universität Hannover, 2022
|
|
Abstract:
Das World Wide Web und die sozialen Medien übernehmen im heutigen Informationszeitalter eine wichtige Rolle für die Vermittlung von Nachrichten und Informationen. In der Regel werden verschiedene Modalitäten im Sinne der Informationskodierung wie beispielsweise Fotos und Text verwendet, um Nachrichten effektiver zu vermitteln oder Aufmerksamkeit zu erregen. Kommunikations- und Sprachwissenschaftler erforschen das komplexe Zusammenspiel zwischen Modalitäten seit Jahrzehnten und haben unter Anderem untersucht, wie durch die Kombination der Modalitäten zusätzliche Informationen oder eine neue Bedeutungsebene entstehen können. Die Anzahl gemeinsamer Konzepte oder Entitäten (beispielsweise Personen, Orte und Ereignisse) zwischen Fotos und Text stellen einen wichtigen Aspekt für die Bewertung der Gesamtaussage und Bedeutung eines multimodalen Artikels dar. Automatisierte Ansätze zur Quantifizierung von Bild-Text-Beziehungen können für zahlreiche Anwendungen eingesetzt werden. Sie ermöglichen beispielsweise eine ... : In today’s information age, the World Wide Web and social media are important sources for news and information. Different modalities (in the sense of information encoding) such as photos and text are typically used to communicate news more effectively or to attract attention. Communication scientists, linguists, and semioticians have studied the complex interplay between modalities for decades and investigated, e.g., how their combination can carry additional information or add a new level of meaning. The number of shared concepts or entities (e.g., persons, locations, and events) between photos and text is an important aspect to evaluate the overall message and meaning of an article. Computational models for the quantification of image-text relations can enable many applications. For example, they allow for more efficient exploration of news, facilitate semantic search and multimedia retrieval in large (web) archives, or assist human assessors in evaluating news for credibility. To date, only a few ...
|
|
Keyword:
Bild-Text-Beziehungen; Bildindexierung; Computer vision; Date estimation; Deep Learning; Deep learning; Dewey Decimal Classification000 | Allgemeines, Wissenschaft000 | Informatik, Wissen, Systeme004 | Informatik; Event classification; Eventklassifikation; Face recognition; Geolocation estimation; Image indexing; Image-text relations; Maschinelles Sehen; Multimedia retrieval; Multimedia Retrieval; Nachrichtenanalyse; Natürliche Sprachverarbeitung; Natural language processing; News analytics; Personenerkennung; Schätzung des Aufnahmejahres; Schätzung des Aufnahmeortes
|
|
URL: https://dx.doi.org/10.15488/11719 https://www.repo.uni-hannover.de/handle/123456789/11812
|
|
BASE
|
|
Hide details
|
|
3 |
Supporting an effective review of telecollaboration for second language learning by visualising the participation and engagement at Dublin City University
|
|
|
|
In: Lee, Hyowon orcid:0000-0003-4395-7702 , Scriney, Michael orcid:0000-0001-6813-2630 , Dey-Plissonneau, Aparajita and Smeaton, Alan orcid:0000-0003-1028-8389 (2021) Supporting an effective review of telecollaboration for second language learning by visualising the participation and engagement at Dublin City University. In: Virtual Exchange in Higher Education: Charting the Irish Experience, 17 Sept 2021, Online vs MS Teams. (2021)
|
|
BASE
|
|
Show details
|
|
4 |
Sign and Search: Sign Search Functionality for Sign Language Lexica ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Unsupervised Cross-Modal Audio Representation Learning from Unstructured Multilingual Text ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Recommending Themes for Ad Creative Design via Visual-Linguistic Representations ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Fuzzy Logic Based Integration of Web Contextual Linguistic Structures for Enriching Conceptual Visual Representations ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
MusicTM-Dataset for Joint Representation Learning among Sheet Music, Lyrics, and Musical Audio ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Utilization of multimodal interaction signals for automatic summarisation of academic presentations
|
|
Curtis, Keith. - : Dublin City University. School of Computing, 2018
|
|
In: Curtis, Keith (2018) Utilization of multimodal interaction signals for automatic summarisation of academic presentations. PhD thesis, Dublin City University. (2018)
|
|
BASE
|
|
Show details
|
|
10 |
Multimodal Machine Translation with Reinforcement Learning ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
ImproteK: introducing scenarios into human-computer music improvisation
|
|
|
|
In: ACM Computers in Entertainment ; https://hal.archives-ouvertes.fr/hal-01380163 ; ACM Computers in Entertainment, 2017, ⟨10.1145/3022635⟩ (2017)
|
|
BASE
|
|
Show details
|
|
12 |
Multimodal Person Discovery in Broadcast TV: lessons learned from MediaEval 2015
|
|
|
|
In: ISSN: 1380-7501 ; EISSN: 1573-7721 ; Multimedia Tools and Applications ; https://hal.archives-ouvertes.fr/hal-01690581 ; Multimedia Tools and Applications, Springer Verlag, 2017, 76 (21), pp.22547 - 22567. ⟨10.1007/s11042-017-4730-x⟩ (2017)
|
|
BASE
|
|
Show details
|
|
13 |
Enabling Embodied Analogies in Intelligent Music Systems ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Narrative Smoothing: Dynamic Conversational Network for the Analysis of TV Series Plots
|
|
|
|
In: DyNo: 2nd International Workshop on Dynamics in Networks, in conjunction with the 2016 IEEE/ACM International Conference ASONAM ; https://hal.archives-ouvertes.fr/hal-01276708 ; DyNo: 2nd International Workshop on Dynamics in Networks, in conjunction with the 2016 IEEE/ACM International Conference ASONAM, Aug 2016, San Francisco, United States. pp.1111-1118, ⟨10.1109/ASONAM.2016.7752379⟩ (2016)
|
|
BASE
|
|
Show details
|
|
16 |
Hierarchical topic structuring: from dense segmentation to topically focused fragments via burst analysis
|
|
|
|
In: Recent Advances on Natural Language Processing ; https://hal.archives-ouvertes.fr/hal-01186443 ; Recent Advances on Natural Language Processing, 2015, Hissar, Bulgaria (2015)
|
|
BASE
|
|
Show details
|
|
17 |
Temporal re-scoring vs. temporal descriptors for semantic indexing of videos
|
|
|
|
In: 13th International Workshop on Content-Based Multimedia Indexing (CBMI) ; https://hal.archives-ouvertes.fr/hal-01230719 ; 13th International Workshop on Content-Based Multimedia Indexing (CBMI), Jun 2015, Prague, Czech Republic. pp.1-4, ⟨10.1109/CBMI.2015.7153626⟩ (2015)
|
|
BASE
|
|
Show details
|
|
18 |
Visual Affect Around the World: A Large-scale Multilingual Visual Sentiment Ontology ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
Novel perspectives and approaches to video summarization
|
|
Guan, Genliang. - : The University of Sydney, 2015. : Faculty of Engineering and Information Technologies, School of Information Technologies, 2015
|
|
BASE
|
|
Show details
|
|
20 |
Planning Human-Computer Improvisation
|
|
|
|
In: International Computer Music Conference ; https://hal.archives-ouvertes.fr/hal-01053834 ; International Computer Music Conference, Sep 2014, Athens, Greece ; http://icmc14-smc14.net (2014)
|
|
BASE
|
|
Show details
|
|
|
|