1 |
ViQuAE, a Dataset for Knowledge-based Visual Question Answering about Named Entities
|
|
|
|
In: ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’22) ; https://hal-universite-paris-saclay.archives-ouvertes.fr/hal-03650618 ; 2022 (2022)
|
|
Abstract:
International audience ; Whether to retrieve, answer, translate, or reason, multimodality opens up new challenges and perspectives. In this context, we are interested in answering questions about named entities grounded in a visual context using a Knowledge Base (KB). To benchmark this task, called KVQAE (Knowledge-based Visual Question Answering about named Entities), we provide ViQuAE, a dataset of 3.7K questions paired with images. This is the first KVQAE dataset to cover a wide range of entity types (e.g. persons, landmarks, and products). The dataset is annotated using a semi-automatic method. We also propose a KB composed of 1.5M Wikipedia articles paired with images. To set a baseline on the benchmark, we address KVQAE as a two-stage problem: Information Retrieval and Reading Comprehension, with both zero-and few-shot learning methods. The experiments empirically demonstrate the difficulty of the task, especially when questions are not about persons. This work paves the way for better multimodal entity representations and question answering. The dataset, KB, code, and semi-automatic annotation pipeline are freely available at https://github.com/PaulLerner/ViQuAE.
|
|
Keyword:
[INFO.INFO-IR]Computer Science [cs]/Information Retrieval [cs.IR]; [INFO.INFO-MM]Computer Science [cs]/Multimedia [cs.MM]; dataset; knowledge-based visual question answering; multimodal
|
|
URL: https://hal-universite-paris-saclay.archives-ouvertes.fr/hal-03650618 https://hal-universite-paris-saclay.archives-ouvertes.fr/hal-03650618/document https://doi.org/10.1145/3477495.3531753 https://hal-universite-paris-saclay.archives-ouvertes.fr/hal-03650618/file/lerner_sigir_2022_camera.pdf
|
|
BASE
|
|
Hide details
|
|
2 |
ISSumSet: a tweet summarization dataset hidden in a TREC track
|
|
|
|
In: SAC '21: Proceedings of the 36th Annual ACM Symposium on Applied Computing ; ISBN: 978-1-4503-8104-8 ; 36th ACM/SIGAPP Symposium on Applied Computing (SAC 2021) ; https://hal-univ-tlse3.archives-ouvertes.fr/hal-03244354 ; 36th ACM/SIGAPP Symposium on Applied Computing (SAC 2021), Association for Computing Machinery - Special Interest Group on Applied Computing (SIGAPP), Mar 2021, Republic of Korea (virtual event), South Korea. pp.665-671, ⟨10.1145/3412841.3441946⟩ ; https://dl.acm.org/doi/10.1145/3412841.3441946 (2021)
|
|
BASE
|
|
Show details
|
|
3 |
Morphologically Annotated Amharic Text Corpora
|
|
|
|
In: SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval ; https://hal-univ-tlse2.archives-ouvertes.fr/hal-03362977 ; SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, Jul 2021, Virtual Event Canada, France. pp.2349-2355, ⟨10.1145/3404835.3463237⟩ (2021)
|
|
BASE
|
|
Show details
|
|
4 |
Atténuer les erreurs de numérisation dans la reconnaissance d'entités nommées pour les documents historiques
|
|
|
|
In: Conférence en Recherche d'Informations et Applications (CORIA 2021) ; https://hal.archives-ouvertes.fr/hal-03320332 ; Conférence en Recherche d'Informations et Applications (CORIA 2021), ARIA : Association Francophone de Recherche d’Information (RI) et Applications, Apr 2021, Grenoble (virtuel), France. pp.1 - 7 ; http://coria.asso-aria.org/2021/articles/mini_24/main.pdf (2021)
|
|
BASE
|
|
Show details
|
|
5 |
A Multilingual Dataset for Named Entity Recognition, Entity Linking and Stance Detection in Historical Newspapers
|
|
|
|
In: SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval ; https://hal.archives-ouvertes.fr/hal-03418387 ; SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, Jul 2021, Virtual Event, Canada. pp.2328-2334, ⟨10.1145/3404835.3463255⟩ (2021)
|
|
BASE
|
|
Show details
|
|
6 |
Overview of SimpleText 2021 - CLEF Workshop on Text Simplification for Scientific Information Access
|
|
|
|
In: Experimental IR Meets Multilinguality, Multimodality, and Interaction: 12th International Conference of the CLEF Association, CLEF 2021, Virtual Event, September 21–24, 2021, Proceedings ; ISBN: 978-3-030-85251-1 ; 12th Conference and Labs of the Evaluation Forum (CLEF 2021) ; https://hal.archives-ouvertes.fr/hal-03637807 ; 12th Conference and Labs of the Evaluation Forum (CLEF 2021), Sep 2021, Bucharest, Romania. pp.432-449, ⟨10.1007/978-3-030-85251-1_27⟩ ; http://clef2021.clef-initiative.eu/ (2021)
|
|
BASE
|
|
Show details
|
|
7 |
Application-Oriented Approach for Detecting Cyberaggression in Social Media
|
|
|
|
In: International Conference on Applied Human Factors and Ergonomics ; https://hal.archives-ouvertes.fr/hal-02903422 ; International Conference on Applied Human Factors and Ergonomics, Jul 2020, San Diego, United States. pp.129-136, ⟨10.1007/978-3-030-51328-3_19⟩ ; https://link.springer.com/chapter/10.1007%2F978-3-030-51328-3_19 (2020)
|
|
BASE
|
|
Show details
|
|
8 |
Capitalizing on a TREC Track to Build a Tweet Summarization Dataset
|
|
|
|
In: CIRCLE 2020 ; Proceedings of the Joint Conference of the Information Retrieval Communities in Europe (CIRCLE 2020) ; Joint Conference of the Information Retrieval Communities in Europe (CIRCLE 2020) ; https://hal.archives-ouvertes.fr/hal-03095613 ; Joint Conference of the Information Retrieval Communities in Europe (CIRCLE 2020), Université de Toulouse, France, Jul 2020, Samatan, Gers, France. pp.1-9 ; http://ceur-ws.org/Vol-2621/CIRCLE20_20.pdf (2020)
|
|
BASE
|
|
Show details
|
|
9 |
Combining Bandits and Lexical Analysis for Document Retrieval in a Juridical Corpora
|
|
|
|
In: Artificial Intelligence XXXVII 40th SGAI International Conference on Artificial Intelligence, AI 2020, Cambridge, UK, December 15–17, 2020, Proceedings ; https://hal.archives-ouvertes.fr/hal-03108194 ; Artificial Intelligence XXXVII 40th SGAI International Conference on Artificial Intelligence, AI 2020, Cambridge, UK, December 15–17, 2020, Proceedings, 12498, pp.317-330, 2020, Lecture Notes in Computer Science book series (LNCS), ⟨10.1007/978-3-030-63799-6_24⟩ ; https://link.springer.com/chapter/10.1007%2F978-3-030-63799-6_24 (2020)
|
|
BASE
|
|
Show details
|
|
10 |
Neural models for the representation and matching of geotextual objects ; Modèles neuronaux pour la représentation et l'appariement d'objets géotextuels
|
|
|
|
In: https://tel.archives-ouvertes.fr/tel-02979573 ; Interface homme-machine [cs.HC]. Université Paul Sabatier - Toulouse III, 2020. Français. ⟨NNT : 2020TOU30042⟩ (2020)
|
|
BASE
|
|
Show details
|
|
11 |
Entity Linking for Historical Documents: Challenges and Solutions
|
|
|
|
In: 22nd International Conference on Asia-Pacific Digital Libraries, ICADL 2020 ; https://hal.archives-ouvertes.fr/hal-03034492 ; 22nd International Conference on Asia-Pacific Digital Libraries, ICADL 2020, 12504, Springer, pp.215-231, 2020, Lecture Notes in Computer Science, 978-3-030-64452-9. ⟨10.1007/978-3-030-64452-9_19⟩ (2020)
|
|
BASE
|
|
Show details
|
|
12 |
Robust Named Entity Recognition and Linking on Historical Multilingual Documents
|
|
|
|
In: Conference and Labs of the Evaluation Forum (CLEF 2020) ; https://hal.archives-ouvertes.fr/hal-03026969 ; Conference and Labs of the Evaluation Forum (CLEF 2020), Sep 2020, Thessaloniki, Greece. pp.1-17, ⟨10.5281/zenodo.4068074⟩ ; http://ceur-ws.org/Vol-2696/paper_171.pdf (2020)
|
|
BASE
|
|
Show details
|
|
13 |
Linking Named Entities across Languages using Multilingual Word Embeddings
|
|
|
|
In: JCDL '20: The ACM/IEEE Joint Conference on Digital Libraries in 2020 ; ACM/IEEE Joint Conference on Digital Libraries - JCDL 2020 ; https://hal.archives-ouvertes.fr/hal-03026933 ; ACM/IEEE Joint Conference on Digital Libraries - JCDL 2020, Aug 2020, Wuhan, Hubei - Virtual event, China. pp.329-332, ⟨10.1145/3383583.3398597⟩ ; https://dl.acm.org/doi/10.1145/3383583.3398597 (2020)
|
|
BASE
|
|
Show details
|
|
14 |
Prediction and Visual Intelligence for Security Information: The PREVISION H2020 Project
|
|
|
|
In: CIRCLE 2020 ; https://hal.archives-ouvertes.fr/hal-02877780 ; CIRCLE 2020, Iván Cantador; Max Chevalier; Massimo Melucci; Josiane Mothe, Jul 2020, Samatan, France ; http://ceur-ws.org/Vol-2621/ (2020)
|
|
BASE
|
|
Show details
|
|
15 |
CONFERENCE REPORT - Report on CLEF 2018: Experimental IR Meets Multilinguality, Multimodality, and Interaction
|
|
|
|
In: ISSN: 0163-5840 ; Sigir Forum ; https://hal.archives-ouvertes.fr/hal-02442736 ; Sigir Forum, Association for Computing Machinery (ACM), 2019, 52 (2), pp.72-82. ⟨10.1145/3308774.3308785⟩ ; https://dl.acm.org/doi/10.1145/3308774.3308785 (2019)
|
|
BASE
|
|
Show details
|
|
16 |
Studying the Variability of System Setting Effectiveness by Data Analytics and Visualization
|
|
|
|
In: CLEF 2019: Experimental IR Meets Multilinguality, Multimodality, and Interaction ; Conference and Labs of the Evaluation Forum, Living Labs (CLEF 2019) ; https://hal.archives-ouvertes.fr/hal-02930098 ; Conference and Labs of the Evaluation Forum, Living Labs (CLEF 2019), Sep 2019, Lugano, Switzerland. pp.62-74, ⟨10.1007/978-3-030-28577-7_3⟩ (2019)
|
|
BASE
|
|
Show details
|
|
17 |
Using language models to improve opinion detection
|
|
|
|
In: ISSN: 0306-4573 ; Information Processing and Management ; https://hal.archives-ouvertes.fr/hal-02279437 ; Information Processing and Management, Elsevier, 2018, 54 (6), pp.958-968. ⟨10.1016/j.ipm.2018.07.001⟩ (2018)
|
|
BASE
|
|
Show details
|
|
18 |
Location extraction from tweets
|
|
|
|
In: ISSN: 0306-4573 ; Information Processing and Management ; https://hal.archives-ouvertes.fr/hal-02640811 ; Information Processing and Management, Elsevier, 2018, 54 (2), pp.129-144. ⟨10.1016/j.ipm.2017.11.001⟩ (2018)
|
|
BASE
|
|
Show details
|
|
19 |
Experimental IR Meets Multilinguality, Multimodality, and Interaction (CLEF 2018, Avignon,France)
|
|
|
|
In: ISSN: 0302-9743 ; Lecture Notes in Computer Science ; 9th International Conference of the CLEF Association (CLEF 2018) ; https://hal.archives-ouvertes.fr/hal-03044243 ; Bellot, Patrice; Trabelsi, Chiraz; Mothe, Josiane; Murtagh, Fionn; Nie, Jian-Yun; Soulier, Laure; Sanjuan, Eric; Cappellato, Linda; Ferro, Nicola. 9th International Conference of the CLEF Association (CLEF 2018), Sep 2018, Avignon, France. Lecture Notes in Computer Science, Springer Berlin / Heidelberg; Springer, 2018, Experimental IR Meets Multilinguality, Multimodality, and Interaction, 978-3-319-98931-0. ⟨10.1007/978-3-319-98932-7⟩ ; https://link.springer.com/book/10.1007%2F978-3-319-98932-7 (2018)
|
|
BASE
|
|
Show details
|
|
20 |
Automatic Detection of Depressive Users in Social Media
|
|
|
|
In: Actes de CORIA 2018 ; Conférence francophone en Recherche d'Information et Applications (CORIA) ; https://hal.archives-ouvertes.fr/hal-02942297 ; Conférence francophone en Recherche d'Information et Applications (CORIA), May 2018, Rennes, France. ⟨10.24348/coria.2018.paper4⟩ (2018)
|
|
BASE
|
|
Show details
|
|
|
|