Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2

Hits 1 – 20 of 22

1	IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages ...
	Bugliarello, Emanuele; Liu, Fangyu; Pfeiffer, Jonas; Reddy, Siva; Elliott, Desmond; Ponti, Edoardo Maria; Vulić, Ivan. - : arXiv, 2022
	Abstract: Reliable evaluation benchmarks designed for replicability and comprehensiveness have driven progress in machine learning. Due to the lack of a multilingual benchmark, however, vision-and-language research has mostly focused on English language tasks. To fill this gap, we introduce the Image-Grounded Language Understanding Evaluation benchmark. IGLUE brings together - by both aggregating pre-existing datasets and creating new ones - visual question answering, cross-modal retrieval, grounded reasoning, and grounded entailment tasks across 20 diverse languages. Our benchmark enables the evaluation of multilingual multimodal models for transfer learning, not only in a zero-shot setting, but also in newly defined few-shot learning setups. Based on the evaluation of the available state-of-the-art models, we find that translate-test transfer is superior to zero-shot transfer and that few-shot learning is hard to harness for many tasks. Moreover, downstream performance is partially explained by the amount of ...
	Keyword: Computation and Language cs.CL; Computer Vision and Pattern Recognition cs.CV; FOS Computer and information sciences
	URL: https://dx.doi.org/10.48550/arxiv.2201.11732 https://arxiv.org/abs/2201.11732
	BASE
	Hide details

2	MDAPT: Multilingual Domain Adaptive Pretraining in a Single Model ...
	Jørgensen, Rasmus Kær; Hartmann, Mareike; Dai, Xiang. - : arXiv, 2021
	BASE
	Show details

3	Vision-and-Language or Vision-for-Language? On Cross-Modal Influence in Multimodal Transformers ...
	The 2021 Conference on Empirical Methods in Natural Language Processing 2021; Bugliarello, Emanuele; Elliott, Desmond. - : Underline Science Inc., 2021
	BASE
	Show details

4	Multimodal Pretraining Unmasked: A Meta-Analysis and a Unified Framework of Vision-and-Language BERTs ...
	The 2021 Conference on Empirical Methods in Natural Language Processing 2021; Bugliarello, Emanuele; Elliott, Desmond. - : Underline Science Inc., 2021
	BASE
	Show details

5	Multimodal pretraining unmasked: A meta-analysis and a unified framework of vision-and-language berts ...
	Bugliarello, Emanuele; Cotterell, Ryan; Okazaki, Naoaki. - : ETH Zurich, 2021
	BASE
	Show details

6	mDAPT: Multilingual Domain Adaptive Pretraining in a Single Model ...
	The 2021 Conference on Empirical Methods in Natural Language Processing 2021; Dai, Xiang; Elliott, Desmond. - : Underline Science Inc., 2021
	BASE
	Show details

7	Visually Grounded Reasoning across Languages and Cultures ...
	The 2021 Conference on Empirical Methods in Natural Language Processing 2021; Bugliarello, Emanuele; Collier, Nigel. - : Underline Science Inc., 2021
	BASE
	Show details

8	Visually Grounded Reasoning across Languages and Cultures ...
	Liu, Fangyu; Bugliarello, Emanuele; Ponti, Edoardo Maria. - : arXiv, 2021
	BASE
	Show details

9	Multimodal pretraining unmasked: A meta-analysis and a unified framework of vision-and-language berts
	Okazaki, Naoaki; Bugliarello, Emanuele; Cotterell, Ryan...
	In: Transactions of the Association for Computational Linguistics, 9 (2021)
	BASE
	Show details

10	The Role of Syntactic Planning in Compositional Image Captioning ...
	Bugliarello, Emanuele; Elliott, Desmond. - : arXiv, 2021
	BASE
	Show details

11	Visually Grounded Reasoning across Languages and Cultures ...
	The 2021 Conference on Empirical Methods in Natural Language Processing 2021; ., Nigel; Bugliarello, Emanuele. - : Underline Science Inc., 2021
	BASE
	Show details

12	CompGuessWhat?!: A Multi-task Evaluation Framework for Grounded Language Learning ...
	Suglia, Alessandro; Konstas, Ioannis; Vanzo, Andrea. - : arXiv, 2020
	BASE
	Show details

13	The Sensitivity of Language Models and Humans to Winograd Schema Perturbations ...
	Abdou, Mostafa; Ravishankar, Vinit; Barrett, Maria. - : arXiv, 2020
	BASE
	Show details

14	Bootstrapping Disjoint Datasets for Multilingual Multimodal Representation Learning ...
	Kádár, Ákos; Chrupała, Grzegorz; Alishahi, Afra. - : arXiv, 2019
	BASE
	Show details

15	Cross-lingual Visual Verb Sense Disambiguation ...
	Gella, Spandana; Elliott, Desmond; Keller, Frank. - : arXiv, 2019
	BASE
	Show details

16	Lessons learned in multilingual grounded language learning ...
	Kádár, Ákos; Elliott, Desmond; Côté, Marc-Alexandre. - : arXiv, 2018
	BASE
	Show details

17	Findings of the Second Shared Task on Multimodal Machine Translation and Multilingual Image Description ...
	Elliott, Desmond; Frank, Stella; Barrault, Loïc. - : arXiv, 2017
	BASE
	Show details

18	Cross-linguistic differences and similarities in image descriptions ...
	van Miltenburg, Emiel; Elliott, Desmond; Vossen, Piek. - : arXiv, 2017
	BASE
	Show details

19	Multi30K: Multilingual English-German Image Descriptions
	Elliott, Desmond [Verfasser]; Frank, Stella [Verfasser]; Sima'an, Khalil [Verfasser]. - Aachen : Universitätsbibliothek der RWTH Aachen, 2016
	DNB Subject Category Language
	Show details

20	Multi30K: Multilingual English-German Image Descriptions ...
	Elliott, Desmond; Frank, Stella; Sima'an, Khalil. - : arXiv, 2016
	BASE
	Show details

Page: 1 2

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern