1 |
Delving Deeper into Cross-lingual Visual Question Answering ...
|
|
|
|
Abstract:
Visual question answering (VQA) is one of the crucial vision-and-language tasks. Yet, the bulk of research until recently has focused only on the English language due to the lack of appropriate evaluation resources. Previous work on cross-lingual VQA has reported poor zero-shot transfer performance of current multilingual multimodal Transformers and large gaps to monolingual performance, attributed mostly to misalignment of text embeddings between the source and target languages, without providing any additional deeper analyses. In this work, we delve deeper and address different aspects of cross-lingual VQA holistically, aiming to understand the impact of input data, fine-tuning and evaluation regimes, and interactions between the two modalities in cross-lingual setups. 1) We tackle low transfer performance via novel methods that substantially reduce the gap to monolingual English performance, yielding +10 accuracy points over existing transfer methods. 2) We study and dissect cross-lingual VQA across ...
|
|
Keyword:
Computation and Language cs.CL; FOS Computer and information sciences
|
|
URL: https://arxiv.org/abs/2202.07630 https://dx.doi.org/10.48550/arxiv.2202.07630
|
|
BASE
|
|
Hide details
|
|
4 |
A morpheme introducing degrees and its impact on argument structure: The Taiwanese Southern Min u-
|
|
|
|
In: Glossa: a journal of general linguistics; Vol 5, No 1 (2020); 37 ; 2397-1835 (2020)
|
|
BASE
|
|
Show details
|
|
8 |
Exploring Multilingual Syntactic Sentence Representations ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Doctors, drugs, and druggists in Moliere's plays ...
|
|
Liu, Chen-Tzu. - : University of Southern California Digital Library (USC.DL), 2015
|
|
BASE
|
|
Show details
|
|
11 |
The Penetration of Virginia Woolf’s Life to Her Novel Mrs. Dalloway
|
|
|
|
In: Cross-Cultural Communication; Vol 10, No 5 (2014): Cross-Cultural Communication; 80-84 ; 1923-6700 ; 1712-8358 (2014)
|
|
BASE
|
|
Show details
|
|
12 |
Attitude and Motivation for English Learning
|
|
|
|
In: Studies in Literature and Language; Vol 9, No 1 (2014): Studies in Literature and Language; 51-56 ; 1923-1563 ; 1923-1555 (2014)
|
|
BASE
|
|
Show details
|
|
13 |
A Case Study: Exploring Video Deficit Effect in 2-Year-Old Children's Playing and Learning with an iPad
|
|
|
|
BASE
|
|
Show details
|
|
20 |
The Characteristics of Sino-Taiwanese Joint Ventures in the People's Republic of China
|
|
|
|
BASE
|
|
Show details
|
|
|
|