1 |
Winoground: Probing Vision and Language Models for Visio-Linguistic Compositionality ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
ANLIzing the Adversarial Natural Language Inference Dataset
|
|
|
|
In: Proceedings of the Society for Computation in Linguistics (2022)
|
|
BASE
|
|
Show details
|
|
3 |
Learning from the Worst: Dynamically Generated Datasets to Improve Online Hate Detection ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
FLAVA: A Foundational Language And Vision Alignment Model ...
|
|
|
|
Abstract:
State-of-the-art vision and vision-and-language models rely on large-scale visio-linguistic pretraining for obtaining good performance on a variety of downstream tasks. Generally, such models are often either cross-modal (contrastive) or multi-modal (with earlier fusion) but not both; and they often only target specific modalities or tasks. A promising direction would be to use a single holistic universal model, as a "foundation", that targets all modalities at once -- a true vision and language foundation model should be good at vision tasks, language tasks, and cross- and multi-modal vision and language tasks. We introduce FLAVA as such a model and demonstrate impressive performance on a wide range of 35 tasks spanning these target modalities. ... : 18 pages ...
|
|
Keyword:
Computation and Language cs.CL; Computer Vision and Pattern Recognition cs.CV; FOS Computer and information sciences
|
|
URL: https://dx.doi.org/10.48550/arxiv.2112.04482 https://arxiv.org/abs/2112.04482
|
|
BASE
|
|
Hide details
|
|
5 |
I like fish, especially dolphins: Addressing Contradictions in Dialogue Modeling ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Improving Question Answering Model Robustness with Synthetic Adversarial Data Generation ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Gradient-based Adversarial Attacks against Text Transformers ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
On the Efficacy of Adversarial Data Collection for Question Answering: Results from a Large-Scale Randomized Study ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Masked Language Modeling and the Distributional Hypothesis: Order Word Matters Pre-training for Little ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Deep Artificial Neural Networks Reveal a Distributed Cortical Network Encoding Propositional Sentence-Level Meaning
|
|
|
|
In: J Neurosci (2021)
|
|
BASE
|
|
Show details
|
|
13 |
Emergent Linguistic Phenomena in Multi-Agent Communication Games ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Inferring concept hierarchies from text corpora via hyperbolic embeddings ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Inferring concept hierarchies from text corpora via hyperbolic embeddings
|
|
|
|
In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL 2019) (2019)
|
|
BASE
|
|
Show details
|
|
18 |
Visually Grounded and Textual Semantic Models Differentially Decode Brain Activity Associated with Concrete and Abstract Nouns ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
Virtual Embodiment: A Scalable Long-Term Strategy for Artificial Intelligence Research ...
|
|
|
|
BASE
|
|
Show details
|
|
20 |
HyperLex: A Large-Scale Evaluation of Graded Lexical Entailment ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|