DE eng

Search in the Catalogues and Directories

Hits 1 – 6 of 6

1
Multimodal neural networks better explain multivoxel patterns in the hippocampus ...
BASE
Show details
2
Does language help generalization in vision models?
In: ViGIL workshop (Visually Grounded Interaction and Language), NAACL 2021 ; https://hal.archives-ouvertes.fr/hal-03311763 ; ViGIL workshop (Visually Grounded Interaction and Language), NAACL 2021, 2021, online, Mexico (2021)
BASE
Show details
3
Multimodal neural networks better explain multivoxel patterns in the hippocampus
In: Neural Information Processing Systems (NeurIPS) conference: 3rd Workshop on Shared Visual Representations in Human and Machine Intelligence (SVRHM 2021) ; https://hal.archives-ouvertes.fr/hal-03428635 ; Neural Information Processing Systems (NeurIPS) conference: 3rd Workshop on Shared Visual Representations in Human and Machine Intelligence (SVRHM 2021), Dec 2021, Virtual Conference, United States ; https://openreview.net/forum?id=6dymbuga7nL (2021)
BASE
Show details
4
Does language help generalization in vision models?
In: CoNLL 2021: Conference on Computational Natural Language Learning ; https://hal.archives-ouvertes.fr/hal-03452804 ; CoNLL 2021: Conference on Computational Natural Language Learning, 2021, Mexico, Mexico. pp.171-182 (2021)
BASE
Show details
5
Does language help generalization in vision models? ...
BASE
Show details
6
Does language help generalization in vision models? ...
Abstract: Vision models trained on multimodal datasets can benefit from the wide availability of large image-caption datasets. A recent model (CLIP) was found to generalize well in zero-shot and transfer learning settings. This could imply that linguistic or "semantic grounding" confers additional generalization abilities to the visual feature space. Here, we systematically evaluate various multimodal architectures and vision-only models in terms of unsupervised clustering, few-shot learning, transfer learning and adversarial robustness. In each setting, multimodal training produced no additional generalization capability compared to standard supervised visual training. We conclude that work is still required for semantic grounding to help improve vision models. ...
Keyword: Computational Linguistics; Machine Learning; Machine Learning and Data Mining; Natural Language Processing
URL: https://dx.doi.org/10.48448/sn65-6g21
https://underline.io/lecture/39877-does-language-help-generalization-in-vision-modelsquestion
BASE
Hide details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
6
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern