1 |
GEM: A General Evaluation Benchmark for Multimodal Tasks ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
{GEM}: A General Evaluation Benchmark for Multimodal Tasks ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|