1 |
Wukong: 100 Million Large-scale Chinese Cross-modal Pre-training Dataset and A Foundation Framework ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Medical-VLBERT: Medical Visual Language BERT for COVID-19 CT Report Generation With Alternate Learning ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
GTAE: Graph-Transformer based Auto-Encoders for Linguistic-Constrained Text Style Transfer ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Wav-BERT: Cooperative Acoustic and Linguistic Representation Learning for Low-Resource Speech Recognition ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Inter-GPS: Interpretable Geometry Problem Solving with Formal Language and Symbolic Reasoning ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Adversarial Meta Sampling for Multilingual Low-Resource Speech Recognition ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Universal-RCNN: Universal Object Detector via Transferable Graph R-CNN ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Linguistically Driven Graph Capsule Network for Visual Question Reasoning ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Heterogeneous Graph Learning for Visual Commonsense Reasoning ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Hybrid Retrieval-Generation Reinforced Agent for Medical Image Report Generation ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Recurrent Topic-Transition GAN for Visual Paragraph Generation ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Deep Structured Scene Parsing by Learning with Image Descriptions ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|