1 |
IR-GAN: Image Manipulation with Linguistic Instruction by Increment Reasoning ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Improving Encoder by Auxiliary Supervision Tasks for Table-to-Text Generation ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
32 II-A2 TreeBERT: A Tree-Based Pre-Trained Model for Programming Language ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
TreeBERT: A Tree-Based Pre-Trained Model for Programming Language ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Using Artificial Intelligence for the Construction of University Physical Training and Teaching Systems
|
|
|
|
In: J Healthc Eng (2021)
|
|
BASE
|
|
Show details
|
|
6 |
Common Brain Substrates Underlying Auditory Speech Priming and Perceived Spatial Separation
|
|
|
|
In: Front Neurosci (2021)
|
|
BASE
|
|
Show details
|
|
7 |
Bundle-driven move analysis: Sentence initial lexical bundles in PhD abstracts
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Measurement of $W^{\pm}$-boson and $Z$-boson production cross-sections in $pp$ collisions at $\sqrt{s}=2.76$ TeV with the ATLAS detector
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Knowledge-guided Pairwise Reconstruction Network for Weakly Supervised Referring Expression Grounding ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding ...
|
|
|
|
Abstract:
Weakly supervised referring expression grounding aims at localizing the referential object in an image according to the linguistic query, where the mapping between the referential object and query is unknown in the training stage. To address this problem, we propose a novel end-to-end adaptive reconstruction network (ARN). It builds the correspondence between image region proposal and query in an adaptive manner: adaptive grounding and collaborative reconstruction. Specifically, we first extract the subject, location and context features to represent the proposals and the query respectively. Then, we design the adaptive grounding module to compute the matching score between each proposal and query by a hierarchical attention model. Finally, based on attention score and proposal features, we reconstruct the input query with a collaborative loss of language reconstruction loss, adaptive reconstruction loss, and attribute classification loss. This adaptive mechanism helps our model to alleviate the variance of ... : Accepted by ICCV 2019 ...
|
|
Keyword:
Computer Vision and Pattern Recognition cs.CV; FOS Computer and information sciences
|
|
URL: https://arxiv.org/abs/1908.10568 https://dx.doi.org/10.48550/arxiv.1908.10568
|
|
BASE
|
|
Hide details
|
|
11 |
Sentence initial bundles: A comparative study between Chinese master’s L2 theses and published writing
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Chinese postgraduates' explanation of the sources of sentence initial bundles in their thesis writing
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Logical Parsing from Natural Language Based on a Neural Translation Model ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Family involvement in preschoolers' bilingual heritage language development: a cultural-historical study of Chinese-Australian families' everyday practices ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Sentence initial bundles in L2 thesis writing: A comparative study of Chinese L2 and New Zealand L1 postgraduates’ writing
|
|
|
|
BASE
|
|
Show details
|
|
17 |
Interaction between auditory and motor systems in speech perception
|
|
|
|
BASE
|
|
Show details
|
|
18 |
Primitive Auditory Memory Is Correlated with Spatial Unmasking That Is Based on Direct-Reflection Integration
|
|
|
|
BASE
|
|
Show details
|
|
|
|