2 |
Effects of Parameter Norm Growth During Transformer Training: Inductive Bias from Gradient Descent ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Softmax Tree: An Accurate, Fast Classifier When the Number of Classes Is Large ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
GOLD: Improving Out-of-Scope Detection in Dialogues using Data Augmentation ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
RuleBERT: Teaching Soft Rules to Pre-Trained Language Models ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Implicit Premise Generation with Discourse-aware Commonsense Knowledge Models ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
On the Challenges of Evaluating Compositional Explanations in Multi-Hop Inference: Relevance, Completeness, and Expert Ratings ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Enhanced Language Representation with Label Knowledge for Span Extraction ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
VeeAlign: Multifaceted Context Representation Using Dual Attention for Ontology Alignment ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Shortcutted Commonsense: Data Spuriousness in Deep Learning of Commonsense Reasoning ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
On Classifying whether Two Texts are on the Same Side of an Argument ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Causal Direction of Data Collection Matters: Implications of Causal and Anticausal Learning for NLP ...
|
|
|
|
BASE
|
|
Show details
|
|
17 |
MTAdam: Automatic Balancing of Multiple Training Loss Terms ...
|
|
|
|
BASE
|
|
Show details
|
|
18 |
Types of Out-of-Distribution Texts and How to Detect Them ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
Asking It All: Generating Contextualized Questions for any Semantic Role ...
|
|
|
|
BASE
|
|
Show details
|
|
20 |
Competency Problems: On Finding and Removing Artifacts in Language Data ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|