1 |
StableMoE: Stable Routing Strategy for Mixture of Experts ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
On the Representation Collapse of Sparse Mixture of Experts ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Controllable Natural Language Generation with Contrastive Prefixes ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Allocating Large Vocabulary Capacity for Cross-lingual Language Model Pre-training ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Improving Pretrained Cross-Lingual Language Models via Self-Labeled Word Alignment ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
s2s-ft: Fine-Tuning Pretrained Transformer Encoders for Sequence-to-Sequence Learning ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Zero-shot Cross-lingual Transfer of Neural Machine Translation with Multilingual Pretrained Encoders ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
MT6: Multilingual Pretrained Text-to-Text Transformer with Translation Pairs ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Multilingual Machine Translation Systems from Microsoft for WMT21 Shared Task ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
DeltaLM: Encoder-Decoder Pre-training for Language Generation and Translation by Augmenting Pretrained Multilingual Encoders ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
XLM-E: Cross-lingual Language Model Pre-training via ELECTRA ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Adapt-and-Distill: Developing Small, Fast and Effective Pretrained Language Models for Domains ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
InfoXLM: An Information-Theoretic Framework for Cross-Lingual Language Model Pre-Training ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
MiniLMv2: Multi-Head Self-Attention Relation Distillation for Compressing Pretrained Transformers ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Harvesting and Refining Question-Answer Pairs for Unsupervised QA ...
|
|
|
|
BASE
|
|
Show details
|
|
17 |
XLM-T: Scaling up Multilingual Machine Translation with Pretrained Cross-lingual Transformer Encoders ...
|
|
|
|
BASE
|
|
Show details
|
|
18 |
Can Monolingual Pretrained Models Help Cross-Lingual Classification? ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
Cross-Lingual Natural Language Generation via Pre-Training ...
|
|
|
|
BASE
|
|
Show details
|
|
20 |
Learning natural language interfaces with neural models
|
|
Dong, Li. - : The University of Edinburgh, 2019
|
|
BASE
|
|
Show details
|
|
|
|