1 |
Pathologies of Pre-trained Language Models in Few-shot Fine-tuning ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
MetaXL: Meta Representation Transformation for Low-resource Cross-lingual Learning ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
A Dataset and Baselines for Multilingual Reply Suggestion ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
A Conditional Generative Matching Model for Multi-lingual Reply Suggestion ...
|
|
|
|
Abstract:
We study the problem of multilingual automated reply suggestions (RS) model serving many languages simultaneously. Multilingual models are often challenged by model capacity and severe data distribution skew across languages. While prior works largely focus on monolingual models, we propose Conditional Generative Matching models (CGM), optimized within a Variational Autoencoder framework to address challenges arising from multi-lingual RS. CGM does so with expressive message conditional priors, mixture densities to enhance multi-lingual data representation, latent alignment for language discrimination, and effective variational optimization techniques for training multi-lingual RS. The enhancements result in performance that exceed competitive baselines in relevance (ROUGE score) by more than 10\% on average, and 16\% for low resource languages. CGM also shows remarkable improvements in diversity (80\%) illustrating its expressiveness in representation of multi-lingual data. ...
|
|
Keyword:
Computation and Language cs.CL; FOS Computer and information sciences
|
|
URL: https://arxiv.org/abs/2109.07046 https://dx.doi.org/10.48550/arxiv.2109.07046
|
|
BASE
|
|
Hide details
|
|
5 |
XtremeDistilTransformers: Task Transfer for Task-agnostic Distillation ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Say `YES' to Positivity: Detecting Toxic Language in Workplace Communications ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
A Conditional Generative Matching Model for Multi-lingual Reply Suggestion ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Gender Bias in Multilingual Embeddings and Cross-Lingual Transfer ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
XtremeDistil: Multi-stage Distillation for Massive Multilingual Models ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Smart To-Do : Automatic Generation of To-Do Items from Emails ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Distilling BERT into Simple Neural Networks with Unlabeled Transfer Data ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Multi-Source Cross-Lingual Model Transfer: Learning What to Share ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Identifying Roles in Social Networks using Linguistic Analysis.
|
|
|
|
BASE
|
|
Show details
|
|
|
|