1 |
Generating Authentic Adversarial Examples beyond Meaning-preserving with Doubly Round-trip Translation ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Conditional Bilingual Mutual Information Based Adaptive Training for Neural Machine Translation ...
|
|
|
|
Abstract:
Token-level adaptive training approaches can alleviate the token imbalance problem and thus improve neural machine translation, through re-weighting the losses of different target tokens based on specific statistical metrics (e.g., token frequency or mutual information). Given that standard translation models make predictions on the condition of previous target contexts, we argue that the above statistical metrics ignore target context information and may assign inappropriate weights to target tokens. While one possible solution is to directly take target contexts into these statistical metrics, the target-context-aware statistical computing is extremely expensive, and the corresponding storage overhead is unrealistic. To solve the above issues, we propose a target-context-aware metric, named conditional bilingual mutual information (CBMI), which makes it feasible to supplement target context information for statistical metrics. Particularly, our CBMI can be formalized as the log quotient of the translation ... : Accepted at ACL 2022 as a long paper of main conference. The code is available at: https://github.com/songmzhang/CBMI ...
|
|
Keyword:
Computation and Language cs.CL; FOS Computer and information sciences
|
|
URL: https://arxiv.org/abs/2203.02951 https://dx.doi.org/10.48550/arxiv.2203.02951
|
|
BASE
|
|
Hide details
|
|
4 |
ClidSum: A Benchmark Dataset for Cross-Lingual Dialogue Summarization ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
EAG: Extract and Generate Multi-way Aligned Corpus for Complete Multi-lingual Neural Machine Translation ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Bilingual Mutual Information Based Adaptive Training for Neural Machine Translation ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Modeling Bilingual Conversational Characteristics for Neural Chat Translation ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Sequence-Level Training for Non-Autoregressive Neural Machine Translation ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Competence-based Curriculum Learning for Multilingual Machine Translation ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|