1 |
Integrating Vectorized Lexical Constraints for Neural Machine Translation ...
|
|
|
|
Abstract:
Lexically constrained neural machine translation (NMT), which controls the generation of NMT models with pre-specified constraints, is important in many practical scenarios. Due to the representation gap between discrete constraints and continuous vectors in NMT models, most existing works choose to construct synthetic data or modify the decoding algorithm to impose lexical constraints, treating the NMT model as a black box. In this work, we propose to open this black box by directly integrating the constraints into NMT models. Specifically, we vectorize source and target constraints into continuous keys and values, which can be utilized by the attention modules of NMT models. The proposed integration method is based on the assumption that the correspondence between keys and values in attention modules is naturally suitable for modeling constraint pairs. Experimental results show that our method consistently outperforms several representative baselines on four language pairs, demonstrating the superiority of ... : Accepted by ACL 2022 (main conference) ...
|
|
Keyword:
Computation and Language cs.CL; FOS Computer and information sciences
|
|
URL: https://dx.doi.org/10.48550/arxiv.2203.12210 https://arxiv.org/abs/2203.12210
|
|
BASE
|
|
Hide details
|
|
2 |
Contextual Semantic-Guided Entity-Centric GCN for Relation Extraction
|
|
|
|
In: Mathematics; Volume 10; Issue 8; Pages: 1344 (2022)
|
|
BASE
|
|
Show details
|
|
3 |
Virtual Reality-Integrated Immersion-Based Teaching to English Language Learning Outcome
|
|
|
|
In: Front Psychol (2022)
|
|
BASE
|
|
Show details
|
|
5 |
Alternated Training with Synthetic and Authentic Data for Neural Machine Translation ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
CPM-2: Large-scale Cost-effective Pre-trained Language Models ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
VISITRON: Visual Semantics-Aligned Interactively Trained Object-Navigator ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Assessing Multilingual Fairness in Pre-trained Multimodal Representations ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Dialog{S}um: {A} Real-Life Scenario Dialogue Summarization Dataset ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Transfer Learning for Sequence Generation: from Single-source to Multi-source ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Segment, Mask, and Predict: Augmenting Chinese Word Segmentation with Self-Supervision ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Learning to Selectively Learn for Weakly-supervised Paraphrase Generation ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
SWSR: A Chinese Dataset and Lexicon for Online Sexism Detection ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Analyzing the Limits of Self-Supervision in Handling Bias in Language ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Statistically significant detection of semantic shifts using contextual word embeddings ...
|
|
|
|
BASE
|
|
Show details
|
|
17 |
SWSR: A Chinese Dataset and Lexicon for Online Sexism Detection ...
|
|
|
|
BASE
|
|
Show details
|
|
18 |
Statistically Significant Detection of Semantic Shifts using Contextual Word Embeddings ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
Leveraging Word-Formation Knowledge for Chinese Word Sense Disambiguation ...
|
|
|
|
BASE
|
|
Show details
|
|
20 |
SWSR: A Chinese Dataset and Lexicon for Online Sexism Detection ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|