DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5 6 7 8 9...42
Hits 81 – 100 of 830

81
Wikily Supervised Neural Translation Tailored to Cross-Lingual Tasks ...
BASE
Show details
82
Vision Guided Generative Pre-trained Language Models for Multimodal Abstractive Summarization ...
BASE
Show details
83
Sorting through the noise: Testing robustness of information processing in pre-trained language models ...
BASE
Show details
84
Building the Directed Semantic Graph for Coherent Long Text Generation ...
BASE
Show details
85
Detect and Classify – Joint Span Detection and Classification for Health Outcomes ...
BASE
Show details
86
Cross-Attention is All You Need: Adapting Pretrained Transformers for Machine Translation ...
Abstract: Anthology paper link: https://aclanthology.org/2021.emnlp-main.132/ Abstract: We study the power of cross-attention in the Transformer architecture within the context of transfer learning for machine translation, and extend the findings of studies into cross-attention when training from scratch. We conduct a series of experiments through fine-tuning a translation model on data where either the source or target language has changed. These experiments reveal that fine-tuning only the cross-attention parameters is nearly as effective as fine-tuning all parameters (i.e., the entire translation model). We provide insights into why this is the case and observe that limiting fine-tuning in this manner yields cross-lingually aligned embeddings. The implications of this finding for researchers and practitioners include a mitigation of catastrophic forgetting, the potential for zero-shot translation, and the ability to extend machine translation models to several new language pairs with reduced parameter storage ...
Keyword: Computational Linguistics; Machine Learning; Machine Learning and Data Mining; Machine translation; Natural Language Processing
URL: https://dx.doi.org/10.48448/pgv2-cn55
https://underline.io/lecture/37970-cross-attention-is-all-you-need-adapting-pretrained-transformers-for-machine-translation
BASE
Hide details
87
Evaluation of Summarization Systems across Gender, Age, and Race ...
BASE
Show details
88
A Language Model-based Generative Classifier for Sentence-level Discourse Parsing ...
BASE
Show details
89
Controllable Neural Dialogue Summarization with Personal Named Entity Planning ...
BASE
Show details
90
Foreseeing the Benefits of Incidental Supervision ...
BASE
Show details
91
Graphine: A Dataset for Graph-aware Terminology Definition Generation ...
BASE
Show details
92
CSDS: A Fine-Grained Chinese Dataset for Customer Service Dialogue Summarization ...
BASE
Show details
93
Connecting Attributions and QA Model Behavior on Realistic Counterfactuals ...
BASE
Show details
94
Provable Limitations of Acquiring Meaning from Ungrounded Form: What will Future Language Models Understand? ...
BASE
Show details
95
Generation and Extraction Combined Dialogue State Tracking with Hierarchical Ontology Integration ...
BASE
Show details
96
Error-Sensitive Evaluation for Ordinal Target Variables ...
BASE
Show details
97
CDLM: Cross-Document Language Modeling ...
BASE
Show details
98
Data-to-text Generation by Splicing Together Nearest Neighbors ...
BASE
Show details
99
Natural Language Processing Meets Quantum Physics: A Survey and Categorization ...
BASE
Show details
100
End-to-end style-conditioned poetry generation: What does it take to learn from examples alone? ...
BASE
Show details

Page: 1 2 3 4 5 6 7 8 9...42

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
830
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern