2 |
An Information-Theoretic Characterization of Morphological Fusion ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
A Simple Geometric Method for Cross-Lingual Linguistic Transformations with Pre-trained Autoencoders ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Exploring Pre-Trained Transformers and Bilingual Transfer Learning for Arabic Coreference Resolution ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Evaluating the Morphosyntactic Well-formedness of Generated Texts ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Visually Grounded Reasoning across Languages and Cultures ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
I Wish I Would Have Loved This One, But I Didn't -- A Multilingual Dataset for Counterfactual Detection in Product Review ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
MultiEURLEX - A multi-lingual and multi-label legal document classification dataset for zero-shot cross-lingual transfer ...
|
|
|
|
Abstract:
Anthology paper link: https://aclanthology.org/2021.emnlp-main.559/ Abstract: We introduce MULTI-EURLEX, a new multilingual dataset for topic classification of legal documents. The dataset comprises 65k European Union (EU) laws, officially translated in 23 languages, annotated with multiple labels from the EUROVOC taxonomy. We highlight the effect of temporal concept drift and the importance of chronological, instead of random splits. We use the dataset as a testbed for zero-shot cross-lingual transfer, where we exploit annotated training documents in one language (source) to classify documents in another language (target). We find that fine-tuning a multilingually pre-trained model (XLM-ROBERTA, MT5) in a single source language leads to catastrophic forgetting of multilingual knowledge and, consequently, poor zero-shot transfer to other languages. Adaptation strategies, namely partial fine-tuning, adapters, BITFIT, LNFIT, originally proposed to accelerate fine-tuning for new end-tasks, help retain ...
|
|
Keyword:
Data Management System; Machine Learning; Machine translation; Natural Language Processing
|
|
URL: https://dx.doi.org/10.48448/r0sk-2844 https://underline.io/lecture/37536-multieurlex---a-multi-lingual-and-multi-label-legal-document-classification-dataset-for-zero-shot-cross-lingual-transfer
|
|
BASE
|
|
Hide details
|
|
9 |
IR like a SIR: Sense-enhanced Information Retrieval for Multiple Languages ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Chinese Opinion Role Labeling with Corpus Translation: A Pivot Study ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Genre as Weak Supervision for Cross-lingual Dependency Parsing ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Synthetic Data Augmentation for Zero-Shot Cross-Lingual Question Answering ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Recent Advances in Dialogue Machine Translation
|
|
|
|
In: Information ; Volume 12 ; Issue 11 (2021)
|
|
BASE
|
|
Show details
|
|
15 |
Visually Grounded Reasoning across Languages and Cultures ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Students Who Study Together Learn Better: On the Importance of Collective Knowledge Distillation for Domain Transfer in Fact Verification ...
|
|
|
|
BASE
|
|
Show details
|
|
17 |
On the Relation between Syntactic Divergence and Zero-Shot Performance ...
|
|
|
|
BASE
|
|
Show details
|
|
18 |
Bridging the “gApp”: improving neural machine translation systems for multiword expression detection
|
|
|
|
In: 11 ; 1 ; 61 ; 80 (2020)
|
|
BASE
|
|
Show details
|
|
20 |
PADIC: extension and new experiments
|
|
|
|
In: 7th International Conference on Advanced Technologies ; 7th International Conference on Advanced Technologies ICAT ; https://hal.archives-ouvertes.fr/hal-01718858 ; 7th International Conference on Advanced Technologies ICAT, Apr 2018, Antalya, Turkey (2018)
|
|
BASE
|
|
Show details
|
|
|
|