21 |
XHate-999: Analyzing and Detecting Abusive Language Across Domains and Languages ...
|
|
|
|
Abstract:
We present XHate -999, a multi-domain and multilingual evaluation data set for abusive language detection. By aligning test instances across six typologically diverse languages, XHate-999 for the first time allows for disentanglement of the domain transfer and language transfer effects in abusive language detection. We conduct a series of domain- and language-transfer experiments with state-of-the-art monolingual and multilingual transformer models, setting strong baseline results and profiling XH ATE -999 as a comprehensive evaluation resource for abusive language detection. Finally, we show that domain- and language-adaptation, via intermediate masked language modeling on abusive corpora in the target language, can lead to substantially improved abusive language detection in the target language in the zero-shot transfer setups. ...
|
|
URL: https://www.repository.cam.ac.uk/handle/1810/315111 https://dx.doi.org/10.17863/cam.62218
|
|
BASE
|
|
Hide details
|
|
22 |
From Zero to Hero: On the Limitations of Zero-Shot Cross-Lingual Transfer with Multilingual Transformers ...
|
|
|
|
BASE
|
|
Show details
|
|
23 |
XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning ...
|
|
|
|
BASE
|
|
Show details
|
|
24 |
From Zero to Hero: On the Limitations of Zero-Shot Cross-Lingual Transfer with Multilingual Transformers ...
|
|
|
|
BASE
|
|
Show details
|
|
25 |
Verb Knowledge Injection for Multilingual Event Processing ...
|
|
|
|
BASE
|
|
Show details
|
|
26 |
Probing Pretrained Language Models for Lexical Semantics ...
|
|
|
|
BASE
|
|
Show details
|
|
27 |
SemEval-2020 Task 2: Predicting Multilingual and Cross-Lingual (Graded) Lexical Entailment ...
|
|
|
|
BASE
|
|
Show details
|
|
28 |
Specializing Unsupervised Pretraining Models for Word-Level Semantic Similarity ...
|
|
|
|
BASE
|
|
Show details
|
|
29 |
Specializing Unsupervised Pretraining Models for Word-Level Semantic Similarity ...
|
|
|
|
BASE
|
|
Show details
|
|
30 |
SemEval-2020 Task 2: Predicting Multilingual and Cross-Lingual (Graded) Lexical Entailment ...
|
|
|
|
BASE
|
|
Show details
|
|
31 |
Non-Linear Instance-Based Cross-Lingual Mapping for Non-Isomorphic Embedding Spaces ...
|
|
|
|
BASE
|
|
Show details
|
|
32 |
Probing Pretrained Language Models for Lexical Semantics ...
|
|
|
|
BASE
|
|
Show details
|
|
33 |
Classification-Based Self-Learning for Weakly Supervised Bilingual Lexicon Induction ...
|
|
|
|
BASE
|
|
Show details
|
|
34 |
Improving Bilingual Lexicon Induction with Unsupervised Post-Processing of Monolingual Word Vector Spaces ...
|
|
|
|
BASE
|
|
Show details
|
|
35 |
Improving Bilingual Lexicon Induction with Unsupervised Post-Processing of Monolingual Word Vector Spaces
|
|
|
|
BASE
|
|
Show details
|
|
36 |
Specializing Unsupervised Pretraining Models for Word-Level Semantic Similarity
|
|
Lauscher, Anne; Vulic, Ivan; Ponti, Edoardo. - : International Committee on Computational Linguistics, 2020. : https://www.aclweb.org/anthology/2020.coling-main.118, 2020. : Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020), 2020
|
|
BASE
|
|
Show details
|
|
38 |
SemEval-2020 Task 2: Predicting Multilingual and Cross-Lingual (Graded) Lexical Entailment
|
|
Glavas, Goran; Vulic, Ivan; Korhonen, Anna-Leena. - : International Committee for Computational Linguistics, 2020. : https://www.aclweb.org/anthology/2020.semeval-1.2, 2020. : Proceedings of the 14th International Workshop on Semantic Evaluation (SemEval 2020), 2020
|
|
BASE
|
|
Show details
|
|
39 |
Classification-Based Self-Learning for Weakly Supervised Bilingual Lexicon Induction
|
|
|
|
BASE
|
|
Show details
|
|
40 |
Towards Instance-Level Parser Selection for Cross-Lingual Transfer of Dependency Parsers
|
|
Glavas, Goran; Agic, Zeljko; Vulic, Ivan. - : International Committee on Computational Linguistics, 2020. : https://www.aclweb.org/anthology/2020.coling-main.345, 2020. : Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020), 2020
|
|
BASE
|
|
Show details
|
|
|
|