1 |
USCORE: An Effective Approach to Fully Unsupervised Evaluation Metrics for Machine Translation ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Constrained Density Matching and Modeling for Cross-lingual Alignment of Contextualized Representations ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Towards Explainable Evaluation Metrics for Natural Language Generation ...
|
|
|
|
Abstract:
Unlike classical lexical overlap metrics such as BLEU, most current evaluation metrics (such as BERTScore or MoverScore) are based on black-box language models such as BERT or XLM-R. They often achieve strong correlations with human judgments, but recent research indicates that the lower-quality classical metrics remain dominant, one of the potential reasons being that their decision processes are transparent. To foster more widespread acceptance of the novel high-quality metrics, explainability thus becomes crucial. In this concept paper, we identify key properties and propose key goals of explainable machine translation evaluation metrics. We also provide a synthesizing overview over recent approaches for explainable machine translation metrics and discuss how they relate to those goals and properties. Further, we conduct own novel experiments, which (among others) find that current adversarial NLP techniques are unsuitable for automatically identifying limitations of high-quality black-box evaluation ...
|
|
Keyword:
Computation and Language cs.CL; FOS Computer and information sciences; Machine Learning cs.LG
|
|
URL: https://arxiv.org/abs/2203.11131 https://dx.doi.org/10.48550/arxiv.2203.11131
|
|
BASE
|
|
Hide details
|
|
4 |
End-to-end style-conditioned poetry generation: What does it take to learn from examples alone? ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Changes in European Solidarity Before and During COVID-19: Evidence from a Large Crowd- and Expert-Annotated Twitter Dataset ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
BERT-Defense: A Probabilistic Model Based on BERT to Combat Cognitively Inspired Orthographic Adversarial Attacks ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Global Explainability of BERT-Based Evaluation Metrics by Disentangling along Linguistic Factors ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Global Explainability of BERT-Based Evaluation Metrics by Disentangling along Linguistic Factors ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Inducing Language-Agnostic Multilingual Representations ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Probing Multilingual BERT for Genetic and Typological Signals ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
On the Limitations of Cross-lingual Encoders as Exposed by Reference-Free Machine Translation Evaluation ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
How to Probe Sentence Embeddings in Low-Resource Languages: On Structural Design Choices for Probing Task Evaluation ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Vec2Sent: Probing Sentence Embeddings With Natural Language Generation ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
From Hero to Zéroe: A Benchmark of Low-Level Adversarial Attacks ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
On the limitations of cross-lingual encoders as exposed by reference-free machine translation evaluation
|
|
|
|
BASE
|
|
Show details
|
|
17 |
On aligning OpenIE extractions with Knowledge Bases: A case study
|
|
|
|
BASE
|
|
Show details
|
|
18 |
Semantic Change and Emerging Tropes In a Large Corpus of New High German Poetry ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
Cross-lingual Argumentation Mining: Machine Translation (and a bit of Projection) is All You Need! ...
|
|
|
|
BASE
|
|
Show details
|
|
20 |
What is the Essence of a Claim? Cross-Domain Claim Identification ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|