1 |
Ethical-Advice Taker: Do Language Models Understand Natural Language Interventions? ...
|
|
|
|
Abstract:
Is it possible to use natural language to intervene in a model's behavior and alter its prediction in a desired way? We investigate the effectiveness of natural language interventions for reading-comprehension systems, studying this in the context of social stereotypes. Specifically, we propose a new language understanding task, Linguistic Ethical Interventions (LEI), where the goal is to amend a question-answering (QA) model's unethical behavior by communicating context-specific principles of ethics and equity to it. To this end, we build upon recent methods for quantifying a system's social stereotypes, augmenting them with different kinds of ethical interventions and the desired model behavior under such interventions. Our zero-shot evaluation finds that even today's powerful neural language models are extremely poor ethical-advice takers, that is, they respond surprisingly little to ethical interventions even though these interventions are stated as simple sentences. Few-shot learning improves model ... : 9 pages, Findings of ACL-IJCNLP 2021 ...
|
|
Keyword:
Artificial Intelligence cs.AI; Computation and Language cs.CL; FOS Computer and information sciences; Machine Learning cs.LG
|
|
URL: https://arxiv.org/abs/2106.01465 https://dx.doi.org/10.48550/arxiv.2106.01465
|
|
BASE
|
|
Hide details
|
|
2 |
"The Boating Store Had Its Best Sail Ever": Pronunciation-attentive Contextualized Pun Recognition ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Gender Bias in Multilingual Embeddings and Cross-Lingual Transfer ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Examining Gender Bias in Languages with Grammatical Gender ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|