21 |
Positive AI with Social Commonsense Models
|
|
|
|
Abstract:
Thesis (Ph.D.)--University of Washington, 2021 ; To effectively understand language and safely communicate with humans, machines must not only grasp the surface meanings of texts, but also their underlying social meaning. This requires understanding interpersonal social commonsense, such as knowing to thank someone for giving you a present, as well as accounting for harmful social biases and stereotypes. While understanding these implied social dynamics is easy for most humans, it remains an elusive goal for AI and NLP systems. Importantly, systems that fail to account for these social and power dynamics risk producing redundant, rude, or even harmful outputs. In this dissertation, we take several steps towards making NLP systems more human-centric, socially aware, and equity driven, motivated by the increased prowess and prevalence of AI and NLP technology. In the first part, we investigate methods for enabling NLP systems to reason about and revise the commonsense implications of text. We introduce ATOMIC, the first largescale social commonsense knowledge graph for machines to reason about the causes and effects of everyday situations, and POWERTRANSFORMER, a system to revise the social implications of text using connotation frames of power and agency. In the second part, we tackle the problem of detecting and representing social biases and toxicity in language with socially aware NLP models. We examine shortcomings of existing toxic language detection tools, uncovering strong racial biases which causes text written by African American authors to be flagged as toxic more often than by white authors. Then, we introduce SOCIAL BIAS FRAMES, a new structured linguistic representation for distilling the harmful or biased implications of text in free-text explanations. We conclude by the discussing the contributions of this dissertation as well as future directions towards improving the social awareness and equity of NLP systems.
|
|
Keyword:
Artificial intelligence; Commonsense reasoning; Computer science; Computer science and engineering; language connotations; Linguistics; social biases; toxic language
|
|
URL: http://hdl.handle.net/1773/47999
|
|
BASE
|
|
Hide details
|
|
22 |
Semantic Comparisons for Natural Language Processing Applications
|
|
|
|
BASE
|
|
Show details
|
|
23 |
Challenges in Automated Debiasing for Toxic Language Detection
|
|
|
|
BASE
|
|
Show details
|
|
24 |
Parsing with Multilingual BERT, a Small Corpus, and a Small Treebank ...
|
|
|
|
BASE
|
|
Show details
|
|
26 |
Unsupervised Bitext Mining and Translation via Self-trained Contextual Embeddings ...
|
|
|
|
BASE
|
|
Show details
|
|
27 |
Evaluating Models' Local Decision Boundaries via Contrast Sets ...
|
|
|
|
BASE
|
|
Show details
|
|
28 |
Grounded Compositional Outputs for Adaptive Language Modeling ...
|
|
|
|
BASE
|
|
Show details
|
|
29 |
Multilingual and Interlingual Semantic Representations for Natural Language Processing: A Brief Introduction
|
|
|
|
In: Computational Linguistics, Vol 46, Iss 2, Pp 249-255 (2020) (2020)
|
|
BASE
|
|
Show details
|
|
30 |
Polyglot Contextual Representations Improve Crosslingual Transfer ...
|
|
|
|
BASE
|
|
Show details
|
|
31 |
Situating Sentence Embedders with Nearest Neighbor Overlap ...
|
|
|
|
BASE
|
|
Show details
|
|
32 |
Low-Resource Parsing with Crosslingual Contextualized Representations ...
|
|
|
|
BASE
|
|
Show details
|
|
33 |
Quoref: A Reading Comprehension Dataset with Questions Requiring Coreferential Reasoning ...
|
|
|
|
BASE
|
|
Show details
|
|
35 |
Measuring Online Debaters’ Persuasive Skill from Text over Time
|
|
|
|
In: Transactions of the Association for Computational Linguistics, Vol 7, Pp 537-550 (2019) (2019)
|
|
BASE
|
|
Show details
|
|
36 |
Empirical Risk Minimization for Probabilistic Grammars: Sample Complexity and Hardness of Learning ...
|
|
|
|
BASE
|
|
Show details
|
|
37 |
Nonparametric Word Segmentation for Machine Translation ...
|
|
|
|
BASE
|
|
Show details
|
|
38 |
Unsupervised Bilingual POS Tagging with Markov Random Fields ...
|
|
|
|
BASE
|
|
Show details
|
|
39 |
Unsupervised Bilingual POS Tagging with Markov Random Fields ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|