1 |
COLD Decoding: Energy-based Constrained Text Generation with Langevin Dynamics ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Ethical-Advice Taker: Do Language Models Understand Natural Language Interventions? ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Ethical-Advice Taker: Do Language Models Understand Natural Language Interventions? ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
NeuroLogic A*esque Decoding: Constrained Text Generation with Lookahead Heuristics ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
ParsiNLU: A Suite of Language Understanding Challenges for Persian ...
|
|
Khashabi, Daniel; Cohan, Arman; Shakeri, Siamak; Hosseini, Pedram; Pezeshkpour, Pouya; Alikhani, Malihe; Aminnaseri, Moin; Bitaab, Marzieh; Brahman, Faeze; Ghazarian, Sarik; Gheini, Mozhdeh; Kabiri, Arman; Mahabadi, Rabeeh Karimi; Memarrast, Omid; Mosallanezhad, Ahmadreza; Noury, Erfan; Raji, Shahab; Rasooli, Mohammad Sadegh; Sadeghi, Sepideh; Azer, Erfan Sadeqi; Samghabadi, Niloofar Safi; Shafaei, Mahsa; Sheybani, Saber; Tazarv, Ali; Yaghoobzadeh, Yadollah. - : arXiv, 2020
|
|
Abstract:
Despite the progress made in recent years in addressing natural language understanding (NLU) challenges, the majority of this progress remains to be concentrated on resource-rich languages like English. This work focuses on Persian language, one of the widely spoken languages in the world, and yet there are few NLU datasets available for this rich language. The availability of high-quality evaluation datasets is a necessity for reliable assessment of the progress on different NLU tasks and domains. We introduce ParsiNLU, the first benchmark in Persian language that includes a range of high-level tasks -- Reading Comprehension, Textual Entailment, etc. These datasets are collected in a multitude of ways, often involving manual annotations by native speakers. This results in over 14.5$k$ new instances across 6 distinct NLU tasks. Besides, we present the first results on state-of-the-art monolingual and multi-lingual pre-trained language-models on this benchmark and compare them with human performance, which ... : To appear on Transactions of the Association for Computational Linguistics (TACL), 2021 ...
|
|
Keyword:
Artificial Intelligence cs.AI; Computation and Language cs.CL; FOS Computer and information sciences
|
|
URL: https://dx.doi.org/10.48550/arxiv.2012.06154 https://arxiv.org/abs/2012.06154
|
|
BASE
|
|
Hide details
|
|
7 |
Evaluating Models' Local Decision Boundaries via Contrast Sets ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
TransOMCS: From Linguistic Graphs to Commonsense Knowledge ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
On the Possibilities and Limitations of Multi-hop Reasoning Under Linguistic Imperfections ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Question Answering as Global Reasoning over Semantic Abstractions ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Seeing Things from a Different Angle: Discovering Diverse Perspectives about Claims ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Distributed knowledge based clinical auto-coding system
|
|
Kaur, Rajvir (S33301). - : U.S., Association for Computational Linguistics, 2019
|
|
BASE
|
|
Show details
|
|
|
|