1 |
Improving Pre-trained Language Models with Syntactic Dependency Prediction Task for Chinese Semantic Error Recognition ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
ExpMRC: explainability evaluation for machine reading comprehension
|
|
|
|
In: Heliyon (2022)
|
|
BASE
|
|
Show details
|
|
3 |
Multilingual multi-aspect explainability analyses on machine reading comprehension models
|
|
|
|
In: iScience (2022)
|
|
BASE
|
|
Show details
|
|
4 |
Multilingual Multi-Aspect Explainability Analyses on Machine Reading Comprehension Models ...
|
|
|
|
Abstract:
Achieving human-level performance on some of the Machine Reading Comprehension (MRC) datasets is no longer challenging with the help of powerful Pre-trained Language Models (PLMs). However, the internal mechanism of these artifacts remains unclear, placing an obstacle for further understanding these models. This paper focuses on conducting a series of analytical experiments to examine the relations between the multi-head self-attention and the final MRC system performance, revealing the potential explainability in PLM-based MRC models. To ensure the robustness of the analyses, we perform our experiments in a multilingual way on top of various PLMs. We discover that passage-to-question and passage understanding attentions are the most important ones in the question answering process, showing strong correlations to the final performance than other parts. Through comprehensive visualizations and case studies, we also observe several general findings on the attention maps, which can be helpful to understand how ... : 15 pages ...
|
|
Keyword:
Artificial Intelligence cs.AI; Computation and Language cs.CL; FOS Computer and information sciences
|
|
URL: https://dx.doi.org/10.48550/arxiv.2108.11574 https://arxiv.org/abs/2108.11574
|
|
BASE
|
|
Hide details
|
|
5 |
Allocating Large Vocabulary Capacity for Cross-lingual Language Model Pre-training ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Chase: A Large-Scale and Pragmatic Chinese Dataset for Cross-Database Context-Dependent Text-to-SQL ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
GL-GIN: Fast and Accurate Non-Autoregressive Model for Joint Multiple Intent Detection and Slot Filling ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
A Closer Look into the Robustness of Neural Dependency Parsers Using Better Adversarial Examples ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Learning to Bridge Metric Spaces: Few-shot Joint Learning of Intent Detection and Slot Filling ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Neural Stylistic Response Generation with Disentangled Latent Variables ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Language learners' enjoyment and emotion regulation in online collaborative learning
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Canonicalizing Open Knowledge Bases with Multi-Layered Meta-Graph Neural Network ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
TableGPT: Few-shot Table-to-Text Generation with Table Structure Reconstruction and Content Matching ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
N-LTP: An Open-source Neural Language Technology Platform for Chinese ...
|
|
|
|
BASE
|
|
Show details
|
|
20 |
Towards Better UD Parsing: Deep Contextualized Word Embeddings, Ensemble, and Treebank Concatenation ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|