1 |
EnCBP: A New Benchmark Dataset for Finer-Grained Cultural Background Prediction in English ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
B Cell-Restricted Depletion of Dnmt3a Activates Notch Signaling and Causes Chronic Lymphocytic Leukemia
|
|
|
|
In: Blood, vol 138, iss Supplement 1 (2021)
|
|
BASE
|
|
Show details
|
|
3 |
Contributions of Transformer Attention Heads in Multi- and Cross-lingual Tasks ...
|
|
|
|
Abstract:
This paper studies the relative importance of attention heads in Transformer-based models to aid their interpretability in cross-lingual and multi-lingual tasks. Prior research has found that only a few attention heads are important in each mono-lingual Natural Language Processing (NLP) task and pruning the remaining heads leads to comparable or improved performance of the model. However, the impact of pruning attention heads is not yet clear in cross-lingual and multi-lingual tasks. Through extensive experiments, we show that (1) pruning a number of attention heads in a multi-lingual Transformer-based model has, in general, positive effects on its performance in cross-lingual and multi-lingual tasks and (2) the attention heads to be pruned can be ranked using gradients and identified with a few trial experiments. Our experiments focus on sequence labeling tasks, with potential applicability on other cross-lingual and multi-lingual tasks. For comprehensiveness, we examine two pre-trained multi-lingual ... : In ACL 2021 ...
|
|
Keyword:
Computation and Language cs.CL; FOS Computer and information sciences; Machine Learning cs.LG
|
|
URL: https://arxiv.org/abs/2108.08375 https://dx.doi.org/10.48550/arxiv.2108.08375
|
|
BASE
|
|
Hide details
|
|
4 |
GradTS: A Gradient-Based Automatic Auxiliary Task Selection Method Based on Transformer Networks ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Contributions of Transformer Attention Heads in Multi- and Cross-lingual Tasks ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Towards Improved Model Design for Authorship Identification: A Survey on Writing Style Understanding ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
When and Why Saying “Thank You” Is Better Than Saying “Sorry” in Redressing Service Failures: The Role of Self-Esteem ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
When and Why Saying “Thank You” Is Better Than Saying “Sorry” in Redressing Service Failures: The Role of Self-Esteem ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Empowering English Language Learners through Digital Literacies: Research, Complexities, and Implications
|
|
|
|
In: Media and Communication ; 7 ; 2 ; 128-136 ; Critical Perspectives on Digital Literacies: Creating a Path Forward (2019)
|
|
BASE
|
|
Show details
|
|
11 |
La influencia de dialectos chinos en el aprendizaje de la pronunciación de español
|
|
|
|
In: Foro de profesores de E/LE; Núm. 15 (2019): FORO DE PROFESORES DE E/LE ; 1886-337X (2019)
|
|
BASE
|
|
Show details
|
|
12 |
La influencia de dialectos chinos en el aprendizaje de la pronunciación de español ; The influence of Chinese dialects in learning Spanish pronunciation
|
|
|
|
In: Foro De Profesores De E-Le [ISSN 1886-337X],v. 15, p. 277-286, (2019) (2019)
|
|
BASE
|
|
Show details
|
|
13 |
Factores y estrategias del aprendiz preadolescente en chino como lengua extranjera. Estudio de caso en España
|
|
|
|
In: Onomázein: Revista de lingüística, filología y traducción de la Pontificia Universidad Católica de Chile, ISSN 0717-1285, Nº. 43, 2019, pags. 158-175 (2019)
|
|
BASE
|
|
Show details
|
|
14 |
Dificultades de comprensión en el diseño del examen oficial de chino YCT2 (A1) ; Comprehension difficulties in the design of C hinese official examination YCT2 (A1)
|
|
|
|
In: Revista Nebrija de Lingüística aplicada a la enseñanza de Lenguas [ISSN 1699-6569], v. 11 (23), p. 146-162 (2017)
|
|
BASE
|
|
Show details
|
|
15 |
Dificultades de comprensión en el diseño del examen oficial de chino YCT2 (A1) ; Comprehension difficulties in the design of C hinese official examination YCT2 (A1)
|
|
|
|
In: Revista Nebrija de Lingüística aplicada a la enseñanza de Lenguas [ISSN 1699-6569], v. 11 (23), p. 146-162 (2017)
|
|
BASE
|
|
Show details
|
|
|
|