1 |
How Efficiency Shapes Human Language
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03552539 ; 2022 (2022)
|
|
BASE
|
|
Show details
|
|
2 |
When classifying grammatical role, BERT doesn't care about word order... except when it matters ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Grammatical cues are largely, but not completely, redundant with word meanings in natural language ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
When Classifying Arguments, BERT Doesn't Care About Word Order. Except When It Matters
|
|
|
|
In: Proceedings of the Society for Computation in Linguistics (2022)
|
|
BASE
|
|
Show details
|
|
5 |
Efficient communication and the organization of the lexicon
|
|
|
|
In: OUP volume on the Mental Lexicon ; https://hal.archives-ouvertes.fr/hal-03482414 ; OUP volume on the Mental Lexicon, In press, ⟨10.31234/osf.io/4an6v⟩ (2021)
|
|
BASE
|
|
Show details
|
|
6 |
Decrypting Cryptic Crosswords: Semantically Complex Wordplay Puzzles as a Target for NLP ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
A Massively Multilingual Analysis of Cross-linguality in Shared Embedding Space ...
|
|
|
|
Abstract:
In cross-lingual language models, representations for many different languages live in the same space. Here, we investigate the linguistic and non-linguistic factors affecting sentence-level alignment in cross-lingual pretrained language models for 101 languages and 5,050 language pairs. Using BERT-based LaBSE and BiLSTM-based LASER as our models, and the Bible as our corpus, we compute a task-based measure of cross-lingual alignment in the form of bitext retrieval performance, as well as four intrinsic measures of vector space alignment and isomorphism. We then examine a range of linguistic, quasi-linguistic, and training-related features as potential predictors of these alignment metrics. The results of our analyses show that word order agreement and agreement in morphological complexity are two of the strongest linguistic predictors of cross-linguality. We also note in-family training data as a stronger predictor than language-specific training data across the board. We verify some of our linguistic ... : 15 pages, 8 figures, EMNLP 2021 ...
|
|
Keyword:
Artificial Intelligence cs.AI; Computation and Language cs.CL; FOS Computer and information sciences; I.2.7; Machine Learning cs.LG
|
|
URL: https://arxiv.org/abs/2109.06324 https://dx.doi.org/10.48550/arxiv.2109.06324
|
|
BASE
|
|
Hide details
|
|
8 |
Deep Subjecthood: Higher-Order Grammatical Features in Multilingual BERT ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Multilingual BERT, Ergativity, and Grammatical Subjecthood ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Decrypting cryptic crosswords: Semantically complex wordplay puzzles as a target for NLP ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Decrypting cryptic crosswords: Semantically complex wordplay puzzles as a target for NLP ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Decrypting cryptic crosswords: Semantically complex wordplay puzzles as a target for NLP ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Decrypting cryptic crosswords: Semantically complex wordplay puzzles as a target for NLP ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Decrypting cryptic crosswords: Semantically complex wordplay puzzles as a target for NLP ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Decrypting cryptic crosswords: Semantically complex wordplay puzzles as a target for NLP ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
A Massively Multilingual Analysis of Cross-linguality in Shared Embedding Space ...
|
|
|
|
BASE
|
|
Show details
|
|
17 |
REPLICATING A FUNDAMENTAL FINDING IN PSYCHOLINGUISTICS: SYNTACTIC PRIMING ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
How (Non-)Optimal is the Lexicon?
|
|
|
|
In: Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (2021)
|
|
BASE
|
|
Show details
|
|
|
|