DE eng

Search in the Catalogues and Directories

Page: 1 2
Hits 1 – 20 of 31

1
XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning ...
Ponti, Edoardo; Glavaš, Goran; Majewska, Olga. - : Apollo - University of Cambridge Repository, 2020
BASE
Show details
2
SemEval-2020 Task 2: Predicting Multilingual and Cross-Lingual (Graded) Lexical Entailment ...
Glavas, Goran; Vulic, Ivan; Korhonen, Anna-Leena. - : Apollo - University of Cambridge Repository, 2020
BASE
Show details
3
Multi-SimLex: A Large-Scale Evaluation of Multilingual and Cross-Lingual Lexical Semantic Similarity ...
Vulic, Ivan; Baker, Simon; Ponti, Edoardo. - : Apollo - University of Cambridge Repository, 2020
BASE
Show details
4
Probing Pretrained Language Models for Lexical Semantics ...
Vulic, Ivan; Ponti, Edoardo; Litschko, Robert. - : Apollo - University of Cambridge Repository, 2020
BASE
Show details
5
The Secret is in the Spectra: Predicting Cross-Lingual Task Performance with Spectral Similarity Measures ...
Dubossarsky, Haim; Vulic, Ivan; Reichart, Roi. - : Apollo - University of Cambridge Repository, 2020
BASE
Show details
6
Spatial multi-arrangement for clustering and multi-way similarity dataset construction ...
Majewska, Olga; McCarthy, D; Van Den Bosch, J. - : Apollo - University of Cambridge Repository, 2020
BASE
Show details
7
The Secret is in the Spectra: Predicting Cross-Lingual Task Performance with Spectral Similarity Measures
Dubossarsky, Haim; Vulic, Ivan; Reichart, Roi. - : Association for Computational Linguistics, 2020. : Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2020), 2020
BASE
Show details
8
Spatial multi-arrangement for clustering and multi-way similarity dataset construction
Majewska, Olga; McCarthy, D; van den Bosch, J. - : European Language Resources Association, 2020. : LREC 2020 - 12th International Conference on Language Resources and Evaluation, Conference Proceedings, 2020
BASE
Show details
9
Probing Pretrained Language Models for Lexical Semantics
Vulic, Ivan; Ponti, Edoardo; Litschko, Robert; Glavas, Goran; Korhonen, Anna-Leena. - : Association for Computational Linguistics, 2020. : Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2020), 2020
Abstract: The success of large pretrained language models (LMs) such as BERT and RoBERTa has sparked interest in probing their representations, in order to unveil what types of knowledge they implicitly capture. While prior research focused on morphosyntactic, semantic, and world knowledge, it remains unclear to which extent LMs also derive lexical type-level knowledge from words in context. In this work, we present a systematic empirical analysis across six typologically diverse languages and five different lexical tasks, addressing the following questions: 1) How do different lexical knowledge extraction strategies (monolingual versus multilingual source LM, out-of-context versus in-context encoding, inclusion of special tokens, and layer-wise averaging) impact performance? How consistent are the observed effects across tasks and languages? 2) Is lexical knowledge stored in few parameters, or is it scattered throughout the network? 3) How do these representations fare against traditional static word vectors in lexical tasks? 4) Does the lexical information emerging from independently trained monolingual LMs display latent similarities? Our main results indicate patterns and best practices that hold universally, but also point to prominent variations across languages and tasks. Moreover, we validate the claim that lower Transformer layers carry more type-level lexical knowledge, but also show that this knowledge is distributed across multiple layers.
URL: https://www.repository.cam.ac.uk/handle/1810/315105
https://doi.org/10.17863/CAM.62212
BASE
Hide details
10
SemEval-2020 Task 2: Predicting Multilingual and Cross-Lingual (Graded) Lexical Entailment
Glavas, Goran; Vulic, Ivan; Korhonen, Anna-Leena. - : International Committee for Computational Linguistics, 2020. : https://www.aclweb.org/anthology/2020.semeval-1.2, 2020. : Proceedings of the 14th International Workshop on Semantic Evaluation (SemEval 2020), 2020
BASE
Show details
11
XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning
Liu, Qianchu; Korhonen, Anna-Leena; Majewska, Olga. - : Association for Computational Linguistics, 2020. : Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2020), 2020
BASE
Show details
12
Modeling Language Variation and Universals: A Survey on Typological Linguistics for Natural Language Processing ...
Ponti, Edoardo; O'Horan, Helen; Berzak, Yevgeni. - : Apollo - University of Cambridge Repository, 2019
BASE
Show details
13
Show Some Love to Your n-grams: A Bit of Progress and Stronger n-gram Language Modeling Baselines ...
Shareghi, Ehsan; Gerz, Daniela; Vulic, Ivan. - : Apollo - University of Cambridge Repository, 2019
BASE
Show details
14
Modeling Language Variation and Universals: A Survey on Typological Linguistics for Natural Language Processing
Reichart, Roi; Shutova, Ekaterina; Korhonen, Anna-Leena. - : MIT Press - Journals, 2019. : COMPUTATIONAL LINGUISTICS, 2019
BASE
Show details
15
Bio-SimVerb ...
Chiu, Hon Wing; Pyysalo, Sampo; Vulic, Ivan. - : Apollo - University of Cambridge Repository, 2018
BASE
Show details
16
Isomorphic Transfer of Syntactic Structures in Cross-Lingual NLP ...
Ponti, Edoardo; Reichart, Roi; Korhonen, Anna-Leena. - : Apollo - University of Cambridge Repository, 2018
BASE
Show details
17
Language Modeling for Morphologically Rich Languages: Character-Aware Modeling for Word-Level Prediction ...
Gerz, Daniela; Vulić, Ivan; Ponti, Edoardo. - : Apollo - University of Cambridge Repository, 2018
BASE
Show details
18
Injecting Lexical Contrast into Word Vectors by Guiding Vector Space Specialisation ...
Vulic, Ivan; Korhonen, Anna-Leena; Linguist, Assoc Computat. - : Apollo - University of Cambridge Repository, 2018
BASE
Show details
19
Investigating the cross-lingual translatability of VerbNet-style classification. ...
Majewska, Olga; Vulić, Ivan; McCarthy, Diana. - : Apollo - University of Cambridge Repository, 2018
BASE
Show details
20
Post-Specialisation: Retrofitting Vectors of Words Unseen in Lexical Resources ...
Vulic, Ivan; Glavaš, Goran; Mrkšić, Nikola. - : Apollo - University of Cambridge Repository, 2018
BASE
Show details

Page: 1 2

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
31
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern