Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Hits 1 – 8 of 8

1	XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning ...
	Ponti, Edoardo; Glavaš, Goran; Majewska, Olga. - : Apollo - University of Cambridge Repository, 2020
	BASE
	Show details

2	Specializing Unsupervised Pretraining Models for Word-Level Semantic Similarity ...
	The 28th International Conference on Computational Linguistics 2020; Glavaš, Goran; Korhonen, Anna. - : Underline Science Inc., 2020
	BASE
	Show details

3	Specializing Unsupervised Pretraining Models for Word-Level Semantic Similarity ...
	Lauscher, Anne; Vulic, Ivan; Ponti, Edoardo. - : Apollo - University of Cambridge Repository, 2020
	BASE
	Show details

4	Probing Pretrained Language Models for Lexical Semantics ...
	Vulic, Ivan; Ponti, Edoardo; Litschko, Robert. - : Apollo - University of Cambridge Repository, 2020
	BASE
	Show details

5	Specializing Unsupervised Pretraining Models for Word-Level Semantic Similarity
	Lauscher, Anne; Vulic, Ivan; Ponti, Edoardo. - : International Committee on Computational Linguistics, 2020. : https://www.aclweb.org/anthology/2020.coling-main.118, 2020. : Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020), 2020
	BASE
	Show details

6	Probing Pretrained Language Models for Lexical Semantics
	Vulic, Ivan; Ponti, Edoardo; Litschko, Robert; Glavas, Goran; Korhonen, Anna-Leena. - : Association for Computational Linguistics, 2020. : Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2020), 2020
	Abstract: The success of large pretrained language models (LMs) such as BERT and RoBERTa has sparked interest in probing their representations, in order to unveil what types of knowledge they implicitly capture. While prior research focused on morphosyntactic, semantic, and world knowledge, it remains unclear to which extent LMs also derive lexical type-level knowledge from words in context. In this work, we present a systematic empirical analysis across six typologically diverse languages and five different lexical tasks, addressing the following questions: 1) How do different lexical knowledge extraction strategies (monolingual versus multilingual source LM, out-of-context versus in-context encoding, inclusion of special tokens, and layer-wise averaging) impact performance? How consistent are the observed effects across tasks and languages? 2) Is lexical knowledge stored in few parameters, or is it scattered throughout the network? 3) How do these representations fare against traditional static word vectors in lexical tasks? 4) Does the lexical information emerging from independently trained monolingual LMs display latent similarities? Our main results indicate patterns and best practices that hold universally, but also point to prominent variations across languages and tasks. Moreover, we validate the claim that lower Transformer layers carry more type-level lexical knowledge, but also show that this knowledge is distributed across multiple layers.
	URL: https://www.repository.cam.ac.uk/handle/1810/315105 https://doi.org/10.17863/CAM.62212
	BASE
	Hide details

7	XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning
	Liu, Qianchu; Korhonen, Anna-Leena; Majewska, Olga. - : Association for Computational Linguistics, 2020. : Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2020), 2020
	BASE
	Show details

8	Specialising Distributional Vectors of All Words for Lexical Entailment ...
	Kamath, Aishwarya; Pfeiffer, Jonas; Ponti, Edoardo. - : Apollo - University of Cambridge Repository, 2019
	BASE
	Show details

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern