Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2 3 4

Hits 1 – 20 of 67

1	Multi-SimLex: A Large-Scale Evaluation of Multilingual and Cross-Lingual Lexical Semantic Similarity
	Vulic, Ivan; Baker, Simon; Ponti, Edoardo Maria; Petti, Ulla; Leviant, Ira; Wing, Kelly; Majewska, Olga; Bar, Eden; Malone, Matt; Poibeau, Thierry; Reichart, Roi; Korhonen, Anna
	In: ISSN: 0891-2017 ; EISSN: 1530-9312 ; Computational Linguistics ; https://hal.archives-ouvertes.fr/hal-02975786 ; Computational Linguistics, Massachusetts Institute of Technology Press (MIT Press), 2020, 46 (4), pp.847-897 ; https://direct.mit.edu/coli/article/46/4/847/97326/Multi-SimLex-A-Large-Scale-Evaluation-of (2020)
	Abstract: Données et informations liées à la publication : https://multisimlex.com/ ; International audience ; We introduce Multi-SimLex, a large-scale lexical resource and evaluation benchmark covering datasets for 12 typologically diverse languages, including major languages (e.g., Mandarin Chinese, Spanish, Russian) as well as less-resourced ones (e.g., Welsh, Kiswahili). Each language dataset is annotated for the lexical relation of semantic similarity and contains 1,888 semantically aligned concept pairs, providing a representative coverage of word classes (nouns, verbs, adjectives, adverbs), frequency ranks, similarity intervals, lexical fields, and concreteness levels. Additionally, owing to the alignment of concepts across languages, we provide a suite of 66 cross-lingual semantic similarity datasets. Due to its extensive size and language coverage, Multi-SimLex provides entirely novel opportunities for experimental evaluation and analysis. On its monolingual and cross-lingual benchmarks, we evaluate and analyze a wide array of recent state-of-the-art monolingual and cross-lingual representation models, including static and contextualized word embeddings (such as fastText, M-BERT and XLM), externally informed lexical representations, as well as fully unsupervised and (weakly) supervised cross-lingual word embeddings. We also present a step-by-step dataset creation protocol for creating consistent, Multi-Simlex-style resources for additional languages. We make these contributions -- the public release of Multi-SimLex datasets, their creation protocol, strong baseline results, and in-depth analyses which can be be helpful in guiding future developments in multilingual lexical semantics and representation learning -- available via a website which will encourage community effort in further expansion of Multi-Simlex to many more languages. Such a large-scale semantic resource could inspire significant further advances in NLP across languages.
	Keyword: [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]; [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG]; [INFO.INFO-TT]Computer Science [cs]/Document and Text Processing; [SCCO.COMP]Cognitive science/Computer science; [SCCO.LING]Cognitive science/Linguistics; [SHS.INFO]Humanities and Social Sciences/Library and information sciences; [SHS.LANGUE]Humanities and Social Sciences/Linguistics; [SHS.STAT]Humanities and Social Sciences/Methods and statistics; Lexicon; Linguistic Resource; Multilinguality; Semantics; Typology
	URL: https://hal.archives-ouvertes.fr/hal-02975786/file/coli_a_00391.pdf https://hal.archives-ouvertes.fr/hal-02975786 https://hal.archives-ouvertes.fr/hal-02975786/document
	BASE
	Hide details

2	Span-ConveRT: Few-shot Span Extraction for Dialog with Pretrained Conversational Representations ...
	Coope, Sam; Farghly, Tyler; Gerz, Daniela. - : Apollo - University of Cambridge Repository, 2020
	BASE
	Show details

3	Efficient Intent Detection with Dual Sentence Encoders ...
	Casanueva, Inigo; Temcinas, Tadas; Gerz, Daniela. - : Apollo - University of Cambridge Repository, 2020
	BASE
	Show details

4	Multidirectional Associative Optimization of Function-Specific Word Representations ...
	Gerz, Daniela; Vulic, Ivan; Rei, Marek. - : Apollo - University of Cambridge Repository, 2020
	BASE
	Show details

5	XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning ...
	Ponti, Edoardo Maria; Glavaš, Goran; Majewska, Olga. - : arXiv, 2020
	BASE
	Show details

6	Emergent Communication Pretraining for Few-Shot Machine Translation ...
	Li, Yaoyiran; Ponti, Edoardo M.; Vulić, Ivan. - : arXiv, 2020
	BASE
	Show details

7	Orthogonal Language and Task Adapters in Zero-Shot Cross-Lingual Transfer ...
	Vidoni, Marko; Vulić, Ivan; Glavaš, Goran. - : arXiv, 2020
	BASE
	Show details

8	MAD-X: An Adapter-Based Framework for Multi-Task Cross-Lingual Transfer ...
	Pfeiffer, Jonas; Vulić, Ivan; Gurevych, Iryna. - : arXiv, 2020
	BASE
	Show details

9	How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models ...
	Rust, Phillip; Pfeiffer, Jonas; Vulić, Ivan. - : arXiv, 2020
	BASE
	Show details

10	UNKs Everywhere: Adapting Multilingual Language Models to New Scripts ...
	Pfeiffer, Jonas; Vulić, Ivan; Gurevych, Iryna. - : arXiv, 2020
	BASE
	Show details

11	SemEval-2020 Task 3: Graded Word Similarity in Context ...
	The 28th International Conference on Computational Linguistics 2020; Armendariz, Carlos; Ljubešić, Nikola. - : Underline Science Inc., 2020
	BASE
	Show details

12	MAD-X: An Adapter-Based Framework for Multi-Task Cross-Lingual Transfer ...
	Pfeiffer, Jonas; Vulic, Ivan; Gurevych, Iryna. - : Apollo - University of Cambridge Repository, 2020
	BASE
	Show details

13	Manual Clustering and Spatial Arrangement of Verbs for Multilingual Evaluation and Typology Analysis ...
	Majewska, Olga; Vulic, Ivan; McCarthy, Diana. - : Apollo - University of Cambridge Repository, 2020
	BASE
	Show details

14	XHate-999: Analyzing and Detecting Abusive Language Across Domains and Languages ...
	Glavas, Goran; Karan, Mladen; Vulic, Ivan. - : Apollo - University of Cambridge Repository, 2020
	BASE
	Show details

15	Emergent Communication Pretraining for Few-Shot Machine Translation ...
	Li, Yaoyiran; Ponti, Edoardo; Vulic, Ivan. - : Apollo - University of Cambridge Repository, 2020
	BASE
	Show details

16	From Zero to Hero: On the Limitations of Zero-Shot Cross-Lingual Transfer with Multilingual Transformers ...
	Lauscher, Anne; Ravishankar, Vinit; Vulic, Ivan. - : Apollo - University of Cambridge Repository, 2020
	BASE
	Show details

17	XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning ...
	Ponti, Edoardo; Glavaš, Goran; Majewska, Olga. - : Apollo - University of Cambridge Repository, 2020
	BASE
	Show details

18	Emergent Communication Pretraining for Few-Shot Machine Translation ...
	The 28th International Conference on Computational Linguistics 2020; Korhonen, Anna; Li, Yaoyiran. - : Underline Science Inc., 2020
	BASE
	Show details

19	Manual Clustering and Spatial Arrangement of Verbs for Multilingual Evaluation and Typology Analysis ...
	The 28th International Conference on Computational Linguistics 2020; Korhonen, Anna; Majewska, Olga. - : Underline Science Inc., 2020
	BASE
	Show details

20	A Closer Look at Few-Shot Crosslingual Transfer: The Choice of Shots Matters ...
	Zhao, Mengjie; Zhu, Yi; Shareghi, Ehsan. - : arXiv, 2020
	BASE
	Show details

Page: 1 2 3 4

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern