3 |
Word Sense Disambiguation for 158 Languages using Word Embeddings Only ...
|
|
Logacheva, Varvara; Teslenko, Denis; Shelmanov, Artem; Remus, Steffen; Ustalov, Dmitry; Kutuzov, Andrey; Artemova, Ekaterina; Biemann, Chris; Ponzetto, Simone Paolo; Panchenko, Alexander. - : arXiv, 2020
|
|
Abstract:
Disambiguation of word senses in context is easy for humans, but is a major challenge for automatic approaches. Sophisticated supervised and knowledge-based models were developed to solve this task. However, (i) the inherent Zipfian distribution of supervised training instances for a given word and/or (ii) the quality of linguistic knowledge representations motivate the development of completely unsupervised and knowledge-free approaches to word sense disambiguation (WSD). They are particularly useful for under-resourced languages which do not have any resources for building either supervised and/or knowledge-based models. In this paper, we present a method that takes as input a standard pre-trained word embedding model and induces a fully-fledged word sense inventory, which can be used for disambiguation in context. We use this method to induce a collection of sense inventories for 158 languages on the basis of the original pre-trained fastText word embeddings by Grave et al. (2018), enabling WSD in these ... : 10 pages, 5 figures, 4 tables, accepted at LREC 2020 ...
|
|
Keyword:
Computation and Language cs.CL; FOS Computer and information sciences
|
|
URL: https://dx.doi.org/10.48550/arxiv.2003.06651 https://arxiv.org/abs/2003.06651
|
|
BASE
|
|
Hide details
|
|
5 |
TextGraphs 2020 Shared Task on Multi-Hop Inference for Explanation Regeneration ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Datasets for Watset: Local-Global Graph Clustering with Applications in Sense and Frame Induction ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Datasets for Watset: Local-Global Graph Clustering with Applications in Sense and Frame Induction ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
HHMM at SemEval-2019 Task 2: Unsupervised frame induction using contextualized word embeddings
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Watset: Local-global graph clustering with applications in sense and frame induction
|
|
|
|
BASE
|
|
Show details
|
|
12 |
RUSSE'2018: A Shared Task on Word Sense Induction for the Russian Language ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Watset: Local-Global Graph Clustering with Applications in Sense and Frame Induction ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
RUSSE: The First Workshop on Russian Semantic Similarity ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
An Unsupervised Word Sense Disambiguation System for Under-Resourced Languages ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
RUSSE'2018: Human-Annotated Sense-Disambiguated Word Contexts for Russian ...
|
|
|
|
BASE
|
|
Show details
|
|
17 |
RUSSE'2018: Human-Annotated Sense-Disambiguated Word Contexts for Russian ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
An unsupervised word sense disambiguation system for under-resourced languages
|
|
|
|
BASE
|
|
Show details
|
|
20 |
RUSSE'2018 : a shared task on word sense induction for the Russian language
|
|
|
|
BASE
|
|
Show details
|
|
|
|