1 |
Deriving Disinformation Insights from Geolocalized Twitter Callouts ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
XLM-T: A Multilingual Language Model Toolkit for Twitter ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Distilling Relation Embeddings from Pre-trained Language Models ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Back to the Basics: A Quantitative Analysis of Statistical and Graph-Based Term Weighting Schemes for Keyword Extraction ...
|
|
|
|
Abstract:
Term weighting schemes are widely used in Natural Language Processing and Information Retrieval. In particular, term weighting is the basis for keyword extraction. However, there are relatively few evaluation studies that shed light about the strengths and shortcomings of each weighting scheme. In fact, in most cases researchers and practitioners resort to the well-known tf-idf as default, despite the existence of other suitable alternatives, including graph-based models. In this paper, we perform an exhaustive and large-scale empirical comparison of both statistical and graph-based term weighting methods in the context of keyword extraction. Our analysis reveals some interesting findings such as the advantages of the less-known lexical specificity with respect to tf-idf, or the qualitative differences between statistical and graph-based methods. Finally, based on our findings we discuss and devise some suggestions for practitioners. Source code to reproduce our experimental results, including a keyword ... : Accepted by EMNLP 2021 main conference ...
|
|
Keyword:
Artificial Intelligence cs.AI; FOS Computer and information sciences; Machine Learning cs.LG
|
|
URL: https://dx.doi.org/10.48550/arxiv.2104.08028 https://arxiv.org/abs/2104.08028
|
|
BASE
|
|
Hide details
|
|
5 |
Modelling general properties of nouns by selectively averaging contextualised embeddings
|
|
|
|
BASE
|
|
Show details
|
|
6 |
BERT is to NLP what AlexNet is to CV: can pre-trained language models identify analogies?
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Back to the basics: a quantitative analysis of statistical and graph-based term weighting schemes for keyword extraction
|
|
|
|
BASE
|
|
Show details
|
|
8 |
XL-WiC: A Multilingual Benchmark for Evaluating Semantic Contextualization ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Learning Cross-Lingual Word Embeddings from Twitter via Distant Supervision
|
|
|
|
In: Proceedings of the International AAAI Conference on Web and Social Media; Vol. 14 (2020): Fourteenth International AAAI Conference on Web and Social Media; 72-82 ; 2334-0770 ; 2162-3449 (2020)
|
|
BASE
|
|
Show details
|
|
10 |
Analysis and Evaluation of Language Models for Word Sense Disambiguation ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Understanding the source of semantic regularities in word embeddings
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Learning cross-lingual word embeddings from Twitter via distant supervision
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Meemi: A Simple Method for Post-processing and Integrating Cross-lingual Word Embeddings ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
On the Robustness of Unsupervised and Semi-supervised Cross-lingual Word Embedding Learning ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
SenseDefs: a multilingual corpus of semantically annotated textual definitions : Exploiting multiple languages and resources jointly for high-quality Word Sense Disambiguation and Entity Linking [<Journal>]
|
|
|
|
DNB Subject Category Language
|
|
Show details
|
|
17 |
Improving Cross-Lingual Word Embeddings by Meeting in the Middle ...
|
|
|
|
BASE
|
|
Show details
|
|
18 |
The Interplay between Lexical Resources and Natural Language Processing ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
From Word to Sense Embeddings: A Survey on Vector Representations of Meaning ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|