2 |
Fairlex: A multilingual benchmark for evaluating fairness in legal text processing ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Fairlex: A multilingual benchmark for evaluating fairness in legal text processing ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
FairLex: A Multilingual Benchmark for Evaluating Fairness in Legal Text Processing ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
One Country, 700+ Languages: NLP Challenges for Underrepresented Languages and Dialects in Indonesia ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Introducing Neural Bag of Whole-Words with ColBERTer: Contextualized Late Interactions using Enhanced Reduction ...
|
|
|
|
Abstract:
Recent progress in neural information retrieval has demonstrated large gains in effectiveness, while often sacrificing the efficiency and interpretability of the neural model compared to classical approaches. This paper proposes ColBERTer, a neural retrieval model using contextualized late interaction (ColBERT) with enhanced reduction. Along the effectiveness Pareto frontier, ColBERTer's reductions dramatically lower ColBERT's storage requirements while simultaneously improving the interpretability of its token-matching scores. To this end, ColBERTer fuses single-vector retrieval, multi-vector refinement, and optional lexical matching components into one model. For its multi-vector component, ColBERTer reduces the number of stored vectors per document by learning unique whole-word representations for the terms in each document and learning to identify and remove word representations that are not essential to effective scoring. We employ an explicit multi-task, multi-stage training to facilitate using very ...
|
|
Keyword:
Artificial Intelligence cs.AI; Computation and Language cs.CL; FOS Computer and information sciences; Information Retrieval cs.IR; Machine Learning cs.LG
|
|
URL: https://arxiv.org/abs/2203.13088 https://dx.doi.org/10.48550/arxiv.2203.13088
|
|
BASE
|
|
Hide details
|
|
9 |
Coloring the Blank Slate: Pre-training Imparts a Hierarchical Inductive Bias to Sequence-to-sequence Models ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
YET LIE THEY DO: A LIST-EXPERIMENT FOR ESTIMATING ANTI-IMMIGRANT SENTIMENT AND SOCIAL DESIRABILITY BIAS
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Modified Gravity and Cosmology: An Update by the CANTATA Network
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03261155 ; 2021 (2021)
|
|
BASE
|
|
Show details
|
|
12 |
The Dawn of the Human-Machine Era: A forecast of new and emerging language technologies.
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03230287 ; 2021 (2021)
|
|
BASE
|
|
Show details
|
|
13 |
UNITEX 3.3 User Manual
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03589580 ; 2021 (2021)
|
|
BASE
|
|
Show details
|
|
14 |
UNITEX 3.3 Manuel d'utilisation
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03589598 ; 2021 (2021)
|
|
BASE
|
|
Show details
|
|
15 |
Modified Gravity and Cosmology: An Update by the CANTATA Network ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
UnibucKernel: Geolocating Swiss German Jodels Using Ensemble Learning ...
|
|
|
|
BASE
|
|
Show details
|
|
18 |
Towards Human-Free Automatic Quality Evaluation of German Summarization ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
Linguistically Informed Masking for Representation Learning in the Patent Domain ...
|
|
|
|
BASE
|
|
Show details
|
|
20 |
Persuasive Natural Language Generation -- A Literature Review ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|