1 |
Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP
|
|
|
|
In: https://hal.inria.fr/hal-03540069 ; 2022 (2022)
|
|
BASE
|
|
Show details
|
|
2 |
Sub-Character Tokenization for Chinese Pretrained Language Models ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Better Robustness by More Coverage: Adversarial and Mixup Data Augmentation for Robust Finetuning ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
What's in a Name? Answer Equivalence For Open-Domain Question Answering ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|