Page: 1 2 3 4 5 6 7 8... 690
61 |
Statistical and Spatio-temporal Hand Gesture Features for Sign Language Recognition using the Leap Motion Sensor ...
|
|
|
|
BASE
|
|
Show details
|
|
64 |
Giant Pigeon and Small Person: Prompting Visually Grounded Models about the Size of Objects ...
|
|
Zhang, Yi. - : Purdue University Graduate School, 2022
|
|
BASE
|
|
Show details
|
|
65 |
Giant Pigeon and Small Person: Prompting Visually Grounded Models about the Size of Objects ...
|
|
Zhang, Yi. - : Purdue University Graduate School, 2022
|
|
BASE
|
|
Show details
|
|
66 |
pNLP-Mixer: an Efficient all-MLP Architecture for Language ...
|
|
|
|
BASE
|
|
Show details
|
|
67 |
Multilingual Abusiveness Identification on Code-Mixed Social Media Text ...
|
|
|
|
BASE
|
|
Show details
|
|
68 |
hate-alert@DravidianLangTech-ACL2022: Ensembling Multi-Modalities for Tamil TrollMeme Classification ...
|
|
|
|
BASE
|
|
Show details
|
|
69 |
StableMoE: Stable Routing Strategy for Mixture of Experts ...
|
|
|
|
BASE
|
|
Show details
|
|
70 |
BERTuit: Understanding Spanish language in Twitter through a native transformer ...
|
|
|
|
Abstract:
The appearance of complex attention-based language models such as BERT, Roberta or GPT-3 has allowed to address highly complex tasks in a plethora of scenarios. However, when applied to specific domains, these models encounter considerable difficulties. This is the case of Social Networks such as Twitter, an ever-changing stream of information written with informal and complex language, where each message requires careful evaluation to be understood even by humans given the important role that context plays. Addressing tasks in this domain through Natural Language Processing involves severe challenges. When powerful state-of-the-art multilingual language models are applied to this scenario, language specific nuances use to get lost in translation. To face these challenges we present \textbf{BERTuit}, the larger transformer proposed so far for Spanish language, pre-trained on a massive dataset of 230M Spanish tweets using RoBERTa optimization. Our motivation is to provide a powerful resource to better ... : Support: 1) BBVA FOUNDATION - CIVIC, 2) Spanish Ministry of Science and Innovation - FightDIS (PID2020-117263GB-100) and XAI-Disinfodemics (PLEC2021-007681), 3) Comunidad Autonoma de Madrid - S2018/TCS-4566, 4) European Comission - IBERIFIER (2020-EU-IA-0252), 5) Digital Future Society (Mobile World Capital Barcelona) - DisTrack, 6) UPM - Programa de Excelencia para el Profesorado Universitario ...
|
|
Keyword:
Computation and Language cs.CL; FOS Computer and information sciences; Machine Learning cs.LG
|
|
URL: https://dx.doi.org/10.48550/arxiv.2204.03465 https://arxiv.org/abs/2204.03465
|
|
BASE
|
|
Hide details
|
|
71 |
EVI: Multilingual Spoken Dialogue Tasks and Dataset for Knowledge-Based Enrolment, Verification, and Identification ...
|
|
|
|
BASE
|
|
Show details
|
|
73 |
Towards the Next 1000 Languages in Multilingual Machine Translation: Exploring the Synergy Between Supervised and Self-Supervised Learning ...
|
|
|
|
BASE
|
|
Show details
|
|
74 |
Data Bootstrapping Approaches to Improve Low Resource Abusive Language Detection for Indic Languages ...
|
|
|
|
BASE
|
|
Show details
|
|
75 |
Out of Thin Air: Is Zero-Shot Cross-Lingual Keyword Detection Better Than Unsupervised? ...
|
|
|
|
BASE
|
|
Show details
|
|
76 |
Assessment of Massively Multilingual Sentiment Classifiers ...
|
|
|
|
BASE
|
|
Show details
|
|
78 |
MASSIVE: A 1M-Example Multilingual Natural Language Understanding Dataset with 51 Typologically-Diverse Languages ...
|
|
|
|
BASE
|
|
Show details
|
|
79 |
DAMO-NLP at SemEval-2022 Task 11: A Knowledge-based System for Multilingual Named Entity Recognition ...
|
|
|
|
BASE
|
|
Show details
|
|
Page: 1 2 3 4 5 6 7 8... 690
|
|