21 |
MasakhaNER: Named entity recognition for African languages
|
|
Adelani, David,; Abbott, Jade; Neubig, Graham; D'Souza, Daniel; Kreutzer, Julia; Lignos, Constantine; Palen-Michel, Chester; Buzaaba, Happy; Rijhwani, Shruti; Ruder, Sebastian; Mayhew, Stephen; Abebe Azime, Israel; Muhammad, Shamsuddeen,; Chinenye Emezue, Chris; Nakatumba-Nabende, Joyce; Ogayo, Perez; Aremu, Anuoluwapo; Gitau, Catherine; Mbaye, Derguene; Alabi, Jesujoba; Yimam, Seid,; Rabiu Gwadabe, Tajuddeen; Ezeani, Ignatius; Niyongabo, Rubungo,; Mukiibi, Jonathan; Otiende, Verrah; Orife, Iroro; David, Davis; Ngom, Samba; Adewumi, Tosin; Rayson, Paul; Adeyemi, Mofetoluwa; Muriuki, Gerald; Anebi, Emmanuel; Chukwuneke, Chiamaka; Odu, Nkiruka; Wairagala, Eric,; Oyerinde, Samuel; Siro, Clemencia; Saul Bateesa, Tobius; Oloyede, Temilola; Wambui, Yvonne; Akinode, Victor; Nabagereka, Deborah; Katusiime, Maurice; Awokoya, Ayodele; Mboup, Mouhamadane; Gebreyohannes, Dibora; Tilaye, Henok; Nwaike, Kelechi; Wolde, Degaga; Faye, Abdoulaye; Sibanda, Blessing; Ahia, Orevaoghene; Dossou, Bonaventure,; Ogueji, Kelechi; Thierno, Ibrahima; DIALLO, Abdoulaye; Akinfaderin, Adewale; Marengereke, Tendai; Osei, Salomey
|
|
In: EISSN: 2307-387X ; Transactions of the Association for Computational Linguistics ; https://hal.inria.fr/hal-03350962 ; Transactions of the Association for Computational Linguistics, The MIT Press, 2021, ⟨10.1162/tacl⟩ (2021)
|
|
Abstract:
International audience ; We take a step towards addressing the underrepresentation of the African continent in NLP research by bringing together different stakeholders to create the first large, publicly available, high-quality dataset for named entity recognition (NER) in ten African languages. We detail the characteristics of these languages to help researchers and practitioners better understand the challenges they pose for NER tasks. We analyze our datasets and conduct an extensive empirical evaluation of stateof-the-art methods across both supervised and transfer learning settings. Finally, we release the data, code, and models to inspire future research on African NLP. 1
|
|
Keyword:
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]
|
|
URL: https://hal.inria.fr/hal-03350962 https://doi.org/10.1162/tacl https://hal.inria.fr/hal-03350962/document https://hal.inria.fr/hal-03350962/file/adelani_TACL2021.pdf
|
|
BASE
|
|
Hide details
|
|
22 |
Modified Gravity and Cosmology: An Update by the CANTATA Network
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03261155 ; 2021 (2021)
|
|
BASE
|
|
Show details
|
|
23 |
Good Scientific Practice in MEEG Research: Progress and Perspectives
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03494100 ; 2021 (2021)
|
|
BASE
|
|
Show details
|
|
24 |
Learning from Leaders-Deaf Theatre Innovations in the Time of COVID-19”
|
|
|
|
In: Student Articles, Chapters, Presentations, Learning Objects (2021)
|
|
BASE
|
|
Show details
|
|
25 |
International Centre for Language and Communicative Development: Corpus and Experimental Study: Children's Acquisition of Wh-questions, 2019 ...
|
|
|
|
BASE
|
|
Show details
|
|
26 |
Charting the impact of bilingualism on social attentional preferences in children with and without autism. ...
|
|
|
|
BASE
|
|
Show details
|
|
27 |
Domain-Specific Multi-Level IR Rewriting for GPU: The Open Earth Compiler for GPU-accelerated Climate Simulation ...
|
|
|
|
BASE
|
|
Show details
|
|
28 |
Modified Gravity and Cosmology: An Update by the CANTATA Network ...
|
|
|
|
BASE
|
|
Show details
|
|
29 |
Associations between Cardiovascular Signal Entropy and Cognitive Performance over Eight Years
|
|
|
|
In: Entropy ; Volume 23 ; Issue 10 (2021)
|
|
BASE
|
|
Show details
|
|
30 |
“Go, Vote, and Tweet It”: Interactivity in Online Protest-Related Discussions About the 2014 Catalan Referendum for Independence
|
|
|
|
In: International Journal of Communication; Vol 15 (2021); 23 ; 1932-8036 (2021)
|
|
BASE
|
|
Show details
|
|
31 |
From fast food to a well-balanced diet: toward a programme focused approach to feedback in higher education
|
|
|
|
BASE
|
|
Show details
|
|
32 |
Uncovering Constraint-Based Behavior in Neural Models via Targeted Fine-Tuning ...
|
|
|
|
BASE
|
|
Show details
|
|
33 |
Age and the visual speech benefit in noise (Beadle et al., 2021) ...
|
|
|
|
BASE
|
|
Show details
|
|
34 |
Age and the visual speech benefit in noise (Beadle et al., 2021) ...
|
|
|
|
BASE
|
|
Show details
|
|
35 |
O5S5: Documenting the experiences of the ASL Communities in the time of COVID-19 ...
|
|
|
|
BASE
|
|
Show details
|
|
36 |
O5S5: Documenting the experiences of the ASL Communities in the time of COVID-19 ...
|
|
|
|
BASE
|
|
Show details
|
|
37 |
O5S5: Documenting the experiences of the ASL Communities in the time of COVID-19 ...
|
|
|
|
BASE
|
|
Show details
|
|
38 |
LUX (Linguistic aspects Under eXamination): Discourse Analysis for Automatic Fake News Classification ...
|
|
|
|
BASE
|
|
Show details
|
|
39 |
Mapping probability word problems to executable representations ...
|
|
|
|
BASE
|
|
Show details
|
|
40 |
Multi-Class Grammatical Error Detection for Correction: A Tale of Two Systems ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|