Home
Catalogue search
Refine your search:
Keyword:
classification (336)
info:eu-repo / classification / ddc / 150 (301)
info:eu-repo / classification / ddc / 004 (266)
Tierbezeichnung (262)
historical linguistics (255)
Language classification (253)
info:eu-repo / classification / ddc / 410 (245)
ddc:004 (242)
DATA processing & computer science (230)
Classification (213)
more
Creator / Publisher:
The Pennsylvania State University CiteSeerX Archives (445)
Universitäts- und Landesbibliothek Münster (149)
Frauenfelder, Ulrich Hans (91)
Bouillon, Pierrette (87)
Scherrer, Yves (57)
Waibel, Alex (48)
Franck, Julie (44)
Bronckart, Jean-Paul (42)
Shlonsky, Ur (42)
Delage, Hélène (40)
more
Year
Medium
Type
BLLDB-Access
Search in the Catalogues and Directories
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
Sort by
creator [A → Z]
'
creator [Z → A]
'
publishing year ↑ (asc)
'
publishing year ↓ (desc)
'
title [A → Z]
'
title [Z → A]
'
Simple Search
Page:
1
2
3
4
5
...
289
Hits 1 – 20 of 5.767
1
Cross-lingual few-shot hate speech and offensive language detection using meta learning
Mozafari, Marzieh
;
Farahbakhsh, Reza
;
Crespi, Noel
In: ISSN: 2169-3536 ; EISSN: 2169-3536 ; IEEE Access ; https://hal.archives-ouvertes.fr/hal-03559484 ; IEEE Access, IEEE, 2022, 10, pp.14880-14896. ⟨10.1109/ACCESS.2022.3147588⟩ (2022)
Abstract:
International audience ; Automatic detection of abusive online content such as hate speech, offensive language, threats, etc. has become prevalent in social media, with multiple efforts dedicated to detecting this phenomenon in English. However, detecting hatred and abuse in low-resource languages is a non-trivial challenge. The lack of sufficient labeled data in low-resource languages and inconsistent generalization ability of transformer-based multilingual pre-trained language models for typologically diverse languages make these models inefficient in some cases. We propose a meta learning-based approach to study the problem of few-shot hate speech and offensive language detection in low-resource languages that will allow hateful or offensive content to be predicted by only observing a few labeled data items in a specific target language. We investigate the feasibility of applying a meta learning approach in cross-lingual few-shot hate speech detection by leveraging two meta learning models based on optimization-based and metric-based (MAML and Proto-MAML) methods. To the best of our knowledge, this is the first effort of this kind. To evaluate the performance of our approach, we consider hate speech and offensive language detection as two separate tasks and make two diverse collections of different publicly available datasets comprising 15 datasets across 8 languages for hate speech and 6 datasets across 6 languages for offensive language. Our experiments show that meta learning-based models outperform transfer learning-based models in a majority of cases, and that Proto-MAML is the best performing model, as it can quickly generalize and adapt to new languages with only a few labeled data points (generally, 16 samples per class yields an effective performance) to identify hateful or offensive content.
Keyword:
[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]
;
[INFO.INFO-NI]Computer Science [cs]/Networking and Internet Architecture [cs.NI]
;
[INFO.INFO-SI]Computer Science [cs]/Social and Information Networks [cs.SI]
;
Cross-lingual classification
;
Few-shot learning
;
Hate speech
;
Meta learning
;
Offensive language
;
Transfer learning
;
XLMRoBERTa
URL:
https://doi.org/10.1109/ACCESS.2022.3147588
https://hal.archives-ouvertes.fr/hal-03559484
BASE
Hide details
2
FAIRsharing record for: General Ontology for Linguistic Description ... : GOLD ...
FAIRsharing Team
. - : FAIRsharing, 2022
BASE
Show details
3
Unsupervised quantification of entity consistency between photos and text in real-world news ...
Müller-Budack, Eric
. - : Hannover : Institutionelles Repositorium der Leibniz Universität Hannover, 2022
BASE
Show details
4
Danish Fungi 2020
Picek, Lukáš
;
Šulc, Milan
;
Matas, Jiří
. - : IEEE/CVF, 2022
BASE
Show details
5
EMBEDDIA tools output example corpus of Estonian, Croatian and Latvian news articles 1.0
Freienthal, Linda
;
Pelicon, Andraž
;
Martinc, Matej
. - : Ekspress Meedia Group, 2022. : Styria Media Group, 2022
BASE
Show details
6
О ЛЕКСИКО-ГРАММАТИЧЕСКИХ РАЗРЯДАХ ИМЕН СУЩЕСТВИТЕЛЬНЫХ В ТАБАСАРАНСКОМ ЯЗЫКЕ ... : ABOUT LEXICAL AND GRAMMATICAL CATEGORIES OF NOUNS IN THE TABASARAN LANGUAGE ...
Н.Э. Сафаралиев
. - : Мир науки, культуры, образования, 2022
BASE
Show details
7
Multi language Email Classification Using Transfer learning
Sousa, Mário Jorge Carvalho de
. - 2022
BASE
Show details
8
Mining an English-Chinese parallel Dataset of Financial News
Turenne, Nicolas
;
Chen, Ziwei
;
Fan, Guitao
...
In: Journal of Open Humanities Data; Vol 8 (2022); 9 ; 2059-481X (2022)
BASE
Show details
9
Discriminating Bacterial Infection from Other Causes of Fever Using Body Temperature Entropy Analysis
Borja Vargas; David Cuesta-Frau; Paula González-López; María-José Fernández-Cotarelo; Óscar Vázquez-Gómez; Ana Colás; Manuel Varela
In: Entropy; Volume 24; Issue 4; Pages: 510 (2022)
BASE
Show details
10
Fokusgruppenkorpus "Personenreferenz im Dialekt"
Schweden, T.
(Theresa);
Dammel, A.
(Antje). - 2022
BASE
Show details
11
The Multilingual Pragmatics of New Englishes: An Analysis of Question Tags in Nigerian English
Westphal, M.
(Michael). - 2022
BASE
Show details
12
The phonetics and phonology of Hong Kong English: a study of fricatives
Ho, S.Y.B.
(Sin). - 2022
BASE
Show details
13
Code: Drift in a Popular Metal Oxide Sensor Dataset Reveals Limitations for Gas Classification Benchmarks ...
, Dennler
. - : Zenodo, 2022
BASE
Show details
14
Code: Drift in a Popular Metal Oxide Sensor Dataset Reveals Limitations for Gas Classification Benchmarks ...
, Dennler
. - : Zenodo, 2022
BASE
Show details
15
Addressing multilingualism in the GoTriple discovery platform ...
Dumouchel, Suzanne
. - : Zenodo, 2022
BASE
Show details
16
Addressing multilingualism in the GoTriple discovery platform ...
Dumouchel, Suzanne
. - : Zenodo, 2022
BASE
Show details
17
The Terms of “You(s)”: How the Term of Address Used by Conversational Agents Influences User Evaluations in French and German Linguaculture ...
Ollier, Joseph
;
Nißen, Marcia Katharina
;
von Wangenheim, Florian
. - : ETH Zurich, 2022
BASE
Show details
18
'Muscles of mussels' and 'hooks of bananas' - the (incipient) numeral classifier system of Ugare (Tivoid, Cameroon/Nigeria) ...
Angitso, Michael
. - : Open Science Framework, 2022
BASE
Show details
19
Towards reconstructing a Proto-Tivoid numeral classifier system ...
Angitso, Michael
. - : Open Science Framework, 2022
BASE
Show details
20
Measuring Semantic Similarity of Documents by Using Named Entity Recognition Methods
Muñoz Morales, David Efraín
. - : Technological University Dublin, 2022
In: Masters (2022)
BASE
Show details
Page:
1
2
3
4
5
...
289
Mobile view
All
Catalogues
UB Frankfurt Linguistik
39
IDS Mannheim
0
OLC Linguistik
93
UB Frankfurt Retrokatalog
0
DNB Subject Category Language
0
Institut für Empirische Sprachwissenschaft
3
Leibniz-Centre General Linguistics (ZAS)
5
Bibliographies
BLLDB
550
BDSL
0
IDS Bibliografie zur deutschen Grammatik
0
IDS Bibliografie zur Gesprächsforschung
0
IDS Konnektoren im Deutschen
0
IDS Präpositionen im Deutschen
0
IDS OBELEX meta
0
MPI-SHH Linguistics Collection
59
MPI for Psycholinguistics
28
Linked Open Data catalogues
Annohub
0
Online resources
Link directory
0
Journal directory
0
Database directory
0
Dictionary directory
0
Open access documents
BASE
5.124
Linguistik-Repository
1
IDS Publikationsserver
0
Online dissertations
0
Language Description Heritage
0
© 2013 - 2024 Lin|gu|is|tik
|
Imprint
|
Privacy Policy
|
Datenschutzeinstellungen ändern