DE eng

Search in the Catalogues and Directories

Page: 1 2
Hits 1 – 20 of 37

1
Homepage2Vec: Language-Agnostic Website Embedding and Classification ...
Abstract: Currently, publicly available models for website classification do not offer an embedding method and have limited support for languages beyond English. We release a dataset of more than two million category-labeled websites in 92 languages collected from Curlie, the largest multilingual human-edited Web directory. The dataset contains 14 website categories aligned across languages. Alongside it, we introduce Homepage2Vec, a machine-learned pre-trained model for classifying and embedding websites based on their homepage in a language-agnostic way. Homepage2Vec, thanks to its feature set (textual content, metadata tags, and visual attributes) and recent progress in natural language representation, is language-independent by design and generates embedding-based representations. We show that Homepage2Vec correctly classifies websites with a macro-averaged F1-score of 0.90, with stable performance across low- as well as high-resource languages. Feature analysis shows that a small subset of efficiently computable ... : Published in Proc. of ICWSM 2022 ...
Keyword: Artificial Intelligence cs.AI; Computation and Language cs.CL; FOS Computer and information sciences
URL: https://dx.doi.org/10.48550/arxiv.2201.03677
https://arxiv.org/abs/2201.03677
BASE
Hide details
2
Better than Average: Paired Evaluation of NLP systems ...
BASE
Show details
3
Classifying Dyads for Militarized Conflict Analysis ...
BASE
Show details
4
Classifying Dyads for Militarized Conflict Analysis
In: Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (2021)
BASE
Show details
5
Cognitive Network Topology and Optimization of the Mental Lexicon ...
Burlacu, Adrian; West, Robert. - : Unpublished, 2021
BASE
Show details
6
Linguistic effects on news headline success: Evidence from thousands of online field experiments (Registered Report Protocol)
In: PLoS One (2021)
BASE
Show details
7
On the limitations of cross-lingual encoders as exposed by reference-free machine translation evaluation
Zhao, Wei; Glavaš, Goran; Peyrard, Maxime. - : Association for Computational Linguistics, 2020
BASE
Show details
8
Crosslingual Document Embedding as Reduced-Rank Ridge Regression ...
BASE
Show details
9
Message Distortion in Information Cascades
In: http://infoscience.epfl.ch/record/270657 (2019)
BASE
Show details
10
Message Distortion in Information Cascades ...
BASE
Show details
11
Reverse-Engineering Satire, or "Paper on Computational Humor Accepted despite Making Serious Advances"
In: http://infoscience.epfl.ch/record/271147 (2019)
BASE
Show details
12
Why the World Reads Wikipedia: Beyond English Speakers
In: http://infoscience.epfl.ch/record/270302 (2019)
BASE
Show details
13
Crosslingual Document Embedding as Reduced-Rank Ridge Regression
In: http://infoscience.epfl.ch/record/263893 (2019)
BASE
Show details
14
How Constraints Affect Content: The Case of Twitter’s Switch from 140 to 280 Characters
In: Proceedings of the International AAAI Conference on Web and Social Media; Vol. 12 No. 1 (2018): Twelfth International AAAI Conference on Web and Social Media ; 2334-0770 ; 2162-3449 (2018)
BASE
Show details
15
Armed Conflicts in Online News: A Multilingual Study
In: Proceedings of the International AAAI Conference on Web and Social Media; Vol. 11 No. 1 (2017): Eleventh International AAAI Conference on Web and Social Media ; 2334-0770 ; 2162-3449 (2017)
BASE
Show details
16
Beyond the FN: A spatio-temporal analysis of the neural correlates of feedback processing in a virtual Blackjack game
In: Brain and cognition. - San Diego, Calif. [u.a.] : Elsevier Science 86 (2014), 104-115
OLC Linguistik
Show details
17
A randomised controlled trial of a theory-based interactive internet-based smoking cessation intervention ('StopAdvisor'): Study protocol
BASE
Show details
18
Conflict adaptation is reflected by response interference
In: Journal of cognitive psychology. - Abingdon : Routlegde, Taylor & Francis Group 24 (2012) 4, 457-467
OLC Linguistik
Show details
19
Cambridge Handbook of Computational Psychology - by Ron Sun [Editor] [Rezension]
In: The journal of mind and behavior. - New York, NY : Institute of Mind and Behavior, Inc. 30 (2009) 4, 337-344
BLLDB
OLC Linguistik
Show details
20
Differential effects of aging on processes underlying task switching
In: Brain and cognition. - San Diego, Calif. [u.a.] : Elsevier Science 68 (2008) 1, 67-80
BLLDB
OLC Linguistik
Show details

Page: 1 2

Catalogues
0
0
12
0
0
0
0
Bibliographies
10
0
0
0
0
0
0
0
2
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
17
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern