DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5...11
Hits 1 – 20 of 216

1
Curlie Dataset - Language-agnostic Website Embedding and Classification ...
Lugeon, Sylvain; Piccardi, Tiziano. - : figshare, 2022
BASE
Show details
2
Curlie Dataset - Language-agnostic Website Embedding and Classification ...
Lugeon, Sylvain; Piccardi, Tiziano. - : figshare, 2022
BASE
Show details
3
Curlie Dataset - Language-agnostic Website Embedding and Classification ...
Lugeon, Sylvain; Piccardi, Tiziano. - : figshare, 2022
BASE
Show details
4
Curlie Dataset - Language-agnostic Website Embedding and Classification ...
Lugeon, Sylvain; Piccardi, Tiziano. - : figshare, 2022
Abstract: **************** Full Curlie dataset **************** This dataset contains the URL scrapped from curlie.org alongside with their multilingual labels. The label correspond to the sub-category where the URL was referenced in Curlie. We also provide a mapping between english labels and labels from other languages for alignment. The URLs have been filtered to only contain homepages. Each distint URL is indexed with a unique identifier (uid). curlie.csv.gz > [url, uid, label, lang] x 2,275,150 samples mapping.json.gz > [english_label, matchings] x 35,946 labels **************** Processed Curlie dataset **************** You find here the data used to train Homepage2vec. URLs have been further filtered out: websites listed under the Regional top-category where dropped, as well as non-accessible websites. This filtering yields 1,018,207 valid URL. The labels are aligned across languages and reduced to the 14 top-categories (classes). Because a URL can belong to several classes, a binary vector is used. The ...
Keyword: 170203 Knowledge Representation and Machine Learning; 80505 Web Technologies excl. Web Search; 80704 Information Retrieval and Web Search; Applied Computer Science; FOS Computer and information sciences; FOS Media and communications; FOS Psychology
URL: https://figshare.com/articles/dataset/Curlie_Dataset_-_Language-agnostic_Website_Embedding_and_Classification/19406693/3
https://dx.doi.org/10.6084/m9.figshare.19406693.v3
BASE
Hide details
5
Community Development of the SWEET Semantic System for Earth and Environmental Data - A Call for Interest ...
Rovetto, Robert J.. - : ESIP, 2022
BASE
Show details
6
The SWEET (Semantic Web for Earth and Environmental Terminology) Bibliography ...
Rovetto, Robert J.. - : ESIP, 2022
BASE
Show details
7
Community Development of the SWEET Semantic System for Earth and Environmental Data - A Call for Interest ...
Rovetto, Robert J.. - : ESIP, 2022
BASE
Show details
8
The SWEET (Semantic Web for Earth and Environmental Terminology) Bibliography ...
Rovetto, Robert J.. - : ESIP, 2022
BASE
Show details
9
3D Serious Game Modeling and Design: Contributions to Language Learning ; Modélisation et Conception de jeu sérieux tridimensionnel : Contributions à l’apprentissage des langues
Tazouti, Yassine. - : HAL CCSD, 2021
In: https://hal.archives-ouvertes.fr/tel-03315793 ; Environnements Informatiques pour l'Apprentissage Humain. Université Ibn Tofail, Kénitra (Maroc), 2021. Français (2021)
BASE
Show details
10
Enabling Dataflow Optimization for Quantum Programs ...
BASE
Show details
11
КОМПЬЮТЕРНЫЕ ИГРЫ КАК СОВРЕМЕННАЯ ТЕХНОЛОГИЯ ОБУЧЕНИЯ ИНОСТРАННОМУ ЯЗЫКУ В НЕЯЗЫКОВОМ ВУЗЕ ... : COMPUTER GAMES AS A MODERN TECHNOLOGY OF TEACHING A FOREIGN LANGUAGE IN A NON-LANGUAGE UNIVERSITY ...
Табуева И.Н.; Кашицин И.А.. - : The Scientific Heritage, 2021
BASE
Show details
12
VocalGeo: Using Speech to Provide Geospatial Context in the Classroom ...
Gilbert, Thomas. - : figshare, 2021
BASE
Show details
13
VocalGeo: Using Speech to Provide Geospatial Context in the Classroom ...
Gilbert, Thomas. - : figshare, 2021
BASE
Show details
14
Open Letter to Springer Editors and Their Response ...
Hochgesang, Julie. - : figshare, 2021
BASE
Show details
15
Open Letter to Springer Editors ...
Hochgesang, Julie. - : figshare, 2021
BASE
Show details
16
Open Letter to Springer Editors and Their Response ...
Hochgesang, Julie. - : figshare, 2021
BASE
Show details
17
Open Letter to Springer Editors ...
Hochgesang, Julie. - : figshare, 2021
BASE
Show details
18
Open Letter to Springer Editors ...
Hochgesang, Julie. - : figshare, 2021
BASE
Show details
19
Language of fungi derived from electrical spiking activity ...
Adamatzky, Andrew. - : arXiv, 2021
BASE
Show details
20
Semantic (Orbital) Sweep - Knowledge modeling and Semantic technology to clean Earth orbit and make spaceflight safer ...
Rovetto, Robert J.. - : ESIP, 2021
BASE
Show details

Page: 1 2 3 4 5...11

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
216
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern