DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5...101
Hits 1 – 20 of 2.018

1
Towards combined semantic and lexical scores based on a new representation of textual data to extract experimental data from scientific publications
In: ISSN: 1751-5858 ; EISSN: 1751-5866 ; International Journal of Intelligent Information and Database Systems ; https://hal.inrae.fr/hal-03616243 ; International Journal of Intelligent Information and Database Systems, Inderscience, 2022, 15 (1), pp.78. ⟨10.1504/IJIIDS.2022.120146⟩ (2022)
BASE
Show details
2
Analyse automatique d’arguments et apprentissage multi-tâches : un cas d’étude
In: Revue Ouverte d'Intelligence Artificielle ; https://hal.mines-ales.fr/hal-03638222 ; Revue Ouverte d'Intelligence Artificielle, Association pour la diffusion de la recherche francophone en intelligence artificielle, 2022, 3 (3-4), pp.201-222. ⟨10.5802/roia.29⟩ (2022)
BASE
Show details
3
Assessing the impact of OCR noise on multilingual event detection over digitised documents
In: ISSN: 1432-5012 ; EISSN: 1432-1300 ; International Journal on Digital Libraries ; https://hal.archives-ouvertes.fr/hal-03635985 ; International Journal on Digital Libraries, Springer Verlag, 2022, ⟨10.1007/s00799-022-00325-2⟩ (2022)
BASE
Show details
4
Introducing the HIPE 2022 Shared Task: Named Entity Recognition and Linking in Multilingual Historical Documents
In: Advances in Information Retrieval. 44th European Conference on IR Research, ECIR 2022, Stavanger, Norway, April 10–14, 2022, Proceedings, Part II ; https://hal.archives-ouvertes.fr/hal-03635971 ; Matthias Hagen; Suzan Verberne; Craig Macdonald; Christin Seifert; Krisztian Balog; Kjetil Nørvåg; Vinay Setty. Advances in Information Retrieval. 44th European Conference on IR Research, ECIR 2022, Stavanger, Norway, April 10–14, 2022, Proceedings, Part II, 13186, Springer International Publishing, pp.347-354, 2022, Lecture Notes in Computer Science, 978-3-030-99738-0. ⟨10.1007/978-3-030-99739-7_44⟩ (2022)
BASE
Show details
5
EMBEDDIA tools output example corpus of Estonian, Croatian and Latvian news articles 1.0
Abstract: This dataset contains articles from EMBEDDIA Media partners with various information added by the tools developed within the EMBEDDIA project: - 12,390 Estonian articles from 2019 with tags given by Ekspress Meedia. The complete dataset without the output of EMBEDDIA tools is available at http://hdl.handle.net/11356/1408 - 5,000 Croatian articles from autumn of 2010 with tags given by 24sata. The complete dataset without the output of EMBEDDIA tools is available at http://hdl.handle.net/11356/1410 - 15,264 Latvian articles from 2019 with tags given by Ekspress Meedia. The complete dataset without the output of EMBEDDIA tools is available at http://hdl.handle.net/11356/1409 All the articles in the dataset have been analysed with texta-mlp Python package (https://pypi.org/project/texta-mlp/) via the EMBEDDIA Media assistant's Texta Toolkit (https://docs.texta.ee/). The tools used to analyse the articles were the following: - Latin1 and Latin2 Name Entity Recognition Tool modules (Cabrera-Diego et al., 2021, both described in https://aclanthology.org/2021.bsnlp-1.12/) . The Latin 1 results can be found folders annotated_articles_ner_latin1/ and annotated_articles_all_tools/, while the Latin 2 results are in annotated_articles_nerlatin2/ or annotated_articles_all_tools/. - RAKUN keyword extractor. RAKUN (Škrlj et al. 2019) is an unsupervised system for keyword extraction, so it can be used for any language. It detects keywords by turning text into a graph and the most important nodes in the graph mostly turn out to be the keywords. It is described in https://link.springer.com/chapter/10.1007/978-3-030-31372-2_26. The keyword annotation results can be found in the folder annotated_articles_rakun/ or annotated_articles_all_tools/. - TNT-KID keyword extractor. TNT-KID (Martinc et al. 2021, ) is a supervised system for automatic keyword extraction. It was trained on a corpus of articles with human-assigned keywords. For Croatian, the annotators were 24sata editors, for Estonian the Ekspress Meedia staff and for Latvian the Latvian Delfi staff. The system is further documented at https://doi.org/10.1017/S1351324921000127. For Croatian only TNT-KID was applied, while for Estonian and Latvian, the TNT-KID with TF-IDF, and extension by Koloski et al. (https://aclanthology.org/2021.hackashop-1.4.pdf) was used. The results of applying this tool are found in the folder annotated articles tnt_kid/ or annotated articles all tools/. - Sentiment analysis. Our news sentiment analyser (Pelicon et al. 2020) labels a news article as being of positive, negative, or neutral sentiment, using a fine-tuned multilingual BERT model, which was trained on Slovene sentiment annotated news articles. The system is further documented in https://doi.org/10.3390/app10175993. The results of this tools are found in the folder annotated articles sentiment/ or annotated articles all tools/. All the data is encoded in "JSON Lines" format. Each folder has its own README file which explains the structure of the files.
Keyword: keyword extraction; named entity recognition; sentiment classification
URL: http://hdl.handle.net/11356/1485
BASE
Hide details
6
HIPE-2022 Shared Task Named Entity Datasets ...
BASE
Show details
7
HIPE-2022 Shared Task Named Entity Datasets ...
BASE
Show details
8
HIPE-2022 Shared Task Named Entity Datasets ...
BASE
Show details
9
HIPE-2022 Shared Task Named Entity Datasets ...
BASE
Show details
10
FiNER-139: A Financial Numeric Entity Recognition Dataset ...
BASE
Show details
11
FiNER-139 ...
BASE
Show details
12
FiNER-139 ...
BASE
Show details
13
FiNER-139: A Financial Numeric Entity Recognition Dataset ...
BASE
Show details
14
A Methodology to Transform Speech into Symbolic Gestures ...
Dr Atul Kumar; Dr Vinodani Katiyar. - : Zenodo, 2022
BASE
Show details
15
A Methodology to Transform Speech into Symbolic Gestures ...
Dr Atul Kumar; Dr Vinodani Katiyar. - : Zenodo, 2022
BASE
Show details
16
MEduKG: A Deep-Learning-Based Approach for Multi-Modal Educational Knowledge Graph Construction
In: Information; Volume 13; Issue 2; Pages: 91 (2022)
BASE
Show details
17
Semantic Feature Extraction Using SBERT for Dementia Detection
In: Brain Sciences; Volume 12; Issue 2; Pages: 270 (2022)
BASE
Show details
18
Text Mining from Free Unstructured Text: An Experiment of Time Series Retrieval for Volcano Monitoring
In: Applied Sciences; Volume 12; Issue 7; Pages: 3503 (2022)
BASE
Show details
19
Prosodic Feature-Based Discriminatively Trained Low Resource Speech Recognition System
In: Sustainability; Volume 14; Issue 2; Pages: 614 (2022)
BASE
Show details
20
Sentence Boundary Extraction from Scientific Literature of Electric Double Layer Capacitor Domain: Tools and Techniques
In: Applied Sciences; Volume 12; Issue 3; Pages: 1352 (2022)
BASE
Show details

Page: 1 2 3 4 5...101

Catalogues
27
5
0
0
0
0
2
Bibliographies
10
0
0
0
0
0
0
3
3
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
1.977
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern