DE eng

Search in the Catalogues and Directories

Hits 1 – 15 of 15

1
Data for Cyrillic Reference Parsing ...
Abstract: We provide a synthetic reference data set covering over 100,000 labeled references (mostly Russian language) and a manually annotated set of real references (771 in number) gathered from multidisciplinary Cyrillic script publications . Background: Extracting structured data from bibliographic references is a crucial task for the creation of scholarly databases. While approaches, tools, and evaluation data sets for the task exist, there is a distinct lack of support for languages other than English and scripts other than the Latin alphabet. A significant portion of the scientific literature that is thereby excluded consists of publications written in Cyrillic script languages. To address this problem, we introduce a new multilingual and multidisciplinary data set of over 100,000 labeled reference strings. The data set covers multiple Cyrillic languages and contains over 700 manually labeled references, while the remaining are generated synthetically. With random samples of varying size of this data, we train ...
Keyword: citation data; citation field extraction; Cyrillic; digital libraries; NLP; references; scholarly data; SDU2022; sequence labeling
URL: https://zenodo.org/record/5801913
https://dx.doi.org/10.5281/zenodo.5801913
BASE
Hide details
2
Data for Training and Evaluating Metadata Extraction Models based on 15 Thousand Cyrillic Script Publications ...
BASE
Show details
3
Token-level Multilingual Epidemic Dataset for Event Extraction ...
BASE
Show details
4
Token-level Multilingual Epidemic Dataset for Event Extraction ...
BASE
Show details
5
Data for Training and Evaluating Metadata Extraction Models based on 15 Thousand Cyrillic Script Publications ...
BASE
Show details
6
Data for Cyrillic Reference Parsing ...
BASE
Show details
7
HTLinker: A Head-to-Tail Linker for Nested Named Entity Recognition
In: Symmetry ; Volume 13 ; Issue 9 (2021)
BASE
Show details
8
A Deep Neural Network-Based Model for Named Entity Recognition for Hindi Language
In: ETSU Faculty Works (2020)
BASE
Show details
9
NAT: Noise-Aware Training for Robust Neural Sequence Labeling
In: Fraunhofer IAIS (2020)
BASE
Show details
10
Modeling a label global context for sequence tagging in recurrent neural networks ; Modélisation d'un contexte global d'étiquettes pour l'étiquetage de séquences dans les réseaux neuronaux récurrents
In: Journée commune AFIA-ATALA sur le Traitement Automatique des Langues et l’Intelligence Artificielle pendant la onzième édition de la plate-forme Intelligence Artificielle (PFIA 2018) ; https://hal.archives-ouvertes.fr/hal-02002111 ; Journée commune AFIA-ATALA sur le Traitement Automatique des Langues et l’Intelligence Artificielle pendant la onzième édition de la plate-forme Intelligence Artificielle (PFIA 2018), Jul 2018, Nancy, France ; https://pfia2018.loria.fr/journee-tal/ (2018)
BASE
Show details
11
A Simple and Effective biLSTM Approach to Aspect-Based Sentiment Analysis in Social Media Customer Feedback
In: Clematide, Simon (2018). A Simple and Effective biLSTM Approach to Aspect-Based Sentiment Analysis in Social Media Customer Feedback. In: Barbaresi, Adrien; Biber, Hanno; Neubarth, Friedrich; Osswald, Rainer. 14th Conference on Natural Language Processing - KONVENS 2018. Vienna: Verlag der Österreichischen Akademie der Wissenschaften, 29-33. (2018)
BASE
Show details
12
Semi-Markov models for sequence segmentation
In: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL 2007) ; http://www.aclweb.org/anthology-new/D/D07/D07-1.pdf (2015)
BASE
Show details
13
Elephant: Sequence Labeling for Word and Sentence Segmentation
In: EMNLP 2013 ; https://hal.archives-ouvertes.fr/hal-01344500 ; EMNLP 2013, Oct 2013, Seattle, United States (2013)
BASE
Show details
14
Unsupervised Large-Vocabulary Word Sense Disambiguation with Graph-based Algorithms for Sequence Data Labeling
In: Joint Conference on Human Language Technology / Empirical Methods in Natural Language Processing (HLT/EMNLP), 2005, Vancouver, British Columbia, Canada (2005)
BASE
Show details
15
Feature-Rich Information Extraction for the Technical Trend-Map Creation
In: http://research.nii.ac.jp/ntcir/workshop/OnlineProceedings8/NTCIR/04-NTCIR8-PATMN-NishiyamaR.pdf
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
15
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern