1 |
Towards combined semantic and lexical scores based on a new representation of textual data to extract experimental data from scientific publications
|
|
|
|
In: ISSN: 1751-5858 ; EISSN: 1751-5866 ; International Journal of Intelligent Information and Database Systems ; https://hal.inrae.fr/hal-03616243 ; International Journal of Intelligent Information and Database Systems, Inderscience, 2022, 15 (1), pp.78. ⟨10.1504/IJIIDS.2022.120146⟩ (2022)
|
|
Abstract:
International audience ; This article presents an ontological and terminological resource guided process for targeted extraction of scientific experimental data. Our method relies on the scientific publication representation (SciPuRe) describing the extracted data through ontological, lexical and structural (using segments in the scientific documents) features. Relevance scores based on these features are computed to rank the results and filter out the numerous false positives. Linear and sequential combinations of these scores are presented and evaluated. Experiments were carried out on a corpus of 50 English language scientific papers in the food packaging field. They revealed that article segment are an effective criterion for filtering out a majority of the quantitative entity false positives using lexical scores. Moreover the best symbolic entity extraction results were obtained with a sequential combinations of semantic and lexical scores. These results enable the ranking of entities by relevance and the filtering of false positive results.
|
|
Keyword:
[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]; data extraction; data relevance; data representation; information retrieval; ontological and terminological resource; OTR; web scientific documents
|
|
URL: https://hal.inrae.fr/hal-03616243 https://doi.org/10.1504/IJIIDS.2022.120146
|
|
BASE
|
|
Hide details
|
|
2 |
MEduKG: A Deep-Learning-Based Approach for Multi-Modal Educational Knowledge Graph Construction
|
|
|
|
In: Information; Volume 13; Issue 2; Pages: 91 (2022)
|
|
BASE
|
|
Show details
|
|
3 |
Prosodic Feature-Based Discriminatively Trained Low Resource Speech Recognition System
|
|
|
|
In: Sustainability; Volume 14; Issue 2; Pages: 614 (2022)
|
|
BASE
|
|
Show details
|
|
4 |
Atténuer les erreurs de numérisation dans la reconnaissance d'entités nommées pour les documents historiques
|
|
|
|
In: Conférence en Recherche d'Informations et Applications (CORIA 2021) ; https://hal.archives-ouvertes.fr/hal-03320332 ; Conférence en Recherche d'Informations et Applications (CORIA 2021), ARIA : Association Francophone de Recherche d’Information (RI) et Applications, Apr 2021, Grenoble (virtuel), France. pp.1 - 7 ; http://coria.asso-aria.org/2021/articles/mini_24/main.pdf (2021)
|
|
BASE
|
|
Show details
|
|
5 |
LILLIE : information extraction and database integration using linguistics and learning-based algorithms ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Exploring Construction of a Company Domain-Specific Knowledge Graph from Financial Texts Using Hybrid Information Extraction
|
|
Jen, Chun-Heng. - : KTH, Skolan för elektroteknik och datavetenskap (EECS), 2021
|
|
BASE
|
|
Show details
|
|
8 |
Data for Training and Evaluating Metadata Extraction Models based on 15 Thousand Cyrillic Script Publications ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Data for Training and Evaluating Metadata Extraction Models based on 15 Thousand Cyrillic Script Publications ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Distantly-Supervised Named Entity Recognition with Noise-Robust Learning and Language Model Augmented Self-Training ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
HittER: Hierarchical Transformers for Knowledge Graph Embeddings ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
AttentionRank: Unsupervised Keyphrase Extraction using Self and Cross Attentions ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Extracting Event Temporal Relations via Hyperbolic Geometry ...
|
|
|
|
BASE
|
|
Show details
|
|
17 |
Partially Supervised Named Entity Recognition via the Expected Entity Ratio Loss ...
|
|
|
|
BASE
|
|
Show details
|
|
18 |
Honey or Poison? Solving the Trigger Curse in Few-shot Event Detection via Causal Intervention ...
|
|
|
|
BASE
|
|
Show details
|
|
20 |
An Empirical Study on Multiple Information Sources for Zero-Shot Fine-Grained Entity Typing ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|