DE eng

Search in the Catalogues and Directories

Hits 1 – 17 of 17

1
The Impact of Pretrained Language Models on Negation and Speculation Detection in Cross-Lingual Medical Text: Comparative Study
In: JMIR Med Inform (2020)
Abstract: BACKGROUND: Negation and speculation are critical elements in natural language processing (NLP)-related tasks, such as information extraction, as these phenomena change the truth value of a proposition. In the clinical narrative that is informal, these linguistic facts are used extensively with the objective of indicating hypotheses, impressions, or negative findings. Previous state-of-the-art approaches addressed negation and speculation detection tasks using rule-based methods, but in the last few years, models based on machine learning and deep learning exploiting morphological, syntactic, and semantic features represented as spare and dense vectors have emerged. However, although such methods of named entity recognition (NER) employ a broad set of features, they are limited to existing pretrained models for a specific domain or language. OBJECTIVE: As a fundamental subsystem of any information extraction pipeline, a system for cross-lingual and domain-independent negation and speculation detection was introduced with special focus on the biomedical scientific literature and clinical narrative. In this work, detection of negation and speculation was considered as a sequence-labeling task where cues and the scopes of both phenomena are recognized as a sequence of nested labels recognized in a single step. METHODS: We proposed the following two approaches for negation and speculation detection: (1) bidirectional long short-term memory (Bi-LSTM) and conditional random field using character, word, and sense embeddings to deal with the extraction of semantic, syntactic, and contextual patterns and (2) bidirectional encoder representations for transformers (BERT) with fine tuning for NER. RESULTS: The approach was evaluated for English and Spanish languages on biomedical and review text, particularly with the BioScope corpus, IULA corpus, and SFU Spanish Review corpus, with F-measures of 86.6%, 85.0%, and 88.1%, respectively, for NeuroNER and 86.4%, 80.8%, and 91.7%, respectively, for BERT. CONCLUSIONS: These results show that these architectures perform considerably better than the previous rule-based and conventional machine learning–based systems. Moreover, our analysis results show that pretrained word embedding and particularly contextualized embedding for biomedical corpora help to understand complexities inherent to biomedical text.
Keyword: Original Paper
URL: http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7746498/
http://www.ncbi.nlm.nih.gov/pubmed/33270027
https://doi.org/10.2196/18953
BASE
Hide details
2
Simplifying drug package leaflets written in Spanish by using word embedding [<Journal>]
Segura-Bedmar, Isabel [Verfasser]; Martinez, Paloma [Sonstige]
DNB Subject Category Language
Show details
3
The CHEMDNER corpus of chemicals and drugs and its annotation principles
Krallinger, Martin; Rabal, Obdulia; Leitner, Florian. - : BioMed Central, 2015
BASE
Show details
4
A Proof-of-Concept for Orthographic Named Entity Correction in Spanish Voice Queries
BASE
Show details
5
A real time Named Entity Recognition system for Arabic text mining
BASE
Show details
6
A linguistic rule-based approach to extract drug-drug interactions from pharmacological documents
BASE
Show details
7
An Illustrated Methodology for Evaluating ASR Systems
BASE
Show details
8
UC3M system: Determining the Extent, Type and Value of Time Expressions in TempEval-2
Moreno Schneider, Julián; Martínez, Paloma; Vicente-Díez, María Teresa. - : Association for Computational Linguistics, 2010
BASE
Show details
9
Automatic Identification of Biomedical Concepts in Spanish Language Unstructured Clinical Texts
BASE
Show details
10
Are Passages Enough? The MIRACLE Team Participation at QA@CLEF2009
BASE
Show details
11
Some Experiments in Evaluating ASR Systems Applied to Multimedia Retrieval
BASE
Show details
12
MOSTAS: un etiquetador morfo-semántico, anonimizador y corrector de historiales clínicos ; MOSTAS: A morpho-semantic tagger, anonymizer and spellchecker for clinical reports
Iglesias, Ana; Castro, Elena; Pérez Laínez, Rebeca. - : Sociedad española para el procesamiento del lenguaje natural, 2008
BASE
Show details
13
Combining Wikipedia and Newswire Texts for Question Answering in Spanish
BASE
Show details
14
Experiences in evaluating multilingual and text-image information retrieval
BASE
Show details
15
Natural language processing and information systems : revised papers
Euzenat, Jérôme (Mitarb.); Winkler, Christian (Mitarb.); Beuvron, François de (Mitarb.). - Berlin [u.a.] : Springer, 2001
BLLDB
UB Frankfurt Linguistik
Show details
16
El suplemento : problemas de caracterización y tipología
In: Real Academia Española. Boletín de la Real Academia Española. - Madrid : Aguirre 77 (1997) 270, 57-98
BLLDB
Show details
17
Observaciones sobre las características lexicográficas del 'Vocabulario científico y técnico'
In: Real Academia Española. Boletín de la Real Academia Española. - Madrid : Aguirre 72 (1992) 255, 173-196
BLLDB
Show details

Catalogues
1
0
0
0
1
0
0
Bibliographies
3
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
13
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern