DE eng

Search in the Catalogues and Directories

Hits 1 – 7 of 7

1
Creation of a multilingual aligned corpus with Ukrainian as the target language and its exploitation
In: Computational Linguistics and Intelligent Systems ; https://hal.archives-ouvertes.fr/hal-01736363 ; Computational Linguistics and Intelligent Systems, Apr 2017, Kharkiv, Ukraine (2017)
BASE
Show details
2
Unsupervised acquisition of morphological resources for Ukrainian
In: Computational Linguistics and Intelligent Systems ; https://hal.archives-ouvertes.fr/hal-01736400 ; Computational Linguistics and Intelligent Systems, Apr 2017, Kharkiv, Ukraine (2017)
BASE
Show details
3
Understanding of unknown medical words
In: Biomedical NLP Workshop associated with RANLP 2017 ; https://hal.archives-ouvertes.fr/hal-01736408 ; Biomedical NLP Workshop associated with RANLP 2017, Sep 2017, Varna, Bulgaria (2017)
BASE
Show details
4
Querying biomedical Linked Data with natural language questions
In: ISSN: 1570-0844 ; EISSN: 2210-4968 ; Semantic Web – Interoperability, Usability, Applicability ; https://hal.archives-ouvertes.fr/hal-01426686 ; Semantic Web – Interoperability, Usability, Applicability, IOS Press, 2017, 8, pp.581-599. ⟨10.3233/SW-160244⟩ (2017)
BASE
Show details
5
Generating and executing complex natural language queries across linked data
In: International Congress on Medical Informatics ; https://hal.archives-ouvertes.fr/hal-01971222 ; International Congress on Medical Informatics, Jan 2015, Sao Paulo, Brazil (2015)
BASE
Show details
6
Tuning HeidelTime for identifying time expressions in clinical texts in English and French
In: International Workshop on Health Text Mining and Information Analysis ; https://hal.archives-ouvertes.fr/hal-01972761 ; International Workshop on Health Text Mining and Information Analysis, Jan 2014, Gothenburg, Sweden (2014)
BASE
Show details
7
Combining an expert-based medical entity recognizer to a machine-learning system: methods and a case-study
In: Biomedical Informatics Insights ; https://hal.archives-ouvertes.fr/hal-01972779 ; Biomedical Informatics Insights, 2013, 13p (2013)
Abstract: International audience ; Medical entity recognition is currently generally performed by data-driven methods based on supervised machine learning. Expert-based systems, where linguistic and domain expertise are directly provided to the system, for instance in the form of lexicons and pattern-based rules, are often combined with data-driven systems. We present here a case study where an existing expert-based medical entity recognition system, Ogmios, is combined with a data-driven system, Caramba, based on a linear-chain Conditional Random Field (CRF) classifier. We examine different methods to combine two such systems and test the most relevant ones through experiments performed on the i2b2/VA 2012 challenge data. Our case study specifically highlights the risk of overfitting incurred by an expert-based system. We observe that it prevents the combination of the two systems from obtaining improvements in precision, recall, or F-measure, and analyse the underlying mechanisms through a post-hoc feature-level analysis. We also observe that wrapping the expert-based system alone as attributes input to a CRF classifier does boost its F-measure from 0.603 to 0.710 (strict matching of types and boundaries, as per the conlleval program), bringing it on par with the data-driven system. The generality of this method remains to be further investigated.
Keyword: [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [INFO]Computer Science [cs]; Hybrid Meth- ods; Information Extraction; Machine Learning; Medical records; Natural Language Processing; Overfitting
URL: https://hal.archives-ouvertes.fr/hal-01972779
BASE
Hide details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
7
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern