DE eng

Search in the Catalogues and Directories

Hits 1 – 3 of 3

1
A French clinical corpus with comprehensive semantic annotations: development of the Medical Entity and Relation LIMSI annOtated Text corpus (MERLOT)
In: ISSN: 1574-020X ; EISSN: 1574-0218 ; Language Resources and Evaluation ; https://hal.archives-ouvertes.fr/hal-01631743 ; Language Resources and Evaluation, Springer Verlag, 2017, 52 (2), pp.571-601. ⟨10.1007/s10579-017-9382-y⟩ (2017)
Abstract: International audience ; Quality annotated resources are essential for Natural Language Processing. The objective of this work is to present a corpus of clinical narratives in French annotated for linguistic, semantic and structural information, aimed at clinical information extraction. Six annotators contributed to the corpus annotation, using a comprehensive annotation scheme covering 21 entities, 11 attributes and 37 relations. All annotators trained on a small, common portion of the corpus before proceeding independently. An automatic tool was used to produce entity and attribute pre-annotations. About a tenth of the corpus was doubly annotated and annotation differences were resolved in consensus meetings. To ensure annotation consistency throughout the corpus, we devised harmonization tools to automatically identify annotation differences to be addressed to improve the overall corpus quality. The annotation project spanned over 24 months and resulted in a corpus comprising 500 documents (148,476 tokens) annotated with 44,740 entities and 26,478 relations. The average inter-annotator agreement is 0.793 F-measure for entities and 0.789 for relations. The performance of the pre-annotation tool for entities reached 0.814 F-measure when sufficient training data was available. The performance of our entity pre-annotation tool shows the value of the corpus to build and evaluate information extraction methods. In addition, we introduced harmonization methods that further improved the quality of annotations in the corpus.
Keyword: [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; Clinical narrative; Inter-annotator agreement; Personal health information; Semantic annotations
URL: https://doi.org/10.1007/s10579-017-9382-y
https://hal.archives-ouvertes.fr/hal-01631743/file/lre.pdf
https://hal.archives-ouvertes.fr/hal-01631743
https://hal.archives-ouvertes.fr/hal-01631743/document
BASE
Hide details
2
Automatic computation of CHA2DS2-VASc score: information extraction from clinical texts for thromboembolism risk assessment.
In: AMIA . Annual Symposium proceedings [electronic resource] / AMIA Symposium. AMIA Symposium. ; https://hal.archives-ouvertes.fr/hal-00748588 ; AMIA . Annual Symposium proceedings [electronic resource] / AMIA Symposium. AMIA Symposium., 2011, 2011, pp.501-10 (2011)
BASE
Show details
3
Automatic computation of CHA2DS2-VASc score: Information extraction from clinical texts for thromboembolism risk assessment
Grouin, Cyril; Deléger, Louise; Rosier, Arnaud. - : American Medical Informatics Association, 2011
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
3
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern