1 |
Inference Annotation of a Chinese Corpus for Opinion Mining
|
|
|
|
In: LREC ; https://hal-inalco.archives-ouvertes.fr/hal-02507170 ; LREC, May 2020, Marseille, France (2020)
|
|
BASE
|
|
Show details
|
|
2 |
Inference Annotation of a Chinese Corpus for Opinion Mining
|
|
|
|
In: LREC ; https://hal-inalco.archives-ouvertes.fr/hal-02507170 ; LREC, May 2020, Marseille, France (2020)
|
|
BASE
|
|
Show details
|
|
3 |
French Levothyrox® Crisis: Retrospective Analysis of Social Media
|
|
|
|
In: International Society of Pharmacovigilance ; https://hal.archives-ouvertes.fr/hal-02411632 ; International Society of Pharmacovigilance, Springer International Publishing, Oct 2019, Bogota, Colombia (2019)
|
|
BASE
|
|
Show details
|
|
4 |
Generating a training corpus for OCR post-correction using encoder-decoder model
|
|
|
|
In: Proceedings of the Eighth International Joint Conference on Natural Language Processing (Volume 1: Long Papers) ; International Joint Conference on Natural Language Processing ; https://hal.archives-ouvertes.fr/hal-01831147 ; International Joint Conference on Natural Language Processing, Nov 2017, Taipei, Taiwan ; https://www.aclweb.org/anthology/I17-1101 (2017)
|
|
BASE
|
|
Show details
|
|
5 |
CLEF eHealth 2017 Multilingual Information Extraction task Overview: ICD10 Coding of Death Certificates in English and French.
|
|
|
|
In: Workshop of the Cross-Language Evaluation Forum ; https://hal.archives-ouvertes.fr/hal-01665374 ; Workshop of the Cross-Language Evaluation Forum, CEUR-WS, Jan 2017, Dublin, Ireland (2017)
|
|
BASE
|
|
Show details
|
|
6 |
A French clinical corpus with comprehensive semantic annotations: development of the Medical Entity and Relation LIMSI annOtated Text corpus (MERLOT)
|
|
|
|
In: ISSN: 1574-020X ; EISSN: 1574-0218 ; Language Resources and Evaluation ; https://hal.archives-ouvertes.fr/hal-01631743 ; Language Resources and Evaluation, Springer Verlag, 2017, 52 (2), pp.571-601. ⟨10.1007/s10579-017-9382-y⟩ (2017)
|
|
Abstract:
International audience ; Quality annotated resources are essential for Natural Language Processing. The objective of this work is to present a corpus of clinical narratives in French annotated for linguistic, semantic and structural information, aimed at clinical information extraction. Six annotators contributed to the corpus annotation, using a comprehensive annotation scheme covering 21 entities, 11 attributes and 37 relations. All annotators trained on a small, common portion of the corpus before proceeding independently. An automatic tool was used to produce entity and attribute pre-annotations. About a tenth of the corpus was doubly annotated and annotation differences were resolved in consensus meetings. To ensure annotation consistency throughout the corpus, we devised harmonization tools to automatically identify annotation differences to be addressed to improve the overall corpus quality. The annotation project spanned over 24 months and resulted in a corpus comprising 500 documents (148,476 tokens) annotated with 44,740 entities and 26,478 relations. The average inter-annotator agreement is 0.793 F-measure for entities and 0.789 for relations. The performance of the pre-annotation tool for entities reached 0.814 F-measure when sufficient training data was available. The performance of our entity pre-annotation tool shows the value of the corpus to build and evaluate information extraction methods. In addition, we introduced harmonization methods that further improved the quality of annotations in the corpus.
|
|
Keyword:
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; Clinical narrative; Inter-annotator agreement; Personal health information; Semantic annotations
|
|
URL: https://doi.org/10.1007/s10579-017-9382-y https://hal.archives-ouvertes.fr/hal-01631743/file/lre.pdf https://hal.archives-ouvertes.fr/hal-01631743 https://hal.archives-ouvertes.fr/hal-01631743/document
|
|
BASE
|
|
Hide details
|
|
7 |
Identification of mentions and relations between bacteria and biotope from PubMed abstracts
|
|
|
|
In: BioNLP Shared-Task Workshop ; https://hal.archives-ouvertes.fr/hal-01831226 ; BioNLP Shared-Task Workshop, ACL, Jan 2016, Berlin, Germany (2016)
|
|
BASE
|
|
Show details
|
|
8 |
Low-resource OCR error detection and correction in French Clinical Texts
|
|
|
|
In: International Workshop on Health Text Mining and Information Analysis ; https://hal.archives-ouvertes.fr/hal-01831225 ; International Workshop on Health Text Mining and Information Analysis, ACL, Nov 2016, Austin, United States (2016)
|
|
BASE
|
|
Show details
|
|
9 |
Analyse des émotions, sentiments et opinions exprimés dans les tweets : présentation et résultats de l'édition 2015 du défi fouille de texte (DEFT)
|
|
|
|
In: Actes de la 22e conférence sur le Traitement Automatique des Langues Naturelles (TALN 2015) ; https://hal.archives-ouvertes.fr/hal-01617180 ; Actes de la 22e conférence sur le Traitement Automatique des Langues Naturelles (TALN 2015), Jun 2015, Caen, France ; http://www.atala.org/taln_archives/ateliers/2015/DEFT/deft-2015-long-001.pdf (2015)
|
|
BASE
|
|
Show details
|
|
10 |
Morpho-Syntactic Study of Errors from Speech Recognition System
|
|
|
|
In: International Conference on Language Resources and Evaluation ; https://hal.archives-ouvertes.fr/hal-01831243 ; International Conference on Language Resources and Evaluation, Jan 2014, Reykjavik, Iceland (2014)
|
|
BASE
|
|
Show details
|
|
11 |
Reformatting clinical records based on global layout statistics
|
|
|
|
In: International Symposium on Semantic Mining in Biomedicine ; https://hal.archives-ouvertes.fr/hal-01831245 ; International Symposium on Semantic Mining in Biomedicine, Jan 2014, Aveiro, Portugal (2014)
|
|
BASE
|
|
Show details
|
|
12 |
Human Annotation of ASR Error Regions: is "gravity" a Sharable Concept for Human Annotators?
|
|
|
|
In: Ninth International Conference on Language Resources and Evaluation (LREC'14) ; https://hal.archives-ouvertes.fr/hal-01134802 ; Ninth International Conference on Language Resources and Evaluation (LREC'14), May 2014, Reykjavik, Iceland. Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), pp.3050-3056, 2014 ; http://lrec2014.lrec-conf.org/en/ (2014)
|
|
BASE
|
|
Show details
|
|
13 |
Approches à base de fréquences pour la simplification lexicale
|
|
|
|
In: TALN-RECITAL 2013 ; https://hal.archives-ouvertes.fr/hal-00838354 ; TALN-RECITAL 2013, Jun 2013, Les Sables d'Olonne, France. pp.493-506 (2013)
|
|
BASE
|
|
Show details
|
|
14 |
Automatic named entity pre-annotation for out-of-domain human annotation
|
|
|
|
In: Linguistic Annotation Workshop ; https://hal.archives-ouvertes.fr/hal-01831229 ; Linguistic Annotation Workshop, ACL, Jan 2013, Sofia, Bulgaria (2013)
|
|
BASE
|
|
Show details
|
|
15 |
Human annotation of asr error regions: Is ”gravity” a sharable concept for human annotators?
|
|
|
|
In: Errors by Humans and Machines in Multimedia, Multimodal, Multilingual Data Processing (ERRARE 2013) ; https://halshs.archives-ouvertes.fr/halshs-01424915 ; Errors by Humans and Machines in Multimedia, Multimodal, Multilingual Data Processing (ERRARE 2013), Nov 2013, Ermenonville, France (2013)
|
|
BASE
|
|
Show details
|
|
16 |
Combining an expert-based medical entity recognizer to a machine-learning system: methods and a case-study
|
|
|
|
In: Biomedical Informatics Insights ; https://hal.archives-ouvertes.fr/hal-01972779 ; Biomedical Informatics Insights, 2013, 13p (2013)
|
|
BASE
|
|
Show details
|
|
17 |
Extended named entities annotation on OCRed documents: from corpus constitution to evaluation campaign
|
|
|
|
In: International Conference on Language Resources and Evaluation ; https://hal.archives-ouvertes.fr/hal-01831254 ; International Conference on Language Resources and Evaluation, Jan 2012, Istanbul, Turkey (2012)
|
|
BASE
|
|
Show details
|
|
18 |
ANNLOR: A Naïve Notation-system for Lexical Outputs Ranking
|
|
|
|
In: Proceedings of the First Joint Conference on Lexical and Computational Semantics (*SEM) ; First Joint Conference on Lexical and Computational Semantics (*SEM) ; https://hal.archives-ouvertes.fr/hal-00790866 ; First Joint Conference on Lexical and Computational Semantics (*SEM), Jun 2012, Montreal, Canada. pp.487-492 ; http://aclweb.org/anthology-new/S/S12/S12-1068.pdf (2012)
|
|
BASE
|
|
Show details
|
|
19 |
Manual Corpus Annotation: Giving Meaning to the Evaluation Metrics
|
|
|
|
In: Proceedings of the International Conference on Computational Linguistics (COLING 2012) ; International Conference on Computational Linguistics ; https://hal.archives-ouvertes.fr/hal-00769639 ; International Conference on Computational Linguistics, Dec 2012, Mumbaï, India. pp.809--818 (2012)
|
|
BASE
|
|
Show details
|
|
20 |
Automatic computation of CHA2DS2-VASc score: information extraction from clinical texts for thromboembolism risk assessment.
|
|
|
|
In: AMIA . Annual Symposium proceedings [electronic resource] / AMIA Symposium. AMIA Symposium. ; https://hal.archives-ouvertes.fr/hal-00748588 ; AMIA . Annual Symposium proceedings [electronic resource] / AMIA Symposium. AMIA Symposium., 2011, 2011, pp.501-10 (2011)
|
|
BASE
|
|
Show details
|
|
|
|