1 |
Assessing the impact of OCR noise on multilingual event detection over digitised documents
|
|
|
|
In: ISSN: 1432-5012 ; EISSN: 1432-1300 ; International Journal on Digital Libraries ; https://hal.archives-ouvertes.fr/hal-03635985 ; International Journal on Digital Libraries, Springer Verlag, 2022, ⟨10.1007/s00799-022-00325-2⟩ (2022)
|
|
BASE
|
|
Show details
|
|
2 |
Assessing the Impact of OCR Noise on Multilingual Event Detection over Digitised Documents ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Assessing the Impact of OCR Noise on Multilingual Event Detection over Digitised Documents ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
L3i_LBPAM at the FinSim-2 task: Learning Financial Semantic Similarities with Siamese Transformers
|
|
|
|
In: WWW '21: Companion Proceedings of the Web Conference 2021 ; WWW '21: The Web Conference 2021 ; https://hal.sorbonne-universite.fr/hal-03256324 ; WWW '21: The Web Conference 2021, Apr 2021, Ljubljana (virtual), Slovenia. pp.302-306, ⟨10.1145/3442442.3451384⟩ (2021)
|
|
BASE
|
|
Show details
|
|
5 |
Discovering Spatial Relations in Litterature: what is the influence of OCR noise ?
|
|
|
|
In: NewsEye’s international conference ; https://hal.archives-ouvertes.fr/hal-03199729 ; NewsEye’s international conference, Mar 2021, Paris, France (2021)
|
|
BASE
|
|
Show details
|
|
6 |
Multilingual Epidemic Event Extraction
|
|
|
|
In: Towards Open and Trustworthy Digital Societies. 23rd International Conference on Asia-Pacific Digital Libraries, ICADL 2021, Virtual Event, December 1–3, 2021, Proceedings ; https://hal.archives-ouvertes.fr/hal-03480551 ; Hao-Ren Ke; Chei Sian Lee; Kazunari Sugiyama. Towards Open and Trustworthy Digital Societies. 23rd International Conference on Asia-Pacific Digital Libraries, ICADL 2021, Virtual Event, December 1–3, 2021, Proceedings, 13133, Springer, pp.139-156, 2021, Lecture Notes in Computer Science, 978-3-030-91668-8. ⟨10.1007/978-3-030-91669-5_12⟩ (2021)
|
|
BASE
|
|
Show details
|
|
7 |
Étude comparative de méthodes de classification multilingue appliquées à l'épidémiologie
|
|
|
|
In: COnférence en Recherche d'Informations et Applications - CORIA 2021, French Information Retrieval Conference ; https://hal.archives-ouvertes.fr/hal-03320343 ; COnférence en Recherche d'Informations et Applications - CORIA 2021, French Information Retrieval Conference, Apr 2021, Grenoble (virtuel), France (2021)
|
|
BASE
|
|
Show details
|
|
8 |
« Exploiter un corpus de données textuelles sans post-traitement : l’écriture burlesque de la Fronde »
|
|
|
|
In: ISSN: 2736-2337 ; Humanités numériques ; https://hal.archives-ouvertes.fr/hal-03500616 ; Humanités numériques, Bruxelles: Humanistica, 2021 (2021)
|
|
BASE
|
|
Show details
|
|
9 |
Étude comparative de méthodes de classification multilingue appliquées à l'épidémiologie ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Impact Analysis of Document Digitization on Event Extraction ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Token-level Multilingual Epidemic Dataset for Event Extraction ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Impact Analysis of Document Digitization on Event Extraction ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Token-level Multilingual Epidemic Dataset for Event Extraction ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Multilingual Epidemic Event Extraction ...
|
|
|
|
Abstract:
In this paper, we focus on epidemic event extraction in multilingual and low-resource settings. The task of extracting epidemic events is defined as the detection of disease names and locations in a document. We experiment with a multilingual dataset comprising news articles from the medical domain with diverse morphological structures (Chinese, English, French, Greek, Polish, and Russian). We investigate various Transformer-based models, also adopting a two-stage strategy, first finding the documents that contain events and then performing event extraction. Our results show that error propagation to the downstream task was higher than expected. We also perform an in-depth analysis of the results, concluding that different entity characteristics can influence the performance. Moreover, we perform several preliminary experiments for the low-resourced languages present in the dataset using the mean teacher semi-supervised technique. Our findings show the potential of pre-trained language models benefiting from ...
|
|
Keyword:
Epidemiological surveillance, Multilingualism, Semi-supervised learning
|
|
URL: https://dx.doi.org/10.5281/zenodo.5779965 https://zenodo.org/record/5779965
|
|
BASE
|
|
Hide details
|
|
16 |
Étude comparative de méthodes de classification multilingue appliquées à l'épidémiologie ...
|
|
|
|
BASE
|
|
Show details
|
|
17 |
Multilingual Epidemiological Text Classification: A Comparative Study ...
|
|
|
|
BASE
|
|
Show details
|
|
18 |
Multilingual Epidemiological Text Classification: A Comparative Study ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
SinNer@Clef-Hipe2020 : Sinful adaptation of SotA models for Named Entity Recognition in French and German
|
|
|
|
In: CLEF 2020 Working Notes. Working Notes of CLEF 2020 - Conference and Labs of the Evaluation Forum ; https://hal.inria.fr/hal-02984746 ; CLEF 2020 Working Notes. Working Notes of CLEF 2020 - Conference and Labs of the Evaluation Forum, Sep 2020, Thessaloniki / Virtual, Greece ; https://impresso.github.io/CLEF-HIPE-2020/ (2020)
|
|
BASE
|
|
Show details
|
|
20 |
Daniel@FinTOC’2 Shared Task: Title Detection and Structure Extraction
|
|
|
|
In: st Joint Workshop on Financial Narrative Processing and MultiLing Financial Summarisation @COLING’2020 ; 1st Joint Workshop on Financial Narrative Processing and MultiLing Financial Summarisation @COLING’2020 ; https://hal.archives-ouvertes.fr/hal-03024867 ; 1st Joint Workshop on Financial Narrative Processing and MultiLing Financial Summarisation @COLING’2020, Dec 2020, Barcelone, Spain (2020)
|
|
BASE
|
|
Show details
|
|
|
|