1 |
Data set for "Token-Level Multilingual Epidemic Dataset for Event Extraction" ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Data set for "Token-Level Multilingual Epidemic Dataset for Event Extraction" ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Multilingual Epidemic Event Extraction
|
|
|
|
In: Towards Open and Trustworthy Digital Societies. 23rd International Conference on Asia-Pacific Digital Libraries, ICADL 2021, Virtual Event, December 1–3, 2021, Proceedings ; https://hal.archives-ouvertes.fr/hal-03480551 ; Hao-Ren Ke; Chei Sian Lee; Kazunari Sugiyama. Towards Open and Trustworthy Digital Societies. 23rd International Conference on Asia-Pacific Digital Libraries, ICADL 2021, Virtual Event, December 1–3, 2021, Proceedings, 13133, Springer, pp.139-156, 2021, Lecture Notes in Computer Science, 978-3-030-91668-8. ⟨10.1007/978-3-030-91669-5_12⟩ (2021)
|
|
BASE
|
|
Show details
|
|
4 |
Étude comparative de méthodes de classification multilingue appliquées à l'épidémiologie
|
|
|
|
In: COnférence en Recherche d'Informations et Applications - CORIA 2021, French Information Retrieval Conference ; https://hal.archives-ouvertes.fr/hal-03320343 ; COnférence en Recherche d'Informations et Applications - CORIA 2021, French Information Retrieval Conference, Apr 2021, Grenoble (virtuel), France (2021)
|
|
BASE
|
|
Show details
|
|
5 |
Étude comparative de méthodes de classification multilingue appliquées à l'épidémiologie ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Token-level Multilingual Epidemic Dataset for Event Extraction ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Token-level Multilingual Epidemic Dataset for Event Extraction ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Étude comparative de méthodes de classification multilingue appliquées à l'épidémiologie ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Multilingual Epidemiological Text Classification: A Comparative Study ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Multilingual Epidemiological Text Classification: A Comparative Study ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
A Dataset for Multi-lingual Epidemiological Event Extraction
|
|
|
|
In: Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020) ; https://hal.archives-ouvertes.fr/hal-02732848 ; Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020), May 2020, Marseille, France. pp.4139-4144 (2020)
|
|
BASE
|
|
Show details
|
|
14 |
Multilingual Epidemiological Text Classification: A Comparative Study
|
|
|
|
In: Proceedings of the 28th International Conference on Computational Linguistics ; COLING, International Conference on Computational Linguistics ; https://hal.archives-ouvertes.fr/hal-03089807 ; COLING, International Conference on Computational Linguistics, Dec 2020, Barcelone, Spain. pp.6172-6183, ⟨10.18653/v1/2020.coling-main.543⟩ (2020)
|
|
Abstract:
International audience ; In this paper, we approach the multilingual text classification task in the context of the epidemiological field. Multilingual text classification models tend to perform differently across different languages (low-or high-resource), more particularly when the dataset is highly imbalanced, which is the case for epidemiological datasets. We conduct a comparative study of different machine and deep learning text classification models using a dataset comprising news articles related to epidemic outbreaks from six languages, four low-resourced and two high-resourced, in order to analyze the influence of the nature of the language, the structure of the document, and the size of the data. Our findings indicate that the performance of the models based on fine-tuned language models exceeds by more than 50% the chosen baseline models that include a specialized epidemiological news surveillance system and several machine learning models. Also, low-resource languages are highly influenced not only by the typology of the languages on which the models have been pre-trained or/and fine-tuned but also by their size. Furthermore, we discover that the beginning and the end of documents provide the most salient features for this task and, as expected, the performance of the models was proportionate to the training data size.
|
|
Keyword:
[INFO.INFO-TT]Computer Science [cs]/Document and Text Processing
|
|
URL: https://hal.archives-ouvertes.fr/hal-03089807/file/2020.coling-main.543.pdf https://hal.archives-ouvertes.fr/hal-03089807 https://hal.archives-ouvertes.fr/hal-03089807/document https://doi.org/10.18653/v1/2020.coling-main.543
|
|
BASE
|
|
Hide details
|
|
15 |
Multilingual Epidemiological Text Classification: A Comparative Study ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Multilingual Epidemiological Text Classification: A Comparative Study ...
|
|
|
|
BASE
|
|
Show details
|
|
17 |
Data for "A Dataset for Multi-lingual Epidemiological Event Extraction" ...
|
|
|
|
BASE
|
|
Show details
|
|
18 |
A Dataset for Multi-lingual Epidemiological Event Extraction ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
A Dataset for Multi-lingual Epidemiological Event Extraction ...
|
|
|
|
BASE
|
|
Show details
|
|
20 |
A Dataset for Multi-lingual Epidemiological Event Extraction ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|