1 |
LivingNER corpus: Named entity recognition, normalization & classification of species, pathogens and food ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
LivingNER corpus: Named entity recognition, normalization & classification of species, pathogens and food ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
DisTEMIST corpus: detection and normalization of disease mentions in spanish clinical cases ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
DisTEMIST corpus: detection and normalization of disease mentions in spanish clinical cases ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Deep Learning with Word Embedding Improves Kazakh Named-Entity Recognition
|
|
|
|
In: Information; Volume 13; Issue 4; Pages: 180 (2022)
|
|
BASE
|
|
Show details
|
|
6 |
Analyzing COVID-19 Medical Papers Using Artificial Intelligence: Insights for Researchers and Medical Professionals
|
|
|
|
In: Big Data and Cognitive Computing; Volume 6; Issue 1; Pages: 4 (2022)
|
|
BASE
|
|
Show details
|
|
7 |
Creating Biographical Networks from Chinese and English Wikipedia
|
|
|
|
In: https://halshs.archives-ouvertes.fr/halshs-03217972 ; 2021 (2021)
|
|
BASE
|
|
Show details
|
|
8 |
Sentiment Analysis of Arabic Documents
|
|
|
|
In: Natural Language Processing for Global and Local Business ; https://hal.archives-ouvertes.fr/hal-03124729 ; Fatih Pinarbasi; M. Nurdan Taskiran. Natural Language Processing for Global and Local Business, pp.307-331, 2021, 9781799842408. ⟨10.4018/978-1-7998-4240-8.ch013⟩ ; https://www.igi-global.com/ (2021)
|
|
BASE
|
|
Show details
|
|
9 |
WEIR-P: An Information Extraction Pipeline for the Wastewater Domain
|
|
|
|
In: RCIS 2021 - 5th International Conference on Research Challenges in Information Science ; https://hal.archives-ouvertes.fr/hal-03211461 ; RCIS 2021 - 5th International Conference on Research Challenges in Information Science, May 2021, Virtual, Cyprus (2021)
|
|
BASE
|
|
Show details
|
|
11 |
Event Study: Advanced Machine Learning and Statistical Technique for Analyzing Sustainability in Banking Stocks
|
|
|
|
In: Mathematics; Volume 9; Issue 24; Pages: 3319 (2021)
|
|
BASE
|
|
Show details
|
|
12 |
Ensemble of Deep Masked Language Models for Effective Named Entity Recognition in Health and Life Science Corpora
|
|
|
|
In: ISSN: 2504-0537 ; Frontiers in research metrics and analytics, Vol. 6 (2021) P. 689803 (2021)
|
|
BASE
|
|
Show details
|
|
13 |
The Comparison Between the Tools for Named Entity Recognition
|
|
|
|
BASE
|
|
Show details
|
|
14 |
French Contextualized Word-Embeddings with a sip of CaBeRnet: a New French Balanced Reference Corpus
|
|
|
|
In: CMLC-8 - 8th Workshop on the Challenges in the Management of Large Corpora ; https://hal.inria.fr/hal-02678358 ; CMLC-8 - 8th Workshop on the Challenges in the Management of Large Corpora, May 2020, Marseille, France ; https://lrec2020.lrec-conf.org/media/proceedings/Workshops/Books/CMLC-8book.pdf (2020)
|
|
Abstract:
International audience ; This paper describes and compares the impact of different types and size of training corpora on language models like ELMO. By asking the fundamental question of quality versus quantity we evaluate four French corpora for training on parsing scores, POS-tagging and named-entities recognition downstream tasks. The paper studies the relevance of a new corpus, CaBeRnet, featuring a representative range of language usage, including a balanced variety of genres (oral transcriptions, newspapers, popular magazines, technical reports, fiction, academic texts), in oral and written styles. We hypothesize that a linguistically representative and balanced corpora will allow the language model to be more efficient and representative of a given language and therefore yield better evaluation scores on different evaluation sets and tasks.
|
|
Keyword:
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [SHS.LANGUE]Humanities and Social Sciences/Linguistics; Balanced French Corpus; BERT; ELMo; French; Language Models; NER; Parsing; Tagging
|
|
URL: https://hal.inria.fr/hal-02678358/file/LREC_Fabre_Ortiz.pdf https://hal.inria.fr/hal-02678358 https://hal.inria.fr/hal-02678358/document
|
|
BASE
|
|
Hide details
|
|
15 |
UNER: Universal Named-Entity Recognition Framework ...
|
|
Alves, Diego. - : Leibniz Universität Hannover (LUH),L3S Research Center,CLEOPATRA ITN, 2020
|
|
BASE
|
|
Show details
|
|
16 |
Cantemist guidelines: neoplasms morphology annotation and mapping to CIEO-3 ...
|
|
|
|
BASE
|
|
Show details
|
|
17 |
Cantemist guidelines: neoplasms morphology annotation and mapping to CIEO-3 ...
|
|
|
|
BASE
|
|
Show details
|
|
18 |
Cantemist corpus: gold standard of oncology clinical cases annotated with CIE-O 3 terminology ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
Cantemist corpus: gold standard of oncology clinical cases annotated with CIE-O 3 terminology ...
|
|
|
|
BASE
|
|
Show details
|
|
20 |
Cantemist guidelines: neoplasms morphology annotation and mapping to CIEO-3 ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|