1 |
Lessons Learned from the Usability Evaluation of a Simulated Patient Dialogue System
|
|
|
|
In: ISSN: 0148-5598 ; EISSN: 1573-689X ; Journal of Medical Systems ; https://hal.archives-ouvertes.fr/hal-03452553 ; Journal of Medical Systems, Springer Verlag (Germany), 2021, 45 (7), ⟨10.1007/s10916-021-01737-4⟩ (2021)
|
|
BASE
|
|
Show details
|
|
2 |
Overview of the Fourth BUCC Shared Task: Bilingual Dictionary Induction from Comparable Corpora
|
|
|
|
In: 13th Workshop on Building and Using Comparable Corpora (BUCC) ; https://hal.archives-ouvertes.fr/hal-03100822 ; 13th Workshop on Building and Using Comparable Corpora (BUCC), May 2020, Marseille, France. pp.6-13 (2020)
|
|
BASE
|
|
Show details
|
|
3 |
Automatic Removal of Identifying Information in Official EU Languages for Public Administrations: The MAPA Project
|
|
|
|
In: Legal Knowledge and Information Systems ; Frontiers in Artificial Intelligence and Applications ; International Conference on Legal Knowledge and Information Systems ; https://hal.archives-ouvertes.fr/hal-03058311 ; International Conference on Legal Knowledge and Information Systems, Dec 2020, Brno, Prague, Czech Republic. pp.223-226, ⟨10.3233/FAIA200869⟩ ; http://ebooks.iospress.nl/volume/legal-knowledge-and-information-systems-jurix-2020-the-thirty-third-annual-conference-brno-czech-republic-december-911-2020 (2020)
|
|
BASE
|
|
Show details
|
|
4 |
The Multilingual Anonymisation Toolkit for Public Administrations (MAPA) Project
|
|
|
|
In: Annual Conference of the European Association for Machine Translation ; https://hal.archives-ouvertes.fr/hal-03103205 ; Annual Conference of the European Association for Machine Translation, Nov 2020, Lisbon, Portugal. pp.471-472 (2020)
|
|
BASE
|
|
Show details
|
|
5 |
TL-Explorer: A Digital Humanities Tool for Mapping and Analyzing Translated Literature
|
|
|
|
In: Proceedings of the The 4th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature ; https://hal.archives-ouvertes.fr/hal-03090881 ; Proceedings of the The 4th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, Dec 2020, Barcelona, Spain ; https://www.aclweb.org/anthology/2020.latechclfl-1.20/ (2020)
|
|
BASE
|
|
Show details
|
|
6 |
CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters
|
|
|
|
In: International Conference on Computational Linguistics ; https://hal.archives-ouvertes.fr/hal-03100665 ; International Conference on Computational Linguistics, Dec 2020, Barcelona (on line), Spain. pp.6903-6915 ; https://coling2020.org/ (2020)
|
|
Abstract:
International audience ; Due to the compelling improvements brought by BERT, many recent representation models adopted the Transformer architecture as their main building block, consequently inheriting the wordpiece tokenization system despite it not being intrinsically linked to the notion of Transformers. While this system is thought to achieve a good balance between the flexibility of characters and the efficiency of full words, using predefined wordpiece vocabularies from the general domain is not always suitable, especially when building models for specialized domains (e.g., the medical domain). Moreover, adopting a wordpiece tokenization shifts the focus from the word level to the subword level, making the models conceptually more complex and arguably less convenient in practice. For these reasons, we propose CharacterBERT, a new variant of BERT that drops the wordpiece system altogether and uses a Character-CNN module instead to represent entire words by consulting their characters. We show that this new model improves the performance of BERT on a variety of medical domain tasks while at the same time producing robust, word-level, and open-vocabulary representations.
|
|
Keyword:
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [INFO.INFO-TT]Computer Science [cs]/Document and Text Processing
|
|
URL: https://hal.archives-ouvertes.fr/hal-03100665/document https://hal.archives-ouvertes.fr/hal-03100665/file/ElBoukkouri_COLING2020.pdf https://hal.archives-ouvertes.fr/hal-03100665
|
|
BASE
|
|
Hide details
|
|
7 |
French Levothyrox® Crisis: Retrospective Analysis of Social Media
|
|
|
|
In: International Society of Pharmacovigilance ; https://hal.archives-ouvertes.fr/hal-02411632 ; International Society of Pharmacovigilance, Springer International Publishing, Oct 2019, Bogota, Colombia (2019)
|
|
BASE
|
|
Show details
|
|
8 |
Apprentissage de plongements de mots dynamiques avec régularisation de la dérive
|
|
|
|
In: Actes de la Conférence sur le Traitement Automatique des Langues Naturelles (TALN) PFIA 2019. Volume I : Articles longs ; 26e Conférence sur le Traitement Automatique des Langues Naturelles ; https://hal.archives-ouvertes.fr/hal-02566346 ; 26e Conférence sur le Traitement Automatique des Langues Naturelles, 2019, Toulouse, France. pp.13-26 (2019)
|
|
BASE
|
|
Show details
|
|
9 |
Cross-Lingual Contextual Word Embeddings Mapping with Multi-Sense Words in Mind
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03100840 ; 2019 (2019)
|
|
BASE
|
|
Show details
|
|
10 |
A Sustainable and Open Access Knowledge Organization Model to Preserve Cultural Heritage and Language Diversity
|
|
|
|
In: ISSN: 2078-2489 ; Information ; https://hal.archives-ouvertes.fr/hal-02565134 ; Information, MDPI, 2019, 10 (10), pp.303. ⟨10.3390/info10100303⟩ (2019)
|
|
BASE
|
|
Show details
|
|
11 |
Embedding Strategies for Specialized Domains: Application to Clinical Entity Recognition
|
|
|
|
In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop ; https://hal.archives-ouvertes.fr/hal-02860947 ; Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop, Jul 2019, Florence, France. pp.295-301, ⟨10.18653/v1/P19-2041⟩ (2019)
|
|
BASE
|
|
Show details
|
|
12 |
Actes de la Conférence sur le Traitement Automatique des Langues Naturelles (TALN) - PFIA 2019
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-02566345 ; Morin, Emmanuel and Rosset, Sophie and Zweigenbaum, Pierre. Jul 2019, Toulouse, France. ATALA, 2019 (2019)
|
|
BASE
|
|
Show details
|
|
13 |
Designing a virtual patient dialogue system based on terminology-rich resources: challenges and evaluation
|
|
|
|
In: ISSN: 1351-3249 ; EISSN: 1469-8110 ; Natural Language Engineering ; https://hal.archives-ouvertes.fr/hal-02358021 ; Natural Language Engineering, Cambridge University Press (CUP), 2019, pp.1-38 (2019)
|
|
BASE
|
|
Show details
|
|
14 |
Détection des couples de termes translittérés à partir d'un corpus parallèle anglais-arabe
|
|
|
|
In: Conférence sur le Traitement Automatique des Langues Naturelles ; https://hal.archives-ouvertes.fr/hal-01899828 ; Conférence sur le Traitement Automatique des Langues Naturelles, May 2018, Rennes, France (2018)
|
|
BASE
|
|
Show details
|
|
15 |
Clinical Natural Language Processing in languages other than English: opportunities and challenges.
|
|
|
|
In: ISSN: 2041-1480 ; Journal of Biomedical Semantics ; https://hal.archives-ouvertes.fr/hal-01842518 ; Journal of Biomedical Semantics, BioMed Central, 2018, 9, 13p. ⟨10.1186/s13326-018-0179-8⟩ (2018)
|
|
BASE
|
|
Show details
|
|
16 |
CLEF eHealth 2018 Multilingual Information Extraction Task Overview: ICD10 Coding of Death Certificates in French, Hungarian and Italian
|
|
|
|
In: Conference and Labs of the Evaluation Forum, eHealth ; https://hal.archives-ouvertes.fr/hal-02276492 ; Conference and Labs of the Evaluation Forum, eHealth, CEUR, Sep 2018, Avignon, France (2018)
|
|
BASE
|
|
Show details
|
|
17 |
A Multilingual Dataset for Evaluating Parallel Sentence Extraction from Comparable Corpora
|
|
|
|
In: International Conference on Language Resources and Evaluation ; https://hal.archives-ouvertes.fr/hal-01898362 ; International Conference on Language Resources and Evaluation, May 2018, Miyazaki, Japan (2018)
|
|
BASE
|
|
Show details
|
|
18 |
GNEG: Graph-Based Negative Sampling for word2vec
|
|
|
|
In: Annual Meeting of the Association for Computational Linguistics ; https://hal.archives-ouvertes.fr/hal-01899825 ; Annual Meeting of the Association for Computational Linguistics, Jul 2018, Melbourne, Australia (2018)
|
|
BASE
|
|
Show details
|
|
19 |
Expanding the Diversity of Texts and Applications: Findings from the Section on Clinical Natural Language Processing of the International Medical Informatics Association Yearbook.
|
|
|
|
In: ISSN: 0943-4747 ; EISSN: 2364-0502 ; IMIA Yearbook of Medical Informatics ; https://hal.archives-ouvertes.fr/hal-01990501 ; IMIA Yearbook of Medical Informatics, Schattauer, 2018, 27, pp.193-198 (2018)
|
|
BASE
|
|
Show details
|
|
20 |
CLEF eHealth 2017 Multilingual Information Extraction task Overview: ICD10 Coding of Death Certificates in English and French.
|
|
|
|
In: Workshop of the Cross-Language Evaluation Forum ; https://hal.archives-ouvertes.fr/hal-01665374 ; Workshop of the Cross-Language Evaluation Forum, CEUR-WS, Jan 2017, Dublin, Ireland (2017)
|
|
BASE
|
|
Show details
|
|
|
|