Page: 1 2 3 4 5 6 7 8 9... 192
81 |
Multiple Tasks Integration ; Multiple Tasks Integration: Tagging, Syntactic and Semantic Parsing as a Single Task
|
|
|
|
In: Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics ; EACL 2021 - 16th Conference of the European Chapter of the Association for Computational Linguistics ; https://hal.archives-ouvertes.fr/hal-03601585 ; EACL 2021 - 16th Conference of the European Chapter of the Association for Computational Linguistics, Apr 2021, Kyiv, Ukraine (2021)
|
|
BASE
|
|
Show details
|
|
82 |
Annotation manuelle des émotions dans des textes écrits avec la plateforme Glozz. ; Annotation manuelle des émotions dans des textes écrits avec la plateforme Glozz.: Guide d'annotation
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03263194 ; [Rapport de recherche] MoDyCo; Université Paris Nanterre. 2021 (2021)
|
|
BASE
|
|
Show details
|
|
83 |
Atténuer les erreurs de numérisation dans la reconnaissance d'entités nommées pour les documents historiques
|
|
|
|
In: Conférence en Recherche d'Informations et Applications (CORIA 2021) ; https://hal.archives-ouvertes.fr/hal-03320332 ; Conférence en Recherche d'Informations et Applications (CORIA 2021), ARIA : Association Francophone de Recherche d’Information (RI) et Applications, Apr 2021, Grenoble (virtuel), France. pp.1 - 7 ; http://coria.asso-aria.org/2021/articles/mini_24/main.pdf (2021)
|
|
BASE
|
|
Show details
|
|
84 |
Prosodic Boundary Prediction Model for Vietnamese Text-To-Speech
|
|
|
|
In: Proc. Interspeech 2021 ; Interspeech 2021 ; https://hal.archives-ouvertes.fr/hal-03329116 ; Interspeech 2021, Aug 2021, Brno, Czech Republic. pp.3885-3889, ⟨10.21437/interspeech.2021-125⟩ (2021)
|
|
BASE
|
|
Show details
|
|
85 |
Texte électronique enrichi par lemmatisation et étiquetage morphosyntaxique, portion de La Mort du roi Arthur , http://www.atilf.fr/dmf/MortArthur/
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03426756 ; 2021 (2021)
|
|
BASE
|
|
Show details
|
|
86 |
From text saliency to linguistic objects: learning linguistic interpretable markers with a multi-channels convolutional architecture
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03142170 ; 2021 (2021)
|
|
BASE
|
|
Show details
|
|
87 |
Artificial Text Detection via Examining the Topology of Attention Maps
|
|
|
|
In: ACL Anthology ; Empirical Methods in Natural Language Processing ; https://hal.archives-ouvertes.fr/hal-03456191 ; Empirical Methods in Natural Language Processing, ACL (Association for Computational Linguistics), Nov 2021, Punta Cana, Dominican Republic (2021)
|
|
BASE
|
|
Show details
|
|
88 |
On the Role of Low-Level Linguistic Tasks for Reading Time Prediction
|
|
|
|
In: Proceedings of the Annual Meeting of the Cognitive Science Society, 43(43) ; 43rd Annual Meeting of the Cognitive Science Society ; https://hal.archives-ouvertes.fr/hal-03303689 ; 43rd Annual Meeting of the Cognitive Science Society, Jul 2021, Vienna, Austria. pp.452 ; https://cognitivesciencesociety.org/cogsci-2021/ (2021)
|
|
BASE
|
|
Show details
|
|
89 |
Grapholinguistics in the 21st Century - 2020. Part II
|
|
|
|
In: G21C 2020 : Grapholinguistics in the 21st Century ; https://hal.archives-ouvertes.fr/hal-03161397 ; G21C 2020 : Grapholinguistics in the 21st Century, Jun 2020, Paris, France. 5, Fluxus Editions, 2021, Grapholinguistics and Its Applications, 9782957054978. ⟨10.36824/2020-graf2⟩ ; http://www.fluxus-editions.fr/gla5.php (2021)
|
|
BASE
|
|
Show details
|
|
90 |
First Align, then Predict: Understanding the Cross-Lingual Ability of Multilingual BERT
|
|
|
|
In: https://hal.inria.fr/hal-03161685 ; 2021 (2021)
|
|
BASE
|
|
Show details
|
|
91 |
Can Multilingual Language Models Transfer to an Unseen Dialect? A Case Study on North African Arabizi
|
|
|
|
In: https://hal.inria.fr/hal-03161677 ; 2021 (2021)
|
|
Abstract:
Building natural language processing systems for non standardized and low resource languages is a difficult challenge. The recent success of large-scale multilingual pretrained language models provides new modeling tools to tackle this. In this work, we study the ability of multilingual language models to process an unseen dialect. We take user generated North-African Arabic as our case study, a resource-poor dialectal variety of Arabic with frequent code-mixing with French and written in Arabizi, a non-standardized transliteration of Arabic to Latin script. Focusing on two tasks, part-of-speech tagging and dependency parsing, we show in zero-shot and unsupervised adaptation scenarios that multilingual language models are able to transfer to such an unseen dialect, specifically in two extreme cases: (i) across scripts, using Modern Standard Arabic as a source language, and (ii) from a distantly related language, unseen during pretraining, namely Maltese. Our results constitute the first successful transfer experiments on this dialect, paving thus the way for the development of an NLP ecosystem for resource-scarce, non-standardized and highly variable vernacular languages.
|
|
Keyword:
[INFO.INFO-TT]Computer Science [cs]/Document and Text Processing
|
|
URL: https://hal.inria.fr/hal-03161677
|
|
BASE
|
|
Hide details
|
|
92 |
Collecting and annotating corpora for three under-resourced languages of France: Methodological issues
|
|
|
|
In: ISSN: 1934-5275 ; EISSN: 1934-5275 ; Language Documentation & Conservation ; https://hal.archives-ouvertes.fr/hal-03273196 ; Language Documentation & Conservation, University of Hawaiʻi Press 2021, 15, pp.316-357 ; http://hdl.handle.net/10125/74645 (2021)
|
|
BASE
|
|
Show details
|
|
93 |
Grapholinguistics in the 21st Century - 2020 ; Grapholinguistics in the 21st Century - 2020: Part I
|
|
|
|
In: G21C 2020 : Grapholinguistics in the 21st Century ; https://hal.archives-ouvertes.fr/hal-03161395 ; Yannis Haralambous. G21C 2020 : Grapholinguistics in the 21st Century, Jun 2020, Paris, France. 4, Fluxus Editions, 2021, Grapholinguistics and Its Applications, 9782957054961. ⟨10.36824/2020-graf1⟩ ; http://www.fluxus-editions.fr/grafematik2020-proceedingsI.pdf (2021)
|
|
BASE
|
|
Show details
|
|
94 |
Playing With Unicorns: AI Dungeon and Citizen NLP
|
|
|
|
In: Digital Humanities Quarterly, vol 14, iss 4 (2021)
|
|
BASE
|
|
Show details
|
|
95 |
Contextualized, Metadata-Empowered, Coarse-to-Fine Weakly-Supervised Text Classification
|
|
|
|
BASE
|
|
Show details
|
|
96 |
Automatic simplification of technical and specialized texts ; Simplification automatique de textes techniques et spécialisés
|
|
|
|
In: https://hal.archives-ouvertes.fr/tel-03343769 ; Informatique et langage [cs.CL]. Université de Lille, 2021. Français. ⟨NNT : 2021LILUH007⟩ (2021)
|
|
BASE
|
|
Show details
|
|
97 |
Automatic text simplification of specialized and technical texts ; Simplification automatique de textes techniques et spécialisés
|
|
|
|
In: https://hal.archives-ouvertes.fr/tel-03343769 ; Informatique et langage [cs.CL]. Université de Lille, 2021. Français (2021)
|
|
BASE
|
|
Show details
|
|
98 |
Models of diachronic semantic change using word embeddings ; Modèles diachroniques à base de plongements de mot pour l'analyse du changement sémantique
|
|
|
|
In: https://tel.archives-ouvertes.fr/tel-03199801 ; Document and Text Processing. Université Paris-Saclay, 2021. English. ⟨NNT : 2021UPASG006⟩ (2021)
|
|
BASE
|
|
Show details
|
|
99 |
Hate speech and offensive language detection using transfer learning approaches ; Détection du discours de haine et du langage offensant utilisant des approches de Transfer Learning
|
|
|
|
In: https://tel.archives-ouvertes.fr/tel-03276023 ; Document and Text Processing. Institut Polytechnique de Paris, 2021. English. ⟨NNT : 2021IPPAS007⟩ (2021)
|
|
BASE
|
|
Show details
|
|
100 |
Extraction and normalization of simple and structured entities in medical documents ; Extraction et normalisation d'entités simples et structurées dans les documents médicaux
|
|
|
|
In: https://hal.archives-ouvertes.fr/tel-03624928 ; Document and Text Processing. Sorbonne Université, 2021. English (2021)
|
|
BASE
|
|
Show details
|
|
Page: 1 2 3 4 5 6 7 8 9... 192
|
|