2 |
Does BERT really agree ? Fine-grained Analysis of Lexical Dependence on a Syntactic Task ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Automatic enjambment detection as a new source of evidence in Spanish versification
|
|
|
|
In: Plotting Poetry: On Mechanically Enhanced Reading ; https://hal.archives-ouvertes.fr/hal-03255481 ; Bories, Anne-Sophie; Purnelle, Gérald; Marchal, Hugues. Plotting Poetry: On Mechanically Enhanced Reading, Presses Universitaires de Liège, 2021, 978-2-87562-280-8 ; http://www.presses.uliege.be/ (2021)
|
|
BASE
|
|
Show details
|
|
4 |
The Corpus for Idiolectal Research (CIDRE)
|
|
|
|
In: European Association of Digital Humanities Conference (EADH 2021) ; https://hal.archives-ouvertes.fr/hal-03353520 ; European Association of Digital Humanities Conference (EADH 2021), Sep 2021, Krasnoyarsk, Russia (2021)
|
|
BASE
|
|
Show details
|
|
5 |
Evaluating Hierarchical Clustering Methods for Corpora with Chronological Order
|
|
|
|
In: EADH2021: Interdisciplinary Perspectives on Data. Second International Conference of the European Association for Digital Humanities ; https://hal.archives-ouvertes.fr/hal-03341803 ; EADH2021: Interdisciplinary Perspectives on Data. Second International Conference of the European Association for Digital Humanities, EADH, Sep 2021, Krasnoyarsk, Russia ; https://eadh2020-2021.org/ (2021)
|
|
BASE
|
|
Show details
|
|
6 |
The Corpus for Idiolectal Research (CIDRE)
|
|
|
|
In: EISSN: 2059-481X ; Journal of Open Humanities Data ; https://hal.archives-ouvertes.fr/hal-03310451 ; Journal of Open Humanities Data, Ubiquity Press, 2021, 7, pp.15. ⟨10.5334/johd.42⟩ (2021)
|
|
BASE
|
|
Show details
|
|
7 |
Text Zoning of Theater Reviews: How Different are Journalistic from Blogger Reviews?
|
|
|
|
In: Workshop on Natural Language Processing for Digital Humanities ; https://hal.archives-ouvertes.fr/hal-03498270 ; Workshop on Natural Language Processing for Digital Humanities, Dec 2021, Sichar, India ; https://rootroo.com/downloads/nlp4dh_proceedings_draft.pdf (2021)
|
|
BASE
|
|
Show details
|
|
11 |
The Corpus for Idiolectal Research (CIDRE)
|
|
|
|
In: Journal of Open Humanities Data; Vol 7 (2021); 15 ; 2059-481X (2021)
|
|
BASE
|
|
Show details
|
|
14 |
ACCOLÉ : Annotation Collaborative d'erreurs de traduction pour COrpus aLignÉs – Nouvelles fonctionnalités
|
|
|
|
In: Actes des 2èmes journées scientifiques du Groupement de Recherche Linguistique Informatique Formelle et de Terrain (LIFT). ; 2èmes journées scientifiques du Groupement de Recherche Linguistique Informatique Formelle et de Terrain (LIFT) ; https://hal.archives-ouvertes.fr/hal-03047150 ; 2èmes journées scientifiques du Groupement de Recherche Linguistique Informatique Formelle et de Terrain (LIFT), 2020, Montrouge, France. pp.1-8 (2020)
|
|
BASE
|
|
Show details
|
|
15 |
Glossary: Introduction to the Digital Humanities ; Glossaire : Introduction aux humanités numériques
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-02410396 ; 2020 (2020)
|
|
BASE
|
|
Show details
|
|
16 |
Multi-SimLex: A Large-Scale Evaluation of Multilingual and Cross-Lingual Lexical Semantic Similarity
|
|
Vulic, Ivan; Baker, Simon; Ponti, Edoardo Maria; Petti, Ulla; Leviant, Ira; Wing, Kelly; Majewska, Olga; Bar, Eden; Malone, Matt; Poibeau, Thierry; Reichart, Roi; Korhonen, Anna
|
|
In: ISSN: 0891-2017 ; EISSN: 1530-9312 ; Computational Linguistics ; https://hal.archives-ouvertes.fr/hal-02975786 ; Computational Linguistics, Massachusetts Institute of Technology Press (MIT Press), 2020, 46 (4), pp.847-897 ; https://direct.mit.edu/coli/article/46/4/847/97326/Multi-SimLex-A-Large-Scale-Evaluation-of (2020)
|
|
Abstract:
Données et informations liées à la publication : https://multisimlex.com/ ; International audience ; We introduce Multi-SimLex, a large-scale lexical resource and evaluation benchmark covering datasets for 12 typologically diverse languages, including major languages (e.g., Mandarin Chinese, Spanish, Russian) as well as less-resourced ones (e.g., Welsh, Kiswahili). Each language dataset is annotated for the lexical relation of semantic similarity and contains 1,888 semantically aligned concept pairs, providing a representative coverage of word classes (nouns, verbs, adjectives, adverbs), frequency ranks, similarity intervals, lexical fields, and concreteness levels. Additionally, owing to the alignment of concepts across languages, we provide a suite of 66 cross-lingual semantic similarity datasets. Due to its extensive size and language coverage, Multi-SimLex provides entirely novel opportunities for experimental evaluation and analysis. On its monolingual and cross-lingual benchmarks, we evaluate and analyze a wide array of recent state-of-the-art monolingual and cross-lingual representation models, including static and contextualized word embeddings (such as fastText, M-BERT and XLM), externally informed lexical representations, as well as fully unsupervised and (weakly) supervised cross-lingual word embeddings. We also present a step-by-step dataset creation protocol for creating consistent, Multi-Simlex-style resources for additional languages. We make these contributions -- the public release of Multi-SimLex datasets, their creation protocol, strong baseline results, and in-depth analyses which can be be helpful in guiding future developments in multilingual lexical semantics and representation learning -- available via a website which will encourage community effort in further expansion of Multi-Simlex to many more languages. Such a large-scale semantic resource could inspire significant further advances in NLP across languages.
|
|
Keyword:
[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]; [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG]; [INFO.INFO-TT]Computer Science [cs]/Document and Text Processing; [SCCO.COMP]Cognitive science/Computer science; [SCCO.LING]Cognitive science/Linguistics; [SHS.INFO]Humanities and Social Sciences/Library and information sciences; [SHS.LANGUE]Humanities and Social Sciences/Linguistics; [SHS.STAT]Humanities and Social Sciences/Methods and statistics; Lexicon; Linguistic Resource; Multilinguality; Semantics; Typology
|
|
URL: https://hal.archives-ouvertes.fr/hal-02975786/file/coli_a_00391.pdf https://hal.archives-ouvertes.fr/hal-02975786 https://hal.archives-ouvertes.fr/hal-02975786/document
|
|
BASE
|
|
Hide details
|
|
17 |
Semi-Supervised Learning on Meta Structure: Multi-Task Tagging and Parsing in Low-Resource Scenarios
|
|
|
|
In: Conference of the Association for the Advancement of Artificial Intelligence ; https://hal.archives-ouvertes.fr/hal-02895835 ; Conference of the Association for the Advancement of Artificial Intelligence, Association for the Advancement of Artificial Intelligence, Feb 2020, New York, United States ; https://aaai.org/Conferences/AAAI-20/ (2020)
|
|
BASE
|
|
Show details
|
|
18 |
Lexical encoding of multiword expressions in XMG
|
|
|
|
In: Actes des 2èmes journées scientifiques du Groupement de Recherche Linguistique Informatique Formelle et de Terrain (LIFT). ; 2èmes journées scientifiques du Groupement de Recherche Linguistique Informatique Formelle et de Terrain (LIFT) ; https://hal.archives-ouvertes.fr/hal-03047145 ; 2èmes journées scientifiques du Groupement de Recherche Linguistique Informatique Formelle et de Terrain (LIFT), Dec 2020, Montrouge, France. pp.60-63 (2020)
|
|
BASE
|
|
Show details
|
|
19 |
Classification des catégories grammaticales sur deux corpus longitudinaux d’enfants
|
|
|
|
In: Actes des 2èmes journées scientifiques du Groupement de Recherche Linguistique Informatique Formelle et de Terrain (LIFT). ; 2èmes journées scientifiques du Groupement de Recherche Linguistique Informatique Formelle et de Terrain (LIFT) ; https://hal.archives-ouvertes.fr/hal-03047149 ; 2èmes journées scientifiques du Groupement de Recherche Linguistique Informatique Formelle et de Terrain (LIFT), 2020, Montrouge, France. pp.23-33 (2020)
|
|
BASE
|
|
Show details
|
|
20 |
Longform recordings : Opportunities and challenges ; Enregistrements de longue durée: Opportunités et défis
|
|
|
|
In: Actes des 2èmes journées scientifiques du Groupement de Recherche Linguistique Informatique Formelle et de Terrain (LIFT). ; LIFT 2020 - 2èmes journées scientifiques du Groupement de Recherche "Linguistique informatique, formelle et de terrain" ; https://hal.archives-ouvertes.fr/hal-03047153 ; LIFT 2020 - 2èmes journées scientifiques du Groupement de Recherche "Linguistique informatique, formelle et de terrain", Dec 2020, Montrouge / Virtual, France. pp.64-71 (2020)
|
|
BASE
|
|
Show details
|
|
|
|