DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5...78
Hits 1 – 20 of 1.545

1
From FreEM to D'AlemBERT ; From FreEM to D'AlemBERT: a Large Corpus and a Language Model for Early Modern French
In: Proceedings of the 13th Language Resources and Evaluation Conference ; https://hal.inria.fr/hal-03596653 ; Proceedings of the 13th Language Resources and Evaluation Conference, European Language Resources Association, Jun 2022, Marseille, France (2022)
Abstract: 8 pages, 2 figures, 4 tables ; International audience ; Language models for historical states of language are becoming increasingly important to allow the optimal digitisation and analysis of old textual sources. Because these historical states are at the same time more complex to process and more scarce in the corpora available, specific efforts are necessary to train natural language processing (NLP) tools adapted to the data. In this paper, we present our efforts to develop NLP tools for Early Modern French (historical French from the 16th to the 18th centuries). We present the FreEMmax corpus of Early Modern French and D'AlemBERT, a RoBERTa-based language model trained on FreEMmax. We evaluate the usefulness of D'AlemBERT by fine-tuning it on a part-of-speech tagging task, outperforming previous work on the test set. Importantly, we find evidence for the transfer learning capacity of the language model, since its performance on lesser-resourced time periods appears to have been boosted by the more resourced ones. We release D'AlemBERT and the open-sourced subpart of the FreEMmax corpus.
Keyword: [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; Corpus creation; Création de corpus; Digital humanities; Early Modern French; Français classique; Humanités Numériques; Language modelling; Langues peu dotées; Less-resourced languages; Modèle de langue neuronal; Modélisation linguistique; Neural language representation models; Partie du discours; POS tagging
URL: https://hal.inria.fr/hal-03596653
BASE
Hide details
2
Digital Mediations: A Report on Digital Transformations in Modern Languages ...
Spence, Paul; Brandão, Renata. - : Zenodo, 2022
BASE
Show details
3
Digital Mediations: A Report on Digital Transformations in Modern Languages ...
Spence, Paul; Brandão, Renata. - : Zenodo, 2022
BASE
Show details
4
Linguistic Mathematical Relationships Saved or Lost in Translating Texts: Extension of the Statistical Theory of Translation and Its Application to the New Testament
In: Information; Volume 13; Issue 1; Pages: 20 (2022)
BASE
Show details
5
Language Barriers in the U.S.: Exploring the protection of human trafficking victims whose native language is Spanish
In: Honors College Theses (2022)
BASE
Show details
6
Processing (transformed) idioms ...
Suijkerbuijk, Michelle. - : Open Science Framework, 2022
BASE
Show details
7
La Diversidad Lingüística Durante y Después del Franquismo en España
In: The Review: A Journal of Undergraduate Student Research (2022)
BASE
Show details
8
Anglicisation in the letters of Marie Stewart, Countess of Mar and her family: a sociolinguistic perspective
BASE
Show details
9
THIS BODY IS AN ABSTRACTION: A Bilingual Anthology of Contemporary Puerto Rican Poetry
In: Embargoed Honors Theses, University of Nebraska-Lincoln (2022)
BASE
Show details
10
I Am a Cat, No. II
In: Zea E-Books Collection (2022)
BASE
Show details
11
Student Centered Language Teaching: A Focus on Student Identity
In: All Graduate Plan B and other Reports (2022)
BASE
Show details
12
La Educación Bilingüe: Una breve historia de la educación bilingüe en los Estados Unidos y otros países y sus beneficios
In: World Languages and Cultures (2021)
BASE
Show details
13
Observación Participante de Clases Virtuales Bilingües en K-2 Durante Covid-19
In: World Languages and Cultures (2021)
BASE
Show details
14
Las Comunidades Asiáticas en Latinoamérica
In: World Languages and Cultures (2021)
BASE
Show details
15
Systemic Functional Linguistics and Its Application to the Study of Academic Conference Presentations
In: World Languages Faculty Publications and Presentations (2021)
BASE
Show details
16
Innovation From Above, Below, and Behind: The Linguistics of the Hebrew Revival
In: Senior Projects Spring 2021 (2021)
BASE
Show details
17
Theme-based Second Language Learning through Multimodal Experimental Animation
In: Chinese Language Teaching Methodology and Technology (2021)
BASE
Show details
18
Simultaneous readings under past tense in Modern Greek ...
Tsilia, Anastasia. - : Open Science Framework, 2021
BASE
Show details
19
International Bilingual Journal of Culture, Anthropology and Linguistics ...
Pal, Patitpaban. - : Open Science Framework, 2021
BASE
Show details
20
The VOT productions of absolute beginners of L3 French ...
Parrish, Kyle. - : Open Science Framework, 2021
BASE
Show details

Page: 1 2 3 4 5...78

Catalogues
30
0
7
0
0
2
0
Bibliographies
92
2
0
0
0
0
0
34
12
Linked Open Data catalogues
11
Online resources
25
0
0
2
Open access documents
1.366
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern