Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Hits 1 – 4 of 4

1	La version française du Maqre-dardeqe (Paris, BNF Hébreu 1243)
	Kogel, Judith
	In: ISSN: 0484-8616 ; Revue des etudes juives ; https://halshs.archives-ouvertes.fr/halshs-03463723 ; Revue des etudes juives, Peeters Publishers, A paraître (2021)
	BASE
	Show details

2	Texte électronique enrichi par lemmatisation et étiquetage morphosyntaxique, portion de La Mort du roi Arthur , http://www.atilf.fr/dmf/MortArthur/
	Jézéquel, Jean-Michel; Bazin-Tacchella, Sylvie; Souvay, Gilles...
	In: https://hal.archives-ouvertes.fr/hal-03426756 ; 2021 (2021)
	BASE
	Show details

3	Corpus and Models for Lemmatisation and POS-tagging of Old French
	Camps, Jean-Baptiste; Clérice, Thibault; Duval, Frédéric...
	In: https://halshs.archives-ouvertes.fr/halshs-03353125 ; 2021 (2021)
	BASE
	Show details

4	Named Entity Recognition for French medieval charters
	Torres Aguilar, Sergio; Stutzmann, Dominique
	In: Workshop on Natural Language Processing for Digital Humanities ; https://hal.archives-ouvertes.fr/hal-03503055 ; Workshop on Natural Language Processing for Digital Humanities, Dec 2021, Helsinki, Finland (2021)
	Abstract: International audience ; This paper presents the process of annotating and modelling a corpus to automatically detect named entities in medieval charters in French. It introduces a new annotated corpus and a new system which outperforms state-of-the art libraries. Charters are legal documents and among the most important historical sources for medieval studies as they reflect economic and social dynamics as well as the evolution of literacy and writing practices. Automatic detection of named entities greatly improves the access to these unstructured texts and facilitates historical research. The experiments described here are based on a corpus encompassing about 500k words (1200 charters) coming from three charter collections ofthe 13th and 14th centuries. We annotated the corpus and then trained two state-of-the art NLP libraries for Named Entity Recognition (Spacy and Flair) and a custom neural model (Bi-LSTM-CRF). The evaluation shows that all three models achieve a high performance rate on the test set and a high generalization capacity against two external corpora unseen during training. This paper describes the corpus and the annotation model, and discusses the issues related to the linguistic processing of medieval French and formulaic discourse, so as to interpret the results within a larger historical perspective.
	Keyword: [INFO.INFO-TT]Computer Science [cs]/Document and Text Processing; [SHS.HIST]Humanities and Social Sciences/History; [SHS.LANGUE]Humanities and Social Sciences/Linguistics; ancien et moyen français; cultural heritage; named entity recognition; natural language processing; Old and Middle French; patrimoine culturel; reconnaissance des entités nommées; traitement automatique du langage naturel
	URL: https://hal.archives-ouvertes.fr/hal-03503055/file/ACL_2020_French_charters_NER_cameraready2.pdf https://hal.archives-ouvertes.fr/hal-03503055 https://hal.archives-ouvertes.fr/hal-03503055/document
	BASE
	Hide details

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern