Home
Catalogue search
Refine your search:
Keyword:
17th c. French (1)
17th century French (1)
Classical theatre (1)
Lemmatisation (1)
POS tagging (1)
[INFO.INFO-CL]Computer Science [cs] / Computation and Language [cs.CL] (1)
[INFO.INFO-TT]Computer Science [cs] / Document and Text Processing (1)
[SHS.LANGUE]Humanities and Social Sciences / Linguistics (1)
[SHS.LITT]Humanities and Social Sciences / Literature (1)
Creator / Publisher:
Bibliothèque Nationale de France (1)
Cafiero, Florian (1)
Camps, Jean-Baptiste (1)
Centre Jean Mabillon (CJM) (1)
Clérice, Thibault (1)
DIM Science du texte et connaissances nouvelles (1)
Fièvre, Paul (1)
Gabay, Simon (1)
Laboratoire Interdisciplinaire des Energies de Demain (LIED (UMR_8236)) (1)
Université Paris Diderot - Paris 7 (UPD7)-Centre National de la Recherche Scientifique (CNRS) (1)
more
Year:
2021 (1)
Medium
Type
BLLDB-Access:
free (1)
subject to license (0)
Search in the Catalogues and Directories
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
Sort by
creator [A → Z]
'
creator [Z → A]
'
publishing year ↑ (asc)
'
publishing year ↓ (desc)
'
title [A → Z]
'
title [Z → A]
'
Simple Search
Hits 1 – 1 of 1
1
Corpus and Models for Lemmatisation and POS-tagging of Classical French Theatre
Camps, Jean-Baptiste
;
Gabay, Simon
;
Fièvre, Paul
;
Clérice, Thibault
;
Cafiero, Florian
In: EISSN: 2416-5999 ; Journal of Data Mining and Digital Humanities ; https://halshs.archives-ouvertes.fr/halshs-02591388 ; Journal of Data Mining and Digital Humanities, Episciences.org, 2021, ⟨10.46298/jdmdh.6485⟩ (2021)
Abstract:
International audience ; This paper describes the process of building an annotated corpus and training models for classical French literature, with a focus on theatre, and particularly comedies in verse. It was originally developed as a preliminary step to the stylometric analyses presented in Cafiero and Camps [2019]. The use of a recent lemmatiser based on neural networks and a CRF tagger allows to achieve accuracies beyond the current state-of-the art on the in-domain test, and proves to be robust during out-of-domain tests, i.e.up to 20th c.novels.
Keyword:
17th c. French
;
17th century French
;
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]
;
[INFO.INFO-TT]Computer Science [cs]/Document and Text Processing
;
[SHS.LANGUE]Humanities and Social Sciences/Linguistics
;
[SHS.LITT]Humanities and Social Sciences/Literature
;
Classical theatre
;
Lemmatisation
;
POS tagging
URL:
https://doi.org/10.46298/jdmdh.6485
https://halshs.archives-ouvertes.fr/halshs-02591388
https://halshs.archives-ouvertes.fr/halshs-02591388v2/file/Corpus_models_Classical_French_v2.pdf
https://halshs.archives-ouvertes.fr/halshs-02591388v2/document
BASE
Hide details
Mobile view
All
Catalogues
UB Frankfurt Linguistik
0
IDS Mannheim
0
OLC Linguistik
0
UB Frankfurt Retrokatalog
0
DNB Subject Category Language
0
Institut für Empirische Sprachwissenschaft
0
Leibniz-Centre General Linguistics (ZAS)
0
Bibliographies
BLLDB
0
BDSL
0
IDS Bibliografie zur deutschen Grammatik
0
IDS Bibliografie zur Gesprächsforschung
0
IDS Konnektoren im Deutschen
0
IDS Präpositionen im Deutschen
0
IDS OBELEX meta
0
MPI-SHH Linguistics Collection
0
MPI for Psycholinguistics
0
Linked Open Data catalogues
Annohub
0
Online resources
Link directory
0
Journal directory
0
Database directory
0
Dictionary directory
0
Open access documents
BASE
1
Linguistik-Repository
0
IDS Publikationsserver
0
Online dissertations
0
Language Description Heritage
0
© 2013 - 2024 Lin|gu|is|tik
|
Imprint
|
Privacy Policy
|
Datenschutzeinstellungen ändern