Home
Catalogue search
Refine your search:
Keyword:
17th c. French (1)
17th century French (1)
Classical theatre (1)
Lemmatisation (1)
POS tagging (1)
[INFO.INFO-CL]Computer Science [cs] / Computation and Language [cs.CL] (1)
[INFO.INFO-TT]Computer Science [cs] / Document and Text Processing (1)
[SHS.LANGUE]Humanities and Social Sciences / Linguistics (1)
[SHS.LITT]Humanities and Social Sciences / Literature (1)
Creator / Publisher
Year:
2021 (1)
Medium
Type:
Article (1)
BLLDB-Access:
free (1)
subject to license (0)
Search in the Catalogues and Directories
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
Sort by
creator [A → Z]
'
creator [Z → A]
'
publishing year ↑ (asc)
'
publishing year ↓ (desc)
'
title [A → Z]
'
title [Z → A]
'
Simple Search
Hits 1 – 1 of 1
1
Corpus and Models for Lemmatisation and POS-tagging of Classical French Theatre
Camps, Jean-Baptiste
;
Gabay, Simon
;
Fièvre, Paul
;
Clérice, Thibault
;
Cafiero, Florian
In: EISSN: 2416-5999 ; Journal of Data Mining and Digital Humanities ; https://halshs.archives-ouvertes.fr/halshs-02591388 ; Journal of Data Mining and Digital Humanities, Episciences.org, 2021, ⟨10.46298/jdmdh.6485⟩ (2021)
Abstract:
International audience ; This paper describes the process of building an annotated corpus and training models for classical French literature, with a focus on theatre, and particularly comedies in verse. It was originally developed as a preliminary step to the stylometric analyses presented in Cafiero and Camps [2019]. The use of a recent lemmatiser based on neural networks and a CRF tagger allows to achieve accuracies beyond the current state-of-the art on the in-domain test, and proves to be robust during out-of-domain tests, i.e.up to 20th c.novels.
Keyword:
17th c. French
;
17th century French
;
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]
;
[INFO.INFO-TT]Computer Science [cs]/Document and Text Processing
;
[SHS.LANGUE]Humanities and Social Sciences/Linguistics
;
[SHS.LITT]Humanities and Social Sciences/Literature
;
Classical theatre
;
Lemmatisation
;
POS tagging
URL:
https://doi.org/10.46298/jdmdh.6485
https://halshs.archives-ouvertes.fr/halshs-02591388
https://halshs.archives-ouvertes.fr/halshs-02591388v2/file/Corpus_models_Classical_French_v2.pdf
https://halshs.archives-ouvertes.fr/halshs-02591388v2/document
BASE
Hide details
Mobile view
All
Catalogues
UB Frankfurt Linguistik
0
IDS Mannheim
0
OLC Linguistik
0
UB Frankfurt Retrokatalog
0
DNB Subject Category Language
0
Institut für Empirische Sprachwissenschaft
0
Leibniz-Centre General Linguistics (ZAS)
0
Bibliographies
BLLDB
0
BDSL
0
IDS Bibliografie zur deutschen Grammatik
0
IDS Bibliografie zur Gesprächsforschung
0
IDS Konnektoren im Deutschen
0
IDS Präpositionen im Deutschen
0
IDS OBELEX meta
0
MPI-SHH Linguistics Collection
0
MPI for Psycholinguistics
0
Linked Open Data catalogues
Annohub
0
Online resources
Link directory
0
Journal directory
0
Database directory
0
Dictionary directory
0
Open access documents
BASE
1
Linguistik-Repository
0
IDS Publikationsserver
0
Online dissertations
0
Language Description Heritage
0
© 2013 - 2024 Lin|gu|is|tik
|
Imprint
|
Privacy Policy
|
Datenschutzeinstellungen ändern