1 |
Building infrastructure for annotating medieval, classical and pre-orthographic languages: the Pyrrha ecosystem
|
|
|
|
In: Digital Humanities 2022 (DH2022) ; https://hal.archives-ouvertes.fr/hal-03606756 ; Digital Humanities 2022 (DH2022), Jul 2022, Tokyo, Japan ; https://dh2022.adho.org/ (2022)
|
|
BASE
|
|
Show details
|
|
2 |
Lemmatiser des textes et corriger l'annotation grâcè a l'apprentissage profond avec Pyrrha
|
|
|
|
In: Humanistica 2021 ; https://hal.archives-ouvertes.fr/hal-03224112 ; Humanistica 2021, May 2021, Rennes, France (2021)
|
|
BASE
|
|
Show details
|
|
3 |
Corpus and Models for Lemmatisation and POS-tagging of Classical French Theatre
|
|
|
|
In: EISSN: 2416-5999 ; Journal of Data Mining and Digital Humanities ; https://halshs.archives-ouvertes.fr/halshs-02591388 ; Journal of Data Mining and Digital Humanities, Episciences.org, 2021, ⟨10.46298/jdmdh.6485⟩ (2021)
|
|
BASE
|
|
Show details
|
|
4 |
Corpus and Models for Lemmatisation and POS-tagging of Old French
|
|
|
|
In: https://halshs.archives-ouvertes.fr/halshs-03353125 ; 2021 (2021)
|
|
BASE
|
|
Show details
|
|
5 |
"Don't worry, it's just noise": quantifying the impact of files treated as single textual units when they are really collections
|
|
|
|
In: Proceedings of the Workshop on Natural Language Processing for Digital Humanities (NLP4DH) ; Workshop on Natural Language Processing for Digital Humanities (NLP4DH) ; https://hal.archives-ouvertes.fr/hal-03481620 ; Workshop on Natural Language Processing for Digital Humanities (NLP4DH), Dec 2021, Virtual, India (2021)
|
|
Abstract:
International audience ; Literature works may present many autonomous or semi-autonomous units, such as poems for the first or chapter for the second. We make the hypothesis that such cuts in the text's flow, if not taken care of in the way we process text, have an impact on the application of the distributional hypothesis. We test this hypothesis with a large 20M tokens corpus of Latin works, by using text files as a single unit or multiple "autonomous" units for the analysis of selected words. For groups of rare words and words specific to heavily segmented works, the results show that their semantic space is mostly different between both versions of the corpus. For the 1000 most frequent words of the corpus, variations are important as soon as the window for defining neighborhood is larger or equal to 10 words.
|
|
Keyword:
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [SHS.LANGUE]Humanities and Social Sciences/Linguistics
|
|
URL: https://hal.archives-ouvertes.fr/hal-03481620/file/dont_worry_just_noise_ACL.pdf https://hal.archives-ouvertes.fr/hal-03481620 https://hal.archives-ouvertes.fr/hal-03481620/document
|
|
BASE
|
|
Hide details
|
|
6 |
Corpus and Models for Lemmatisation and POS-tagging of Old French ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Guidelines for linguistic annotation of modern French (16th-18th c.) ; Manuel d'annotation linguistique pour le français moderne (XVIe -XVIIIe siècles)
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-02571190 ; 2020 (2020)
|
|
BASE
|
|
Show details
|
|
8 |
Standardizing linguistic data: method and tools for annotating (pre-orthographic) French ; Standardiser les données linguistiques: méthodes et outils pour l'annotation du français (pré-orthographique)
|
|
|
|
In: Proceedings of the 2nd International Digital Tools & Uses Congress (DTUC '20) ; https://hal.archives-ouvertes.fr/hal-03018381 ; Proceedings of the 2nd International Digital Tools & Uses Congress (DTUC '20), Oct 2020, Hammamet, Tunisia. ⟨10.1145/3423603.3423996⟩ (2020)
|
|
BASE
|
|
Show details
|
|
9 |
Standardizing linguistic data: method and tools for annotating (pre-orthographic) French ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Standardizing linguistic data: method and tools for annotating(pre-orthographic) French ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Standardizing linguistic data: method and tools for annotating(pre-orthographic) French ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
APIs in Digital Humanities: The Infrastructural Turn
|
|
|
|
In: Digital Humanities 2016 ; https://hal.archives-ouvertes.fr/hal-01348706 ; Digital Humanities 2016, Jul 2016, Cracovie, Poland. pp.93-96 ; http://dh2016.adho.org/ (2016)
|
|
BASE
|
|
Show details
|
|
|
|