1 |
Between History and Natural Language Processing: Study, Enrichment and Online Publication of French Parliamentary Debates of the Early Third Republic (1881-1899)
|
|
|
|
In: ParlaCLARIN III at LREC2022 - Workshop on Creating, Enriching and Using Parliamentary Corpora ; https://hal.archives-ouvertes.fr/hal-03623351 ; ParlaCLARIN III at LREC2022 - Workshop on Creating, Enriching and Using Parliamentary Corpora, Jun 2022, Marseille, France ; https://www.clarin.eu/ParlaCLARIN-III (2022)
|
|
BASE
|
|
Show details
|
|
5 |
Collection of Slovenian paremiological units Pregovori 1.0
|
|
Babič, Saša; Miha, Peče; Erjavec, Tomaž; Ivančič Kutin, Barbara; Šrimpf Vendramin, Katarina; Kropej Telban, Monika; Jakop, Nataša; Stanonik, Marija. - : ZRC SAZU, 2022. : Jožef Stefan Institute, 2022
|
|
Abstract:
This corpus collects and annotates the extensive and highly valuable diachronic collection of Slovenian proverbs, 50 years and more in the making at the ZRC SAZU Institute of Slovenian Ethnology. It is composed of the structured 2,515 bibliographical items (1578-2010): printed books, journals, calendars, collecting campaigns in different journals, folklore collecting field-works, personal notes, etc. that served as the sources of the proverbs and the collection of the paremiological units. Each one is represented in two ways: as the diplomatic transcription from the source collection (due to the technical difficulties of the transcribers and human errors in transcription, the transcription of older texts is inconsistent) and as the critical transcription which normalizes the alphabet. The words of the critical transcriptions have also been automatically modernised to contemporary spelling, and these words further annotated with lemmas, MULTEXT-East MSDs and Universal dependencies with the CLASSLA toolchain. The canonical encoding of the corpus is TEI, but the corpus is also distributed in two derived encodings. One is the bibliography and sayings as two TSV files, and the other the vertical file, as used by CQP-type concordancers, such as Sketch Engine.
|
|
Keyword:
folk sayings; paremiology; proverbs; TEI
|
|
URL: http://hdl.handle.net/11356/1455
|
|
BASE
|
|
Hide details
|
|
6 |
Terminological Methods in Lexicography: Conceptualising, Organising, and Encoding Terms in General Language Dictionaries
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Giving Depth to TEI-Based Descriptions of Manuscripts: The Golden Gospel of Ham
|
|
|
|
In: Aethiopica; Bd. 24 (2021); 175–211 ; Aethiopica; Vol. 24 (2021); 175–211 ; 2194-4024 ; 1430-1938 ; 10.15460/aethiopica.24.0 (2022)
|
|
BASE
|
|
Show details
|
|
8 |
Towards an Online Database of Ancient Dramatic Meters
|
|
|
|
In: FuturoClassico FCl; N. 7 (2021); 143-164 ; 2465-0951 (2022)
|
|
BASE
|
|
Show details
|
|
9 |
Understanding and reading XML ; Comprendre et lire le XML
|
|
|
|
In: https://halshs.archives-ouvertes.fr/halshs-03637142 ; École thématique. Comprendre et lire le XML, Bibliothèque du lab. CRISCO EA 4255, France. 2021, pp.72 ; Comprendre et lire le XML (2021)
|
|
BASE
|
|
Show details
|
|
10 |
XML and namespaces ; XML et espaces de nom
|
|
|
|
In: https://halshs.archives-ouvertes.fr/halshs-03637189 ; Doctorat. XML et espaces de nom, Bibliothèque du lab. CRISCO EA 4255, France. 2021, pp.44 ; XML et espaces de nom (2021)
|
|
BASE
|
|
Show details
|
|
11 |
Language Processing in Digital Editions of Russian 18 th Century Texts ; Лингвистическая обработка цифровых изданий русских текстов XVIII века
|
|
|
|
In: Corpora 2021 International Conference ; https://halshs.archives-ouvertes.fr/halshs-03285725 ; Corpora 2021 International Conference, Saint-Petersburg State University, Jul 2021, Saint-Petersbourg, Russia ; https://events.spbu.ru/events/corpora-2021 (2021)
|
|
BASE
|
|
Show details
|
|
12 |
La Base de français médiéval et le consortium CAHIER : dix ans d'échanges et de collaborations
|
|
|
|
In: 10 ans avec CAHIER. Des corpus d'auteurs pour les humanités à leur exploitation numérique ; https://halshs.archives-ouvertes.fr/halshs-03363517 ; 10 ans avec CAHIER. Des corpus d'auteurs pour les humanités à leur exploitation numérique, Jun 2021, Bordeaux, France ; https://cahier10.sciencesconf.org/344494 (2021)
|
|
BASE
|
|
Show details
|
|
13 |
Expanding the content model of annotationBlock
|
|
|
|
In: Next Gen TEI, 2021 - TEI Conference and Members’ Meeting ; https://hal.archives-ouvertes.fr/hal-03380805 ; Next Gen TEI, 2021 - TEI Conference and Members’ Meeting, Oct 2021, Virtual, United States (2021)
|
|
BASE
|
|
Show details
|
|
20 |
Multilingual comparable corpora of parliamentary debates ParlaMint 2.1
|
|
|
|
BASE
|
|
Show details
|
|
|
|