DE eng

Search in the Catalogues and Directories

Hits 1 – 9 of 9

1
Collecting and annotating corpora for three under-resourced languages of France: Methodological issues
In: ISSN: 1934-5275 ; EISSN: 1934-5275 ; Language Documentation & Conservation ; https://hal.archives-ouvertes.fr/hal-03273196 ; Language Documentation & Conservation, University of Hawaiʻi Press 2021, 15, pp.316-357 ; http://hdl.handle.net/10125/74645 (2021)
BASE
Show details
2
Collecting and annotating corpora for three under-resourced languages of France: Methodological issues
Bernhard, Delphine; Ligozat, Anne-Laure; Bras, Myriam. - : University of Hawaii Press, 2021
BASE
Show details
3
Collecting and annotating corpora for three under-resourced languages of France: Methodological issues
Bernhard, Delphine; Ligozat, Anne-Laure; Bras, Myriam. - : University of Hawaii Press, 2021
BASE
Show details
4
Language Technologies for Regional Languages of France: The RESTAURE Project
In: International Conference Language Technologies for All (LT4All): Enabling Linguistic Diversity and Multilingualism Worldwide ; https://hal.archives-ouvertes.fr/hal-02418928 ; International Conference Language Technologies for All (LT4All): Enabling Linguistic Diversity and Multilingualism Worldwide, Dec 2019, Paris, France. pp.272‑275 ; https://lt4all.elra.info/proceedings/lt4all2019/ (2019)
BASE
Show details
5
Annotated Corpus for the Alsatian Dialects ...
BASE
Show details
6
Annotated Corpus for the Alsatian Dialects ...
Abstract: This corpus contains a collection of texts in the Alsatian dialects which were manually annotated with parts-of-speech, lemmas, translations into French and location entities. The corpus was produced in the context of the RESTAURE project, funded by the French ANR. The current version of the corpus contains 21 documents and 12,570 tokens. The annotation process is detailed in the following article: http://hal.archives-ouvertes.fr/hal-01704806 Information about version 2 Version 2 contains the same annotated documents as version 1, but some errors have been corrected and the annotated corpus is provided in the CoNLL-U format The untokenised and unannotated versions of the documents are found in the “txt” folder. The annotated versions of the documents are found in the "ud" folder (CoNLL-U format). In addition to the form, the lemma and the part-of-speechn additional information is also provided: translation of the lemma into French (Gloss field) annotation of location names (NamedType field) ...
Keyword: Alsatian; Corpus; FOS Languages and literature; Lemma; Linguistics; Natural Language Processing; Part-of-speech
URL: https://zenodo.org/record/2536041
https://dx.doi.org/10.5281/zenodo.2536041
BASE
Hide details
7
Part-Of-Speech Annotation Guidelines For The Alsatian Dialects ...
BASE
Show details
8
Annotated Corpus For The Alsatian Dialects ...
BASE
Show details
9
Part-Of-Speech Annotation Guidelines For The Alsatian Dialects ...
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
9
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern