7 |
Part Of Speech Annotation Guidelines For The Occitan Language ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Part Of Speech Annotation Guidelines For The Occitan Language ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Part-Of-Speech Annotation Guidelines For The Alsatian Dialects ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Annotated Corpus For The Alsatian Dialects ...
|
|
|
|
Abstract:
This corpus contains a collection of texts in the Alsatian dialects which were manually annotated with parts-of-speech, lemmas, translations into French and location entities. The corpus was produced in the context of the RESTAURE project, funded by the French ANR. The current version of the corpus contains 21 documents and 12,570 tokens. The annotation process is detailed in the following article: http://hal.archives-ouvertes.fr/hal-01704806 The untokenised and unannotated versions of the documents are found in the “txt” folder. The annotated versions of the documents are found in the “annotated” folder. They are provided in a TSV format with the following columns: id: token index in the document form: word form translation: translation into French lemma: word lemma pos: part-of-speech location: Begin-Inside tags for location entities ...
|
|
Keyword:
Alsatian; Corpus; FOS Languages and literature; Lemma; Linguistics; Natural Language Processing; Part-of-speech
|
|
URL: https://dx.doi.org/10.5281/zenodo.1170129 https://zenodo.org/record/1170129
|
|
BASE
|
|
Hide details
|
|
14 |
Part-Of-Speech Annotation Guidelines For The Alsatian Dialects ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|