1 |
Towards a part-of-speech tagger for Sranan Tongo ...
|
|
Nicolás, C.V.; Viktor, Z.. - : Фонд содействия развитию интернет-медиа, ИТ-образования, человеческого потенциала "Лига интернет-медиа", 2022
|
|
BASE
|
|
Show details
|
|
2 |
The CLASSLA-StanfordNLP model for morphosyntactic annotation of standard Slovenian 1.3
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Enhancing Communication Reliability from the Semantic Level under Low SNR
|
|
|
|
In: Electronics; Volume 11; Issue 9; Pages: 1358 (2022)
|
|
BASE
|
|
Show details
|
|
4 |
Leveraging Part-of-Speech Tagging Features and a Novel Regularization Strategy for Chinese Medical Named Entity Recognition
|
|
|
|
In: Mathematics; Volume 10; Issue 9; Pages: 1386 (2022)
|
|
BASE
|
|
Show details
|
|
5 |
Tagged Corpus of Early English Correspondence Extension Sampler (TCEECES) ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Tagged Corpus of Early English Correspondence Extension Sampler (TCEECES) ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Easy-to-use combination of POS and BERT model for domain-specific and misspelled terms
|
|
|
|
In: NL4IA Workshop Proceedings ; https://hal.archives-ouvertes.fr/hal-03474696 ; NL4IA Workshop Proceedings, Nov 2021, Milan, Italy (2021)
|
|
BASE
|
|
Show details
|
|
8 |
Collecting and annotating corpora for three under-resourced languages of France: Methodological issues
|
|
Bernhard, Delphine; Ligozat, Anne-Laure; Bras, Myriam; Martin, Fanny; Vergez-Couret, Marianne; Erhart, Pascale; Sibille, Jean; Todirascu, Amalia; Boula De Mareüil, Philippe; Huck, Dominique
|
|
In: ISSN: 1934-5275 ; EISSN: 1934-5275 ; Language Documentation & Conservation ; https://hal.archives-ouvertes.fr/hal-03273196 ; Language Documentation & Conservation, University of Hawaiʻi Press 2021, 15, pp.316-357 ; http://hdl.handle.net/10125/74645 (2021)
|
|
Abstract:
International audience ; In contrast to French, the vast majority of regional languages of France can be considered as under-resourced. In this article, we present the results of a research project aiming to produce annotated resources for three regional languages of France: Alsatian, Occitan, and Picard. These languages cover three different language families (Germanic and two subfamilies of Romance, Oïl and Oc languages) and different sociolinguistic situations. Yet, they all face issues common to many under-resourced languages: lack of human and financial resources and presence of geolinguistic variation. The originality of this project is that it brought together researchers from different fields (sociolinguistics, descriptive linguistics, dialectology, natural language processing, digital humanities) to work together towards the common goal of developing annotated corpora for Alsatian, Occitan, and Picard. This created a favorable and stimulating working environment which could not have been achieved had different research groups worked independently, each on a single language. This article details the annotation process, with a special focus on the delimitation of the tokens and the definition of the part-of-speech tags.
|
|
Keyword:
[INFO.INFO-TT]Computer Science [cs]/Document and Text Processing; Alsatian; annotations; corpus; Occitan; part-of-speech; Picard; tokenization
|
|
URL: https://hal.archives-ouvertes.fr/hal-03273196/file/bernhard_et_al.pdf https://hal.archives-ouvertes.fr/hal-03273196/document https://hal.archives-ouvertes.fr/hal-03273196
|
|
BASE
|
|
Hide details
|
|
9 |
Hierarchical-Task Reservoir for Online Semantic Analysis from Continuous Speech
|
|
|
|
In: ISSN: 2162-237X ; IEEE Transactions on Neural Networks and Learning Systems ; https://hal.inria.fr/hal-03031413 ; IEEE Transactions on Neural Networks and Learning Systems, IEEE, 2021, ⟨10.1109/TNNLS.2021.3095140⟩ ; https://ieeexplore.ieee.org/abstract/document/9548713/metrics#metrics (2021)
|
|
BASE
|
|
Show details
|
|
11 |
Annotated Corpus of Pre-Standardized Balkan Slavic Literature 1.1
|
|
Šimko, Ivan. - : Slavic Seminary, University of Zurich, 2021
|
|
BASE
|
|
Show details
|
|
13 |
The CLASSLA-StanfordNLP model for morphosyntactic annotation of standard Slovenian 1.2
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Automatic Part-of-Speech Tagging for Security Vulnerability Descriptions ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Automatic Part-of-Speech Tagging for Security Vulnerability Descriptions ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
Developing Core Technologies for Resource-Scarce Nguni Languages
|
|
|
|
In: Information; Volume 12; Issue 12; Pages: 520 (2021)
|
|
BASE
|
|
Show details
|
|
20 |
A Comparative Study of Arabic Part of Speech Taggers Using Literary Text Samples from Saudi Novels
|
|
|
|
In: Information; Volume 12; Issue 12; Pages: 523 (2021)
|
|
BASE
|
|
Show details
|
|
|
|