2 |
Collecting and annotating corpora for three under-resourced languages of France: Methodological issues
|
|
|
|
In: ISSN: 1934-5275 ; EISSN: 1934-5275 ; Language Documentation & Conservation ; https://hal.archives-ouvertes.fr/hal-03273196 ; Language Documentation & Conservation, University of Hawaiʻi Press 2021, 15, pp.316-357 ; http://hdl.handle.net/10125/74645 (2021)
|
|
BASE
|
|
Show details
|
|
3 |
Glottolog 4.4 Resources for Picard
|
|
: Max Planck Institute for Evolutionary Anthropology, 2021
|
|
BASE
|
|
Show details
|
|
4 |
Emphatic elements and the development of definite articles
|
|
|
|
In: Journal of Historical Syntax; Vol 5 No 16-25 (2021): Proceedings of the 21st Diachronic Generative Syntax (DiGS) Conference; 1-32 ; 2163-6001 (2021)
|
|
BASE
|
|
Show details
|
|
5 |
Collecting and annotating corpora for three under-resourced languages of France: Methodological issues
|
|
Bernhard, Delphine; Ligozat, Anne-Laure; Bras, Myriam; Martin, Fanny; Vergez-Couret, Marianne; Erhart, Pascale; Sibille, Jean; Todirascu, Amalia; Boula de Mareüil, Philippe; Huck, Dominique. - : University of Hawaii Press, 2021
|
|
Abstract:
In contrast to French, the vast majority of regional languages of France can be considered as under-resourced. In this article, we present the results of a research project aiming to produce annotated resources for three regional languages of France: Alsatian, Occitan, and Picard. These languages cover three different language families (Germanic and two subfamilies of Romance, Oïl and Oc languages) and different sociolinguistic situations. Yet, they all face issues common to many under-resourced languages: lack of human and financial resources and presence of geolinguistic variation. The originality of this project is that it brought together researchers from different fields (sociolinguistics, descriptive linguistics, dialectology, natural language processing, digital humanities) to work together towards the common goal of developing annotated corpora for Alsatian, Occitan, and Picard. This created a favorable and stimulating working environment which could not have been achieved had different research groups worked independently, each on a single language. This article details the annotation process, with a special focus on the delimitation of the tokens and the definition of the part-of-speech tags. ; National Foreign Language Resource Center ; bernhard_et_al.pdf
|
|
Keyword:
Alsatian; annotations; corpus; Occitan; part-of-speech; Picard; tokenization
|
|
URL: http://hdl.handle.net/10125/74645
|
|
BASE
|
|
Hide details
|
|
6 |
Collecting and annotating corpora for three under-resourced languages of France: Methodological issues
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Étude comparative des particules interrogatives en picard et dans deux variétés de français parlées au Canada [TRADUCTIONS EXEMPLES PICARDS]
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Iterative methods for constructing an equations of non-closed shells solution
|
|
|
|
In: Structural Mechanics of Engineering Constructions and Buildings, Vol 17, Iss 6, Pp 588-607 (2021) (2021)
|
|
BASE
|
|
Show details
|
|
9 |
Le parler picard hennuyer de Gommegnies (Nord)
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03270447 ; Éditions de linguistique et de philologie. 2020, 978-2-37276-038-6 (2020)
|
|
BASE
|
|
Show details
|
|
10 |
Les parlers romans dans l'atlas sonore des langues et dialectes de Belgique
|
|
|
|
In: ISSN: 0220-665X ; Bien dire et bien aprandre - Revue de médiévistique ; https://hal.archives-ouvertes.fr/hal-03047333 ; Bien dire et bien aprandre - Revue de médiévistique, Centre d'études médiévales et dialectales, 2020, Les atlas linguistiques galloromans à l'heure du numérique : projets et enjeux, pp.85-108 ; https://www.septentrion.com/fr/livre/?GCOI=27574100945550 (2020)
|
|
BASE
|
|
Show details
|
|
11 |
Enseigner le picard au XXIème siècle : pour qui, comment ?
|
|
|
|
In: Variation et enseignement des langues le cas des langues à faible diffusion ; https://hal.archives-ouvertes.fr/hal-03215161 ; Variation et enseignement des langues le cas des langues à faible diffusion, 2020 (2020)
|
|
BASE
|
|
Show details
|
|
12 |
Variation dans le système pronominal gallo-roman : l’expression de la pluralité en français et en picard
|
|
Tremblay, Mireille. - : Département d'études françaises, Université de Toronto, 2020. : Érudit, 2020
|
|
BASE
|
|
Show details
|
|
14 |
Exploiting languages proximity for part-of-speech tagging of three French regional languages
|
|
|
|
In: ISSN: 1574-020X ; EISSN: 1574-0218 ; Language Resources and Evaluation ; https://hal.archives-ouvertes.fr/hal-02358020 ; Language Resources and Evaluation, Springer Verlag, 2019, pp.1-26 (2019)
|
|
BASE
|
|
Show details
|
|
15 |
Language Technologies for Regional Languages of France: The RESTAURE Project
|
|
|
|
In: International Conference Language Technologies for All (LT4All): Enabling Linguistic Diversity and Multilingualism Worldwide ; https://hal.archives-ouvertes.fr/hal-02418928 ; International Conference Language Technologies for All (LT4All): Enabling Linguistic Diversity and Multilingualism Worldwide, Dec 2019, Paris, France. pp.272‑275 ; https://lt4all.elra.info/proceedings/lt4all2019/ (2019)
|
|
BASE
|
|
Show details
|
|
16 |
Francisation des dialectes d’oïl : de l’usage des atlas linguistiques comme termes de comparaison
|
|
|
|
In: Langages, N 215, 3, 2019-09-30, pp.27-42 (2019)
|
|
BASE
|
|
Show details
|
|
|
|