Home
Catalogue search
Refine your search:
Keyword
Creator / Publisher:
Bernhard, Delphine (10)
Todirascu, Amalia (10)
Ligozat, Anne-Laure (8)
Linguistique, Langues et Parole (LILPA) (7)
Université de Strasbourg (UNISTRA) (7)
Bras, Myriam (4)
Ecole Nationale Supérieure d'Informatique pour l'Industrie et l'Entreprise (ENSIIE) (4)
François, Thomas (4)
Gala, Núria (4)
Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur (LIMSI) (4)
more
Year
Medium
Type:
Article (8)
Miscellaneous (2)
BLLDB-Access:
free (10)
subject to license (0)
Search in the Catalogues and Directories
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
Sort by
creator [A → Z]
'
creator [Z → A]
'
publishing year ↑ (asc)
'
publishing year ↓ (desc)
'
title [A → Z]
'
title [Z → A]
'
Simple Search
Hits 1 – 10 of 10
1
Collecting and annotating corpora for three under-resourced languages of France: Methodological issues
Bernhard, Delphine
;
Ligozat, Anne-Laure
;
Bras, Myriam
;
Martin, Fanny
;
Vergez-Couret, Marianne
;
Erhart, Pascale
;
Sibille, Jean
;
Todirascu, Amalia
;
Boula De Mareüil, Philippe
;
Huck, Dominique
In: ISSN: 1934-5275 ; EISSN: 1934-5275 ; Language Documentation & Conservation ; https://hal.archives-ouvertes.fr/hal-03273196 ; Language Documentation & Conservation, University of Hawaiʻi Press 2021, 15, pp.316-357 ; http://hdl.handle.net/10125/74645 (2021)
Abstract:
International audience ; In contrast to French, the vast majority of regional languages of France can be considered as under-resourced. In this article, we present the results of a research project aiming to produce annotated resources for three regional languages of France: Alsatian, Occitan, and Picard. These languages cover three different language families (Germanic and two subfamilies of Romance, Oïl and Oc languages) and different sociolinguistic situations. Yet, they all face issues common to many under-resourced languages: lack of human and financial resources and presence of geolinguistic variation. The originality of this project is that it brought together researchers from different fields (sociolinguistics, descriptive linguistics, dialectology, natural language processing, digital humanities) to work together towards the common goal of developing annotated corpora for Alsatian, Occitan, and Picard. This created a favorable and stimulating working environment which could not have been achieved had different research groups worked independently, each on a single language. This article details the annotation process, with a special focus on the delimitation of the tokens and the definition of the part-of-speech tags.
Keyword:
[INFO.INFO-TT]Computer Science [cs]/Document and Text Processing
;
Alsatian
;
annotations
;
corpus
;
Occitan
;
part-of-speech
;
Picard
;
tokenization
URL:
https://hal.archives-ouvertes.fr/hal-03273196/file/bernhard_et_al.pdf
https://hal.archives-ouvertes.fr/hal-03273196/document
https://hal.archives-ouvertes.fr/hal-03273196
BASE
Hide details
2
Collecting and annotating corpora for three under-resourced languages of France: Methodological issues
Bernhard, Delphine
;
Ligozat, Anne-Laure
;
Bras, Myriam
. - : University of Hawaii Press, 2021
BASE
Show details
3
Collecting and annotating corpora for three under-resourced languages of France: Methodological issues
Bernhard, Delphine
;
Ligozat, Anne-Laure
;
Bras, Myriam
. - : University of Hawaii Press, 2021
BASE
Show details
4
Transformations syntaxiques pour une aide à l'apprentissage de la lecture : typologie, adéquation et corpus adaptés
Gala, Núria
;
Todirascu, Amalia
;
Bernhard, Delphine
...
In: ISSN: 2261-2424 ; SHS Web of Conferences ; https://hal.archives-ouvertes.fr/hal-02562205 ; SHS Web of Conferences, EDP Sciences, 2020, 7e Congrès Mondial de Linguistique Française 78, pp.14006. ⟨10.1051/shsconf/20207814006⟩ (2020)
BASE
Show details
5
L’avenir numérique des langues minoritaires : bilan du projet RESTAURE pour l’alsacien, l’occitan et le picard
Bernhard, Delphine
;
Bras, Myriam
;
Ligozat, Anne-Laure
...
In: ISSN: 2105-0368 ; Les Cahiers du GEPE ; Colloque « Langues minoritaires » : quels acteurs pour quel avenir ? ; https://hal.archives-ouvertes.fr/hal-02378172 ; Les Cahiers du GEPE, Université de Strasbourg, 2020, Langues minoritaires : Quels acteurs pour quel avenir ? ; http://cahiersdugepe.fr/index.php?id=3662 (2020)
BASE
Show details
6
Recommandations pour des transformations de textes français afin d'améliorer leur lisibilité et leur compréhension
Gala, Núria
;
Todirascu, Amalia
;
Javourey-Drevet, Ludivine
...
In: https://hal.archives-ouvertes.fr/hal-03198905 ; [Rapport de recherche] ANR. 2020 (2020)
BASE
Show details
7
Chaînes de référence et lisibilité des textes : Le projet ALLuSIF
Todirascu, Amalia
;
François, Thomas
;
Bernhard, Delphine
...
In: ISSN: 0023-8368 ; EISSN: 1957-7982 ; Langue française ; https://halshs.archives-ouvertes.fr/halshs-01665316 ; Langue française, Armand Colin, 2017, Les chaînes de référence en corpus (éds. Catherine Schnedecker, Julie Glikman, Frédéric Landragin), 195 (3), pp.35-52 ; http://www.revues.armand-colin.com/lettres-langues/langue-francaise/langue-francaise-ndeg-195-32017 (2017)
BASE
Show details
8
Chaînes de référence et lisibilité des textes : le projet ALLuSIF
Todirascu, Amalia
;
François, Thomas
;
Bernhard, Delphine
...
In: Langue française, N 195, 3, 2017-09-25, pp.35-52 (2017)
BASE
Show details
9
Are Cohesive Features Relevant for Text Readability Evaluation?
Todirascu, Amalia
;
François, Thomas
;
Bernhard, Delphine
...
In: 26th International Conference on Computational Linguistics (COLING 2016) ; https://hal.archives-ouvertes.fr/hal-01430554 ; 26th International Conference on Computational Linguistics (COLING 2016), Dec 2016, Osaka, Japan. pp.987 - 997 ; http://coling2016.anlp.jp/ (2016)
BASE
Show details
10
Coherence and Cohesion for the Assessment of Text Readability
Todirascu, Amalia
;
François, Thomas
;
Gala, Nuria
...
In: Proceedings of 10th International Workshop on Natural Language Processing and Cognitive Science (NLPCS 2013) ; https://hal.archives-ouvertes.fr/hal-00860796 ; Proceedings of 10th International Workshop on Natural Language Processing and Cognitive Science (NLPCS 2013), Oct 2013, Marseille, France. pp.11-19 (2013)
BASE
Show details
Mobile view
All
Catalogues
UB Frankfurt Linguistik
0
IDS Mannheim
0
OLC Linguistik
0
UB Frankfurt Retrokatalog
0
DNB Subject Category Language
0
Institut für Empirische Sprachwissenschaft
0
Leibniz-Centre General Linguistics (ZAS)
0
Bibliographies
BLLDB
0
BDSL
0
IDS Bibliografie zur deutschen Grammatik
0
IDS Bibliografie zur Gesprächsforschung
0
IDS Konnektoren im Deutschen
0
IDS Präpositionen im Deutschen
0
IDS OBELEX meta
0
MPI-SHH Linguistics Collection
0
MPI for Psycholinguistics
0
Linked Open Data catalogues
Annohub
0
Online resources
Link directory
0
Journal directory
0
Database directory
0
Dictionary directory
0
Open access documents
BASE
10
Linguistik-Repository
0
IDS Publikationsserver
0
Online dissertations
0
Language Description Heritage
0
© 2013 - 2024 Lin|gu|is|tik
|
Imprint
|
Privacy Policy
|
Datenschutzeinstellungen ändern