DE eng

Search in the Catalogues and Directories

Page: 1 2 3
Hits 1 – 20 of 42

1
Collecting and annotating corpora for three under-resourced languages of France: Methodological issues
In: ISSN: 1934-5275 ; EISSN: 1934-5275 ; Language Documentation & Conservation ; https://hal.archives-ouvertes.fr/hal-03273196 ; Language Documentation & Conservation, University of Hawaiʻi Press 2021, 15, pp.316-357 ; http://hdl.handle.net/10125/74645 (2021)
BASE
Show details
2
Collecting and annotating corpora for three under-resourced languages of France: Methodological issues
Abstract: In contrast to French, the vast majority of regional languages of France can be considered as under-resourced. In this article, we present the results of a research project aiming to produce annotated resources for three regional languages of France: Alsatian, Occitan, and Picard. These languages cover three different language families (Germanic and two subfamilies of Romance, Oïl and Oc languages) and different sociolinguistic situations. Yet, they all face issues common to many under-resourced languages: lack of human and financial resources and presence of geolinguistic variation. The originality of this project is that it brought together researchers from different fields (sociolinguistics, descriptive linguistics, dialectology, natural language processing, digital humanities) to work together towards the common goal of developing annotated corpora for Alsatian, Occitan, and Picard. This created a favorable and stimulating working environment which could not have been achieved had different research groups worked independently, each on a single language. This article details the annotation process, with a special focus on the delimitation of the tokens and the definition of the part-of-speech tags. ; National Foreign Language Resource Center ; bernhard_et_al.pdf
Keyword: Alsatian; annotations; corpus; Occitan; part-of-speech; Picard; tokenization
URL: http://hdl.handle.net/10125/74645
BASE
Hide details
3
Collecting and annotating corpora for three under-resourced languages of France: Methodological issues
Bernhard, Delphine; Ligozat, Anne-Laure; Bras, Myriam. - : University of Hawaii Press, 2021
BASE
Show details
4
L’avenir numérique des langues minoritaires : bilan du projet RESTAURE pour l’alsacien, l’occitan et le picard
In: ISSN: 2105-0368 ; Les Cahiers du GEPE ; Colloque « Langues minoritaires » : quels acteurs pour quel avenir ? ; https://hal.archives-ouvertes.fr/hal-02378172 ; Les Cahiers du GEPE, Université de Strasbourg, 2020, Langues minoritaires : Quels acteurs pour quel avenir ? ; http://cahiersdugepe.fr/index.php?id=3662 (2020)
BASE
Show details
5
Exploiting languages proximity for part-of-speech tagging of three French regional languages [<Journal>]
Magistry, Pierre [Verfasser]; Ligozat, Anne-Laure [Verfasser]; Rosset, Sophie [Verfasser]
DNB Subject Category Language
Show details
6
Exploiting languages proximity for part-of-speech tagging of three French regional languages
In: ISSN: 1574-020X ; EISSN: 1574-0218 ; Language Resources and Evaluation ; https://hal.archives-ouvertes.fr/hal-02358020 ; Language Resources and Evaluation, Springer Verlag, 2019, pp.1-26 (2019)
BASE
Show details
7
Language Technologies for Regional Languages of France: The RESTAURE Project
In: International Conference Language Technologies for All (LT4All): Enabling Linguistic Diversity and Multilingualism Worldwide ; https://hal.archives-ouvertes.fr/hal-02418928 ; International Conference Language Technologies for All (LT4All): Enabling Linguistic Diversity and Multilingualism Worldwide, Dec 2019, Paris, France. pp.272‑275 ; https://lt4all.elra.info/proceedings/lt4all2019/ (2019)
BASE
Show details
8
A Corpus for Hybrid Question Answering Systems
In: Proceeding WWW '18 Companion Proceedings of the The Web Conference 2018 ; Workshop on Hybrid Question Answering with Structured and Unstructured Knowledge ; https://hal.archives-ouvertes.fr/hal-02284465 ; Workshop on Hybrid Question Answering with Structured and Unstructured Knowledge, Apr 2018, Lyon - FR, France. pp.1081-1086, &#x27E8;10.1145/3184558.3191540&#x27E9; (2018)
BASE
Show details
9
Étiquetage en parties du discours de langues peu dotées par spécialisation des plongements lexicaux
In: Conférence sur le Traitement Automatique des Langues Naturelles ; https://hal.archives-ouvertes.fr/hal-01793092 ; Conférence sur le Traitement Automatique des Langues Naturelles, May 2018, Rennes, France (2018)
BASE
Show details
10
Resources and Methods for the Automatic Recognition of Place Names in Alsatian
In: Corpus-Based Research in the Humanities ; https://hal.archives-ouvertes.fr/hal-01702656 ; Corpus-Based Research in the Humanities, Jan 2018, Vienna, Austria. pp.35-44 ; https://www.oeaw.ac.at/ac/crh2/proceedings/ (2018)
BASE
Show details
11
A French clinical corpus with comprehensive semantic annotations: development of the Medical Entity and Relation LIMSI annOtated Text corpus (MERLOT) [<Journal>]
Campillos, Leonardo [Verfasser]; Deleger, Louise [Sonstige]; Grouin, Cyril [Sonstige].
DNB Subject Category Language
Show details
12
A French clinical corpus with comprehensive semantic annotations: development of the Medical Entity and Relation LIMSI annOtated Text corpus (MERLOT)
In: ISSN: 1574-020X ; EISSN: 1574-0218 ; Language Resources and Evaluation ; https://hal.archives-ouvertes.fr/hal-01631743 ; Language Resources and Evaluation, Springer Verlag, 2017, 52 (2), pp.571-601. &#x27E8;10.1007/s10579-017-9382-y&#x27E9; (2017)
BASE
Show details
13
Chaînes de référence et lisibilité des textes : Le projet ALLuSIF
In: ISSN: 0023-8368 ; EISSN: 1957-7982 ; Langue française ; https://halshs.archives-ouvertes.fr/halshs-01665316 ; Langue française, Armand Colin, 2017, Les chaînes de référence en corpus (éds. Catherine Schnedecker, Julie Glikman, Frédéric Landragin), 195 (3), pp.35-52 ; http://www.revues.armand-colin.com/lettres-langues/langue-francaise/langue-francaise-ndeg-195-32017 (2017)
BASE
Show details
14
Chaînes de référence et lisibilité des textes : le projet ALLuSIF
In: Langue française, N 195, 3, 2017-09-25, pp.35-52 (2017)
BASE
Show details
15
Evaluating Lexical Simplification and Vocabulary Knowledge for Learners of French: Possibilities of Using the FLELex Resource
In: LREC 2016 proceedings ; Language Resources and Evaluation Conference (LREC) ; https://hal.archives-ouvertes.fr/hal-02517616 ; Language Resources and Evaluation Conference (LREC), May 2016, Portorož, Slovenia (2016)
BASE
Show details
16
Modèles adaptatifs pour prédire automatiquement la compétence lexicale d'un apprenant de français langue étrangère
In: Actes de la conférence conjointe JEP-TALN-RECITAL 2016 ; JEP-TALN-RECITAL 2016 ; https://hal.archives-ouvertes.fr/hal-01631772 ; JEP-TALN-RECITAL 2016, Jan 2016, Paris, France (2016)
BASE
Show details
17
Are Cohesive Features Relevant for Text Readability Evaluation?
In: 26th International Conference on Computational Linguistics (COLING 2016) ; https://hal.archives-ouvertes.fr/hal-01430554 ; 26th International Conference on Computational Linguistics (COLING 2016), Dec 2016, Osaka, Japan. pp.987 - 997 ; http://coling2016.anlp.jp/ (2016)
BASE
Show details
18
Représentation sémantique de questions pour interroger le Web sémantique.
In: CORIA 2015 - Conférence en Recherche d'Informations et Applications - 12th French Information Retrieval Conference, Paris, France, March 18-20, 2015. ; CORIA ; https://hal.archives-ouvertes.fr/hal-02289244 ; CORIA, Mar 2015, Paris, France. pp.453--468, &#x27E8;10.24348/coria.2015.80&#x27E9; (2015)
BASE
Show details
19
Représentation sémantique de questions pour interroger le Web sémantique. ...
BASE
Show details
20
LIMSI-CNRS@ CLEF 2014: Invalidating Answers for Multiple Choice Question Answering.
In: Working Notes for CLEF 2014 Conference, Sheffield, UK, September 15-18, 2014 ; CLEF 2014 ; https://hal.archives-ouvertes.fr/hal-02290008 ; CLEF 2014, Sep 2014, Sheffield, United Kingdom. pp.1386--1394 (2014)
BASE
Show details

Page: 1 2 3

Catalogues
0
0
0
0
2
0
0
Bibliographies
1
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
39
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern