DE eng

Search in the Catalogues and Directories

Page: 1 2 3
Hits 1 – 20 of 53

1
Corpus-based Language Universals Analysis using Universal Dependencies ; Analyse orientée corpus d'universaux linguistiques sur Universal Dependencies
In: SyntaxFest Quasy 2021 - Quantitative Syntax ; https://hal.inria.fr/hal-03501774 ; SyntaxFest Quasy 2021 - Quantitative Syntax, Mar 2022, Sofia, Bulgaria (2022)
BASE
Show details
2
Corpus-based Language Universals Analysis using Universal Dependencies ; Analyse orientée corpus d'universaux linguistiques sur Universal Dependencies
In: Quasy (Quantitative Syntax), SyntaxFest 2021 ; https://hal.inria.fr/hal-03501774 ; Quasy (Quantitative Syntax), SyntaxFest 2021, Mar 2022, Sofia, Bulgaria (2022)
BASE
Show details
3
French CrowS-Pairs: Extending a challenge dataset for measuring social bias in masked language models to a language other than English
In: ACL 2022 - 60th Annual Meeting of the Association for Computational Linguistics ; https://hal.inria.fr/hal-03629677 ; ACL 2022 - 60th Annual Meeting of the Association for Computational Linguistics, May 2022, Dublin, Ireland (2022)
Abstract: International audience ; Warning: This paper contains explicit statements of offensive stereotypes which may be upsetting. Much work on biases in natural language processing has addressed biases linked to the social and cultural experience of English speaking individuals in the United States. We seek to widen the scope of bias studies by creating material to measure social bias in language models (LMs) against specific demographic groups in France. We build on the US-centered CrowS-pairs dataset to create a multilingual stereotypes dataset that allows for comparability across languages while also characterizing biases that are specific to each country and language. We introduce 1,677 sentence pairs in French that cover stereotypes in ten types of bias like gender and age. 1,467 sentence pairs are translated from CrowS-pairs and 210 are newly crowdsourced and translated back into English. The sentence pairs contrast stereotypes concerning underadvantaged groups with the same sentence concerning advantaged groups. We find that four widely used language models (three French, one multilingual) favor sentences that express stereotypes in most bias categories. We report on the translation process, which led to a characterization of stereotypes in CrowS-pairs including the identification of US-centric cultural traits. We offer guidelines to further extend the dataset to other languages and cultural environments.
Keyword: [INFO.INFO-TT]Computer Science [cs]/Document and Text Processing
URL: https://hal.inria.fr/hal-03629677
https://hal.inria.fr/hal-03629677/file/ACLFinal.pdf
https://hal.inria.fr/hal-03629677/document
BASE
Hide details
4
Convertir le Trésor de la Langue Française en Ontolex-Lemon : un zeste de données liées
In: Journées LIFT 2021 - Linguistique informatique, formelle et de terrain ; https://hal.inria.fr/hal-03463294 ; Journées LIFT 2021 - Linguistique informatique, formelle et de terrain, Dec 2021, Grenoble, France (2021)
BASE
Show details
5
Analyse orientée corpus d'universaux de Greenberg sur Universal Dependencies
In: Journées LIFT 2021 - Linguistique informatique, formelle et de terrain ; https://hal.inria.fr/hal-03462112 ; Journées LIFT 2021 - Linguistique informatique, formelle et de terrain, GDR LIFT - Linguistique Informatique, Formelle et de Terrain, Dec 2021, Grenoble, France (2021)
BASE
Show details
6
The Dawn of the Human-Machine Era: A forecast of new and emerging language technologies.
In: https://hal.archives-ouvertes.fr/hal-03230287 ; 2021 (2021)
BASE
Show details
7
Deep Sequoia corpus - PARSEME-FR corpus - FrSemCor
BASE
Show details
8
The Dawn of the Human-Machine Era: A forecast of new and emerging language technologies
BASE
Show details
9
Language Teachers and Crowdsourcing: Insights from a Cross-European Survey
In: ISSN: 1331-6745 ; EISSN: 1849-0379 ; Rasprave Instituta za hrvatski jezik i jezikoslovlje ; https://hal.inria.fr/hal-02974069 ; Rasprave Instituta za hrvatski jezik i jezikoslovlje, 2020, 46 (1), pp.1-28. ⟨10.31724/rihjj.46.1.1⟩ (2020)
BASE
Show details
10
Creating Expert Knowledge by Relying on Language Learners: a Generic Approach for Mass-Producing Language Resources by Combining Implicit Crowdsourcing and Language Learning
In: LREC 2020 - Language Resources and Evaluation Conference ; https://hal.inria.fr/hal-02879883 ; LREC 2020 - Language Resources and Evaluation Conference, May 2020, Marseille, France (2020)
BASE
Show details
11
Text Corpora and the Challenge of Newly Written Languages
In: 1st Joint SLTU and CCURL Workshop (SLTU-CCURL 2020) ; https://hal.archives-ouvertes.fr/hal-02611209 ; 1st Joint SLTU and CCURL Workshop (SLTU-CCURL 2020), May 2020, Marseille, France (2020)
BASE
Show details
12
Unsupervised Data Augmentation for Less-Resourced Languages with no Standardized Spelling
In: RANLP ; https://hal.archives-ouvertes.fr/hal-02280002 ; RANLP, Sep 2019, Varna, Bulgaria. pp.776 - 784 (2019)
BASE
Show details
13
Représentations et transmission des connaissances à la lumière de l’innovation numérique. Actes du colloque Jeunes Chercheurs PRAXILING UMR 5267, 7-8 Novembre 2019
In: Jeunes Chercheurs PRAXILING UMR 5267 ; https://hal.archives-ouvertes.fr/hal-02369575 ; Jeunes Chercheurs PRAXILING UMR 5267, Nov 2019, Montpellier, France. 2019, ⟨10.18463/toubol.001⟩ (2019)
BASE
Show details
14
Représentations et transmission des connaissances à la lumière de l'innovation numérique. Actes du colloque Jeunes Chercheurs 2019 PRAXILING, 7-8 Novembre 2019. ...
Magnier, Julien; Biales, Anne-Laure; Bellet, Pierre. - : Praxiling UMR 5267, 2019
BASE
Show details
15
À l'écoute des locuteurs : production participative de ressources langagières pour des langues non standardisées
In: ISSN: 1248-9433 ; EISSN: 1965-0906 ; Revue TAL ; https://hal.archives-ouvertes.fr/hal-01995758 ; Revue TAL, ATALA (Association pour le Traitement Automatique des Langues), 2018 (2018)
BASE
Show details
16
Cheap, Fast and Good! Voting Games with a Purpose
In: Games4NLP: Games and Gamification for Natural Language Processing ; https://hal.archives-ouvertes.fr/hal-01790614 ; Games4NLP: Games and Gamification for Natural Language Processing , May 2018, Miyazaki, Japan (2018)
BASE
Show details
17
Produire des données pour la recherche en jouant aux zombies
In: ISSN: 2270-6224 ; Interstices ; https://hal.inria.fr/hal-01827612 ; Interstices, INRIA, 2018 ; https://interstices.info/produire-des-donnees-pour-la-recherche-en-jouant-aux-zombies (2018)
BASE
Show details
18
"Fingers in the Nose": Evaluating Speakers' Identification of Multi-Word Expressions Using a Slightly Gamified Crowdsourcing Platform
In: Proceedings of the Joint Workshop on Linguistic Annotation, Multiword Expressions and Constructions (LAW-MWE-CxG-2018) ; https://hal.archives-ouvertes.fr/hal-01912706 ; Proceedings of the Joint Workshop on Linguistic Annotation, Multiword Expressions and Constructions (LAW-MWE-CxG-2018), Aug 2018, Santa Fe, United States. pp.207 - 213 ; https://aclanthology.coli.uni-saarland.de/events/ws-2018/#W18-49 (2018)
BASE
Show details
19
Introducing the European NETwork for Combining Language LEarning and Crowdsourcing Techniques (enetCollect)
In: EuroCALL ; https://hal.archives-ouvertes.fr/hal-01961788 ; EuroCALL, Aug 2018, Jyväskylä, Finland (2018)
BASE
Show details
20
Report on EMNLP Reviewer Survey
In: https://hal.archives-ouvertes.fr/hal-01660886 ; [Technical Report] Association for computational linguistics. 2017 (2017)
BASE
Show details

Page: 1 2 3

Catalogues
0
0
0
0
2
0
0
Bibliographies
1
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
50
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern