Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2

Hits 1 – 20 of 23

1	Corpus-based Language Universals Analysis using Universal Dependencies ; Analyse orientée corpus d'universaux linguistiques sur Universal Dependencies
	Choi, Hee-Soo; Guillaume, Bruno; Fort, Karën
	In: SyntaxFest Quasy 2021 - Quantitative Syntax ; https://hal.inria.fr/hal-03501774 ; SyntaxFest Quasy 2021 - Quantitative Syntax, Mar 2022, Sofia, Bulgaria (2022)
	BASE
	Show details

2	Corpus-based Language Universals Analysis using Universal Dependencies ; Analyse orientée corpus d'universaux linguistiques sur Universal Dependencies
	Choi, Hee-Soo; Guillaume, Bruno; Fort, Karën
	In: Quasy (Quantitative Syntax), SyntaxFest 2021 ; https://hal.inria.fr/hal-03501774 ; Quasy (Quantitative Syntax), SyntaxFest 2021, Mar 2022, Sofia, Bulgaria (2022)
	BASE
	Show details

3	French CrowS-Pairs: Extending a challenge dataset for measuring social bias in masked language models to a language other than English
	Névéol, Aurélie; Dupont, Yoann; Bezançon, Julien; Fort, Karën
	In: ACL 2022 - 60th Annual Meeting of the Association for Computational Linguistics ; https://hal.inria.fr/hal-03629677 ; ACL 2022 - 60th Annual Meeting of the Association for Computational Linguistics, May 2022, Dublin, Ireland (2022)
	Abstract: International audience ; Warning: This paper contains explicit statements of offensive stereotypes which may be upsetting. Much work on biases in natural language processing has addressed biases linked to the social and cultural experience of English speaking individuals in the United States. We seek to widen the scope of bias studies by creating material to measure social bias in language models (LMs) against specific demographic groups in France. We build on the US-centered CrowS-pairs dataset to create a multilingual stereotypes dataset that allows for comparability across languages while also characterizing biases that are specific to each country and language. We introduce 1,677 sentence pairs in French that cover stereotypes in ten types of bias like gender and age. 1,467 sentence pairs are translated from CrowS-pairs and 210 are newly crowdsourced and translated back into English. The sentence pairs contrast stereotypes concerning underadvantaged groups with the same sentence concerning advantaged groups. We find that four widely used language models (three French, one multilingual) favor sentences that express stereotypes in most bias categories. We report on the translation process, which led to a characterization of stereotypes in CrowS-pairs including the identification of US-centric cultural traits. We offer guidelines to further extend the dataset to other languages and cultural environments.
	Keyword: [INFO.INFO-TT]Computer Science [cs]/Document and Text Processing
	URL: https://hal.inria.fr/hal-03629677 https://hal.inria.fr/hal-03629677/file/ACLFinal.pdf https://hal.inria.fr/hal-03629677/document
	BASE
	Hide details

4	Convertir le Trésor de la Langue Française en Ontolex-Lemon : un zeste de données liées
	Ahmadi, Sina; Constant, Mathieu; Fort, Karën...
	In: Journées LIFT 2021 - Linguistique informatique, formelle et de terrain ; https://hal.inria.fr/hal-03463294 ; Journées LIFT 2021 - Linguistique informatique, formelle et de terrain, Dec 2021, Grenoble, France (2021)
	BASE
	Show details

5	Analyse orientée corpus d'universaux de Greenberg sur Universal Dependencies
	Choi, Hee-Soo; Guillaume, Bruno; Fort, Karën
	In: Journées LIFT 2021 - Linguistique informatique, formelle et de terrain ; https://hal.inria.fr/hal-03462112 ; Journées LIFT 2021 - Linguistique informatique, formelle et de terrain, GDR LIFT - Linguistique Informatique, Formelle et de Terrain, Dec 2021, Grenoble, France (2021)
	BASE
	Show details

6	The Dawn of the Human-Machine Era: A forecast of new and emerging language technologies.
	Sayers, Dave; Sousa-Silva, Rui; Höhn, Sviatlana...
	In: https://hal.archives-ouvertes.fr/hal-03230287 ; 2021 (2021)
	BASE
	Show details

7	Language Teachers and Crowdsourcing: Insights from a Cross-European Survey
	Arhar Holdt, Špela; Zviel-Girshin, Rina; Gajek, Elżbieta...
	In: ISSN: 1331-6745 ; EISSN: 1849-0379 ; Rasprave Instituta za hrvatski jezik i jezikoslovlje ; https://hal.inria.fr/hal-02974069 ; Rasprave Instituta za hrvatski jezik i jezikoslovlje, 2020, 46 (1), pp.1-28. ⟨10.31724/rihjj.46.1.1⟩ (2020)
	BASE
	Show details

8	Creating Expert Knowledge by Relying on Language Learners: a Generic Approach for Mass-Producing Language Resources by Combining Implicit Crowdsourcing and Language Learning
	Nicolas, Lionel; Lyding, Verena; Borg, Claudia...
	In: LREC 2020 - Language Resources and Evaluation Conference ; https://hal.inria.fr/hal-02879883 ; LREC 2020 - Language Resources and Evaluation Conference, May 2020, Marseille, France (2020)
	BASE
	Show details

9	Text Corpora and the Challenge of Newly Written Languages
	Millour, Alice; Fort, Karën
	In: 1st Joint SLTU and CCURL Workshop (SLTU-CCURL 2020) ; https://hal.archives-ouvertes.fr/hal-02611209 ; 1st Joint SLTU and CCURL Workshop (SLTU-CCURL 2020), May 2020, Marseille, France (2020)
	BASE
	Show details

10	Produire des données pour la recherche en jouant aux zombies
	Fort, Karën; Guillaume, Bruno
	In: ISSN: 2270-6224 ; Interstices ; https://hal.inria.fr/hal-01827612 ; Interstices, INRIA, 2018 ; https://interstices.info/produire-des-donnees-pour-la-recherche-en-jouant-aux-zombies (2018)
	BASE
	Show details

11	"Fingers in the Nose": Evaluating Speakers' Identification of Multi-Word Expressions Using a Slightly Gamified Crowdsourcing Platform
	Fort, Karën; Guillaume, Bruno; Constant, Mathieu...
	In: Proceedings of the Joint Workshop on Linguistic Annotation, Multiword Expressions and Constructions (LAW-MWE-CxG-2018) ; https://hal.archives-ouvertes.fr/hal-01912706 ; Proceedings of the Joint Workshop on Linguistic Annotation, Multiword Expressions and Constructions (LAW-MWE-CxG-2018), Aug 2018, Santa Fe, United States. pp.207 - 213 ; https://aclanthology.coli.uni-saarland.de/events/ws-2018/#W18-49 (2018)
	BASE
	Show details

12	Crowdsourcing Complex Language Resources: Playing to Annotate Dependency Syntax
	Guillaume, Bruno; Fort, Karën; Lefèbvre, Nicolas
	In: International Conference on Computational Linguistics (COLING) ; https://hal.inria.fr/hal-01378980 ; International Conference on Computational Linguistics (COLING), Dec 2016, Osaka, Japan ; http://coling2016.anlp.jp/ (2016)
	BASE
	Show details

13	Analyse lexicale outillée de la parole transcrite de patients schizophrènes
	Amblard, Maxime; Fort, Karën; Demily, Caroline...
	In: ISSN: 1248-9433 ; EISSN: 1965-0906 ; Revue TAL ; https://hal.inria.fr/hal-01188677 ; Revue TAL, ATALA (Association pour le Traitement Automatique des Langues), 2015, Natural Language Processing and Cognition, 55 (3), pp.91 - 115 (2015)
	BASE
	Show details

14	Mapping the Lexique des Verbes du Français (Lexicon of French Verbs) to a NLP Lexicon using Examples
	Guillaume, Bruno; Fort, Karen; Perrier, Guy...
	In: International Conference on Language Resources and Evaluation (LREC) ; https://hal.inria.fr/hal-00969184 ; International Conference on Language Resources and Evaluation (LREC), May 2014, Reykjavik, Iceland (2014)
	BASE
	Show details

15	Propa-L: a Semantic Filtering Service from a Lexical Network Created using Games With A Purpose
	Lafourcade, Mathieu; Fort, Karen
	In: 9th International Conference on Language Resources and Evaluation ; LREC: Language Resources and Evaluation Conference ; https://hal.inria.fr/hal-00969161 ; LREC: Language Resources and Evaluation Conference, May 2014, Reykjavik, Iceland. pp.1676-1681 ; http://www.lrec-conf.org/proceedings/lrec2014/index.html (2014)
	BASE
	Show details

16	Deep Syntax Annotation of the Sequoia French Treebank
	Candito, Marie; Perrier, Guy; Guillaume, Bruno...
	In: International Conference on Language Resources and Evaluation (LREC) ; https://hal.inria.fr/hal-00969191 ; International Conference on Language Resources and Evaluation (LREC), May 2014, Reykjavik, Iceland (2014)
	BASE
	Show details

17	Evaluating Corpora Documentation with regards to the Ethics and Big Data Charter
	Couillault, Alain; Fort, Karen; Adda, Gilles...
	In: International Conference on Language Resources and Evaluation (LREC) ; https://hal.inria.fr/hal-00969180 ; International Conference on Language Resources and Evaluation (LREC), May 2014, Reykjavik, Iceland (2014)
	BASE
	Show details

18	"Where the data are coming from?" Ethics, crowdsourcing and traceability for Big Data in Human Language Technology
	Adda, Gilles; Besacier, Laurent; Couillault, Alain...
	In: Crowdsourcing and human computation multidisciplinary workshop ; https://hal.archives-ouvertes.fr/hal-01078045 ; Crowdsourcing and human computation multidisciplinary workshop, CNRS, Sep 2014, Paris, France (2014)
	BASE
	Show details

19	Crowdsourcing for Language Resource Development: Criticisms About Amazon Mechanical Turk Overpowering Use
	Fort, Karen; Adda, Gilles; Sagot, Benoît...
	In: Human Language Technology Challenges for Computer Science and Linguistics ; https://hal.inria.fr/hal-01053047 ; Vetulani, Zygmunt and Mariani, Joseph. Human Language Technology Challenges for Computer Science and Linguistics, 8387, Springer International Publishing, pp.303-314, 2014, Lecture Notes in Computer Science, 978-3-319-08957-7. ⟨10.1007/978-3-319-08958-4_25⟩ (2014)
	BASE
	Show details

20	Étude quantitative des disfluences dans le discours de schizophrènes : automatiser pour limiter les biais
	Amblard, Maxime; Fort, Karen
	In: TALN - Traitement Automatique des Langues Naturelles ; https://hal.inria.fr/hal-01054391 ; TALN - Traitement Automatique des Langues Naturelles, Jul 2014, Marseille, France. pp.292-303 (2014)
	BASE
	Show details

Page: 1 2

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern