DE eng

Search in the Catalogues and Directories

Hits 1 – 15 of 15

1
Language Teachers and Crowdsourcing: Insights from a Cross-European Survey
In: ISSN: 1331-6745 ; EISSN: 1849-0379 ; Rasprave Instituta za hrvatski jezik i jezikoslovlje ; https://hal.inria.fr/hal-02974069 ; Rasprave Instituta za hrvatski jezik i jezikoslovlje, 2020, 46 (1), pp.1-28. ⟨10.31724/rihjj.46.1.1⟩ (2020)
BASE
Show details
2
Creating Expert Knowledge by Relying on Language Learners: a Generic Approach for Mass-Producing Language Resources by Combining Implicit Crowdsourcing and Language Learning
In: LREC 2020 - Language Resources and Evaluation Conference ; https://hal.inria.fr/hal-02879883 ; LREC 2020 - Language Resources and Evaluation Conference, May 2020, Marseille, France (2020)
BASE
Show details
3
Text Corpora and the Challenge of Newly Written Languages
In: 1st Joint SLTU and CCURL Workshop (SLTU-CCURL 2020) ; https://hal.archives-ouvertes.fr/hal-02611209 ; 1st Joint SLTU and CCURL Workshop (SLTU-CCURL 2020), May 2020, Marseille, France (2020)
BASE
Show details
4
Unsupervised Data Augmentation for Less-Resourced Languages with no Standardized Spelling
In: RANLP ; https://hal.archives-ouvertes.fr/hal-02280002 ; RANLP, Sep 2019, Varna, Bulgaria. pp.776 - 784 (2019)
Abstract: International audience ; Non-standardized languages are a challenge to the construction of representative linguistic resources and to the development of efficient natural language processing tools: when spelling is not determined by a consensual norm, a multiplicity of alternative written forms can be encountered for a given word, inducing a large proportion of out-of-vocabulary words. To embrace this diversity, we propose a methodology based on crowdsourcing alternative spellings from which variation rules are automatically extracted. The rules are further used to match out-of-vocabulary words with one of their spelling variants. This virtuous process enables the unsupervised augmentation of multi-variant lexicons without requiring manual rule definition by experts. We apply this multilingual methodology on Al-satian, a French regional language and provide (i) an intrinsic evaluation of the correctness of the obtained variants pairs, (ii) an extrinsic evaluation on a downstream task: part-of-speech tagging. We show that in a low-resource scenario, collecting spelling variants for only 145 words can lead to (i) the generation of 876 additional variant pairs, (ii) a diminution of out-of-vocabulary words improving the tagging performance by 1 to 4%.
Keyword: [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [INFO]Computer Science [cs]
URL: https://hal.archives-ouvertes.fr/hal-02280002/file/Proceedings_of_Recent_Advances_in_Natural_Language_Processing.pdf
https://hal.archives-ouvertes.fr/hal-02280002
https://hal.archives-ouvertes.fr/hal-02280002/document
BASE
Hide details
5
Représentations et transmission des connaissances à la lumière de l’innovation numérique. Actes du colloque Jeunes Chercheurs PRAXILING UMR 5267, 7-8 Novembre 2019
In: Jeunes Chercheurs PRAXILING UMR 5267 ; https://hal.archives-ouvertes.fr/hal-02369575 ; Jeunes Chercheurs PRAXILING UMR 5267, Nov 2019, Montpellier, France. 2019, ⟨10.18463/toubol.001⟩ (2019)
BASE
Show details
6
À l'écoute des locuteurs : production participative de ressources langagières pour des langues non standardisées
In: ISSN: 1248-9433 ; EISSN: 1965-0906 ; Revue TAL ; https://hal.archives-ouvertes.fr/hal-01995758 ; Revue TAL, ATALA (Association pour le Traitement Automatique des Langues), 2018 (2018)
BASE
Show details
7
Cheap, Fast and Good! Voting Games with a Purpose
In: Games4NLP: Games and Gamification for Natural Language Processing ; https://hal.archives-ouvertes.fr/hal-01790614 ; Games4NLP: Games and Gamification for Natural Language Processing , May 2018, Miyazaki, Japan (2018)
BASE
Show details
8
Produire des données pour la recherche en jouant aux zombies
In: ISSN: 2270-6224 ; Interstices ; https://hal.inria.fr/hal-01827612 ; Interstices, INRIA, 2018 ; https://interstices.info/produire-des-donnees-pour-la-recherche-en-jouant-aux-zombies (2018)
BASE
Show details
9
"Fingers in the Nose": Evaluating Speakers' Identification of Multi-Word Expressions Using a Slightly Gamified Crowdsourcing Platform
In: Proceedings of the Joint Workshop on Linguistic Annotation, Multiword Expressions and Constructions (LAW-MWE-CxG-2018) ; https://hal.archives-ouvertes.fr/hal-01912706 ; Proceedings of the Joint Workshop on Linguistic Annotation, Multiword Expressions and Constructions (LAW-MWE-CxG-2018), Aug 2018, Santa Fe, United States. pp.207 - 213 ; https://aclanthology.coli.uni-saarland.de/events/ws-2018/#W18-49 (2018)
BASE
Show details
10
Introducing the European NETwork for Combining Language LEarning and Crowdsourcing Techniques (enetCollect)
In: EuroCALL ; https://hal.archives-ouvertes.fr/hal-01961788 ; EuroCALL, Aug 2018, Jyväskylä, Finland (2018)
BASE
Show details
11
Report on EMNLP Reviewer Survey
In: https://hal.archives-ouvertes.fr/hal-01660886 ; [Technical Report] Association for computational linguistics. 2017 (2017)
BASE
Show details
12
Crowdsourcing Complex Language Resources: Playing to Annotate Dependency Syntax
In: International Conference on Computational Linguistics (COLING) ; https://hal.inria.fr/hal-01378980 ; International Conference on Computational Linguistics (COLING), Dec 2016, Osaka, Japan ; http://coling2016.anlp.jp/ (2016)
BASE
Show details
13
Ethical Issues in Corpus Linguistics And Annotation: Pay Per Hit Does Not Affect Effective Hourly Rate For Linguistic Resource Development On Amazon Mechanical Turk
In: ETHics In Corpus collection, Annotation and Application workshop ; https://hal.inria.fr/hal-01324362 ; ETHics In Corpus collection, Annotation and Application workshop, May 2016, Portoroz, Slovenia (2016)
BASE
Show details
14
Éthique et traitement automatique des langues et de la parole : entre truismes et tabous
In: ISSN: 1248-9433 ; EISSN: 1965-0906 ; Revue TAL ; https://hal.archives-ouvertes.fr/hal-01422254 ; Revue TAL, ATALA (Association pour le Traitement Automatique des Langues), 2016, TAL et éthique, 57 (2), pp.7-19 ; https://www.atala.org/-Revue-TAL- (2016)
BASE
Show details
15
Analyse lexicale outillée de la parole transcrite de patients schizophrènes
In: ISSN: 1248-9433 ; EISSN: 1965-0906 ; Revue TAL ; https://hal.inria.fr/hal-01188677 ; Revue TAL, ATALA (Association pour le Traitement Automatique des Langues), 2015, Natural Language Processing and Cognition, 55 (3), pp.91 - 115 (2015)
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
15
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern