3 |
Creating Expert Knowledge by Relying on Language Learners: a Generic Approach for Mass-Producing Language Resources by Combining Implicit Crowdsourcing and Language Learning
|
|
Nicolas, Lionel; Lyding, Verena; Borg, Claudia; Forascu, Corina; Fort, Karën; Zdravkova, Katerina; Kosem, Iztok; Cibej, Jaka; Holdt, Špela,; Millour, Alice; König, Alexander; Rodosthenous, Christos; Sangati, Federico; Hassan, Umair ul; Katinskaia, Anisia; Barreiro, Anabela; Aparaschivei, Lavinia; Hacohen-Kerner, Yaakov
|
|
In: LREC 2020 - Language Resources and Evaluation Conference ; https://hal.inria.fr/hal-02879883 ; LREC 2020 - Language Resources and Evaluation Conference, May 2020, Marseille, France (2020)
|
|
Abstract:
International audience ; We introduce in this paper a generic approach to combine implicit crowdsourcing and language learning in order to mass-produce language resources (LRs) for any language for which a crowd of language learners can be involved. We present the approach by explaining its core paradigm that consists in pairing specific types of LRs with specific exercises, by detailing both its strengths and challenges, and by discussing how much these challenges have been addressed at present. Accordingly, we also report on ongoing proof-of-concept efforts aiming at developing the first prototypical implementation of the approach in order to correct and extend an LR called ConceptNet based on the input crowdsourced from language learners. We then present an international network called the European Network for Combining Language Learning with Crowdsourcing Techniques (enetCollect) that provides the context to accelerate the implementation of the generic approach. Finally, we exemplify how it can be used in several language learning scenarios to produce a multitude of NLP resources and how it can therefore alleviate the long-standing NLP issue of the lack of LRs.
|
|
Keyword:
[INFO.INFO-TT]Computer Science [cs]/Document and Text Processing; [SHS.LANGUE]Humanities and Social Sciences/Linguistics; Collaborative Resource Construction; Computer-Assisted Language Learning; COST Action; Crowdsourcing
|
|
URL: https://hal.inria.fr/hal-02879883/document https://hal.inria.fr/hal-02879883 https://hal.inria.fr/hal-02879883/file/EnetCollect___LREC_2020.pdf
|
|
BASE
|
|
Hide details
|
|
4 |
Substituto - A Synchronous Educational Language Game for Simultaneous Teaching and Crowdsourcing
|
|
|
|
In: 9th Workshop on Natural Language Processing for Computer Assisted Language Learning (NLP4CALL 2020) ; https://hal.inria.fr/hal-03114898 ; 9th Workshop on Natural Language Processing for Computer Assisted Language Learning (NLP4CALL 2020), Nov 2020, Gothenburg, Sweden. pp.1-9, ⟨10.3384/ecp201759⟩ ; https://www.aclweb.org/anthology/volumes/2020.nlp4call-1/ (2020)
|
|
BASE
|
|
Show details
|
|
5 |
LEONIDE - Longitudinal Learner Corpus in Italiano, Deutsch and English 1.1
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Introducing the European NETwork for Combining Language LEarning and Crowdsourcing Techniques (enetCollect)
|
|
|
|
In: EuroCALL ; https://hal.archives-ouvertes.fr/hal-01961788 ; EuroCALL, Aug 2018, Jyväskylä, Finland (2018)
|
|
BASE
|
|
Show details
|
|
7 |
MERLIN Written Learner Corpus for Czech, German, Italian 1.0
|
|
|
|
BASE
|
|
Show details
|
|
8 |
MERLIN Written Learner Corpus for Czech, German, Italian 1.1
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Enriching Morphological Lexica through Unsupervised Derivational Rule Acquisition
|
|
|
|
In: Proceedings of WoLeR 2011, ESSLLI Int. Workshop on Lexical Ressources ; WoLeR 2011at ESSLLI : International Workshop on Lexical Resources ; https://hal.inria.fr/inria-00617064 ; WoLeR 2011at ESSLLI : International Workshop on Lexical Resources, Aug 2011, Ljubljana, Slovenia (2011)
|
|
BASE
|
|
Show details
|
|
10 |
Creating and maintaining language resources: the main guidelines of the Victoria project
|
|
|
|
In: Workshop on Language Resources: From Storyboard to Sustainability and LR Lifecycle Management (LREC 2010 workshop) ; https://hal.inria.fr/inria-00521241 ; Workshop on Language Resources: From Storyboard to Sustainability and LR Lifecycle Management (LREC 2010 workshop), May 2010, Valletta, Malta (2010)
|
|
BASE
|
|
Show details
|
|
11 |
A morphological and syntactic wide-coverage lexicon for Spanish: The Leffe
|
|
|
|
In: RANLP 2009 - Recent Advances in Natural Language Processing ; https://hal.inria.fr/inria-00616693 ; RANLP 2009 - Recent Advances in Natural Language Processing, Sep 2009, Borovets, Bulgaria ; http://aclweb.org/anthology//R/R09/ (2009)
|
|
BASE
|
|
Show details
|
|
12 |
Trouver et confondre les coupables : un processus sophistiqué de correction de lexique
|
|
|
|
In: 16ème conférence sur le Traitement Automatique des Langues Naturelles : TALN'09 ; https://hal.inria.fr/inria-00553257 ; 16ème conférence sur le Traitement Automatique des Langues Naturelles : TALN'09, ATALA ; LIPN, Jun 2009, Senlis, France (2009)
|
|
BASE
|
|
Show details
|
|
13 |
Building a morphological and syntactic lexicon by merging various linguistic resources
|
|
|
|
In: NODALIDA 2009 - the 17th Nordic Conference of Computational Linguistics ; https://hal.inria.fr/hal-00793048 ; NODALIDA 2009 - the 17th Nordic Conference of Computational Linguistics, May 2009, Odense, Denmark (2009)
|
|
BASE
|
|
Show details
|
|
14 |
FRMG: évolutions d'un analyseur syntaxique TAG du français
|
|
|
|
In: Journée de l'ATALA sur : Quels analyseurs syntaxiques pour le français ? ; https://hal.inria.fr/inria-00553260 ; Journée de l'ATALA sur : Quels analyseurs syntaxiques pour le français ?, ATALA, Oct 2009, Paris, France ; http://alpage.inria.fr/iwpt09/atala/frmg.pdf (2009)
|
|
BASE
|
|
Show details
|
|
15 |
Towards efficient production of linguistic resources: the Victoria Project
|
|
|
|
In: Proceedings of the International Conference RANLP-2009 ; https://hal.inria.fr/inria-00553259 ; Proceedings of the International Conference RANLP-2009, 2009, Borovets, Bulgaria, Bulgaria. pp.318--323 ; http://www.aclweb.org/anthology/R09-1058 (2009)
|
|
BASE
|
|
Show details
|
|
16 |
Construcciòn y extensiòn de un léxico morfológico y sintáctico para el Español: el Leffe
|
|
|
|
In: Proceedings of SEPLN 09 ; https://hal.inria.fr/inria-00553258 ; Proceedings of SEPLN 09, 2009, San Sebastian, Spain, España (2009)
|
|
BASE
|
|
Show details
|
|
17 |
Producción eficiente de recursos lingüísticos: el proyecto Victoria
|
|
|
|
In: SEPLN 09 - 25th edition of the Annual Conference of the Spanish Society for Natural Language Processing ; https://hal.inria.fr/hal-00793059 ; SEPLN 09 - 25th edition of the Annual Conference of the Spanish Society for Natural Language Processing, Sep 2009, Donostia, España (2009)
|
|
BASE
|
|
Show details
|
|
18 |
Producción eficiente de recursos lingüísticos: el proyecto Victoria ; Efficient production of linguistic resources: the Victoria Project
|
|
|
|
BASE
|
|
Show details
|
|
19 |
Construcción y extensión de un léxico morfológico y sintáctico para el español: el Leffe ; Building and extending a morphological and syntactic lexicon for Spanish: the Leffe
|
|
|
|
BASE
|
|
Show details
|
|
20 |
Extensión y corrección semi-automática de léxicos morfo-sintácticos
|
|
|
|
In: 24th edition of the conference of the Spanish Society for Natural Language Processing (SEPLN 2008) ; https://hal.inria.fr/inria-00553523 ; 24th edition of the conference of the Spanish Society for Natural Language Processing (SEPLN 2008), El Advanced Database research group, LaBDA, Sep 2008, Madrid, España (2008)
|
|
BASE
|
|
Show details
|
|
|
|