DE eng

Search in the Catalogues and Directories

Page: 1 2
Hits 1 – 20 of 25

1
Kolipsi-1 Corpus v1.0
Glaznieks, Aivars; Frey, Jennifer-Carmen; Abel, Andrea; Vettori, Chiara; Nicolas, Lionel. - : Institute for Applied Linguistics, Eurac Research, 2021
Abstract: The Kolipsi-1 L2 is a written learner corpus of German and Italian L2 speakers originating from South Tyrol (Italy). It has been developed as a by-product of the KOLIPSI project “South-Tyrolean pupils and the second language: a linguistic and socio-psychological investigation”. In addition, data from L1 pupils were collected exclusively for the creation of a native speaker reference corpus. The data collection took place in autumn 2007 and is based on two standardized tests for written productions. The two tasks consisted of (1) writing an e-mail to a friend retelling a given event at the supermarket based on a picture story (narrative text genre) and (2) in writing a letter to a friend discussing holiday plans (argumentative text genre). For both tasks a time limit of 30 minutes was fixed and no additional reference material was allowed. CEFR levesl have been assigned to all L2 learner texts, providing a holistic score as well as evaluations of coherence, lexis, grammar and sociolinguistic appropriateness. Person-related metadata provides information about: - the writer's language background, including L1(s), the L1(s) of mother and father, and a self-declared language group affiliation - the writer's age, gender and socio-economic status - the writer's district of residence and whether he lives in an urban or rural environment - the language, location and type of school the writer attended - whether the writer passed the local bilinguality exam or not - an anonymous identifier for the writer's school class and L2 teacher to account for class effects All texts have been transcribed manually adding transcription annotations that reflect surface features of the text, such as the graphical arrangement, and include error annotation on the orthographic level. In addition to that, all texts were automatically annotated, adding tokenisation, sentence splitting, POS-tagging and lemmatization using an orthographically corrected target version of the corpus. Kolipsi-1 L2 belongs to the Kolipsi Corpus Family, a series of related learner corpora collected in South Tyrolean upper secondary schools. The corpora of the Kolipsi Corpus Family contain Italian and German learner texts that were collected in the course of the KOLIPSI project in 2007/2008 (Kolipsi-1) and a follow-up study in 2014/2015 (Kolipsi-2). The aim of both corpus studies was to analyse the second language competences of South-Tyrolean pupils from upper secondary schools (between 16-18 years old), and to contextualize the results of such investigation by commenting on crucial sociolinguistic and psychosocial aspects that influence it. The results of the follow-up study should be compared to the results of the original KOLIPSI project.
Keyword: argumentative essay; high school; L2; Learner corpora; opinion text; picture story; South Tyrol; students; upper secondary school
URL: https://hdl.handle.net/20.500.12124/26
BASE
Hide details
2
Kolipsi-2 Corpus v1.0
Glaznieks, Aivars; Frey, Jennifer-Carmen; Nicolas, Lionel. - : Institute for Applied Linguistics, Eurac Research, 2021
BASE
Show details
3
Creating Expert Knowledge by Relying on Language Learners: a Generic Approach for Mass-Producing Language Resources by Combining Implicit Crowdsourcing and Language Learning
In: LREC 2020 - Language Resources and Evaluation Conference ; https://hal.inria.fr/hal-02879883 ; LREC 2020 - Language Resources and Evaluation Conference, May 2020, Marseille, France (2020)
BASE
Show details
4
Substituto - A Synchronous Educational Language Game for Simultaneous Teaching and Crowdsourcing
In: 9th Workshop on Natural Language Processing for Computer Assisted Language Learning (NLP4CALL 2020) ; https://hal.inria.fr/hal-03114898 ; 9th Workshop on Natural Language Processing for Computer Assisted Language Learning (NLP4CALL 2020), Nov 2020, Gothenburg, Sweden. pp.1-9, ⟨10.3384/ecp201759⟩ ; https://www.aclweb.org/anthology/volumes/2020.nlp4call-1/ (2020)
BASE
Show details
5
LEONIDE - Longitudinal Learner Corpus in Italiano, Deutsch and English 1.1
Glaznieks, Aivars; Frey, Jennifer-Carmen; Stopfner, Maria. - : Institute for Applied Linguistics, Eurac Research, 2020
BASE
Show details
6
Introducing the European NETwork for Combining Language LEarning and Crowdsourcing Techniques (enetCollect)
In: EuroCALL ; https://hal.archives-ouvertes.fr/hal-01961788 ; EuroCALL, Aug 2018, Jyväskylä, Finland (2018)
BASE
Show details
7
MERLIN Written Learner Corpus for Czech, German, Italian 1.0
Wisniewski, Katrin; Abel, Andrea; Vodičková, Kateřina. - : Institute for Applied Linguistics, Eurac Research, 2018
BASE
Show details
8
MERLIN Written Learner Corpus for Czech, German, Italian 1.1
Wisniewski, Katrin; Abel, Andrea; Vodičková, Kateřina. - : Institute for Applied Linguistics, Eurac Research, 2018
BASE
Show details
9
Enriching Morphological Lexica through Unsupervised Derivational Rule Acquisition
In: Proceedings of WoLeR 2011, ESSLLI Int. Workshop on Lexical Ressources ; WoLeR 2011at ESSLLI : International Workshop on Lexical Resources ; https://hal.inria.fr/inria-00617064 ; WoLeR 2011at ESSLLI : International Workshop on Lexical Resources, Aug 2011, Ljubljana, Slovenia (2011)
BASE
Show details
10
Creating and maintaining language resources: the main guidelines of the Victoria project
In: Workshop on Language Resources: From Storyboard to Sustainability and LR Lifecycle Management (LREC 2010 workshop) ; https://hal.inria.fr/inria-00521241 ; Workshop on Language Resources: From Storyboard to Sustainability and LR Lifecycle Management (LREC 2010 workshop), May 2010, Valletta, Malta (2010)
BASE
Show details
11
A morphological and syntactic wide-coverage lexicon for Spanish: The Leffe
In: RANLP 2009 - Recent Advances in Natural Language Processing ; https://hal.inria.fr/inria-00616693 ; RANLP 2009 - Recent Advances in Natural Language Processing, Sep 2009, Borovets, Bulgaria ; http://aclweb.org/anthology//R/R09/ (2009)
BASE
Show details
12
Trouver et confondre les coupables : un processus sophistiqué de correction de lexique
In: 16ème conférence sur le Traitement Automatique des Langues Naturelles : TALN'09 ; https://hal.inria.fr/inria-00553257 ; 16ème conférence sur le Traitement Automatique des Langues Naturelles : TALN'09, ATALA ; LIPN, Jun 2009, Senlis, France (2009)
BASE
Show details
13
Building a morphological and syntactic lexicon by merging various linguistic resources
In: NODALIDA 2009 - the 17th Nordic Conference of Computational Linguistics ; https://hal.inria.fr/hal-00793048 ; NODALIDA 2009 - the 17th Nordic Conference of Computational Linguistics, May 2009, Odense, Denmark (2009)
BASE
Show details
14
FRMG: évolutions d'un analyseur syntaxique TAG du français
In: Journée de l'ATALA sur : Quels analyseurs syntaxiques pour le français ? ; https://hal.inria.fr/inria-00553260 ; Journée de l'ATALA sur : Quels analyseurs syntaxiques pour le français ?, ATALA, Oct 2009, Paris, France ; http://alpage.inria.fr/iwpt09/atala/frmg.pdf (2009)
BASE
Show details
15
Towards efficient production of linguistic resources: the Victoria Project
In: Proceedings of the International Conference RANLP-2009 ; https://hal.inria.fr/inria-00553259 ; Proceedings of the International Conference RANLP-2009, 2009, Borovets, Bulgaria, Bulgaria. pp.318--323 ; http://www.aclweb.org/anthology/R09-1058 (2009)
BASE
Show details
16
Construcciòn y extensiòn de un léxico morfológico y sintáctico para el Español: el Leffe
In: Proceedings of SEPLN 09 ; https://hal.inria.fr/inria-00553258 ; Proceedings of SEPLN 09, 2009, San Sebastian, Spain, España (2009)
BASE
Show details
17
Producción eficiente de recursos lingüísticos: el proyecto Victoria
In: SEPLN 09 - 25th edition of the Annual Conference of the Spanish Society for Natural Language Processing ; https://hal.inria.fr/hal-00793059 ; SEPLN 09 - 25th edition of the Annual Conference of the Spanish Society for Natural Language Processing, Sep 2009, Donostia, España (2009)
BASE
Show details
18
Producción eficiente de recursos lingüísticos: el proyecto Victoria ; Efficient production of linguistic resources: the Victoria Project
Nicolas, Lionel; Molinero Álvarez, Miguel Ángel; Sagot, Benoît. - : Sociedad Española para el Procesamiento del Lenguaje Natural, 2009
BASE
Show details
19
Construcción y extensión de un léxico morfológico y sintáctico para el español: el Leffe ; Building and extending a morphological and syntactic lexicon for Spanish: the Leffe
Molinero Álvarez, Miguel Ángel; Sagot, Benoît; Nicolas, Lionel. - : Sociedad Española para el Procesamiento del Lenguaje Natural, 2009
BASE
Show details
20
Extensión y corrección semi-automática de léxicos morfo-sintácticos
In: 24th edition of the conference of the Spanish Society for Natural Language Processing (SEPLN 2008) ; https://hal.inria.fr/inria-00553523 ; 24th edition of the conference of the Spanish Society for Natural Language Processing (SEPLN 2008), El Advanced Database research group, LaBDA, Sep 2008, Madrid, España (2008)
BASE
Show details

Page: 1 2

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
25
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern