8 |
EVALD – a Pioneer Application for Automated Essay Scoring in Czech
|
|
|
|
In: Prague Bulletin of Mathematical Linguistics , Vol 113, Iss 1, Pp 9-30 (2019) (2019)
|
|
Abstract:
In the paper, we present EVALD applications (Evaluator of Discourse) for automated essay scoring. EVALD is the first tool of this type for Czech. It evaluates texts written by both native and non-native speakers of Czech. We describe first the history and the present in the automatic essay scoring, which is illustrated by examples of systems for other languages, mainly for English. Then we focus on the methodology of creating the EVALD applications and describe datasets used for testing as well as supervised training that EVALD builds on. Furthermore, we analyze in detail a sample of newly acquired language data – texts written by non-native speakers reaching the threshold level of the Czech language acquisition required e.g. for the permanent residence in the Czech Republic – and we focus on linguistic differences between the available text levels. We present the feature set used by EVALD and – based on the analysis – we extend it with new spelling features. Finally, we evaluate the overall performance of various variants of EVALD and provide the analysis of collected results.
|
|
Keyword:
Computational linguistics. Natural language processing; P98-98.5
|
|
URL: https://doaj.org/article/e1c0cb0354a84ea286e7fb354464c96a https://doi.org/10.2478/pralin-2019-0004
|
|
BASE
|
|
Hide details
|
|
12 |
Primary and secondary discourse connectives: definitions and lexicons
|
|
|
|
In: Dialogue & Discourse; Vol 9 No 1 (2018); 50-78 ; 2152-9620 (2018)
|
|
BASE
|
|
Show details
|
|
16 |
Studying text coherence in Czech – a corpus-based analysis
|
|
|
|
In: Topics in Linguistics, Vol 18, Iss 2, Pp 36-47 (2017) (2017)
|
|
BASE
|
|
Show details
|
|
17 |
CzeDLex – A Lexicon of Czech Discourse Connectives
|
|
|
|
In: Prague Bulletin of Mathematical Linguistics , Vol 109, Iss 1, Pp 61-91 (2017) (2017)
|
|
BASE
|
|
Show details
|
|
|
|