1 |
GECCC Grammar Error Correction Corpus for Czech
|
|
|
|
Abstract:
Grammar Error Correction Corpus for Czech (GECCC) consists of 83 058 sentences and covers four diverse domains, including essays written by native students, informal website texts, essays written by Romani ethnic minority children and teenagers and essays written by nonnative speakers. All domains are professionally annotated for GEC errors in a unified manner, and errors were automatically categorized with a Czech-specific version of ERRANT released at https://github.com/ufal/errant_czech The dataset was introduced in the paper Czech Grammar Error Correction with a Large and Diverse Corpus that was accepted to TACL. Until published in TACL, see the arXiv version: https://arxiv.org/pdf/2201.05590.pdf
|
|
Keyword:
dataset; gec; grammatical error correction
|
|
URL: http://hdl.handle.net/11234/1-4639
|
|
BASE
|
|
Hide details
|
|
7 |
NameTag 2
|
|
Straková, Jana. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2021
|
|
BASE
|
|
Show details
|
|
8 |
RobeCzech: Czech RoBERTa, a monolingual contextualized language representation model ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Universal Dependencies 2.5 Models for UDPipe (2019-12-06)
|
|
Straka, Milan; Straková, Jana. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2020
|
|
BASE
|
|
Show details
|
|
12 |
Universal Dependencies 2.4 Models for UDPipe (2019-05-31)
|
|
Straka, Milan; Straková, Jana. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2019
|
|
BASE
|
|
Show details
|
|
13 |
UDPipe at SIGMORPHON 2019: Contextualized Embeddings, Regularization with Morphological Categories, Corpora Merging ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
ÚFAL MRPipe at MRP 2019: UDPipe Goes Semantic in the Meaning Representation Parsing Shared Task ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Universal Dependencies 2.3 Models for UDPipe (2018-11-15)
|
|
Straka, Milan; Straková, Jana. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2018
|
|
BASE
|
|
Show details
|
|
16 |
Universal Dependencies 2.0 Models for UDPipe (2017-08-01)
|
|
Straka, Milan; Straková, Jana. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2017
|
|
BASE
|
|
Show details
|
|
18 |
Czech Models (MorfFlex CZ 160310 + PDT 3.0) for MorphoDiTa 160310
|
|
Straka, Milan; Straková, Jana. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2016
|
|
BASE
|
|
Show details
|
|
19 |
WordSim353-cs: Evaluation Dataset for Lexical Similarity and Relatedness, based on WordSim353
|
|
|
|
BASE
|
|
Show details
|
|
|
|