DE eng

Search in the Catalogues and Directories

Hits 1 – 14 of 14

1
Predicting lexical complexity in English texts: the Complex 2.0 dataset
Abstract: © 2022 The Authors. Published by Springer. This is an open access article available under a Creative Commons licence. The published version can be accessed at the following link on the publisher’s website: https://doi.org/10.1007/s10579-022-09588-2 ; Identifying words which may cause difficulty for a reader is an essential step in most lexical text simplification systems prior to lexical substitution and can also be used for assessing the readability of a text. This task is commonly referred to as complex word identification (CWI) and is often modelled as a supervised classification problem. For training such systems, annotated datasets in which words and sometimes multi-word expressions are labelled regarding complexity are required. In this paper we analyze previous work carried out in this task and investigate the properties of CWI datasets for English. We develop a protocol for the annotation of lexical complexity and use this to annotate a new dataset, CompLex 2.0. We present experiments using both new and old datasets to investigate the nature of lexical complexity. We found that a Likert-scale annotation protocol provides an objective setting that is superior for identifying the complexity of words compared to a binary annotation protocol. We release a new dataset using our new protocol to promote the task of Lexical Complexity Prediction.
Keyword: complex word identification; lexical complexity; text simplification
URL: https://doi.org/10.1007/s10579-022-09588-2
http://hdl.handle.net/2436/624697
BASE
Hide details
2
Investigating Text Simplification Evaluation ...
BASE
Show details
3
Investigating Text Simplification Evaluation ...
BASE
Show details
4
Predicting Lexical Complexity in English Texts ...
BASE
Show details
5
SemEval-2021 Task 1: Lexical Complexity Prediction ...
BASE
Show details
6
SemEval-2021 Task 1: Lexical Complexity Prediction ...
BASE
Show details
7
SemEval-2021 Task 1: Lexical Complexity Prediction ...
BASE
Show details
8
SemEval-2021 Task 1: Lexical Complexity Prediction ...
BASE
Show details
9
SemEval-2021 task 1: Lexical complexity prediction
Evans, Richard; Zampieri, Marcos; Shardlow, Matthew. - : Association for Computational Linguistics, 2021
BASE
Show details
10
CompLex: A New Corpus for Lexical Complexity Prediction from Likert Scale Data ...
BASE
Show details
11
Detecting Multiword Expression Type Helps Lexical Complexity Assessment ...
BASE
Show details
12
Multi-Word Lexical Simplification ...
BASE
Show details
13
Identification of research hypotheses and new knowledge from scientific literature ...
BASE
Show details
14
Identification of research hypotheses and new knowledge from scientific literature ...
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
14
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern