1 |
Determining Tone of a Body of Text
|
|
|
|
In: Senior Projects Spring 2020 (2020)
|
|
BASE
|
|
Show details
|
|
3 |
Automatic Poetry Classification and Chronological Semantic Analysis ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
BanglaEmotion: A Benchmark Dataset for Bangla Textual Emotion Analysis ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
BanglaEmotion: A Benchmark Dataset for Bangla Textual Emotion Analysis ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
CorCenCC: Corpws Cenedlaethol Cymraeg Cyfoes – the National Corpus of Contemporary Welsh ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Computer-Assisted Language Comparison: State of the Art
|
|
|
|
In: Journal of Open Humanities Data; Vol 6 (2020); 2 ; 2059-481X (2020)
|
|
BASE
|
|
Show details
|
|
10 |
LIWC and n-gram counts of English and Dutch novels ...
|
|
|
|
Abstract:
This dataset consists of CSV files with word counts in several corpora: - 694 English language novels from different genders and orientations - 401 bestselling Dutch language novels - 50 novels nominated for Dutch literary prizes Each corpus comes with: - LIWC counts; this file also includes the available metadata for each novel. The English data was created with LIWC 2015. The Dutch data was created with the validated translation of LIWC 2001. - Word counts (unigrams) and bigram counts per novel. All text has been converted to lowercase. Contractions are tokenized into separate tokens, e.g., can't => ca n't Two restrictions are applied: - only unigrams or bigrams that occur in at least 10 texts are retained - only the 100k most frequent are retained - Overall word counts and bigram counts; i.e., the sum across all novels. All files are encoded in UTF-8. ...
|
|
Keyword:
Computational Linguistics; Literature; Psycholinguistics
|
|
URL: https://dx.doi.org/10.17632/tmp32v54ss.1 https://data.mendeley.com/datasets/tmp32v54ss/1
|
|
BASE
|
|
Hide details
|
|
11 |
Data for: Is buttercup a kind of cup? Hyponymy and semantic transparency in compound words ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Psycholinguistic LIWC and n-gram counts in a corpus of 1145 English and Dutch novels ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Data for: Is buttercup a kind of cup? Hyponymy and semantic transparency in compound words ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Psycholinguistic LIWC and n-gram counts in a corpus of 1145 English and Dutch novels ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Das Runde muss ins Eckige – Ein Streifzug durch die Sprache des Fußballs ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Das Runde muss ins Eckige – Ein Streifzug durch die Sprache des Fußballs ...
|
|
|
|
BASE
|
|
Show details
|
|
17 |
Das Runde muss ins Eckige – Ein Streifzug durch die Sprache des Fußballs ...
|
|
|
|
BASE
|
|
Show details
|
|
20 |
Free Software Tools for Computational Linguistics: An Overview ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|