Home
Catalogue search
Refine your search:
Keyword:
004 Informatik (2)
020 Bibliotheks- und Informationswissenschaft (2)
400 Sprachwissenschaft (1)
400 Sprachwissenschaft, Linguistik (1)
Linguistik (1)
Sentiment Analysis, German, Lexicon-based Sentiment Analysis, Corpus, Evaluation (1)
ddc:004 (1)
ddc:020 (1)
ddc:400 (1)
Creator / Publisher
Year:
2021 (2)
Medium
Type
BLLDB-Access
Search in the Catalogues and Directories
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
Sort by
creator [A → Z]
'
creator [Z → A]
'
publishing year ↑ (asc)
'
publishing year ↓ (desc)
'
title [A → Z]
'
title [Z → A]
'
Simple Search
Hits 1 – 2 of 2
1
Lexicon-based Sentiment Analysis in German: Systematic Evaluation of Resources and Preprocessing Techniques ...
Fehle, Jakob
;
Schmidt, Thomas
;
Wolff, Christian
. - : Universität Regensburg, 2021
BASE
Show details
2
Lexicon-based Sentiment Analysis in German: Systematic Evaluation of Resources and Preprocessing Techniques
Schmidt, Thomas
;
Wolff, Christian
;
Fehle, Jakob
. - : KONVENS 2021 Organizers, 2021
Abstract:
We present the results of an evaluation study in the context of lexicon-based sentiment analysis resources for German texts. We have set up a comprehensive compilation of 19 sentiment lexicon resources and 20 sentiment-annotated corpora available for German across multiple domains. In addition to the evaluation of the sentiment lexicons we also investigate the influence of the following preprocessing steps and modifiers: stemming and lemmatization, part-of-speech-tagging, usage of emoticons, stop words removal, usage of valence shifters, intensifiers, and diminishers. We report the best performing lexicons as well as the influence of preprocessing steps and other modifications on average performance across all corpora. We show that larger lexicons with continuous values like SentiWS and SentiMerge perform best across the domains. The best performing configuration of lexicon and modifications considering the f1-value and accuracy averages across all corpora achieves around 67%. Preprocessing, especially stemming or lemmatization increases the performance consistently on average around 6% and for certain lexicons and configurations up to 16.5% while methods like the usage of valence shifters, intensifiers or diminishers rarely influence overall performance. We discuss domain-specific differences and give recommendations for the selection of lexicons, preprocessing and modifications.
Keyword:
004 Informatik
;
020 Bibliotheks- und Informationswissenschaft
;
400 Sprachwissenschaft
;
ddc:004
;
ddc:020
;
ddc:400
;
Linguistik
URL:
https://epub.uni-regensburg.de/50833/1/2021.konvens-1.8%20%281%29.pdf
https://epub.uni-regensburg.de/50833/
https://aclanthology.org/2021.konvens-1.8
BASE
Hide details
Mobile view
All
Catalogues
UB Frankfurt Linguistik
0
IDS Mannheim
0
OLC Linguistik
0
UB Frankfurt Retrokatalog
0
DNB Subject Category Language
0
Institut für Empirische Sprachwissenschaft
0
Leibniz-Centre General Linguistics (ZAS)
0
Bibliographies
BLLDB
0
BDSL
0
IDS Bibliografie zur deutschen Grammatik
0
IDS Bibliografie zur Gesprächsforschung
0
IDS Konnektoren im Deutschen
0
IDS Präpositionen im Deutschen
0
IDS OBELEX meta
0
MPI-SHH Linguistics Collection
0
MPI for Psycholinguistics
0
Linked Open Data catalogues
Annohub
0
Online resources
Link directory
0
Journal directory
0
Database directory
0
Dictionary directory
0
Open access documents
BASE
2
Linguistik-Repository
0
IDS Publikationsserver
0
Online dissertations
0
Language Description Heritage
0
© 2013 - 2024 Lin|gu|is|tik
|
Imprint
|
Privacy Policy
|
Datenschutzeinstellungen ändern