DE eng

Search in the Catalogues and Directories

Hits 1 – 16 of 16

1
Generalized word shift graphs: a method for visualizing and explaining pairwise comparisons between texts
In: Springer Berlin Heidelberg (2021)
Abstract: Abstract A common task in computational text analyses is to quantify how two corpora differ according to a measurement like word frequency, sentiment, or information content. However, collapsing the texts’ rich stories into a single number is often conceptually perilous, and it is difficult to confidently interpret interesting or unexpected textual patterns without looming concerns about data artifacts or measurement validity. To better capture fine-grained differences between texts, we introduce generalized word shift graphs, visualizations which yield a meaningful and interpretable summary of how individual words contribute to the variation between two texts for any measure that can be formulated as a weighted average. We show that this framework naturally encompasses many of the most commonly used approaches for comparing texts, including relative frequencies, dictionary scores, and entropy-based measures like the Kullback–Leibler and Jensen–Shannon divergences. Through a diverse set of case studies ranging from presidential speeches to tweets posted in urban green spaces, we demonstrate how generalized word shift graphs can be flexibly applied across domains for diagnostic investigation, hypothesis generation, and substantive interpretation. By providing a detailed lens into textual shifts between corpora, generalized word shift graphs help computational social scientists, digital humanists, and other text analysis practitioners fashion more robust scientific narratives.
URL: https://hdl.handle.net/1721.1/131952
BASE
Hide details
2
Storywrangler: A massive exploratorium for sociolinguistic, cultural, socioeconomic, and political timelines using Twitter
In: Sci Adv (2021)
BASE
Show details
3
How the world’s collective attention is being paid to a pandemic: COVID-19 related n-gram time series for 24 languages on Twitter
In: PLoS One (2021)
BASE
Show details
4
Storywrangler: A massive exploratorium for sociolinguistic, cultural, socioeconomic, and political timelines using Twitter ...
BASE
Show details
5
Fame and Ultrafame: Measuring and comparing daily levels of `being talked about' for United States' presidents, their rivals, God, countries, and K-pop ...
BASE
Show details
6
English verb regularization in books and tweets
Gray, Tyler J.; Reagan, Andrew J.; Dodds, Peter Sheridan. - : Public Library of Science, 2018
BASE
Show details
7
Sentiment analysis methods for understanding large-scale texts: a case for using continuum-scored words and word shift graphs [<Journal>]
Reagan, Andrew J. [Verfasser]; Danforth, Christopher M. [Sonstige]; Tivnan, Brian [Sonstige].
DNB Subject Category Language
Show details
8
Divergent discourse between protests and counter-protests: #BlackLivesMatter and #AllLivesMatter ...
BASE
Show details
9
Sentiment analysis methods for understanding large-scale texts: a case for using continuum-scored words and word shift graphs ...
BASE
Show details
10
Sentiment analysis methods for understanding large-scale texts: a case for using continuum-scored words and word shift graphs ...
BASE
Show details
11
Forecasting the onset and course of mental illness with Twitter data
Reece, Andrew G.; Reagan, Andrew J.; Lix, Katharina L. M.. - : Nature Publishing Group UK, 2017
BASE
Show details
12
The Lexicocalorimeter: Gauging public health through caloric input and output on social media
Alajajian, Sharon E.; Williams, Jake Ryland; Reagan, Andrew J.. - : Public Library of Science, 2017
BASE
Show details
13
Forecasting the onset and course of mental illness with Twitter data
Reece, Andrew G.; Reagan, Andrew J.; Lix, Katharina L. M.. - : Nature Publishing Group UK, 2017
BASE
Show details
14
Forecasting the onset and course of mental illness with Twitter data ...
BASE
Show details
15
The Lexicocalorimeter: Gauging public health through caloric input and output on social media
In: PLoS (2015)
BASE
Show details
16
Benchmarking sentiment analysis methods for large-scale texts: A case for using continuum-scored words and word shift graphs ...
BASE
Show details

Catalogues
0
0
0
0
1
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
15
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern