1 |
Finding Variants for Construction-Based Dialectometry: A Corpus-Based Approach to Regional CxGs ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Representations of Language Varieties Are Reliable Given Corpus Similarity Measures ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Measuring Linguistic Diversity During COVID-19 ...
|
|
|
|
Abstract:
Computational measures of linguistic diversity help us understand the linguistic landscape using digital language data. The contribution of this paper is to calibrate measures of linguistic diversity using restrictions on international travel resulting from the COVID-19 pandemic. Previous work has mapped the distribution of languages using geo-referenced social media and web data. The goal, however, has been to describe these corpora themselves rather than to make inferences about underlying populations. This paper shows that a difference-in-differences method based on the Herfindahl-Hirschman Index can identify the bias in digital corpora that is introduced by non-local populations. These methods tell us where significant changes have taken place and whether this leads to increased or decreased diversity. This is an important step in aligning digital corpora like social media with the real-world populations that have produced them. ...
|
|
Keyword:
Computation and Language cs.CL; FOS Computer and information sciences
|
|
URL: https://dx.doi.org/10.48550/arxiv.2104.01290 https://arxiv.org/abs/2104.01290
|
|
BASE
|
|
Hide details
|
|
4 |
Learned Construction Grammars Converge Across Registers Given Increased Exposure ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Production vs Perception: The Role of Individuality in Usage-Based Grammar Induction ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Global Syntactic Variation in Seven Languages: Towards a Computational Dialectology ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Learned Construction Grammars Converge Across Registers Given Increased Exposure
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Production vs Perception: The Role of Individuality in Usage-Based Grammar Induction
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Representations of Language Varieties Are Reliable Given Corpus Similarity Measures
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Modeling Global Syntactic Variation in English Using Dialect Classification ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Mapping Languages and Demographics with Georeferenced Corpora
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Global Syntactic Variation in Seven Languages: Toward a Computational Dialectology
|
|
|
|
In: Front Artif Intell (2019)
|
|
BASE
|
|
Show details
|
|
17 |
Modeling the Complexity and Descriptive Adequacy of Construction Grammars
|
|
|
|
In: Proceedings of the Society for Computation in Linguistics (2018)
|
|
BASE
|
|
Show details
|
|
18 |
Learnability and falsifiability of Construction Grammars
|
|
|
|
In: Proceedings of the Linguistic Society of America; Vol 2 (2017): Proceedings of the Linguistic Society of America; 1:1–15 ; 2473-8689 (2017)
|
|
BASE
|
|
Show details
|
|
19 |
The Linguistic Status of Predictions and Feature Ranks from SVM Text Classifiers
|
|
|
|
In: LSA Annual Meeting Extended Abstracts; Vol 6: LSA Annual Meeting Extended Abstracts 2015; 5:1-5 ; 2377-3367 (2015)
|
|
BASE
|
|
Show details
|
|
|
|