1 |
Finding Variants for Construction-Based Dialectometry: A Corpus-Based Approach to Regional CxGs ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Representations of Language Varieties Are Reliable Given Corpus Similarity Measures ...
|
|
|
|
Abstract:
This paper measures similarity both within and between 84 language varieties across nine languages. These corpora are drawn from digital sources (the web and tweets), allowing us to evaluate whether such geo-referenced corpora are reliable for modelling linguistic variation. The basic idea is that, if each source adequately represents a single underlying language variety, then the similarity between these sources should be stable across all languages and countries. The paper shows that there is a consistent agreement between these sources using frequency-based corpus similarity measures. This provides further evidence that digital geo-referenced corpora consistently represent local language varieties. ...
|
|
Keyword:
Computation and Language cs.CL; FOS Computer and information sciences
|
|
URL: https://dx.doi.org/10.48550/arxiv.2104.01294 https://arxiv.org/abs/2104.01294
|
|
BASE
|
|
Hide details
|
|
4 |
Learned Construction Grammars Converge Across Registers Given Increased Exposure ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Production vs Perception: The Role of Individuality in Usage-Based Grammar Induction ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Global Syntactic Variation in Seven Languages: Towards a Computational Dialectology ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Learned Construction Grammars Converge Across Registers Given Increased Exposure
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Production vs Perception: The Role of Individuality in Usage-Based Grammar Induction
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Representations of Language Varieties Are Reliable Given Corpus Similarity Measures
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Modeling Global Syntactic Variation in English Using Dialect Classification ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Mapping Languages and Demographics with Georeferenced Corpora
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Global Syntactic Variation in Seven Languages: Toward a Computational Dialectology
|
|
|
|
In: Front Artif Intell (2019)
|
|
BASE
|
|
Show details
|
|
17 |
Modeling the Complexity and Descriptive Adequacy of Construction Grammars
|
|
|
|
In: Proceedings of the Society for Computation in Linguistics (2018)
|
|
BASE
|
|
Show details
|
|
18 |
Learnability and falsifiability of Construction Grammars
|
|
|
|
In: Proceedings of the Linguistic Society of America; Vol 2 (2017): Proceedings of the Linguistic Society of America; 1:1–15 ; 2473-8689 (2017)
|
|
BASE
|
|
Show details
|
|
19 |
The Linguistic Status of Predictions and Feature Ranks from SVM Text Classifiers
|
|
|
|
In: LSA Annual Meeting Extended Abstracts; Vol 6: LSA Annual Meeting Extended Abstracts 2015; 5:1-5 ; 2377-3367 (2015)
|
|
BASE
|
|
Show details
|
|
|
|