DE eng

Search in the Catalogues and Directories

Hits 1 – 7 of 7

1
Mapping Languages and Demographics with Georeferenced Corpora
Abstract: This paper evaluates large georeferenced corpora, taken from both web-crawled and social media sources, against ground-truth population and language-census datasets. The goal is to determine (i) which dataset best represents population demographics; (ii) in what parts of the world the datasets are most representative of actual populations; and (iii) how to weight the datasets to provide more accurate representations of underlying populations. The paper finds that the two datasets represent very different populations and that they correlate with actual populations with values of r = 0.60 (social media) and r = 0.49 (web-crawled). Further, Twitter data makes better predictions about the inventory of languages used in each country.
Keyword: Communication and Culture::2004 - Linguistics::200402 - Computational Linguistics; communication and culture::4704 - Linguistics::470406 - Historical; comparative and typological linguistics; crowdsourcing; demographics; Field of Research::16 - Studies in Human Society::1603 - Demography::160399 - Demography not elsewhere classified; Field of Research::16 - Studies in Human Society::1604 - Human Geography::160403 - Social and Cultural Geography; Field of Research::20 - Language; Fields of Research::47 - Language; language; population; user-generated content
URL: http://hdl.handle.net/10092/17132
BASE
Hide details
2
Juxtaposing thematic regions derived from spatial and platial user-generated content
McKenzie, Grant; Adams, Ben. - : Schloss Dagstuhl -- Leibniz-Zentrum fur Informatik, 2017
BASE
Show details
3
The observational roots of reference of the semantic web
BASE
Show details
4
Semantic Referencing - Determining Context Weights for Similarity Measurement
BASE
Show details
5
On the Geo-Indicativeness of Non-Georeferenced Text
BASE
Show details
6
Semantic Signatures for Places of Interest
BASE
Show details
7
Effects of Pattern, Spatial Frequency, Number, and Rate of Stimulus Presentatton on the Accuracy of Detection
In: Perceptual & motor skills. - Thousand Oaks, CA : SAGE Publications 88 (1999) 2, 693-700
OLC Linguistik
Show details

Catalogues
0
0
1
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
6
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern