1 |
Challenges of sampling and how phylogenetic comparative methods help: With a case study of the Pama-Nyungan laminal contrast
|
|
|
|
In: ISSN: 1430-0532 ; EISSN: 1613-415X ; Linguistic Typology ; https://hal.archives-ouvertes.fr/hal-03643365 ; Linguistic Typology, De Gruyter, In press, ⟨10.1515/lingty-2021-0025⟩ (2022)
|
|
BASE
|
|
Show details
|
|
2 |
Situative strategies and constructions in European languages ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
СОПОСТАВИТЕЛЬНЫЙ АНАЛИЗ ГЛАГОЛОВ ДВИЖЕНИЯ В ДАРГИНСКОМ И РУССКОМ ЯЗЫКАХ ... : COMPARATIVE ANALYSIS OF VERBS OF MOVEMENT IN DARGIN AND RUSSIAN LANGUAGES ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Frequency, Informativity and Word Length: Insights from Typologically Diverse Corpora
|
|
|
|
In: Entropy; Volume 24; Issue 2; Pages: 280 (2022)
|
|
Abstract:
Zipf’s law of abbreviation, which posits a negative correlation between word frequency and length, is one of the most famous and robust cross-linguistic generalizations. At the same time, it has been shown that contextual informativity (average surprisal given previous context) is more strongly correlated with word length, although this tendency is not observed consistently, depending on several methodological choices. The present study examines a more diverse sample of languages than the previous studies (Arabic, Finnish, Hungarian, Indonesian, Russian, Spanish and Turkish). I use large web-based corpora from the Leipzig Corpora Collection to estimate word lengths in UTF-8 characters and in phonemes (for some of the languages), as well as word frequency, informativity given previous word and informativity given next word, applying different methods of bigrams processing. The results show different correlations between word length and the corpus-based measure for different languages. I argue that these differences can be explained by the properties of noun phrases in a language, most importantly, by the order of heads and modifiers and their relative morphological complexity, as well as by orthographic conventions.
|
|
Keyword:
corpora; frequency; informativity; linguistic typology; n-grams; Zipf’s law of abbreviation
|
|
URL: https://doi.org/10.3390/e24020280
|
|
BASE
|
|
Hide details
|
|
6 |
LEXICO- GRAMMATICAL RESOURCES OF FUNCTIONAL EQUIVALENCE IN THE TRANSLATION OF TEXTS FROM ENGLISH INTO UZBEK ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
LEXICO- GRAMMATICAL RESOURCES OF FUNCTIONAL EQUIVALENCE IN THE TRANSLATION OF TEXTS FROM ENGLISH INTO UZBEK ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Introduction to Armenian linguistics in an areal perspective ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Introduction to Armenian linguistics in an areal perspective ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Un cas d’exemple dictionnairique ... : Les pragmatèmes de signalisation ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Supplementary Materials for 'Measuring and assessing indeterminacy and variation in the morphology-syntax distinction' ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Supplementary Materials for 'Measuring and assessing indeterminacy and variation in the morphology-syntax distinction' ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Generating Samples of Diasporic Minority Populations: A Chilean Example
|
|
|
|
In: Targeting International Audiences: Current and Future Approaches to International Broadcasting Research ; 3 ; CIBAR Proceedings ; 138-149 ; Conference of International Broadcasters' Audience Research Services (CIBAR) ; XX (2022)
|
|
BASE
|
|
Show details
|
|
18 |
Per una sistemica diacronica delle lingue romanze: aspetti teorici, applicativi e ipotesi sulla memoria delle lingue
|
|
Begioni, L. - : La scuola di Pitagora editrice, 2022. : country:IT, 2022. : place:NAPOLI, 2022
|
|
BASE
|
|
Show details
|
|
19 |
Structural and semantic congruence of Bulgarian, Russian and English set expressions: Contrastive-typological research
|
|
|
|
In: Russian Journal of Linguistics, Vol 26, Iss 1, Pp 95-115 (2022) (2022)
|
|
BASE
|
|
Show details
|
|
20 |
Phylogenetic trees: Grammar versus vocabulary
|
|
|
|
In: Russian Journal of Linguistics, Vol 26, Iss 1, Pp 31-50 (2022) (2022)
|
|
BASE
|
|
Show details
|
|
|
|