1 |
Frequency, Informativity and Word Length: Insights from Typologically Diverse Corpora
|
|
|
|
In: Entropy; Volume 24; Issue 2; Pages: 280 (2022)
|
|
BASE
|
|
Show details
|
|
3 |
Multi-word units (and tokenization more generally): a multi-dimensional and largely information-theoretic approach
|
|
|
|
In: Lexis: Journal in English Lexicology, Vol 19 (2022) (2022)
|
|
BASE
|
|
Show details
|
|
4 |
Meta-Learner for Amharic Sentiment Classification
|
|
|
|
In: Applied Sciences ; Volume 11 ; Issue 18 (2021)
|
|
BASE
|
|
Show details
|
|
5 |
You are kidding right? The English present progressive as a stance marker in film dialogue ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
An interactive visualization of Google Books Ngrams with R and Shiny : exploring a(n) historical increase in onset strength in a(n) huge database
|
|
|
|
BASE
|
|
Show details
|
|
7 |
An interactive visualization of Google Books Ngrams with R and Shiny : exploring a(n) historical increase in onset strength in a(n) huge database
|
|
|
|
BASE
|
|
Show details
|
|
8 |
DIGITAL TECHNOLOGIES FOR GRAMMATICAL ERROR CORRECTION: DEEP LEARNING METHODS & SYNTACTIC N-GRAMS
|
|
|
|
In: Мова; No. 35 (2021) ; Мова; № 35 (2021) ; 2414-9489 ; 2307-4558 (2021)
|
|
BASE
|
|
Show details
|
|
9 |
You are kidding right? The English present progressive as a stance marker in film dialogue
|
|
|
|
In: Lingue e Linguaggi; Volume 44(2021); 183-202 (2021)
|
|
BASE
|
|
Show details
|
|
10 |
Visualizing the development of prose styles in Horse Manuals from Early Modern English to Present-Day English
|
|
|
|
In: EISSN: 2416-5999 ; Journal of Data Mining and Digital Humanities ; https://hal.archives-ouvertes.fr/hal-02283138 ; Journal of Data Mining and Digital Humanities, Episciences.org, 2020, Special Issue Visualisations in Historical Linguistics, Special issue on Visualisations in Historical Linguistics, pp.1-33 (2020)
|
|
BASE
|
|
Show details
|
|
11 |
Frequency lists of character-level n-grams from the GOS 1.0 corpus 1.1
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Frequency lists of word-level n-grams from the GOS 1.0 corpus 1.1
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Visualizing the development of prose styles in Horse Manuals from Early Modern English to Present-Day English
|
|
|
|
In: Journal of Data Mining and Digital Humanities, Vol Special issue on Visualisations in Historical Linguistics (2020) (2020)
|
|
BASE
|
|
Show details
|
|
16 |
An interactive visualization of Google Books Ngrams with R and Shiny: Exploring a(n) historical increase in onset strength in a(n) huge database
|
|
|
|
In: Journal of Data Mining and Digital Humanities, Vol Special issue on Visualisations in Historical Linguistics (2020) (2020)
|
|
Abstract:
International audience Using the re-emergence of the /h/ onset from Early Modern to Present-Day English as a case study, we illustrate the making and the functions of a purpose-built web application named (an:a) lyzer for the interactive visualization of the raw n-gram data provided by Google Books Ngrams (GBN). The database has been compiled from the full text of over 4.5 million books in English, totalling over 468 billion words and covering roughly five centuries. We focus on bigrams consisting of words beginning with graphic preceded by the indefinite article allomorphs a and an, which serve as a diagnostic of the consonantal strength of the initial /h/. The sheer size of this database affords us the possibility to attain a maximal diachronic resolution, to distinguish highly specific groups of -initial lexical items, and even to trace the diffusion of the observed changes across individual lexical units. The functions programmed into the app enable us to explore the data interactively by filtering, selecting and viewing them according to various parameters that were manually annotated into the data frame. We also discuss limitations of the database, of the app and of the explorative data analysis. The app is publicly accessible online at https://osf.io/ht8se/.
|
|
Keyword:
[shs.langue]humanities and social sciences/linguistics; AZ20-999; Bibliography. Library science. Information resources; corpus linguistics; data visualization; google books; google books ngrams; historical linguistics; historical phonology; History of scholarship and learning. The humanities; n-grams; r; shiny; Z
|
|
URL: https://doaj.org/article/b54c8cc339bb4b8a9e4ba8f849b398a2
|
|
BASE
|
|
Hide details
|
|
17 |
The necessity modals have to, must, need to and should: using n-grams to help identify common and distinct semantic and pragmatic aspects. 11.2: 220-243
|
|
|
|
In: ISSN: 1876-1933 ; EISSN: 1876-1941 ; Constructions and Frames ; https://hal.archives-ouvertes.fr/hal-02369306 ; Constructions and Frames, John Benjamins, 2019, 11, pp.220 - 243. ⟨10.1075/cf.00029.cap⟩ (2019)
|
|
BASE
|
|
Show details
|
|
18 |
The necessity modals have to, must, need to and should: using n-grams to help identify common and distinct semantic and pragmatic aspects
|
|
|
|
In: ISSN: 1876-1933 ; EISSN: 1876-1941 ; Constructions and Frames ; https://hal.archives-ouvertes.fr/hal-02501498 ; Constructions and Frames, John Benjamins, 2019, 11 (2), pp.220-243. ⟨10.1075/cf.00029.cap⟩ (2019)
|
|
BASE
|
|
Show details
|
|
20 |
Dependency tree extraction tool STARK 1.0
|
|
Krsnik, Luka; Dobrovoljc, Kaja; Robnik-Šikonja, Marko. - : Centre for Language Resources and Technologies, University of Ljubljana, 2019. : Faculty of Arts, University of Ljubljana, 2019. : Faculty of Computer and Information Science, University of Ljubljana, 2019
|
|
BASE
|
|
Show details
|
|
|
|