DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4
Hits 1 – 20 of 74

1
Data and R codes for Exploring diachronic salience of emotion metaphors: A contrastive study of HAPPINESS metaphors in Classical Malay and Indonesian ...
Rajeg, Gede Primahadi Wijaya. - : Open Science Framework, 2021
BASE
Show details
2
Constructional equivalence in the Indonesian translations of ROB and STEAL ...
Rajeg, Gede Primahadi Wijaya. - : Open Science Framework, 2021
BASE
Show details
3
corplingr ...
Rajeg, Gede Primahadi Wijaya. - : Open Science Framework, 2021
BASE
Show details
4
paracorp ...
Rajeg, Gede Primahadi Wijaya. - : Open Science Framework, 2021
BASE
Show details
5
Corpora e storia della lingua ...
Cortelazzo, Michele A.. - : EUT Edizioni Università di Trieste, 2021
BASE
Show details
6
Le particelle razve e neuželi alla luce del Corpus parallelo russo-italiano
Noseda, Valentina (orcid:0000-0002-5148-1241). - : Aracne, 2021. : country:ITA, 2021. : place:Roma, 2021
BASE
Show details
7
Corpora e storia della lingua
Cortelazzo, Michele A.. - : EUT Edizioni Università di Trieste, 2021
BASE
Show details
8
ERRATAS database of editorial principles and practices in printed editions of historical correspondence ...
BASE
Show details
9
ERRATAS database of editorial principles and practices in printed editions of historical correspondence ...
BASE
Show details
10
Editions and other sources used in the Corpora of Early English Correspondence (CEEC-400) ...
Kaislaniemi, Samuli. - : Zenodo, 2020
BASE
Show details
11
Editions and other sources used in the Corpora of Early English Correspondence (CEEC-400) ...
Kaislaniemi, Samuli. - : Zenodo, 2020
BASE
Show details
12
ПРИМЕНЕНИЕ КОРПУСНЫХ МЕТОДОВ ДЛЯ ЛЕКСИЧЕСКОГО АНАЛИЗА СПЕЦИАЛИЗИРОВАННЫХ КОРПУСОВ ... : APPLICATION OF CORPUS METHODS FOR LEXICAL ANALYSIS OF SPECIALIZED CORPORA ...
BASE
Show details
13
The Creation and Complexity Analysis of a Corpus of Educational Materials in Irish (EduGA) ; n/a
? MEACHAIR, M?CHE?L JOHN. - : Trinity College Dublin. School of Linguistic Speech & Comm Sci. C.L.C.S., 2020
BASE
Show details
14
The New Chinese Corpus of Literary Texts Litchi
In: Acta Linguistica Asiatica, Vol 10, Iss 2 (2020) (2020)
BASE
Show details
15
Size of corpora and collocations: The case of Russian
In: Slovenščina 2.0: Empirične, aplikativne in interdisciplinarne raziskave, Vol 8, Iss 2 (2020) (2020)
BASE
Show details
16
ТЕХНОЛОГИЯ ПАРАЛЛЕЛЬНЫХ КОРПУСОВ ТЕКСТОВ И ЕЕ ИСПОЛЬЗОВАНИЕ В ПРОЦЕССЕ ОБУЧЕНИЯ ПЕРЕВОДУ ... : THE TECHNOLOGY OF PARALLEL TEXT CORPORA AND ITS USE IN THE PROCESS OF TRANSLATION TRAINING ...
Авдеев А.А.. - : НАУЧНЫЙ ЖУРНАЛ СОВРЕМЕННЫЕ ЛИНГВИСТИЧЕСКИЕ И МЕТОДИКО-ДИДАКТИЧЕСКИЕ ИССЛЕДОВАНИЯ, 2019
BASE
Show details
17
From a Smoking Gun to Spent Fuel: Principled Subsampling Methods for Building Big Language Data Corpora from Monitor Corpora
In: Data ; Volume 4 ; Issue 2 (2019)
Abstract: With the influence of Big Data culture on qualitative data collection, acquisition, and processing, it is becoming increasingly important that social scientists understand the complexity underlying data collection and the resulting models and analyses. Systematic approaches for creating computationally tractable models need to be employed in order to create representative, specialized reference corpora subsampled from Big Language Data sources. Even more importantly, any such method must be tested and vetted for its reproducibility and consistency in generating a representative model of a particular population in question. This article considers and tests one such method for Big Language Data downsampling of digitally accessible language data to determine both how to operationalize this form of corpus model creation, as well as testing whether the method is reproducible. Using the U.S. Nuclear Regulatory Commission&rsquo ; s public documentation database as a test source, the sampling method&rsquo ; s procedure was evaluated to assess variation in the rate of which documents were deemed fit for inclusion or exclusion from the corpus across four iterations. After performing multiple sampling iterations, the approach pioneered by the Tobacco Documents Corpus creators was deemed to be reproducible and valid using a two-proportion z-test at a 99% confidence interval at each stage of the evaluation process&ndash ; leading to a final mean rejection ratio of 23.5875 and variance of 0.891 for the documents sampled and evaluated for inclusion into the final text-based model. The findings of this study indicate that such a principled sampling method is viable, thus necessitating the need for an approach for creating language-based models that account for extralinguistic factors and linguistic characteristics of documents.
Keyword: big data; corpus linguistics; databases; documentary analysis; government regulation; language data; language modeling; monitor corpora; nuclear power; tobacco documents
URL: https://doi.org/10.3390/data4020048
BASE
Hide details
18
To be or not to be .? Analysis of misuses with be in a learner corpus in an online environment ; ¿Ser o no ser.? Análisis de errores con to be en un corpus de estudiantes en un entorno online
Castillo Rodríguez, Cristina (1); López Pérez, Sidoni (1). - : Revista de Filologia de la Universidad de la Laguna, 2019
BASE
Show details
19
THE ROLE OF ELECTRONIC CORPORA IN TRANSLATION TRAINING
In: Studies in Linguistics, Culture, and FLT, Vol 7, Pp 48-58 (2019) (2019)
BASE
Show details
20
Eesti keele kui teise keele õpikute lausete analüüs ja selle rakendamine eri keeleoskustasemete sõnastike näitelausete automaatsel valikul
In: Eesti Rakenduslingvistika Ühingu Aastaraamat, Vol 15, Pp 99-119 (2019) (2019)
BASE
Show details

Page: 1 2 3 4

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
73
1
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern