DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5 6 7 8 9
Hits 61 – 80 of 162

61
Effects of lexical properties on viewing time per word in autistic and neurotypical readers
In: 69 ; 1 ; 158 ; 167 (2017)
BASE
Show details
62
Corpus-driven description of multi-word patterns
Steyer, Kathrin [Verfasser]; Corpas Pastor, Gloria [Herausgeber]; Monti, Johanna [Herausgeber]. - Mannheim : Institut für Deutsche Sprache, Bibliothek, 2016
DNB Subject Category Language
Show details
63
Improving translation memory matching and retrieval using paraphrases
In: 30 ; 1 ; 19 ; 40 (2016)
BASE
Show details
64
Accessible texts for autism: an eye-tracking study
Yaneva, Victoria; Temnikova, Irina; Mitkov, Ruslan Prof.. - : Association of Computing Machinery, 2016
BASE
Show details
65
Computational Phraseology light: automatic translation of multiword expressions without translation resources
Mitkov, Ruslan. - : De Gruyter Mouton, 2016
Abstract: This paper describes the first phase of a project whose ultimate goal is the implementation of a practical tool to support the work of language learners and translators by automatically identifying multiword expressions (MWEs) and retrieving their translations for any pair of languages. The task of translating multiword expressions is viewed as a two-stage process. The first stage is the extraction of MWEs in each of the languages; the second stage is a matching procedure for the extracted MWEs in each language which proposes the translation equivalents. This project pursues the development of a knowledge-poor approach for any pair of languages which does not depend on translation resources such as dictionaries, translation memories or parallel corpora which can be time consuming to develop or difficult to acquire, being expensive or proprietary. In line with this philosophy, the methodology developed does not rely on any dictionaries or parallel corpora, nor does it use any (bilingual) grammars. The only information comes from comparable corpora, inexpensively compiled. The first proofof- concept stage of this project covers English and Spanish and focuses on a particular subclass of MWEs: verb-noun expressions (collocations) such as take advantage, make sense, prestar atención and tener derecho. The choice of genre was determined by the fact that newswire is a widespread genre and available in different languages. An additional motivation was the fact that the methodology was developed as language independent with the objective of applying it to and testing it for different languages. The ACCURAT toolkit (Pinnis et al. 2012; Skadina et al. 2012; Su and Babych 2012a) was employed to compile automatically the comparable corpora and documents only above a specific threshold were considered for inclusion. More specifically, only pairs of English and Spanish documents with comparability score (cosine similarity) higher 0.45 were extracted. Statistical association measures were employed to quantify the strength of the relationship between two words and to propose that a combination of a verb and a noun above a specific threshold would be a (candidate for) multiword expression. This study focused on and compared four popular and established measures along with frequency: Log-likelihood ratio, T-Score, Log Dice and Salience. This project follows the distributional similarity premise which stipulates that translation equivalents share common words in their contexts and this applies also to multiword expressions. The Vector Space Model is traditionally used to represent words with their co-occurrences and to measure similarity. The vector representation for any word is constructed from the statistics of the occurrences of that word with other specific/context words in a corpus of texts. In this study, the word2vec method (Mikolov et al. 2013) was employed. Mikolov et al.’s method utilises patterns of word co-occurrences within a small window to predict similarities among words. Evaluation results are reported for both extracting MWEs and their automatic translation. A finding of the evaluation worth mentioning is that the size of the comparable corpora is more important for the performance of automatic translation of MWEs than the similarity between them as long as the comparable corpora used are of minimal similarity.
Keyword: Computational Linguistics; Multiword Expressions; Phraseology
URL: http://hdl.handle.net/2436/620324
https://doi.org/10.1515/phras-2016-0008
BASE
Hide details
66
Language resources for Italian: Towards the development of a corpus of annotated Italian multiword expressions
BASE
Show details
67
Statistical Language and Speech Processing : Third International Conference, SLSP 2015, Budapest, Hungary, November 24-26, 2015, Proceedings
Dediu, Adrian-Horia [Herausgeber]; Martín-Vide, Carlos [Herausgeber]; Mitkov, Ruslan [Herausgeber]. - Cham : Springer International Publishing, 2015
DNB Subject Category Language
Show details
68
The Role of Corpus Pattern Analysis in Machine Translation Evaluation
BASE
Show details
69
A machine-learning approach to negation and speculation detection for sentiment analysis
In: 67 ; 9 ; 2118 ; 2136 (2015)
BASE
Show details
70
Simple or not simple? A readability question
Štajner, Sanja; Mitkov, Ruslan; Corpas Pastor, Gloria. - : Springer International Publishing, 2015
BASE
Show details
71
Anaphora Resolution
Mitkov, Ruslan. - Florence : Taylor and Francis, 2014
UB Frankfurt Linguistik
Show details
72
The Oxford handbook of computational linguistics
Mitkov, Ruslan (Herausgeber). - Oxford : Oxford University Press, 2014
UB Frankfurt Linguistik
Show details
73
The Oxford handbook of computational linguistics
Mitkov, Ruslan. - Oxford : Oxford University Press, 2014-
MPI für Psycholinguistik
Show details
74
The Oxford handbook of computational linguistics
Mitkov, Ruslan (Hrsg.). - 2nd edition. - Oxford [u.a.] : Oxford Univ. Press, 2014
IDS Mannheim
Show details
75
Simple or not simple? A readability question
In: 379 ; 398 (2014)
BASE
Show details
76
Natural Language Processing methodology for tracking diachronic changes in the 20th century English language
Štajner, Sanja; Mitkov, Ruslan; Leech, Geoffrey. - : Equinox Publishing, 2014
BASE
Show details
77
The fewer, the better? A contrastive ctudy about ways to simplify
Mitkov, Ruslan; Štajner, Sanja. - : Association for Computational Linguistics, 2014
BASE
Show details
78
Statistical language and speech processing : first international conference ; proceedings
Dediu, Adrian-Horia [Herausgeber]; Martín-Vide, Carlos [Herausgeber]; Mitkov, Ruslan [Herausgeber]. - 2013
DNB Subject Category Language
Show details
79
Statistical Language and Speech Processing : First International Conference, SLSP 2013, Tarragona, Spain, July 29-31, 2013, Proceedings
Dediu, Adrian-Horia; Martín-Vide, Carlos; Mitkov, Ruslan. - Berlin, Heidelberg : Springer Berlin Heidelberg, 2013
UB Frankfurt Linguistik
Show details
80
Statistical language and speech processing : first international conference ; proceedings
Dediu, Adrian-Horia (Hrsg.); Martín-Vide, Carlos (Hrsg.); Mitkov, Ruslan (Hrsg.). - Heidelberg [u.a.] : Springer, 2013
UB Frankfurt Linguistik
Show details

Page: 1 2 3 4 5 6 7 8 9

Catalogues
16
4
12
0
8
0
5
Bibliographies
23
0
1
0
0
0
0
3
8
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
91
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern