Home
Catalogue search
Refine your search:
Keyword:
Computational linguistics (3)
natural language processing (3)
Artificial intelligence (2)
Machine learning (2)
Language (1)
Linguistics (1)
ROC curve (1)
code switching (1)
decision tree learning (1)
error corpora (1)
more
Creator / Publisher:
Foster, Jennifer (3)
Wagner, Joachim (3)
Barman, Utsab (1)
Bryl, Anton (1)
Das, Amitava (1)
Le Roux, Joseph (1)
Samad Zadeh Kaljahi, Rasoul (1)
van Genabith, Josef (1)
Year
Medium
Type
BLLDB-Access
Search in the Catalogues and Directories
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
Sort by
creator [A → Z]
'
creator [Z → A]
'
publishing year ↑ (asc)
'
publishing year ↓ (desc)
'
title [A → Z]
'
title [Z → A]
'
Simple Search
Hits 1 – 3 of 3
1
Code mixing: a challenge for language identification in the language of social media
Barman, Utsab
;
Das, Amitava
;
Wagner, Joachim
...
In: Barman, Utsab, Das, Amitava orcid:0000-0003-3418-463X , Wagner, Joachim orcid:0000-0002-8290-3849 and Foster, Jennifer orcid:0000-0002-7789-4853 (2014) Code mixing: a challenge for language identification in the language of social media. In: First Workshop on Computational Approaches to Code Switching, 25 Oct 2014, Doha, Qatar. (2014)
BASE
Show details
2
DCU-Paris13 systems for the SANCL 2012 shared task
Le Roux, Joseph
;
Foster, Jennifer
;
Wagner, Joachim
...
In: Le Roux, Joseph, Foster, Jennifer orcid:0000-0002-7789-4853 , Wagner, Joachim orcid:0000-0002-8290-3849 , Samad Zadeh Kaljahi, Rasoul and Bryl, Anton (2012) DCU-Paris13 systems for the SANCL 2012 shared task. In: The NAACL 2012 First Workshop on Syntactic Analysis of Non-Canonical Language (SANCL), 7-8 Jun 2012, Montreal, Quebec, Canada. (2012)
BASE
Show details
3
Detecting grammatical errors with treebank-induced, probabilistic parsers
Wagner, Joachim
. - : Dublin City University. School of Computing, 2012
In: Wagner, Joachim orcid:0000-0002-8290-3849 (2012) Detecting grammatical errors with treebank-induced, probabilistic parsers. PhD thesis, Dublin City University. (2012)
Abstract:
Today's grammar checkers often use hand-crafted rule systems that define acceptable language. The development of such rule systems is labour-intensive and has to be repeated for each language. At the same time, grammars automatically induced from syntactically annotated corpora (treebanks) are successfully employed in other applications, for example text understanding and machine translation. At first glance, treebank-induced grammars seem to be unsuitable for grammar checking as they massively over-generate and fail to reject ungrammatical input due to their high robustness. We present three new methods for judging the grammaticality of a sentence with probabilistic, treebank-induced grammars, demonstrating that such grammars can be successfully applied to automatically judge the grammaticality of an input string. Our best-performing method exploits the differences between parse results for grammars trained on grammatical and ungrammatical treebanks. The second approach builds an estimator of the probability of the most likely parse using grammatical training data that has previously been parsed and annotated with parse probabilities. If the estimated probability of an input sentence (whose grammaticality is to be judged by the system) is higher by a certain amount than the actual parse probability, the sentence is flagged as ungrammatical. The third approach extracts discriminative parse tree fragments in the form of CFG rules from parsed grammatical and ungrammatical corpora and trains a binary classifier to distinguish grammatical from ungrammatical sentences. The three approaches are evaluated on a large test set of grammatical and ungrammatical sentences. The ungrammatical test set is generated automatically by inserting common grammatical errors into the British National Corpus. The results are compared to two traditional approaches, one that uses a hand-crafted, discriminative grammar, the XLE ParGram English LFG, and one based on part-of-speech n-grams. In addition, the baseline methods and the new methods are combined in a machine learning-based framework, yielding further improvements.
Keyword:
Artificial intelligence
;
Computational linguistics
;
decision tree learning
;
error corpora
;
error detection
;
grammar checker
;
Language
;
learner corpus
;
Linguistics
;
Machine learning
;
n-gram language models
;
natural language processing
;
precision grammar
;
probabilistic grammar
;
ROC curve
;
voting classifier
URL:
http://doras.dcu.ie/16776/
BASE
Hide details
Mobile view
All
Catalogues
UB Frankfurt Linguistik
0
IDS Mannheim
0
OLC Linguistik
0
UB Frankfurt Retrokatalog
0
DNB Subject Category Language
0
Institut für Empirische Sprachwissenschaft
0
Leibniz-Centre General Linguistics (ZAS)
0
Bibliographies
BLLDB
0
BDSL
0
IDS Bibliografie zur deutschen Grammatik
0
IDS Bibliografie zur Gesprächsforschung
0
IDS Konnektoren im Deutschen
0
IDS Präpositionen im Deutschen
0
IDS OBELEX meta
0
MPI-SHH Linguistics Collection
0
MPI for Psycholinguistics
0
Linked Open Data catalogues
Annohub
0
Online resources
Link directory
0
Journal directory
0
Database directory
0
Dictionary directory
0
Open access documents
BASE
3
Linguistik-Repository
0
IDS Publikationsserver
0
Online dissertations
0
Language Description Heritage
0
© 2013 - 2024 Lin|gu|is|tik
|
Imprint
|
Privacy Policy
|
Datenschutzeinstellungen ändern