1 |
Classifying Bias in Large Multilingual Corpora via Crowdsourcing and Topic Modeling
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Correcting Errors in Digital Lexicographic Resources Using a Dictionary Manipulation Language ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
A random forest system combination approach for error detection in digital dictionaries ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Detecting Structural Irregularity in Electronic Dictionaries Using Language Modeling ...
|
|
|
|
Abstract:
Dictionaries are often developed using tools that save to Extensible Markup Language (XML)-based standards. These standards often allow high-level repeating elements to represent lexical entries, and utilize descendants of these repeating elements to represent the structure within each lexical entry, in the form of an XML tree. In many cases, dictionaries are published that have errors and inconsistencies that are expensive to find manually. This paper discusses a method for dictionary writers to quickly audit structural regularity across entries in a dictionary by using statistical language modeling. The approach learns the patterns of XML nodes that could occur within an XML tree, and then calculates the probability of each XML tree in the dictionary against these patterns to look for entries that diverge from the norm. ... : 6 pages, 2 figures, 11 tables; appeared in Proceedings of Electronic Lexicography in the 21st Century (eLex), November 2011 ...
|
|
Keyword:
Computation and Language cs.CL; FOS Computer and information sciences; I.2.7; I.2.6; I.5.1; I.5.4; Machine Learning cs.LG
|
|
URL: https://dx.doi.org/10.48550/arxiv.1410.8149 https://arxiv.org/abs/1410.8149
|
|
BASE
|
|
Hide details
|
|
5 |
A random forest system combination approach for error detection in digital dictionaries
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Citation Handling: Processing Citation Texts in Scientific Documents
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Correcting Errors in Digital Lexicographic Resources Using a Dictionary Manipulation Language ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Detecting Structural Irregularity in Electronic Dictionaries Using Language Modeling ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Correcting Errors in Digital Lexicographic Resources Using a Dictionary Manipulation Language
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Detecting Structural Irregularity in Electronic Dictionaries Using Language Modeling
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Citation Handling for Improved Summarization of Scientific Documents
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Error Correction for Arabic Dictionary Lookup
|
|
|
|
In: Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC 2010), Valetta, 17 - 23 May 2010 (2010), 263-268
|
|
IDS OBELEX meta
|
|
Show details
|
|
15 |
Multiple Alternative Sentene Compressions as a Tool for Automatic Summarization Tasks
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Headline Generation for Written and Broadcast News
|
|
|
|
In: DTIC (2005)
|
|
BASE
|
|
Show details
|
|
17 |
Hedge Trimmer: A Parse-and-Trim Approach to Headline Generation
|
|
|
|
In: DTIC (2003)
|
|
BASE
|
|
Show details
|
|
|
|