2 |
Using Global Constraints and Reranking to Improve Cognates Detection ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Acquisition of Translation Lexicons for Historically Unwritten Languages via Bridging Loanwords ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Data Cleaning for XML Electronic Dictionaries via Statistical Anomaly Detection ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Data Cleaning for XML Electronic Dictionaries via Statistical Anomaly Detection
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Statistical modality tagging from rule-based annotations and crowdsourcing ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Use of Modality and Negation in Semantically-Informed Syntactic MT ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Annotating Cognates and Etymological Origin in Turkic Languages ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Correcting Errors in Digital Lexicographic Resources Using a Dictionary Manipulation Language ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
A random forest system combination approach for error detection in digital dictionaries ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Bucking the Trend: Large-Scale Cost-Focused Active Learning for Statistical Machine Translation ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Taking into Account the Differences between Actively and Passively Acquired Data: The Case of Active Learning with Support Vector Machines for Imbalanced Datasets ...
|
|
|
|
Abstract:
Actively sampled data can have very different characteristics than passively sampled data. Therefore, it's promising to investigate using different inference procedures during AL than are used during passive learning (PL). This general idea is explored in detail for the focused case of AL with cost-weighted SVMs for imbalanced data, a situation that arises for many HLT tasks. The key idea behind the proposed InitPA method for addressing imbalance is to base cost models during AL on an estimate of overall corpus imbalance computed via a small unbiased sample rather than the imbalance in the labeled training data, which is the leading method used during PL. ... : 4 pages, 5 figures; appeared in Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers, pages 137-140, Boulder, Colorado, June 2009. Association for Computational Linguistics ...
|
|
Keyword:
Computation and Language cs.CL; FOS Computer and information sciences; I.2.6; I.2.7; I.5.1; I.5.4; Machine Learning cs.LG; Machine Learning stat.ML
|
|
URL: https://dx.doi.org/10.48550/arxiv.1409.4835 https://arxiv.org/abs/1409.4835
|
|
BASE
|
|
Hide details
|
|
16 |
Rapid Adaptation of POS Tagging for Domain Specific Uses ...
|
|
|
|
BASE
|
|
Show details
|
|
17 |
Detecting Structural Irregularity in Electronic Dictionaries Using Language Modeling ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
Semantically-Informed Syntactic Machine Translation: A Tree-Grafting Approach ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|