1 |
Neural MT and Human Post-editing : a Method to Improve Editorial Quality
|
|
|
|
In: ISSN: 1134-8941 ; Interlingüística ; https://halshs.archives-ouvertes.fr/halshs-03603590 ; Interlingüística, Alacant [Spain] : Universitat Autònoma de Barcelona, 2022, pp.15-36 (2022)
|
|
BASE
|
|
Show details
|
|
2 |
Human evaluation of three machine translation systems : from quality to attitudes by professional translators
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Quantitative Fine-grained Human Evaluation of Machine Translation Systems: a Case Study on English to Croatian
|
|
|
|
In: Articles (2018)
|
|
BASE
|
|
Show details
|
|
4 |
Human-Guided Evolutionary-Based Linguistics Approach For Automatic Story Generation ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
MSEE: Stochastic Cognitive Linguistic Behavior Models for Semantic Sensing
|
|
|
|
In: DTIC (2013)
|
|
BASE
|
|
Show details
|
|
6 |
English → Russian MT evaluation campaign
|
|
|
|
In: ACL 2013 - 51st Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (2013)
|
|
BASE
|
|
Show details
|
|
7 |
Human-Guided Evolutionary-Based Linguistics Approach For Automatic Story Generation
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Bucking the Trend: Large-Scale Cost-Focused Active Learning for Statistical Machine Translation
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Task muddiness, intelligence metrics, and the necessity of autonomous mental development
|
|
|
|
In: http://www.cse.msu.edu/~cse841/papers/MuddyTasks.pdf (2009)
|
|
BASE
|
|
Show details
|
|
10 |
A Method for Stopping Active Learning Based on Stabilizing Predictions and the Need for User-Adjustable Stopping ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
A Method for Stopping Active Learning Based on Stabilizing Predictions and the Need for User-Adjustable Stopping
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Automating Convoy Training Assessment to Improve Soldier Performance
|
|
|
|
In: DTIC (2008)
|
|
BASE
|
|
Show details
|
|
13 |
Differential Effect of Correct Name Translation on Human and Automated Judgments of Translation Acceptability: A Pilot Study
|
|
|
|
In: DTIC (2008)
|
|
Abstract:
This study proffers two important findings: (1) automated machine translation (MT) evaluation is insensitive to the cognitive gravitas of proper names, contributing to its weak modeling of human judgments of higher quality MT output, and (2) there is a "new" methodology that produces superior measurement of translation acceptability. Twenty Arabic sentences, each with average name density of 3.7 names in 22 words, were translated into English with a research-grade MT system, to produce a 20-output-sentence Control Stimulus Set. Manual correction of 25% of the name translations resulted in an Enhanced Stimulus Set. A Magnitude Estimation (ME) methodology task had each of two teams of five subjects judge Control and Enhanced Sets against human reference translations. As is customary in ME studies, subjects made direct numerical estimations of the magnitude of the stimuli, in this case the degree to which sentences in the Sets conveyed the meaning in the reference sentences. Average estimates for Control and Enhanced Sets were 4.57 and 6.16, respectively, a 34.8% difference. Automated evaluation with the Metric for Evaluation of Translation with Explicit word ORdering (METEOR) produced scores of .446 and .546, a 22% difference. ME detected a differential effect, a finding which suggests that weighting proper name rendering in automated evaluation systems may improve correlations with human judgments on higher quality output.
|
|
Keyword:
*COMPREHENSION; *HUMANS; *MACHINE TRANSLATION; *MAGNITUDE ESTIMATION; *METRICS; *NAME TRANSLATION; ARABIC LANGUAGE; AUTOMATED EVALUATION; ESTIMATES; FOREIGN LANGUAGES; HUMAN UNDERSTANDING; JUDGEMENT(PSYCHOLOGY); Linguistics; MACHINE TRANSLATION EVALUATION; PROPER NAMES; STIMULI
|
|
URL: http://oai.dtic.mil/oai/oai?&verb=getRecord&metadataPrefix=html&identifier=ADA488000 http://www.dtic.mil/docs/citations/ADA488000
|
|
BASE
|
|
Hide details
|
|
14 |
Automatic Computation of . . .
|
|
|
|
In: http://www.informatik.uni-freiburg.de/~ksimon/papers/CIKM-06-Proximity.pdf (2006)
|
|
BASE
|
|
Show details
|
|
15 |
Toward Joint Segmentation and Classification of Dialog Acts in Multiparty Meetings
|
|
|
|
In: DTIC (2005)
|
|
BASE
|
|
Show details
|
|
16 |
Symposium on Speech Communication Metrics and Human Performance.
|
|
|
|
In: DTIC AND NTIS (1995)
|
|
BASE
|
|
Show details
|
|
17 |
Structural analysis of hypertexts: Identifying hierarchies and useful metrics
|
|
|
|
In: http://www.cs.technion.ac.il/~ehudr/publications/pdf/BotafogoRS92a.pdf (1992)
|
|
BASE
|
|
Show details
|
|
18 |
Natural Language Processing Systems Evaluation Workshop Held in Berkely, California on 18 June 1991
|
|
|
|
In: DTIC AND NTIS (1991)
|
|
BASE
|
|
Show details
|
|
19 |
Metrics for MT evaluation: Evaluating reordering
|
|
|
|
In: http://homepages.inf.ed.ac.uk/miles/papers/mt09.pdf
|
|
BASE
|
|
Show details
|
|
|
|