Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Hits 1 – 9 of 9

1	WMT18 Quality Estimation Shared Task Test Data
	Specia, Lucia; Logacheva, Varvara; Blain, Frederic; Fernandez, Ramon; Martins, André. - : University of Sheffield, 2018
	Abstract: Test data for the WMT18 QE task. Train data can be downloaded from http://hdl.handle.net/11372/LRT-2619. This shared task will build on its previous six editions to further examine automatic methods for estimating the quality of machine translation output at run-time, without relying on reference translations. We include word-level, phrase-level and sentence-level estimation. All tasks make use of datasets produced from post-editions by professional translators. The datasets are domain-specific (IT and life sciences/pharma domains) and extend from those used previous years with more instances and more languages. One important addition is that this year we also include datasets with neural MT outputs. In addition to advancing the state of the art at all prediction levels, our specific goals are: To study the performance of quality estimation approaches on the output of neural MT systems. We will do so by providing datasets for two language language pairs where the same source segments are translated by both a statistical phrase-based and a neural MT system. To study the predictability of deleted words, i.e. words that are missing in the MT output. TO do so, for the first time we provide data annotated for such errors at training time. To study the effectiveness of explicitly assigned labels for phrases. We will do so by providing a dataset where each phrase in the output of a phrase-based statistical MT system was annotated by human translators. To study the effect of different language pairs. We will do so by providing datasets created in similar ways for four language language pairs. To investigate the utility of detailed information logged during post-editing. We will do so by providing post-editing time, keystrokes, and actual edits. Measure progress over years at all prediction levels. We will do so by using last year's test set for comparative experiments. In-house statistical and neural MT systems were built to produce translations for all tasks. MT system-dependent information can be made available under request. The data is publicly available but since it has been provided by our industry partners it is subject to specific terms and conditions. However, these have no practical implications on the use of this data for research purposes. Participants are allowed to explore any additional data and resources deemed relevant.
	Keyword: machine learning; machine translation; quality estimation
	URL: http://hdl.handle.net/11372/LRT-2805
	BASE
	Hide details

2	WMT18 Quality Estimation Shared Task Training and Development Data
	Specia, Lucia; Logacheva, Varvara; Blain, Frederic. - : University of Sheffield, 2018
	BASE
	Show details

3	Text Simplification From Professionally Produced Corpora ...
	Scarton, Carolina; Paetzold, Gustavo Henrique; Specia, Lucia. - : Zenodo, 2018
	BASE
	Show details

4	SimPA: A Sentence-Level Simplification Corpus for the Public Administration Domain ...
	Scarton, Carolina; Paetzold, Gustavo Henrique; Specia, Lucia. - : Zenodo, 2018
	BASE
	Show details

5	Text Simplification From Professionally Produced Corpora ...
	Scarton, Carolina; Paetzold, Gustavo Henrique; Specia, Lucia. - : Zenodo, 2018
	BASE
	Show details

6	SimPA: A Sentence-Level Simplification Corpus for the Public Administration Domain ...
	Scarton, Carolina; Paetzold, Gustavo Henrique; Specia, Lucia. - : Zenodo, 2018
	BASE
	Show details

7	Findings of the WMT 2018 shared task on quality estimation
	Specia, Lucia; Logacheva, Varvara; Blain, Frederic. - : Association for Computational Linguistics, 2018
	BASE
	Show details

8	deepQuest: a framework for neural-based quality estimation
	Ive, Julia; Blain, Frederic; Specia, Lucia. - : Association for Computational Linguistics, 2018
	BASE
	Show details

9	Sheffield submissions for the WMT18 quality estimation shared task
	Ive, Julia; Scarton, Carolina; Specia, Lucia...
	In: 807 ; 813 (2018)
	BASE
	Show details

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern