1 |
Rethinking Data Augmentation for Low-Resource Neural Machine Translation: A Multi-Task Learning Approach
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Understanding the effects of word-level linguistic annotations in under-resourced neural machine translation
|
|
|
|
BASE
|
|
Show details
|
|
3 |
The Universitat d'Alacant submissions to the English-to-Kazakh news translation task at WMT 2019 ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
The Universitat d'Alacant submissions to the English-to-Kazakh news translation task at WMT 2019 ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Quantitative Fine-Grained Human Evaluation of Machine Translation Systems: a Case Study on English to Croatian ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Fine-grained human evaluation of neural versus phrase-based machine translation ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
A Multifaceted Evaluation of Neural versus Phrase-Based Machine Translation for 9 Language Directions ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Assisting non-expert speakers of under-resourced languages in assigning stems and inflectional paradigms to new word entries of morphological dictionaries
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Fine-Grained Human Evaluation of Neural Versus Phrase-Based Machine Translation
|
|
|
|
In: Prague Bulletin of Mathematical Linguistics , Vol 108, Iss 1, Pp 121-132 (2017) (2017)
|
|
BASE
|
|
Show details
|
|
10 |
Integrating Rules and Dictionaries from Shallow-Transfer Machine Translation into Phrase-Based Statistical Machine Translation
|
|
|
|
BASE
|
|
Show details
|
|
11 |
RuLearn: an Open-source Toolkit for the Automatic Inference of Shallow-transfer Rules for Machine Translation
|
|
|
|
BASE
|
|
Show details
|
|
12 |
RuLearn: an Open-source Toolkit for the Automatic Inference of Shallow-transfer Rules for Machine Translation
|
|
|
|
In: Prague Bulletin of Mathematical Linguistics , Vol 106, Iss 1, Pp 193-204 (2016) (2016)
|
|
BASE
|
|
Show details
|
|
13 |
A generalised alignment template formalism and its application to the inference of shallow-transfer machine translation rules from scarce bilingual corpora
|
|
|
|
Abstract:
Statistical and rule-based methods are complementary approaches to machine translation (MT) that have different strengths and weaknesses. This complementarity has, over the last few years, resulted in the consolidation of a growing interest in hybrid systems that combine both data-driven and linguistic approaches. In this paper, we address the situation in which the amount of bilingual resources that is available for a particular language pair is not sufficiently large to train a competitive statistical MT system, but the cost and slow development cycles of rule-based MT systems cannot be afforded either. In this context, we formalise a new method that uses scarce parallel corpora to automatically infer a set of shallow-transfer rules to be integrated into a rule-based MT system, thus avoiding the need for human experts to handcraft these rules. Our work is based on the alignment template approach to phrase-based statistical MT, but the definition of the alignment template is extended to encompass different generalisation levels. It is also greatly inspired by the work of Sánchez-Martínez and Forcada (2009) in which alignment templates were also considered for shallow-transfer rule inference. However, our approach overcomes many relevant limitations of that work, principally those related to the inability to find the correct generalisation level for the alignment templates, and to select the subset of alignment templates that ensures an adequate segmentation of the input sentences by the rules eventually obtained. Unlike previous approaches in literature, our formalism does not require linguistic knowledge about the languages involved in the translation. Moreover, it is the first time that conflicts between rules are resolved by choosing the most appropriate ones according to a global minimisation function rather than proceeding in a pairwise greedy fashion. Experiments conducted using five different language pairs with the free/open-source rule-based MT platform Apertium show that translation quality significantly improves when compared to the method proposed by Sánchez-Martínez and Forcada (2009), and is close to that obtained using handcrafted rules. For some language pairs, our approach is even able to outperform them. Moreover, the resulting number of rules is considerably smaller, which eases human revision and maintenance. ; Research funded by Universitat d’Alacant through project GRE11-20, by the Spanish Ministry of Economy and Competitiveness through projects TIN2009-14009-C02-01 and TIN2012-32615, by Generalitat Valenciana through grant ACIF/2010/174, and by the European Union Seventh Framework Programme FP7/2007-2013 under grant agreement PIAP-GA-2012-324414 (Abu-MaTran).
|
|
Keyword:
Hybrid machine translation; Lenguajes y Sistemas Informáticos; Machine translation; Transfer rule inference
|
|
URL: https://doi.org/10.1016/j.csl.2014.10.003 http://hdl.handle.net/10045/52497
|
|
BASE
|
|
Hide details
|
|
14 |
An open-source toolkit for integrating shallow-transfer rules into phrase-based statistical machine translation
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Choosing the correct paradigm for unknown words in rule-based machine translation systems
|
|
|
|
BASE
|
|
Show details
|
|
16 |
The Universitat d’Alacant hybrid machine translation system for WMT 2011
|
|
|
|
BASE
|
|
Show details
|
|
17 |
Integrating shallow-transfer rules into phrase-based statistical machine translation
|
|
|
|
BASE
|
|
Show details
|
|
18 |
Enriching a statistical machine translation system trained on small parallel corpora with rule-based bilingual phrases
|
|
|
|
BASE
|
|
Show details
|
|
19 |
A Widely Used Machine Translation Service and its Migration to a Free/Open-Source Solution : the Case of Softcatalà
|
|
|
|
BASE
|
|
Show details
|
|
20 |
ScaleMT: a free/open-source framework for building scalable machine translation web services
|
|
|
|
BASE
|
|
Show details
|
|
|
|