DE eng

Search in the Catalogues and Directories

Hits 1 – 1 of 1

1
Incorporating source-language paraphrases into phrase-based SMT with confusion networks
In: Jiang, Jie , Du, Jinhua orcid:0000-0002-3267-4881 and Way, Andy orcid:0000-0001-5736-5930 (2011) Incorporating source-language paraphrases into phrase-based SMT with confusion networks. In: SSST-2011: The Fifth Workshop on Syntax and Structure in Statistical Translation , 23 June 2011, Portland, Oregon, USA. (2011)
Abstract: To increase the model coverage, sourcelanguage paraphrases have been utilized to boost SMT system performance. Previous work showed that word lattices constructed from paraphrases are able to reduce out-ofvocabulary words and to express inputs in different ways for better translation quality. However, such a word-lattice-based method suffers from two problems: 1) path duplications in word lattices decrease the capacities for potential paraphrases; 2) lattice decoding in SMT dramatically increases the search space and results in poor time efficiency. Therefore, in this paper, we adopt word confusion networks as the input structure to carry source-language paraphrase information. Similar to previous work, we use word lattices to build word confusion networks for merging of duplicated paths and faster decoding. Experiments are carried out on small-, medium- and large-scale English– Chinese translation tasks, and we show that compared with the word-lattice-based method, the decoding time on three tasks is reduced significantly (up to 79%) while comparable translation quality is obtained on the largescale task.
Keyword: chinese translation; Machine translating; SMT; statistical machine translation
URL: http://doras.dcu.ie/16434/
BASE
Hide details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
1
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern