1 |
The Impact of language models and loss functions on repair disfluency detection
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Detecting speech repairs incrementally using a noisy channel approach
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Choosing the right translation : a syntactically informed classification approach
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Morphosyntactic target language matching in statistical machine translation
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Syntax-based word reordering in phrase-based statistical machine translation : why does it work?
|
|
|
|
BASE
|
|
Show details
|
|
7 |
This phrase-based SMT system is out of order : generalised word reordering in machine translation
|
|
|
|
Abstract:
Many natural language processes have some degree of preprocessing of data: tokenisation, stemming and so on. In the domain of Statistical Machine Translation it has been shown that word reordering as a preprocessing step can help the translation process. Recently, hand-written rules for reordering in German–English translation have shown good results, but this is clearly a labour-intensive and language pair-specific approach. Two possible sources of the observed improvement are that (1) the reordering explicitly matches the syntax of the source language more closely to that of the target language, or that (2) it fits the data better to the mechanisms of phrasal SMT; but it is not clear which. In this paper, we apply a general principle based on dependency distance minimisation to produce reorderings. Our languageindependent approach achieves half of the improvement of a reimplementation of the handcrafted approach, and suggests that reason (2) is a possible explanation for why that reordering approach works. ; 8 page(s)
|
|
Keyword:
080100 Artificial Intelligence and Image Processing
|
|
URL: http://hdl.handle.net/1959.14/101350
|
|
BASE
|
|
Hide details
|
|
|
|