Information and Media Technologies
Online ISSN : 1881-0896
ISSN-L : 1881-0896
Media (processing) and Interaction
Multiple Translation-Engine-based Hypotheses and Edit-Distance-based Rescoring for a Greedy Decoder for Statistical Machine Translation
Michael PaulEiichiro SumitaSeiichi Yamamoto
Author information
JOURNAL FREE ACCESS

2006 Volume 1 Issue 1 Pages 446-460

Details
Abstract
This paper extends a greedy decoder for statistical machine translation (SMT), which searches for an optimal translation by using SMT models starting from a decoder seed, i.e., the source language input paired with an initial translation hypothesis. First, the outputs generated by multiple translation enginesare utilized as the initial translation hypotheses, whereby their variations reduce local optima problems inherent in the search. Second, a rescoring method based on the edit-distance between the initial translation hypothesis and the outputs of the decoder is used to compensate for problems of conventional greedy decoding solely based on statistical models. Our approach is evaluated for the translation of dialogues in the travel domain, and the results show that it drastically improves translation quality.
Content from these authors
© 2006 by Information Processing Society of Japan
Previous article Next article
feedback
Top