ISCA Archive IWSLT 2009
ISCA Archive IWSLT 2009

A unified framework for phrase-based, hierarchical, and syntax-based statistical machine translation

Hieu Hoang, Philipp Koehn, Adam Lopez

Despite many differences between phrase-based, hierarchical, and syntax-based translation models, their training and testing pipelines are strikingly similar. Drawing on this fact, we extend the Moses toolkit to implement hierarchical and syntactic models, making it the first open source toolkit with end-to-end support for all three of these popular models in a single package. This extension substantially lowers the barrier to entry for machine translation research across multiple models.