ISCA Archive IWSLT 2009
ISCA Archive IWSLT 2009

Enriching SCFG rules directly from efficient bilingual chart parsing

Martin Čmejrek, Bowen Zhou, Bing Xiang

In this paper, we propose a new method for training translation rules for a Synchronous Context-free Grammar. A bilingual chart parser is used to generate the parse forest, and EM algorithm to estimate expected counts for each rule of the ruleset. Additional rules are constructed as combinations of reliable rules occurring in the parse forest. The new method of proposing additional translation rules is independent of word alignments. We present the theoretical background for this method, and initial experimental results on German-English translations of Europarl data.


Cite as: Čmejrek, M., Zhou, B., Xiang, B. (2009) Enriching SCFG rules directly from efficient bilingual chart parsing. Proc. International Workshop on Spoken Language Translation (IWSLT 2009), 136-143

@inproceedings{cmejrek09_iwslt,
  author={Martin Čmejrek and Bowen Zhou and Bing Xiang},
  title={{Enriching SCFG rules directly from efficient bilingual chart parsing}},
  year=2009,
  booktitle={Proc. International Workshop on Spoken Language Translation (IWSLT 2009)},
  pages={136--143}
}