ISCA Archive IWSLT 2009
ISCA Archive IWSLT 2009

Barcelona media SMT system description for the IWSLT 2009: introducing source context information

Marta R. Costa-jussà, Rafael E. Banchs

This paper describes the Barcelona Media SMT system in the IWSLT 2009 evaluation campaign. The Barcelona Media system is an statistical phrase-based system enriched with source context information. Adding source context in an SMT system is interesting to enhance the translation in order to solve lexical and structural choice errors. The novel technique uses a similarity metric among each test sentence and each training sentence. First experimental results of this technique are reported in the Arabic and Chinese Basic Traveling Expression Corpus (BTEC) task. Although working in a single domain, there are ambiguities in SMT translation units and slight improvements in BLEU are shown in both tasks (Zh2En and Ar2En).


Cite as: Costa-jussà, M.R., Banchs, R.E. (2009) Barcelona media SMT system description for the IWSLT 2009: introducing source context information. Proc. International Workshop on Spoken Language Translation (IWSLT 2009), 24-28

@inproceedings{costajussa09_iwslt,
  author={Marta R. Costa-jussà and Rafael E. Banchs},
  title={{Barcelona media SMT system description for the IWSLT 2009: introducing source context information}},
  year=2009,
  booktitle={Proc. International Workshop on Spoken Language Translation (IWSLT 2009)},
  pages={24--28}
}