ISCA Archive Eurospeech 1999
ISCA Archive Eurospeech 1999

Improving n-gram modeling using distance-related unit association maximum entropy language modeling

Shuwu Zhang, Harald Singer, Dekai Wu, Yoshinori Sagisaka

In this paper, a distance-related unit association maximum entropy (DUAME) language modeling is proposed. This approach can model an event (unit subsequence) using the co-occurrence of full distance unit association (UA) features so that it is able to pursue a functional approximation to higher order N-gram with significantly less memory requirement. A smoothing strategy related to this modeling will also be discussed. Preliminary experimental results have shown that DUAME modeling is comparable to conventional N-gram modeling in perplexity with significantly small number of parameters.