ISCA Archive Interspeech 2012
ISCA Archive Interspeech 2012

An alignment matching method to explore pseudosyllable properties across different corpora

Raymond W. M. Ng, Thomas Hain, Keikichi Hirose

A pseudosyllable unit was derived for English read speech recognition. It is a question whether the pseudosyllable unit can be extracted in a robust manner and how this unit could help in the speech recognition process by providing some indications to the error pattern. In this study, an evaluation method which maps every hypothesis phoneme to every reference is proposed. Analysis is done to the pseudosyllables extracted from two different sets of speech data. Mutual information is used to look at the relationship between different pseudosyllable aspects and error pattern of the hypothesis phoneme. It was shown that the pseudosyllable extraction algorithm is robust and gives units with consistent nature. Pseudosyllables which have a phone triplet structure tends to have lower insertion. Pseudosyllables which overlap with their neighbours are places where more insertion errors may occur.

Index Terms: pseudosyllable, error analysis, mutual information, speech recognition


doi: 10.21437/Interspeech.2012-262

Cite as: Ng, R.W.M., Hain, T., Hirose, K. (2012) An alignment matching method to explore pseudosyllable properties across different corpora. Proc. Interspeech 2012, 863-866, doi: 10.21437/Interspeech.2012-262

@inproceedings{ng12_interspeech,
  author={Raymond W. M. Ng and Thomas Hain and Keikichi Hirose},
  title={{An alignment matching method to explore pseudosyllable properties across different corpora}},
  year=2012,
  booktitle={Proc. Interspeech 2012},
  pages={863--866},
  doi={10.21437/Interspeech.2012-262},
  issn={2958-1796}
}