ISCA Archive Interspeech 2014
ISCA Archive Interspeech 2014

Word-based probabilistic phonetic retrieval for low-resource spoken term detection

Di Xu, Florian Metze

Two problems make Spoken Term Detection (STD) particularly challenging under low-resource conditions: the low quality of speech recognition hypotheses, and a high number of out-of-vocabulary (OOV) words. In this paper, we propose an intuitive way to handle OOV terms for STD on word-based Confusion Networks using phonetic similarities, and generalize it into a probabilistic and vocabulary-independent retrieval framework. We then reflect on how several heuristics and Machine Learning based methods can be incorporated into this framework to improve retrieval performance. We present experimental results on several low-resource languages from IARPA's Babel program, such as Assamese, Bengali, Haitian, and Lao.


doi: 10.21437/Interspeech.2014-530

Cite as: Xu, D., Metze, F. (2014) Word-based probabilistic phonetic retrieval for low-resource spoken term detection. Proc. Interspeech 2014, 2774-2778, doi: 10.21437/Interspeech.2014-530

@inproceedings{xu14e_interspeech,
  author={Di Xu and Florian Metze},
  title={{Word-based probabilistic phonetic retrieval for low-resource spoken term detection}},
  year=2014,
  booktitle={Proc. Interspeech 2014},
  pages={2774--2778},
  doi={10.21437/Interspeech.2014-530},
  issn={2308-457X}
}