ISCA Archive Interspeech 2014
ISCA Archive Interspeech 2014

Simple gesture-based error correction interface for smartphone speech recognition

Yuan Liang, Koji Iwano, Koichi Shinoda

Conventional error correction interfaces for speech recognition require a user to first mark an error region and choose the correct word from a candidate list. Taking the user's effort and the limited user interface available in a smartphone use into account, this operation should be simpler. In this paper, we propose an interface where users mark the error region once, and then the word will be replaced by another candidate. Assuming that the words preceding/succeeding the error region are validated by the user, we search the Web n-grams for long word sequences matched to such a context. The acoustic features of the error region are also utilized to rerank the candidate words. The experimental result proved the effectiveness of our method. 30.2% of the error words were corrected by a single operation.


doi: 10.21437/Interspeech.2014-302

Cite as: Liang, Y., Iwano, K., Shinoda, K. (2014) Simple gesture-based error correction interface for smartphone speech recognition. Proc. Interspeech 2014, 1194-1198, doi: 10.21437/Interspeech.2014-302

@inproceedings{liang14_interspeech,
  author={Yuan Liang and Koji Iwano and Koichi Shinoda},
  title={{Simple gesture-based error correction interface for smartphone speech recognition}},
  year=2014,
  booktitle={Proc. Interspeech 2014},
  pages={1194--1198},
  doi={10.21437/Interspeech.2014-302},
  issn={2308-457X}
}