ISCA Archive Interspeech 2011
ISCA Archive Interspeech 2011

TSAB - web interface for transcribed speech collections

Tanel Alumäe, Ahti Kitsik

This paper describes a new web interface for accessing large transcribed spoken data collections. The system uses automatic or manual time-aligned transcriptions with speaker and topic segmentation information to present structured speech data more efficiently and make accessing relevant speech data quicker. The system is independent of the underlying speech processing technology. The software is free and open-source.