ISCA Archive Eurospeech 1999
ISCA Archive Eurospeech 1999

Recognition, indexing and retrieval of british broadcast news with the THISL system

Tony Robinson, Dave Abberley, David Kirby, Steve Renals

This paper describes the THISL spoken document retrieval system for British and North American Broadcast News. The system is based on the Abbot large vocabulary speech recognizer and a probabilistic text retrieval system. We discuss the development of a realtime British English Broadcast News system, and its integration into a spoken document retrieval system. Detailed evaluation is performed using a similar North American Broadcast News system, to take advantage of the TREC SDR evaluation methodology. We report results on this evaluation, with particular reference to the effect of query expansion and of automatic segmentation algorithms.