ISCA Archive Interspeech 2006
ISCA Archive Interspeech 2006

Discriminative named entity recognition of speech data using speech recognition confidence

Katsuhito Sudoh, Hajime Tsukada, Hideki Isozaki

This paper presents a method for the named entity recognition (NER) of speech data that uses automatic speech recognition (ASR) confidence as a feature that indicates whether each word is correctly recognized. An NER model is trained using ASR results with named entity (NE) labels to include an ASR confidence feature as well as corresponding transcriptions with NE labels. Experiments using support vector machines (SVMs) and speech data from Japanese newspaper articles show that the proposed method achieves higher F-measure in NER than a simple application of text-based NER to ASR results.