ISCA Archive Interspeech 2010
ISCA Archive Interspeech 2010

A fast one-pass-training feature selection technique for GMM-based acoustic event detection with audio-visual data

Taras Butko, Climent Nadeu

Acoustic event detection becomes a difficult task, even for a small number of events, in scenarios where events are produced rather spontaneously and often overlap in time. In this work, we aim to improve the detection rate by means of feature selection. Using a one-against-all detection approach, a new fast one-pass-training algorithm, and an associated highly-precise metric are developed. Choosing a different subset of multimodal features for each acoustic event class, the results obtained from audiovisual data collected in the UPC multimodal room show an improvement in average detection rate with respect to using the whole set of features.