ISCA Archive Interspeech 2015
ISCA Archive Interspeech 2015

Frequency map selection using a RBFN-based classifier in the MVDR beamformer for speaker localization in reverberant rooms

Daniele Salvati, Carlo Drioli, Gian Luca Foresti

We present the weighted minimum variance distortionless response (WMVDR), which is a steered response power (SRP) algorithm, for near-field speaker localization in a reverberant environment. The proposed WMVDR is based on a machine learning approach for computing the incoherent frequency fusion of narrowband power maps. We adopt a radial basis function network (RBFN) classifier for the estimation of the weighting coefficients, and a marginal distribution of narrowband power map as feature for the supervised training operation. Simulations demonstrate the effectiveness of the proposed approach in different conditions.