ISCA Archive Interspeech 2006
ISCA Archive Interspeech 2006

Max-Gabor analysis and synthesis of spectrograms

Tony Ezzat, Jake Bouvrie, Tomaso Poggio

We present a method that analyzes a two-dimensional magnitude spectrogram S(f, t) into its local constituent spectro-temporal amplitudes A(f, t), frequencies F(f, t), orientations ƒ¦(f , t), and phases ƒÓ(f, t). The method operates by performing a two-dimensional local Gabor-like analysis of the spectrogram, retaining only the parameters of the 2D-Gabor filter with maximal amplitude response within the local region. We demonstrate the technique over a wide variety of speakers, and show how the spectrograms in each case may be adequately reconstructed using the parameters of the Max-Gabor analysis. Finally, we discuss the nature of the extracted Max-Gabor parameters.