ISCA Archive Interspeech 2023
ISCA Archive Interspeech 2023

FN-SSL: Full-Band and Narrow-Band Fusion for Sound Source Localization

Yabo Wang, Bing Yang, Xiaofei Li

Extracting direct-path spatial features is critical for sound source localization in adverse acoustic environments. This paper proposes a full-band and narrow-band fusion network for estimating direct-path inter-channel phase difference (DP-IPD) from microphone signals. The alternating full-band and narrow-band layers are responsible for learning the full-band correlation and narrow-band extraction of DP-IPD, respectively. Experiments show that the proposed network noticeably outperforms other advanced methods on both simulated and real-world data.