doi: 10.21437/AVSEC.2025
The Role of AI in Modern Hearing Aids
Peter Derleth
A Framework to Design Tailored Models based on the Optimum-Path Forest: Opportunities for Audio-Visual Speech Processing
João Paulo Papa
Restoring Degraded Multi-Speaker Speech Through Separation and Enhancement
Akam Rahimi
Tackling Reverberation and Binaural Data on Audio-Visual Speech Enhancement through RecognAVSE
João Renato Manesco, Leandro Passos, Rahma Fourati, João Paulo Papa, Amir Hussain
Efficient and Sustainable Audio-Visual Speech Enhancement through Latency-Aware Pruned Model
Rahma Fourati, Jihene Tmamna, João R. Manesco, Leandro A. Passos, João P. Papa, Amir Hussain
Temporal-Aware Graph Neural Network with Conformer Model for Audiovisual Speech Enhancement
Nasir Saleem, Kia Dashtipour, Arif Reza Anwary, Adeel Hussain, Khubaib Ahmed, Amir Hussain
MSF-AVSE: Multi-Stream Fusion Network for Binaural Audiovisual Speech Enhancement
Nasir Saleem, Kia Dashtipour, Arif Reza Anwary, Adeel Hussain, Khubaib Ahmed, Aysha Munawwara, Amir Hussain
Visual Speech Enhancement With Calibrated Features and Dual-Path Transformer
Nasir Saleem, Kia Dashtipour, Aysha Munawwara, Arif Reza Anwary, Mandar Gogate, Amir Hussain
Multimodal Speech Sensing for Next Generation Hearing Aids
Michaela Reay, Kia Dashtipour, Mandar Gogate, Nasir Saleem, Amir Hussain, Qammar Abbasi
ConformerAVSE: A Transformer-based Audio-Visual Speech Enhancement Model for Hearing Aids
Dongkun Xu, Xianpo Ni, Usman Anwar, Kia Dashtipour, Mandar Gogate, Nasir Saleem, Amir Hussain, Tughrul Arslan
AV-LocoFiLM: Audio-Visual Speech Enhancement Using FiLM-Based Fusion and Hybrid Local–Global Transformers
Shafique Ahmed, Jen-Cheng Hou, Yu Tsao
FPGA-Based LSTM Acceleration for Real-Time Speech Enhancement in Next Generation Hearing Aids
Xianpo Ni, Usman Anwar, Dongkun Xu, Tughrul Arslan, Amir Hussain
AV-TFLocoformer: A Locally Convolutional Transformer for Robust Audio-Visual Speech Enhancement
Aquib Raza, Pusuluri Sri Sai Aditya, Shafique Ahmed, Yu Tsao
Adaptive Gaze and Spatial Speaker Tracking: Enhancing Hearing Aid Performance in Dynamic Environments
Arif Reza Anwary, Nasir Saleem, Kia Dashtipour, Mandar Gogate, Amir Hussain
AUREXA-SE: Audio-Visual Unified Representation Exchange Architecture with Cross-Attention and Squeezeformer for Speech Enhancement
M. Sajid, Deepanshu Gupta, Yash Modi, Sanskriti Jain, Harshith Jai Surya Ganji, A. Rahaman, Harshvardhan Choudhary, Nasir Saleem, Amir Hussain, M Tanveer
Efficient Audio-Visual Speech Enhancement via Neural Architecture Search
Khubaib Ahmed, Ahsan Adeel, Nasir Saleem, Kia Dashtipour, Amir Hussain, Ahsan Ulhaq
How LSTM is Integrated in the Functional Link Adaptive Filter
Alireza Nezamdoust, Danilo Comminiello, Amir Hussain, Kia Dashtipour, Mandar Gogate
Edge-Optimized Cognition and Context-Aware Speech Enhancement for Multimodal Hearing Aids
Usman Anwar, Xianpo Ni, Dongkun Xu, Tughrul Arslan, Kia Dashtipour, Mandar Gogate, Amir Hussain
A Mamba-Based Audio-Visual Speech Enhancement Model for the 4th COG-MHEARAVSEChallenge
Chih Ning Chen, Jen-Cheng Hou, Jun-Cheng Chen, Yu Tsao, Shao-Yi Chien
Leveraging Mamba with Full-Face Vision for Audio-Visual Speech Enhancement
Rong Chao, Wenze Ren, You-Jin Li, Kuo-Hsuan Hung, Sung-Feng Huang, Sze-Wei Fu, Wen-Huang Cheng, Yu Tsao
BAV-MossFormer2: Enhanced MossFormer2 for Binaural Audio-Visual Speech Enhancement
Wenze Ren, Kai Li, Rong Chao, Junjie Li, Zilong Huang, Shafique Ahmed, You-Jin Li, Kuo-Hsuan Hung, Syu-Siang Wang, Hsin-Min Wang, Yu Tsao
| Article |
|---|