ISCA Archive AVSEC 2025 Search Website
  ISCA Archive Search Website
×

Click on column names to sort.

Searching uses the 'and' of terms e.g. Smith Interspeech matches all papers by Smith in any Interspeech. The order of terms is not significant.

Use double quotes for exact phrasal matches e.g. "acoustic features".

Case is ignored.

Diacritics are optional e.g. lefevre also matches lefèvre (but not vice versa).

It can be useful to turn off spell-checking for the search box in your browser preferences.

If you prefer to scroll rather than page, increase the number in the show entries dropdown.

top

4th Cogmhear Audio-Visual Speech Enhancement Challenge

Rotterdam, Netherlands
16 August 2025

Chairs: Amir Hussain
doi: 10.21437/AVSEC.2025

4th COG-MHEAR International Audio-Visual Speech Enhancement Challenge (AVSEC-4)


The Role of AI in Modern Hearing Aids
Peter Derleth

A Framework to Design Tailored Models based on the Optimum-Path Forest: Opportunities for Audio-Visual Speech Processing
João Paulo Papa

Restoring Degraded Multi-Speaker Speech Through Separation and Enhancement
Akam Rahimi

Tackling Reverberation and Binaural Data on Audio-Visual Speech Enhancement through RecognAVSE
João Renato Manesco, Leandro Passos, Rahma Fourati, João Paulo Papa, Amir Hussain

Efficient and Sustainable Audio-Visual Speech Enhancement through Latency-Aware Pruned Model
Rahma Fourati, Jihene Tmamna, João R. Manesco, Leandro A. Passos, João P. Papa, Amir Hussain

Temporal-Aware Graph Neural Network with Conformer Model for Audiovisual Speech Enhancement
Nasir Saleem, Kia Dashtipour, Arif Reza Anwary, Adeel Hussain, Khubaib Ahmed, Amir Hussain

MSF-AVSE: Multi-Stream Fusion Network for Binaural Audiovisual Speech Enhancement
Nasir Saleem, Kia Dashtipour, Arif Reza Anwary, Adeel Hussain, Khubaib Ahmed, Aysha Munawwara, Amir Hussain

Visual Speech Enhancement With Calibrated Features and Dual-Path Transformer
Nasir Saleem, Kia Dashtipour, Aysha Munawwara, Arif Reza Anwary, Mandar Gogate, Amir Hussain

Multimodal Speech Sensing for Next Generation Hearing Aids
Michaela Reay, Kia Dashtipour, Mandar Gogate, Nasir Saleem, Amir Hussain, Qammar Abbasi

ConformerAVSE: A Transformer-based Audio-Visual Speech Enhancement Model for Hearing Aids
Dongkun Xu, Xianpo Ni, Usman Anwar, Kia Dashtipour, Mandar Gogate, Nasir Saleem, Amir Hussain, Tughrul Arslan

AV-LocoFiLM: Audio-Visual Speech Enhancement Using FiLM-Based Fusion and Hybrid Local–Global Transformers
Shafique Ahmed, Jen-Cheng Hou, Yu Tsao

FPGA-Based LSTM Acceleration for Real-Time Speech Enhancement in Next Generation Hearing Aids
Xianpo Ni, Usman Anwar, Dongkun Xu, Tughrul Arslan, Amir Hussain

AV-TFLocoformer: A Locally Convolutional Transformer for Robust Audio-Visual Speech Enhancement
Aquib Raza, Pusuluri Sri Sai Aditya, Shafique Ahmed, Yu Tsao

Adaptive Gaze and Spatial Speaker Tracking: Enhancing Hearing Aid Performance in Dynamic Environments
Arif Reza Anwary, Nasir Saleem, Kia Dashtipour, Mandar Gogate, Amir Hussain

AUREXA-SE: Audio-Visual Unified Representation Exchange Architecture with Cross-Attention and Squeezeformer for Speech Enhancement
M. Sajid, Deepanshu Gupta, Yash Modi, Sanskriti Jain, Harshith Jai Surya Ganji, A. Rahaman, Harshvardhan Choudhary, Nasir Saleem, Amir Hussain, M Tanveer

Efficient Audio-Visual Speech Enhancement via Neural Architecture Search
Khubaib Ahmed, Ahsan Adeel, Nasir Saleem, Kia Dashtipour, Amir Hussain, Ahsan Ulhaq

How LSTM is Integrated in the Functional Link Adaptive Filter
Alireza Nezamdoust, Danilo Comminiello, Amir Hussain, Kia Dashtipour, Mandar Gogate

Edge-Optimized Cognition and Context-Aware Speech Enhancement for Multimodal Hearing Aids
Usman Anwar, Xianpo Ni, Dongkun Xu, Tughrul Arslan, Kia Dashtipour, Mandar Gogate, Amir Hussain

A Mamba-Based Audio-Visual Speech Enhancement Model for the 4th COG-MHEARAVSEChallenge
Chih Ning Chen, Jen-Cheng Hou, Jun-Cheng Chen, Yu Tsao, Shao-Yi Chien

Leveraging Mamba with Full-Face Vision for Audio-Visual Speech Enhancement
Rong Chao, Wenze Ren, You-Jin Li, Kuo-Hsuan Hung, Sung-Feng Huang, Sze-Wei Fu, Wen-Huang Cheng, Yu Tsao

BAV-MossFormer2: Enhanced MossFormer2 for Binaural Audio-Visual Speech Enhancement
Wenze Ren, Kai Li, Rong Chao, Junjie Li, Zilong Huang, Shafique Ahmed, You-Jin Li, Kuo-Hsuan Hung, Syu-Siang Wang, Hsin-Min Wang, Yu Tsao


Search papers
Article
×

4th COG-MHEAR International Audio-Visual Speech Enhancement Challenge (AVSEC-4)