ISCA Archive AVSEC 2024 Sessions Search Website Booklet
  ISCA Archive Sessions Search Website Booklet
×

Click on column names to sort.

Searching uses the 'and' of terms e.g. Smith Interspeech matches all papers by Smith in any Interspeech. The order of terms is not significant.

Use double quotes for exact phrasal matches e.g. "acoustic features".

Case is ignored.

Diacritics are optional e.g. lefevre also matches lefèvre (but not vice versa).

It can be useful to turn off spell-checking for the search box in your browser preferences.

If you prefer to scroll rather than page, increase the number in the show entries dropdown.

top

3rd COG-MHEAR Workshop on Audio-Visual Speech Enhancement

Kos, Greece
1 September 2024

Chairs: Amir Hussain and Peter Bell
doi: 10.21437/AVSEC.2024


3rd COG-MHEAR workshop on Audio-Visual Speech Enhancement (AVSEC)


Multi-Model Dual-Transformer Network for Audio-Visual Speech Enhancement
Fazale Wahab, Nasir Saleem, Amir Hussain, Muhammad Rizwan, Md Bipul Hossen

AI as the Articulator: Leveraging ChatGPT 3.5 for Audio-Visual Speech Enhancement
shahab S Sohail, Mandar Gogate, Tassadaq Hussain, Kia K. Dashtipour, Muhammed Riaz, Zain Hussain, Usman Anwar, Adele Goman, Tughrul Arsalan, Amir Hussain

RecognAVSE: An Audio-Visual Speech Enhancement Approach using Separable 3D convolutions and Deep Complex U-Net
João Renato Ribeiro Manesco, Leandro A Passos, Rahma Fourati, João Papa, Amir Hussain

A Target Speaker Extraction Method for the 3rd Audio-Visual Speech Enhancement Challenge
Zhan Jin, Bang Zeng, Zhuo Li, Xin Liu, Ming Li

A Lightweight Real-time Audio-Visual Speech Enhancement Framework
Mandar Gogate, Kia K. Dashtipour, Amir Hussain

AVSE-Pruner: Filter Pruning of Audio-Visual Speech Enhancement System using Multi-objective Binary Particle Swarm Optimization
Rahma Fourati, Jihene Tmamna, Najwa Kouka, Mandar Gogate, Kia K. Dashtipour, Leandro A Passos, João Papa, Tughrul Arslan, Amir Hussain

Towards Cross-Lingual Audio-Visual Speech Enhancement
Kia K. Dashtipour, Mandar Gogate, Shafique Ahmed, Adeel Hussain, Tassadaq Hussain, Jen-Cheng Hou, Tughrul Arslan, Yu Tsao, Amir Hussain

LSTMSE-Net: Long Short Term Speech Enhancement Network for Audio-visual Speech Enhancement
Arnav Jain, Jasmer S. Sanjotra, Harshvardhan Choudhary, Krish Agrawal, Rupal Shah, Rohan Jha, MD SAJID, Amir Hussain, M Tanveer

Real-Time Audio Visual Speech Enhancement: Integrating Visual Cues for Improved Performance
Utkarsh Tiwari, Mandar Gogate, Kia K. Dashtipour, Eamon Sheikh, Rimjhim Dr. Singh, Tughrul Arslan, Amir Hussain

Asynchronicity between Visual and Auditory Information in Audiovisual Speech: Evidence from Four Types of Consonant-words /b/, /t/, /k/ and /g/
Biao Zeng, Keira Evans, Mia Carne, Lauren Game, Erik Persson

Privacy Considerations for Wearable Audio-Visual AI in Hearing Aids
Poppy Welch, Jennifer Williams

Next-Generation Speech Recognition Using Radar Sensing
Michaela B. Reay, Balal Saleemi, Qammer H. Abbasi, Hira Hameed, Kia K. Dashtipour, Amir Hussain, Muhammad Ali Imran

Deep Complex U-Net with Conformer for Audio-Visual Speech Enhancement
Shafique Ahmed, Chia-Wei Chen, WenZe Ren, Chin-Jou Li, Ernie Chu, Jun-Cheng Chen, Amir Hussain, Hsin-Min Wang, Yu Tsao, Jen-Cheng Hou

Mobile phone-based speech enhancement using cognitive load and fuzzy reasoning for normal and hearing-impaired users
Song Chen, Usman Anwar, Jasper Kirton-Wingate, Faiyaz Doctor, Adeel Hussain, Ting Zhou, Arif Anwary, Kia K. Dashtipour, Mandar Gogate, Jen-Cheng Hou, Yu Tsao, Michael Akeroyd, Tughrul Arslan, Amir Hussain

A Framework for Speech Enhancement based on Audio Signal and Speaker Embeddings
Azadeh Nazemi, Ashkan Sami, Mahsa Sami, Amir Hussain

Iterative Speech Enhancement with Transformers
Azadeh Nazemi, Ashkan Sami, Mahsa Sami, Amir Hussain

Towards Low-Energy Low-Latency Multimodal Open Master Hearing Aid
Adewale Adetomi, Xianpo Ni, Mandar Gogate, Kia K. Dashtipour, Tughrul ARSLAN, Amir Hussain

DAVSE: A Diffusion-Based Generative Approach for Audio-Visual Speech Enhancement
Chia-Wei Chen, Jen-Cheng Hou, Yu Tsao, Jun-Cheng Chen, Shao-Yi Chien

Target Speaker Direction Estimation using Eye Gaze and Head Movement for Hearing Aids
Arif Reza Anwary, Mandar Gogate, Kia K. Dashtipour, Jen-Cheng Hou, Tughrul Arslan, Yu Tsao, Michael Akeroyd, Amir Hussain

Evaluating the Audio-Visual Speech Enhancement Challenge (AVSEC) Baseline Model Using an Out-of-Domain Free-Flowing Corpus
Kia K. Dashtipour, Mandar Gogate, Adeel Hussain, Bryony Buck, Arif Reza Anwary, Tughrul Arslan, Amir Hussain

Towards cloud-based and federated A-Synchronous Speech enhancement using Deep Neuro-fuzzy Models: Review, Challenges & Future Directions
Riaz Ul Amin, Mandar Gogate, Kia K. Dashtipour, Adeel Hussain, Tughrul Arslan, Amjad Ullah, Faiyaz Doctor, Tharmalingam Ratnarajah, Mathini Sellathurai, Amir Hussain

Sign Assist: Real-Time Isolated Sign Language Recognition and Translator Model Connecting Sign Language Users with GPT Model
Shahzeen Ijaz Ahmad, Nabeel Sabir, Adnan Abid, Amir Hussain


Search papers
Article
×

Keynote: Dr Peter Derleth (Sonova AG)

Keynote: Prof Yu Tsao (Academia Sinica)

3rd COG-MHEAR workshop on Audio-Visual Speech Enhancement (AVSEC)