doi: 10.21437/AVSEC.2024
AI / DNN based speech enhancement in hearing aids
Peter Derleth
Speech Enhancement and Its Application to Assistive Oral Communication Technologies
Yu Tsao
Multi-Model Dual-Transformer Network for Audio-Visual Speech Enhancement
Fazale Wahab, Nasir Saleem, Amir Hussain, Muhammad Rizwan, Md Bipul Hossen
AI as the Articulator: Leveraging ChatGPT 3.5 for Audio-Visual Speech Enhancement
shahab S Sohail, Mandar Gogate, Tassadaq Hussain, Kia K. Dashtipour, Muhammed Riaz, Zain Hussain, Usman Anwar, Adele Goman, Tughrul Arsalan, Amir Hussain
RecognAVSE: An Audio-Visual Speech Enhancement Approach using Separable 3D convolutions and Deep Complex U-Net
João Renato Ribeiro Manesco, Leandro A Passos, Rahma Fourati, João Papa, Amir Hussain
A Target Speaker Extraction Method for the 3rd Audio-Visual Speech Enhancement Challenge
Zhan Jin, Bang Zeng, Zhuo Li, Xin Liu, Ming Li
A Lightweight Real-time Audio-Visual Speech Enhancement Framework
Mandar Gogate, Kia K. Dashtipour, Amir Hussain
AVSE-Pruner: Filter Pruning of Audio-Visual Speech Enhancement System using Multi-objective Binary Particle Swarm Optimization
Rahma Fourati, Jihene Tmamna, Najwa Kouka, Mandar Gogate, Kia K. Dashtipour, Leandro A Passos, João Papa, Tughrul Arslan, Amir Hussain
Towards Cross-Lingual Audio-Visual Speech Enhancement
Kia K. Dashtipour, Mandar Gogate, Shafique Ahmed, Adeel Hussain, Tassadaq Hussain, Jen-Cheng Hou, Tughrul Arslan, Yu Tsao, Amir Hussain
LSTMSE-Net: Long Short Term Speech Enhancement Network for Audio-visual Speech Enhancement
Arnav Jain, Jasmer S. Sanjotra, Harshvardhan Choudhary, Krish Agrawal, Rupal Shah, Rohan Jha, MD SAJID, Amir Hussain, M Tanveer
Real-Time Audio Visual Speech Enhancement: Integrating Visual Cues for Improved Performance
Utkarsh Tiwari, Mandar Gogate, Kia K. Dashtipour, Eamon Sheikh, Rimjhim Dr. Singh, Tughrul Arslan, Amir Hussain
Asynchronicity between Visual and Auditory Information in Audiovisual Speech: Evidence from Four Types of Consonant-words /b/, /t/, /k/ and /g/
Biao Zeng, Keira Evans, Mia Carne, Lauren Game, Erik Persson
Privacy Considerations for Wearable Audio-Visual AI in Hearing Aids
Poppy Welch, Jennifer Williams
Next-Generation Speech Recognition Using Radar Sensing
Michaela B. Reay, Balal Saleemi, Qammer H. Abbasi, Hira Hameed, Kia K. Dashtipour, Amir Hussain, Muhammad Ali Imran
Deep Complex U-Net with Conformer for Audio-Visual Speech Enhancement
Shafique Ahmed, Chia-Wei Chen, WenZe Ren, Chin-Jou Li, Ernie Chu, Jun-Cheng Chen, Amir Hussain, Hsin-Min Wang, Yu Tsao, Jen-Cheng Hou
Mobile phone-based speech enhancement using cognitive load and fuzzy reasoning for normal and hearing-impaired users
Song Chen, Usman Anwar, Jasper Kirton-Wingate, Faiyaz Doctor, Adeel Hussain, Ting Zhou, Arif Anwary, Kia K. Dashtipour, Mandar Gogate, Jen-Cheng Hou, Yu Tsao, Michael Akeroyd, Tughrul Arslan, Amir Hussain
A Framework for Speech Enhancement based on Audio Signal and Speaker Embeddings
Azadeh Nazemi, Ashkan Sami, Mahsa Sami, Amir Hussain
Iterative Speech Enhancement with Transformers
Azadeh Nazemi, Ashkan Sami, Mahsa Sami, Amir Hussain
Towards Low-Energy Low-Latency Multimodal Open Master Hearing Aid
Adewale Adetomi, Xianpo Ni, Mandar Gogate, Kia K. Dashtipour, Tughrul ARSLAN, Amir Hussain
DAVSE: A Diffusion-Based Generative Approach for Audio-Visual Speech Enhancement
Chia-Wei Chen, Jen-Cheng Hou, Yu Tsao, Jun-Cheng Chen, Shao-Yi Chien
Target Speaker Direction Estimation using Eye Gaze and Head Movement for Hearing Aids
Arif Reza Anwary, Mandar Gogate, Kia K. Dashtipour, Jen-Cheng Hou, Tughrul Arslan, Yu Tsao, Michael Akeroyd, Amir Hussain
Evaluating the Audio-Visual Speech Enhancement Challenge (AVSEC) Baseline Model Using an Out-of-Domain Free-Flowing Corpus
Kia K. Dashtipour, Mandar Gogate, Adeel Hussain, Bryony Buck, Arif Reza Anwary, Tughrul Arslan, Amir Hussain
Towards cloud-based and federated A-Synchronous Speech enhancement using Deep Neuro-fuzzy Models: Review, Challenges & Future Directions
Riaz Ul Amin, Mandar Gogate, Kia K. Dashtipour, Adeel Hussain, Tughrul Arslan, Amjad Ullah, Faiyaz Doctor, Tharmalingam Ratnarajah, Mathini Sellathurai, Amir Hussain
Sign Assist: Real-Time Isolated Sign Language Recognition and Translator Model Connecting Sign Language Users with GPT Model
Shahzeen Ijaz Ahmad, Nabeel Sabir, Adnan Abid, Amir Hussain
Article |
---|