ISCA Archive Interspeech 2025
ISCA Archive Interspeech 2025

Towards Sentence Level Imagined Speech Generation from EEG signals

Sparsh Rastogi, Harsh Dadwal, Khushboo Modi, Jatin Bedi, Jasmeet Singh

Brain-Computer Interfaces (BCIs) have emerged as alternative means of communication for individuals with speech & motor impairments. These systems enable patients to express themselves without any articulation, by decoding speech from neural activity. However, most of the existing studies rely on invasive surgical procedures, with limited studies using non-invasive signals for phoneme or word level classification, thus covering a short vocabulary. To the best of our knowledge, this study presents the first demonstration of a framework for sentence level imagined speech synthesis from non-invasive electroencephalography (EEG) signals. Our model uses an Efficient-Net based masked auto-encoder approach for learning feature embeddings from EEG signals which are then used for fine-tuning BERT for next token generation. For this study, Large Spanish Speech EEG Dataset has been used with a mixed subject approach for both training & evaluation purposes, resulting into a 48.92% accuracy.