ISCA Archive Interspeech 2021
ISCA Archive Interspeech 2021

Handling Acoustic Variation in Dysarthric Speech Recognition Systems Through Model Combination

Enno Hermann, Mathew Magimai-Doss

Developing automatic speech recognition (ASR) systems that recognise dysarthric speech as well as control speech from unimpaired speakers remains challenging. Including more highly variable dysarthric speech during training can also negatively affect the performance on control speakers, which is not desirable when developing speech recognisers for a wider audience. In this work, we analyse how the acoustic variability of dysarthric speech affects ASR systems and propose the combination of multiple acoustic models trained on different subsets of speakers to mitigate this effect. This approach shows improvements for both dysarthric and control speakers on the Torgo and UA-Speech corpora.


doi: 10.21437/Interspeech.2021-2212

Cite as: Hermann, E., Magimai-Doss, M. (2021) Handling Acoustic Variation in Dysarthric Speech Recognition Systems Through Model Combination. Proc. Interspeech 2021, 4788-4792, doi: 10.21437/Interspeech.2021-2212

@inproceedings{hermann21_interspeech,
  author={Enno Hermann and Mathew Magimai-Doss},
  title={{Handling Acoustic Variation in Dysarthric Speech Recognition Systems Through Model Combination}},
  year=2021,
  booktitle={Proc. Interspeech 2021},
  pages={4788--4792},
  doi={10.21437/Interspeech.2021-2212},
  issn={2958-1796}
}