ISCA Archive IberSPEECH 2022
ISCA Archive IberSPEECH 2022

VoxCeleb-PT – a dataset for a speech processing course

John Mendonca, Isabel Trancoso

This paper introduces VoxCeleb-PT, a small dataset of voices of Portuguese celebrities that can be used as a language-specific extension of the widely used VoxCeleb corpus. Besides introducing the corpus, we also describe three lab assignments where it was used in a one-semester speech processing course: age regression, speaker verification and speech recognition, hoping to highlight the relevance of this dataset as a pedagogical tool. Additionally, this paper confirms the overall limitations of current systems when evaluated in different languages and acoustic conditions: we found an overall degradation of performance on all of the proposed tasks.