ISCA Archive Interspeech 2023
ISCA Archive Interspeech 2023

Mapping Phonemes to Acoustic Symbols and Codes Using Synchrony in Speech Modulation Vectors Estimated by the Travellingwave Filter Bank

Ashwin Rao

A hybrid vector representation for speech resonances is defined using the modulation model and the sum of sinusoids model. An adaptive filter bank, whose channels utilize resonance localized modulation tracking, to robustly estimate temporal variations in these vectors, is then presented. The synchrony in modulations, within and across resonance channels, is subsequently used to derive acoustic symbols and codes that map fundamental units of languages, phonemes. Such an acoustic-phonetic mapping has never been demonstrated before. It has potential applications in speech recognition and voice analytics.