ISCA Archive Interspeech 2010
ISCA Archive Interspeech 2010

Can conversational word usage be used to predict speaker demographics?

Dan Gillick

This work surveys the potential for predicting demographic traits of individual speakers (gender, age, education level, ethnicity, and geographic region) using only word usage features derived from the output of a speech recognition system on conversational American English. Significant differences in word usage patterns among the different classes allows for reasonably high classification accuracy (60%-82%), even without extensive training data.