ISCA Archive Interspeech 2011
ISCA Archive Interspeech 2011

A pitch tracking corpus with evaluation on multipitch tracking scenario

Gregor Pirker, Michael Wohlmayr, Stefan Petrik, Franz Pernkopf

In this paper, we introduce a novel pitch tracking database (PTDB) including ground truth signals obtained from a laryngograph. The database, referenced as PTDB-TUG, consists of 2342 phonetically rich sentences taken from the TIMIT corpus. Each sentence was at least recorded once by a male and a female native speaker. In total, the database contains 4720 recordings from 10 male and 10 female speakers. Furthermore, we evaluated two multipitch tracking systems on a subset of speakers to provide a benchmark for further research activities. The database can be downloaded at http://www.spsc.tugraz.at/tools.