ISCA Archive Interspeech 2007
ISCA Archive Interspeech 2007

The buckeye corpus of speech: updates and enhancements

Eric Fosler-Lussier, Laura Dilley, Na'im Tyson, Mark Pitt

This paper describes recent progress in the development of the Buckeye Corpus of Speech, a phonetically labeled corpus of conversational American English speech, first described in [1]. With the publication of the second phase of transcription, the corpus has nearly doubled in size from the first release. We briefly give an overview of the corpus, report on additional studies of inter-labeler agreement, and describe a new GUI designed to facilitate searching the annotated speech files.