ISCA Archive Eurospeech 2003
ISCA Archive Eurospeech 2003

Spanish broadcast news transcription

Gerhard Backfried, Roser Jaquemot Caldes

We describe the Sail Labs Media Mining System (MMS) aimed at the transcription of Castilian Spanish broadcast-news. In contrast to previous systems, the focus of this system is on Spanish as spoken on the Iberian Peninsula as opposed to the Americas. We discuss the development of a Castilian Spanish broadcast-news corpus suitable for training the various system components of the MMS and report on the development of the speech-recognition component using the newly established corpora.