ISCA Archive Eurospeech 1993
ISCA Archive Eurospeech 1993

Prolog tools for accessing the phondat database of spoken German

Christoph Draxler, Hans G. Tillmann, Barbara Eisen

The PhonDat project within the German VERBMOBIL research initiative aims at creating and making accessible a very large database of symbolic and signal data of spoken high German. Currently, the PhonDat database consists of one corpus of sentences containing all phoneme combinations of high German, and of one corpus of sentences from a train enquiries scenario. All symbolic data is held in a Prolog system with a powerful database management system extension; signal data is stored in external files. The database is accessed through queries over the symbolic data. The result of a query evaluation is either again symbolic data, or a reference to signal files and signal fragments within these files. Two access modes are supported: a toolbox of pre-defined high-level query predicates for standard, albeit complex, queries; and the full Prolog programming language for custom applications.