ISCA Archive Interspeech 2015
ISCA Archive Interspeech 2015

Implementation of a live dialectal media subtitling system

Michael Stadtschnitzer, Christoph Schmidt

Subtitling is a useful technique to fulfil the information needs of deaf and hearing impaired people. Live subtitling is needed especially for live events and is not restricted to television, but can also be provided to persons on site, e.g. to a deaf politician during a parliamentary debate. Live subtitling is demanding since the audio information has to be transformed into text within a few seconds. In this demonstration we present the Fraunhofer IAIS audio and video live subtitling system for standard German and Bavarian. The system was developed in the “Live-Caption” project and consists of an online speech recognition and speaker diarisation system. We employ our entire large annotated standard German broadcast corpus for training. In addition, we apply the system to Bavarian dialect by adapting the acoustic models and the pronunciation lexicon, exploiting a Bavarian media corpus. Due to the real-time restrictions and the spontaneous character of dialectal speech, the system performance is far from perfect, but encouraging.