ISCA Archive Eurospeech 1997
ISCA Archive Eurospeech 1997

Automatic detection of disturbing robot voice- and ping pong-effects in GSM transmitted speech

Martin Paping, Thomas Fahnle

This contribution reports about a method to automatically detect the disturbing Robot Voice and Ping Pong effect which occur in GSM transmitted speech. Both effects are caused by the frame substitution technique, recommended by the GSM standard: in these cases the transmitted speechmay be modulated by a disturbing 50 Hz component. These modulations can be detected very easily in the frequency domain. By a framewise comparision of the modulation amplitude of an undisturbed clean speech signal with a test signal it is possible to locate the occurrence of Robot Voice and Ping Pong very precisely. Comparing human perception to the outcome of the proposed algorithm shows a high degree of correspondence.


doi: 10.21437/Eurospeech.1997-466

Cite as: Paping, M., Fahnle, T. (1997) Automatic detection of disturbing robot voice- and ping pong-effects in GSM transmitted speech. Proc. 5th European Conference on Speech Communication and Technology (Eurospeech 1997), 1631-1634, doi: 10.21437/Eurospeech.1997-466

@inproceedings{paping97_eurospeech,
  author={Martin Paping and Thomas Fahnle},
  title={{Automatic detection of disturbing robot voice- and ping pong-effects in GSM transmitted speech}},
  year=1997,
  booktitle={Proc. 5th European Conference on Speech Communication and Technology (Eurospeech 1997)},
  pages={1631--1634},
  doi={10.21437/Eurospeech.1997-466},
  issn={1018-4074}
}