ISCA Archive Interspeech 2006
ISCA Archive Interspeech 2006

High-rate data embedding in unvoiced speech

Konrad Hofbauer, Gernot Kubin

We propose a blind speech watermarking algorithm which allows high-rate embedding of digital side information into speech signals. We exploit the fact that the well-known LPC vocoder works very well for unvoiced speech. Using an auto-correlation based pitch tracking algorithm, a voiced/unvoiced segmentation is carried out. In the unvoiced segments, the linear prediction residual is replaced by a data sequence. This substitution does not cause perceptual degradation as long as the residual’s power is matched. The signal is resynthesised using the unmodified LPC filter coefficients. The watermark is decoded by a linear prediction analysis of the received signal and the information is extracted from the sign of the residual. The watermark is nearly imperceptible and provides a channel capacity of up to 2000 bit/s in an 8 kHz-sampled speech signal.


doi: 10.21437/Interspeech.2006-67

Cite as: Hofbauer, K., Kubin, G. (2006) High-rate data embedding in unvoiced speech. Proc. Interspeech 2006, paper 1906-Mon1FoP.10, doi: 10.21437/Interspeech.2006-67

@inproceedings{hofbauer06_interspeech,
  author={Konrad Hofbauer and Gernot Kubin},
  title={{High-rate data embedding in unvoiced speech}},
  year=2006,
  booktitle={Proc. Interspeech 2006},
  pages={paper 1906-Mon1FoP.10},
  doi={10.21437/Interspeech.2006-67},
  issn={2958-1796}
}