ISCA Archive Eurospeech 1997
ISCA Archive Eurospeech 1997

A 16-kbit/s wideband speech codec scalable with g.729

A. Kataoka, S. Kurihara, S. Sasaki, S. Hayashi

A wideband speech scalable codec is proposed for improving the flexibility in telecommunication networks. This coder is scalable with G.729 (ITU 8-kbit/s standard). Its decoder can process the incoming bitstream at three bit rates (8, 12, and 16 kbit/s) and provide a choice of speech types (wideband and telephone-band). The codec has a split-band structure, where both bands are coded by analysis-by-synthesis techniques. This paper proposes two types of scalable codec: a separate one and a composite one. It also proposes a new method (an additional adaptive codebook) for predicting pitch, while maintaining scalability with the G.729 codec. Subjective testing for wideband speech showed that the quality of the proposed codec at 16-kbit/s is equivalent to that of the 64-kbit/s G.722, and at 12-kbit/s is better than that of the 48-kbit/s G.722. Testing has further demonstrated that the 8-kbit/s coder provides high quality for telephone-band speech.


doi: 10.21437/Eurospeech.1997-431

Cite as: Kataoka, A., Kurihara, S., Sasaki, S., Hayashi, S. (1997) A 16-kbit/s wideband speech codec scalable with g.729. Proc. 5th European Conference on Speech Communication and Technology (Eurospeech 1997), 1491-1494, doi: 10.21437/Eurospeech.1997-431

@inproceedings{kataoka97_eurospeech,
  author={A. Kataoka and S. Kurihara and S. Sasaki and S. Hayashi},
  title={{A 16-kbit/s wideband speech codec scalable with g.729}},
  year=1997,
  booktitle={Proc. 5th European Conference on Speech Communication and Technology (Eurospeech 1997)},
  pages={1491--1494},
  doi={10.21437/Eurospeech.1997-431},
  issn={1018-4074}
}