ISCA Archive Interspeech 2007
ISCA Archive Interspeech 2007

Performance of speaker-dependent wideband speech coding

Ethan R. Duni, Bhaskar D. Rao

This paper examines the performance gains available in wideband speech coding using speaker-dependent systems. It is shown that a performance gain of 4 bits per frame, in the rate-distortion sense, is achievable in the LSF coding. While variations are evident in the pitch lag statistics during voiced frames, there is no gain to be had in unvoiced frames or in the adaptive gains; thus, there is little benefit to speaker-dependent coding of adaptive codebook parameters. Lastly, it was shown that gains of 40-50 bits per frame are available in the fixed excitation. These performance boosts can be exploited in a number of ways, most simply by reducing the operating rate. Alternatively, the complexity of the coding systems can be reduced while maintaining the same performance of speaker-independent coding. It was shown that a reduction in complexity by a factor of 4 is achievable using speaker-dependent LSF quantization.