ISCA Archive AVSP 1998
ISCA Archive AVSP 1998

Subjective Evaluation for HMM-Based Speech-To-Lip Movement Synthesis

Eli Yamamoto, Satoshi Nakamura, Kiyohiro Shikano

An audio-visual intelligibility score is generally used as an evaluation measure in visual speech synthesis. Especially an intelligibility score of talking heads represents accuracy of facial models[1][2]. The facial models has two stages such as construction of real faces and realization of dynamical human-like motions. We focus on lip movement synthesis from input acoustic speech to realize dynamical motions. The goal of our researchis to synthesize lip movements natural enough to do lip-reading. In previous research, we have proposed a lip movement synthesis method using HMMs which can incorporate a forward coarticulation effect and confirmed its effectiveness through objective evaluation tests. In this paper, subjective evaluation tests are performed. Intelligibility test and acceptability test are conducted for subjective evaluation.

Cite as: Yamamoto, E., Nakamura, S., Shikano, K. (1998) Subjective Evaluation for HMM-Based Speech-To-Lip Movement Synthesis. Proc. Auditory-Visual Speech Processing, 227-232

  author={Eli Yamamoto and Satoshi Nakamura and Kiyohiro Shikano},
  title={{Subjective Evaluation for HMM-Based Speech-To-Lip Movement Synthesis}},
  booktitle={Proc. Auditory-Visual Speech Processing},