Recent developments in the field of speech technology leead us to assume that speech synthesis techniques could offer a quality sufficient for practical applications, such as information services or aids for handicapped people (Deliege 1989). To facilitate the diffusion of these applications it is necessary to develop reliable and valid performance measures to compare different systems for different applications. Some results on comparisons among different evaluation methodologies such as categorical estimation, magnitude estimation, paired comparison, and reaction time, are reported in the paper.