This paper discusses issues in evaluating spoken language dialogue systems in terms of technical performance and end-user acceptance. Recent efforts in this domain have been carried out in the framework of two major research initiatives: the European Esprit longterm project Spoken Language Dialogue Systems and Components - Best practice in development and evaluation (DISC) and the US American DARPA COMMUNICATOR project.