A tool developed for computer-aided training of spoken language is presented in the paper. The tools envi-ronment utilizes both visual and auditory feedback information to help a user in learning pronunciation and intonation in L1 or L2. The learning is supported by displaying the users speech and its relevant pa-rameters (volume, F0 and spectrum) in parallel with multiple reference templates. The templates may be-long to the same utterance or make a minimum pair that can be used for contrastive training. The time plots are accompanied by textual labels (phonemes, syllables or words) that are automatically aligned to the users utterance and by plots that identify the regions with major deviations with respect to the reference tem-plates. The tool has been tested in two tasks: a) speech training of a deaf person and b) learning pronunciation and intonation in a foreign language.