The system presented in this paper produces synthetic speech by concatenating three types of units to generate a syllable. While the initial unit is a classic demisyllable, the final demisyllable is split into a VC diphone and an optional suffix containing the final consonant cluster. Rules controlling the synthetisizing process are formulated in a language especially developed for text-to-speech synthesis. A set of rules for duration control and concatenation is presented.