In this paper, we propose a precise but simple inter-word diphone model (IDM) for ward-spotting based on SMQ/HMM. We have applied ordinary diphone models to a speaker-independent, large-vocabulary word recognition unit. However, because users are apt to add words and/or extraneous speech, accuracy degrades due to the mismatch of models at word-boundaries. The IDM represents transition from the preceding phonemes to a word or from a word to the succeeding phonemes. An experiment showed that the IDMa reduce error rates by about 5% for speech containing unknown words and extraneous speech. The experiment also showed that the proposed method ensured performance good enough for the practical use of a large-vocabulary, isolated-word recognition system.