ISCA Archive Eurospeech 2001
ISCA Archive Eurospeech 2001

Turkish word segmentation using morphological analyzer

M. Oguzhan Külekcý, Mehmed Özkan

This paper describes an algorithm to segment an input Turkish string without any spaces, which may be an output of a speech-to-text application, into words by using morphological analyser. It is quite possible to use the algorithm on other languages, which has a morphological analysis component, as well. Turkish morphological analyser is designed and implemented as the linguistic engine of the algorithm. The construction of the analyser proposes a technique that attempts to achieve group vise morpheme recognition instead of searching suffixes one by one in a word.