ISCA Archive SLaTE 2007
ISCA Archive SLaTE 2007

Using natural language parsers in plagiarism detection

Maxim Mozgovoy, Tuomo Kakkonen, Erkki Sutinen

The problem of plagiarism detection system design is a subject of numerous works of the last decades. Various advanced file-file comparison techniques were developed. However, most existing systems, aimed at natural language texts, do not perform any significant preprocessing of the input documents. So in many cases it is possible to hide the presence of plagiarism by utilizing some simple techniques. In this work we show how a natural language parser can be used to fight against basic plagiarism hiding methods.