Author:Butgereit, L; Botha, RADate:Oct 2013Mobile Instant Messaging (MIM) systems have produced a new convention in writing where vowels are often omitted, where new suffixes have appeared, where numerals and symbols often appear in the place of letters which have a similar shape or ...Read more
Author:Butgereit, LL; Botha, RADate:Sep 2011N-grams are used to quantify the similarity between two documents or the similarity between two collections of words. This paper shows how N-grams of length 3 and 4 both coupled with text processing (including stop word removal and stemming ...Read more