Gauging Similarity via N-Grams: Language-Independent Sorting, Categorization, and Retrevial of Text

Mark Damashek

January 31, 1995