Gauging similarity with n-grams: Language-independent categorization of text (1995)

by M Damashek
Venue:Science