Data analysis project: Leveraging massive textual corpora using n-gram statistics (2008)

by Andrew Carlson, Tom M Mitchell, Ian Fette