Mining E-mail Content for Author Identification Forensics (2001)
| Venue: | SIGMOD RECORD |
| Citations: | 59 - 1 self |
BibTeX
@ARTICLE{Vel01mininge-mail,
author = {Olivier de Vel and Alison Anderson and Malcolm Corney and George Mohay},
title = {Mining E-mail Content for Author Identification Forensics},
journal = {SIGMOD RECORD},
year = {2001},
volume = {30},
pages = {55--64}
}
Years of Citing Articles
OpenURL
Abstract
We describe an investigation into e-mail content mining for author identification, or authorship attribution, for the purpose of forensic investigation. We focus our discussion on the ability to discriminate between authors for the case of both aggregated e-mail topics as well as across different email topics. An extended set of e-mail document features including structural characteristics and linguistic patterns were derived and, together with a Support Vector Machine learning algorithm, were used for mining the e-mail content. Experiments using a number of e-mail documents generated by different authors on a set of topics gave promising results for both aggregated and multi-topic author categorisation.







