Spotsigs: robust and efficient near duplicate detection in large web collections (2008)

by Martin Theobald, Jonathan Siddharth, Andreas Paepcke
Venue:In Proceedings of the 31st annual international ACM SIGIR conference on research and development in information retrieval