Detecting phrase-level duplication on the world wide web (2005)

by Dennis Fetterly
Venue:In Proceedings of the 28th Annual International ACM SIGIR Conference on Research & Development in Information Retrieval
Citations:40 - 1 self

Active Bibliography

13 Spam, Damn Spam, and Statistics: Using statistical analysis to locate spam web pages – Dennis Fetterly, Mark Manasse, Marc Najork - 2004
1 Inter-document similarity in web searches – Bruno Martins, Bruno Emanuel, Bruno Emanuel, Da Graça Martins, Da Graça Martins, Mestre Em Informática, Mestre Em Informática, Mário Gaspar, Mário Gaspar, Da Silva, Da Silva, José Luís, José Luís, Cabral De, Cabral De, Moura Borges, Moura Borges, André Osório, André Osório, E Cruz, E Cruz, De Azevedo Falcão, De Azevedo Falcão, Thibault Nicolas Langlois, Thibault Nicolas Langlois - 2004
40 Thwarting the nigritude ultramarine: learning to identify link spam – Isabel Drost, Tobias Scheffer - 2005
6 Undue influence: Eliminating the impact of link plagiarism on web search rankings – Baoning Wu, Brian D. Davison - 2006
6 Survey on web spam detection: principles and algorithms – Nikita Spirin, Jiawei Han
2 Evolutionary Study of Phishing – Danesh Irani, Steve Webb, Jonathon Giffin, Calton Pu - 2008
5 The Co-Evolution of Systems and – Shaozhi Ye, Ji-rong Wen, Wei-ying Ma - 2004
14 Using Bloom filters to refine web search results – Navendu Jain - 2005
3 EFFICIENT ARCHIVAL DATA STORAGE – Lawrence You - 2006
54 On the Evolution of Clusters of Near-Duplicate Web Pages – Dennis Fetterly , Mark Manasse, Marc Najork - 2003
General Terms Experimentation – Yiqun Liu, Rongwei Cen, Min Zhang, Shaoping Ma, Liyun Ru
20 Detecting Semantic Cloaking on the Web – Baoning Wu, Brian D. Davison - 2006
67 SpamRank -- Fully Automatic Link Spam Detection – Andras A. Benczur, Karoly Csalogany, Tamas Sarlos, Máté Uher - 2005
49 The connectivity sonar: detecting site functionality by structural patterns – Einat Amitay, David Carmel, Adam Darlow, Ronny Lempel, Aya Soffer - 2003
Emerging Applications of Link Analysis for Ranking – Paul-Alexandru Chirita - 2007
15 Site Level Noise Removal for Search Engines – Andre Luiz da Costa Carvalho, Paul-Alexandru Chirita, Edleno Silva De Moura, Pavel Calado, Wolfgang Nejdl - 2006
83 Identifying Link Farm Spam Pages – Baoning Wu, Brian D. Davison - 2005
6 Lazy Preservation: Reconstructing Websites from the Web Infrastructure – Frank Mccown, Michael L. Nelson (director, William Y. Arms (member, Johan Bollen (member, Kurt Maly (member, Ravi Mukkamala (member, Frank Mccown, Director Dr, Michael L. Nelson - 2007
11 Introducing the Portuguese web archive initiative – Daniel Gomes, André Nogueira, João Miranda, Miguel Costa