UbiCrawler: a scalable fully distributed web crawler (2003)

by Paolo Boldi , Bruno Codenotti , Massimo Santini , Sebastiano Vigna
Venue:Software: Practice & Experience
Citations:102 - 23 self

Documents Related by Co-Citation

2146 The PageRank Citation Ranking: Bringing Order to the Web – Lawrence Page, Sergey Brin, Rajeev Motwani, Terry Winograd - 1999
3262 The Anatomy of a Large-Scale Hypertextual Web Search Engine – Sergey Brin, Lawrence Page - 1998
161 The WebGraph Framework I: Compression Techniques – Paolo Boldi, Sebastiano Vigna - 2003
94 Ranking the Web Frontier – Nadav Eiron, Kevin S. McCurley, John A. Tomlin - 2004
2719 Authoritative Sources in a Hyperlinked Environment – Jon M. Kleinberg - 1999
96 WebBase : A repository of web pages – Jun Hirai, Sriram Raghavan, Hector Garcia-molina, Andreas Paepcke - 1999
133 Mercator: A scalable, extensible web crawler – Allan Heydon, Marc Najork - 1999
101 Breadth-first search crawling yields high-quality pages – Marc Najork - 2001
35 The Link Database: Fast Access to Graphs of the Web – Keith H. Randall, Raymie Stata Rajiv, Rajiv G. Wickremesinghe, Janet L. Wiener
55 Breadth-first crawling yields high-quality pages – M Najork, J L Wiener
68 SpamRank -- Fully Automatic Link Spam Detection – Andras A. Benczur, Karoly Csalogany, Tamas Sarlos, Máté Uher - 2005
27 Using Rank Propagation and Probabilistic Counting for Link-Based Spam Detection – Luca Becchetti, Carlos Castillo, Debora Donato, Stefano Leonardi, Ricardo Baeza-Yates - 2006
63 Recognizing Nepotistic Links on the Web – Brian D. Davison - 2000
129 Exploiting the Block Structure of the Web for Computing PageRank – Sepandar Kamvar, Taher Haveliwala, Christopher Manning, Gene Golub - 2003
133 Extrapolation Methods for Accelerating PageRank Computations – Sepandar Kamvar, Taher Haveliwala, Christopher Manning, Gene Golub - 2003
170 Web Spam Taxonomy – Zoltan Gyöngyi, Hector Garcia-Molina - 2005
2369 Modern Information Retrieval – Ricardo Baeza-Yates, Berthier Ribeiro-Neto - 1999
73 Design and Implementation of a High-Performance Distributed Web Crawler – Vladislav Shkapenyuk, Torsten Suel - 2002
122 Topical Locality in the Web – Brian D. Davison - 2000