MetaCart Sign in to MyCiteSeerX

Include Citations | Advanced Search | Help

Disambiguated Search | Include Citations | Advanced Search | Help

Extrapolation Methods for Accelerating PageRank Computations (2003) [98 citations — 11 self]

Abstract:

We present a novel algorithm for the fast computation of PageRank, a hyperlink-based estimate of the "importance" of Web pages. The original PageRank algorithm uses the Power Method to compute successive iterates that converge to the principal eigenvector of the Markov matrix representing the Web link graph. The algorithm presented here, called Quadratic Extrapolation, accelerates the convergence of the Power Method by periodically subtracting off estimates of the nonprincipal eigenvectors from the current iterate of the Power Method. In Quadratic Extrapolation, we take advantage of the fact that the first eigenvalue of a Markov matrix is known to be 1 to compute the nonprincipal eigenvectors using successive iterates of the Power Method. Empirically, we show that using Quadratic Extrapolation speeds up PageRank computation by 25-- 300% on a Web graph of 80 million nodes, with minimal overhead. Our contribution is useful to the PageRank community and the numerical linear algebra community in general, as it is a fast method for determining the dominant eigenvector of a matrix that is too large for standard fast methods to be practical.

Citations

1669 Authoritative sources in a hyperlinked environment – Kleinberg - 1999
1064 The PageRank Citation Ranking: Bringing Order to the Web – Page, Brin, et al. - 1999
349 Improved algorithms for topic distillation in hyperlinked environments – Bharat, Henzinger - 1998
339 Focused crawling: a new approach to topic-specific (web) resource discovery – Chakrabarti, Berg, et al. - 1999
244 Automatic resource compilation by analyzing hyperlink structure and associated text – Chakrabarti, Dom, et al. - 1998
229 Topic-sensitive PageRank – Haveliwala - 2002
208 The web as a graph: Measurements, models and methods – Kleinberg, Kumar, et al. - 1999
158 Probability and random processes – Grimmett, Stirzaker - 2001
134 Scaling personalized web search – Jeh, Widom - 2003
102 Efficient computation of PageRank – Haveliwala - 1999
101 Numerical linear algebra – LN, Bau - 1997
98 The intelligent surfer: Probabilistic combination of link and content information in PageRank – Richardson, Domingos - 2002
81 Comparing top k lists – Fagin, Kumar, et al. - 2003
71 WebBase: A Repository of Web Pages – Hirai, Raghavan, et al. - 2000
48 What is this page known for? computing web page reputations – RAFIEI, MENDELZON
42 PageRank computation and the structure of the Web: experiments and algorithms – Arasu, Novak, et al. - 2002
42 The Second Eigenvalue of the Google Matrix – Haveliwala, Kamvar - 2003
36 The structure of broad topics on the Web – Chakrabarti, Joshi, et al. - 2002
17 On Bernoulli's numerical solution of algebraic equations – Aitken - 1926
8 On the convergence and stability of the epsilon algorithm – Wynn - 1966
7 Numerical solution of large finite markov chains by algebraic multigrid techniques – Krieger - 1995