We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.DS

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Data Structures and Algorithms

Title: Approximating PageRank locally with sublinear query complexity

Abstract: Can we approximate the centrality score of a given node by exploring only a vanishing fraction of the graph? In this paper we develop a combination of techniques that we apply to the case of PageRank, where the centrality of a node depends on \emph{every} arc in the graph. We obtain an algorithm that, given any one node in any $n$-node graph, with probability $1-\delta$ returns a multiplicative $(1\pm\epsilon)$ approximation of its PageRank score using at most $O\big(n^{\frac{2}{3}}\sqrt[3]{\log(n/\delta)}\,\ln(1/\delta)^\frac{1}{3}\epsilon^{-\frac{2}{3}}\big) = \tilde{O}(n^{\frac{2}{3}})$ graph exploration queries -- where a query returns either a given node's list of neighbours, or a node chosen uniformly at random. Until now, every algorithm required in general $\Omega(n)$ queries. We also show that this upper bound is essentially optimal by proving an almost matching lower bound. Our techniques can be applied to other centrality measures, even though we leave the analysis of the resulting bounds for future work.
Comments: 29 pages, 3 figures
Subjects: Data Structures and Algorithms (cs.DS); Information Retrieval (cs.IR); Social and Information Networks (cs.SI)
Cite as: arXiv:1404.1864 [cs.DS]
  (or arXiv:1404.1864v2 [cs.DS] for this version)

Submission history

From: Marco Bressan [view email]
[v1] Mon, 7 Apr 2014 17:57:33 GMT (32kb,D)
[v2] Mon, 5 Feb 2018 11:33:17 GMT (45kb)
[v3] Sat, 4 Aug 2018 09:58:58 GMT (43kb)

Link back to: arXiv, form interface, contact.