Fast Iterative Solvers for Markov Chains, with Application to Google's PageRank. Hans De Sterck

Similar documents
Multigrid Methods for Markov Chains

SELECTIVE ALGEBRAIC MULTIGRID IN FOAM-EXTEND

Web search before Google. (Taken from Page et al. (1999), The PageRank Citation Ranking: Bringing Order to the Web.)

Multilevel Aggregation Methods for Small-World Graphs with Application to Random-Walk Ranking

MULTILEVEL AGGREGATION METHODS FOR SMALL-WORLD GRAPHS WITH APPLICATION TO RANDOM-WALK RANKING. Hans de Sterck. van Emden Henson, Geoffrey Sanders

Introduction to Multigrid and its Parallelization

Information Retrieval. Lecture 11 - Link analysis

PageRank and related algorithms

Agenda. Math Google PageRank algorithm. 2 Developing a formula for ranking web pages. 3 Interpretation. 4 Computing the score of each page

(Sparse) Linear Solvers

Link Analysis and Web Search


Motivation. Motivation

Web consists of web pages and hyperlinks between pages. A page receiving many links from other pages may be a hint of the authority of the page

Lecture 8: Linkage algorithms and web search

COMP 4601 Hubs and Authorities

A Multi-Level Method for the Steady State Solution of Markov Chains

COMP Page Rank

(Sparse) Linear Solvers

A Parallel Solver for Laplacian Matrices. Tristan Konolige (me) and Jed Brown

A Modified Algorithm to Handle Dangling Pages using Hypothetical Node

A brief history of Google

PageRank Algorithm Abstract: Keywords: I. Introduction II. Text Ranking Vs. Page Ranking

COMP5331: Knowledge Discovery and Data Mining

The PageRank Computation in Google, Randomized Algorithms and Consensus of Multi-Agent Systems

The PageRank Computation in Google, Randomized Algorithms and Consensus of Multi-Agent Systems

Smoothers. < interactive example > Partial Differential Equations Numerical Methods for PDEs Sparse Linear Systems

Lecture #3: PageRank Algorithm The Mathematics of Google Search

Linear Equation Systems Iterative Methods

smooth coefficients H. Köstler, U. Rüde

Reducing Communication Costs Associated with Parallel Algebraic Multigrid

Adaptive Smoothed Aggregation (αsa) Multigrid

CENTRALITIES. Carlo PICCARDI. DEIB - Department of Electronics, Information and Bioengineering Politecnico di Milano, Italy

Page rank computation HPC course project a.y Compute efficient and scalable Pagerank

Multigrid Pattern. I. Problem. II. Driving Forces. III. Solution

The PageRank Citation Ranking

Advanced Computer Architecture: A Google Search Engine

Information Networks: PageRank

Mathematical Methods and Computational Algorithms for Complex Networks. Benard Abola

A Reordering for the PageRank problem

A Comparison of Algebraic Multigrid Preconditioners using Graphics Processing Units and Multi-Core Central Processing Units

Contents. I The Basic Framework for Stationary Problems 1

ROBERTO BATTITI, MAURO BRUNATO. The LION Way: Machine Learning plus Intelligent Optimization. LIONlab, University of Trento, Italy, Apr 2015

MAE 298, Lecture 9 April 30, Web search and decentralized search on small-worlds

AMS526: Numerical Analysis I (Numerical Linear Algebra)

Numerical Algorithms

An efficient method for graph Laplacian systems

The PageRank Citation Ranking: Bringing Order to the Web

Parallel Implementations of Gaussian Elimination

COMPARATIVE ANALYSIS OF POWER METHOD AND GAUSS-SEIDEL METHOD IN PAGERANK COMPUTATION

Page Rank Algorithm. May 12, Abstract

Adaptive methods for the computation of PageRank

Mathematical Analysis of Google PageRank

Iterative Methods for Linear Systems

Parallel Hybrid Monte Carlo Algorithms for Matrix Computations

PhD Student. Associate Professor, Co-Director, Center for Computational Earth and Environmental Science. Abdulrahman Manea.

Collaborative filtering based on a random walk model on a graph

c 2006 Society for Industrial and Applied Mathematics

Comparison of different solvers for two-dimensional steady heat conduction equation ME 412 Project 2

1 2 (3 + x 3) x 2 = 1 3 (3 + x 1 2x 3 ) 1. 3 ( 1 x 2) (3 + x(0) 3 ) = 1 2 (3 + 0) = 3. 2 (3 + x(0) 1 2x (0) ( ) = 1 ( 1 x(0) 2 ) = 1 3 ) = 1 3

Algebraic Multigrid (AMG) for Ground Water Flow and Oil Reservoir Simulation

Large-Scale Networks. PageRank. Dr Vincent Gramoli Lecturer School of Information Technologies

PARALLEL METHODS FOR SOLVING PARTIAL DIFFERENTIAL EQUATIONS. Ioana Chiorean

CS6200 Information Retreival. The WebGraph. July 13, 2015

This is an author-deposited version published in: Eprints ID: 16478

Link Analysis from Bing Liu. Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data, Springer and other material.

Information Retrieval and Web Search Engines

Information Retrieval Lecture 4: Web Search. Challenges of Web Search 2. Natural Language and Information Processing (NLIP) Group

Lecture 15: More Iterative Ideas

On the regularizing power of multigrid-type algorithms

How Google Finds Your Needle in the Web's

ADAPTIVE SMOOTHED AGGREGATION (αsa)

Modeling web-crawlers on the Internet with random walksdecember on graphs11, / 15

1 Starting around 1996, researchers began to work on. 2 In Feb, 1997, Yanhong Li (Scotch Plains, NJ) filed a

Numerical Linear Algebra for Data and Link Analysis

Iterative Algorithms I: Elementary Iterative Methods and the Conjugate Gradient Algorithms

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University

What is Multigrid? They have been extended to solve a wide variety of other problems, linear and nonlinear.

Link Structure Analysis

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University

Stochastic Multiscale Approaches to Consensus Problems

1.6 Case Study: Random Surfer

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University

.. Spring 2009 CSC 466: Knowledge Discovery from Data Alexander Dekhtyar..

INTRODUCTION TO DATA SCIENCE. Link Analysis (MMDS5)

CS 395T Lecture 12: Feature Matching and Bundle Adjustment. Qixing Huang October 10 st 2018

Introduction to Information Retrieval

Regularization by multigrid-type algorithms

CS-C Data Science Chapter 9: Searching for relevant pages on the Web: Random walks on the Web. Jaakko Hollmén, Department of Computer Science

c 2006 Society for Industrial and Applied Mathematics

An algebraic multigrid method for isotropic linear elasticity problems on anisotropic meshes

On the regularizing power of multigrid-type algorithms. Marco Donatelli. Stefano Serra Capizzano

1 Exercise: 1-D heat conduction with finite elements

Searching the Web [Arasu 01]

ETNA Kent State University

A PageRank Algorithm based on Asynchronous Gauss-Seidel Iterations

Algebraic Multigrid Methods Based on Generic Approximate Banded Inverse Matrix Techniques

Multiscale Methods. Introduction to Network Analysis 1

Solution of Ambrosio-Tortorelli Model for Image Segmentation by Generalized Relaxation Method

Distributed NVAMG. Design and Implementation of a Scalable Algebraic Multigrid Framework for a Cluster of GPUs

Transcription:

Fast Iterative Solvers for Markov Chains, with Application to Google's PageRank Hans De Sterck Department of Applied Mathematics University of Waterloo, Ontario, Canada joint work with Steve McCormick, John Ruge, Tom Manteuffel, Geoff Sanders (University of Colorado at Boulder), Jamie Pearson (Waterloo) Departamento de Ingeniería Matemática Universidad de Chile, 25 April 2008

1. Ranking Pages in Web Search Google search: - keyword-based query results in list of web pages - pages are listed in order of importance : PageRank how does PageRank work? Markov chains

Ranking Pages in Web Search... how does PageRank work? a page has high rank if the sum of the ranks of its backlinks is high ( The PageRank Citation Ranking: Bringing Order to the Web (1998), Page, Brin, Motwani, Winograd)

Ranking Pages in Web Search... a page has high rank if the sum of the ranks of its backlinks is high ( The PageRank Citation Ranking: Bringing Order to the Web (1998), Page, Brin, Motwani, Winograd) PageRank = stationary vector of Markov chain random surfer, random walk on graph, robust against spam

2. Stationary Vector of Markov Chain B is column-stochastic if B is irreducible (every state can be reached from every other state in the directed graph) (no probability sinks!) largest eigenvalue of B:

3. Web Matrix Regularization PageRank (used by Google):

Web Matrix Regularization BackButton (add reverse of each link): 1- λ 2 O(1/n)

4. One-level Iterative Methods for Markov Chains largest eigenvalue of B: power method: convergence factor: convergence is very slow when (slow mixing)

Convergence of Power Method for PageRank convergence factor independent of problem size convergence is fast, linear in the number of web pages (15+ billion!) model is accurate (hence Google s $150B market cap...)

But How About Slowly Mixing Markov Chains? slow mixing: one-level methods (Power, Jacobi, Gauss Seidel,...) are way too slow need multi-level methods! (multigrid) applications: PageRank, BackButton, many other applications

5. Principle of Multigrid (for PDEs) 1D model problem: discretization: matrix form: # 2 "1 & % ( % "1 2 "1 ( A = 1 % "1 2 "1 ( h 2 % ( % O O O ( % "1 2 "1( % ( $ "1 2 '

Principle of Multigrid (for PDEs) use relaxation method (Gauss method) high-frequency error is removed by relaxation (Gauss- Seidel,...) low-frequency-error needs to be removed by coarse-grid correction

Principle of Multigrid (for PDEs) high-frequency error is removed by relaxation (weighted Jacobi, Gauss- Seidel,...) low-frequency-error needs to be removed by coarse-grid correction

Multigrid Hierarchy: V-cycle multigrid V-cycle: relax (=smooth) on successively coarser grids transfer error using restriction (R=P T ) and interpolation (P) W=O(n)

6. Aggregation for Markov Chains form three coarse, aggregated states

Aggregation in Matrix Form (Krieger, Horton,... 1990s)

Error Equation A c is a singular M-matrix

Multilevel Aggregation Algorithm (Krieger, Horton 1994, but no good way to build Q, convergence not good)

7. Choosing Aggregates Based on Strength (SIAM J. Scientific Computing, 2008) error equation: use strength of connection in define row-based strength (determine all states that strongly influence a row s state) state that has largest value in x i is seed point for new aggregate, and all unassigned states influenced by it join its aggregate repeat (similar to Algebraic Multigrid: Brandt, McCormick and Ruge, 1983)

8. Performance for PageRank

Performance for BackButton

9. Improved Algorithm: Smoothed Aggregation... (Vanek, Mandel, and Brezina, Computing, 1996) after relaxation: coarse grid correction with Q: coarse grid correction with Q s :

Non-smoothed Aggregation non-smoothed method: A c is an irreducible singular M-matrix

Smoothed Aggregation smooth the columns of P with weighted Jacobi: smooth the rows of R with weighted Jacobi:

Smoothed Aggregation smoothed coarse level operator: problem: A cs is not a singular M-matrix (signs wrong) solution: lumping approach on S in

Lumped Smoothed Aggregation is an irreducible singular M-matrix! (off-diagonal elements remain positive) theorem lumped operator is a singular M-matrix on all coarse levels, and thus allows for a unique strictly positive solution x c on all levels theorem exact solution x is a fixed point of the multigrid cycle (SIAM J. Scientific Computing, submitted)

10. Numerical Results: Uniform Chain

Numerical Results: Tandem Queueing Network

11. Influence of α: PageRank

BackButton Influence of α: BackButton

12. Conclusions multilevel method applied to Google s PageRank with α=0.15 is not faster than power method (SIAM J. Scientific Computing, 2008) smoothed multilevel method applied to slowly mixing Markov chains is much faster than power method (in fact, close to optimal O(n) complexity!) (SIAM J. Scientific Computing, submitted)

8. Performance for Web Ranking total efficiency factor of MAA relative to WJAC: : MAA 2 times more efficient than WJAC : MAA 2 times less efficient than WJAC

7. Well-posedness: Singular M-matrices singular M-matrix: our A=I-B is a singular M-matrix on all levels

Well-posedness: Unsmoothed Method

Smoothed Aggregation smoothed coarse level operator: problem: A cs is not a singular M-matrix (signs wrong) solution: lumping approach on S in

Smoothed Aggregation we want as little lumping as possible only lump offending elements (i,j): (we consider both off-diagonal signs and reducibility here!) for offending elements (i,j), add S {i,j} to S: conserves both row and column sums

9. Lumped Smoothed Method is Well-posed