Lecture 2: Geometric Graphs, Predictive Graphs and Spectral Analysis

Similar documents
Lecture 2: From Structured Data to Graphs and Spectral Analysis

REGULAR GRAPHS OF GIVEN GIRTH. Contents

Lecture 11: Clustering and the Spectral Partitioning Algorithm A note on randomized algorithm, Unbiased estimates

Testing Isomorphism of Strongly Regular Graphs

Algebraic Graph Theory- Adjacency Matrix and Spectrum

CSCI5070 Advanced Topics in Social Computing

Graph drawing in spectral layout

Lecture 9 - Matrix Multiplication Equivalences and Spectral Graph Theory 1

Chapter 2. Spectral Graph Drawing

Lesson 4. Random graphs. Sergio Barbarossa. UPC - Barcelona - July 2008

Big Data Analytics. Special Topics for Computer Science CSE CSE Feb 11

Introduction to Graph Theory

Assessing and Safeguarding Network Resilience to Nodal Attacks

Spectral Clustering X I AO ZE N G + E L HA M TA BA S SI CS E CL A S S P R ESENTATION MA RCH 1 6,

Scalable Clustering of Signed Networks Using Balance Normalized Cut

Math 778S Spectral Graph Theory Handout #2: Basic graph theory

1. a graph G = (V (G), E(G)) consists of a set V (G) of vertices, and a set E(G) of edges (edges are pairs of elements of V (G))

Mathematics of Networks II

Stanford University CS359G: Graph Partitioning and Expanders Handout 18 Luca Trevisan March 3, 2011

Basics of Graph Theory

MAT 280: Laplacian Eigenfunctions: Theory, Applications, and Computations Lecture 18: Introduction to Spectral Graph Theory I. Basics of Graph Theory

( ) =cov X Y = W PRINCIPAL COMPONENT ANALYSIS. Eigenvectors of the covariance matrix are the principal components

Application of Spectral Clustering Algorithm

Maximizing edge-ratio is NP-complete

Random Generation of the Social Network with Several Communities

Machine Learning for Data Science (CS4786) Lecture 11

Ma/CS 6b Class 26: Art Galleries and Politicians

ON THE STRONGLY REGULAR GRAPH OF PARAMETERS

Lecture 5: Graphs. Rajat Mittal. IIT Kanpur

Randomized rounding of semidefinite programs and primal-dual method for integer linear programming. Reza Moosavi Dr. Saeedeh Parsaeefard Dec.

Lecture 3: Descriptive Statistics

Graph similarity. Laura Zager and George Verghese EECS, MIT. March 2005

Algebraic Graph Theory

CSE 190 Lecture 16. Data Mining and Predictive Analytics. Small-world phenomena

Revisiting Frank-Wolfe: Projection-Free Sparse Convex Optimization. Author: Martin Jaggi Presenter: Zhongxing Peng

Assignment 1 Introduction to Graph Theory CO342

Spectral Clustering on Handwritten Digits Database

CS224W: Analysis of Networks Jure Leskovec, Stanford University

Spectral Clustering. Presented by Eldad Rubinstein Based on a Tutorial by Ulrike von Luxburg TAU Big Data Processing Seminar December 14, 2014

Conic Duality. yyye

Mining Social Network Graphs

CS388C: Combinatorics and Graph Theory

TELCOM2125: Network Science and Analysis

Local Fiedler Vector Centrality for Detection of Deep and Overlapping Communities in Networks

Notes for Lecture 24

CSCI-B609: A Theorist s Toolkit, Fall 2016 Sept. 6, Firstly let s consider a real world problem: community detection.

CS 140: Sparse Matrix-Vector Multiplication and Graph Partitioning

Extremal Graph Theory: Turán s Theorem

Design of Orthogonal Graph Wavelet Filter Banks

Extremal Graph Theory. Ajit A. Diwan Department of Computer Science and Engineering, I. I. T. Bombay.

Extracting Information from Complex Networks

RELATIVELY OPTIMAL CONTROL: THE STATIC SOLUTION

Signal Processing for Big Data

Random Simplicial Complexes

AMS /672: Graph Theory Homework Problems - Week V. Problems to be handed in on Wednesday, March 2: 6, 8, 9, 11, 12.

8. The Postman Problems

My favorite application using eigenvalues: partitioning and community detection in social networks

Lecture 9. Semidefinite programming is linear programming where variables are entries in a positive semidefinite matrix.

Lecture 6: Unsupervised Machine Learning Dagmar Gromann International Center For Computational Logic

Clustering. Informal goal. General types of clustering. Applications: Clustering in information search and analysis. Example applications in search

Introduction aux Systèmes Collaboratifs Multi-Agents

ON INDEX EXPECTATION AND CURVATURE FOR NETWORKS

Clustering in Networks

Ma/CS 6b Class 13: Counting Spanning Trees

6.854J / J Advanced Algorithms Fall 2008

Bounds on graphs with high girth and high chromatic number

On the null space of a Colin de Verdière matrix

Graphs, Zeta Functions, Diameters, and Cospectrality

Adaptive Filters Algorithms (Part 2)

Graph Theory S 1 I 2 I 1 S 2 I 1 I 2

LOW-DENSITY PARITY-CHECK (LDPC) codes [1] can

Mathematical Programming and Research Methods (Part II)

Optimal Region for Binary Search Tree, Rotation and Polytope

6. Lecture notes on matroid intersection

Social-Network Graphs

Non-exhaustive, Overlapping k-means

Solving problems on graph algorithms

Algorithms on Minimizing the Maximum Sensor Movement for Barrier Coverage of a Linear Domain

Introduction to Engineering Systems, ESD.00. Networks. Lecturers: Professor Joseph Sussman Dr. Afreen Siddiqi TA: Regina Clewlow

The clustering in general is the task of grouping a set of objects in such a way that objects

On the Balanced Case of the Brualdi-Shen Conjecture on 4-Cycle Decompositions of Eulerian Bipartite Tournaments

Lecture 2 September 3

Modularity CMSC 858L

HW Graph Theory SOLUTIONS (hbovik) - Q

Introduc1on to Computa1onal Manifolds and Applica1ons

1. Lecture notes on bipartite matching

Cospectral digraphs from locally line digraphs

Kernighan/Lin - Preliminary Definitions. Comments on Kernighan/Lin Algorithm. Partitioning Without Nodal Coordinates Kernighan/Lin

On the relationship between the algebraic connectivity and graph s robustness to node and link failures

Counting the number of spanning tree. Pied Piper Department of Computer Science and Engineering Shanghai Jiao Tong University

Zhibin Huang 07. Juni Zufällige Graphen

04 - Normal Estimation, Curves

6 Randomized rounding of semidefinite programs

ON THE SUM OF THE SQUARES OF ALL DISTANCES IN SOME GRAPHS

Introduction to spectral clustering

Advanced Operations Research Techniques IE316. Quiz 1 Review. Dr. Ted Ralphs

Clustering. SC4/SM4 Data Mining and Machine Learning, Hilary Term 2017 Dino Sejdinovic

Spectral Methods for Network Community Detection and Graph Partitioning

Lecture 4: Primal Dual Matching Algorithm and Non-Bipartite Matching. 1 Primal/Dual Algorithm for weighted matchings in Bipartite Graphs

CS256 Applied Theory of Computation

Transcription:

Lecture 2: Geometric Graphs, Predictive Graphs and Spectral Analysis Radu Balan February 8, 2018

Data Graphs Today we discuss construction of dynamical data graphs and spectral analysis. The overarching problem is the following: Main Problem Given a graph, discover if it can be explained as a structured data graph, or just as a random graph.

Data Graphs Today we discuss construction of dynamical data graphs and spectral analysis. The overarching problem is the following: Main Problem Given a graph, discover if it can be explained as a structured data graph, or just as a random graph. We shall discuss first how to construct a sequence of nested graphs from a data set. Two types of data: 1 Percolation models/geometric graphs 2 Weighted graphs

Percolation Models. Geometric graphs Fix a set of points in R d. Example, for d = 2: n = 10 2 = 100 Uniform (regular) lattice.

Percolation Models. Geometric graphs Fix a set of points in R d. Example, for d = 2: n = 10 2 = 100 Nonuniform (irregular) lattice.

Percolation Models. Geometric graphs Fix a set of points in R d. Example, for d = 2: n = 10 2 = 100 Nonuniform (irregular) lattice. Created by random perturbation of the regular lattice.

Percolation Models. Geometric graphs Construct the matrix of pairwise distances: ( ) V = r k r j, r k = (x k, y k ). 1 k,j n

Percolation Models. Geometric graphs Construct the matrix of pairwise distances: ( ) V = r k r j, r k = (x k, y k ). 1 k,j n Then sort the set of distances in an ascending order. This way we define an order on the set of pairs of points. Implicitly this defines an ascending order on the set of edges. We obtain a sequence of nested graphs (G t ) t 0 0 t m = n(n 1)/2 where t indicates the number of edges in the graph G t. Thus G t has n nodes and t edges.

Percolation Models. Geometric graphs Play Examples: n = 100, regular/irregular, different types of norms: r k r j 2 = (x k x j ) 2 + (y k y j ) 2 r k r j = max( x k x j, y k y j )

Weighted Graphs. Predictive Graphs The sequence of nested graphs is obtained by sort the edges according to their weights: start with the largest weight first, and then pick the next largest weight, and so on.

Weighted Graphs. Predictive Graphs The sequence of nested graphs is obtained by sort the edges according to their weights: start with the largest weight first, and then pick the next largest weight, and so on. On the Bernard& Killworth office data from: http://vlado.fmf.unilj.si/pub/networks/data/ucinet/ucidata Generate random 40 points in the unit square.

Weighted Graphs. Predictive Graphs The sequence of nested graphs is obtained by sort the edges according to their weights: start with the largest weight first, and then pick the next largest weight, and so on. On the Bernard& Killworth office data from: http://vlado.fmf.unilj.si/pub/networks/data/ucinet/ucidata Generate random 40 points in the unit square. See movie: bkoff movie.mp4

Data Graphs Data Size Size matters:

Dynamical Data Graphs Data Graphs Data Size Size matters:

Dynamical Data Graphs Data Graphs Data Size Size matters: By Paul Signac - Ophelia2, Public Domain, https://commons.wikimedia.org/w/index.php?curid=12570159 (L Hirondelle Steamer on the Seine)

Data Graphs Public Datasets On Canvas you can find links to several public databases: 1 Duke: https://dnac.ssri.duke.edu/datasets.php 2 Stanford: https://snap.stanford.edu/data/ 3 Uni. Koblenz: http://konect.uni-koblenz.de/ 4 M. Newman (U. Michigan): http://www-personal.umich.edu/ mejn/netdata/ 5 A.L. Barabasi (U. Notre Dame): http://www3.nd.edu/ networks/resources.htm 6 UCI: https://networkdata.ics.uci.edu/resources.php 7 Google/YouTube: https://research.google.com/youtube8m/

Spectral Analysis Basic Properties Last time we learned how to construct: the Adjacency matrix A, the Degree matrix D, the (unnormalized symmetric) graph Laplacian matrix = D A, the normalized Laplacian matrix = D 1/2 D 1/2, and the normalized asymmetric Laplacian matrix L = D 1.

Spectral Analysis Basic Properties Last time we learned how to construct: the Adjacency matrix A, the Degree matrix D, the (unnormalized symmetric) graph Laplacian matrix = D A, the normalized Laplacian matrix = D 1/2 D 1/2, and the normalized asymmetric Laplacian matrix L = D 1. We denote: n the number of vertices (also known as the size of the graph), m the number of edges, d(v) the degree of vertex v, d(i, j) the distance between vertex i and vertex j (length of the shortest path connecting i to j), and by D the diameter of the graph (the largest distance between two vertices = longest shortest path ).

Spectral Analysis Basic Properties In this section we summarize spectral properties of the Laplacian matrices. Theorem 1 = T 0, = T 0 are positive semidefinite matrices. 2 eigs( ) = eigs(l) [0, 2]. 3 0 is always an eigenvalue of,, L with same multiplicity. Its multiplicity is equal to the number of connected components of the graph. 4 λ max ( ) 2 max v d(v), i.e. the lagest eigenvalue of is bounded by twice the largest degree of the graph.

Spectral Analysis Basic Properties Theorem Let 0 = λ 0 λ 1 λ n 1 2 be the eigenvalues of (or L), that is eigs( ) = {λ 0, λ 1,, λ n 1 } = eigs(l). Then: n 1 1 i=0 λ i n. 2 n 1 i=0 λ i = n #isolated vertices. 3 λ 1 n n 1. 4 λ 1 = n n 1 if and only if the graph is complete (i.e. any two vertices are connected by an edge). 5 If the graph is not complete then λ 1 1. 6 If the graph is connected then λ 1 > 0. If λ i = 0 and λ i+1 0 then the graph has exactly i + 1 connected components. 7 If the graph is connected (no isolated vertices) then λ n 1 n n 1.

Spectral Analysis Smallest nonnegative eigenvalue Theorem Assume the graph is connected. Thus λ 1 > 0. Denote by D its diameter and by d max, d, d H the maximum, average, and harmonic avergae of the degrees (d 1,, d n ): d max = max d j, d = 1 j n n d j, j=1 1 d H = 1 n n j=1 1 d j. Then 1 λ 1 1 nd. 2 Let µ = max 1 j n 1 1 λ j. Then 1 + (n 1)µ 2 n d H (1 (1 + µ)( d d H 1)).

Spectral Analysis Smallest nonnegative eigenvalue Theorem [continued] 3 Assume D 4. Then dmax 1 λ 1 1 2 (1 2 d max D ) + 2 D.

Spectral Analysis Comments on the proof Ingredients and key relations: 1. Let f = (f 1, f 2,, f n ) R n be a n-vector. Then: f, f = (f x f y ) 2 x y where x y if there is an edge between vertex x and vertex y (i.e. A x,y = 1). This proves positivity of all operators. 2. Last time we showed eigs( ) = eigs(l) because and L are similar matrices. 3. 0 is an eigenvalue for with eigenvector 1 = (1, 1,, 1). If multiple connected components, define such a 1 vector for each component (and 0 on rest). 4. λ max ( ) = 1 λ min (D 1/2 AD 1/2 ) 1 + λ min (D 1/2 AD 1/2 ).

Spectral Analysis Comments on the proof - 2 λ max (D 1/2 AD 1/2 ) = max f =1 D 1/2 AD 1/2 f, f = max f =1 λ min (D 1/2 AD 1/2 ) = min f =1 D 1/2 AD 1/2 f, f λ min,max (D 1/2 AD 1/2 ) max D 1/2 AD 1/2 f, f = max f =1 f =1 Next use Cauchy-Schwartz to get f i f A i,j j di dj f 2 i A i,j = i,j d i i j i Thus λ max ( ) 2. Similarly λ max ( ) 2(max i d i ). i,j i,j f i A i,j di f i A i,j di f 2 i = f 2 = 1. f j dj f j dj

Spectral Analysis Comments on the proof - 2 λ max (D 1/2 AD 1/2 ) = max f =1 D 1/2 AD 1/2 f, f = max f =1 λ min (D 1/2 AD 1/2 ) = min f =1 D 1/2 AD 1/2 f, f λ min,max (D 1/2 AD 1/2 ) max D 1/2 AD 1/2 f, f = max f =1 f =1 Next use Cauchy-Schwartz to get f i f A i,j j di dj f 2 i A i,j = i,j d i i j i Thus λ max ( ) 2. Similarly λ max ( ) 2(max i d i ). i,j i,j f i A i,j di f i A i,j di f 2 i = f 2 = 1. 5. If the graph is connected, trace( ) = n = n 1 i=0 λ i. Since λ 0 = 0 we f j dj f j dj

Spectral Analysis Comments on the proof - 2 λ max (D 1/2 AD 1/2 ) = max f =1 D 1/2 AD 1/2 f, f = max f =1 λ min (D 1/2 AD 1/2 ) = min f =1 D 1/2 AD 1/2 f, f λ min,max (D 1/2 AD 1/2 ) max D 1/2 AD 1/2 f, f = max f =1 f =1 Next use Cauchy-Schwartz to get f i f A i,j j di dj f 2 i A i,j = i,j d i i j i Thus λ max ( ) 2. Similarly λ max ( ) 2(max i d i ). i,j i,j f i A i,j di f i A i,j di f 2 i = f 2 = 1. 5. If the graph is connected, trace( ) = n = n 1 i=0 λ i. Since λ 0 = 0 we f j dj f j dj

Spectral Analysis Special graphs: Cycles and Complete graphs Cycle graphs: like a regular polygon. Remark: Adjacency matrices are circulant, and so are, = L.

Spectral Analysis Special graphs: Cycles and Complete graphs Cycle graphs: like a regular polygon. Remark: Adjacency matrices are circulant, and so are, = L. Then argue the FFT forms a ONB of eigenvectors. Compute the eigenvalues as FFT of the generating sequence.

Spectral Analysis Special graphs: Cycles and Complete graphs Cycle graphs: like a regular polygon. Remark: Adjacency matrices are circulant, and so are, = L. Then argue the FFT forms a ONB of eigenvectors. Compute the eigenvalues as FFT of the generating sequence. Consequence: The normalized Laplacian has the following eigenvalues: 1 For cycle graph on n vertices: λ k = 1 cos 2πk n, 0 k n 1. 2 For the complete graph on n vertices: λ 0 = 0, λ 1 = = λ n 1 = n n 1.

References B. Bollobás, Graph Theory. An Introductory Course, Springer-Verlag 1979. 99(25), 15879 15882 (2002). F. Chung, Spectral Graph Theory, AMS 1997. F. Chung, L. Lu, The average distances in random graphs with given expected degrees, Proc. Nat.Acad.Sci. 2002. R. Diestel, Graph Theory, 3rd Edition, Springer-Verlag 2005. P. Erdös, A. Rényi, On The Evolution of Random Graphs G. Grimmett, Probability on Graphs. Random Processes on Graphs and Lattices, Cambridge Press 2010. J. Leskovec, J. Kleinberg, C. Faloutsos, Graph Evolution: Densification and Shrinking Diameters, ACM Trans. on Knowl.Disc.Data, 1(1) 2007.