Models and Algorithms for Complex Networks

Similar documents
ENGI 4421 Counting Techniques for Probability Page Example 3.01 [Navidi Section 2.2; Devore Section 2.3]

CHAPTER TWO LANGUAGES. Dr Zalmiyah Zakaria

Associative Operations on a Three-Element Set

Compiler Construction

On the Parikh-de-Bruijn grid

Sequential Algorithms for Generating Random Graphs. Amin Saberi Stanford University

Euclid. Father of Geometry Euclidean Geometry Euclid s Elements

!! " # $%! "! &' $ (!!

Lec-5-HW-1, TM basics

Tilings of the Sphere by Edge Congruent Pentagons

Definitions. You can represent a point by a dot and name it by a capital letter.

Chapter 4: Regular Expressions

On Realizations of a Joint Degree Matrix

Counting Faces of Polytopes

Automata Theory TEST 1 Answers Max points: 156 Grade basis: 150 Median grade: 81%

Data Compression Fundamentals

Models of Network Formation. Networked Life NETS 112 Fall 2017 Prof. Michael Kearns

Talen en Compilers. Johan Jeuring , period 2. January 17, Department of Information and Computing Sciences Utrecht University

Chapter 13, Sequence Data Mining

Coloring Random Graphs

Bayesian Networks and Decision Graphs

CSE 190 Lecture 16. Data Mining and Predictive Analytics. Small-world phenomena

Regular Expressions. Lecture 10 Sections Robb T. Koether. Hampden-Sydney College. Wed, Sep 14, 2016

Overlay (and P2P) Networks

Theory of Computations III. WMU CS-6800 Spring -2014

ENGINEERS ACADEMY. 7. Given Boolean theorem. (a) A B A C B C A B A C. (b) AB AC BC AB BC. (c) AB AC BC A B A C B C.

Don t Let Uncle Sam Pick Your Pocket Again. Plan Now for next year. Call for a free half-hour consultation.

Peer-to-peer clustering

Lecture 2: From Structured Data to Graphs and Spectral Analysis

An Empirical Analysis of Communities in Real-World Networks

(Social) Networks Analysis III. Prof. Dr. Daning Hu Department of Informatics University of Zurich

ITEC2620 Introduction to Data Structures

2.6 BOOLEAN FUNCTIONS

arxiv: v1 [math.co] 19 Nov 2017

Constraint Tightness and Looseness versus Local and Global Consistency

CS249: SPECIAL TOPICS MINING INFORMATION/SOCIAL NETWORKS

Power Set of a set and Relations

Topics on Computing and Mathematical Sciences I Graph Theory (3) Trees and Matchings I

Cooperative Computing for Autonomous Data Centers

CS224W: Analysis of Networks Jure Leskovec, Stanford University

Math 170- Graph Theory Notes

Lesson 18: Counting Problems

Jure Leskovec, Cornell/Stanford University. Joint work with Kevin Lang, Anirban Dasgupta and Michael Mahoney, Yahoo! Research

Properties of Biological Networks

Matching Algorithms. Proof. If a bipartite graph has a perfect matching, then it is easy to see that the right hand side is a necessary condition.

CS388C: Combinatorics and Graph Theory

Architecture of Compilers, Interpreters. CMSC 330: Organization of Programming Languages. Front End Scanner and Parser. Implementing the Front End

csci 210: Data Structures Stacks and Queues in Solution Searching

Rumour spreading in the spatial preferential attachment model

CHAPTER-2 STRUCTURE OF BOOLEAN FUNCTION USING GATES, K-Map and Quine-McCluskey

Digital Credential Analysis (Mathematical Analysis) May 21 st 2017 Douglas Berdeaux,

Some Comments on Leaving Certificate Geometry Work in Progress

Section 1.7 Sequences, Summations Cardinality of Infinite Sets

Unit-IV Boolean Algebra

JNTUWORLD. Code No: R

CS402 Theory of Automata Solved Subjective From Midterm Papers. MIDTERM SPRING 2012 CS402 Theory of Automata

Gujarat Technological University Sankalchand Patel College of Engineering, Visnagar B.E. Semester VII (CE) July-Nov Compiler Design (170701)

Graph Algorithms. Many problems in networks can be modeled as graph problems.

Clustering: Classic Methods and Modern Views

CSE 258 Lecture 12. Web Mining and Recommender Systems. Social networks

Chapter Seven: Regular Expressions

X Y Z F=X+Y+Z

POINT SET TOPOLOGY. Introduction

Chapter 1. Social Media and Social Computing. October 2012 Youn-Hee Han

Lesson 4. Random graphs. Sergio Barbarossa. UPC - Barcelona - July 2008

Counting Words Using Hashing

Section 2.4 Sequences and Summations

arxiv: v1 [physics.soc-ph] 22 Oct 2018

The Markov Chain Simulation Method for Generating Connected Power Law Random Graphs

QUESTION BANK. Unit 1. Introduction to Finite Automata

Chapter 18 out of 37 from Discrete Mathematics for Neophytes: Number Theory, Probability, Algorithms, and Other Stuff by J. M. Cargal.

Automated Conjecturing for Proof Discovery

CMSC 330: Organization of Programming Languages. Architecture of Compilers, Interpreters

Conway s Tiling Groups

arxiv: v1 [math.co] 30 Nov 2015

Similarity Ranking in Large- Scale Bipartite Graphs

Computer Science. Unit-4: Introduction to Boolean Algebra

Random Generation of the Social Network with Several Communities

A Particular Type of Non-associative Algebras and Graph Theory

Situation 1: Congruent Triangles vs. Similar Triangles

Random graph models with fixed degree sequences: choices, consequences and irreducibilty proofs for sampling

R.M.D. ENGINEERING COLLEGE R.S.M. Nagar, Kavaraipettai

Regular Expressions. Chapter 6

no triangle can have more than one right angle or obtuse angle.

An Optimal Allocation Approach to Influence Maximization Problem on Modular Social Network. Tianyu Cao, Xindong Wu, Song Wang, Xiaohua Hu

Glynda, the good witch of the North

CIS-331 Exam 2 Fall 2015 Total of 105 Points Version 1

Graph Theory and Network Measurment

Characteristics of Preferentially Attached Network Grown from. Small World

Algorithm and Complexity of Disjointed Connected Dominating Set Problem on Trees

Discrete Mathematics Introduction

Sub-Graph Detection Theory

Lecture 2: Geometric Graphs, Predictive Graphs and Spectral Analysis

SEARCHING WITH MULTIPLE RANDOM WALK QUERIES

1. Which of the following regular expressions over {0, 1} denotes the set of all strings not containing 100 as a sub-string?

Understanding Disconnection and Stabilization of Chord

Dartmouth College Computer Science 10, Fall 2014 Solution to Final Exam A

CONSTRUCTIONS Introduction Division of a Line Segment

Networks and Algebraic Statistics

An Investigation into the Free/Open Source Software Phenomenon using Data Mining, Social Network Theory, and Agent-Based

Transcription:

Models and Algorithms for Complex Networks with network with parametrization, elements maintaining typically characteristic local profiles with categorical attributes [C. Faloutsos MMDS8] Milena Mihail Georgia Tech. with Stephen Young, Gagan Goel, Giorgos Amanatidis, Bradley Green, Christos Gkantsidis, Amin Saberi, D. Bader, T. Feder, C. Papadimitriou, P. Tetali, E. Zegura

Talk Outline Flexible (further parametrized) Models Complex Networks in Web N.. Structural/Syntactic Flexible Models 2. Semantic Flexible Models Models & Algorithms Connection : Kleinberg s Model(s) for Navigation Distributed Searching Algorithms with Additional Local Info/Dynamics. On the Power of Local Replication 2. On the Power of Topology Awareness via Link Criticality Conclusion : Web N. Model & Algorithm characteristics: further parametrization, typically local, locality of info in algorithms & dynamics. Dynamics become especially important. 2

frequency, but no sharp concentration: Erdos-Renyi 2 4 degree Sparse graphs with large degree-variance. Power-law degree distributions. Small-world, i.e. small diameter, high clustering coefficients. scaling Web N. The Internet is constantly growing and evolving giving rise to new models and algorithmic questions. 3

frequency However, in practice, there are discrepancies Random Graph with same degrees, but no sharp concentration: Erdos-Renyi Global Routing Flickr Network (Autonomous Network Systems) Friendship Local Routing Network Network with same Degrees 2 4 degree Sparse graphs with large degree-variance. Power-law degree distributions. Patent Collaboration Network (in Boston) Small-world, i.e. small diameter, high clustering coefficients. A rich theory of power-law random graphs has been developed [ Evolutionary, Configurational Models, & e.g. see Rick Durrett s 7 book ]. 4

Flexible models for complex networks: exhibit a large increase in the properties of generated graphs by introducing a small extension in the parameters of the generating model. 5

Case : Structural/Syntactic Flexible Model Models with power law and arbitrary degree sequences Modifications and Generalizations with additional constraints, of Erdos-Gallai / Havel-Hakimi such as specified joint degree distributions (from random graphs, to graphs with very low entropy). Assortativity: small large The networking community proposed that [Sigcomm 4, CCR 6 and Sigcomm 6], beyond the degree sequence, models for networks of routers should capture how many nodes of degree are connected to nodes of degree. 6

Networking Proposition [CCR 6, Sigcomm 6]: A real highly optimized network G. A random graph with same average degree as G. A graph with same number of links A random graph with same between nodes of degree and as G. degree sequence as G. 7

The Joint-Degree Matrix Realization Problem is: connected, mincost, random Definitions The (well studied) Degree Sequence Realization Problem is: connected, mincost, random 8

The Joint-Degree Matrix Realization Problem is: connected, mincost, random Theorem [Amanatidis, Green, M 8]: The natural necessary conditions for an instance to have a realization are also sufficient (and have a short description). The natural necessary conditions for an instance to have a connected realization are also sufficient (no known short description). There are polynomial time algorithms to construct a realization and a connected realization of, or produce a certificate that such a realization does not exist. 9

Advantages: Degree Sequence Flexibility Realization in: Problem: enforcing Given arbitrary or precluding certain edges, connected, adding Is this costs degree on edges sequence and finding realizable mincost? realizations, mincost, close to matching close to sampling/random generation. random If so, construct a realization. Reduction to perfect matching: 2

Theorem [Erdos-Gallai]: A degree sequence iff the natural necessary condition holds: moreover, there is a connected realization iff the natural necessary condition holds: is realizable 2 3 4 2 3 2 2 [ Havel-Hakimi ] Construction: Greedy: any unsatisfied vertex is connected with the vertices of highest remaining degree requirements. Connectivity, if possible, attained with 2-switches. Note :all 2-switches are legal.

Theorem, Joint Degree Matrix Realization [Amanatidis, Green, M 8]: Proof [sketch]: 2

Balanced Degree Invariant: Example Case Maintaining Balanced Degree Invariant: add delete add delete add Note: This may NOT be a simple augmenting path.

Theorem, Joint Degree Matrix Connected Realization [Amanatidis, Green, M 8]: Proof [remarks]: Main Difficulty: Two connected components are amenable to rewiring by 2-switches, only using two vertices of the same degree. The algorithm explores vertices of the same degree in different components, in a recursive manner. connected component connected component 4

Open Problems for Joint Degree Matrix Realization Construct mincost and/or random realization, or connected realization. Satisfy constraints between arbitrary subsets of vertices. Is there a reduction to matchings or flow or some other well understood combinatorial problem? Is there evidence of hardness? Is there a simple generative model for graphs with low assortativity? ( explanatory or other ) 5

Talk Outline Flexible (further parametrized) Models Complex Networks in Web N.. Structural/Syntactic Flexible Models 2. Semantic Flexible Models Models & Algorithms Connection : Kleinberg s Model(s) for Navigation Distributed Searching Algorithms with Additional Local Info/Dynamics. On the Power of Local Replication 2. On the Power of Topology Awareness via Link Criticality Conclusion : Web N. Model & Algorithm characteristics: further parametrization, typically local, locality of info in algorithms & dynamics. Dynamics become especially important. ODD

Case 2: Semantic Flexible Model(s) Generalizations of Erdos-Renyi random graphs Flickr Network Friendship Network Patent Collaboration Network (in Boston) Models with semantics on nodes, and links among nodes with semantic proximity generated by very general probability distributions. Varying structural characteristics Also densification, RANDOM shrinking DOT diameter, PRODUCT C. GRAPHS Faloutsos, MMDS8 KRONECKER GRAPHS 6

RANDOM DOT PRODUCT GRAPHS Kratzl,Nickel,Scheinerman 5 Young,Scheinerman 7 Young,M 8 7

unlikely never somewhat likely very likely 8

SUMMARY OF RESULTS A semi-closed formula for degree distribution Model can generate graphs with a wide variety of densities average degrees up to. and wide varieties of degree distributions, including power-laws. Diameter characterization : Determined by Erdos-Renyi for similar average density, if all coordinates of X are in (will say more about this). Positive clustering coefficient, depending on the distance of the generating distribution from the uniform distribution. Remark: Power-laws and the small world phenomenon are the hallmark of complex networks. 9

A Semi-closed Formula for Degree Distribution Theorem [Young, M 8] Theorem ( removing error terms) [Young, M 8] 2

Example: (a wide range of degrees, except for very large degrees) This is in agreement with real data. 2

Diameter Characterization Theorem [Young, M 8] Remark: If the graph can become disconnected. It is important to obtain characterizations of connectivity as approaches.this would enhance model flexibility 22

Clustering Characterization Theorem [Young, M 8] Remarks on the proof 23

Open Problems for Random Dot Product Graphs Fit real data, and isolate benchmark distributions. Characterize connectivity (diameter and conductance) as approaches. Do/which further properties of X characterize further properties of? Evolution: X as a function of n? (including: two connected vertices with small similarity, either disconnect, or increase their similarity). Should/can? Similarity functions beyond inner product (e.g. Kernel functions). Algorithms: navigability, information/virus propagation, etc. 24

KRONECKER GRAPHS [Faloutsos, Kleinberg,Leskovec 6] Another semantic flexible model, introducing parametrization. -bit vertex characterization 2-bit vertex characterization 3-bit vertex characterization 25

STOCHASTIC KRONECKER GRAPHS [ Faloutsos, Kleinberg, Leskovec 6] a b aa ab ba bb aaa aab aba abb baa bab bba bbb b c ab ac bb bc aab aac abb abc bab bac bbb bbc ba bb bb bc ca cd cb cc aba abb abb abc aca acd acb acc bba bbb bbb bbc bca bcd bcb bcc baa bab bba bbb caa cab cba cbb bab bac bbb bbc cab cac cbb cbc bba bbb bca bcb cba cbb cca ccb bbb bbc bcd bcc cbb cbc ccd ccc Several properties characterized (e.g. multinomial degree distributions, densification, shrinking diameter, self-similarity). Large scale data set have been fit efficiently! 26

Talk Outline Flexible (further parametrized) Models Complex Networks in Web N.. Structural/Syntactic Flexible Models 2. Semantic Flexible Models Models & Algorithms Connection : Kleinberg s Model(s) for Navigation Distributed Searching Algorithms with Additional Local Info/Dynamics. On the Power of Local Replication 2. On the Power of Topology Awareness via Link Criticality Conclusion : Web N. Model & Algorithm characteristics: Further Parametrization, Locality of Info & Dynamics. 27

Where it all started: Kleinberg s navigability model Moral: Parametrization? is essential in the study of complex networks Theorem [Kleinberg]: The only value for which the network is navigable is r =2. 29

Strategic Network Formation Process [Sandberg 5]: Experimentally, the resulting network has structure and navigability similar to Kleinberg s small world network. 3

Strategic Network Formation Process [ Green & M 8 ] simplification of [Clauset & Moore 3]: Experimentally, the resulting network becomes navigable after steps but does not have structure similar to Kleinberg s small world network. 3

Talk Outline Flexible (further parametrized) Models Complex Networks in Web N.. Structural/Syntactic Flexible Models 2. Semantic Flexible Models Models & Algorithms Connection : Kleinberg s Model(s) for Navigation Distributed Searching Algorithms with Additional Local Info/Dynamics. On the Power of Local Replication 2. On the Power of Topology Awareness via Link Criticality Conclusion : Web N. Model & Algorithm characteristics: further parametrization, typically local, locality of info in algorithms & dynamics. Dynamics become especially important. 3

. On the Power of Local Replication How do networks search (propagate information) : [ Gkantidis, M, Saberi, 4 5 ] Cost = queried nodes / found information 32

. On the Power of Local Replication [ Gkantidis, M, Saberi, 4 5 ] [M, Saberi, Tetali 5 ] Equivalent to one-step local replication of information. Theorem (extends to power-law random graphs): Proof : 33

2. On the Power of Topology Awareness via Link Criticality 34

2. On the Power of Topology Awareness via Link Criticality Via Semidefinite Programming Fastest Mixing Markov Chain Boyd,Diaconis,Xiao 4 How do social networks compute link criticality? Via Computation of Principal Eigenvector(s), Distributed, Asynchronous Gkantsidis, Goel, Mihail, Saberi 7 35

Link Criticality via Distributed Asynchronous Computation of Principal Eigenvector(s) [Gkantsidis,Goel,M,Saberi 7] + - + + + + -+ - - - + - - Start with Step: For all nodes Step: For links, asynchronously Hardest part: Numerical Stability. 36

Talk Outline Flexible (further parametrized) Models Complex Networks in Web N.. Structural/Syntactic Flexible Models 2. Semantic Flexible Models Models & Algorithms Connection : Kleinberg s Model(s) for Navigation Distributed Searching Algorithms with Additional Local Info/Dynamics. On the Power of Local Replication 2. On the Power of Topology Awareness via Link Criticality Conclusion : Web N. Model & Algorithm characteristics: further parametrization, typically local, locality of info in algorithms & dynamics. Dynamics become especially important. 37

Topology Maintenance = Connectivity & Good Conductance Theorem [Feder,Guetz,M,Saberi 6]:The Markov chain corresponding to a local 2-link switch is rapidly mixing if the degree sequence enforces diameter at least 3, and for some. 39