Dynamic Graph Query Support for SDN Management. Ramya Raghavendra IBM TJ Watson Research Center

Similar documents
Dynamic Graph Query Primitives for SDN-based Cloud Network Management

A Highly Efficient Runtime and Graph Library for Large Scale Graph Analytics

E6885 Network Science Lecture 10: Graph Database (II)

E6895 Advanced Big Data Analytics Lecture 4:

Oracle IaaS, a modern felhő infrastruktúra

Cross-Site Virtual Network Provisioning in Cloud and Fog Computing

Evolution of Data Center Security Automated Security for Today s Dynamic Data Centers

Module Day Topic. 1 Definition of Cloud Computing and its Basics

Exploring Cloud Security, Operational Visibility & Elastic Datacenters. Kiran Mohandas Consulting Engineer

JStorm Based Network Analytics Platform. Alibaba Cloud Senior Technical Manager, Biao Lyu

Cloud Native Security. OpenShift Commons Briefing

lecture 18: network virtualization platform (NVP) 5590: software defined networking anduo wang, Temple University TTLMAN 401B, R 17:30-20:00

BROCADE CLOUD-OPTIMIZED NETWORKING: THE BLUEPRINT FOR THE SOFTWARE-DEFINED NETWORK

Lecture 10.1 A real SDN implementation: the Google B4 case. Antonio Cianfrani DIET Department Networking Group netlab.uniroma1.it

The Next Opportunity in the Data Centre

The Software Driven Datacenter

Workload Mobility and Disaster Recovery to VMware Cloud IaaS Providers

STRATEGIC WHITE PAPER. Securing cloud environments with Nuage Networks VSP: Policy-based security automation and microsegmentation overview

Understanding Cloud Migration. Ruth Wilson, Data Center Services Executive

Cisco Nexus 1000V Switch for Microsoft Hyper-V

Virtual Security Gateway Overview

Cloud 3.0 and Software Defined Networking October 28, Amin Vahdat on behalf of Google Technical Infratructure Google Fellow

Lecture 7: Data Center Networks

PSOACI Why ACI: An overview and a customer (BBVA) perspective. Technology Officer DC EMEAR Cisco

Kahu Compute. Make Private Cloud Painless. Kahu Compute: Make Private Cloud Painless

MAKING THE CLOUD A SECURE EXTENSION OF YOUR DATACENTER

21CTL Disaster Recovery, Workload Mobility and Infrastructure as a Service Proposal. By Adeyemi Ademola E. Cloud Engineer

How CloudEndure Works

Securing VMware NSX-T J U N E 2018

CogniFit Technical Security Details

VMware Cloud Provider Platform

DEFINING SECURITY FOR TODAY S CLOUD ENVIRONMENTS. Security Without Compromise

Minimizing the Risks of OpenStack Adoption

Title DC Automation: It s a MARVEL!

Amazon AWS-Solution-Architect-Associate Exam

How CloudEndure Disaster Recovery Works

OpenStack Networking: Where to Next?

SPARTA: Scalable Per-Address RouTing Architecture

Quantum, network services for Openstack. Salvatore Orlando Openstack Quantum core developer

Next-Generation Cloud Platform

DISTRIBUTED SYSTEMS [COMP9243] Lecture 8a: Cloud Computing WHAT IS CLOUD COMPUTING? 2. Slide 3. Slide 1. Why is it called Cloud?

Best Practices for Validating the Performance of Data Center Infrastructure. Henry He Ixia

Connect and Transform Your Digital Business with IBM

Internet Technology. 15. Things we didn t get to talk about. Paul Krzyzanowski. Rutgers University. Spring Paul Krzyzanowski

Extending Enterprise Security to Multicloud and Public Cloud

Challenges for Data Driven Systems

1 Copyright 2011, Oracle and/or its affiliates. All rights reserved. reserved. Insert Information Protection Policy Classification from Slide 8

Network Implications of Cloud Computing Presentation to Internet2 Meeting November 4, 2010

CPSC 426/526. Cloud Computing. Ennan Zhai. Computer Science Department Yale University

Journey to the Cloud Next Generation Infrastructure for the future workforce.

Software-Defined Networking (SDN) Overview

Virtualized Network Services SDN solution for service providers

Managing and Securing Computer Networks. Guy Leduc. Chapter 2: Software-Defined Networks (SDN) Chapter 2. Chapter goals:

Exam Name: VMware Certified Associate Network Virtualization

Disclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no commitme

Copyright 2011, Oracle and/or its affiliates. All rights reserved.

Big Data Hadoop Course Content

SpagoBI and Talend jointly support Big Data scenarios

IBM Cloud for VMware Solutions NSX Edge Services Gateway Solution Architecture

Lenovo ThinkSystem NE Release Notes. For Lenovo Cloud Network Operating System 10.6

Distributed Graph Storage. Veronika Molnár, UZH

Cisco Extensible Network Controller

Xen and CloudStack. Ewan Mellor. Director, Engineering, Open-source Cloud Platforms Citrix Systems

TEN ESSENTIAL NETWORK VIRTUALIZATION DEFINITIONS

Data Analytics using MapReduce framework for DB2's Large Scale XML Data Processing

How to Configure a Hybrid WAN in Parallel to An Existing Traditional Wan Infrastructure

IBM SYSTEM POWER7. PowerVM. Jan Kristian Nielsen Erik Rex IBM Corporation

VMWARE SOLUTIONS AND THE DATACENTER. Fredric Linder

Transformation Through Innovation

Developing, Deploying and Managing Applications on the Cloud

Disclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no commitme

Network Behavior Analysis

OpenNebula on VMware: Cloud Reference Architecture

Self-driving Datacenter: Analytics

How CloudEndure Works

Private Cloud Database Consolidation Name, Title

How CloudEndure Disaster Recovery Works

End to End SLA for Enterprise Multi-Tenant Applications

Automated Netezza Migration to Big Data Open Source

DELL EMC VSCALE FABRIC

Introduction to Cloud Computing

EBOOK: VMware Cloud on AWS: Optimized for the Next-Generation Hybrid Cloud

IT HealthCheck Feature List

Top 40 Cloud Computing Interview Questions

Oracle GoldenGate for Big Data

Evolving To The Big Data Warehouse

Solution Overview Gigamon Visibility Platform for AWS

AWS Solution Architect (AWS SA)

OPEN CONTRAIL ARCHITECTURE GEORGIA TECH SDN EVENT

Cisco Nexus 1000V InterCloud

NGF0502 AWS Student Slides

Nuage Networks Product Architecture. White Paper

IBM dashdb Local. Using a software-defined environment in a private cloud to enable hybrid data warehousing. Evolving the data warehouse

Deployments and Network Topologies

Demystifying the Cloud With a Look at Hybrid Hosting and OpenStack

What is Cloud Computing? What are the Private and Public Clouds? What are IaaS, PaaS, and SaaS? What is the Amazon Web Services (AWS)?

About Intellipaat. About the Course. Why Take This Course?

Database Consolidation onto Private Cloud. Piotr Kołodziej, Oracle Polska

Modernize Your Backup and DR Using Actifio in AWS

Bringing OpenStack to the Enterprise. An enterprise-class solution ensures you get the required performance, reliability, and security

Transcription:

Dynamic Graph Query Support for SDN Management Ramya Raghavendra IBM TJ Watson Research Center

Roadmap SDN scenario 1: Cloud provisioning Management/Analytics primitives

Current Cloud Offerings Limited control of the network Requires integration of third-party solutions Limits the opportunity to migrate production applications introduction of cloud networking functions Subnets and ACLs e.g., VPC enhancements Network monitoring e.g., CloudWatch VPN to the enterprise e.g., Virt Private Cloud Server load balancing e.g., Elastic Load Balancing persistent connectivity for services e.g., elastic IP base IP connectivity Third-party virtual appliances Examples of Missing Features No ability to create VLANs in the cloud No facility to manage bandwidth or QoS Limited ability to craft network segments No intelligence for dynamically structured networks reference: http://broadcast.oreilly.com/2010/12/ cloud-2011-the-year-of-the-network-in-the-cloud.html 3

Network as a Service Model High-level VM and network description virtual switches, hypervisors Network devices, switches, routers, firewalls, setup virtual switch on host Device state OpenFlow, SNMP, BladeOS etc Monitoring, notifications Cloud Controller Network controller network description Provide rich set of services isolation, custom addressing, service differentiation etc. Interact with variety of network devices support multiple cloud platforms, device management protocols Introduces a network controller network-aware VM placement, QoS support, real-time monitoring, diagnostics, management, security etc. * CloudNaaS: A Cloud Networking Platform for Enterprise Applications, Symposium on Cloud Computing, 2011

Cloud Networking-as-a-Service Cloud controller Provides base IaaS service for managing VM instances and images Self-service provisioning UI Connects VMs via host virtual switches self-service UI Network specification Cloud controller Network controller application Network controller Provides VM placement directives to cloud controller Generates virtual network between VMs Configures physical and virtual switches application middleware OS VM middleware OS VM virtual network application middleware OS VM 5

Prototype Cloud Controller: OpenNebula 1.4 Modified to accept user-specified network policies Modified to accept placement decisions from Network Controller Network Controller: NOX and OpenFlow-enabled switches Network controller implemented as a C++ NOX application (~2500 LOC) HP Procurve 5400 switches w/ OpenFlow 1.0 firmware VM2 VM4 Network Controller OpenNebula Cloud Controller HOST5 VM8 HOST1 VM1 VM5 SWITCH 2 SWITCH 3 SWITCH 5 VM3 HOST3 HOST2 SWITCH 1 SWITCH 4 HOST4

Roadmap SDN scenario 1: Cloud provisioning Management/Analytics primitives

Graph Queries for Network Management Support for efficient queries on data structure containing time-varying graph, e.g., shortest path between two nodes Utilized in various network management operations 9 24 14 15 5 20 30 18 44 2 11 16 6 19 6 Network graphs can represent: physical network elements such as routers, switches, and servers virtual elements such as VMs and virtual switches logical elements such as people, processes, web pages, etc. and links between them Algorithm support: shortest path spanning tree min flow

Cloud DCN Graphs Are Dynamic Cloud DCNs may experience frequent changes Deployment of new VMs, removal of old ones Migration of VMs Layer 2/3 network reconfiguration Graph updates edge/vertex insertion & deletion edge weight change (e.g. congestion level) Customer: this is what my workloads look like Monitoring, notifications High-level VM & network description Cloud controller Network controller High-level network description & commands OpenFlow 9

NetGraph Software Architecture Cloud Controller shortest path spanning tree reachability bipartite matching n/w aware placement Load balancing Broadcast SDN-enabled Network Controller Service Modules Isolation Routing Path computation OpenFlow SNMP QoS Monitoring Topology graph queries physical & virtual network graph, updates NetGraph shortest paths, spanning tree, transitiveclosure Physical and virtual network infrastructure * NetGraph: Dynamic Graph Query Primitives for SDN-based Cloud Network Management, HotSDN, Sigcomm 2012

System G Tools v1.0 Architecture Visualization Huge Network Visualization Network Propagation I2 3D Network Visualization Geo Network Visualization Graphical Model Visualization Analytics Communities Graph Search Network Info Flow Bayesian Networks Centralities Graph Query Shortest Paths Latent Net Inference Ego Net Features Graph Matching Graph Sampling Markov Networks Middleware Database System G Assets Open Source IBM Product Hardware BigInsights Hadoop Pthreads PERCS Coh. Clus. GBase (update, scan, operators, indexing)) HBase HDFS Shared Memory Run Time Library Graph Processing Interface Graphs RDMA Native Store Distr. Memory RT Library MPI Cluster (BladeCenter, BlueGene) Graph Data Interface Netezza DB2 RDF DB2 Graphs FPGA/ HMC Infosphere Streams (ISS) TinkerPop Compliant DBs

Graph Workload Types Type 1: Computations on graph structures / topologies Bayesian network to Junction tree Example converting Bayesian network into junction tree, graph traversal (BFS/DFS), etc. Characteristics Poor locality, irregular memory access, limited numeric operations 1 2 Type 2: Computations on graphs with rich properties Example Belief propagation: diffuse information through a graph using statistical models Characteristics 3 4 5 6 11 Locality and memory access pattern depend on vertex models Typically a lot of numeric operations Hybrid workload 7 10 8 9 1 2 3 4 5 6 11 7 10 8 9 1 2 3 4 5 6 11 7 10 8 9 6,4, 5 3,1, 2 5,3, 4 11,5,6 7,3, 5 10,7 9.8 8,7 7,3,5 9.8 3,1,2 5,3,4 8,7 6,4,5 10,7 11,5, 6 Type 3: Computations on dynamic graphs Example streaming graph clustering, incremental k-core, etc. Characteristics Poor locality, irregular memory access Operations to update a model (e.g., cluster, sub-graph) Hybrid workload 3-core subgraph

13 System G Analytics Overview System G is a comprehensive set of graph analytics libraries for IBM Big Data Analytics Communities Graph Search Network Info Flow Bayesian Networks Centralities Graph Query Shortest Paths Latent Net Inference Ego Net Features Graph Matching Graph Sampling Markov Networks Analytics target at big graphs Create HBase coprocessors to provide scalable graph analytics by distributing data and computation in a balanced way based on graph topology. They outperform MapReduce-based approaches by 2.8 times. Exploit RDMA and other hardware-based optimization for CPU intensive analytics to achieve high CPU utilization, low IO and good speed up Analytics target at dynamic graphs A major differentiator compared to competitors like GraphLab (dynamic graph clustering coefficients analytics on System G is up to 21 times faster than GraphLab) Include incremental K core, evolution aware clustering, and streaming graph clustering coefficients Exploit IBM Infosphere Stream framework

Analytics Characteristics Analytics Exact or Approx. Graph size and dynamics K-core Exact Tested against > 200M edge graph K-neighborhood Exact Scale free K shortest path Exact Scale free Connected component Exact Tested against > 70M edge graph Pagerank Exact Tested against > 70M edge graph Graph search & recommendation Exact Scale free Probabilistic Inference in Bayesian Networks Approx. Constrained by physical memory XRIME scalable community detection Approx. Tested against > 5M edge graph Snazzy community detection Exact Tested against > 1M edge graph Dynamic subgraph matching Exact Tested against > 150K edge graph. Change up to 30% within 10 sec Incremental streaming K-core Exact Tested against > 16M edge graph Streaming graph clustering Approx. Tested against 400K updates/sec Streaming graph clustering coefficient * Cloud Service Placement via Subgraph Matching, ICDE 2014 14 Exact Tested against > 1.8B edge graph, up to 24M updates/sec

System G Graph Data Interface API 15 Existing Implementations: GBase HBase implementation of Graph Native Store C++ Graph Store, in memory and on disc DB2RDF Tuned for SPARQL + RDF Non-IBM Graph DBs: TinkerGraph in memory graph Neo4J Titan (on HBase, BerkeleyDB, Cassandra) More

Sample queries What is the average hourly RTT at a router? When RTT is high, are the k hop links experiencing high RTT? What is the current shortest path between two network endpoints? Is the direct link more expensive than the overlay link?

Ongoing Work Graph + Time series Queries Graph store coupled with (distributed) time series support What was the shortest path at peak hour yesterday? How do routes vary on a daily basis? Graph + Location Queries RCA based on client locations, or service locations

Thank You! Ramya Raghavendra IBM Research rraghav@us.ibm.com