Cloud + Big Data Putting it all Together

Similar documents
VMware Cloud Application Platform

Transform to Your Cloud

VMWARE SOLUTIONS AND THE DATACENTER. Fredric Linder

The Latest EMC s announcements

Enabling Your Cloud with VMware. Rob Rowe Jason Kuipers

Orchestrating the Cloud Infrastructure using Cisco Intelligent Automation for Cloud

VMware vcloud Director for Service Providers

The Software Driven Datacenter

IT Infrastructure: Poised for Change

EMC Hybrid Cloud. Umair Riaz - vspecialist

Dedicated Hosted Cloud with vcloud Director

Redefine: Enterprise Hybrid Cloud

OPTIMIZING CLOUD DEPLOYMENT OF VIRTUALIZED APPLICATIONS ON EMC SYMMETRIX VMAX CLOUD EDITION

DEPLOYING A VMWARE VCLOUD DIRECTOR INFRASTRUCTURE-AS-A-SERVICE (IAAS) SOLUTION WITH VMWARE CLOUD FOUNDATION : ARCHITECTURAL GUIDELINES

IBM Cloud for VMware Solutions

The Virtualisation Security Journey: Beyond Endpoint Security with VMware and Symantec

AirSembly. vcloud Director Management Platform

Automating the Software-Defined Data Center with vcloud Automation Center

Remove complexity in protecting your virtual infrastructure with. IBM Spectrum Protect Plus. Data availability made easy. Overview

Cisco Cloud Strategy. Uwe Müller. Leader PreSales Cloud & Datacenter Germany

Design and Architecture. Derek Collison

Agenda. This Session: Azure Networking Basics, On-prem connectivity options DEMO Create VNET/Gateway Cost-estimation for VNET/Gateways

Data Center and Cloud Automation

Vmware.Test-inside.VCAC510.v by.Luger.97q

Soluzioni integrate con vsphere La virtualizzazione abilita il percorso evolutivo di innovazione dell'it

[MS10992]: Integrating On-Premises Core Infrastructure with Microsoft Azure

Application Provisioning

Copyright 2015 EMC Corporation. All rights reserved. Published in the USA.

Cisco Cloud Architecture with Microsoft Cloud Platform Peter Lackey Technical Solutions Architect PSOSPG-1002

vcloud Air - Virtual Private Cloud OnDemand User's Guide

VMware Cloud Provider Platform

What s New in VMware vcloud Automation Center 5.1

Developing Enterprise Cloud Solutions with Azure

VMworld 2013 Overview

VMware Virtual SAN Technology

Cisco Intelligent Automation for Cloud & Compute

Get ready to be what s next.

Cloud Computing Private Cloud

Title DC Automation: It s a MARVEL!

JOURNEY TO YOUR CLOUD. Mika Kotro Sales Development EMC Deutschland GmbH. Copyright 2012 EMC Corporation. All rights reserved.

Windows Azure Services - At Different Levels

Introducing VMware Validated Designs for Software-Defined Data Center

VMware vcloud Director Evaluator s Guide TECHNICAL WHITE PAPER

NetBackup as a Service

Tenant Onboarding. Tenant Onboarding Overview. Tenant Onboarding with Virtual Data Centers

Hybrid Cloud Solutions

What You Need to Know About OpenStack + VMware

Data Protection for Virtualized Environments

Data Management at Cloud Scale CommVault Simpana v10. VMware Partner Exchange Session SPO2308 February 2013

Modernize Your Backup and DR Using Actifio in AWS

Introducing VMware Validated Designs for Software-Defined Data Center

Storage Considerations for VMware vcloud Director. VMware vcloud Director Version 1.0

Introducing VMware Validated Designs for Software-Defined Data Center

Customer Case Studies on Accelerating Their Path to Hybrid Cloud

EMC Strategy Overview: Journey To The Private Cloud

How Hybrid Cloud Accelerates IT Transformation

#techsummitch

Audience Data center administrators responsible for designing, installing and configuring a private cloud infrastructure.

MICROSOFT APPLICATIONS

VMware vcloud Air Network Program Product Usage Guide Q1 2015

Atos Canopy Orchestrated Hybrid Cloud. Mark Nouris - Atos Head of Cloud Michael Kollar Head of Cloud engineering & TIC

Automating the Software-Defined Data Center with vcloud Automation Center

Taming the Multi-Cloud With Simplicity and Openness. Minh Dang Cisco Systems Vietnam 2018 January

Managing Virtual Data Centers

70-532: Developing Microsoft Azure Solutions

LEAD YOUR CLOUD TRANSFORMATION. Copyright 2013 EMC Corporation. All rights reserved.

DC: Le Converged Infrastructure per Software Defined e Cloud Cisco NetApp - Softway. Luigi MARCOCCHIA SOFTWAY

VMware vfabric Data Director 2.5 EVALUATION GUIDE

The Future of Virtualization. Jeff Jennings Global Vice President Products & Solutions VMware

Introducing VMware Validated Designs for Software-Defined Data Center

Kako napraviti Cloud?

VMWARE SERVICE PROVIDER PROGRAM PRODUCT USAGE GUIDE Q3

VVD for Cloud Providers: Scale and Performance Guidelines. October 2018

The Future of Virtualization Desktop to the Datacentre. Raghu Raghuram Vice President Product and Solutions VMware

Copyright 2012 EMC Corporation. All rights reserved.

Transforming IT: From Silos To Services


The intelligence of hyper-converged infrastructure. Your Right Mix Solution

Data Center 3.0 Shift to IT as a Service with Your Own Private Cloud

Table of Contents HOL-SDC-1317

Disclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no commitme

How to Keep UP Through Digital Transformation with Next-Generation App Development

Infrastructure modernization with Microsoft Azure

Monitoring and Operating a Private Cloud with System Center 2012 (70-246) Course Outline Module 1: Introduction to the Private Cloud

Hyper-Convergence De-mystified. Francis O Haire Group Technology Director

ReDefine Enterprise Storage

Understanding the latent value in all content

Data Protection Modernization: Meeting the Challenges of a Changing IT Landscape

Dell EMC Enterprise Hybrid Cloud for Microsoft Azure Stack. Ahmed Iraqi Account Systems Engineer Dell EMC North & West Africa

Copyright 2012 EMC Corporation. All rights reserved.

VMware vsphere 4.0 The best platform for building cloud infrastructures

How Microsoft Built MySQL, PostgreSQL and MariaDB for the Cloud. Santa Clara, California April 23th 25th, 2018

Cisco CloudCenter Solution with VMware

Microsoft Windows Embedded Server Overview

Driving Greater Business Outcomes with Next Gen IT. Prasad Radhakrishnan ASEAN Datacenter Computing Solutions 18 th Jan 2018

[TITLE] Virtualization 360: Microsoft Virtualization Strategy, Products, and Solutions for the New Economy

Introducing VMware Validated Designs for Software-Defined Data Center

Branch Office Desktop

Elmar Szych Cloud Solution Architekt

Abstract. The Challenges. ESG Lab Review InterSystems IRIS Data Platform: A Unified, Efficient Data Platform for Fast Business Insight

Transcription:

Cloud + Big Data Putting it all Together Even Solberg 2009 VMware Inc. All rights reserved

2

Big, Fast and Flexible Data Big Big Data Processing Fast OLTP workloads Flexible Document Object Big Data Analytics Analytic workloads Key / Value OSS Relational Cloud Delivery Model Data as a service for private and public clouds 3

Big, Fast and Flexible Data Big Big Data Processing Fast OLTP workloads Flexible Document Serengeti Object GemFire Big Data Analytics Analytic workloads Key / Value OSS Relational vpostgres Cloud Delivery Model Data as a service for private and public clouds 4

Cloud Stack Neutral View SaaS PaaS IaaS 5

6 Big Data IaaS

but first, some Background. How to build an IaaS Cloud 7

https://customer.portal.org Generate Ticket 1:st Line Support Service Catalog Workflow engine SLA Descriptions Show back Billing Information Customer Portal ITSM Ticketing Change Mgmt Support Change & Release Mgmt Automated ITIL Process Including Approvals Service Renewal Service Owner Service Delivery Management Cost Models Usage Allocation Pay As You Go CB / SB Exported Billing Information Performance Mgmt Resource Mgmt Capacity Mgmt Compliance Mgmt Customer A Users Groups Service Catalog Custome r B Customer C Automated Provisioning Multi Tenancy IT Service Catalog Resource Distribution Resource Allocation Users Groups Service Catalog Users Groups Service Catalog Administrative Interface / Resource Allocation and Definition Customer D Users Groups Service Catalog Central Infrastructure Management Cust A Gold Network & Security Firewall VPN Load Balancer NAT Cust B Silve r Cust C Bronze Out Of The Box Integration Human Interaction Integration must be built

https://customer.portal.org Generate Ticket 1:st Line Support Service Service Catalog Workflow engine SLA Manager Descriptions Show back Billing Information -- Customer Portal DynamicOps ITSM Ticketing Change Mgmt Support Service Manager Change & Release Director Mgmt Application Automated ITIL Process Including Approvals Service Manager Renewal Service Owner Service Service Delivery Manager Management / ITBM Cost Models Usage Allocation vcenter Pay As You Go CB / SB Exported Billing Information Chargeback Performance Mgmt vcenter Resource Operations Mgmt Capacity Mgmt Compliance Suite Mgmt Customer A Users Groups Service Catalog Custome r B Customer C Automated Provisioning Multi Tenancy IT Service Catalog Resource Distribution Resource Allocation vcloud Director Users Groups Service Catalog Users Groups Service Catalog Administrative Interface / Resource Allocation and Definition vsphere Customer D Users Groups Service Catalog Cust A Gold Network & Security Firewall VPN vcns Load Balancer NAT Cust B Silve r Cust C Bronze Out Of The Box Integration Human Interaction Integration must be built

Organization: Finance Organization: Marketing Users & Policies Org VDC Catalogs Users & Policies Org VDC Catalogs Gold Provider Virtual Datacenters Silver Bronze VMware vcenter Server Resource Pools Datastores Port Groups VMware vsphere

Complete Cloud Suite Management Cloud Infrastructure Extensibility vfabric Application Director vcenter Operations Mgmt Suite vcenter Site Recovery Manager Software Defined Storage vcloud Director Software Defined Networking Virtualization vsphere Software Defined Security (server, storage, network) Software Defined Availability vcloud APIs vcloud Connector vcenter Orchestrator 11

Virtualizing Hadoop Project Serengeti 12

13 3 Big Reasons to Virtualize Hadoop

1. Virtualize Hardware Big SQL NoSQL Hadoop SQL NoSQL Unified Big Data Infrastructure Private Public Hadoop DSS 14

2. Rapid Provisioning I want my Hadoop cluster NOW! 15

3. Leverage Capabilities Increase Utilization No single points of failure VM Isolation Resource Management 16

What? Hadoop in a VM? Really? Actually, Hadoop performs well in a virtual machine 17

Performance of Hadoop for Several Workloads Ratio of time taken Lower is Better 1,2 1 Ratio to Native 0,8 0,6 0,4 1 VM 2 VMs 0,2 0 18

Fast Provisioning From a seed node to a cluster Thin Provisioning Linked Clone 60GB => 3.5GB ~6 second 19

Being Efficient through Resource over-commitment Memory over-commitment Hadoop JVMs hold onto memory even when not busy vsphere memory overcommit allows us to pack more hadoop nodes per host If you use EM4J, this can be optimized further Disk over-commitment Hadoop is designed for large dataset Thin-provisioning is wonderful in saving disk footprint 20

Performance Create more smaller VMs Makes Hadoop scale better Single large Hadoop node is limited by JVM scalability Allows for easier/faster adjustment of packing of VMs across hosts by vsphere (through DRS) Sizing/Configuration of storage is critical Plan on ~50Mbytes/sec of bandwidth per core SAN ports/switches will limit performance SANs are typically configured by default for IOPS, not Bandwidth Performance of the backend storage should be tested/sized Local disks will give ~100MBytes/sec per disk: pick correct controller 21

Summary Hadoop does work well in a virtual environment Plan a virtual cluster, enable other big-data solutions on the same infrastructure Leverage the recipes to automate your configuration and deployment 22

The big glaring hole [with cloud] is data handling. -Adrian Kunzle, MD Head of Engineering & Architecture, JPMorgan Chase

New Ways to Work with Data NoSQL In-memory Key/value pairs, simplicity, high productivity Different offerings, different data models: document, graph, big table, column NewSQL In-memory Scalability benefits of in-memory systems with standardized SQL Classic SQL Traditional RDBMS ACID (atomicity, consistency, isolation, durability) 24

How do you scale the data tier? 25

vfabric GemFire Application Data Lives Here Application Data Sleeps Here 26

Key Capabilities Low-latency, linearly-scalable, memory-based data fabric Data-aware execution Active/continuous querying and event notification 27

Primary Use Cases Web session cache, L2 cache App data cache, in-memory DB Grid data fabric: client compute Grid data fabric: fabric compute 28

Existing Applications New Applications vfabric Data Director DBA App Dev Automation Self-Service Provisioning Backup / Restore Clone One click HA DBA IT Admin Policy Based Control Resource Mgmt Security Mgmt Database Templates Monitor Private Cloud Hybrid Cloud Public Cloud

Big Data PaaS Cloud Foundry & vfabric 30

Cloud Stack Neutral View SaaS PaaS IaaS 31

Cloud Stack Classic Pyramid SaaS PaaS IaaS 32

Cloud Stack By Numbers SaaS PaaS IaaS 33

Cloud Stack By Value SaaS PaaS IaaS 34

Big Data PaaS Architecture Business Intelligence Applications UI Framework Data Integration Big Data API Data Process Analytics Workflow Scheduling Metadata Languages U-Data Store Coordination Other Application Services Graph Store Read / Write Access Application Lifecycle Management Security Systems Monitoring & Management Infrastructure as a Service (IaaS) 35

37 OSS community

vfabric Postgres Data Services vfabric RabbitMQ TM Msg Services Additional partners services Other Services 38

Data Services Private Clouds Msg Services Public Clouds Partners Other Services Micro Clouds.COM 39

VMware Cloud Application Platform Programming Model Rich Web Social and Mobile Data Access Integration Patterns Batch Framework Spring Tool Suite WaveMaker Cloud Foundry Java Runtime (tc Server) Web Runtime (ERS) Messaging (RabbitMQ) Global Data (GemFire) In-mem SQL (SQLFire) App Monitoring (Spring Insight) Performance Mgmt (Hyperic) Java Optimizations (EM4J, ) Virtual Datacenter Cloud Infrastructure and Management Automated App Provisioning (AppDirector) 40

Big Data SaaS Cetas 41

Data Sources 42

On-Premise Installation 43

Cloud-based Installation 44

45 Summary

Big, Fast and Flexible Data Big Big Data Processing Fast OLTP workloads Flexible Document Serengeti Object GemFire Big Data Analytics Analytic workloads Key / Value OSS Relational vpostgres Cloud Delivery Model Data as a service for private and public clouds 46

47