NEXT GENERATION DATACENTER XTREMIO and SCALEIO Aleksandr Shvadtshenko, Sr Systems Engineer, EMC Baltic States 1
EVERY BUSINESS WILL BE REDEFINED 2
NEW DIGITAL ERA APPS LOCATION IMAGES EACH APP HAS 1,000X MORE USERS DOCUMENTS VIDEO EACH USER = 1,000X MORE DATA Catch people or things in the act of doing something and affect the outcome 3
BUILT ON A NEW PLATFORM 3 RD PLATFORM BILLIONS OF USERS Mobile Cloud Big Data Social Mobile Devices MILLIONS OF APPS HUNDREDS OF MILLIONS OF USERS 2 ND PLATFORM LAN/Internet PC Client/Server TENS OF THOUSANDS OF APPS 1 ST PLATFORM MILLIONS OF USERS Mainframe, Mini Computer Terminals THOUSANDS OF APPS 4
HYPERCONVERGED INFRASTRUCTURE 5
HYPERCONVERGED INFRASTRUCTURE VIRTUAL VIRTUAL VIRTUAL VIRTUAL VIRTUAL COMPUTE + MEMORY SIMPLE STORAGE TO MANAGE NETWORK SOFTWARE DEFINED SCALE OUT + ARCHITECTURE + + MANAGEMENT EMC EMC 6
INDUSTRY TRENDS: BUILDING BLOCKS FOR NEXT GENERATION DATACENTER Next Generation Applications Trend PaaS, Mobile, Hadoop Which Will Enable Hybrid Clouds Which Will Be Used As Building Blocks Which Will Be Deployed In CI Hybrid Cloud Trend Software-Defined Data Center Trend Converged Infrastructure Trend Elastic, Agile, Data Center Running Next Generation Apps 2 Key Changes To The Underlying Storage Market Flash Trend New Levels of Performance COTS Trend New Levels of Efficiency 7
All-Flash Arrays (AFA) 8
Evolution of All-Flash Array Architectures This research shows that flash will become the lowest cost media for almost all storage from 2016 and beyond, and that a shared data philosophy is required to maximize the potential from both storage cost and application functionality perspectives. http://wikibon.org/wiki/v/evolution_of_all-flash_array_architectures 9
Why Flash? Challenge #1: Latency Bad user experience Application timeout Lost time = money 10
Why Flash? Latency cause: CPU HDD performance gap CPU IS FAST AND GETS FASTER DISK IS SLOW AND DOES NOT GET ANY FASTER LATENCY 2000 2010 2020 100x 800x 10000x 11
Why Flash? Solution: Flash will close the performance gap CPU IS FAST AND GETS FASTER FLASH IS FAST AND GETS FASTER 2000 2010 2020 100x 800x 10000x 12
XtremIO Sudden Impact - SQL Server 13
Business Value of next generation All-Flash-Array: lu re business will run faster Faster Applications -> Happier users More Transactions -> More sales Faster reports -> Instant business decisions Able to trade for 40 minutes extra per day = 156,000 per day 14
Gen 1 Flash Systems Legacy Disk Arrays The Evolution of FLASH Arrays Flash Optimized Hybrids Dual Controller Limited Scale Single Workloads Some Data Services Scale Up All Flash Arrays Dual Controller Limited Scale Single Workloads Sometimes Data Services Compromise Between Performance & Efficiency - Data Svcs Scale Out All Flash Arrays Multi Controller (16-XtremIO) Petabyte Scale Mixed Workloads In-Line Data Services Lowest TCO Highest Performance All Flash Data Center Gen 1 Gen 2 Gen 3 Gen 4 Source: Wikibon December 2014 15
Avoiding Bad Design Choices 5 Common Things AFAs Borrow from Disk Log Structuring Uneven SSD wear Post-Process Performance inconsistency RAID Par ity Par ity Par ity Par ity SSD Wear - Write Amplification Metadata De-stage Performance inconsistency Asymmetric Controllers A P Uneven Resource Utilization XTREMIO DOES NOT USE ANY OF THESE 16
XTREMIO DATA PROTECTION Designed for SSD No legacy RAID baggage Highly efficient- only 8% overhead Lowest write amplification SSDs may fail in place No configuration No hot spares Adapts to failures High Performance RAID 1 Good Capacity Utilization RAID 5 XDP Superior Protection RAID 6 Copyright 2014 EMC Corporation. All rights reserved. 17
Typical AFA System Level Garbage Collection Array Controller Huge back-end I/O amplification initiated by array controller Cannot defer process when array must free up space Big tax on array controllers to manage limits capacity 18
XtremIO No System Level Tax XtremIO Controller No Garbage Collection ASIC ASIC ASIC ASIC ASIC ASIC ASIC ASIC ASIC Performed by Each SSD Controller ASIC SSD controllers have ideal knowledge of the NAND Zero back-end I/O initiated by the array controllers No tax on array controllers 19
Why Customers Choose XtremIO Latency Latency Consistent High Performance & Low Latency Actual customer data Workload: OLTP with 8KB Block Size Latency Spikes 2-4.5ms = Gen 3 Flash Various latency spikes <1ms = XtremIO Consistent & Predictable (~0.5ms) System-Level Garbage Collection Impact Worse than Disk Latency 20-40ms = Gen 3 Flash Latency increases with capacity <1ms = XtremIO Always Consistent & Predictable XtremIO X-Brick Traditional Flash Array 20
Architecture Matters Part 2 Scale Up Dual Controller Shared Meta-Data Mid-Tier Architecture TYPE 1 Tightly Coupled Scale-Out Multi-Controller Grid Shared Meta-Data TYPE 2 Enterprise Architecture Loosely Coupled Scale Out 3 TYPE Why Scale-Out is Critical for AFAs Independent Controllers Distributed Meta-Data Large Content Repositories Distributed Share Nothing Controllers are the Bottleneck 4 TYPE Independent Controllers Distributed Meta-Data Non-Transactional Commits No front-end relief in Scale Up Limited CPU for Data Services Linear Scaling Iops & Latency 7+ Year Asset Lifecycle No performance tech refreshes Xpect More Program VNX VMAX Isilon XtremIO Superior Isilon vsan Resiliency & Performance N-Way Active/Active Architecture 21
ACTIVE ACTIVE 16 ACTIVE CONTROLLERS 150K SCALE IOPS SCALE 2M IOPS Scale Up XtremIO Scale Out Controller 16 Controller 15 Controller 14 SAN Controller 13 Controller 12 Controller 11 Passive Active Controller 10 Controller 9 Flash Flash Flash Controller 6 Controller 5 Controller 8 Controller 7 Controller 4 Controller 2 Controller 3 Controller 1 22
Product Family, On-Demand Linear Scale-Out 150K mixed IOPS 250K read IOPS <1ms Latency From 2 16 N-way Active Controllers 1.2M mixed IOPS 2M read IOPS <1ms Latency PBs Capacity Starter 5TB 5 320TB physical, PBs effective capacity based on 10, 20, & 40 TB X-Bricks SCALE-OUT <1ms LATENCY RICH DATA SERVICES NO TUNING 23
XtremIO X-Brick Cluster Building Block 32 CPU Cores ACTIVE CONTROLLER 1 ACTIVE CONTROLLER 2 512 GB RAM Controller RAM Infiniband RDMA Controller RAM 4 Host Ports 2 x FC 2 x iscsi SAS 2.0 SAS 2.0 2 x FC 2 x iscsi 25 emcl SSDs 24
XtremIO In-MEMORY Metadata XtremIO scales-out. Granular metadata is distributed across controllers in a shared memory fabric. PERFORMANCE Consistent & predictable AMAZING Metadata heavy operations USER DATA USER DATA USER DATA USER DATA REMOTE DIRECT MEMORY ACCESS FABRIC 25
Efficient IN-MEMORY VM cloning with XtremIO Hypervisor Instantly provision 1000s of VMs Metadata only operation; no disc IOs No storage sprawl X-COPY / ODX COMMAND IN-MEMORY METADATA BLOCK BLOCK BLOCK BLOCK UNIQUE, COMPRESSED USER DATA ON SSD 26
Today s Applications Environment JUST LIMITED TEST/DEV COPIES Brute Force Copy PRODUCTION Brute Force Copy DATAMART COPIES ONE APPLICATION 6 DATABASE COPIES 3 ARRAYS/POOLS 1 USE CASE FOR FLASH 27
Game-Changing Consolidation, Agility SCALE-OUT IOPS IN ABUNDANCE FREE, FAST DEV/TEST FREE, FAST ANALYTICS XTREMIO FLASH FOR ENTIRE APPLICATION DEV INST 1 DEV INST 1 DEV INST 1 DEV INST 1 DEV INST 1 DEV INST 1 DEV INST 1 DEV INST 1 DEV INST 1 DEV INST 1 FIN COPY FIN COPY REPORTS MORE BUSINESS PRODUCTIVITY DEV INST 2 DEV INST 2 DEV INST 2 DEV INST 2 DEV INST 2 DEV INST 2 DEV INST 2 DEV INST 2 DEV INST 2 DEV INST 2 OPS COPY OPS COPY ANALYTICS DEV INST 3 DEV INST 3 DEV INST 3 DEV INST 3 DEV INST 3 DEV INST 3 x TEST/DEV COPIES DEV INST 3 DEV INST 3 DEV INST 3 DEV INST 3 1 0 DATABASE INSTANCES HIGH PERFORMANCE PRODUCTION 1 SALES COPY XTREMIO CLUSTER SALES COPY DATAMART COPIES 0 TRENDS FASTER APPLICATION DEVELOPMENT TIMES BRUTE FORCE COPIES 28
Business Value#2 of next generation All-Flash-Array lu re business will run faster Faster Development Copies -> Faster development More Testing Copies -> More testing and improved quality More & Faster Develoment Resources -> Better code and less bugs 29
XtremIO s Unique Architecture Consistent Predictable Performance + Efficiency SCALE OUT Linear Scale IOPS, Bandwidth & Capacity DATA CENTER SERVICES HA/BC, App Management, Converged Infrastructure Self Service Provisioning and Orchestration Storage Resource Management linear scale out sub 1ms latency INLINE AND UNSTOPPABLE DATA SERVICES Data Reduction Efficiencies, In-Memory Metadata Validated Reference Architectures Intelligent vsphere VAAI Integration Continuous Data Protection and Disaster Recovery Enterprise Multi Pathing Converged Infrastructure VSI, DB, VDI 2-3 Site Continuous Availability Thin Provisioning Flash Data Protection Database Consistent Snapshot Management Virtualization Management Integration (VMW & MS) Deduplication Compression Encryption Writeable Copies EMC Storage Analytics Vmware vrealize Ops 30
XPECT MORE PROGRAM * EMC IS REDEFINING THE STORAGE LIFECYCLE MAINTENANCE PRICE PROTECTION 3-YEAR MONEY-BACK WARRANTY FLASH ENDURANCE PROTECTION 7 YEARS 2015-2018 7 YEAR S * For qualifying customers through December 31, 2015. See EMC.com/XpectMore for details, terms and conditions. 31
The Rise of Server SAN 32
The Rise of Server SAN http://wikibon.org/wiki/v/the_rise_of_server_san 33
Introducing EMC ScaleIO Software Defined, Scale-Out SAN Hyper-converged, software-defined block storage for commodity hardware Software that creates a server-based SAN from local storage to deliver performance and capacity on demand Run storage and applications in the same server(s) Leverages any media (HDDs, SSDs, & PCIe flash) Runs on physical & virtual servers 34
ScaleIO Hyper-converged architecture Flash SSD HDD RAID Cache RAM Bare Metal KVM VMware Hyper-V Compute Network ETH/IB Storage 35
ScaleIO SDC & SDS ScaleIO Data Client ScaleIO Data Server Lightweight device driver Allows ScaleIO block devices to be presented to application server Manages storage capacity on the application/storage server Provides backend access of devices to the SDCs 36
ScaleIO architecture scaleio hyperconvergance: resource utilization Impact on server is below 10% CPU load Maximum IOPS load on SSD pool Maximum IOPS load on 10K pool 37
ScaleIO PERFORMANCE Scales Linearly ~31M IOPs is about 8X better than the latest high-end HDS storage (G1000) for a fraction of the cost If we were to extrapolate the lines, we would get ~180M IOPs for 1024 Nodes (~46 G1000 systems ) 38
Elastic architecture Add, remove, re-allocate, on the fly BALANCED BALANCED BALANCED Auto-balance of resources across nodes/clusters Auto-rebalance when resources are added Auto-rebuild when resources fail or removed BALANCED BALANCED Easier Capacity Planning No Migrations 39
ENTERPRISE GRADE PERFORMANCE INTEROPERABILITY MONITORING FAULT TOLERANCE RESILIENCY Flash Cache Openstack Cinder Restful API SNMP & Call Home Rack Level High Availability RecoverPoint APP 1 APP 2 APP 3 MULTI-TENANCY PROTECTION SECURITY ELASTICITY EFFICIENCY IOP and QoS Controls Writeable Snapshots Data masking MDM & SDC Automatic Rebalance Thin Provisioning 40
ScaleIO System wide visibility, cost effective management at scale Automated installation scripts and easy configuration Manage the entire data center stack from single UI Monitor HW/SW failure recovery no administrative intervention required Options: CLI, UI, REST, ViPR Controller, ViPR SRM, vsphere, OpenStack Call-home and incident reporting VIEW AGGREGATE CAPACITY SCALEIO NODE COUNT PERFORMANCE MB/S & IOPS READS/ WRITES 41
3 SCALEIO CONSUMPTION MODELS ScaleIO SW ScaleIO Node VCE VxRack Software Defined Maximum flexibility Ultra Scale-Out SDS Software Only Complete flexibility End user supplies server End user supplies switch Scale-Out Block Storage Node HW + SW Bundle Package High degree of flexibility/ choice Primarily a SAN.Next Use Case Software Defined Controlled Environment Turnkey Fully Engineered Solution Fully integrated Application platform Primarily Infrastructure.Next Use Case 42
ScaleIO node ScaleIO SW + commodity Server bundle Software Proprietary EMC SW Fully supported by EMC EMC Commodity Server Node Fully tested by EMC Fully supported by EMC Available but optional: switches, mgmt. server, EMC rack and Network Interconnects ScaleIO Node Bundled ScaleIO SW + EMC commodity server Full EMC supported SW and HW 43
Benefits of Software-defined-storage: SUPREME ELASTICITY UNPARALLELED FLEXIBILITY MASSIVE SCALABILITY EXTREME PERFORMANCE COMPELLING ECONOMICS ATTAIN 8x better performance than TRADITIONAL SAN Scales To 1k+ NODES EFFICIENCY 50 % Over Traditional SAN Up To 30-60 % TCO Savings 44
Free Software Downloads www.emc.com/products-solutions/trial-software-download/ 45
HOW TO BRIDGE 2ND AND 3RD PLATFORM ARCHITECTURE? Blades, Virtualization, Arrays COTS, Storage Software 46
ViPR CONTROLLER CLOAKS & MANAGES ALL DIAMOND GOLD SILVER BRONZE Control Abstraction VMAX VNX VBLOCK XtremIO Isilon EMC ScaleIO 47
ViPR service catalog 48
49
CoPRHD @ emccode.github.io emccode@emc.com @EMCCode http://emccode.github.io {code} http://blog.emccode.com http://dasboard.emccode.com 50
TOP 3 STRATEGIC TRENDS 1. Converged and Hyperconverged Infrastructure 2. The Rise of Server SAN 3. All-Flash Arrays 51