Scaling Data Center Application Infrastructure. Gary Orenstein, Gear6

Similar documents
Tiered File System without Tiers. Laura Shepard, Isilon

Virtualization Practices: Providing a Complete Virtual Solution in a Box

Storage Virtualization II Effective Use of Virtualization - focusing on block virtualization -

The Role of WAN Optimization in Cloud Infrastructures. Josh Tseng, Riverbed

LEVERAGING FLASH MEMORY in ENTERPRISE STORAGE

Planning For Persistent Memory In The Data Center. Sarah Jelinek/Intel Corporation

Virtualization Practices:

PRESENTATION TITLE GOES HERE. Understanding Architectural Trade-offs in Object Storage Technologies

Storage Virtualization II. - focusing on block virtualization -

ADVANCED DEDUPLICATION CONCEPTS. Thomas Rivera, BlueArc Gene Nagle, Exar

Multi-Cloud Storage: Addressing the Need for Portability and Interoperability

Storage Performance Management Overview. Brett Allison, IntelliMagic, Inc.

A Promise Kept: Understanding the Monetary and Technical Benefits of STaaS Implementation. Mark Kaufman, Iron Mountain

ADVANCED DATA REDUCTION CONCEPTS

Trends in Data Protection and Restoration Technologies. Mike Fishman, EMC 2 Corporation

Interoperable Cloud Storage with the CDMI Standard. Mark Carlson, SNIA TC and Oracle Chair, SNIA Cloud Storage TWG

Overview and Current Topics in Solid State Storage

Overview and Current Topics in Solid State Storage

Massively Scalable File Storage. Philippe Nicolas, KerStor

Data Deduplication Methods for Achieving Data Efficiency

Adrian Proctor Vice President, Marketing Viking Technology

Effective Storage Tiering for Databases

SAS: Today s Fast and Flexible Storage Fabric. Rick Kutcipal President, SCSI Trade Association Product Planning and Architecture, Broadcom Limited

pnfs, parallel storage for grid and enterprise computing Joshua Konkle, NetApp, Inc.

High Availability Using Fault Tolerance in the SAN. Wendy Betts, IBM Mark Fleming, IBM

Deploying Public, Private, and Hybrid. Storage Cloud Environments

Trends in Worldwide Media and Entertainment Storage

Architectural Principles for Networked Solid State Storage Access

The Benefits of Solid State in Enterprise Storage Systems. David Dale, NetApp

Tom Sas HP. Author: SNIA - Data Protection & Capacity Optimization (DPCO) Committee

SRM: Can You Get What You Want? John Webster, Evaluator Group.

ASPECTS OF DEDUPLICATION. Dominic Kay, Oracle Mark Maybee, Oracle

The File Systems Evolution. Christian Bandulet, Sun Microsystems

Notes & Lessons Learned from a Field Engineer. Robert M. Smith, Microsoft

in Transition to the Cloud David A. Chapa, CTE EVault, a Seagate Company

SAS: Today s Fast and Flexible Storage Fabric

SNIA Tutorial 1 A CASE FOR FLASH STORAGE HOW TO CHOOSE FLASH STORAGE FOR YOUR APPLICATIONS

Interoperable Cloud Storage with the CDMI Standard. Mark Carlson, SNIA TC and Oracle Co-Chair, SNIA Cloud Storage TWG

Create a Smarter and More Economic Cloud Storage Architecture. November 7, 2018

Restoration Technologies

Storage in combined service/product data infrastructures. Craig Dunwoody CTO, GraphStream Incorporated

Mark Rogov, Dell EMC Chris Conniff, Dell EMC. Feb 14, 2018

LTFS Bulk Transfer Standard PRESENTATION TITLE GOES HERE

Expert Insights: Expanding the Data Center with FCoE

Use Cases for iscsi and FCoE: Where Each Makes Sense

The Evolution of File Systems

Analysis and Optimization. Carl Waldspurger Irfan Ahmad CloudPhysics, Inc.

How to create a synthetic workload test. Eden Kim, CEO Calypso Systems, Inc.

Facing an SSS Decision? SNIA Efforts to Evaluate SSS Performance. Ray Lucchesi Silverton Consulting, Inc.

Enterprise Class Mass Storage: Innovative Approaches to Disk-based Active Archive Storage

Ron Emerick, Oracle Corporation

Mobile and Secure Healthcare: Encrypted Objects and Access Control Delegation

Felix Xavier CloudByte Inc.

WHAT HAPPENS WHEN THE FLASH INDUSTRY GOES TO TLC? Luanne M. Dauber, Pure Storage

Apples to Apples, Pears to Pears in SSS performance Benchmarking. Esther Spanjer, SMART Modular

OpenStack Manila An Overview of Manila Liberty & Mitaka

Evolution of Fibre Channel. Mark Jones FCIA / Emulex Corporation

An Introduction to Key Management for Secure Storage. Walt Hubis, LSI Corporation

STORAGE CONSOLIDATION WITH IP STORAGE. David Dale, NetApp

Application Recovery. Andreas Schwegmann / HP

Virtualization of the MS Exchange Server Environment

STORAGE CONSOLIDATION WITH IP STORAGE. David Dale, NetApp

Trends in Data Protection and Restoration Technologies. Jason Iehl, NetApp

EVERYTHING YOU WANTED TO KNOW ABOUT STORAGE, BUT WERE TOO PROUD TO ASK Part Cyan Storage Management. September 28, :00 am PT

Felix Xavier CloudByte Inc.

Extending RDMA for Persistent Memory over Fabrics. Live Webcast October 25, 2018

Cloud Archive and Long Term Preservation Challenges and Best Practices

New Approach to Unstructured Data

Everything You Wanted To Know About Storage: Part Teal The Buffering Pod

I D C T E C H N O L O G Y S P O T L I G H T

EMC XTREMCACHE ACCELERATES VIRTUALIZED ORACLE

The Intersection of Cloud & Solid State Storage

ORACLE FABRIC MANAGER

SRM: Can You Get What You Want? John Webster Principal IT Advisor, Illuminata

Tutorial. A New Standard for IP Based Drive Management. Mark Carlson SNIA Technical Council Co-Chair

IBM Spectrum NAS. Easy-to-manage software-defined file storage for the enterprise. Overview. Highlights

StorageCraft OneBlox and Veeam 9.5 Expert Deployment Guide

Document Sub Title. Yotpo. Technical Overview 07/18/ Yotpo

THE ZADARA CLOUD. An overview of the Zadara Storage Cloud and VPSA Storage Array technology WHITE PAPER

Trends in Data Protection CDP and VTL

Jeff Dodson / Avago Technologies

VMworld 2013 Overview

Understanding Your Virtual Workload. Irfan Ahmad CTO CloudPhysics

TUTORIAL: WHITE PAPER. VERITAS Indepth for the J2EE Platform PERFORMANCE MANAGEMENT FOR J2EE APPLICATIONS

Storage Clouds. Marty Stogsdill Oracle

StorageCraft OneXafe and Veeam 9.5

SNIA Tutorial 3 EVERYTHING YOU WANTED TO KNOW ABOUT STORAGE: Part Teal Queues, Caches and Buffers

Nutanix Tech Note. Virtualizing Microsoft Applications on Web-Scale Infrastructure

Condusiv s V-locity VM Accelerates Exchange 2010 over 60% on Virtual Machines without Additional Hardware

Leveraging Software-Defined Storage to Meet Today and Tomorrow s Infrastructure Demands

The HP 3PAR Get Virtual Guarantee Program

The Google File System

Everything You Wanted To Know About Storage (But Were Too Proud To Ask) The Basics

Surveillance Dell EMC Storage with Synectics Digital Recording System

Evolution of Rack Scale Architecture Storage

AUTOMATING IBM SPECTRUM SCALE CLUSTER BUILDS IN AWS PROOF OF CONCEPT

EMC XTREMCACHE ACCELERATES MICROSOFT SQL SERVER

Market Report. Scale-out 2.0: Simple, Scalable, Services- Oriented Storage. Scale-out Storage Meets the Enterprise. June 2010.

Storage and Virtualization: Five Best Practices to Ensure Success. Mike Matchett, Akorri

Storage Considerations for VMware vcloud Director. VMware vcloud Director Version 1.0

Transcription:

Scaling Data Center Application Infrastructure Gary Orenstein, Gear6

SNIA Legal Notice The material contained in this tutorial is copyrighted by the SNIA. Member companies and individuals may use this material in presentations and literature under the following conditions: Any slide or slides used must be reproduced without modification The SNIA must be acknowledged as source of any material used in the body of any document containing material from these presentations. This presentation is a project of the SNIA Education Committee. Neither the Author nor the Presenter is an attorney and nothing in this presentation is intended to be nor should be construed as legal advice or opinion. If you need legal advice or legal opinion please contact an attorney. The information presented herein represents the Author's personal opinion and current understanding of the issues involved. The Author, the Presenter, and the SNIA do not assume any responsibility or liability for damages arising out of any reliance on or use of this information. NO WARRANTIES, EXPRESS OR IMPLIED. USE AT YOUR OWN RISK. 2008 Storage Networking Industry Association. All Rights Reserved. 2

Abstract Data center managers must support ever-increasing application workloads for up to tens of thousands of users. The demands placed upon the underlying infrastructure require proper planning and architecture in order to scale efficiently. Application managers can choose to deploy application infrastructure internally using readily available technology solutions. Additionally, there are options to extend application infrastructure with cloud computing offerings from Amazon Web Service and Google AppEngine. Even if application managers do not make use of the cloud computing offerings directly, the respective architectures provide an excellent reference model for private infrastructure deployment. In all cases, application managers need to know what tools and resources are available to help scale infrastructure to support an ever increasing user base. 3 2008 Storage Networking Industry Association. All Rights Reserved. 3

INTRODUCTION Systems and data center level view The File Explosion and Storage Impact Three Case Studies: Background Examining the I/O Bottleneck and Conventional Solutions Caching for Scale: Data Center Strategies Caching in Context: Case Study Review 2008 Storage Networking Industry Association. All Rights Reserved. 4

Huge File Counts Driving New Bottlenecks Old bottleneck Limited capacity Compound Growth, 2007-2011 88% New bottlenecks Huge file counts Deep directory requests Simultaneous users Unpredictable access patterns 59% All leading to Painful access times Source: IDC 2008 http://www.emc.com/collateral/analyst-reports/diverse-exploding-digital-universe.pdf 2008 Storage Networking Industry Association. All Rights Reserved. 5

File Explosion Issues Facing Individual Companies Music downloads in 5 years 5 billion 100 photos millionuploaded per week Files concurrently >100K accessed by 30,000 clients in under 1 millisecond 1 billion Searchable videos by 2009 http://money.cnn.com/news/newsfeeds/articles/djf500/200809091346dowjonesdjonline000554_fortune5.htm http://www.flowgram.com/p/2qi3k8eicrfgkv/ http://www.searchenginejournal.com/truveo-forecasts-1-billion-searchable-online-videos-by-2009/6203/ 2008 Storage Networking Industry Association. All Rights Reserved. 6

Data Load and Storage CPU Load 100% Storage Effectiveness Ability to efficiently use all system functionality without over provisioning resources Storage Effectiveness Threshold Data Load Warning Zone! Storage CPU Load 0% January December 2008 Storage Networking Industry Association. All Rights Reserved. 7

The Rise of Indexing Bottlenecks Common Index Overload! 2008 Storage Networking Industry Association. All Rights Reserved. 8

Walking the Directory Tree Global namespaces can add to performance concerns Sample NFS directory lookup /quick/brown/fox/jumped/over/the/lazy/dog.file /quick Additional NFS operation /brown /fox /jumped Requested content: dog.file 2008 Storage Networking Industry Association. All Rights Reserved. 9

The Impact of High File Counts Conventional Model Web/App Servers Numerous metadata requests Lengthy response times Inability to scale the number of users Storage System Impact High CPU utilization Slow response times Inability to use all functionality Snapshots Disk over provisioning System over provisioning Disk Storage 2008 Storage Networking Industry Association. All Rights Reserved. 10

Three Case Study Scenarios Data warehousing Software development Web scale 2008 Storage Networking Industry Association. All Rights Reserved. 11

Enterprise Data Warehouse Configuration Poor Current environment Many databases Large and small Highly active and less active Large number of concurrent users Access control and authentication mechanism in place Single storage repository streamlines management but is prone to bottlenecks Storage Health Good 2008 Storage Networking Industry Association. All Rights Reserved. 12

Enterprise Data Warehouse Configuration Database Split Pros Cons Reduce Single System Workload Pain to split database Excessive overhead / management Concurrency challenges Poor Storage Health Good 2008 Storage Networking Industry Association. All Rights Reserved. 13

Software Development Bottlenecks Heavy I/O Load Compiling Process Regressions Poor Storage Health 2008 Storage Networking Industry Association. All Rights Reserved. 14 Good

Software Development - Replicas Pros Cons Reduce storage CPU load Over-provisioned storage Excess manual administration Manually administered disk-based replicas Poor Good 2008 Storage Networking Industry Association. All Rights Reserved. Storage Health 15

Web Scale Applications Step 1 Index servers crawl database Index Servers 4 4 Step 4 Serve search requests 1 3 Database 2 Lengthy propagation cycle limits update rate to every 24 hours Step 2 Index servers generate index file NFS Storage Step 3 Manually propagate updated index file to local storage 2008 Storage Networking Industry Association. All Rights Reserved. 16

Current Trends Driving Increasing I/O Bottlenecks Application traffic trends Shared I/O applications File-content explosion Web-scale applications Server virtualization I/O Bottlenecks Current trends driving painful storage problems 2008 Storage Networking Industry Association. All Rights Reserved. 17

Current Ineffective Performance Approaches Client caching Subsystem caching Over provisioning Limited capacity Inefficient Isolated Limited capacity Difficult to scale Resources anchored to each subsystem Hot Spots No latency reduction High CAPEX and OPEX 2008 Storage Networking Industry Association. All Rights Reserved. 18

A Network-Centric Approach: Centralized Caching Increase performance Reduce total system costs Leverage existing infrastructure Network Cache Cached data served 10-50x faster from memory Scale easily 2008 Storage Networking Industry Association. All Rights Reserved. 19

Solutions Needed At All Layers Server Networking Storage 2008 Storage Networking Industry Association. All Rights Reserved. 20

Server Layer Application Scaling Server Virtualization Parallelization Clustering 2008 Storage Networking Industry Association. All Rights Reserved. 21

Networking Layer Bandwidth/ Latency Functionality Bandwidth Latency Networking Bandwidth/ Latency Functionality File Acceleration Load Balancing File Access Optimization 2008 Storage Networking Industry Association. All Rights Reserved. 22

Storage Layer Scalable File Systems Parallel / Clustered Global Namespaces Persistence Protection Storage 2008 Storage Networking Industry Association. All Rights Reserved. 23

Why Are We Here? Typical Data Center Lots of Servers with Lots of Processors and Cores Lots of Disk Drives with Rotating Mechanical Media SOMETHING HAS TO CHANGE! 2008 Storage Networking Industry Association. All Rights Reserved. 24

Data Center Memory Options Wide Range of Deployment Choices Servers Appliances Network Devices Storage Systems PCI Cards Processors Memory Modules SSDs 2008 Storage Networking Industry Association. All Rights Reserved. 25

Ways to Use Memory Memory as Disk Individual host-visible LUN Actively managed storage Manual or software-assisted active data extraction Memory as Cache Transparent view Passively managed Automatic caching of active data set Memory-based LUN Memory-based Cache Disk-based LUN Disk-based LUN 2008 Storage Networking Industry Association. All Rights Reserved. 26

Making Use of Near-Infinite Disk Capacity Memorybased LUN Memorybased Cache Near- Infinite Diskbased Capacity 2008 Storage Networking Industry Association. All Rights Reserved. 27

Where to Cache L1, L2, L3 Cache Servers Network Controllers Disks Server Cache Network Cache Controller Cache Disk Cache Many Caching Options All Likely To Stick Around 2008 Storage Networking Industry Association. All Rights Reserved. 28

Comparing Cache Locations Low Device-Count Configurations Server or storage caching provides comprehensive reach Multiple Device Configurations Server or storage caching provides limited reach 2008 Storage Networking Industry Association. All Rights Reserved. 29

Advantages of Network Caching Network Caching for Multi- Device Configurations Maximum effectiveness and efficiency Optimizing Cache Locations Disks Servers Coverage/ Utilization Network Cache Proximity to server Ideal Location 2008 Storage Networking Industry Association. All Rights Reserved. 30

Enterprise Data Warehouse Solution Pros Single system, streamlined management Network caching for peak load handling Single Storage Management Point Poor Storage Health Good 2008 Storage Networking Industry Association. All Rights Reserved. 31

Software Development Solution Pros Memory-based, network caching for handling of small file and metadata requests Poor Good 2008 Storage Networking Industry Association. All Rights Reserved. Storage Health 32

Web Scale Application Solution Step 1 Indexing servers crawl database Lucene Servers 3 3 Step 3 Serve search requests 1 2 No propagation Immediate updates Database Step 2 Index servers generate index files Immediate access available from network cache NFS Storage 2008 Storage Networking Industry Association. All Rights Reserved. 33

Controlling High File Counts Conventional Model Numerous metadata requests Lengthy response times Inability to scale the number of users Centralized Caching Model Cache frequent requests Immediate response times Accelerate existing infrastructure performance Web/App Servers Web/App Servers Caching Appliance Disk Storage Disk Storage 2008 Storage Networking Industry Association. All Rights Reserved. 34

Q&A / Feedback Please send any questions or comments on this presentation to SNIA: Application Track Josh Tseng, Track Lead Rob Peglar Many thanks to the following individuals for their contributions to this tutorial. - SNIA Education Committee 35 2008 Storage Networking Industry Association. All Rights Reserved. 35