Operational Experiences with Disk Imaging in a Multi-Tenant Datacenter. Kevin Atkinson, Gary Wong, and Robert Ricci

Size: px
Start display at page:

Download "Operational Experiences with Disk Imaging in a Multi-Tenant Datacenter. Kevin Atkinson, Gary Wong, and Robert Ricci"

Transcription

1 Operational Experiences with Disk Imaging in a Multi-Tenant Datacenter Kevin Atkinson, Gary Wong, and Robert Ricci

2 2

3 2

4 2

5 2

6 2

7 2

8 2

9 Properties of disk images and their usage have consequences for:! Storage Caching Pre-loading Distribution 3

10 4

11 What does the working set look like? 4

12 What does the working set look like? What do the images themselves look like? 4

13 What does the working set look like? What do the images themselves look like? What are the key factors in pre-loading? 4

14 The dataset Four years ( ): 279,972 requests Users: 1,301 individuals, 368 organizations Unique images: 714 Emulab ~600 PCs Facility / user image model 5

15 User Behavior

16 Emulab is a pretty odd beast and its users are even weirder. 7

17 Emulab is a pretty odd beast and its users are even weirder. Reviewer D 7

18 Emulab is a pretty odd beast and its users are even weirder. Reviewer D [Emulab user] 7

19 Facility vs. user images Facility User 55.6% 44.4% 8

20 Facility vs. user images Facility User 55.6% 44.4% 8

21 Facility vs. user images Facility User 55.6% 44.4% 1) Most users stick to facility or user images 2) Heaviest users use their own images 8

22 Image popularity 9

23 Image popularity 9

24 Image popularity 9

25 Image popularity 9

26 Image popularity Exponential 9

27 Image popularity Exponential Heavy-Tailed 9

28 Image popularity 1) Facility images have a smaller, lighter tail 2) Most popular image < 13% of requests Exponential Heavy-Tailed 9

29 Scaling: total images 10

30 Scaling: total images 10

31 Scaling: total images 10

32 Scaling: total images As userbase grows, user images dominate the totals 10

33 Daily working set 11

34 Daily working set Small image set each day good caching potential 11

35 Scaling: working set 12

36 Scaling: working set 12

37 Scaling: working set 12

38 Scaling: working set Facility will max out 12

39 Scaling: working set Facility will max out In the limit, highly popular facility images account for most requests 12

40 Image Contents

41 Block-level similarity Base 14

42 Block-level similarity Base Derived 14

43 Block-level similarity Base Derived 14

44 Block-level similarity Base Derived Percentage of blocks that need to be written to transform the base image into derived 14

45 Block-level similarity Derived: User image Base: Most similar facility image 15

46 Block-level similarity Derived: User image Base: Most similar facility image 15

47 Block-level similarity 1) De-duplicating Derived: User storage image an attractive option 2) Differential Base: loading Most similar has potential facility image 15

48 Pre-Loading

49 Pre-loading: Size 17

50 Pre-loading: Size Spare Capacity 17

51 Pre-loading: Size Spare Capacity Mostly Full 17

52 Pre-loading: Size Spare Capacity WSS for facility Mostly Full images maxes out on large facilities 17

53 Pre-loading: Size 1) Key: Ratio of WSS to idle capacity 2) Effective when ratio is high Spare Capacity WSS for facility Mostly Full images maxes out on large facilities 17

54 Pre-loading: Rate 18

55 Pre-loading: Rate 18

56 Pre-loading: Rate Invest in fast, scalable imaging 18

57 Conclusions

58 General conclusions Deduplicating, two-tier storage attractive Caching can be effective Image lifespan, idle periods Treat facility and user images differently Facility better targets for pre-loading Differential loading requires new strategies Potential savings, outline of optimization problem Images per organization, WSS per week 20

59 Explore the data, reproduce our results:! 21

60 No dominant images 22

61 No dominant images No image dominates long-term, popular images change frequently 22

62 Image lifespan 23

63 Image lifespan A few days 23

64 Image lifespan A few days Four Years 23

65 Image lifespan A few days Four Years Two-tiered storage system attractive 23

66 Savings from deltas 24

67 Images per organization 25

68 Idle images 26

69 WSS per week 27

70 Top images RHL90-STD [D] 21, % FEDORA10-STD 18, % UBUNTU10-STD 14, % RHL90-STD 13, % FC4-UPDATE 12, % u 715/10 11, % FBSD410-STD 8, % FEDORA8-STD 8, % u 237/69 7, % u 296/35 7, % u 787/24 6, % UBUNTU70-STD 6, % UBUNTU12-64-STD 5, % 28

71 Size considerations Small facilities with few idle disks Pre-loading not valuable Large facilities - focus on: Scalable reloading mechanisms Prediction and optimization for user requests 29

Edge Datacenter Placement BY ABHISHEK GUPTA FRIDAY GROUP MEETING JUNE 10, 2016

Edge Datacenter Placement BY ABHISHEK GUPTA FRIDAY GROUP MEETING JUNE 10, 2016 Edge Datacenter Placement BY ABHISHEK GUPTA FRIDAY GROUP MEETING JUNE 10, 2016 Why the need for a Edge Datacenter? For cloud services, performance at the user end is very important. In recent years, the

More information

Reducing Costs in the Data Center Comparing Costs and Benefits of Leading Data Protection Technologies

Reducing Costs in the Data Center Comparing Costs and Benefits of Leading Data Protection Technologies Reducing Costs in the Data Center Comparing Costs and Benefits of Leading Data Protection Technologies November 2007 Reducing Costs in the Data Center Table of Contents The Increasingly Costly Data Center...1

More information

Rhinoback Online Backup. In-File Delta

Rhinoback Online Backup. In-File Delta December 2006 Table of Content 1 Introduction... 3 1.1 Differential Delta Mode... 3 1.2 Incremental Delta Mode... 3 2 Delta Generation... 4 3 Block Size Setting... 4 4 During Backup... 5 5 During Restore...

More information

Deduplication Storage System

Deduplication Storage System Deduplication Storage System Kai Li Charles Fitzmorris Professor, Princeton University & Chief Scientist and Co-Founder, Data Domain, Inc. 03/11/09 The World Is Becoming Data-Centric CERN Tier 0 Business

More information

Scale-out Data Deduplication Architecture

Scale-out Data Deduplication Architecture Scale-out Data Deduplication Architecture Gideon Senderov Product Management & Technical Marketing NEC Corporation of America Outline Data Growth and Retention Deduplication Methods Legacy Architecture

More information

SSD Admission Control for Content Delivery Networks

SSD Admission Control for Content Delivery Networks Technical Disclosure Commons Defensive Publications Series September 20, 2017 SSD Admission Control for Content Delivery Networks Dragos Ionescu Richard Schooler Kenneth Barr Follow this and additional

More information

Automated Storage Tiering on Infortrend s ESVA Storage Systems

Automated Storage Tiering on Infortrend s ESVA Storage Systems Automated Storage Tiering on Infortrend s ESVA Storage Systems White paper Abstract This white paper introduces automated storage tiering on Infortrend s ESVA storage arrays. Storage tiering can generate

More information

- SLED: single large expensive disk - RAID: redundant array of (independent, inexpensive) disks

- SLED: single large expensive disk - RAID: redundant array of (independent, inexpensive) disks RAID and AutoRAID RAID background Problem: technology trends - computers getting larger, need more disk bandwidth - disk bandwidth not riding moore s law - faster CPU enables more computation to support

More information

Announcements. Reading. Project #1 due in 1 week at 5:00 pm Scheduling Chapter 6 (6 th ed) or Chapter 5 (8 th ed) CMSC 412 S14 (lect 5)

Announcements. Reading. Project #1 due in 1 week at 5:00 pm Scheduling Chapter 6 (6 th ed) or Chapter 5 (8 th ed) CMSC 412 S14 (lect 5) Announcements Reading Project #1 due in 1 week at 5:00 pm Scheduling Chapter 6 (6 th ed) or Chapter 5 (8 th ed) 1 Relationship between Kernel mod and User Mode User Process Kernel System Calls User Process

More information

NetApp SolidFire and Pure Storage Architectural Comparison A SOLIDFIRE COMPETITIVE COMPARISON

NetApp SolidFire and Pure Storage Architectural Comparison A SOLIDFIRE COMPETITIVE COMPARISON A SOLIDFIRE COMPETITIVE COMPARISON NetApp SolidFire and Pure Storage Architectural Comparison This document includes general information about Pure Storage architecture as it compares to NetApp SolidFire.

More information

2012 Enterprise Strategy Group. Enterprise Strategy Group Getting to the bigger truth. TM

2012 Enterprise Strategy Group. Enterprise Strategy Group Getting to the bigger truth. TM 2012 Enterprise Strategy Group Enterprise Strategy Group Getting to the bigger truth. TM Enterprise Strategy Group Getting to the bigger truth. TM Virtualization Evolution and Storage Requirements Kim

More information

A study of practical deduplication

A study of practical deduplication A study of practical deduplication Dutch T. Meyer University of British Columbia Microsoft Research Intern William Bolosky Microsoft Research Why Dutch is Not Here A study of practical deduplication Dutch

More information

CSC630/CSC730 Parallel & Distributed Computing

CSC630/CSC730 Parallel & Distributed Computing CSC630/CSC730 Parallel & Distributed Computing Analytical Modeling of Parallel Programs Chapter 5 1 Contents Sources of Parallel Overhead Performance Metrics Granularity and Data Mapping Scalability 2

More information

WAN Optimized Replication of Backup Datasets Using Stream-Informed Delta Compression

WAN Optimized Replication of Backup Datasets Using Stream-Informed Delta Compression WAN Optimized Replication of Backup Datasets Using Stream-Informed Delta Compression Philip Shilane, Mark Huang, Grant Wallace, & Windsor Hsu Backup Recovery Systems Division EMC Corporation Introduction

More information

Data Storage Revolution

Data Storage Revolution Data Storage Revolution Relational Databases Object Storage (put/get) Dynamo PNUTS CouchDB MemcacheDB Cassandra Speed Scalability Availability Throughput No Complexity Eventual Consistency Write Request

More information

YOUR CONDUIT TO THE CLOUD

YOUR CONDUIT TO THE CLOUD COLOCATION YOUR CONDUIT TO THE CLOUD MASSIVE NETWORKS Enterprise-Class Data Transport Solutions SUMMARY COLOCATION PROVIDERS ARE EVERYWHERE. With so many to choose from, how do you know which one is right

More information

Analyzing Compute vs. Storage Tradeoff for Videoaware Storage Efficiency

Analyzing Compute vs. Storage Tradeoff for Videoaware Storage Efficiency Analyzing Compute vs. Storage Tradeoff for Videoaware Storage Efficiency Atish Kathpal, Mandar Kulkarni Ajay Bakre Advanced Technology Group NetApp Inc. 1 Context and Overview Trend: Number of devices

More information

Storage Considerations for VMware vcloud Director. VMware vcloud Director Version 1.0

Storage Considerations for VMware vcloud Director. VMware vcloud Director Version 1.0 Storage Considerations for VMware vcloud Director Version 1.0 T e c h n i c a l W H I T E P A P E R Introduction VMware vcloud Director is a new solution that addresses the challenge of rapidly provisioning

More information

Protect enterprise data, achieve long-term data retention

Protect enterprise data, achieve long-term data retention Technical white paper Protect enterprise data, achieve long-term data retention HP StoreOnce Catalyst and Symantec NetBackup OpenStorage Table of contents Introduction 2 Technology overview 3 HP StoreOnce

More information

Evaluation Report: Improving SQL Server Database Performance with Dot Hill AssuredSAN 4824 Flash Upgrades

Evaluation Report: Improving SQL Server Database Performance with Dot Hill AssuredSAN 4824 Flash Upgrades Evaluation Report: Improving SQL Server Database Performance with Dot Hill AssuredSAN 4824 Flash Upgrades Evaluation report prepared under contract with Dot Hill August 2015 Executive Summary Solid state

More information

Web Caching and Content Delivery

Web Caching and Content Delivery Web Caching and Content Delivery Caching for a Better Web Performance is a major concern in the Web Proxy caching is the most widely used method to improve Web performance Duplicate requests to the same

More information

Caching & Tiering BPG

Caching & Tiering BPG Intro: SSD Caching and SSD Tiering functionality in the StorTrends 3500i offers the most intelligent performance possible from a hybrid storage array at the most cost-effective prices in the industry.

More information

Scale-out Object Store for PB/hr Backups and Long Term Archive April 24, 2014

Scale-out Object Store for PB/hr Backups and Long Term Archive April 24, 2014 Scale-out Object Store for PB/hr Backups and Long Term Archive April 24, 2014 Gideon Senderov Director, Advanced Storage Products NEC Corporation of America Long-Term Data in the Data Center (EB) 140 120

More information

Benefits of Storage Capacity Optimization Methods (COMs) And. Performance Optimization Methods (POMs)

Benefits of Storage Capacity Optimization Methods (COMs) And. Performance Optimization Methods (POMs) Benefits of Storage Capacity Optimization Methods (COMs) And Performance Optimization Methods (POMs) Herb Tanzer & Chuck Paridon Storage Product & Storage Performance Architects Hewlett Packard Enterprise

More information

IBM Real-time Compression and ProtecTIER Deduplication

IBM Real-time Compression and ProtecTIER Deduplication Compression and ProtecTIER Deduplication Two technologies that work together to increase storage efficiency Highlights Reduce primary storage capacity requirements with Compression Decrease backup data

More information

Workspace & Storage Infrastructure for Service Providers

Workspace & Storage Infrastructure for Service Providers Workspace & Storage Infrastructure for Service Providers Garry Soriano Regional Technical Consultant Citrix Cloud Channel Summit 2015 @rhipecloud #RCCS15 The industry s most complete Mobile Workspace solution

More information

Daniel A. Menascé, Ph. D. Dept. of Computer Science George Mason University

Daniel A. Menascé, Ph. D. Dept. of Computer Science George Mason University Daniel A. Menascé, Ph. D. Dept. of Computer Science George Mason University menasce@cs.gmu.edu www.cs.gmu.edu/faculty/menasce.html D. Menascé. All Rights Reserved. 1 Benchmark System Under Test (SUT) SPEC

More information

Database Architecture 2 & Storage. Instructor: Matei Zaharia cs245.stanford.edu

Database Architecture 2 & Storage. Instructor: Matei Zaharia cs245.stanford.edu Database Architecture 2 & Storage Instructor: Matei Zaharia cs245.stanford.edu Summary from Last Time System R mostly matched the architecture of a modern RDBMS» SQL» Many storage & access methods» Cost-based

More information

IOmark- VDI. IBM IBM FlashSystem V9000 Test Report: VDI a Test Report Date: 5, December

IOmark- VDI. IBM IBM FlashSystem V9000 Test Report: VDI a Test Report Date: 5, December IOmark- VDI IBM IBM FlashSystem V9000 Test Report: VDI- 151205- a Test Report Date: 5, December 2015 Copyright 2010-2015 Evaluator Group, Inc. All rights reserved. IOmark- VDI, IOmark- VM, VDI- IOmark,

More information

Ambry: LinkedIn s Scalable Geo- Distributed Object Store

Ambry: LinkedIn s Scalable Geo- Distributed Object Store Ambry: LinkedIn s Scalable Geo- Distributed Object Store Shadi A. Noghabi *, Sriram Subramanian +, Priyesh Narayanan +, Sivabalan Narayanan +, Gopalakrishna Holla +, Mammad Zadeh +, Tianwei Li +, Indranil

More information

New HPE 3PAR StoreServ 8000 and series Optimized for Flash

New HPE 3PAR StoreServ 8000 and series Optimized for Flash New HPE 3PAR StoreServ 8000 and 20000 series Optimized for Flash AGENDA HPE 3PAR StoreServ architecture fundamentals HPE 3PAR Flash optimizations HPE 3PAR portfolio overview HPE 3PAR Flash example from

More information

Solution Guide. 10 Non-Negotiables of IT Infrastructure Performance Management

Solution Guide. 10 Non-Negotiables of IT Infrastructure Performance Management Solution Guide 10 Non-Negotiables of IT Infrastructure Performance Management Many IT optimization efforts fail because of management s inability to recognize the importance of an integrated infrastructure

More information

A DEDUPLICATION-INSPIRED FAST DELTA COMPRESSION APPROACH W EN XIA, HONG JIANG, DA N FENG, LEI T I A N, M I N FU, YUKUN Z HOU

A DEDUPLICATION-INSPIRED FAST DELTA COMPRESSION APPROACH W EN XIA, HONG JIANG, DA N FENG, LEI T I A N, M I N FU, YUKUN Z HOU A DEDUPLICATION-INSPIRED FAST DELTA COMPRESSION APPROACH W EN XIA, HONG JIANG, DA N FENG, LEI T I A N, M I N FU, YUKUN Z HOU PRESENTED BY ROMAN SHOR Overview Technics of data reduction in storage systems:

More information

Peter X. Gao, Andrew R. Curtis, Bernard Wong, S. Keshav. Cheriton School of Computer Science University of Waterloo

Peter X. Gao, Andrew R. Curtis, Bernard Wong, S. Keshav. Cheriton School of Computer Science University of Waterloo Peter X. Gao, Andrew R. Curtis, Bernard Wong, S. Keshav Cheriton School of Computer Science University of Waterloo August 15, 2012 1 = ~1M servers CO 2 of 280,000 cars 2 Datacenters and Request Routing

More information

Performance of relational database management

Performance of relational database management Building a 3-D DRAM Architecture for Optimum Cost/Performance By Gene Bowles and Duke Lambert As systems increase in performance and power, magnetic disk storage speeds have lagged behind. But using solidstate

More information

Tom Sas HP. Author: SNIA - Data Protection & Capacity Optimization (DPCO) Committee

Tom Sas HP. Author: SNIA - Data Protection & Capacity Optimization (DPCO) Committee Advanced PRESENTATION Data Reduction TITLE GOES HERE Concepts Tom Sas HP Author: SNIA - Data Protection & Capacity Optimization (DPCO) Committee SNIA Legal Notice The material contained in this tutorial

More information

Chapter 11. SnapProtect Technology

Chapter 11. SnapProtect Technology Chapter 11 SnapProtect Technology Hardware based snapshot technology provides the ability to use optimized hardware and disk appliances to snap data on disk arrays providing quick recovery by reverting

More information

I/O Considerations for Server Blades, Backplanes, and the Datacenter

I/O Considerations for Server Blades, Backplanes, and the Datacenter I/O Considerations for Server Blades, Backplanes, and the Datacenter 1 1 Contents Abstract 3 Enterprise Modular Computing 3 The Vision 3 The Path to Achieving the Vision 4 Bladed Servers 7 Managing Datacenter

More information

10/29/2013. Program Agenda. The Database Trifecta: Simplified Management, Less Capacity, Better Performance

10/29/2013. Program Agenda. The Database Trifecta: Simplified Management, Less Capacity, Better Performance Program Agenda The Database Trifecta: Simplified Management, Less Capacity, Better Performance Data Growth and Complexity Hybrid Columnar Compression Case Study & Real-World Experiences

More information

the past doesn t impact the future!

the past doesn t impact the future! Memoryless property: suppose time between session arrivals Z is exponentially distributed note: Pr{Z >y} = y be bt dt = e by suppose a session has not arrived for y seconds what is the probability that

More information

MAID for Archiving. Aloke Guha. COPAN Systems

MAID for Archiving. Aloke Guha. COPAN Systems Logo of Your Logo of Your organization organization MAID for Archiving Aloke Guha COPAN Systems 1900 Pike Road, Longmont, CO 80501-6775, US Phone:+01 303-532-0250 FAX: +01 303-532-0302 E-mail: aloke.guha@copansys.com

More information

SCI - NIH/NCRR Site. Web Log Analysis Yearly Report Report Range: 01/01/ :00:00-12/31/ :59:59.

SCI - NIH/NCRR Site. Web Log Analysis Yearly Report Report Range: 01/01/ :00:00-12/31/ :59:59. SCI - NIH/NCRR Site Web Log Analysis Yearly Report 2003 Report Range: 01/01/2003 00:00:00-12/31/2003 23:59:59 www.webtrends.com Table of Contents General Statistics...5 Page Views Over Time...8 Top Pages

More information

Cache Management for TelcoCDNs. Daphné Tuncer Department of Electronic & Electrical Engineering University College London (UK)

Cache Management for TelcoCDNs. Daphné Tuncer Department of Electronic & Electrical Engineering University College London (UK) Cache Management for TelcoCDNs Daphné Tuncer Department of Electronic & Electrical Engineering University College London (UK) d.tuncer@ee.ucl.ac.uk 06/01/2017 Agenda 1. Internet traffic: trends and evolution

More information

TITLE. the IT Landscape

TITLE. the IT Landscape The Impact of Hyperconverged Infrastructure on the IT Landscape 1 TITLE Drivers for adoption Lower TCO Speed and Agility Scale Easily Operational Simplicity Hyper-converged Integrated storage & compute

More information

IOmark- VM. IBM IBM FlashSystem V9000 Test Report: VM a Test Report Date: 5, December

IOmark- VM. IBM IBM FlashSystem V9000 Test Report: VM a Test Report Date: 5, December IOmark- VM IBM IBM FlashSystem V9000 Test Report: VM- 151205- a Test Report Date: 5, December 2015 Copyright 2010-2015 Evaluator Group, Inc. All rights reserved. IOmark- VM, IOmark- VDI, VDI- IOmark, and

More information

Chapter 7. GridStor Technology. Adding Data Paths. Data Paths for Global Deduplication. Data Path Properties

Chapter 7. GridStor Technology. Adding Data Paths. Data Paths for Global Deduplication. Data Path Properties Chapter 7 GridStor Technology GridStor technology provides the ability to configure multiple data paths to storage within a storage policy copy. Having multiple data paths enables the administrator to

More information

Flash Cache Best Practice Guide

Flash Cache Best Practice Guide Technical Report Flash Cache Best Practice Guide Skip Shapiro, NetApp November 2013 TR-3832 Abstract NetApp Flash Cache and Flash Cache 2 cards, and caching software embedded in the Data ONTAP operating

More information

Trusted Disk Loading in the Emulab Network Testbed. Cody Cutler, Mike Hibler, Eric Eide, Rob Ricci

Trusted Disk Loading in the Emulab Network Testbed. Cody Cutler, Mike Hibler, Eric Eide, Rob Ricci Trusted Disk Loading in the Emulab Network Testbed Cody Cutler, Mike Hibler, Eric Eide, Rob Ricci 1 Emulab Public network testbed Create complex experiments quickly 500+ nodes at Utah Emulab 2 Emulab Nodes

More information

for the SNIA Emerald TM Power Efficiency Measurement Specification Version 2.0 Revision 2

for the SNIA Emerald TM Power Efficiency Measurement Specification Version 2.0 Revision 2 User Guide for the SNIA Emerald TM Power Efficiency Measurement Specification Version 2.0 Revision 2 June 21, 2013 About the SNIA The Storage Networking Industry Association (SNIA) is a not for profit

More information

s390 zlinux at Citi Presented by Doctor P. Robinson June 5, 2013 Hillgang Citi Managing zlinux in a Heterogenous Enterprise

s390 zlinux at Citi Presented by Doctor P. Robinson June 5, 2013 Hillgang Citi Managing zlinux in a Heterogenous Enterprise s390 zlinux at Citi Presented by Doctor P. Robinson June 5, 2013 Hillgang Biography: Doctor P. Robinson Sr. Infrastructure Tech Specialist / SME 17 years with Citi Over 40 years in Operation and System

More information

EMC Integrated Infrastructure for VMware. Business Continuity

EMC Integrated Infrastructure for VMware. Business Continuity EMC Integrated Infrastructure for VMware Business Continuity Enabled by EMC Celerra and VMware vcenter Site Recovery Manager Reference Architecture Copyright 2009 EMC Corporation. All rights reserved.

More information

50 TB. Traditional Storage + Data Protection Architecture. StorSimple Cloud-integrated Storage. Traditional CapEx: $375K Support: $75K per Year

50 TB. Traditional Storage + Data Protection Architecture. StorSimple Cloud-integrated Storage. Traditional CapEx: $375K Support: $75K per Year Compelling Economics: Traditional Storage vs. StorSimple Traditional Storage + Data Protection Architecture StorSimple Cloud-integrated Storage Servers Servers Primary Volume Disk Array ($100K; Double

More information

Deadline Guaranteed Service for Multi- Tenant Cloud Storage Guoxin Liu and Haiying Shen

Deadline Guaranteed Service for Multi- Tenant Cloud Storage Guoxin Liu and Haiying Shen Deadline Guaranteed Service for Multi- Tenant Cloud Storage Guoxin Liu and Haiying Shen Presenter: Haiying Shen Associate professor *Department of Electrical and Computer Engineering, Clemson University,

More information

Dell PowerVault MD Family. Modular storage. The Dell PowerVault MD storage family

Dell PowerVault MD Family. Modular storage. The Dell PowerVault MD storage family Dell MD Family Modular storage The Dell MD storage family Dell MD Family Simplifying IT The Dell MD Family simplifies IT by optimizing your data storage architecture and ensuring the availability of your

More information

Distributed File Systems Part II. Distributed File System Implementation

Distributed File Systems Part II. Distributed File System Implementation s Part II Daniel A. Menascé Implementation File Usage Patterns File System Structure Caching Replication Example: NFS 1 Implementation: File Usage Patterns Static Measurements: - distribution of file size,

More information

How Architecture Design Can Lower Hyperconverged Infrastructure (HCI) Total Cost of Ownership (TCO)

How Architecture Design Can Lower Hyperconverged Infrastructure (HCI) Total Cost of Ownership (TCO) Economic Insight Paper How Architecture Design Can Lower Hyperconverged Infrastructure (HCI) Total Cost of Ownership (TCO) By Eric Slack, Sr. Analyst December 2017 Enabling you to make the best technology

More information

White Paper Features and Benefits of Fujitsu All-Flash Arrays for Virtualization and Consolidation ETERNUS AF S2 series

White Paper Features and Benefits of Fujitsu All-Flash Arrays for Virtualization and Consolidation ETERNUS AF S2 series White Paper Features and Benefits of Fujitsu All-Flash Arrays for Virtualization and Consolidation Fujitsu All-Flash Arrays are extremely effective tools when virtualization is used for server consolidation.

More information

Cost Model Energy Benefits DirectAire & SmartAire Overview & Explanation

Cost Model Energy Benefits DirectAire & SmartAire Overview & Explanation Cost Model Energy Benefits DirectAire & SmartAire Overview & Explanation A cost model (See figure 1) has been created to provide the user a simplified method for directly comparing the energy cost of a

More information

Basic Memory Management. Basic Memory Management. Address Binding. Running a user program. Operating Systems 10/14/2018 CSC 256/456 1

Basic Memory Management. Basic Memory Management. Address Binding. Running a user program. Operating Systems 10/14/2018 CSC 256/456 1 Basic Memory Management Program must be brought into memory and placed within a process for it to be run Basic Memory Management CS 256/456 Dept. of Computer Science, University of Rochester Mono-programming

More information

Naming in Distributed Systems

Naming in Distributed Systems Naming in Distributed Systems Dr. Yong Guan Department of Electrical and Computer Engineering & Information Assurance Center Iowa State University Outline for Today s Talk Overview: Names, Identifiers,

More information

Smart Routing. Requests

Smart Routing. Requests Smart Routing of Requests Martin Klein 1 Lyudmila Balakireva 1 Harihar Shankar 1 James Powell 1 Herbert Van de Sompel 2 1 Research Library Los Alamos National Laboratory 2 Data Archiving and Networked

More information

Using Alluxio to Improve the Performance and Consistency of HDFS Clusters

Using Alluxio to Improve the Performance and Consistency of HDFS Clusters ARTICLE Using Alluxio to Improve the Performance and Consistency of HDFS Clusters Calvin Jia Software Engineer at Alluxio Learn how Alluxio is used in clusters with co-located compute and storage to improve

More information

Addressing the Stranded Power Problem in Datacenters using Storage Workload Characterization. January 30 th, 2010 Sriram Sankar and Kushagra Vaid

Addressing the Stranded Power Problem in Datacenters using Storage Workload Characterization. January 30 th, 2010 Sriram Sankar and Kushagra Vaid Addressing the Stranded Power Problem in Datacenters using Storage Workload Characterization January 30 th, 2010 Sriram Sankar and Kushagra Vaid 1 Microsoft Online Services Across the company, all over

More information

Trusted Disk Loading in the Emulab Network Testbed. Cody Cutler, Eric Eide, Mike Hibler, Rob Ricci

Trusted Disk Loading in the Emulab Network Testbed. Cody Cutler, Eric Eide, Mike Hibler, Rob Ricci Trusted Disk Loading in the Emulab Network Testbed Cody Cutler, Eric Eide, Mike Hibler, Rob Ricci 1 Emulab Public network testbed Create complex experiments quickly 500+ nodes at Utah Emulab 2 Emulab Nodes

More information

ChunkStash: Speeding Up Storage Deduplication using Flash Memory

ChunkStash: Speeding Up Storage Deduplication using Flash Memory ChunkStash: Speeding Up Storage Deduplication using Flash Memory Biplob Debnath +, Sudipta Sengupta *, Jin Li * * Microsoft Research, Redmond (USA) + Univ. of Minnesota, Twin Cities (USA) Deduplication

More information

Performance Report: Multiprotocol Performance Test of VMware ESX 3.5 on NetApp Storage Systems

Performance Report: Multiprotocol Performance Test of VMware ESX 3.5 on NetApp Storage Systems NETAPP TECHNICAL REPORT Performance Report: Multiprotocol Performance Test of VMware ESX 3.5 on NetApp Storage Systems A Performance Comparison Study of FC, iscsi, and NFS Protocols Jack McLeod, NetApp

More information

#mytweet via Instagram: Exploring User Behaviour Across Multiple Social Networks

#mytweet via Instagram: Exploring User Behaviour Across Multiple Social Networks 1 #mytweet via Instagram: Exploring User Behaviour Across Multiple Social Networks Bang Hui Lim, Dongyuan Lu Tao Chen, Min-yen Kan Web Information Retrieval / Natural Language Processing Group 1. http://www.statista.com/statistics/272014/global-social-networks-ranked-by-number-of-users/

More information

UW-ATLAS Experiences with Condor

UW-ATLAS Experiences with Condor UW-ATLAS Experiences with Condor M.Chen, A. Leung, B.Mellado Sau Lan Wu and N.Xu Paradyn / Condor Week, Madison, 05/01/08 Outline Our first success story with Condor - ATLAS production in 2004~2005. CRONUS

More information

Locality of Reference

Locality of Reference Locality of Reference 1 In view of the previous discussion of secondary storage, it makes sense to design programs so that data is read from and written to disk in relatively large chunks but there is

More information

Balancing Fairness and Efficiency in Tiered Storage Systems with Bottleneck-Aware Allocation

Balancing Fairness and Efficiency in Tiered Storage Systems with Bottleneck-Aware Allocation Balancing Fairness and Efficiency in Tiered Storage Systems with Bottleneck-Aware Allocation Hui Wang, Peter Varman Rice University FAST 14, Feb 2014 Tiered Storage Tiered storage: HDs and SSDs q Advantages:

More information

V6 Programming Fundamentals: Part 1 Stored Procedures and Beyond David Adams & Dan Beckett. All rights reserved.

V6 Programming Fundamentals: Part 1 Stored Procedures and Beyond David Adams & Dan Beckett. All rights reserved. Summit 97 V6 Programming Fundamentals: Part 1 Stored Procedures and Beyond by David Adams & Dan Beckett 1997 David Adams & Dan Beckett. All rights reserved. Content adapted from Programming 4th Dimension:

More information

Virtualization of the MS Exchange Server Environment

Virtualization of the MS Exchange Server Environment MS Exchange Server Acceleration Maximizing Users in a Virtualized Environment with Flash-Powered Consolidation Allon Cohen, PhD OCZ Technology Group Introduction Microsoft (MS) Exchange Server is one of

More information

Storage Infrastructure Optimization

Storage Infrastructure Optimization Brussels, Belgium, 13th TF-Storage Meeting Denis Stanarevic, IBM SIO South East Europe Storage Infrastructure Optimization Setup the scene Cost Categories Environment Time To run... You need The complete

More information

Chronological Backtracking Conflict Directed Backjumping Dynamic Backtracking Branching Strategies Branching Heuristics Heavy Tail Behavior

Chronological Backtracking Conflict Directed Backjumping Dynamic Backtracking Branching Strategies Branching Heuristics Heavy Tail Behavior PART III: Search Outline Depth-first Search Chronological Backtracking Conflict Directed Backjumping Dynamic Backtracking Branching Strategies Branching Heuristics Heavy Tail Behavior Best-First Search

More information

Storage Performance Validation for Panzura

Storage Performance Validation for Panzura Storage Performance Validation for Panzura Ensuring seamless cloud storage performance for Panzura s Quicksilver Product Suite WHITEPAPER Table of Contents Background on Panzura...3 Storage Performance

More information

repostor would like to lower your TCO

repostor would like to lower your TCO repostor would like to lower your TCO By reading this document you will get our answers to: ROI (Return on Investment) Up to 43 times space savings TCO (Total Cost of Ownership) RTO (Recovery Time Objective)

More information

Small verse Large. The Performance Tester Paradox. Copyright 1202Performance

Small verse Large. The Performance Tester Paradox. Copyright 1202Performance Small verse Large The Performance Tester Paradox The Paradox Why do people want performance testing? To stop performance problems in production How do we ensure this? Performance test with Realistic workload

More information

Isilon InsightIQ. Version User Guide

Isilon InsightIQ. Version User Guide Isilon InsightIQ Version 4.1.1 User Guide Copyright 2009-2017 Dell Inc. or its subsidiaries. All rights reserved. Published January 2017 Dell believes the information in this publication is accurate as

More information

Technical Note P/N REV A01 March 29, 2007

Technical Note P/N REV A01 March 29, 2007 EMC Symmetrix DMX-3 Best Practices Technical Note P/N 300-004-800 REV A01 March 29, 2007 This technical note contains information on these topics: Executive summary... 2 Introduction... 2 Tiered storage...

More information

Simply Top Talkers Jeroen Massar, Andreas Kind and Marc Ph. Stoecklin

Simply Top Talkers Jeroen Massar, Andreas Kind and Marc Ph. Stoecklin IBM Research - Zurich Simply Top Talkers Jeroen Massar, Andreas Kind and Marc Ph. Stoecklin 2009 IBM Corporation Motivation and Outline Need to understand and correctly handle dominant aspects within the

More information

Memory Systems IRAM. Principle of IRAM

Memory Systems IRAM. Principle of IRAM Memory Systems 165 other devices of the module will be in the Standby state (which is the primary state of all RDRAM devices) or another state with low-power consumption. The RDRAM devices provide several

More information

Cascade Mapping: Optimizing Memory Efficiency for Flash-based Key-value Caching

Cascade Mapping: Optimizing Memory Efficiency for Flash-based Key-value Caching Cascade Mapping: Optimizing Memory Efficiency for Flash-based Key-value Caching Kefei Wang and Feng Chen Louisiana State University SoCC '18 Carlsbad, CA Key-value Systems in Internet Services Key-value

More information

Multimedia Streaming. Mike Zink

Multimedia Streaming. Mike Zink Multimedia Streaming Mike Zink Technical Challenges Servers (and proxy caches) storage continuous media streams, e.g.: 4000 movies * 90 minutes * 10 Mbps (DVD) = 27.0 TB 15 Mbps = 40.5 TB 36 Mbps (BluRay)=

More information

10/23/12. Outline. Part 6. Trees (3) Example: A B-tree of degree 5. B-tree of degree m. Inserting 55. Inserting 55. B-Trees External Methods

10/23/12. Outline. Part 6. Trees (3) Example: A B-tree of degree 5. B-tree of degree m. Inserting 55. Inserting 55. B-Trees External Methods Outline Part 6. Trees (3) B-Trees External Methods CS 200 Algorithms and Data Structures 1 2 B-tree of degree m All leaves are at the same level Each node contains between m-1 and floor((m-2)/2) s (except

More information

Meet the Walkers! Accelerating Index Traversals for In-Memory Databases"

Meet the Walkers! Accelerating Index Traversals for In-Memory Databases Meet the Walkers! Accelerating Index Traversals for In-Memory Databases Onur Kocberber Boris Grot, Javier Picorel, Babak Falsafi, Kevin Lim, Parthasarathy Ranganathan Our World is Data-Driven! Data resides

More information

The storage challenges of virtualized environments

The storage challenges of virtualized environments The storage challenges of virtualized environments The virtualization challenge: Ageing and Inflexible storage architectures Mixing of platforms causes management complexity Unable to meet the requirements

More information

HOW DATA DEDUPLICATION WORKS A WHITE PAPER

HOW DATA DEDUPLICATION WORKS A WHITE PAPER HOW DATA DEDUPLICATION WORKS A WHITE PAPER HOW DATA DEDUPLICATION WORKS ABSTRACT IT departments face explosive data growth, driving up costs of storage for backup and disaster recovery (DR). For this reason,

More information

CSE544 Database Architecture

CSE544 Database Architecture CSE544 Database Architecture Tuesday, February 1 st, 2011 Slides courtesy of Magda Balazinska 1 Where We Are What we have already seen Overview of the relational model Motivation and where model came from

More information

RAMCloud. Scalable High-Performance Storage Entirely in DRAM. by John Ousterhout et al. Stanford University. presented by Slavik Derevyanko

RAMCloud. Scalable High-Performance Storage Entirely in DRAM. by John Ousterhout et al. Stanford University. presented by Slavik Derevyanko RAMCloud Scalable High-Performance Storage Entirely in DRAM 2009 by John Ousterhout et al. Stanford University presented by Slavik Derevyanko Outline RAMCloud project overview Motivation for RAMCloud storage:

More information

Next-Generation Cloud Platform

Next-Generation Cloud Platform Next-Generation Cloud Platform Jangwoo Kim Jun 24, 2013 E-mail: jangwoo@postech.ac.kr High Performance Computing Lab Department of Computer Science & Engineering Pohang University of Science and Technology

More information

WHY SECURE MULTI-TENANCY WITH DATA DOMAIN SYSTEMS?

WHY SECURE MULTI-TENANCY WITH DATA DOMAIN SYSTEMS? Why Data Domain Series WHY SECURE MULTI-TENANCY WITH DATA DOMAIN SYSTEMS? Why you should take the time to read this paper Provide data isolation by tenant (Secure logical data isolation for each tenant

More information

G Disco. Robert Grimm New York University

G Disco. Robert Grimm New York University G22.3250-001 Disco Robert Grimm New York University The Three Questions! What is the problem?! What is new or different?! What are the contributions and limitations? Background: ccnuma! Cache-coherent

More information

Isilon: Raising The Bar On Performance & Archive Use Cases. John Har Solutions Product Manager Unstructured Data Storage Team

Isilon: Raising The Bar On Performance & Archive Use Cases. John Har Solutions Product Manager Unstructured Data Storage Team Isilon: Raising The Bar On Performance & Archive Use Cases John Har Solutions Product Manager Unstructured Data Storage Team What we ll cover in this session Isilon Overview Streaming workflows High ops/s

More information

Backup 2.0: Simply Better Data Protection

Backup 2.0: Simply Better Data Protection Simply Better Protection 2.0: Simply Better Protection Gain Net Savings of $15 for Every $1 Invested on B2.0 Technologies Executive Summary Traditional backup methods are reaching their technology end-of-life.

More information

CSE 486/586 Distributed Systems

CSE 486/586 Distributed Systems CSE 486/586 Distributed Systems Content Distribution Networks Slides by Steve Ko Computer Sciences and Engineering University at Buffalo CSE 486/586 Understanding Your Workload Engineering principle Make

More information

DELL EMC DATA DOMAIN SISL SCALING ARCHITECTURE

DELL EMC DATA DOMAIN SISL SCALING ARCHITECTURE WHITEPAPER DELL EMC DATA DOMAIN SISL SCALING ARCHITECTURE A Detailed Review ABSTRACT While tape has been the dominant storage medium for data protection for decades because of its low cost, it is steadily

More information

Inter-Data-Center Network Traffic Prediction with Elephant Flows

Inter-Data-Center Network Traffic Prediction with Elephant Flows Inter-Data-Center Network Traffic Prediction with Elephant Flows Yi Li, Hong Liu, Wenjun Yang, Dianming Hu, Wei Xu Institute for Interdisciplinary Information Sciences, Tsinghua University Baidu Inc. Inter-Data-Center

More information

Dell PowerVault MD Family. Modular storage. The Dell PowerVault MD storage family

Dell PowerVault MD Family. Modular storage. The Dell PowerVault MD storage family Dell PowerVault MD Family Modular storage The Dell PowerVault MD storage family Dell PowerVault MD Family The affordable choice The Dell PowerVault MD family is an affordable choice for reliable storage.

More information

Performance Analysis in the Real World of Online Services

Performance Analysis in the Real World of Online Services Performance Analysis in the Real World of Online Services Dileep Bhandarkar, Ph. D. Distinguished Engineer 2009 IEEE International Symposium on Performance Analysis of Systems and Software My Background:

More information

Deduplication has been around for several

Deduplication has been around for several Demystifying Deduplication By Joe Colucci Kay Benaroch Deduplication holds the promise of efficient storage and bandwidth utilization, accelerated backup and recovery, reduced costs, and more. Understanding

More information