A Case for Packing and Indexing in Cloud File Systems

Similar documents
A Case for Packing and Indexing in Cloud File Systems

MATE-EC2: A Middleware for Processing Data with Amazon Web Services

Database Workload. from additional misses in this already memory-intensive databases? interference could be a problem) Key question:

Amazon Elastic File System

Amazon Aurora Deep Dive

Big Data solution benchmark

BlobPorter Documentation. Jesus Aguilar

ZBD: Using Transparent Compression at the Block Level to Increase Storage Space Efficiency

PebblesDB: Building Key-Value Stores using Fragmented Log Structured Merge Trees

Azure-persistence MARTIN MUDRA

Pocket: Elastic Ephemeral Storage for Serverless Analytics

AWS Storage Gateway. Not your father s hybrid storage. University of Arizona IT Summit October 23, Jay Vagalatos, AWS Solutions Architect

YCSB++ Benchmarking Tool Performance Debugging Advanced Features of Scalable Table Stores

4 Myths about in-memory databases busted

Data Processing at the Speed of 100 Gbps using Apache Crail. Patrick Stuedi IBM Research

Amazon Aurora Deep Dive

Cloud Computing and Hadoop Distributed File System. UCSB CS170, Spring 2018

Analyzing the High Performance Parallel I/O on LRZ HPC systems. Sandra Méndez. HPC Group, LRZ. June 23, 2016

How to go serverless with AWS Lambda

SMORE: A Cold Data Object Store for SMR Drives

Experiences with Serverless Big Data

DASH COPY GUIDE. Published On: 11/19/2013 V10 Service Pack 4A Page 1 of 31

Amazon Aurora Deep Dive

Atlas: Baidu s Key-value Storage System for Cloud Data

Get ready to be what s next.

Strata: A Cross Media File System. Youngjin Kwon, Henrique Fingler, Tyler Hunt, Simon Peter, Emmett Witchel, Thomas Anderson

SHRD: Improving Spatial Locality in Flash Storage Accesses by Sequentializing in Host and Randomizing in Device

Data Processing at the Speed of 100 Gbps using Apache Crail. Patrick Stuedi IBM Research

Cumulus: Filesystem Backup to the Cloud

Georgia Institute of Technology ECE6102 4/20/2009 David Colvin, Jimmy Vuong

Flash-Conscious Cache Population for Enterprise Database Workloads

ADAPTIVE STREAMING AT. Justin Ruggles Lead Engineer, Transcoding & Delivery

AWS: Basic Architecture Session SUNEY SHARMA Solutions Architect: AWS

The Impact of SSD Selection on SQL Server Performance. Solution Brief. Understanding the differences in NVMe and SATA SSD throughput

SQL SERVER Lubo Goryl Solution Professional Microsoft Slovakia

Data Center TCP (DCTCP)

Dynamic Vertical Memory Scalability for OpenJDK Cloud Applications

YCSB++ benchmarking tool Performance debugging advanced features of scalable table stores

HashKV: Enabling Efficient Updates in KV Storage via Hashing

Google File System. Arun Sundaram Operating Systems

MONITORING SERVERLESS ARCHITECTURES

FlashBlox: Achieving Both Performance Isolation and Uniform Lifetime for Virtualized SSDs

OS-caused Long JVM Pauses - Deep Dive and Solutions

Agenda. Introduction Storage Primer Block Storage Shared File Systems Object Store On-Premises Storage Integration

POSTGRESQL ON AWS: TIPS & TRICKS (AND HORROR STORIES) ALEXANDER KUKUSHKIN. PostgresConf US

Sparse Indexing: Large-Scale, Inline Deduplication Using Sampling and Locality

Improving throughput for small disk requests with proximal I/O

Vlad Vinogradsky

ECE 7650 Scalable and Secure Internet Services and Architecture ---- A Systems Perspective

MapReduce. U of Toronto, 2014

Agenda. AWS Database Services Traditional vs AWS Data services model Amazon RDS Redshift DynamoDB ElastiCache

CA485 Ray Walshe Google File System

Scalability and Performance of Dell EMC OpenManage Essentials (OME) Version 2.3

SSD Architecture for Consistent Enterprise Performance

Using Alluxio to Improve the Performance and Consistency of HDFS Clusters

Live Migration: Even faster, now with a dedicated thread!

Architectural challenges for building a low latency, scalable multi-tenant data warehouse

10/26/2017 Universal Java GC analysis tool - Java Garbage collection log analysis made easy

Cloud Programming. Programming Environment Oct 29, 2015 Osamu Tatebe

Single-pass restore after a media failure. Caetano Sauer, Goetz Graefe, Theo Härder

IBM Education Assistance for z/os V2R2

White Paper: Bulk Data Migration in the Cloud. Migration in the Cloud

Reshaping Text Data for Efficient Processing on Amazon EC2. Gabriela Turcu, Ian Foster, Svetlozar Nestorov

Erik Riedel Hewlett-Packard Labs

ParaFS: A Log-Structured File System to Exploit the Internal Parallelism of Flash Devices

and data combined) is equal to 7% of the number of instructions. Miss Rate with Second- Level Cache, Direct- Mapped Speed

Performance Characterization of ONTAP Cloud in Amazon Web Services with Application Workloads

Using Transparent Compression to Improve SSD-based I/O Caches

PYTHIA: Improving Datacenter Utilization via Precise Contention Prediction for Multiple Co-located Workloads

AMAZON S3 FOR SCIENCE GRIDS: A VIABLE SOLUTION?

Cloudera s Enterprise Data Hub on the Amazon Web Services Cloud: Quick Start Reference Deployment October 2014

Cloud scaling of Visual Weather

GearDB: A GC-free Key-Value Store on HM-SMR Drives with Gear Compaction

Staged Memory Scheduling

NPTEL Course Jan K. Gopinath Indian Institute of Science

HIGH PERFORMANCE SANLESS CLUSTERING THE POWER OF FUSION-IO THE PROTECTION OF SIOS


Predictable Time-Sharing for DryadLINQ Cluster. Sang-Min Park and Marty Humphrey Dept. of Computer Science University of Virginia

vrealize Business for Cloud User Guide

Lecture 15: Datacenter TCP"

Building High Performance Apps using NoSQL. Swami Sivasubramanian General Manager, AWS NoSQL

Serverless Computing: Design, Implementation, and Performance. Garrett McGrath and Paul R. Brenner

D E N A L I S T O R A G E I N T E R F A C E. Laura Caulfield Senior Software Engineer. Arie van der Hoeven Principal Program Manager

Appendix B. Standards-Track TCP Evaluation

Diagnosis via monitoring & tracing

Virtual SQL Servers. Actual Performance. 2016

CS252 S05. CMSC 411 Computer Systems Architecture Lecture 18 Storage Systems 2. I/O performance measures. I/O performance measures

Name: Instructions. Problem 1 : Short answer. [48 points] CMU / Storage Systems 25 Feb 2009 Spring 2010 Exam 1

File Systems Management and Examples

Operating Systems. File Systems. Thomas Ropars.

Cloud e Datacenter Networking

EECS750: Advanced Operating Systems. 2/24/2014 Heechul Yun

Lessons from Post-processing Climate Data on Modern Flash-based HPC Systems

Exam4Tests. Latest exam questions & answers help you to pass IT exam test easily

pblk the OCSSD FTL Linux FAST Summit 18 Javier González Copyright 2018 CNEX Labs

Data Centers and Cloud Computing

Data Centers and Cloud Computing. Slides courtesy of Tim Wood

data within a computer system are stored in one of 2 physical states (hence the use of binary digits)

Amazon Aurora Relational databases reimagined.

Deep Dive on Amazon Elastic File System

Transcription:

A Case for Packing and Indexing in Cloud File Systems Saurabh Kadekodi, Bin Fan*, Adit Madan*, Garth Gibson and Greg Ganger University, *Alluxio Inc.

In a nutshell Cloud object stores have a per-operation pricing structure Small object workloads have poor performance and high cost Packing Performance Price http://www.pdl.cmu.edu/!2

Cloud file systems Client S3, Azure, GCS AWS, GCE, etc. Object store Natural approach is 1:1 file to object mapping; bad for small files Packing (batching or coalescing) small files improves local FS performance How much does packing help in cloud file systems? http://www.pdl.cmu.edu/!3

Motivation Performance Price http://www.pdl.cmu.edu/!4

Performance Motivation Alluxio AWS Master Worker 1 Worker 2 Worker 3 Worker 4 r4.4xlarge r4.4xlarge r4.4xlarge r4.4xlarge 4KiB objects 1 object per 1 file to S3 3.2M objects of 4KiB each = 12.5GiB http://www.pdl.cmu.edu/!5

Performance Motivation Alluxio AWS Master Worker 1 Worker 2 Worker 3 Worker 4 r4.4xlarge r4.4xlarge r4.4xlarge r4.4xlarge 4MiB objects 1 object per 1K files to S3 3.2K objects of 4MiB each = 12.5GiB http://www.pdl.cmu.edu/!5

Performance Motivation Alluxio AWS Master Worker 1 Worker 2 Worker 3 Worker 4 r4.4xlarge r4.4xlarge r4.4xlarge r4.4xlarge 40MiB objects 1 object per 10K files to S3 320 objects of 40MiB each = 12.5GiB http://www.pdl.cmu.edu/!5

Performance Motivation Alluxio AWS Master Worker 1 Worker 2 Worker 3 Worker 4 r4.4xlarge r4.4xlarge r4.4xlarge r4.4xlarge 400MiB objects 1 object per 100K files to S3 32 objects of 400MiB each = 12.5GiB http://www.pdl.cmu.edu/!5

Throughput By Object Size Throughput (MB / s) 500 375 250 125 0.42 0 4KB (1:1) 4MB (1K:1) 40MB (10K:1) 400MB (100K:1) 355.5 Object Size 457.1 492.3 http://www.pdl.cmu.edu/!6

S3 Throttling in Action PUT req / sec PUT Req / min 166 10000 1.66 100 333 83 20000 5000 S3 Exceptions / min S3 Exceptions / sec 02:45 02:50 02:55 03:00 03:05 03:10 03:15 03:20 03:25 03:30 03:35 03:40 03:45 03:50 03:55 04:00 04:05 04:10 04:15 04:20 Time (hh:mm) Request rate throttling by S3 for 4KiB writes S3 warns routinely requesting >100 PUT req / s subject to throttling NW bandwidth of r4.4xlarge instances ~ 10 Gb / sec minimum 13MB writes to avoid being throttled packing http://www.pdl.cmu.edu/!7

Motivation Performance Price http://www.pdl.cmu.edu/!8

Price Motivation S3 PUT, COPY, POST GET Data Retrieval Data Storage Standard $0.05 / 10K req $0.04 / 100K req Free (for certain data center locations) $0.023 / GB-month 1400 Operation cost Storage cost Say we want to store 1TB per month as 4KiB objects, approx 100 req/s for a month Price ($) 1050 700 350 Percentage of operation costs 98% Nil 16% <0.001% 0 S3 EFS DynamoDB Packing Data ingest methodology http://www.pdl.cmu.edu/

Price Motivation S3 PUT, COPY, POST GET Data Retrieval Data Storage Standard $0.05 / 10K req $0.04 / 100K req Free (for certain data center locations) $0.023 / GB-month 1400 Operation cost Storage cost Say we want to store 1TB per month as 4KiB objects, approx 100 req/s for a month Price ($) 1050 700 350 Percentage of data storage costs 2% 100% 84% >99.999% 0 S3 EFS DynamoDB Packing Data ingest methodology http://www.pdl.cmu.edu/

Price Motivation S3 PUT, COPY, POST GET Data Retrieval Data Storage Standard $0.05 / 10K req $0.04 / 100K req Free (for certain data center locations) $0.023 / GB-month 1400 Operation cost Storage cost Say we want to store 1TB per month as 4KiB objects, approx 100 req/s for a month Price ($) 1050 700 350 Percentage of data storage costs Packing cost split almost equal to EFS, but EFS storage cost >10x S3 storage costs 2% 100% 84% >99.999% 0 S3 EFS DynamoDB Packing Data ingest methodology http://www.pdl.cmu.edu/

Packing in Alluxio Alluxio AWS Master Worker 1 Worker 2 Worker 3 Worker 4 to S3 http://www.pdl.cmu.edu/!10

Packing in Alluxio AWS Master Packed blob Alluxio Worker 1 Worker 2 Worker 3 Buffer for packing Worker 4 Index to S3 http://www.pdl.cmu.edu/!10

Packing in Alluxio Alluxio AWS Master Worker 1 Worker 2 Worker 3 Worker 4 to S3 http://www.pdl.cmu.edu/!10

Packing in Alluxio Alluxio AWS Master Worker 1 Worker 2 Worker 3 Worker 4 to S3 http://www.pdl.cmu.edu/!10

Packing in Alluxio Alluxio AWS Master Worker 1 Worker 2 Worker 3 Worker 4 to S3 http://www.pdl.cmu.edu/!10

A Packed Blob Packing policy determines what to pack Triggered by dirty bytes & timeout Embedded Index blob-extent 1 blob-extent 2 blob-extent k Blob Extent: alluxio-path:logical-offset:physical-offset:length http://www.pdl.cmu.edu/!11

Evaluation http://www.pdl.cmu.edu/!12

Configuration Alluxio AWS Master Worker 1 Worker 2 Worker 3 Worker 4 http://www.pdl.cmu.edu/!13

Configuration Alluxio AWS Master Worker 1 Worker 2 Worker 3 Worker 4 http://www.pdl.cmu.edu/!13

Configuration AWS Master r4.4xlarge EC2 instance Alluxio Worker 1 Worker 2 Worker 3 Worker 4 http://www.pdl.cmu.edu/!13

Configuration AWS Master r4.4xlarge EC2 instance Alluxio 8 workload generators Worker 1 Worker 2 Worker 3 Worker 4 http://www.pdl.cmu.edu/!13

Configuration AWS Master r4.4xlarge EC2 instance Alluxio 8 workload generators Worker 1 Worker 2 Worker 3 Worker 4 100K files; 8KB each http://www.pdl.cmu.edu/!13

Configuration Alluxio AWS Master Worker 1 Worker 2 Worker 3 Worker 4 http://www.pdl.cmu.edu/!13

Configuration Alluxio AWS Master Worker 1 Worker 2 Worker 3 Worker 4 800K files 800K files 800K files 800K files 3.2M files of 8KiB = 24.4GB http://www.pdl.cmu.edu/!13

Packing Configuration Max blob size: 1 GB Packing interval: 5 sec # Packing threads: 16 Index # Master threads: 16 Backup interval: 1 min http://www.pdl.cmu.edu/!14

Motivation Revisited Performance Price http://www.pdl.cmu.edu/!15

Write Performance Comparison 100 71.5 Direct 10000 Packed 21780 60x 30 22.5 Data Metadata 0.9 MB / s 10 60x Seconds 100 355 GB 15 24.4 24.4 7.5 1 1.2 Throughput Comparison 1 Runtime Comparison http://www.pdl.cmu.edu/!16 0 Direct Packing

Motivation Revisited Performance Price http://www.pdl.cmu.edu/!17

S3 Data Ingest Price 10000 1729 1000000 257820 25000x Direct Packed 1 1000 0 0.00067 Data Ingest Cost 1 Rate-Limited Retries Request rate is throttled much more than data rate http://www.pdl.cmu.edu/!18

Research questions (feedback) 1 Price with packing = price without packing 25000 Will fine-tuning help? By how much? Are there workload specific tuning opportunities / challenges? Garbage collection of packed blobs Different from LSM Trees, LFS, TableFS, etc. Cost-driven GC policies? Interference with foreground workload? http://www.pdl.cmu.edu/!19

Conclusion Packing Performance Price Price reduction because of: Matching application write sizes to storage system write sizes Elimination of retries due to aggressive rate-limiting imposed by S3-like services saukad@cs.cmu.edu http://www.pdl.cmu.edu/!20