Active Disks - Remote Execution

Similar documents
Active Disks For Large-Scale Data Mining and Multimedia

Active Disks - Remote Execution

Erik Riedel Hewlett-Packard Labs

Active Storage For Large-Scale Data Mining and Multimedia Applications

Active Disks For Large-Scale Data Mining

NASD: Network-Attached Secure Disks

Active Storage For Large-Scale Data Mining and Multimedia

CPS104 Computer Organization and Programming Lecture 18: Input-Output. Outline of Today s Lecture. The Big Picture: Where are We Now?

Quiz for Chapter 6 Storage and Other I/O Topics 3.10

Object-based Storage Devices (OSD) T10 Standard

Slides compliment of Yong Chen and Xian-He Sun From paper Reevaluating Amdahl's Law in the Multicore Era. 11/16/2011 Many-Core Computing 2

Database Workload. from additional misses in this already memory-intensive databases? interference could be a problem) Key question:

Hardware Acceleration of Database Operations

How TokuDB Fractal TreeTM. Indexes Work. Bradley C. Kuszmaul. MySQL UC 2010 How Fractal Trees Work 1

Chapter 1: Fundamentals of Quantitative Design and Analysis

File Server Scaling with Network-Attached Secure Disks (NASD)

TECHNOLOGY BRIEF. Compaq 8-Way Multiprocessing Architecture EXECUTIVE OVERVIEW CONTENTS

Introduction to TCP/IP Offload Engine (TOE)

Sorting. Overview. External sorting. Warm up: in memory sorting. Purpose. Overview. Sort benchmarks

Processor Architectures At A Glance: M.I.T. Raw vs. UC Davis AsAP

EE/CSCI 451: Parallel and Distributed Computation

B.H.GARDI COLLEGE OF ENGINEERING & TECHNOLOGY (MCA Dept.) Parallel Database Database Management System - 2

Input/Output. Today. Next. Principles of I/O hardware & software I/O software layers Disks. Protection & Security

TCP offload engines for high-speed data processing

Algorithm Performance Factors. Memory Performance of Algorithms. Processor-Memory Performance Gap. Moore s Law. Program Model of Memory I

Deploy a High-Performance Database Solution: Cisco UCS B420 M4 Blade Server with Fusion iomemory PX600 Using Oracle Database 12c

Introduction to I/O and Disk Management

CSE 451: Operating Systems Spring Module 12 Secondary Storage. Steve Gribble

Computing architectures Part 2 TMA4280 Introduction to Supercomputing

Introduction to I/O and Disk Management

Jignesh M. Patel. Blog:

Outline. Parallel Database Systems. Information explosion. Parallelism in DBMSs. Relational DBMS parallelism. Relational DBMSs.

Availability and Utility of Idle Memory in Workstation Clusters. Anurag Acharya, UC-Santa Barbara Sanjeev Setia, George Mason Univ

SCSI vs. ATA More than an interface. Dave Anderson, Jim Dykes, Erik Riedel Seagate Research April 2003

Windows NT Server Configuration and Tuning for Optimal SAS Server Performance

BlueGene/L. Computer Science, University of Warwick. Source: IBM

Qiong Zhang, Roger D. Chamberlain, Ronald S. Indeck, Benjamin West, and Jason White

Workload Optimized Systems: The Wheel of Reincarnation. Michael Sporer, Netezza Appliance Hardware Architect 21 April 2013

Communication has significant impact on application performance. Interconnection networks therefore have a vital role in cluster systems.

Lecture 1: Gentle Introduction to GPUs

Intel Enterprise Processors Technology

PC I/O. May 7, Howard Huang 1

EMC XTREMCACHE ACCELERATES VIRTUALIZED ORACLE

Outline. Execution Environments for Parallel Applications. Supercomputers. Supercomputers

FFS. CS380L: Mike Dahlin. October 9, Overview of file systems trends, future. Phase 1: Basic file systems (e.g.

Performance of computer systems

CS 61C: Great Ideas in Computer Architecture. MapReduce

COS 318: Operating Systems. Storage Devices. Vivek Pai Computer Science Department Princeton University

Evaluation Report: Improving SQL Server Database Performance with Dot Hill AssuredSAN 4824 Flash Upgrades

Experiences with the Parallel Virtual File System (PVFS) in Linux Clusters

DELL EMC DATA DOMAIN SISL SCALING ARCHITECTURE

The Server-Storage Performance Gap

COSC 6385 Computer Architecture - Thread Level Parallelism (I)

Performance of DB2 Enterprise-Extended Edition on NT with Virtual Interface Architecture

Future Routing Schemes in Petascale clusters

An Oracle White Paper June Exadata Hybrid Columnar Compression (EHCC)

Control Hazards. Prediction

Chapter 12: Query Processing

TDT4260/DT8803 COMPUTER ARCHITECTURE EXAM

Object-based Storage (OSD) Architecture and Systems

A Better Storage Solution

Introduction to I/O. April 30, Howard Huang 1

Ingo Brenckmann Jochen Kirsten Storage Technology Strategists SAS EMEA Copyright 2003, SAS Institute Inc. All rights reserved.

The Impact of Optics on HPC System Interconnects

Oasis: An Active Storage Framework for Object Storage Platform

HP ProLiant BladeSystem Gen9 vs Gen8 and G7 Server Blades on Data Warehouse Workloads

Assessing and Comparing HP Parallel SCSI and HP Small Form Factor Enterprise Hard Disk Drives in Server Environments

CSE 451: Operating Systems Spring Module 12 Secondary Storage

CSCI 402: Computer Architectures. Parallel Processors (2) Fengguang Song Department of Computer & Information Science IUPUI.

Microarchitecture Overview. Performance

Network Design Considerations for Grid Computing

Table of contents. OpenVMS scalability with Oracle Rdb. Scalability achieved through performance tuning.

Distributed Video Systems Chapter 3 Storage Technologies

OpenVMS History. Ruth Goldenberg.

Computer Architecture Dr. Charles Kim Howard University

Computer Systems Architecture I. CSE 560M Lecture 19 Prof. Patrick Crowley

Column Stores vs. Row Stores How Different Are They Really?

EMC XTREMCACHE ACCELERATES MICROSOFT SQL SERVER

Hewlett Packard Enterprise HPE GEN10 PERSISTENT MEMORY PERFORMANCE THROUGH PERSISTENCE

Microarchitecture Overview. Performance

The Future of High-Performance Networking (The 5?, 10?, 15? Year Outlook)

Module 6: INPUT - OUTPUT (I/O)

On Deploying and Executing Data-Intensive Code on SMart Autonomous Storage (SmAS) Disks

Association Rule Discovery

QuickSpecs. What's New New 146GB Pluggable Ultra320 SCSI 15,000 rpm Universal Hard Drive. HP SCSI Ultra320 Hard Drive Option Kits (Servers) Overview

Welcome to Part 3: Memory Systems and I/O

Evaluation Report: HP StoreFabric SN1000E 16Gb Fibre Channel HBA

I/O Commercial Workloads. Scalable Disk Arrays. Scalable ICDA Performance. Outline of This Talk: Related Work on Disk Arrays.

Veritas Storage Foundation and. Sun Solaris ZFS. A performance study based on commercial workloads. August 02, 2007

Intel s s Memory Strategy for the Wireless Phone

PacketShader: A GPU-Accelerated Software Router

Introduction to Input and Output

Dynamic Fine Grain Scheduling of Pipeline Parallelism. Presented by: Ram Manohar Oruganti and Michael TeWinkle

Table of Contents. HP A7173A PCI-X Dual Channel Ultra320 SCSI Host Bus Adapter. Performance Paper for HP PA-RISC Servers

Introduction to parallel computers and parallel programming. Introduction to parallel computersand parallel programming p. 1

Deduplication Storage System

Simplify System Complexity

Association Rule Discovery

Lecture 23. Finish-up buses Storage

APP: MINIMIZING INTERFERENCE USING AGGRESSIVEPIPELINED PREFETCHING IN MULTI-LEVEL BUFFER CACHES

Transcription:

- Remote Execution for Network-Attached Storage Erik Riedel, University www.pdl.cs.cmu.edu/active UC-Berkeley Systems Seminar 8 October 1998

Outline Network-Attached Storage Opportunity Applications Performance Model Prototype Summary

Today s Server-Attached Disks Store-and-forward data copy through server machine File/Database Server Local Area Network Clients Separate storage and client networks storage moving to packetized FC clients moving to scalable switches

Network-Attached Secure Disks Eliminate server bottleneck w/ network-attached File Manager Object Storage Object Storage Network Security Network Security Switched Network Local Area Network Object Storage Object Storage Network Security Network Security Clients Combined storage and client networks single, switched infrastructure delivers max. bandwidth to clients drives must handle security

Excess Device Cycles Are Coming Higher and higher levels of integration in drive electronics specialized drive chips combined into single ASIC technology trends push toward integrated control processor 100 MHz, 32-bit superscalar w/ 2 MB on-chip RAM in 98

Opportunity Large database systems - lots of disks, lots of power System Processing (MHz) Data Rate (MB/s) CPU Disks I/O Bus Disks Compaq Proliant TPC-C 4 x 200=800 113 x 25=2,825 133 1,130 Microsoft Terraserver 4 x 400=1,600 320 x 25=8,000 532 3,200 Digital AlphaServer 500 TPC-C 1 x 500=500 61 x 25=1,525 266 610 Digital AlphaServer 4100 TPC-D 4 x 466=1,864 82 x 25=2,050 532 820 assume disk offers equivalent of 25 host MHz assume disk sustained data rate of 10 MB/s Lots more cycles and MB/s in disks than in host

Advantage - Basic advantages of an system parallel processing - lots of disks bandwidth reduction - filtering operations common scheduling - little bit of computation can go a long way Appropriate applications execution time dominated by data-intensive core allows parallel implementation of core small memory footprint small number of cycles per byte of data processed

Example Application Throughput (MB/s) Data mining - association rules [Agrawal95] 25.0 20.0 15.0 10.0 5.0 0.0 frequent sets summary counts count of 1-itemsets and 2-itemsets milk & bread => cheese diapers & beer Frequent Sets 1 Prototype Server 2 4 6 8 10 Number of Disks Throughput (MB/s) 250.0 200.0 150.0 100.0 50.0 0.0 Scaling Up Frequent Sets Server 20 40 60 80 100 Number of Disks

Basic Performance Model Execution = max( processing, transfer, disk access ) selectivity is #bytes-input / #bytes-output assume fully overlapped pipeline (avoids Amdahl s law) Processing time per byte Host: #cycles/byte / host-cpu-speed Disks: #cycles/byte / (disk-cpu-speed * #disks) Transfer time per overall byte Host: 1 / interconnect-data-rate Disks: (1 / selectivity) / interconnect-data-rate Disk access time per overall byte Both: 1 / (disk-data-rate * #disks)

Throughput Model Scalable throughput speedup = (#disks)/(host-cpu-speed/disk-cpu-speed) (host-cpu/disk-cpu-speed) ~ 5 (two processor generations) selectivity = #bytes-input / #bytes-output Throughput transfer saturation host saturation disk saturation disk-cpu saturation active disks traditional host-cpu disk-cpu selectivity Number of Disks

Database - select Additional Applications extract records that match a particular predicate Database - nearest neighbor search k records closest to input record with large number of attributes, reduces to scan Multimedia - edge detection [Smith95] detect edges in an image Multimedia - image registration [Welling97] find rotation and translation from reference image

Traditional Server Database Server Ultra Ultra Digital AlphaServer 500/500 500 MHz, 256 MB memory disks - Seagate Cheetah 4.5 GB, 10,000 RPM, 11.2 MB/s

Server with Server Obj Stor Obj Stor Network Security Network Security Switched Network ATM Obj Stor Obj Stor Network Security Network Prototype Digital AXP 3000/400 workstation 133 MHz, software NASD prototype Seagate Medallist disks Security

Performance with Throughput (MB/s) 60.0 40.0 20.0 Search Server Throughput (MB/s) 25.0 20.0 15.0 10.0 5.0 Frequent Sets Server 0.0 2 4 6 8 10 Number of Disks 0.0 2 4 6 8 10 Number of Disks Throughput (MB/s) 5.0 4.0 3.0 2.0 1.0 Edge Detection Server Throughput (MB/s) 0.80 0.60 0.40 0.20 Image Registration Server 0.0 2 4 6 8 10 Number of Disks 0.00 2 4 6 8 10 Number of Disks

Application Characteristics Critical properties for Active Disk performance cycles/byte => maximum throughput memory footprint selectivity => network bandwidth application input computation throughput memory selectivity bandwidth (cycles/byte) (MB/s) (KB) (factor) (KB/s) Select m=1% 7 28.6-100 290 Search k=10 7 28.6 72 80,500 0.4 Frequent Sets s=0.25% 16 12.5 620 15,000 0.8 Edge Detection t=75 303 0.67 1776 110 6.1 Image Registration - 4740* 0.04 672 180 0.2 Select m=20% 7 28.6-5 5,700 Frequent Sets s=0.025% 16 12.5 2,000 14,000 0.9 Edge Detection t=20 394 0.51 1750 3 170

Summary Technology trends provide the opportunity excess cycles large systems => lots of disks => lots of power Dramatic benefits possible application examples - data mining and multimedia characteristics for big wins - parallelism, selectivity basic advantage - compute close to the data Prototype speedup of 2x on 10 disks scales to 15x in 60 disk system bottleneck can be above 1000s of disks Throughput (MB/s) 250.0 200.0 150.0 100.0 50.0 Frequent Sets Server 0.0 60 20 40 80 100 Number of Disks

Conclusions & Future Work Leverage for powerful drive chips available now Siemens Tri-Core [announced March 98, first silicon Sept 98] Cirrus Logic 3CI [announced June 98] higher-level storage interfaces & security architecture NASD [Sigmetrics 97, ASPLOS 98] Object-oriented disks [Seagate and X3 T10], NSIC, SNIA aggressive applications data mining [Center for Automated Learning & Discovery] multimedia [Informedia, Digital Libraries] Challenges programming model - partitioning, mobility, interfaces resources - driven by cost, reliability, volume management - disk come in boxes of ten additional application classes - sort/join, storage management

Related Work Database Machines (CASSM, RAP, Gamma) higher disk bandwidth, parallelism general-purpose programmability OS/Database Extensions application-specific specialization/extension (SPIN, VINO) data type extensions (Sybase, Informix) Parallel Programming automatic data parallelism (HPF), task parallelism (Fx) parallel I/O (Kotz, IBM, Intel) Other Smart Disks offload SMP database functions, disk layout (Berkeley) select, sort, images via extended (Santa Barbara)

It is Why Isn t This Parallel Programming? parallel cores distributed computation serial portion needs to be small Disks are different must protect the data must continue to serve demand requests memory/cpu ratios driven by cost, reliability, volume come in boxes of ten advantage - compute close to the data Opportunistically use this power e.g. data mining possible on an OLTP system