A Flexible Data Warehouse Architecture

Similar documents
Oracle Exadata X2/X3-8: A Critical Review

Increasing Performance of Existing Oracle RAC up to 10X

Mellanox InfiniBand Solutions Accelerate Oracle s Data Center and Cloud Solutions

Exadata. Presented by: Kerry Osborne. February 23, 2012

FlashGrid Software Enables Converged and Hyper-Converged Appliances for Oracle* RAC

Software Defined Storage at the Speed of Flash. PRESENTATION TITLE GOES HERE Carlos Carrero Rajagopal Vaideeswaran Symantec

Under the Hood of Oracle Database Appliance. Alex Gorbachev

Oracle Exadata X7. Uwe Kirchhoff Oracle ACS - Delivery Senior Principal Service Delivery Engineer

IBM s Data Warehouse Appliance Offerings

Copyright 2012 EMC Corporation. All rights reserved.

Oracle #1 for Data Warehousing. Data Warehouses Growing Rapidly Tripling In Size Every Two Years

HP visoko-performantna OLTP rješenja

Exchange Server 2007 Performance Comparison of the Dell PowerEdge 2950 and HP Proliant DL385 G2 Servers

Was ist dran an einer spezialisierten Data Warehousing platform?

The World s Fastest Backup Systems

Upgrade to Microsoft SQL Server 2016 with Dell EMC Infrastructure

Optimizing the Data Center with an End to End Solutions Approach

Software-defined Shared Application Acceleration

ORACLE SUPERCLUSTER T5-8

Solaris Engineered Systems

Oracle Exadata. Smart Database Platforms - Dramatic Performance and Cost Advantages. Juan Loaiza Senior Vice President Oracle Database Systems

Exadata X3 in action: Measuring Smart Scan efficiency with AWR. Franck Pachot Senior Consultant

MicroTerabyte - NVMesh - Oracle RAC Reference Architecture

Building blocks for high performance DWH Computing

Oracle Database 10G. Lindsey M. Pickle, Jr. Senior Solution Specialist Database Technologies Oracle Corporation

Open storage architecture for private Oracle database clouds

Hyper-converged storage for Oracle RAC based on NVMe SSDs and standard x86 servers

How to Pick SQL Server Hardware

Fusion iomemory PCIe Solutions from SanDisk and Sqrll make Accumulo Hypersonic

EMC VMAX 400K SPC-2 Proven Performance. Silverton Consulting, Inc. StorInt Briefing

Recent Innovations in Data Storage Technologies Dr Roger MacNicol Software Architect

SAN Storage Array Workbook September 11, 2012

The Oracle Database Appliance I/O and Performance Architecture

Database Services at CERN with Oracle 10g RAC and ASM on Commodity HW

Oracle for administrative, technical and Tier-0 mass storage services

Books by Michael R. Ault

Exam : Title : High-End Disk for Open Systems V2. Version : DEMO

Dell Reference Configuration for Large Oracle Database Deployments on Dell EqualLogic Storage

ORACLE EXADATA DATABASE MACHINE X2-8

REQUEST FOR PROPOSAL FOR PROCUREMENT OF

Accelerating Microsoft SQL Server 2016 Performance With Dell EMC PowerEdge R740

An Oracle White Paper March A Technical Overview of the Oracle Exadata Database Machine and Exadata Storage Server

Dell PowerEdge R730xd Servers with Samsung SM1715 NVMe Drives Powers the Aerospike Fraud Prevention Benchmark

Oracle 1Z0-515 Exam Questions & Answers

Oracle Database Exadata Cloud Service Exadata Performance, Cloud Simplicity DATABASE CLOUD SERVICE

An Oracle White Paper November A Technical Overview of the Oracle Exadata Database Machine and Exadata Storage Server

Reduce Costs & Increase Oracle Database OLTP Workload Service Levels:

2009. October. Semiconductor Business SAMSUNG Electronics

Storage Optimization with Oracle Database 11g

Lenovo Database Configuration

<Insert Picture Here> Controlling resources in an Exadata environment

Maximize Database Join and Sort Performance Utilizing Exadata Flash

PSAM, NEC PCIe SSD Appliance for Microsoft SQL Server (Reference Architecture) September 4 th, 2014 NEC Corporation

ORACLE EXADATA DATABASE MACHINE X2-2

ZD-XL SQL Accelerator 1.6

Accelerate Applications Using EqualLogic Arrays with directcache

MACHINE X2-8. Exadata Storage Server and the Oracle Databases storage hierarchy. With 14 ORACLE DATA SHEET ORACLE EXADATA DATABASE FEATURES AND FACTS

Three Paths to Better Business Decisions

Fusion-io: Driving Database Performance

Lenovo Database Configuration for Microsoft SQL Server TB

Storage Best Practices for Microsoft Server Applications

Accelerating Microsoft SQL Server Performance With NVDIMM-N on Dell EMC PowerEdge R740

Lenovo Database Configuration

Configuring a Single Oracle ZFS Storage Appliance into an InfiniBand Fabric with Multiple Oracle Exadata Machines

Getting the Most Out of SSDs IT System Optimization Best Practices

Dell Microsoft Business Intelligence and Data Warehousing Reference Configuration Performance Results Phase III

Microsoft SQL Server 2012 Fast Track Reference Architecture Using PowerEdge R720 and Compellent SC8000

Holistic Oracle Tuning. Seeing the forest in the trees Mike Ault Oracle Guru, TMS, Inc

TPC-E testing of Microsoft SQL Server 2016 on Dell EMC PowerEdge R830 Server and Dell EMC SC9000 Storage

SUN ORACLE DATABASE MACHINE

UCS Invicta: A New Generation of Storage Performance. Mazen Abou Najm DC Consulting Systems Engineer

Consolidating OLTP Workloads on Dell PowerEdge R th generation Servers

Hitachi Converged Platform for Oracle

Sun Lustre Storage System Simplifying and Accelerating Lustre Deployments

Emulex LPe16000B 16Gb Fibre Channel HBA Evaluation

vstart 50 VMware vsphere Solution Specification

Dell Fluid Data solutions. Powerful self-optimized enterprise storage. Dell Compellent Storage Center: Designed for business results

How to Speed up Database Applications with a Purpose-Built SSD Storage Solution

HP solutions for mission critical SQL Server Data Management environments

ORACLE Linux / TSC.

REFERENCE ARCHITECTURE Microsoft SQL Server 2016 Data Warehouse Fast Track. FlashStack 70TB Solution with Cisco UCS and Pure Storage FlashArray//X

Oracle #1 RDBMS Vendor

Solving the I/O bottleneck with Flash

VEXATA FOR ORACLE. Digital Business Demands Performance and Scale. Solution Brief

Pivot3 Acuity with Microsoft SQL Server Reference Architecture

Oracle Exadata: The World s Fastest Database Machine

SAP High-Performance Analytic Appliance on the Cisco Unified Computing System

Identifying Performance Bottlenecks with Real- World Applications and Flash-Based Storage

DELL EMC DATA DOMAIN DEDUPLICATION STORAGE SYSTEMS

EMC Tiered Storage for Microsoft SQL Server 2008 Enabled by EMC CLARiiON CX4 and Enterprise Flash Drives

Deploy a High-Performance Database Solution: Cisco UCS B420 M4 Blade Server with Fusion iomemory PX600 Using Oracle Database 12c

Dell Microsoft Reference Configuration Performance Results

16GFC Sets The Pace For Storage Networks

Got Isilon? Need IOPS? Get Avere.

Dell EMC SAP HANA Appliance Backup and Restore Performance with Dell EMC Data Domain

EMC CLARiiON CX3-80. Enterprise Solutions for Microsoft SQL Server 2005

B. Using Data Guard Physical Standby to migrate from an 11.1 database to Exadata is beneficial because it allows you to adopt HCC during migration.

ACCELERATE YOUR ANALYTICS GAME WITH ORACLE SOLUTIONS ON PURE STORAGE

All-Flash Storage Solution for SAP HANA:

Reasons to Deploy Oracle on EMC Symmetrix VMAX

Transcription:

A Flexible Data Warehouse Architecture Building the Ideal Data Warehouse Platform Mike Ault Oracle Guru Texas Memory Systems NYOUG Dec 2010

Michael R. Ault Oracle Guru - Nuclear Navy 6 years - Nuclear Chemist/Programmer 10 years - Kennedy Western University Graduate - Bachelors Degree Computer Science - Certified in all Oracle Versions Since 6 - Oracle DBA, author, since 1990

Books by Michael R. Ault

Statspackanalyzer.com Free Statspack/AWR Analysis Sponsored by Texas Memory Systems -Looks for IO bottlenecks and other configuration issues. -Straightforward tuning advice

General Requirements Flexible Architecture Can use multiple OS Can use multiple Databases Not locked to a single DBS or OS Expandable Low cost (relatively speaking)

What is a Data Warehouse? It is not just a large database It has a specific purpose It has a specific design

Typical Data Warehouse Design

Data Warehouse Characteristics Usually very large (hundreds of GB to TB) Structured for data retrieval Provides for Summaries Requires large numbers of IOPS Usually only a small number of users

DWH IO Characteristics Usually access is dimensions into facts Bitmap index merges usually most efficient Called a star join May have full table scans Can exceed 200,000 IOPS for large joins with sorts

Hard Disk IO Characteristics Limited by physics Maximum of 200 random IOPS Latency of 2-5 milliseconds for a 15K rpm drive Regardless of storage capacity!

IOPS Hard Drive IOPS (28 15K) 28 15K rpm 146GB drives 300 Gigabyte data, 600 Gigabyte total TPC-H profile HD IOPS 100000.00 10000.00 1000.00 100.00 10.00 1.00 0 5000 10000 15000 20000 Se conds Perm IO Temp IO Total IO

IOPS SSD IOPS Profile 1-1 TB Flash unit 2-128 GB DDR units Same DB and query set IOPS-SSD 1000000.00 100000.00 10000.00 1000.00 100.00 10.00 1.00 0 500 1000 1500 2000 2500 3000 3500 4000 4500 Seconds D-DI D-TI D-Total

To Get to 100,000 IOPS You need about 500 hard drives 3-CX3 from EMC should do it However

Can t get away from the Latency! EMC IOPS and Latency (From: http://blogs.vmware.com/performance/2008/05/100000-io-opera.html)

Why Am I Concerned? Latency will kill DWH performance Can t get below 2-5ms latency with disks Have to get rid of physical movement to retrieve data

Comparative Latencies IOPS Comparison for Various SANs (Source: www.storageperformance.org)

New SSD Technology EMC putting SSD into disk drive size package 4 per tray maximum EMC SSD Response time: 1-2 MS EMC HDD Response time: 4-8 MS (Source: EMC Tech Talk: Enterprise Flash Drives, Barry A. Burke, Chief Strategy Officer, Symmetrix Product Group, EMC Storage Division, June 25, 2008

So Disks too slow (4-8 ms) Even new SSD technology in disk format is slow (1-2 ms) Performance oriented SSD best choice (.015-0.2 ms)

DWH Processing Characteristics Sorts Summaries Parallel Operations Need multiple CPUS Need expansion capabilities

Available Server Technology Multi-CPU Blade Servers with RAC Large Multi-CPU Monolithic servers Multi-CPU Individual Servers with RAC

So What About Blades Flexible (within limits) Can t switch to other manufacturer Limited to their upgrade/release cycle May have to dump entire stack if new architecture comes along Flexible to DB Flexible to OS

How About Monolithic Severs? Not flexible Locked to a single vendor Need to toss the whole thing if new architecture comes along Limited growth Flexible to DB (sometimes) Flexible to OS (sometimes)

Individual Servers Each is self-contained Can be upgraded individually Flexible to OS Flexible to DB Expandable

So So Far We Need: Large data storage capacity Able to sustain large numbers of IOPS Able to support high degrees of parallel processing (supports large numbers of CPUs) Large core memory for each server/cpu Easily increase data size and IOPS capacity Easily increase processing capability

Oracle Requirements Real Application clusters Partitioning Parallel Query Maybe Analytical

Server Requirements Multi-high speed CPU servers Multiple servers High speed interconnect such as infiniband between the servers Multiple High bandwidth connections to the IO subsystem

IO Subsystem Should: Be easily expandable Provide high numbers of low latency IOPS Provide high bandwidth

Let s build it: Servers DELL R905 PowerEdge. 4-quadcore Opteron 8393,3.1GHz processors, dual 1 gb NIC 2 dual channel 4 Gb fibre channel connections. 10Ghz NIC 2 servers giving us 32 3.1 GHz processors

Why? The 8393 at 3.1 GHz gives best SPEC benchmark results for the DELL 905 server

IO Subsystem

Tier Zero Storage Lowest Latency, smaller needed capacity 4 Mirrored RamSan 440 s loaded with 512 Gigabytes of DDR Ram (1 terabyte usable from mirror) 15 microsecond latency 600,000 IOPS each Projected cost: $668K

Tier 1 Storage Not as fast Need more capacity 2-mirrored RamSan-630 s (8 terabytes of capacity) 250 microseconds response time 500,000 IOPS Projected Cost: $697K

Tier 2 Storage Actually only used for Backup Should provide for de-duplication and compression An appliance would be best DataDomain DD120 Projected cost: $12.5K

Switches Must provide for all connections Must be low latency high bandwidth Need 4-16 channel 4 Gb switches I suggest QLogic SanBox 5600Q Projected cost: $13.1K 10 Gig Ethernet router Fujitsu XG700, projected cost $6.5K Total switches: $19.6K

Misc. Cabinet Cables Etc. Estimate at $1.5K

Total: Servers: 36,484.00 RamSan-440: 668,400.00 RamSan-620: 697,400.00 DataDomain: 12,500.00 Switches: 19,600.00 Misc. 1,500.00 (cables, rack, etc.) Total 1,435,884.00

But What About Oracle? Just can make a guess For a similar CPU architecture with RAC and Parallel Query Cost in June 2009 was $440K with three years maintenance based on a 12 named user license

Grand Total: $1,875,654.00 Should I wrap that for you?

How does that Compare? Let s look at the new Exadata Won t need a full build We will look at a half Rack configuration 4 - servers 7 - Exadata cells Meets space requirements but only provides 18,900 IOPS Would need 741 cells to match IOPS at a cost for hardware and software in excess of $62 million

Projected Oracle DBM Cost Half Rack: $671,000.00 Cell Licenses: $960,000.00 Support (3 yr): $432,200.00 Oracle Licenses: $440,000.00 Total $2,503,200.00

For those in the back $2,503,200.00

Let s Think Green The energy consumption of the Exadata cell is projected to be 900 watts. For 7 cells about 6.3KW 600W for each of the RamSan440 systems 500W for each RamSan630 total of 3.4KW. Over one year of operation the difference in energy and cooling costs could be close to half those of the Exadata using the SSD solution Once all of the license and long term ongoing costs are considered the ideal solution provides quite a savings.

Final Scorecard Consideration Flexible Configuration Oracle DWHM OS Flexible Yes No DB Flexible Yes No Expandable Yes Yes High IO Bandwidth Yes Yes but low IOPS Low Latency Yes No High IOPS Yes No Initial cost Lower Higher Long term cost Good Poor

Summary I have shown a flexible data warehouse server architecture. This type of system is a moving target An ideal architecture allows high IO bandwidth. Low latency, capability for high degree of parallel operations and flexibility as far as future growth, database system and OS are concerned. Weigh all factors before selecting a solution

Questions? Mike.ault@texmemsys.com www.superssd.com www.ramsan.com