VLDB. Partitioning Compression

Similar documents
Oracle 11g Partitioning new features and ILM

Oracle Advanced Compression: Reduce Storage, Reduce Costs, Increase Performance Bill Hodak Principal Product Manager

Copyright 2012, Oracle and/or its affiliates. All rights reserved.

Oracle Advanced Compression. An Oracle White Paper June 2007

Copyright 2013, Oracle and/or its affiliates. All rights reserved.

An Oracle White Paper June Exadata Hybrid Columnar Compression (EHCC)

Oracle 1Z0-515 Exam Questions & Answers

Oracle Advanced Compression. An Oracle White Paper April 2008

An Oracle White Paper October Advanced Compression with Oracle Database 11g

Storage Optimization with Oracle Database 11g

Exadata Implementation Strategy

An Oracle White Paper February Optimizing Storage for Oracle PeopleSoft Applications

Data Warehousing & Big Data at OpenWorld for your smartphone

Oracle #1 for Data Warehousing. Data Warehouses Growing Rapidly Tripling In Size Every Two Years

Data Warehousing 11g Essentials

ORACLE DATA SHEET ORACLE PARTITIONING

Oracle Database 11g Release 2 for SAP Advanced Compression. Christoph Kersten Oracle Database for SAP Global Technology Center (Walldorf, Germany)

Exadata Implementation Strategy

Oracle: From Client Server to the Grid and beyond

Optimizing Storage with SAP and Oracle Database 12c O R A C L E W H I T E P A P E R M A Y

Hybrid Columnar Compression (HCC) on Oracle Database 18c O R A C L E W H IT E P A P E R FE B R U A R Y

Was ist dran an einer spezialisierten Data Warehousing platform?

10/29/2013. Program Agenda. The Database Trifecta: Simplified Management, Less Capacity, Better Performance

Oracle Zero Data Loss Recovery Appliance (ZDLRA)

Actual4Test. Actual4test - actual test exam dumps-pass for IT exams

The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into

Automating Information Lifecycle Management with

Copyright 2013, Oracle and/or its affiliates. All rights reserved. Insert Information Protection Policy Classification from Slide 12

An Oracle White Paper August Advanced Compression Option (ACO) with Oracle Database 11g

Table Compression in Oracle9i Release2. An Oracle White Paper May 2002

An Oracle White Paper June Partitioning with Oracle Database 12c

Oracle #1 RDBMS Vendor

Oracle EXAM - 1Z Oracle Exadata Database Machine Administration, Software Release 11.x Exam. Buy Full Product

An Oracle White Paper July Oracle Advanced Compression with Oracle Database 12c

Exadata X3 in action: Measuring Smart Scan efficiency with AWR. Franck Pachot Senior Consultant

Zero Data Loss Recovery Appliance DOAG Konferenz 2014, Nürnberg

Oracle Database In-Memory

An Oracle White Paper June Oracle Advanced Compression with Oracle Database 12c

<Insert Picture Here> Controlling resources in an Exadata environment

Oracle Advanced Compression with Oracle Database 12c Release 2 O R A C L E W H I T E P A P E R M A R C H

Oracle Advanced Compression with Oracle Database 12c Release 2 O R A C L E W H I T E P A P E R S E P T E MB E R

A Technical Overview of the HP Oracle Exadata Storage Server. An Oracle White Paper October 2008

<Insert Picture Here> Introducing Exadata X3

Oracle Partitioning in Oracle Database 12c Release 2

Key to A Successful Exadata POC

Oracle Advanced Compression Helps Global Fortune 500 Company Meet Storage Savings Initiative O R A C L E W H I T E P A P E R F E B R U A R Y

In-Memory Data Management

So you think you know everything about Partitioning?

Oracle Exam 1z0-027 Oracle Exadata Database Machine X3 Administration Version: 6.13 [ Total Questions: 72 ]

Oracle Database In-Memory

Segregating Data Within Databases for Performance Prepared by Bill Hulsizer

Oracle Database 10G. Lindsey M. Pickle, Jr. Senior Solution Specialist Database Technologies Oracle Corporation

Evolving To The Big Data Warehouse

B. Using Data Guard Physical Standby to migrate from an 11.1 database to Exadata is beneficial because it allows you to adopt HCC during migration.

Oracle Exadata: Strategy and Roadmap

Information Lifecycle Management for Business Data. An Oracle White Paper September 2005

Oracle Hyperion Profitability and Cost Management

<Insert Picture Here> Exadata MAA Best Practices Series Session 1: E-Business Suite on Exadata

Oracle Exadata Implementation Strategy HHow to Implement Exadata In-house

Oracle Database 11g for Data Warehousing and Business Intelligence

Oracle Exadata X7. Uwe Kirchhoff Oracle ACS - Delivery Senior Principal Service Delivery Engineer

1 Copyright 2012, Oracle and/or its affiliates. All rights reserved.

Oracle Exadata. Smart Database Platforms - Dramatic Performance and Cost Advantages. Juan Loaiza Senior Vice President Oracle Database Systems

Copyright 2014, Oracle and/or its affiliates. All rights reserved.

1Z Oracle Exadata X5 Administration Exam Summary Syllabus Questions

<Insert Picture Here> South Fla Oracle Users Group Oracle/Sun Exadata Database Machine June 3, 2010

An Oracle White Paper September Oracle Utilities Meter Data Management Demonstrates Extreme Performance on Oracle Exadata/Exalogic

Oracle Database 18c and Autonomous Database

<Insert Picture Here> Controlling resources in an Exadata environment

In-Memory Data Management Jens Krueger

Copyright 2018, Oracle and/or its affiliates. All rights reserved.

Course Contents of ORACLE 9i

Four Steps to Unleashing The Full Potential of Your Database

An Oracle White Paper March A Technical Overview of the Oracle Exadata Database Machine and Exadata Storage Server

SAP IQ Software16, Edge Edition. The Affordable High Performance Analytical Database Engine

Oracle CoreTech Update OASC Opening 17. November 2014

Oracle Database Exadata Cloud Service Exadata Performance, Cloud Simplicity DATABASE CLOUD SERVICE

The Oracle Database Appliance I/O and Performance Architecture

Data Vault Partitioning Strategies WHITE PAPER

Oracle Database In-Memory

OpenWorld 2015 Oracle Par22oning

Oracle Database 10g: New Features for Administrators Release 2

What Really Sets Apart Exadata from the Rest

An Oracle White Paper April 2010

An Oracle White Paper November A Technical Overview of the Oracle Exadata Database Machine and Exadata Storage Server

Storwize/IBM Technical Validation Report Performance Verification

Top Trends in DBMS & DW

Rio-2 Hybrid Backup Server

Copyright 2012, Oracle and/or its affiliates. All rights reserved.

Oracle Database 11g Data Guard

CHAPTER. Oracle Database 11g Architecture Options

Mellanox InfiniBand Solutions Accelerate Oracle s Data Center and Cloud Solutions

Deep Dive Into Storage Optimization When And How To Use Adaptive Compression. Thomas Fanghaenel IBM Bill Minor IBM

Oracle Advanced Compression Proof-of-Concept (POC) Insights and Best Practices

SUN ORACLE DATABASE MACHINE

DELL EMC DATA DOMAIN SISL SCALING ARCHITECTURE

Validating the NetApp Virtual Storage Tier in the Oracle Database Environment to Achieve Next-Generation Converged Infrastructures

Oracle Database 10g : Administration Workshop II (Release 2) Course 36 Contact Hours

Andy Mendelsohn, Oracle Corporation

ORACLE EXADATA DATABASE MACHINE X2-8

Transcription:

VLDB Partitioning Compression

<Insert Picture Here> Oracle Partitioning in Oracle Database 11g

Oracle Partitioning Ten Years of Development Core functionality Performance Manageability Oracle8 Range partitioning Global range indexes Static partition pruning Basic maintenance operations: add, drop, exchange Oracle8i Hash and composite range-hash partitioning Partition-wise joins Dynamic pruning Merge operation Oracle9i List partitioning Oracle9i R2 Composite range-list partitioning Oracle10g Global hash indexes Oracle10g R2 1M partitions per table Global index maintenance Fast partition split Local Index maintenance Multi-dimensional pruning Fast drop table

Oracle Partitioning Ten Years of Development Core functionality Performance Manageability Oracle8 Range partitioning Global range indexes Static partition pruning Basic maintenance operations: add, drop, exchange Oracle8i Hash and composite range-hash partitioning Partition-wise joins Dynamic pruning Merge operation Oracle9i List partitioning Oracle9i R2 Composite range-list partitioning Oracle10g Global hash indexes Oracle10g R2 1M partitions per table Oracle Database 11g More composite choices REF Partitioning Virtual Column Partitioning Global index maintenance Fast partition split Local Index maintenance Multi-dimensional pruning Fast drop table Interval Partitioning Partition Advisor

Oracle Partitioning Fundamental Concepts for Success While performance seems to be the most visible one, don't forget about the rest partitioning must address all business-relevant areas of Performance, Manageability, and Availability Partition autonomy is crucial fundamental requirement for any partition maintenance operations acknowledge partitions as metadata in the data dictionary

Oracle Partitioning Enhancements Complete the basic partitioning strategies defines HOW data is going to be partitioned new composite partitioning methods Introduce partitioning extensions defines WHAT controls the data placement enhance the manageability and automation virtual column based partitioning REF partitioning interval partitioning partition advisor

<Insert Picture Here> Composite Partitioning in Oracle Database 11g

Extended Composite Partitioning Strategies Concept of composite partitioning Data is partitioned along two dimensions (A,B) A distinct value pair for the two dimensions uniquely determines the target partitioning Composite partitioning is complementary to multi- column range partitioning Extensions in Oracle Database 11g.. New 11g List Range Strategy Range - Range List - Hash List - List Use Case Geography ShipDate -Time Geography OrderDate Geography OrderID Product

Composite Partitioning - Concept Table SALES RANGE(order_date)-RANGE(ship_date) ship_date Jan 2006 Feb 2006 May 2006 Jan 2006 Feb 2006 order_date Mar 2006 Jan 2007

Composite Partitioning - Concept Table SALES RANGE(order_date)-RANGE(ship_date) ship_date All records with Jan 2006 Feb 2006 May 2006 Jan 2006 Feb 2006 order_date Mar Mar2006 2006 Jan 2007 order_date in March 2006

Composite Partitioning - Concept Table SALES RANGE(order_date)-RANGE(ship_date) ship_date Jan 2006 Feb 2006 May May 2006 Jan 2006 Feb 2006 order_date Mar 2006 Jan 2007 All records with ship_date in May 2006

Composite Partitioning - Concept Table SALES RANGE(order_date)-RANGE(ship_date) ship_date Jan 2006 Feb 2006 May May 2006 Jan 2006 Feb 2006 order_date Mar Mar2006 2006 Jan 2007 All records with order_date in March 2006 AND ship_date in May 2006

<Insert Picture Here> Virtual Column based Partitioning

Virtual Columns Business Problem Extended Schema attributes are fully derived and dependent on existing common data Redundant storage or extended view definitions are solving this problem today requires additional maintenance and creates overhead Solution Oracle Database 11g introduces virtual columns purely virtual, meta-data only Treated as real columns except no DML can have statistics eligible as partitioning key Enhanced performance and manageability

Virtual Columns - Example Base table with all attributes CREATE TABLE accounts (acc_no number(10) not null, acc_name varchar2(50) not null, 12500 12507 12666 12875 Adams Blake King Smith

Virtual Columns - Example Base table with all attributes is extended with the virtual (derived) column CREATE TABLE accounts (acc_no number(10) not null, acc_name varchar2(50) not null, acc_branch number(2) generated always as (to_number(substr(to_char(acc_no),1,2))) 12500 12507 12666 12875 Adams Blake King Smith 12 12 12 12

Virtual Columns - Example Base table with all attributes is extended with the virtual (derived) column and the virtual column is used as partitioning key CREATE TABLE accounts (acc_no number(10) not null, acc_name varchar2(50) not null, acc_branch number(2) generated always as (to_number(substr(to_char(acc_no),1,2))) partition by list (acc_branch) 12500 12507 12666 12875 Adams Blake King Smith 12 12 12 12 32320 32407 32758 32980 Jones Clark Hurd Phillips 32 32 32 32

<Insert Picture Here> Interval Partitioning

Interval Partitioning Partitioning is key-enabling functionality for managing large volumes of data Application one logical object for application transparency multiple physical segments for administration but Physical segmentation requires additional data management overhead new partitions must be created on-time for new data Automate the partition management SQL CDRs Mar Jan Feb

Interval Partitioning Interval Partitioning extension to range partitioning full automation for equi-sized range partitions Partitions are created as metadata information only start partition is made persistent Segments are allocated as soon as new data arrives no need to create new partitions local indexes are created and maintained as well No need for any partition management

Interval Partitioning How it works CREATE TABLE sales (order_date DATE, ) PARTITON BY RANGE (order_date) INTERVAL(NUMTOYMINTERVAL(1,'month') (PARTITION p_first VALUES LESS THAN ('01-FEB-2006'); Table SALES Jan 2006 Feb 2006 Mar 2006 First segment is created Jan 2007 Oct 2009 Nov 2009

Interval Partitioning How it works CREATE TABLE sales (order_date DATE, ) PARTITON BY RANGE (order_date) INTERVAL(NUMTOYMINTERVAL(1,'month') (PARTITION p_first VALUES LESS THAN ('01-FEB-2006'); Table SALES Jan 2006 Feb 2006 Mar 2006 Jan 2007 Oct 2009 Other partitions only exist in metadata Nov 2009

Interval Partitioning How it works CREATE TABLE sales (order_date DATE, ) PARTITON BY RANGE (order_date) INTERVAL(NUMTOYMINTERVAL(1,'month') (PARTITION p_first VALUES LESS THAN ('01-FEB-2006'); Table SALES Jan 2006 Feb 2006 Mar 2006 Jan 2007 Oct 2009 Nov 2009 New segment is automatically allocated INSERT INTO sales (order_date DATE, ) VALUES ('04-MAR-2006',);

Interval Partitioning How it works CREATE TABLE sales (order_date DATE, ) PARTITON BY RANGE (order_date) INTERVAL(NUMTOYMINTERVAL(1,'month') (PARTITION p_first VALUES LESS THAN ('01-FEB-2006'); Table SALES Jan 2006 Feb 2006 Mar 2006 Jan 2007 Oct 2009 Nov 2009 whenever data for a new partition arrives INSERT INTO sales (order_date DATE, ) VALUES ('17-OCT-2009',);

Interval Partitioning How it works Interval partitioned table can have classical range and automated interval section Automated new partition management plus full partition maintenance capabilities: Best of both worlds Table SALES Jan 2006 Feb 2006 Mar 2006 Jan 2007 Oct 2009 Nov 2009

Interval Partitioning How it works Interval partitioned table can have classical range and automated interval section Automated new partition management plus full partition maintenance capabilities: Best of both worlds Table SALES 2006 Range partition section MERGE and move old partitions for ILM Jan 2007 Oct 2009 Nov 2009

Interval Partitioning How it works Interval partitioned table can have classical range and automated interval section Automated new partition management plus full partition maintenance capabilities: Best of both worlds Table SALES 2006 Range partition section Jan 2007 Oct 2009 Interval partition section Nov 2009 MERGE and move old partitions for ILM Insert new data - Automatic segment creation INSERT INTO sales (order_date DATE, ) VALUES ('13-NOV-2009',);

<Insert Picture Here> REF Partitioning

REF Partitioning Business Problem Related tables benefit from same partitioning strategy e.g. order lineitem Redundant storage of the same information solves this problem data overhead maintenance overhead Solution Oracle Database 11g introduces REF Partitioning child table inherits the partitioning strategy of parent table through PK-FK relationship intuitive modelling Enhanced Performance and Manageability

Before REF Partitioning Table ORDERS Jan 2006 RANGE(order_date) Primary key order_id Feb 2006 Redundant storage of order_date Redundant maintenance Table LINEITEMS Jan 2006 Feb 2006 RANGE(order_date) Foreign key order_id

REF Partitioning Table ORDERS Jan 2006 RANGE(order_date) Primary key order_id Feb 2006 PARTITION BY REFERENCE Partitioning key inherited through PK-FK relationship Table LINEITEMS Jan 2006 Feb 2006 RANGE(order_date) Foreign key order_id

Partitioning Advisor Packaged Apps Custom Apps SQL Workload SQL Advisor SQL SQL Plan Access Structure Tuning Analysis Analysis SQL Profile SQL Advice Indexes & MVs New! Partition Analysis Partition Advice Considers entire query workload to improve query performance Advises on partitioning methods Range (equal-interval), range key and interval Hash, hash key Integrated, non-conflicting advice with Indexes, MVs Well-tuned SQL & Schema

Oracle Partitioning Summary Proven functionality in 7th generation experience comes with age and customer usage Fundamental concepts for success the most comprehensive and complete offering 8th generation introduces ground-breaking new functionality extended COMPOSITE strategies further flexibility and performance INTERVAL for simplicity, ease of management VIRTUAL COLUMNS extend applicability with low maintenance REF enhanced performance for related tables

<Insert Picture Here> Advanced Compression Option

Challenges Explosion in data volume managed by Enterprises Government regulations (Sarbanes-Oxley, HIPPA, etc) User generated content (Web 2.0) IT managers must support larger volumes of data with limited technology budgets Need to optimize storage consumption Also maintain acceptable application performance Intelligent and efficient compression technology can help address these challenges

Introducing Advanced Compression Option Oracle Database 11g introduces a comprehensive set of compression capabilities Structured/Relational data compression Unstructured data compression Compression for backup data Network transport compression Reduces resource requirements and costs Storage System Network Bandwidth Memory Usage Redo logs Standby Backups

Table Compression Introduced in Oracle9i Release 2 Supports compression during bulk load operations (Direct Load, CTAS) Data modified using conventional DML not compressed Optimized compression algorithm for relational data Improved performance for queries accessing large amounts of data Fewer IOs Buffer Cache efficiency Data is compressed at the database block level Each block contains own compression metadata improves IO efficiency Local symbol table dynamically adapts to data changes Compression can be specified at either the table or partition levels Completely transparent to applications Noticeable impact on write performance

OLTP Table Compression Oracle Database 11g extends compression for OLTP data Support for conventional DML Operations (INSERT, UPDATE, DELETE) New algorithm significantly reduces write overhead Batched compression ensures no impact for most OLTP transactions No impact on reads Reads may actually see improved performance due to fewer IOs and enhanced memory efficiency

OLTP Table Compression Overhead Inserts are again uncompressed Free Space Uncompressed Compressed Block usage reaches PCTFREE triggers Compression Inserts are uncompressed Block usage reaches PCTFREE triggers Compression Adaptable, continuous compression Compression automatically triggered when block usage reaches PCTFREE Compression eliminates holes created due to deletions and maximizes contiguous free space in block

Using OLTP Table Compression Requires database compatibility level at 11.1 or greater New Syntax extends the COMPRESS keyword COMPRESS [FOR {ALL DIRECT_LOAD} OPERATIONS] DIRECT_LOAD (DEFAULT) Refers to Bulk load operations from 10g and prior releases ALL OLTP + Direct loads Enable compression for a new table CREATE TABLE t1 COMPRESS FOR ALL OPERATIONS Enable only direct load compression on existing table ALTER TABLE t2 COMPRESS only new rows are compressed, existing rows are uncompressed

SecureFiles - Deduplication Secure hash Enables storage of a single physical image for duplicate data Significantly reduces space consumption Dramatically improves write and copy operations No adverse impact on read operations may actually improve read performance for cache data Duplicate detection happens within a table, partition or sub-partition Specially useful for content management, email applications and data archival applications

SecureFiles - Compression Huge storage savings Industry standard compression algorithms 2-3x compression for typical files (doc, pdf, xml) Minimal CPU overhead during compression Automatically detects if SecureFile data is compressible Skips compression for already compressed data Auto-turn off compression when space savings are minimal or zero Two levels of compression provide different compression ratios Compression Levels: MEDIUM (default), HIGH Higher the degree of compression, higher the latency and CPU overhead incurred SecureFiles Compression is independent of table or index compression Server-side compression Allows for random reads and writes to SecureFile data Can be specified at a partition level

Data Pump Compression Metadata compression available since Oracle Database 10g Oracle Database 11g extends compression to table data during exports no need to decompress before import Single step compression of both data and metadata compressed data directly hits disk resulting in reduced disk space requirements 75% reduction in dump file size on export of sample OE and SH schemas Compression factor comparable to GNU gzip utility Application transparent complete Data Pump functionality available on compressed files

Backup data and Network transport Compression Fast RMAN Compression compresses the backup set contents before writing them to disk or tape no extra decompression steps are required during recovery when you use RMAN compression. high performance, industry standard compression algorithm 40% faster backup compression versus Oracle Database 10g suitable for fast, incremental daily backups reduces network usage Data Guard Network Compression compression of redo traffic over the network improves redo transport performance gap resolution is up to 2x faster

Summary Advanced Compression Option contains comprehensive data compression capabilities for all types of data Structured, Unstructured, Backup, Network Transport Reduces storage consumption by 2 to 3 times Improves read performance Enhances memory, buffer cache utilization Complete application transparency Benefits diverse application workloads

VLDB Partitioning Compression

EXADATA Extreme Performance Data Warehousing

The Performance Challenge Storage Data Bandwidth Bottleneck Large data warehouses want to scan dozens, hundreds, or thousands of disks at full disk speed Pipes between disks and servers constrain bandwidth by 10x or more Result is that warehouses can become slower as they get bigger

Solutions To Data Bandwidth Bottleneck Ship less data through pipes Add more pipes Make pipes bigger

Exadata A New Architecture Breaks Data Bandwidth Bottleneck Exadata Ships Less Data Through Pipes Query processing is moved into storage to dramatically reduce data sent to servers while offloading server CPUs Exadata has More Pipes Modular storage cell building blocks organized into Massively Parallel Grid Bandwidth scales with capacity Exadata has Bigger Pipes InfiniBand interconnect transfers data 5x faster than Fibre Channel Exadata Moves a Lot Less Data a Lot Faster

Exadata A New Architecture Exadata delivers brawny hardware for use by Oracle s brainy software Hardware by HP Software by Oracle Sold by Oracle Support point of contact is Oracle

HP Oracle Exadata Storage Server Storage Server (Cell) is building block of Exadata Storage Grid Cells ship with all hardware and software components pre-installed Runs Exadata Storage Software, Oracle Infiniband protocol, Oracle Enterprise Linux 5.1, and HP hardware management software Does not support non-database storage Hewlett Packard is exclusive hardware provider HP has worldwide presence, great reputation, outstanding x86 platforms, high volume leadership, excellent support organization Co-branded by HP and Oracle Absolutely no custom hardware - all parts off the shelf high-volume

HP Oracle Database Machine Configured for High Performance Data Warehousing 8 Oracle Database servers 64 Intel processor cores Oracle Enterprise Linux Oracle Real Application Clusters 14 Exadata Storage Servers 50 to 168 TB raw storage InfiniBand switches Optimized, Certified, and Supported by Oracle

Add Performance and Capacity by Adding More Database Machines

Extreme Performance in Action Scan TB of User Data In 3.5 sec. 80 70 70 60 60 50 50 Elapsed Time Secs 40 40 Scan Rate(GB/Sec) 30 30 20 20 10 10 0 0 14 28 42 56 70 84

Scalable Pure Storage Add racks to scale further Scale to 18 cells in one rack Each cell connects to 2 InfiniBand switches for Redundancy SAS raw capacity per rack: 65TB SATA raw capacity per rack: 216TB Peak throughput per rack : >18GB/s InfiniBand links across racks for full connectivity

Exadata Product Capacity Raw Storage User Data Data Bandwidth HP Exadata Storage Server Hardware SAS 12x450 3.6 TB 1 TB 1 GB/s HP Exadata Storage Server Hardware SATA 12x1000 12 TB 3.3 TB 0.75 GB/s HP Oracle Database Machine Hardware SAS 450 50 TB 14 TB 14 GB/s 168 TB 46 TB 10.5 GB/s HP Oracle Database Machine Hardware SATA 1000

Exadata Changes the Game Exadata delivers brawny hardware for use by Oracle s brainy software A new approach and architecture Delivers database intelligence and massively parallel scaling in the storage tier Using state of the art industry standard hardware Eliminates all bottlenecks preventing high performance query processing Dramatic performance improvement for data warehousing Simplicity of appliance seamlessly integrated with the database