IBM DB2 Analytics Accelerator

Similar documents
IBM DB2 Analytics Accelerator Trends and Directions

Db2 Analytics Accelerator V5.1 What s new in PTF 5

The New Disruptive Db2 Analytics Accelerator Delivering new flexible, integrated deployment options

Optimizing Data Transformation with Db2 for z/os and Db2 Analytics Accelerator

IDAA v4.1 PTF 5 - Update The Fillmore Group June 2015 A Premier IBM Business Partner

DB2 11 for z/os Application Functionality (Check out these New Features) Randy Ebersole IBM

IBM DB2 Analytics Accelerator: Real-Life Use Cases

DB2 Analytics Accelerator Loader for z/os

Saving ETL Costs Through Data Virtualization Across The Enterprise

DB2 for z/os Tools Overview & Strategy

Reliability and Performance with IBM DB2 Analytics Accelerator Version 4.1 IBM Redbooks Solution Guide

Latest from the Lab: What's New Machine Learning Sam Buhler - Machine Learning Product/Offering Manager

REST APIs on z/os. How to use z/os Connect RESTful APIs with Modern Cloud Native Applications. Bill Keller

IBM Data Virtualization Manager in Detail + Demo Atlanta DB2 User Group Meeting December 7, 2018

End to End Analysis on System z IBM Transaction Analysis Workbench for z/os. James Martin IBM Tools Product SME August 10, 2015

IBM DB2 Analytics Accelerator for z/os, v2.1 Providing extreme performance for complex business analysis

Scalable Analytics: IBM System z Approach

IBM DB2 Analytics Accelerator use cases

Automating Information Lifecycle Management with

Data Virtualization for the Enterprise

October, z14 and Db2. Jeff Josten Distinguished Engineer, Db2 for z/os Development IBM Corporation

Oracle Database 18c and Autonomous Database

Managing IBM Db2 Analytics Accelerator by using IBM Data Server Manager 1

IBM Data Virtualization Manager for z/os Leverage data virtualization synergy with API economy to evolve the information architecture on IBM Z

Build and Deploy Stored Procedures with IBM Data Studio

Build a system health check for Db2 using IBM Machine Learning for z/os

IBM Db2 Analytics Accelerator for z/os Version User's Guide IBM SH

Concurrent execution of an analytical workload on a POWER8 server with K40 GPUs A Technology Demonstration

Database Management Systems

IBM Db2 Analytics Accelerator Version 7.1

DB2 for z/os Enterprise Summit

DB2 REST API and z/os Connect SQL/Stored Procedures Play a Role in Mobile and API Economics

IBM B2B INTEGRATOR BENCHMARKING IN THE SOFTLAYER ENVIRONMENT

Where Copybooks Go and Rational Developer for System z and Rational Team Concert Implementation Questions

Db2 for z/os Early experiences using Transparent Data Set Encryption

IBM Spectrum Protect Node Replication

Analytics with IMS and QMF

IBM i 7.3 Features for SAP clients A sortiment of enhancements

IMS V13 Overview. Deepak Kohli IMS Product Management

Short Summary of DB2 V4 Through V6 Changes

August Oracle - GoldenGate Statement of Direction

In This Issue. Q Edition

IBM i 7.2. Therese Eaton Client Technical Specialist

Microsoft SQL Server Fix Pack 15. Reference IBM

IBM IMS Tools Keynote

DB2 REST API and z/os Connect SQL/Stored Procedures Play a Role in Mobile and API Economics

How to Modernize the IMS Queries Landscape with IDAA

Welcome to the IBM IIS Tech Talk

DB2 12 for z Optimizer

Designing Database Solutions for Microsoft SQL Server 2012

IBM Db2 Event Store Simplifying and Accelerating Storage and Analysis of Fast Data. IBM Db2 Event Store

Analytics on z Systems, what s new?

<Insert Picture Here> Oracle Rdb Releases 7.2, 7.2.1, 7.2.2, 7.2.3, 7.2.4, 7.2.5

Contents. Why You Should Read This Book by Tom Ramey... i About the Authors... v Introduction by Surekha Parekh... xv

1 Dulcian, Inc., 2001 All rights reserved. Oracle9i Data Warehouse Review. Agenda

Applying Analytics to IMS Data Helps Achieve Competitive Advantage

IBM PDTools for z/os. Update. Hans Emrich. Senior Client IT Professional PD Tools + Rational on System z Technical Sales and Solutions IBM Systems

Chapter 1. Overview Topic: What's new Topic: Features and benefits

DB2 11 for z/os Utilities Update

System Z Performance & Capacity Management using TDSz and DB2 Analytics Accelerator: UnipolSai Customer Experience

DB2 Warehouse Manager for OS/390 and z/os White Paper

Copyright 2014, Oracle and/or its affiliates. All rights reserved.

EDB xdb Replication Server 5.1

IBM DATA VIRTUALIZATION MANAGER FOR z/os

SQL Server 2017 Power your entire data estate from on-premises to cloud

Autonomous Database Level 100

Oracle GoldenGate and Oracle Streams: The Future of Oracle Replication and Data Integration

EXAMGOOD QUESTION & ANSWER. Accurate study guides High passing rate! Exam Good provides update free of charge in one year!

ANY Data for ANY Application Exploring IBM Data Virtualization Manager for z/os in the era of API Economy

MySQL CLOUD SERVICE. Propel Innovation and Time-to-Market

Splunking Your z/os Mainframe Introducing Syncsort Ironstream

DB2 for z/os Stored Procedures Update

Deploying CICS regions with the z/os Provisioning Toolkit

Aurora, RDS, or On-Prem, Which is right for you

Session 1079: Using Real Application Testing to Successfully Migrate to Exadata - Best Practices and Customer Case Studies

Data Analytics using MapReduce framework for DB2's Large Scale XML Data Processing

IBM DB2 11 DBA for z/os Certification Review Guide Exam 312

Db2 for z/os Gets Agile

DURATION : 03 DAYS. same along with BI tools.

ziip and zaap Software Update

Empowering DBA's with IBM Data Studio. Deb Jenson, Data Studio Product Manager,

Copyright 2013, Oracle and/or its affiliates. All rights reserved. Insert Information Protection Policy Classification from Slide 12

What it does not show is how to write the program to retrieve this data.

IBM MQ Update BITUG BigSIG Gerry Reilly Development Director and CTO IBM Messaging and IoT Foundation IBM Hursley Lab, UK

Paradigm Shifts in How Tape is Viewed and Being Used on the Mainframe

What's new and exciting in Tools for DB2 for z/os

<Insert Picture Here> Oracle NoSQL Database A Distributed Key-Value Store

Abstract. The Challenges. ESG Lab Review InterSystems IRIS Data Platform: A Unified, Efficient Data Platform for Fast Business Insight

Safe Harbor Statement

Business Analytics in System z: The IBM DB2 Analytics Accelerator Carlos Guardia

Introduction and Overview

Oracle Advanced Compression: Reduce Storage, Reduce Costs, Increase Performance Bill Hodak Principal Product Manager

Moving Databases to Oracle Cloud: Performance Best Practices

Key Features. High-performance data replication. Optimized for Oracle Cloud. High Performance Parallel Delivery for all targets

Improving Your Business with Oracle Data Integration See How Oracle Enterprise Metadata Management Can Help You

DB2 11 and Beyond Celebrating 30 Years of Superior Technology

The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into

Recent Innovations in Data Storage Technologies Dr Roger MacNicol Software Architect

DB2 for z/os Utilities Update

DQpowersuite. Superior Architecture. A Complete Data Integration Package

Transcription:

June, 2017 IBM DB2 Analytics Accelerator DB2 Analytics Accelerator for z/os on Cloud for z/os Update Peter Bendel IBM STSM

Disclaimer IBM s statements regarding its plans, directions, and intent are subject to change or withdrawal without notice and at IBM s sole discretion. Information regarding potential future products is intended to outline our general product direction and it should not be relied on in making a purchasing decision. The information mentioned regarding potential future products is not a commitment, promise, or legal obligation to deliver any material, code or functionality. Information about potential future products may not be incorporated into any contract. The development, release, and timing of any future features or functionality described for our products remains at our sole discretion. Performance is based on measurements and projections using standard IBM benchmarks in a controlled environment. The actual throughput or performance that any user will experience will vary depending upon many factors, including considerations such as the amount of multiprogramming in the user s job stream, the I/O configuration, the storage configuration, and the workload processed. Therefore, no assurance can be given that an individual user will achieve results similar to those stated here. 2

Agenda Level-set: What is IDAA and the recent enhancements Big picture: Long-term strategy Work in progress: Top themes 3

Agenda Level-set: What is IDAA and the recent enhancements Big picture: Long-term strategy Work in progress: Top themes 4

IBM DB2 Analytics Accelerator Transaction Processing Analytical Workload The hybrid computing platform on System z DB2 Accelerator personalities Ø Ø Supports transaction processing and analytics workloads concurrently, efficiently and costeffectively Delivers industry leading performance for mixed workloads Ø Ø Ø Ø Turbo charged access path with hardware assisted early filtering Full-width index Specialty engine Archive Ø Ø The unique heterogeneous scaleout platform Superior availability, reliability and security Ø Ø Ø Tablespace ETL/ELT and in-database analytics acceleration Integration hub: fast federated joins across heterogeneous sources Ø Hybrid cloud 5

V5.1 Highlights 6 Ø Accelerator-only tables can benefit statistics and analytics tools that use temporary data for reports. They enable acceleration of data transformations implemented via SQL statements. Storing interim results in accelerator-only tables enables subsequent queries or data transformations to process all relevant data on the accelerator with high speed Ø In-database analytics capabilities enable acceleration of predictive analytics applications. This enables SPSS/Netezza Analytics data mining and in-database modeling to be processed within the Accelerator. Ø System Temporal and Bi-temporal tables support Active and history table loaded to Accelerator Timestamp(12) data type now supported Truncation to timestamp(6), new options for ZPARM QUERY_ACCEL_OPTIONS Queries using temporal SQL expressions can be routed to Accelerator Ø Transparent archive tables support (since V4PTF3) Archive-enabled and archive table table loaded to Accelerator Archive table older partitions can be archived in IDAA Ø Encryption of data-in-motion Encrypt network traffic between z Systems and the Accelerator using Internet Protocol Security (IPsec). Network traffic includes: DRDA traffic (e.g. Queries, Table Loads) Configuration Console traffic Incremental Update traffic Requires configuration and enablement on z Systems and on the Accelerator Ø Incremental update operation enhancements Ability to disable query acceleration automatically for suspended tables Continuous replication and re-load of replicated tables without taking a table lock on the source Ø Improved installation and maintenance process Extended compatibility between stored procedure version level and accelerator version level Enhanced product packaging Ø More supported SQL functions Date/Time arithmetic functions Inline scalar UDFs

V6.1 Highlights Ø IDAA on Cloud Overwhelming general market trends and IBM focus Demonstrating z analytics vitality Fast deployment, quick time to value Basis for hybrid cloud offering Basis for change of accelerator engine (dashdb) Gaining necessary new skills DB2 Analytics Accelerator for z/os on Cloud Ø Mix and match Simultaneous deployment of on-premises and cloud IDAA servers Ø R support in IDAA on prem Support for data scientists most popular tool Enabling in-database analytics 7

Introducing Accelerator-only table type in DB2 for z/os Creation (DDL) and access remains through DB2 for z/os in all cases Non-accelerator DB2 table Data in DB2 only Table 1 Accelerator-shadow table Data in DB2 and the Accelerator Table 2 Table 2 Accelerator-archived table / partition Empty read-only partition in DB2 Partition data is in Accelerator only Accelerator-only table (AOT) Proxy table in DB2 Data is in Accelerator only Table 3 Table 4 Table 3 Table 4 8

New use cases with Accelerator-only tables Adding application support for temporary objects Multi-step Reporting QMF Campaign 1 2 In-database transformation Consolidation of ETL/ELT processing in DB2 for z/os Support for Data Stage Balanced Optimization Accelerate existing workload Reduce IT sprawl Individual ad-hoc analysis Data Scientist Work Area Derive new business insight Include external & historical data 9 Integrate more data sources for analytics Use DB2 Analytics Accelerator Loader to integrate with IMS data or data from other sources Leverage High Performance Storage Saver 3 4

IBM DB2 Analytics Accelerator Loader for z/os v2.1 Extending analytic capabilities by bringing non-db2 data to the accelerator and z Systems Significant cost and time reduction by eliminating manual ETL processes of non-db2 data Insight into more data types such as IMS, VSAM, sequential files, Oracle, Adabas, etc. Accelerate operational analytics by supporting SMF, RMF, Syslog, data Ensure High Availability for critical analytics applications Loading multiple accelerators in parallel: a single invocation and a single unload from DB2 Applies to non-db2 sources as well Add data to existing table to avoid reloading entire table z Systems synergy: ziip usage VSAM IMS DB2 Sequential Adabas SMF Oracle 10

IDAA V5 PTF 4 new features - Changing the IP address or the TCP/IP port of the Capture Agent - Restoring archived partitions from multiple accelerators - Directing accelerated queries to a specific accelerator - Supporting multiple-row insert for Accelerator-only tables (AOTs) DB2 Analytics Accelerator Loader 2.1 PTFs - Backup and Recovery for accelerator tables including Accelerator-Only-Tables Available via APAR PI70981/PTF UI44867 see http://www.worldofdb2.com/profiles/blog/show?id=2837977%3ablogpost%3a116258&xgs=1 &xg_source=msg_share_post 11

Directing accelerated queries to a specific accelerator For users with multiple accelerators: A new special register and bind option will allow users to direct accelerated queries to a specific accelerator DB2 11 only Delivered via DB2 PTFs only, no Accelerator update to PTF 4 required: PI72150, PI73563 Currently, a user cannot specify a target accelerator for an accelerated query to be executed. If a user has multiple accelerators where the same set of DB2 tables are accelerated, the accelerator workload balance algorithm will distribute the queries depending on queue length on each accelerator. 12 Use Cases: User has an additional accelerator on a remote location (DR location) in addition to locally deployed accelerators. Current algorithm does not take latency into account caused by the long network distance. Therefore some high priority queries might be sent to the remote accelerators and incur increased elapsed time. User wants to direct queries to different accelerators depending on the priorities of the workload, with high priority queries directed to the fastest, highest capacity accelerator and low priority queries directed to a slower accelerator.

Directing accelerated queries to a specific accelerator - Interfaces CURRENT ACCELERATOR (special register) Specifies the name of preferred accelerator(s) to which DB2 should send accelerated dynamic queries. If none of the accelerator servers named by CURRENT ACCELERATOR are available or eligible, DB2 will consider other available accelerator servers. Syntax: SET CURRENT ACCELERATOR = <accelerator_name> <accelerator-name> is a single accelerator name or an accelerator logical name as recorded in SYSIBM.LOCATIONS (the logical name represents one or more accelerator server names). ACCELERATOR BIND OPTION Set the target accelerator(s) to be used for static queries single accelerator name or an accelerator logical name Only affects runtime behavior of an accelerated query If one of the specified accelerator(s) does not exist during Bind time, then DB2 issues a warning message and will not fail the Bind. 13

Multiple-row insert for Accelerator-only tables (AOTs) Support of multiple-row INSERTs for accelerator-only tables Allows faster inserts into accelerator-only table through JDBC, ODBC or local SQL applications Note: This feature is still in Beta state DB2 11 only Required: PTF 4 on the Accelerator and DB2 PTF PI66573 Performance numbers from test in Lab: Multiple-row insert is ~66 times faster than single row insert JDBC program with batch size of 32000 rows for multiple-row insert Use cases: Populating AOTs using INSERT statements from external sources ETL processing outside DB2, then insert result into AOT Populating AOTs from analytics tools such as IBM Campaign or Data Stage From DB2 INSERT statement: 14

Example for Multiple-row insert using JDBC 15

Agenda Level-set: What is IDAA and the recent enhancements Big picture: Long-term strategy Work in progress: Top themes 16

Strategy 1. Bring analytics to z data DB2 for z/os as the control point Deeply integrated and transparent to typical personas that use database Integration hub for z and non-z data Industry leading: Complex query performance In-database transformation acceleration Archiving 3. Support hybrid cloud Uniform experience, simultaneous use and easy transition between on-premises, private cloud and public cloud implementations 2. Hybrid platform architecture Workload-optimized system of systems Best of breed for each component Hybrid analytics/transactional processing (HTAP) 17

Planned major features delivered in upcoming V5 PTFs HTAP - Real-time processing on real-time data on best-of-breed technologies for transactional and analytical workload Schema change support - Adjust the schema on the IDAA side to the changed schema in DB2 Federated access support - Enable read access to all data in one accelerator through a single DB2 system 18

HTAP Real-time processing on real-time data HTAP HTAP Top priority theme Hybrid Transactional/Analytical Processing HTAP Workload Transactions Analytics Real-time processing on real-time data on best-ofbreed technologies for transactional and analytical workload IBM research team has pioneered this technology A patented just-in-time currency replication protocol guarantees transactional data coherency for analytical requests IBM looks at the ability to leverage DB2 z/os data on the IBM DB2 Analytics Accelerator w/o recognizable latency with very high focus and views this as a high priority theme The result is a unique-in-industry, heterogeneous scaleout solution for enterprise-grade HTAP 19

Basic idea Reading most recent committed data during asynchronous replication DB2 IDAA Write requests OLTP read requests data asynchronous replication data no yes OLAP read requests most recent committed data required? yes most recent committed data available? no initiate apply wait for given time period 20

Externals Introducing a new zparm / bind option / special register - CURRENT QUERY ACCELERATION WAITFORDATA = n.m - 0.0-3600.0 seconds, 0.0 = no wait, default: 0.0 0.0: > 0.0: immediately execute in accelerator without waiting wait for most recently committed changes before query to be applied by asynchronous replication and then run the query or: fail the query if max wait time specified exceeded or: execute query in DB2 if WITH FAILBACK is specified in CURRENT QUERY ACCELERATION special register 21

Processing flow example CURRENT QUERY ACCELERATION = ENABLE WITH FAILBACK and query is routable to IDAA 22

Schema change support: Adding a column to IDAA ID Name ID Name 2014 IBM Corporation

Adding a column to IDAA ID Name Team ID Name Queries referencing the new column can not be offloaded Table in IDAA can not be loaded anymore Data changes in DB2 does not get replicated anymore 24 2014 IBM Corporation

Workaround as of today ID Name Team ID Name 1. Remove the table from the accelerator 2. Re-add the table to the accelerator 3. Re-load the data of the table Table outage during the entire process 25 2014 IBM Corporation

Workaround HPSS Schema Change ID Name Team ID Name 1. Restore data in DB2 2. Remove the table from the accelerator 3. Re-add the table to the accelerator 4. Re-load the data of the table 5. Re-archive data 26 2014 IBM Corporation

Adding a column to IDAA Schema Change Support ID Name Team ID Name Team 27 2014 IBM Corporation

Schema change support After a column has been added/altered to a DB2 table the column can be added/altered at the associated table on the Accelerator by executing a synchronization operation - No need any more to remove the table from the Accelerator, re-add it and re-load it again Supported table types: - Accelerator shadow tables (With or without Incremental Update enabled) - Acclelerator archive tables Supported Schema Changes - ALTER TABLE ADD COLUMN (adding a nullable column) - ALTER TABLE ADD COLUMN... NOT NULL WITH DEFAULT (adding a non-nullable column with a default value) Default values are supported as long as they are explicit. Default values like e.g. TIMESTAMP WITH DEFAULT; are not supported, as there is no way to determine the DB2 side value at the time of the ALTER statement in this case - ALTER TABLE ALTER COLUMN <name> SET DATA TYPE (limited to increase length of VARCHAR) 28

Planned interfaces for the schema change support New stored procedure ACCEL_SYNCHRONIZE_SCHEMA to synchronize the schema and alter the table on the Accelerator New Accelerator Studio controls: 29

Agenda Level-set: What is IDAA and the recent enhancements Big picture: Long-term strategy Work in progress: Top themes - Hybrid Transactional/Analytical Processing - Schema change support - Federated access support 30

Federated access support what works today DB2A SELECT DB2B SELECT SELECT IDAA 31

Federated access support what works today DB2A SELECT DB2B SELECT SELECT join IDAA 32

Federated access support work in progress DB2A DB2B SELECT SELECT join IDAA 33

Federated access support How is it going to work For DB2 owning Subsystem DB2O: - Invoke a new IDAA stored procedure to GRANT read access for tables to other referencing DB2 subsystems (not a DB2 GRANT) ACCEL_GRANT_TABLES_REFERENCE (ACCEL1, DB2R, Set< acceleratedtables>) For DB2 referencing Subsystem DB2R: - Invoke another new IDAA stored procedure to CREATE references to tables of other DB2 subsystems ACCEL_CREATE_REFERENCE_TABLES (ACCEL1, DB2O, Set<acceleratedtable-on-db2o>) - For each specified table, this stored procedure creates an AOT as a reference table and a synonym that points to the table of the owning subsystem on IDAA No data duplication on IDAA, just references (reference AOT and synonym) Afterwards the DB2 referencing Subsystem DB2R can use the created reference AOT in queries and join it with other tables on IDAA (either own tables or other reference tables) 34

35