Data Warehouse appliances: IBM Pure Data for Analytics
|
|
- Isabella Carter
- 6 years ago
- Views:
Transcription
1 May 2017 Data Warehouse appliances: IBM Pure Data for Analytics Fabio Bresciani, Cloud & Cognitive, IBM Italia
2 1997 IBM ebusiness 1927 Italy 1956 Data storage industry creation 1969 IBM technology guided Apollo mission to the moon 1981 The IBM PC 2011 IBM Watson 1911 Computing-Tabulating- Recording (CTR) 1924 International Business Machines 1935 Training courses for Women 1944 First machine to handle long calculations automatically 1961 The Selectric Typewriter 1962 First computerdriven airline reservation system 1971 Floppy disk 1969 Magnetic strips on credit cards 1973 UPC bar codes 1986 IBM scientists won the Nobel Prize 1997 Supercomputer defeated the best chess player 2
3 Cosa fa IBM? Analytics Consulting Services Security IBM Technical and Infrastructure Services Research Cloud Commerce Healthcare Systems Internet of Things
4 IBM Research Centri di Ricerca IBM 5.7 B$ in R&D (6% del fatturato) Almaden Austin New York São Paulo/ Rio de Janeiro Dublin Zurich Haifa Delhi/Bengaluru Nairobi Johannesburg Beijing/Shanghai Tokyo Melbourne 13 centri di Ricerca in 6 continenti, fra cui quello di Zurigo guidato dall italiano Alessandro Curioni Per 24 anni consecutivi l impresa leader nei brevetti brevetti U.S. nel premi Nobel master inventor in 43 paesi Concentrati in aree strategiche: Cloud Computing, Analytics, Security Cognitive Computing, Healthcare 4
5 Traditional Data Warehouses are just too complex They do NOT meet the demands of advanced analytics on big data. Too complex an infrastructure Too complicated to deploy Too much tuning required Too long to get answers Too inefficient at analytics Too many people needed to maintain Too costly to operate 5
6 Big Data Floods Traditional Database Systems 6
7 Let s Simplify This Mess 7
8 And Bring Analytics In To The Warehouse 8
9 Netezza Simplicity Legacy RDBMS Create Table - Logical Model Netezza DDL Create Table - Logical Model CREATE TABLE CRRADMIN.OT_ORDER_EVENTS CREATE TABLE CRRADMIN.OT_ORDER_EVENTS ( ( TRADE_DATE DATE NOT NULL, ORIGIN_SYS_CD VARCHAR2(32 BYTE) NOT NULL, TRADE_DATE DATE NOT NULL, ORIGIN_SYS_EVENT_SEQ VARCHAR2(32 BYTE) NOT NULL, ORIGIN_SYS_CD VARCHAR (32) NOT NULL, EVENT_ID Allocate Space NUMBER(9) NOT NULL, ORIGIN_SYS_EVENT_SEQ VARCHAR (32) NOT NULL, EVENT_CLASS_CD VARCHAR2(32 BYTE) NOT NULL, EVENT_ID INTEGER NOT NULL, EVENT_DATETIME TABLESPACE OTR_DATA" DATE LOCAL NOT NULL, EVENT_CLASS_CD VARCHAR (32) NOT NULL, ORIGIN_SYS_REF (PARTITION BY RANGE VARCHAR2(32 (TRADE_DATE) BYTE) NOT ( NULL, EVENT_DATETIME TIMESTAMP NOT NULL, ORIGIN_SYS_PARENT_REF VARCHAR2(32 BYTE), PARTITION P VALUES LESS THAN ( ) ORIGIN_SYS_REF VARCHAR (32) NOT NULL, ORIGIN_SYS_ORDER_REF VARCHAR2(32 BYTE), ORIGIN_SYS_RELATED_REF PCTFREE 10 INITRANS VARCHAR2(32 2 MAXTRANS BYTE), 255 ORIGIN_SYS_PARENT_REF VARCHAR (32), Create Indexes ORIGIN_SYS_GROUP_REF STORAGE(INITIAL VARCHAR2( NEXT BYTE), MINEXTENTS ORIGIN_SYS_ORDER_REF 1 VARCHAR (32), ORIGIN_SYS_DATETIME MAXEXTENTS DATE NOT NULL, ORIGIN_SYS_RELATED_REF VARCHAR (32), CREATE INDEX OTOE_EVENT_ID TRADE_ID PCTINCREASE 0 FREELISTS NUMBER(9), 1 FREELIST GROUPS 1 BUFFER_POOL ORIGIN_SYS_GROUP_REF VARCHAR (32), BASKET_ID DEFAULT, PCTFREE NUMBER(9), ON 10 INITRANS 2 MAXTRANS 255 ORIGIN_SYS_DATETIME TIMESTAMP NOT NULL, ORDER_ID CRRADMIN.OT_ORDER_EVENTS(EVENT_ID) STORAGE(INITIAL NUMBER(9), NEXT MINEXTENTS TRADE_ID 1 INTEGER, BASKET_NAME TABLESPACE VARCHAR2(32 OTR_IDX BYTE), MAXEXTENTS BASKET_ID INTEGER, SQC_SQN NOLOGGING VARCHAR2(20 BYTE), EXECFAC_ID PCTINCREASE 0 FREELISTS NUMBER(9), 1 FREELIST GROUPS 1 BUFFER_POOL ORDER_ID INTEGER, PCTFREE 10 CUSTOMER_REF DEFAULT, INITRANS VARCHAR2(255 2 BYTE), BASKET_NAME VARCHAR (32), Logical INSTRUMENT_ID PARTITION Model Only P NUMBER(9), VALUES LESS THAN ( ) SQC_SQN VARCHAR (20), MAXTRANS 255 No SYMBOL indexes PCTFREE 10 INITRANS VARCHAR2(64 2 MAXTRANS BYTE), 255 EXECFAC_ID INTEGER, STORAGE(BUFFER_POOL DEFAULT) STORAGE(INITIAL NEXT MINEXTENTS CUSTOMER_REF 1 VARCHAR (255), ); No Physical Tuning/Admin NOPARALLEL MAXEXTENTS INSTRUMENT_ID INTEGER, NOCOMPRESS Distribute PCTINCREASE Data by Columns 0 FREELISTS 1 FREELIST GROUPS 1 BUFFER_POOL / or Round Robin SYMBOL VARCHAR (64), DEFAULT, CREATE INDEX OTOE_TRADE_ID PARTITION P VALUES LESS THAN ( ) ) 9 ON PCTFREE 10 INITRANS 2 MAXTRANS 255 DISTRIBUTE ON (ORIGIN_SYS_REF); CRRADMIN.OT_ORDER_EVENTS(TRADE_ID)
10 IBM PureData System for Analytics The Simple Data Warehouse Appliance for Serious Analytics Purpose-built analytics appliance Integrated database, server and storage Standard interfaces Low total cost of ownership What makes it different? Speed x faster than traditional custom systems 1 Simplicity - minimal administration and tuning Scalability - petabyte+ scale user data capacity Smart - high performance, advanced analytics 10
11 Massively Parallel Processing Architecture Divide and conquer MPP Shared Nothing concept Divides the work in smaller tasks A big task is sliced vertically into a series of smaller tasks Benefits The smaller tasks run independently The work is automatically balanced among the tasks to minimize the time to complete Each task is assigner the same amount of physical resources Communication between is made only at the beginning and end of the task A large task completes in a short elapsed time Maximizes use of resources Points of Attention Complexity on administration and management Communication bottlenecks 11
12 Data Warehouse Workload Fewer requests, lots of data manipulation Transactional System used for BI Request Request CPU General Purpose Storage 12
13 Data Warehouse Workload Transaction systems are inefficient for data shuffling Transactional System used for BI Results Request CPU General Purpose Storage 13
14 Data Warehouse Blades Designed for Tera-scale Business Intelligence IBM Pure Data System Results Request CPU Intelligent Storage Asymmetric Massively Parallel Processing 14
15 Data Warehouse Blades Highly efficient data movement IBM Pure Data System Results 2% of CPU requirements 1% of network traffic Request CPU Intelligent Storage Asymmetric Massively Parallel Processing 15
16 Asymmetric Massively Parallel Processing SOLARIS AIX Netezza Appliance Client TRU64 HP-UX WINDOWS LINUX ODBC 3.X JDBC Type 4 OLE-DB SQL/92 SQL Compiler 1 2 S-Blade Processor & streaming DB logic S-Blade Processor & streaming DB logic Query Plan Execution Engine 3 S-Blade Processor & streaming DB logic Source Systems ETL Server DBA CLI 3rd Party Apps High-Speed Loader/Unloader Optimize Admin Front End DBOS SMP Host Network Fabric Ÿ Ÿ Ÿ 920 High-Performance Database Engine Streaming joins, aggregations, sorts S-Blade Processor & streaming DB logic Massively Parallel Intelligent Storage High Performance Loader 16
17 Asymmetric Massively Parallel Processing SOLARIS AIX Netezza TwinFin Appliance Client TRU64 HP-UX Source Systems WINDOWS ETL Server DBA CLI 3rd Party Apps LINUX SQL High-Speed Loader/Unloader SQL Compiler Query Plan Optimize Admin SQL Front End SMP Host Snippets Execution Engine DBOS Network Fabric Ÿ Ÿ Ÿ 920 S-Blade S-Blade S-Blade Processor & streaming DB logic High-Performance Database Engine Streaming joins, aggregations, sorts S-Blade Processor & streaming DB logic Processor & streaming DB logic Processor & streaming DB logic Massively Parallel Intelligent Storage High Performance Loader 17
18 S-Blade Data Stream Processing select DISTRICT, PRODUCTGRP, sum(nrx) from MTHLY_RX_TERR_DATA where MONTH = ' ' and MARKET = and SPECIALTY = 'GASTRO' FPGA Core CPU Core Slice of table MTHLY_RX_TERR_DATA (compressed) 18 Uncompress Project select DISTRICT, PRODUCTGRP, sum(nrx) Restrict, Visibility Complex Joins, Aggs, etc. where MONTH = ' ' and MARKET = and SPECIALTY = 'GASTRO' sum(nrx)
19 Asymmetric Massively Parallel Processing SOLARIS AIX Netezza TwinFin Appliance Client TRU64 HP-UX Source Systems WINDOWS ETL Server DBA CLI 3rd Party Apps LINUX ODBC 3.X JDBC Type 4 OLE-DB SQL/92 High-Speed Loader/Unloader SQL Compiler Query Plan Optimize Admin Front End SMP Host Consolidate Execution Engine DBOS Network Fabric Ÿ Ÿ Ÿ 920 S-Blade S-Blade S-Blade Processor & streaming DB logic High-Performance Database Engine Streaming joins, aggregations, sorts, etc. S-Blade Processor & streaming DB logic Processor & streaming DB logic Processor & streaming DB logic Massively Parallel Intelligent Storage High Performance Loader 19
20 Inside the IBM PureData System for Analytics N3001 Optimized Hardware + Software Hardware accelerated AMPP Purpose-built for high performance analytics Requires no tuning SMP Hosts SQL Compiler Query Plan Optimize Admin Disk Enclosures User data, mirror, swap partitions High speed data streaming Snippet Blades Hardware-based query acceleration with FPGAs Blistering fast results Complex analytics executed as the data streams from disk 20
21 Disk Mirroring and Failover Primary Mirror Temp All user data and temp space mirrored Disk failures transparent to queries and transactions Failed drives automatically regenerated Bad sectors automatically rewritten or relocated 21
22 S-Blade Failover and Query Continuity S-Blades Drives automatically reassigned to remaining S-Blades within a chassis Read-only queries (that have not returned data yet) automatically restarted Transactions and loads interrupted Loads automatically restarted from last successful checkpoint 22
23 23
24 ZoneMap Pure Data's Anti-Index: Automatic Query Acceleration Col 1: Date Col 2: Zip Zone Maps Base Table Data Blocks Indexes are additional structures on disk, derived from the base table to accelerate locating information ZoneMaps are a method within the storage system without the need for add l structures on the disk to indicate where data DOES NOT reside The NPS system > Automatically stores min. & max. values of all integer columns in each file extent > Uses the ZoneMap information to determine if a given extent should be read Zone Maps: 18 out of 48 Extents Read 24 Both indices and ZoneMaps are techniques to avoid full table scans, but Netezza s ZoneMap approach is: Automatic; and Does not require a separate structure to create, tune & maintain
25 25
26 26
27 Response Time Distributions and Performance SPU Node 1 CPU Disk I/O Network Response time is affected by the completion time for all of the SPUs in the AMPP array. A distribution method that distributes data evenly across all SPUs is the single most important factor that can influence overall performance! 27
28 Response Time Hash Distributions and Data Skew SPU Node 1 CPU Disk I/O Network Gender = M or F will distribute all table records on 2 SPUs 6 7 Select a distribution key with unique values and high cardinality 28
29 Response Time Hash Distributions and Processing Skew SPU Node CPU Disk I/O Network Jan Feb Mar Apr May Jun Jul Using a DATE column as the distribution key may distribute rows evenly across all S- Blades. However, most analysis (queries) is performed on a date range. Massive parallel processing won t be achieved when all of the records to be processed for a given date range are located on a single or a few S-Blades) 29
30 Commonly JOINed Tables: Use the Same Distribution Key For tables commonly joined (WHERE clause) use the same column/distribution key used in the JOIN! CREATE TABLE customer ( c_custkey integer, c_name character varying(25), c_address character varying(40), c_nationkey integer, c_phone character(15), c_acctbal numeric(15,2), c_mktsegment character(10), c_comment character varying(117) ) DISTRIBUTE ON ( c_custkey ); CREATE TABLE orders ( o_orderkey integer, o_custkey integer, o_orderstatus character(1), o_totalprice numeric(15,2), o_orderdate date, o_orderpriority character(15), o_clerk character(15), o_shippriority integer, o_comment character varying(79) ) DISTRIBUTE ON ( o_custkey ); 30
31 Impact of Distribution Key on Table Join Performance Identical Distribution Keys CREATE TABLE ORDERS (ORDER_NO, CUST_NO, ) DISTRIBUTE BY HASH (CUST_NO) CREATE TABLE CUSTOMERS (CUST_NO, ) DISTRIBUTE BY HASH (CUST_NO) SELECT FROM ORDERS O, CUSTOMERS C WHERE O.CUST_NO = C.CUST_NO ORDERS Table 100, 1, 135, 4, 190, 4, 222, 8, 118, 6, 149, 7, 206, 3, 282, 11, 112, 2, 168, 5, 174, 12, 211, 2, No data movement is required Join Processing Join Processing Join Processing CUSTOMERS Table 1, 4, 8, 10, 3, 6, 7, 11, 2, 5, 9, 12, 31
32 Impact of Distribution Key on Table Join (cont.) Different Distribution Keys CREATE TABLE ORDERS (ORDER_NO, CUST_NO, ) DISTRIBUTE BY HASH (ORDER_NO) CREATE TABLE CUSTOMERS (CUST_NO, ) DISTRIBUTE BY HASH (CUST_NO) SELECT FROM ORDERS O, CUSTOMERS C WHERE O.CUST_NO = C.CUST_NO ORDERS Table 118, 6, 135, 4, 174, 12, 282, 11, 112, 2, 168, 5, 206, 3, 222, 8, 100, 1, 149, 7, 190, 4, 211, 2, Data shipping Data movement is required Shipped rows Shipped rows Shipped rows Join Processing Join Processing Join Processing CUSTOMERS Table 1, 4, 8, 10, 3, 6, 7, 11, 2, 5, 9, 12, 32
33 Workload Management Workload Management (WLM) provided optional functionality to manage resources and prioritize usage across a diverse multi-user environment to meet the need of mixed user workloads Guaranteed Resource Allocation (GRA) Mechanism to allocate NPS resources among groups of users in a multi-user environment Prioritized Query Execution (PQE) Finer control over resource allocation by extending the notion of query priorities from scheduling to execution Short Query Bias (SQB) Ensures users with short queries receive faster, higher, biased query response time under heavy system workloads Workload Limits (GRA) You can use the JOB MAXIMUM attribute of the group definition to control the number of actively running jobs submitted by that group User Requests Request Queues Power User Minimum Resource Guarantees Departmental User Admin Tasks 33
34 Appliances are easy to monitor 34
35 Traditional storage is not ready for the digital transformation Object storage solves the problems of scale, management and costs BLOCK & FILE Traditional Storage Block storage = fixed size blocks in rigid arrangement, ideal for enterprise databases. File storage - sharing files in hierarchically nested folders, ideal for active documents. OBJECT Storage for unstructured data (photos, videos, audios, ) and big data. Object is data with metadata. Basis for cloud storage, spans geographies. High scalability (seamless, multi-dimensional scaling). Ease of use. Lower cost of operations. 35 Page 35
36 36 Page 36
37 37 Page 37
38 Writing Data to IBM Cloud Object Storage text Original Data Let s store a Video! $ Accesser 1 Objects are sent to the Accesser via the S3 Compatible API or Openstack Swift Compatible API $ 2 Each object is segmented into 4MB segments e.g a 1GB object will be segmented into 250 segments. 4MB 4MB 4MB 4MB 4MB 38 38
39 Writing Data to IBM Cloud Object Storage text $ 4MB 4MB 4MB 4MB Each segment is encrypted and then sliced. $ 4MB 4MB 4MB 4MB Erasure Coding Expansion 4 Erasure coding is used to transform the data into a customizable number of slices 39 39
40 Writing Data to IBM Cloud Object Storage text $ 4MB 4MB 4MB 4MB Each slice is written to a separate storage node. In this example, the storage nodes are geographically dispersed across 3 sites. SITE 1 SITE 2 SITE 3 Storage Nodes SITE 1 SITE 2 SITE 3 SITE 2 SITE
41 Reading Data from IBM Cloud Object Storage text 4MB 4MB 4MB 4MB 4MB Storage Nodes SITE 1 SITE 2 SITE 3 SITE 1 SITE 2 SITE 3 SITE 2 SITE 3 With this 12/7 Information Dispersal Algorithm, a read can still be executed with any five storage nodes being unavailable
42 Reading Data from IBM Cloud Object Storage text $ Storage Nodes SITE 1 SITE 2 SITE 3 SITE 1 SITE 2 SITE 3 SITE 2 SITE 3 Even an entire site outage (plus one additional storage node outage) can be tolerated
43 IBM Cloud Object Storage EFFICIENCY How to build a highly reliable storage system for 1 Petabyte of usable data? RAID 6 + Replication IBM Cloud Object Storage Original 1.20 PB Raw Onsite mirror 1.20 PB Raw Remote copy 1.20 PB Raw 1 PB 3.6 PB x 3.6x 3 FTE Replication/backup Usable Storage Raw Storage 6TB Disks Racks Required Floor Space Ops Staffing Extra Software 1 PB 1.7 PB x 1.7x.5 FTE None $ 70% + TCO Savings Page 43
Netezza The Analytics Appliance
Software 2011 Netezza The Analytics Appliance Michael Eden Information Management Brand Executive Central & Eastern Europe Vilnius 18 October 2011 Information Management 2011IBM Corporation Thought for
More informationIBM s Data Warehouse Appliance Offerings
IBM s Data Warehouse Appliance Offerings RChaitanya IBM India Software Labs Agenda 1 IBM Smart Analytics System (D5600) System Overview Technical Architecture Software / Hardware stack details 2 Netezza
More informationData Warehouse Appliance: Main Memory Data Warehouse
Data Warehouse Appliance: Main Memory Data Warehouse Robert Wrembel Poznan University of Technology Institute of Computing Science Robert.Wrembel@cs.put.poznan.pl www.cs.put.poznan.pl/rwrembel SAP Hana
More informationBusiness Analytics in System z: The IBM DB2 Analytics Accelerator Carlos Guardia
Business Analytics in System z: The IBM DB2 Analytics Accelerator Carlos Guardia zim Lead Architect IBM Software Group Business challenges and technology trends Change in business requirements BI/DW is
More informationVendor: IBM. Exam Code: Exam Name: IBM Certified Specialist Netezza Performance Software v6.0. Version: Demo
Vendor: IBM Exam Code: 000-553 Exam Name: IBM Certified Specialist Netezza Performance Software v6.0 Version: Demo QUESTION NO: 1 Which CREATE DATABASE attributes are required? A. The database name. B.
More informationIBM DB2 Analytics Accelerator for z/os, v2.1 Providing extreme performance for complex business analysis
IBM DB2 Analytics Accelerator for z/os, v2.1 Providing extreme performance for complex business analysis Willie Favero IBM Silicon Valley Lab Data Warehousing on System z Swat Team Thursday, March 15,
More informationOn-Disk Bitmap Index Performance in Bizgres 0.9
On-Disk Bitmap Index Performance in Bizgres 0.9 A Greenplum Whitepaper April 2, 2006 Author: Ayush Parashar Performance Engineering Lab Table of Contents 1.0 Summary...1 2.0 Introduction...1 3.0 Performance
More informationIBM PureData System for Analytics The Next Generation. Ralf Götz Client Technical Professional Big Data IBM Deutschland GmbH
IBM PureData System for Analytics The Next Generation Ralf Götz Client Technical Professional Big Data IBM Deutschland GmbH April 19, 2013 The Future of Analytics made easy is already here... The good
More informationDeveloping a Dynamic Mapping to Manage Metadata Changes in Relational Sources
Developing a Dynamic Mapping to Manage Metadata Changes in Relational Sources 1993-2016 Informatica LLC. No part of this document may be reproduced or transmitted in any form, by any means (electronic,
More informationVendor: IBM. Exam Code: C Exam Name: IBM PureData System for Analytics v7.0. Version: Demo
Vendor: IBM Exam Code: C2090-540 Exam Name: IBM PureData System for Analytics v7.0 Version: Demo QUESTION: 1 A SELECT statement spends all its time returning 1 billion rows. What can be done to make this
More informationVendor: IBM. Exam Code: Exam Name: IBM PureData System for Analytics v7.0. Version: Demo
Vendor: IBM Exam Code: 000-540 Exam Name: IBM PureData System for Analytics v7.0 Version: Demo QUESTION 1 A SELECT statement spends all its time returning 1 billion rows. What can be done to make this
More informationOracle Data Pump Internals
Oracle Data Pump Internals Dean Gagne Oracle USA Nashua NH USA Keywords: Oracle Data Pump, export, import, transportable tablespace, parallel, restart Introduction Oracle Data Pump was a new feature introduced
More informationSolutions for Netezza Performance Issues
Solutions for Netezza Performance Issues Vamsi Krishna Parvathaneni Tata Consultancy Services Netezza Architect Netherlands vamsi.parvathaneni@tcs.com Lata Walekar Tata Consultancy Services IBM SW ATU
More informationAgenda. AWS Database Services Traditional vs AWS Data services model Amazon RDS Redshift DynamoDB ElastiCache
Databases on AWS 2017 Amazon Web Services, Inc. and its affiliates. All rights served. May not be copied, modified, or distributed in whole or in part without the express consent of Amazon Web Services,
More informationIBM Exam Questions & Answers
IBM 000-540 Exam Questions & Answers Number: 000-540 Passing Score: 800 Time Limit: 120 min File Version: 56.6 http://www.gratisexam.com/ IBM 000-540 Exam Questions & Answers Exam Name: IBM PureData System
More informationAppliances and DW Architecture. John O Brien President and Executive Architect Zukeran Technologies 1
Appliances and DW Architecture John O Brien President and Executive Architect Zukeran Technologies 1 OBJECTIVES To define an appliance Understand critical components of a DW appliance Learn how DW appliances
More information5 Fundamental Strategies for Building a Data-centered Data Center
5 Fundamental Strategies for Building a Data-centered Data Center June 3, 2014 Ken Krupa, Chief Field Architect Gary Vidal, Solutions Specialist Last generation Reference Data Unstructured OLTP Warehouse
More informationExam Questions P
Exam Questions P2090-047 IBM PureData System for Transactions Technical Mastery Test v1 https://www.2passeasy.com/dumps/p2090-047/ 1. A group has a resource allocation maximum of 50% and the job maximum
More informationExam Name: Netezza Platform Software v6
Vendor: IBM Exam Code: 000-553 Exam Name: Netezza Platform Software v6 Version: DEMO 1.Which CREATE DATABASE attributes are required? A. The database name. B. The database name and the redo log file name.
More informationDQpowersuite. Superior Architecture. A Complete Data Integration Package
DQpowersuite Superior Architecture Since its first release in 1995, DQpowersuite has made it easy to access and join distributed enterprise data. DQpowersuite provides an easy-toimplement architecture
More informationWas ist dran an einer spezialisierten Data Warehousing platform?
Was ist dran an einer spezialisierten Data Warehousing platform? Hermann Bär Oracle USA Redwood Shores, CA Schlüsselworte Data warehousing, Exadata, specialized hardware proprietary hardware Introduction
More informationIBM Data Retrieval Technologies: RDBMS, BLU, IBM Netezza, and Hadoop
#IDUG IBM Data Retrieval Technologies: RDBMS, BLU, IBM Netezza, and Hadoop Frank C. Fillmore, Jr. The Fillmore Group, Inc. The Baltimore/Washington DB2 Users Group December 11, 2014 Agenda The Fillmore
More informationEvolving To The Big Data Warehouse
Evolving To The Big Data Warehouse Kevin Lancaster 1 Copyright Director, 2012, Oracle and/or its Engineered affiliates. All rights Insert Systems, Information Protection Policy Oracle Classification from
More informationHP NonStop Database Solution
CHOICE - CONFIDENCE - CONSISTENCY HP NonStop Database Solution Marco Sansoni, HP NonStop Business Critical Systems 9 ottobre 2012 Agenda Introduction to HP NonStop platform HP NonStop SQL database solution
More informationOracle #1 for Data Warehousing. Data Warehouses Growing Rapidly Tripling In Size Every Two Years
Extreme Performance HP Oracle Machine & Exadata Storage Server October 16, 2008 Robert Stackowiak Vice President, EPM & Data Warehousing Solutions, Oracle ESG Oracle #1 for Data Warehousing Microsoft 14.8%
More informationIntroduction to Database Services
Introduction to Database Services Shaun Pearce AWS Solutions Architect 2015, Amazon Web Services, Inc. or its affiliates. All rights reserved Today s agenda Why managed database services? A non-relational
More informationWorkload Optimized Systems: The Wheel of Reincarnation. Michael Sporer, Netezza Appliance Hardware Architect 21 April 2013
Workload Optimized Systems: The Wheel of Reincarnation Michael Sporer, Netezza Appliance Hardware Architect 21 April 2013 Outline Definition Technology Minicomputers Prime Workstations Apollo Graphics
More informationP Exam Questions Demo IBM. Exam Questions P
IBM Exam Questions P2090-050 IBM PureData System for Analytics Technical Mast ery Test v1 Version:Demo 1. What is required to troubleshoot a query? A. nzevents and the pg.log. B. nzsql and the ODBC config.
More informationCopyright 2012, Oracle and/or its affiliates. All rights reserved.
1 Oracle Partitioning für Einsteiger Hermann Bär Partitioning Produkt Management 2 Disclaimer The goal is to establish a basic understanding of what can be done with Partitioning I want you to start thinking
More informationVLDB. Partitioning Compression
VLDB Partitioning Compression Oracle Partitioning in Oracle Database 11g Oracle Partitioning Ten Years of Development Core functionality Performance Manageability Oracle8 Range partitioning
More informationFour Steps to Unleashing The Full Potential of Your Database
Four Steps to Unleashing The Full Potential of Your Database This insightful technical guide offers recommendations on selecting a platform that helps unleash the performance of your database. What s the
More informationSecureFiles Migration O R A C L E W H I T E P A P E R F E B R U A R Y
SecureFiles Migration O R A C L E W H I T E P A P E R F E B R U A R Y 2 0 1 8 Table of Contents Disclaimer 1 Introduction 2 Using SecureFiles 2 Migration Techniques 3 Migration with Online Redefinition
More informationModern Data Warehouse The New Approach to Azure BI
Modern Data Warehouse The New Approach to Azure BI History On-Premise SQL Server Big Data Solutions Technical Barriers Modern Analytics Platform On-Premise SQL Server Big Data Solutions Modern Analytics
More informationOracle Database In-Memory
Oracle Database In-Memory Mark Weber Principal Sales Consultant November 12, 2014 Row Format Databases vs. Column Format Databases Row SALES Transactions run faster on row format Example: Insert or query
More informationFLASHARRAY//M Smart Storage for Cloud IT
FLASHARRAY//M Smart Storage for Cloud IT //M AT A GLANCE PURPOSE-BUILT to power your business: Transactional and analytic databases Virtualization and private cloud Business critical applications Virtual
More informationAutomating Information Lifecycle Management with
Automating Information Lifecycle Management with Oracle Database 2c The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated
More informationORACLE 8 OBJECT ORIENTED TECHNOLOGY ORACLE SOFTWARE STRUCTURES SERVER SIDE BACKGROUND PROCESSES DATABASE SERVER AND DATABASE INSTANCE
ORACLE 8 IS ORDBMS HANDLES VLDB - GIGABYTES/TERABYTES 10,000 CONCURRENT USERS PARTITIONED TABLES AND INDEXES SINGLE DATA BLOCK IS INACCESSSIBLE CAN ACCESS MULTIPLE PARTITION IN PARALLEL FOR CONCURRENT
More informationNetezza PureData System Administration Course
Course Length: 2 days CEUs 1.2 AUDIENCE After completion of this course, you should be able to: Administer the IBM PDA/Netezza Install Netezza Client Software Use the Netezza System Interfaces Understand
More informationCAST(HASHBYTES('SHA2_256',(dbo.MULTI_HASH_FNC( tblname', schemaname'))) AS VARBINARY(32));
>Near Real Time Processing >Raphael Klebanov, Customer Experience at WhereScape USA >Definitions 1. Real-time Business Intelligence is the process of delivering business intelligence (BI) or information
More informationHive and Shark. Amir H. Payberah. Amirkabir University of Technology (Tehran Polytechnic)
Hive and Shark Amir H. Payberah amir@sics.se Amirkabir University of Technology (Tehran Polytechnic) Amir H. Payberah (Tehran Polytechnic) Hive and Shark 1393/8/19 1 / 45 Motivation MapReduce is hard to
More informationAll-Flash High-Performance SAN/NAS Solutions for Virtualization & OLTP
All-Flash High-Performance SAN/NAS Solutions for Virtualization & OLTP All-flash configurations are designed to deliver maximum IOPS and throughput numbers for mission critical workloads and applicati
More informationMigrate from Netezza Workload Migration
Migrate from Netezza Automated Big Data Open Netezza Source Workload Migration CASE SOLUTION STUDY BRIEF Automated Netezza Workload Migration To achieve greater scalability and tighter integration with
More informationData Warehousing 11g Essentials
Oracle 1z0-515 Data Warehousing 11g Essentials Version: 6.0 QUESTION NO: 1 Indentify the true statement about REF partitions. A. REF partitions have no impact on partition-wise joins. B. Changes to partitioning
More informationDATABASE SCALE WITHOUT LIMITS ON AWS
The move to cloud computing is changing the face of the computer industry, and at the heart of this change is elastic computing. Modern applications now have diverse and demanding requirements that leverage
More informationDeploying an IBM Industry Data Model on an IBM Netezza data warehouse appliance
Deploying an IBM Industry Data Model on an IBM Netezza data warehouse appliance Whitepaper Page 2 About This Paper Contents Introduction Page 3 Transforming the Logical Data Model to a Physical Data Model
More informationRickard Linck Client Technical Professional Core Database and Lifecycle Management Common Analytic Engine Cloud Data Servers On-Premise Data Servers
Rickard Linck Client Technical Professional Core Database and Lifecycle Management Common Analytic Engine Cloud Data Servers On-Premise Data Servers Watson Data Platform Reference Architecture Business
More informationEMC VPLEX Geo with Quantum StorNext
White Paper Application Enabled Collaboration Abstract The EMC VPLEX Geo storage federation solution, together with Quantum StorNext file system, enables a global clustered File System solution where remote
More informationIBM Db2 Analytics Accelerator Version 7.1
IBM Db2 Analytics Accelerator Version 7.1 Delivering new flexible, integrated deployment options Overview Ute Baumbach (bmb@de.ibm.com) 1 IBM Z Analytics Keep your data in place a different approach to
More informationAutomated Netezza Migration to Big Data Open Source
Automated Netezza Migration to Big Data Open Source CASE STUDY Client Overview Our client is one of the largest cable companies in the world*, offering a wide range of services including basic cable, digital
More informationData Analytics using MapReduce framework for DB2's Large Scale XML Data Processing
IBM Software Group Data Analytics using MapReduce framework for DB2's Large Scale XML Data Processing George Wang Lead Software Egnineer, DB2 for z/os IBM 2014 IBM Corporation Disclaimer and Trademarks
More informationVOLTDB + HP VERTICA. page
VOLTDB + HP VERTICA ARCHITECTURE FOR FAST AND BIG DATA ARCHITECTURE FOR FAST + BIG DATA FAST DATA Fast Serve Analytics BIG DATA BI Reporting Fast Operational Database Streaming Analytics Columnar Analytics
More informationIBM System Storage DCS3700
IBM System Storage DCS3700 Maximize performance, scalability and storage density at an affordable price Highlights Gain fast, highly dense storage capabilities at an affordable price Deliver simplified
More informationWelcome to the presentation. Thank you for taking your time for being here.
Welcome to the presentation. Thank you for taking your time for being here. In this presentation, my goal is to share with you 10 practical points that a single partitioned DBA needs to know to get head
More informationIn-Memory Data Management Jens Krueger
In-Memory Data Management Jens Krueger Enterprise Platform and Integration Concepts Hasso Plattner Intitute OLTP vs. OLAP 2 Online Transaction Processing (OLTP) Organized in rows Online Analytical Processing
More informationMicrosoft Azure Databricks for data engineering. Building production data pipelines with Apache Spark in the cloud
Microsoft Azure Databricks for data engineering Building production data pipelines with Apache Spark in the cloud Azure Databricks As companies continue to set their sights on making data-driven decisions
More informationData Sheet: Storage Management Veritas Storage Foundation for Oracle RAC from Symantec Manageability and availability for Oracle RAC databases
Manageability and availability for Oracle RAC databases Overview Veritas Storage Foundation for Oracle RAC from Symantec offers a proven solution to help customers implement and manage highly available
More informationEMC GREENPLUM MANAGEMENT ENABLED BY AGINITY WORKBENCH
White Paper EMC GREENPLUM MANAGEMENT ENABLED BY AGINITY WORKBENCH A Detailed Review EMC SOLUTIONS GROUP Abstract This white paper discusses the features, benefits, and use of Aginity Workbench for EMC
More informationCloud Computing & Visualization
Cloud Computing & Visualization Workflows Distributed Computation with Spark Data Warehousing with Redshift Visualization with Tableau #FIUSCIS School of Computing & Information Sciences, Florida International
More informationExperience the GRID Today with Oracle9i RAC
1 Experience the GRID Today with Oracle9i RAC Shig Hiura Pre-Sales Engineer Shig_Hiura@etagon.com 2 Agenda Introduction What is the Grid The Database Grid Oracle9i RAC Technology 10g vs. 9iR2 Comparison
More informationIntroduction to Hadoop. Owen O Malley Yahoo!, Grid Team
Introduction to Hadoop Owen O Malley Yahoo!, Grid Team owen@yahoo-inc.com Who Am I? Yahoo! Architect on Hadoop Map/Reduce Design, review, and implement features in Hadoop Working on Hadoop full time since
More information<Insert Picture Here> MySQL Web Reference Architectures Building Massively Scalable Web Infrastructure
MySQL Web Reference Architectures Building Massively Scalable Web Infrastructure Mario Beck (mario.beck@oracle.com) Principal Sales Consultant MySQL Session Agenda Requirements for
More informationSAP IQ Software16, Edge Edition. The Affordable High Performance Analytical Database Engine
SAP IQ Software16, Edge Edition The Affordable High Performance Analytical Database Engine Agenda Agenda Introduction to Dobler Consulting Today s Data Challenges Overview of SAP IQ 16, Edge Edition SAP
More informationCopyright 2012, Oracle and/or its affiliates. All rights reserved.
1 Big Data Connectors: High Performance Integration for Hadoop and Oracle Database Melli Annamalai Sue Mavris Rob Abbott 2 Program Agenda Big Data Connectors: Brief Overview Connecting Hadoop with Oracle
More informationScality RING on Cisco UCS: Store File, Object, and OpenStack Data at Scale
Scality RING on Cisco UCS: Store File, Object, and OpenStack Data at Scale What You Will Learn Cisco and Scality provide a joint solution for storing and protecting file, object, and OpenStack data at
More informationBest Practices. How DB2 Performance Structures Improve Performance. DB2 for z/os. Sheryl M. Larsen IBM WW DB2 for z/os Evangelist
DB2 for z/os Best Practices How DB2 Performance Structures Improve Performance Sheryl M. Larsen IBM WW DB2 for z/os Evangelist smlarsen@us.ibm.com Sheryl M. Larsen smlarsen@us.ibm.com Sheryl Larsen is
More informationOracle GoldenGate and Oracle Streams: The Future of Oracle Replication and Data Integration
Oracle GoldenGate and Oracle Streams: The Future of Oracle Replication and Data Integration Sachin Chawla, Ali Kutay, Juan Loaiza, Hasan Rizvi Oracle Corporation The following is intended to outline our
More informationScalable Analytics: IBM System z Approach
Namik Hrle IBM Distinguished Engineer hrle@de.ibm.com Scalable Analytics: IBM System z Approach Symposium on Scalable Analytics - Industry meets Academia FGDB 2012 FG Datenbanksysteme der Gesellschaft
More informationOLAP Introduction and Overview
1 CHAPTER 1 OLAP Introduction and Overview What Is OLAP? 1 Data Storage and Access 1 Benefits of OLAP 2 What Is a Cube? 2 Understanding the Cube Structure 3 What Is SAS OLAP Server? 3 About Cube Metadata
More informationDURATION : 03 DAYS. same along with BI tools.
AWS REDSHIFT TRAINING MILDAIN DURATION : 03 DAYS To benefit from this Amazon Redshift Training course from mildain, you will need to have basic IT application development and deployment concepts, and good
More informationXTREMIO: TRANSFORMING APPLICATIONS, ENABLING THE AGILE DATA CENTER
1 XTREMIO: TRANSFORMING APPLICATIONS, ENABLING THE AGILE DATA CENTER MAX FISHMAN XTREMIO PRODUCT MANAGEMENT 2 THE ALL FLASH ARRAY REVOLUTION ALL FLASH ARRAY 3 XTREMIO ENABLES THE AGILE DATA CENTER 10%
More informationCopyright 2013, Oracle and/or its affiliates. All rights reserved.
2 Copyright 23, Oracle and/or its affiliates. All rights reserved. Oracle Database 2c Heat Map, Automatic Data Optimization & In-Database Archiving Platform Technology Solutions Oracle Database Server
More informationSQL Server SQL Server 2008 and 2008 R2. SQL Server SQL Server 2014 Currently supporting all versions July 9, 2019 July 9, 2024
Current support level End Mainstream End Extended SQL Server 2005 SQL Server 2008 and 2008 R2 SQL Server 2012 SQL Server 2005 SP4 is in extended support, which ends on April 12, 2016 SQL Server 2008 and
More informationTeradata Analyst Pack More Power to Analyze and Tune Your Data Warehouse for Optimal Performance
Data Warehousing > Tools & Utilities Teradata Analyst Pack More Power to Analyze and Tune Your Data Warehouse for Optimal Performance By: Rod Vandervort, Jeff Shelton, and Louis Burger Table of Contents
More informationOracle #1 RDBMS Vendor
Oracle #1 RDBMS Vendor IBM 20.7% Microsoft 18.1% Other 12.6% Oracle 48.6% Source: Gartner DataQuest July 2008, based on Total Software Revenue Oracle 2 Continuous Innovation Oracle 11g Exadata Storage
More informationDatabase Acceleration Solution Using FPGAs and Integrated Flash Storage
Database Acceleration Solution Using FPGAs and Integrated Flash Storage HK Verma, Xilinx Inc. August 2017 1 FPGA Analytics in Flash Storage System In-memory or Flash storage based DB reduce disk access
More informationHIGH PERFORMANCE SANLESS CLUSTERING THE POWER OF FUSION-IO THE PROTECTION OF SIOS
HIGH PERFORMANCE SANLESS CLUSTERING THE POWER OF FUSION-IO THE PROTECTION OF SIOS Proven Companies and Products Fusion-io Leader in PCIe enterprise flash platforms Accelerates mission-critical applications
More informationSão Paulo. August,
São Paulo August, 28 2018 A Modernização das Soluções de Armazeamento e Proteção de Dados DellEMC Mateus Pereira Systems Engineer, DellEMC mateus.pereira@dell.com Need for Transformation 81% of customers
More informationQLIK INTEGRATION WITH AMAZON REDSHIFT
QLIK INTEGRATION WITH AMAZON REDSHIFT Qlik Partner Engineering Created August 2016, last updated March 2017 Contents Introduction... 2 About Amazon Web Services (AWS)... 2 About Amazon Redshift... 2 Qlik
More informationData Transformation and Migration in Polystores
Data Transformation and Migration in Polystores Adam Dziedzic, Aaron Elmore & Michael Stonebraker September 15th, 2016 Agenda Data Migration for Polystores: What & Why? How? Acceleration of physical data
More informationGreenplum Architecture Class Outline
Greenplum Architecture Class Outline Introduction to the Greenplum Architecture What is Parallel Processing? The Basics of a Single Computer Data in Memory is Fast as Lightning Parallel Processing Of Data
More informationOracle 1Z0-515 Exam Questions & Answers
Oracle 1Z0-515 Exam Questions & Answers Number: 1Z0-515 Passing Score: 800 Time Limit: 120 min File Version: 38.7 http://www.gratisexam.com/ Oracle 1Z0-515 Exam Questions & Answers Exam Name: Data Warehousing
More informationIntroduction to K2View Fabric
Introduction to K2View Fabric 1 Introduction to K2View Fabric Overview In every industry, the amount of data being created and consumed on a daily basis is growing exponentially. Enterprises are struggling
More informationGreenplum Database 4.0: Critical Mass Innovation. Architecture White Paper August 2010
Greenplum Database 4.0: Critical Mass Innovation Architecture White Paper August 2010 Greenplum Database 4.0: Critical Mass Innovation Table of Contents Meeting the Challenges of a Data-Driven World 2
More informationTop Trends in DBMS & DW
Oracle Top Trends in DBMS & DW Noel Yuhanna Principal Analyst Forrester Research Trend #1: Proliferation of data Data doubles every 18-24 months for critical Apps, for some its every 6 months Terabyte
More informationIBM DB2 Analytics Accelerator
June, 2017 IBM DB2 Analytics Accelerator DB2 Analytics Accelerator for z/os on Cloud for z/os Update Peter Bendel IBM STSM Disclaimer IBM s statements regarding its plans, directions, and intent are subject
More informationNetezza Basics Class Outline
Netezza Basics Class Outline CoffingDW education has been customized for every customer for the past 20 years. Our classes can be taught either on site or remotely via the internet. Education Contact:
More informationFAST SQL SERVER BACKUP AND RESTORE
WHITE PAPER FAST SQL SERVER BACKUP AND RESTORE WITH PURE STORAGE TABLE OF CONTENTS EXECUTIVE OVERVIEW... 3 GOALS AND OBJECTIVES... 3 AUDIENCE... 3 PURE STORAGE INTRODUCTION... 4 SOLUTION SUMMARY... 4 FLASHBLADE
More informationCrescando: Predictable Performance for Unpredictable Workloads
Crescando: Predictable Performance for Unpredictable Workloads G. Alonso, D. Fauser, G. Giannikis, D. Kossmann, J. Meyer, P. Unterbrunner Amadeus S.A. ETH Zurich, Systems Group (Funded by Enterprise Computing
More informationMigrate from Netezza Workload Migration
Migrate from Netezza Automated Big Data Open Netezza Source Workload Migration CASE SOLUTION STUDY BRIEF Automated Netezza Workload Migration To achieve greater scalability and tighter integration with
More informationOracle Exadata: Strategy and Roadmap
Oracle Exadata: Strategy and Roadmap - New Technologies, Cloud, and On-Premises Juan Loaiza Senior Vice President, Database Systems Technologies, Oracle Safe Harbor Statement The following is intended
More informationCombine Native SQL Flexibility with SAP HANA Platform Performance and Tools
SAP Technical Brief Data Warehousing SAP HANA Data Warehousing Combine Native SQL Flexibility with SAP HANA Platform Performance and Tools A data warehouse for the modern age Data warehouses have been
More informationFLASHARRAY//M Business and IT Transformation in 3U
FLASHARRAY//M Business and IT Transformation in 3U TRANSFORM IT Who knew that moving to all-flash storage could help reduce the cost of IT? FlashArray//m makes server and workload investments more productive,
More informationInternational Journal of Computer Engineering and Applications,
EXADATA DATABASE MACHINE: CLOUD IMPLEMENTATION- IN A BOX: Lecturer, Dept of Information technology Ranchi Women s College, Ranchi-834001, Jharkhand. dollyviv@gmail.com ABSTRACT: After remaining happy with
More informationAn Oracle White Paper November A Technical Overview of the Oracle Exadata Database Machine and Exadata Storage Server
An Oracle White Paper November 2012 A Technical Overview of the Oracle Exadata Database Machine and Exadata Storage Server Introduction... 2 Exadata Product Family... 4 The Exadata Engineered System...
More informationSAP IQ - Business Intelligence and vertical data processing with 8 GB RAM or less
SAP IQ - Business Intelligence and vertical data processing with 8 GB RAM or less Dipl.- Inform. Volker Stöffler Volker.Stoeffler@DB-TecKnowledgy.info Public Agenda Introduction: What is SAP IQ - in a
More informationOracle EXAM - 1Z Oracle Exadata Database Machine Administration, Software Release 11.x Exam. Buy Full Product
Oracle EXAM - 1Z0-027 Oracle Exadata Database Machine Administration, Software Release 11.x Exam Buy Full Product http://www.examskey.com/1z0-027.html Examskey Oracle 1Z0-027 exam demo product is here
More informationVeritas Storage Foundation for Oracle RAC from Symantec
Veritas Storage Foundation for Oracle RAC from Symantec Manageability, performance and availability for Oracle RAC databases Data Sheet: Storage Management Overviewview offers a proven solution to help
More informationUnderstanding Oracle RAC ( ) Internals: The Cache Fusion Edition
Understanding (12.1.0.2) Internals: The Cache Fusion Edition Subtitle Markus Michalewicz Director of Product Management Oracle Real Application Clusters (RAC) November 19th, 2014 @OracleRACpm http://www.linkedin.com/in/markusmichalewicz
More informationOracle Exadata: The World s Fastest Database Machine
10 th of November Sheraton Hotel, Sofia Oracle Exadata: The World s Fastest Database Machine Daniela Milanova Oracle Sales Consultant Oracle Exadata Database Machine One architecture for Data Warehousing
More informationApproaching the Petabyte Analytic Database: What I learned
Disclaimer This document is for informational purposes only and is subject to change at any time without notice. The information in this document is proprietary to Actian and no part of this document may
More information