Appliances and DW Architecture. John O Brien President and Executive Architect Zukeran Technologies 1

Similar documents
Data Warehouse and Data Mining

Data Warehouse and Data Mining

Netezza The Analytics Appliance

CHAPTER 3 Implementation of Data warehouse in Data Mining

Evolving To The Big Data Warehouse

1. Analytical queries on the dimensionally modeled database can be significantly simpler to create than on the equivalent nondimensional database.

1 Dulcian, Inc., 2001 All rights reserved. Oracle9i Data Warehouse Review. Agenda

Top Trends in DBMS & DW

Microsoft. Designing Business Intelligence Solutions with Microsoft SQL Server 2012

SQL Server SQL Server 2008 and 2008 R2. SQL Server SQL Server 2014 Currently supporting all versions July 9, 2019 July 9, 2024

VERITAS Storage Foundation 4.0 for Oracle

Data Warehouse and Data Mining

Modern Data Warehouse The New Approach to Azure BI

Data Mining Concepts & Techniques

Chapter 13 Business Intelligence and Data Warehouses The Need for Data Analysis Business Intelligence. Objectives

Increasing Performance of Existing Oracle RAC up to 10X

Oracle 1Z0-515 Exam Questions & Answers

Sub-Second Response Times with New In-Memory Analytics in MicroStrategy 10. Onur Kahraman

Information Management course

Oracle Database 10G. Lindsey M. Pickle, Jr. Senior Solution Specialist Database Technologies Oracle Corporation

SAP IQ Software16, Edge Edition. The Affordable High Performance Analytical Database Engine

Proceedings of the IE 2014 International Conference AGILE DATA MODELS

Data Computing Appliance

Evolution of Database Systems

Full file at

SAP HANA Inspirience Day

This presentation is for informational purposes only and may not be incorporated into a contract or agreement.

5 Fundamental Strategies for Building a Data-centered Data Center

Avaya IQ 5.1 Database Server Configuration Recommendations And Oracle Guidelines

Application-Tier In-Memory Analytics Best Practices and Use Cases

Drawing the Big Picture

Tasting the Flavors of Analysis Services 2012

IT DATA WAREHOUSING AND DATA MINING UNIT-2 BUSINESS ANALYSIS

This tutorial will help computer science graduates to understand the basic-to-advanced concepts related to data warehousing.

#mstrworld. Analyzing Multiple Data Sources with Multisource Data Federation and In-Memory Data Blending. Presented by: Trishla Maru.

The storage challenges of virtualized environments

DURATION : 03 DAYS. same along with BI tools.

CIS 601 Graduate Seminar. Dr. Sunnie S. Chung Dhruv Patel ( ) Kalpesh Sharma ( )

Microsoft Analytics Platform System (APS)

IBM DB2 Analytics Accelerator use cases

Protect enterprise data, achieve long-term data retention

Syllabus. Syllabus. Motivation Decision Support. Syllabus

Deccansoft Software Services Microsoft Silver Learning Partner. SSAS Syllabus

Postgres Plus and JBoss

The advantages of architecting an open iscsi SAN

Best Practices For Backup And Restore In Sql Server 2005

Oracle #1 for Data Warehousing. Data Warehouses Growing Rapidly Tripling In Size Every Two Years

powered by Cloudian and Veritas

Leveraging Customer Behavioral Data to Drive Revenue the GPU S7456

Teradata Certified Professional Program Teradata V2R5 Certification Guide

Approaching the Petabyte Analytic Database: What I learned

Partner Presentation Faster and Smarter Data Warehouses with Oracle OLAP 11g

April Copyright 2013 Cloudera Inc. All rights reserved.

The Evolution of Data Warehousing. Data Warehousing Concepts. The Evolution of Data Warehousing. The Evolution of Data Warehousing

Nutanix Tech Note. Virtualizing Microsoft Applications on Web-Scale Infrastructure

e-warehousing with SPD

An Overview of Data Warehousing and OLAP Technology

CS614 - Data Warehousing - Midterm Papers Solved MCQ(S) (1 TO 22 Lectures)

CHAPTER 8 DECISION SUPPORT V2 ADVANCED DATABASE SYSTEMS. Assist. Prof. Dr. Volkan TUNALI

VERITAS Storage Foundation for Windows FlashSnap Option

Massively Parallel Processing. Big Data Really Fast. A Proven In-Memory Analytical Processing Platform for Big Data

Migrate from Netezza Workload Migration

Call: SAS BI Course Content:35-40hours

Microsoft SQL Server HA and DR with DVX

Four Steps to Unleashing The Full Potential of Your Database

EMC GREENPLUM DATA COMPUTING APPLIANCE: PERFORMANCE AND CAPACITY FOR DATA WAREHOUSING AND BUSINESS INTELLIGENCE

Storage Optimization with Oracle Database 11g

Data-Intensive Distributed Computing

Microsoft SQL Server Training Course Catalogue. Learning Solutions

Modernizing Business Intelligence and Analytics

What is a Data Warehouse?

Solutions for Netezza Performance Issues

Oracle Database 11g: Data Warehousing Fundamentals

HP NonStop Database Solution

1. Attempt any two of the following: 10 a. State and justify the characteristics of a Data Warehouse with suitable examples.

Data Warehousing Concepts

Data Warehouse and Mining

<Insert Picture Here> MySQL Web Reference Architectures Building Massively Scalable Web Infrastructure

EMC DATA DOMAIN PRODUCT OvERvIEW

The Oracle Database Appliance I/O and Performance Architecture

FlexPod. The Journey to the Cloud. Technical Presentation. Presented Jointly by NetApp and Cisco

DATABASE SCALE WITHOUT LIMITS ON AWS

IBM s Data Warehouse Appliance Offerings

VOLTDB + HP VERTICA. page

Data Sheet: Storage Management Veritas Storage Foundation for Oracle RAC from Symantec Manageability and availability for Oracle RAC databases

Migrating Enterprise BI to Azure

Automated Netezza Migration to Big Data Open Source

DAHA AKILLI BĐR DÜNYA ĐÇĐN BĐLGĐ ALTYAPILARIMIZI DEĞĐŞTĐRECEĞĐZ

Data Warehousing 11g Essentials

Introduction to K2View Fabric

Data Warehouse. Asst.Prof.Dr. Pattarachai Lalitrojwong

ELTMaestro for Spark: Data integration on clusters

Microsoft certified solutions associate

After completing this course, participants will be able to:

EMC Backup and Recovery for Microsoft SQL Server

Veritas Storage Foundation for Oracle RAC from Symantec

VEXATA FOR ORACLE. Digital Business Demands Performance and Scale. Solution Brief

Abstract. The Challenges. ESG Lab Review InterSystems IRIS Data Platform: A Unified, Efficient Data Platform for Fast Business Insight

Avaya IQ 5.0 Database Server Configuration Recommendations And Oracle Guidelines

Data Mining & Data Warehouse

Transcription:

Appliances and DW Architecture John O Brien President and Executive Architect Zukeran Technologies 1

OBJECTIVES To define an appliance Understand critical components of a DW appliance Learn how DW appliances fit within your existing DW architecture 2

WHAT IS AN APPLIANCE? General Purpose Multi-purpose Purpose specific 3

AN APPLIANCE IS BORN Compatibility Applications (Operational, BI, DSS) Database Operating System (Windows, Unix, Linux, etc) Computing Resources (, memory) Networking Resources Storage Resources (Persistence) Purpose Purpose Specific Appliance Horizontal versus Vertical Layers When does it flip? 4

PROVEN APPLIANCES Google search appliance Symantec VPN appliance Linksys Wireless router TiVo 5

CHARACTERISTICS OF A GOOD APPLIANCE Clear, specific purpose Compatible with existing infrastructure Ease of installation & use Highly cost effective 6

MAKING A DW SPECIFIC APPLIANCE? Optimize for mixed BI workloads Store and access lots of data Perform analytical queries Support operational report requirements Expand data access to the mainstream DW user Combine Server + Optimization Software + Storage Ensure ease of use Offer low maintenance and TCO Integrate with existing infrastructure BI tools with ANSI SQL Ability to load data easily Compatible with standard operating procedures Build in flexibility to support changing requirements 7

PRE-CONFIGURED vs. APPLIANCE Appliances are not a pre-configuration or assembly of tuned products and technologies An appliance is built for a specific purpose and can leverage commodity parts Appliances require no user serviceable parts Test: Does it matter what s inside the appliance? 8

DW APPLIANCE: HIGHLY COST EFFECTIVE Reduce DBA expertise and administration No tablespaces, extents to manage No archive, redo logs to manage No partitioning management Reduce need for storage expertise SAN storage architects LUNs, meta volumes Relatively low acquisition cost vs. traditional model Server manufacturer annual maintenance Operating System license Storage hardware Efficient power consumption 9

FUTURE DATA WAREHOUSE APPLIANCES? Will we see these specific appliances in the future? Do they meet the criteria for success? ETL or ELT appliance? BI appliance? EII appliance? OLAP appliance? Audit appliance? 10

WHERE DO DW APPLIANCES MAKE SENSE? Source: Inmon Data Systems http://www.inmoncif.com 11

DW APPLIANCE: HIGH PERFORMANCE Massive Parallelism inherent to the architecture Improved I/O throughput rates with effective I/O Ethernet ODBC JDBC Host Database Backplane 12

DW APPLIANCE: LARGE SCALE DATABASES How do you effortlessly scale beyond 2TB as your data warehouse matures? Federated architecture Rapid expansion and data redistribution 13

SCALABILITY: FEDERATED ARCHITECTURE Logical areas of the data warehouse architecture move to new appliances Data marts, ODS, staging Subject areas or conformed dimensions stay together unless the capability to perform database joins across platforms exist EII tool, remote tables Logical groups such as North America DW, Euro DW or corporate entities 14

SCALABILITY: DATA REDISTRIBUTION Adding capacity may cause data to be redistributed Buckets SLA Driven Redistribution 15

SCALABILITY: DATA REDISTRIBUTION Ethernet ODBC JDBC MPP Engine Backplane Existing DW Appliance MPP Engine Backplane Adding new DW Appliance 16

MAINTAINING PERFORMANCE WITH SCALABILITY Ethernet ODBC JDBC MPP Engine Backplane Existing DW Appliance MPP Engine Backplane Adding new DW Appliance 17

DW APPLIANCE: BACKUP AND RECOVERY Built-in RAID mirroring and striping The challenge is that large systems take massive amount of computing and network resources to backup changes in a high volume environment Recommended to compress and save load files since loads are faster than recoveries In a fully mirrored environment, chances of needing to restore from a backup are low Veritas or other standard APIs are available backups 18

DW APPLIANCE: FAST LOAD/UNLOAD Minimum 500GB per hour load rates Near physical limitations disk I/O and Ethernet connection to appliance Utilizes high performance ODBC drivers and special loader utilities 19

IMPACT OF DATA WAREHOUSE APPLIANCES 20

WHAT A FAST DATABASE CAN DO FOR YOU Before ETL tools ETL was hand coded programs ETL was code in database procedural languages ETL tools offered Faster development with 3GL Better performance than database code & SQL High Performance data warehouse appliance Faster queries on large data sets High Performance SQL 21

ELT A PARADIGM SHIFT Transformations typically are performed after extraction and before loading limiting the database workload which is thought to be query intense for data warehouses. High performance data warehouse appliances are being used to transform the data inside the high performance database and the results stored in the appliance or in other databases Some companies point to ETL license savings as part of the ROI for appliances Sunopsis is a tool being used in these cases 22

ROLAP vs. MOLAP Multidimensional database engines were created to make up for the performance deficiencies of relational OLTP databases at that time. MOLAP cubes are known for their high performance interactive capabilities, complex calculations and dimensional and hierarchical analysis but are challenged in: Real-time environments Large datasets High degree of dimensional and hierarchical branches MOLAP cubes are also a duplication a data in another database format, the multidimensional database. ROLAP leverages on the underlying database for performance characteristics ROLAP metrics can be atomic level, pre-calculated, pre-rolled up depending on what works best 23

FOCUS ON BUSINESS VALUE OF DATA Data Architecture and Modeling High performance databases shouldn t make up for poor data analysis and modeling efforts Good Requirements, Analysis and Reports Make sure that the wrong answer doesn t just come back faster 24

WHAT THE BUSINESS WANTS FROM DW On-Demand New products and projects Lower Cost Overall TCO and long term maintainability Flexibility Simpler infrastructure to build and go Able to update as the business needs change New capabilities Scalability Ability to store more data with less cost 25

Questions and Discussion 26