Sub-Second Response Times with New In-Memory Analytics in MicroStrategy 10. Onur Kahraman

Similar documents
PRIME Deep Dive. Introduction... 1 Data Model and Load... 2 OLAP Cubes... 2 MTDI Cubes... 5

How to Deploy Enterprise Analytics Applications With SAP BW and SAP HANA

#mstrworld. Analyzing Multiple Data Sources with Multisource Data Federation and In-Memory Data Blending. Presented by: Trishla Maru.

Guide Users along Information Pathways and Surf through the Data

Massively Parallel Processing. Big Data Really Fast. A Proven In-Memory Analytical Processing Platform for Big Data

Accelerating BI on Hadoop: Full-Scan, Cubes or Indexes?

Hadoop 2.x Core: YARN, Tez, and Spark. Hortonworks Inc All Rights Reserved

Appliances and DW Architecture. John O Brien President and Executive Architect Zukeran Technologies 1

SAP HANA. Jake Klein/ SVP SAP HANA June, 2013

Approaching the Petabyte Analytic Database: What I learned

Safe Harbor Statement

Chapter 13 Business Intelligence and Data Warehouses The Need for Data Analysis Business Intelligence. Objectives

SAP BW and MicroStrategy

Performance Issue : More than 30 sec to load. Design OK, No complex calculation. 7 tables joined, 500+ millions rows

Part 1: Indexes for Big Data

Case Study: Tata Communications Delivering a Truly Interactive Business Intelligence Experience on a Large Multi-Tenant Hadoop Cluster

SAP HANA Scalability. SAP HANA Development Team

Designing dashboards for performance. Reference deck

CHAPTER 8 DECISION SUPPORT V2 ADVANCED DATABASE SYSTEMS. Assist. Prof. Dr. Volkan TUNALI

Processing a Trillion Cells per Mouse Click

In-memory Analytics Guide

From Single Purpose to Multi Purpose Data Lakes. Thomas Niewel Technical Sales Director DACH Denodo Technologies March, 2019

1 Dulcian, Inc., 2001 All rights reserved. Oracle9i Data Warehouse Review. Agenda

Step-by-step data transformation

HANA Performance. Efficient Speed and Scale-out for Real-time BI

Accessibility Features in the SAS Intelligence Platform Products

BI4Dynamics NAV Installation Requirements

IDU0010 ERP,CRM ja DW süsteemid Loeng 5 DW concepts. Enn Õunapuu

SAP IQ Software16, Edge Edition. The Affordable High Performance Analytical Database Engine

HYRISE In-Memory Storage Engine

VOLTDB + HP VERTICA. page

IDS V11.50 and Informix Warehouse Feature V11.50 Offerings Packaging

Interactive SQL-on-Hadoop from Impala to Hive/Tez to Spark SQL to JethroData

Apache Kylin. OLAP on Hadoop

BI4Dynamics NAV Installation Requirements

A Comparison of Memory Usage and CPU Utilization in Column-Based Database Architecture vs. Row-Based Database Architecture

Microsoft Analytics Platform System (APS)

Dremel: Interactice Analysis of Web-Scale Datasets

Microsoft certified solutions associate

SQL Server Analysis Services

Lenovo Database Configuration

Optimize OLAP & Business Analytics Performance with Oracle 12c In-Memory Database Option

Leveraging Customer Behavioral Data to Drive Revenue the GPU S7456

Topics. Big Data Analytics What is and Why Hadoop? Comparison to other technologies Hadoop architecture Hadoop ecosystem Hadoop usage examples

TM Why Upgrade?

SQL Server SQL Server 2008 and 2008 R2. SQL Server SQL Server 2014 Currently supporting all versions July 9, 2019 July 9, 2024

Crescando: Predictable Performance for Unpredictable Workloads

Cognos Dynamic Cubes

Increasing Performance of Existing Oracle RAC up to 10X

BI4Dynamics AX Installation Requirements

Přehled novinek v SQL Server 2016

Evolution of Database Systems

IBM Security QRadar Deployment Intelligence app IBM

SSAS Multidimensional vs. SSAS Tabular Which one do I choose?

Data-Intensive Distributed Computing

SAP Crystal Reports and SAP HANA: Options and Opportunities (0301)

BI ENVIRONMENT PLANNING GUIDE

DATABASE SCALE WITHOUT LIMITS ON AWS

OLAP Introduction and Overview

Ingo Brenckmann Jochen Kirsten Storage Technology Strategists SAS EMEA Copyright 2003, SAS Institute Inc. All rights reserved.

Advanced Database Technologies NoSQL: Not only SQL

After completing this course, participants will be able to:

Dremel: Interac-ve Analysis of Web- Scale Datasets

Power BI Architecture

Performance Tuning in SAP BI 7.0

Teradata Aggregate Designer

Optimize OLAP & Business Analytics Performance with Oracle 12c In-Memory Database Option

Two Success Stories - Optimised Real-Time Reporting with BI Apps

Performance Benchmark and Capacity Planning. Version: 7.3

Bringing Data to Life

Azure Scalability Prescriptive Architecture using the Enzo Multitenant Framework

QLIK INTEGRATION WITH AMAZON REDSHIFT

Qlik Sense Performance Benchmark

CISC 7610 Lecture 2b The beginnings of NoSQL

Fast and Efficient A/B Testing Analysis with Shiny and SQL. Charlie Thompson Storyblocks

Utilizing Databases in Grid Engine 6.0

Cluster-Based Computing

Stream Processing Platforms Storm, Spark,.. Batch Processing Platforms MapReduce, SparkSQL, BigQuery, Hive, Cypher,...

Lenovo Database Configuration for Microsoft SQL Server TB

Stream Processing Platforms Storm, Spark,.. Batch Processing Platforms MapReduce, SparkSQL, BigQuery, Hive, Cypher,...

Data contains value and knowledge

Creating Connection With Hive. Version: 16.0

SAP NetWeaver BW Performance on IBM i: Comparing SAP BW Aggregates, IBM i DB2 MQTs and SAP BW Accelerator

<Insert Picture Here> MySQL Cluster What are we working on

Data Warehousing and Decision Support (mostly using Relational Databases) CS634 Class 20

Advanced Data Management Technologies Written Exam

The Reality of Qlik and Big Data. Chris Larsen Q3 2016

Modern Data Warehouse The New Approach to Azure BI

CS614 - Data Warehousing - Midterm Papers Solved MCQ(S) (1 TO 22 Lectures)

Revolutionizing Data Warehousing in Telecom with the Vertica Analytic Database

An Overview of Data Warehousing and OLAP Technology

Achieving Horizontal Scalability. Alain Houf Sales Engineer

In-Memory Data Management Jens Krueger

CSE 544 Principles of Database Management Systems. Alvin Cheung Fall 2015 Lecture 8 - Data Warehousing and Column Stores

1. Attempt any two of the following: 10 a. State and justify the characteristics of a Data Warehouse with suitable examples.

Performance and Scalability Overview

Welcome to the Era of Open Analytics

Fattane Zarrinkalam کارگاه ساالنه آزمایشگاه فناوری وب

Built for Speed: Comparing Panoply and Amazon Redshift Rendering Performance Utilizing Tableau Visualizations

Using Tableau to report on SharePoint lists and libraries: A step by step guide

Transcription:

Sub-Second Response Times with New In-Memory Analytics in MicroStrategy 10 Onur Kahraman

High Performance Is No Longer A Nice To Have In Analytical Applications Users expect Google Like performance from analytic applications, especially on mobile devices Exploding data volumes & variety require In- Memory consolidation and aggregation Modern analytical applications contain 100 s of viz, distributed to 1000 s of users daily Drivers of High Performance Drastic drop in cost of memory combined with parallel processing delivers cost effective performance 2

MicroStrategy s New In-Memory Architecture Combines 3 Breakthroughs In-Memory Data Store Massively Parallel Processing on Commodity Hardware Look-Ahead Analytics Integrated Data and Visualization Layers Interactive Exploration of Terabyte Datasets by 100,000s of Users 3

Parallel Partitioned In-Memory Cubes Parallel Relational In-memory Engine Linear scalability to 1000s of CPUS Flexible schema and partitioned data 3x to 10x faster 7x to 20x more users Tightly-coupled interactive exploration Parallel data connections for higher fetch rate. Parallel rendering of visualizations from in-memory cubes. Much more flexible cube schema. No unnecessary prejoins. Better memory management Highly improved inmemory layer. Support more than 2B rows per cube. Spread data into multiple 2B chunks Tightly coupled inmemory engine with visualization engine. Fast response times for applications. 4

MicroStrategy Parallel Partitioned Cubes Co-exists With Existing Databases Parallel Partitioned Cubes Does not replace databases Functions as Hot data layer for apps requiring high performance Drill through to databases for detail Load from databases or directly from files and Hadoop SOURCE DATA

Massively Parallel Processing On Commodity Hardware Traditional BI Parallel Execution with MSTR 10 Query Engines Bottleneck Parallel Execution Shared Memory Distributed Data Memory Memory Memory Parallel Query execution and loading Inexpensive Commodity Hardware

Look Ahead Analytics Tightly Integrated Data & Visualization Layers Traditional BI Look Ahead Analytics Visualization Layer Analytics layer optimizes queries for data Visualization Layer Loosely Coupled Data Layer Data layer analyzes dashboard and optimizes structures Data Layer Data layer has no knowledge of analytics layer design Connections Optimized for the lowest common denominator Tightly integrated layers enable optimization Analytics layer globally optimizes queries sent to data based on data structures Data layer looks ahead and plans based on knowledge of dashboard

Faster Data Loads with Parallel Partitioned In-Memory Cubes Bottleneck Intelligent cubes take really long time because ODBC single thread data fetch is too slow. VISUALIZATION API Data can now be loaded in parallel leading to faster cube publishing. Application Engines Tune the number of connections through VLDB. Default being 2. Analytics Engines Analytics Engines DATA DATA DATA DATA DATA 8

Broader Analytical Capabilities with Parallel Partitioned In-Memory Cubes Bottleneck Subset reports were limited to single pass analytics Parallel partitioning will support the generation of multi pass CSI to be able to support full range of analytics and be at par with SQL Engine. Multi pass analytics will include support for metric qualifications, relationship filters etc. 9

Larger Data Volumes with Parallel Partitioned In-Memory Cubes Bottleneck Data limitation of 2B rows per cube. Overcome the 2B row limitation by enabling data partitioning across several cores of a CPU and to be able to split the data within the cube into multiple 2B chunks. Leverage existing CPU cores more efficiently. 9.4.1 OLAP Cube 2B MSTR 10 Parallel Partitioned Cube. 2B 2B 2B. DATA DATA DATA DATA DATA. 1 2 3. 16 1 2 3. 16 16 core CPU 16 core CPU 10

In-Memory OLAP Services vs. Parallel Partitioned In-Memory Cubes Access the Database With Higher Throughout Create and Publish the Cube With Higher Data Scalability Analyze the Data With Faster Response Time 9.4.1: OLAP Services Data: 5M rows Fetch Rate: 5074kb/sec 9.4.1: OLAP Services Data: 2.35B rows Failed due to 2 billion row limit 9.4.1: OLAP Services Data: 8M rows Response Time: 0:06:33 MSTR 10: 8Thread Parallel Load Data: 5M rows Fetch Rate: 22454kb/sec MSTR 10: 8Thread Partitioning Data: 2.35B rows Publish time: 5:14:23 Cube size: 265GB MSTR 10: 8Thread Parallel Access Data: 8M rows Response Time: 0:04:25 Upload data 4 times faster Increase the data scalability up to 80 times 50% Faster Data Interactions 11

In-Memory OLAP Services vs. Parallel Partitioned In-Memory Cubes Parallel non-partitioned cube has up to 20% higher capacity than OLAP Services Average Response Time (sec) 0 1 2 3 4 5 0 5 10 15 Power Rating (KiloCycles) Average Response Time (sec) 1 2 3 4 5 6 7 0 5 10 Power Rating (KiloCycles) MSTR 10 Cube 8-partition on 8-core Linux MSTR 10 Cube 1-partition on 8-core Linux MSTR 10 Cube non-partition on 8-core Linux MSTR 10 Cube 8-partition on 8-core Win64 MSTR 10 Cube 1-partition on 8-core Win64 MSTR 10 Cube non-partition on 8-core Win64 12

In-Memory OLAP Services vs. Parallel Partitioned In-Memory Cubes 35 490 30 390 Response Time (sec) 25 20 15 10 5 Response Time (sec) 290 190 90 0 R1 R3 R5 R7 R9 R11 R13 R15 R17 R19 R21 R23 R25 Customer A Reports 9.4.1 Cube MSTR10 Cube Non Partition MSTR10 Cube 32-Partition -10 R1 R3 R5 R7 R9 R11 R13 R15 R17 R19 R21 R23 R25 R27 R29 Customer B Reports 9.4.1 Cube MSTR10 Cube Non Partition MSTR10 Cube 32-Partition The partitioned in-memory cube has significant performance gain in almost all cases for both Customer A and B Non partitioned and OLAP cubes have almost similar response time 13