Guided Analytics with large Detail Data

Similar documents
Guide Users along Information Pathways and Surf through the Data

QLIKVIEW SCALABILITY BENCHMARK WHITE PAPER

Case Study: Tata Communications Delivering a Truly Interactive Business Intelligence Experience on a Large Multi-Tenant Hadoop Cluster

Built for Speed: Comparing Panoply and Amazon Redshift Rendering Performance Utilizing Tableau Visualizations

Microsoft Exam

SMP User Manual Sales, Marketing and Information Services

Excel and Tableau. A Beautiful Partnership. Faye Satta, Senior Technical Writer Eriel Ross, Technical Writer

HP Integration with Incorta: Connection Guide. HP Vertica Analytic Database

DATA VISUALIZATION Prepare the data for visualization Data presentation architecture (DPA) is a skill-set that seeks to identify, locate, manipulate,

TUTORIAL Get Started with Tableau Desktop

Data Mining Concepts & Techniques

Magento U. Getting Started with Magento Business Intelligence Essentials

Business Analytics Enhancements

OLAP2 outline. Multi Dimensional Data Model. A Sample Data Cube

Welcome to the Introduction to the Modeling Process for SAP Business One, version for SAP HANA.

The Definitive Guide to Preparing Your Data for Tableau

Management Information Systems Review Questions. Chapter 6 Foundations of Business Intelligence: Databases and Information Management

SERVER FOR PRESCRIBING INFORMATION REPORTING AND ANALYSIS (SPIRA) USER GUIDE CONTENTS

Excel Tutorial 5: Working with Excel Tables, PivotTables, and PivotCharts. 6. You can use a table s sizing handle to add columns or rows to a table.

Data Warehouses Chapter 12. Class 10: Data Warehouses 1

Data Information Database Database System Data 1. Known facts that can be recorded and have an implicit meaning.

1. Attempt any two of the following: 10 a. State and justify the characteristics of a Data Warehouse with suitable examples.

Intermediate Tableau Public Workshop

UAccess ANALYTICS. Fundamentals of Reporting. updated v.1.00

The Seven Steps to Implement DataOps

Welcome to the SAP Business One on HANA release 9.1 delta training on creating advanced dashboards. An advanced dashboard is a type of supplementary

Getting Started Guide

Hands-On Lab. Developing BI Applications. Lab version: Last updated: 2/23/2011

2. How Metric Insights gets data from Tableau Server

UNIT

STATISTICAL TECHNIQUES. Interpreting Basic Statistical Values

VLOOKUP vs. SUMIFS. Battle of the Excel Heavyweights. made with

Step-by-step data transformation

SOFTWARE ENGINEERING Prof.N.L.Sarda Computer Science & Engineering IIT Bombay. Lecture #10 Process Modelling DFD, Function Decomp (Part 2)

Step 5 - Scan Applications. Step 5 - Scan Applications

Data Explorer in Pentaho Data Integration (PDI)

Application Architectures - Introduction

Power BI Desktop Lab

Getting Started Guide. Sage MAS Intelligence 500

Data Validation Option Best Practices

Tableau Online. Understanding Data Updates

The Kanban Course. Workbook

Basics of Dimensional Modeling

1 Lecture 5: Advanced Data Structures

The Kanban Course. Workbook

STRAIGHT LINE GRAPHS THE COORDINATES OF A POINT. The coordinates of any point are written as an ordered pair (x, y)

Introduction to MDDBs

Ultimate, Real-Time Mail Tracking and Notifications

DATA 301 Introduction to Data Analytics Visualization. Dr. Ramon Lawrence University of British Columbia Okanagan

NeoMedia s Patent on Using Identification Code to Access Networked Computers. U.S. Patent No. 6,199,048

Adding Information to a Worksheet

33 Exploring Report Data using Drill Mode

How to integrate data into Tableau

Data Analytics at Logitech Snowflake + Tableau = #Winning

Formulas, LookUp Tables and PivotTables Prepared for Aero Controlex

1/12/2012. Jim Williams HONP-112 Week 3

Performance Issue : More than 30 sec to load. Design OK, No complex calculation. 7 tables joined, 500+ millions rows

1. Data Table. 1.1 The One-Variable Data Table

SAP HANA Scalability. SAP HANA Development Team

A Beginner s Guide to Successful Marketing

The strategic advantage of OLAP and multidimensional analysis

Using FIS & The Federal Funds Interest Calculator (for Excel 2007)

Data Mining & Analytics Data Mining Reference Model Data Warehouse Legal and Ethical Issues. Slides by Michael Hahsler

In this unit we are continuing our discussion of IT security measures.

Rocky Mountain Technology Ventures

Detailed instructions for video analysis using Logger Pro.

Six Core Data Wrangling Activities. An introductory guide to data wrangling with Trifacta

Professional Edition Tutorial: Excel Spreadsheets

v.5 General Ledger: Best Practices (Course #V221)

Destiny. Understanding Roles and Assigning Permissions Webinar. Participant Guide

Data Warehousing. Overview

You Can t Move Forward Unless You Can Roll Back. By: Michael Black

Tableau COURSE CONTENT

Excel Template Instructions for the Glo-Brite Payroll Project (Using Excel 2010 or 2013)

Sisulizer Three simple steps to localize

Pegasus Opera II. Hints and Tips (2) From AMA Business Systems Ltd Tech Support Team

Microsoft Access. Note: This slideshow is a continuation of the previous slideshow.

Chapter 3. The Multidimensional Model: Basic Concepts. Introduction. The multidimensional model. The multidimensional model

Vertica Knowledge Base Article. Vertica QuickStart for Tableau

Informatica PowerExchange for Tableau User Guide

Multidimensional Grouping Made Easy

Microsoft Microsoft Excel 2013 Expert Part 1. Download Full Version :

HANA Performance. Efficient Speed and Scale-out for Real-time BI

The Hadoop Paradigm & the Need for Dataset Management

Anaplan Snapshotting Learning App User Guide

Data Warehouse and Data Mining

Analytics Open Labs: Introduction to Tableau at the University of Kentucky

Decision Support Systems aka Analytical Systems

Rockwell Automation ODVA Annual Meeting

Sections in this manual

Table of contents. 01 Adding a parameters sheet to the Report Designer How to perform a check test on a database 8

Massively Parallel Processing. Big Data Really Fast. A Proven In-Memory Analytical Processing Platform for Big Data

Sage 500 ERP Business Intelligence

The Kanban Course. Facilitator s Guide

Parts Updater. Last Update: August 10, (440)

US FOODS ONLINE 2015: Overview

The WideRuled Story Generator Tutorial Alex Mitchell Communications and New Media Programme National University of Singapore

FAQ: Database Development and Management

using cells to create dynamic formulas

FAQ: Advanced Functions

Transcription:

Guided Analytics with large Detail Data When we think of guided analytics, we need to think of it as two pieces of the same puzzle. 1. What is it we want the user to be able to accomplish? 2. How do we create an infrastructure to get him there? The first piece deals with the business problem being addressed, and, is not the focus of this paper. The second piece deals with the technical structure of the solution. This paper attempts to detail one way of drilling from one set of data, to another, in an efficient manner. Done properly, it is possible to walk through a logical process, to drill into massive amounts of data, without ever having to scan a large table. A Tableau Packaged workbook (Guided_Analytics_For_Large_Data.twbx) is part of this document package. For this example, we ll use the well known Tableau Super Store Sales data, and, we ll set up a dashboard that facilitates drilling from Region/Quarter, to State/Month, to Zip Code/Day, to individual order line. Please use your imagination, pretending that the total detail data in hundreds of millions of rows Note: This is just a very basic view of the total Guided Analytic process. A full dissertation would read like War And Peace! The concept: In a real world, IT created, data warehouse, the best performance solutions come from pre computed aggregate tables, each structured to allow for drilling from small tables, into slightly larger ones, in a repetitive process, until we end up at the largest table, looking at a very small subset of the total data. Use Tableau s Data Extract capability to build the individual aggregate tables needed. Build views from each of the individual Data Extracts.

Use Tableau Actions to link from each Data extract TO the next larger Data Extract. Since we are dealing with LARGE data sets, processes should be implemented to limit the amount of data being pulled to the user view. 1. Actions are Single Select ONLY. 2. Clearing the selection should show no data. The structure looks like this: File sizes for the raw data, and, the Tableau Data Extracts used in this effort; The level 1 TDE 46k The level 2 TDE 63K The level 3 TDE 244k The level 4 TDE 543k The initial raw data 1,670k

Before looking at the individual steps, let s look at the final product: Ugly and boring but, visual best practices is not the intent of this paper.

Now, let s look at each of the four views, individually, commenting on the pieces needed for the drilling process. #1 - Region-Quarter Summary very small, and, very fast. Nothing special about this one #2 State-Month - A little larger, BUT, we are only accessing a subset of the data. Note the Level of Detail Shelf

#3 ZipCode Date. Note the Level of detail Shelf, again #4 Detail. From the prior view, we ll pass ZipCode and date each must be somewhere in the detail view.

Once these view are built, we create the Tableau Actions needed to do the linking. Remember the two key rules: Single Select AND No data on Clearing Selection! There are three actions required:

The first one drills from the Region-Quarter view TO the State-Month view:

The second one drills from the State-Month view TO the ZipCode-Day view:

The final one drills from the ZipCode-Day view TO the Detail view:

OTHER things to consider when creating this type of process: 1. While creating the various Data Extracts, he raw data should NOT be changing. Otherwise the user could be drilling into what appears to be incorrect data when one extract is created from data in one state, and, a different extract is created from data in another state. 2. Even though this example uses nothing but extracts, a real world process would, likely, do the final drill into the live data. Make sure that data is structure for the key information being passed. Example: in this example, if we were drilling into the live data for the Detail view, we d want to be certain the detail table had indexes on the date and the zip code columns. Otherwise, we d be doing a full table scan against that table! 3. An additional topic to consider is using MENU selections, instead of SELECT, for the Actions. Give the user choices in the drill process. The same concept (extracts linking to extracts) is followed. 4. As the extracts are built, be sure to select (as appropriate): The goal is to make each Data Extract as small and as pertinent as possible.