Business Intelligence. You can t manage what you can t measure. You can t measure what you can t describe. Ahsan Kabir

Similar documents
IDU0010 ERP,CRM ja DW süsteemid Loeng 5 DW concepts. Enn Õunapuu

Updating your Business Intelligence Skills to Microsoft SQL Server 2012

Data Mining. Data warehousing. Hamid Beigy. Sharif University of Technology. Fall 1394

Recently Updated Dumps from PassLeader with VCE and PDF (Question 1 - Question 15)

COURSE 20466D: IMPLEMENTING DATA MODELS AND REPORTS WITH MICROSOFT SQL SERVER

Updating your Business Intelligence Skills to Microsoft SQL Server 2012 Course 40009A; 3 Days, Instructor-led

TDWI Data Modeling. Data Analysis and Design for BI and Data Warehousing Systems

Guide Users along Information Pathways and Surf through the Data

Cognos also provides you an option to export the report in XML or PDF format or you can view the reports in XML format.

After completing this course, participants will be able to:

6+ years of experience in IT Industry, in analysis, design & development of data warehouses using traditional BI and self-service BI.

OLAP Introduction and Overview

Data Strategies for Efficiency and Growth

Evolution of Database Systems

Advanced Solutions of Microsoft SharePoint Server 2013

WKU-MIS-B10 Data Management: Warehousing, Analyzing, Mining, and Visualization. Management Information Systems

The Data Organization

Chapter 4, Data Warehouse and OLAP Operations

Data Warehousing and OLAP

An Overview of Data Warehousing and OLAP Technology

What is a Data Warehouse?

CHAPTER 3 Implementation of Data warehouse in Data Mining

QA Microsoft Designing Business Intelligence Solutions with Microsoft SQL Server 2012

1Z0-526

BUSINESS INTELLIGENCE. SSAS - SQL Server Analysis Services. Business Informatics Degree

Data Mining Concepts & Techniques

Microsoft End to End Business Intelligence Boot Camp

Data Mining. Data warehousing. Hamid Beigy. Sharif University of Technology. Fall 1396

CHAPTER 8: ONLINE ANALYTICAL PROCESSING(OLAP)

Data Mining. Data warehousing. Hamid Beigy. Sharif University of Technology. Fall 1394

XML: Changing the data warehouse

MS-55045: Microsoft End to End Business Intelligence Boot Camp

Dta Mining and Data Warehousing

COURSE 10977A: UPDATING YOUR SQL SERVER SKILLS TO MICROSOFT SQL SERVER 2014

Data Warehouses. Yanlei Diao. Slides Courtesy of R. Ramakrishnan and J. Gehrke

This tutorial will help computer science graduates to understand the basic-to-advanced concepts related to data warehousing.

Aggregating Knowledge in a Data Warehouse and Multidimensional Analysis

Advanced Solutions of Microsoft SharePoint Server 2013 Course Contact Hours

Advanced Solutions of Microsoft SharePoint 2013

Basics of Dimensional Modeling

Data Warehousing. Overview

Full file at

Decision Support Systems

Implementing a Data Warehouse with Microsoft SQL Server 2012

Oliver Engels & Tillmann Eitelberg. Big Data! Big Quality?

A Multi-Dimensional Data Model

Microsoft Implementing a SQL Data Warehouse

Exam /Course 20767B: Implementing a SQL Data Warehouse

Acknowledgment. MTAT Data Mining. Week 7: Online Analytical Processing and Data Warehouses. Typical Data Analysis Process.

20767B: IMPLEMENTING A SQL DATA WAREHOUSE

Deccansoft Software Services. SSIS Syllabus

by Prentice Hall

Information Management course

Dr. Michael Curry. Oregon. The Big Picture: SQL Overview and Getting the Most from SQL Saturday

Management Information Systems MANAGING THE DIGITAL FIRM, 12 TH EDITION FOUNDATIONS OF BUSINESS INTELLIGENCE: DATABASES AND INFORMATION MANAGEMENT

Modern Data Warehouse The New Approach to Azure BI

Part I. Introduction. Chapter 1: Introduction to Data Warehousing and SQL Server 2008 Analysis Services

Modelling Data Warehouses with Multiversion and Temporal Functionality

Chapter 6 VIDEO CASES

DATA MINING TRANSACTION

CSE 544 Principles of Database Management Systems. Alvin Cheung Fall 2015 Lecture 8 - Data Warehousing and Column Stores

20466C - Version: 1. Implementing Data Models and Reports with Microsoft SQL Server

1 Dulcian, Inc., 2001 All rights reserved. Oracle9i Data Warehouse Review. Agenda

Database Vs. Data Warehouse

Business Intelligence and Decision Support Systems

Data Warehouses Chapter 12. Class 10: Data Warehouses 1

Data-Driven Driven Business Intelligence Systems: Parts I. Lecture Outline. Learning Objectives

DATA WAREHOUSE EGCO321 DATABASE SYSTEMS KANAT POOLSAWASD DEPARTMENT OF COMPUTER ENGINEERING MAHIDOL UNIVERSITY

Data Warehouse and Mining

Implementing a SQL Data Warehouse

DATA MINING AND WAREHOUSING

Question Bank. 4) It is the source of information later delivered to data marts.

Implementing a SQL Data Warehouse

CS614 - Data Warehousing - Midterm Papers Solved MCQ(S) (1 TO 22 Lectures)

Training 24x7 DBA Support Staffing. MCSA:SQL 2016 Business Intelligence Development. Implementing an SQL Data Warehouse. (40 Hours) Exam

Venezuela: Teléfonos: / Colombia: Teléfonos:

Microsoft Power BI for O365

Fig 1.2: Relationship between DW, ODS and OLTP Systems

collection of data that is used primarily in organizational decision making.

6 SSIS Expressions SSIS Parameters Usage Control Flow Breakpoints Data Flow Data Viewers

Lecture 2 and 3 - Dimensional Modelling

Accurate study guides, High passing rate! Testhorse provides update free of charge in one year!

Knowledge Modelling and Management. Part B (9)

Teradata Aggregate Designer

Implementing a Data Warehouse with Microsoft SQL Server 2014 (20463D)

Data warehouse architecture consists of the following interconnected layers:

Advanced Data Management Technologies Written Exam

Data Warehousing ETL. Esteban Zimányi Slides by Toon Calders

Microsoft SharePoint Server 2013 Plan, Configure & Manage

Information Management course

Implement a Data Warehouse with Microsoft SQL Server

20463C-Implementing a Data Warehouse with Microsoft SQL Server. Course Content. Course ID#: W 35 Hrs. Course Description: Audience Profile

Implementing a Data Warehouse with Microsoft SQL Server 2014

STRATEGIC INFORMATION SYSTEMS IV STV401T / B BTIP05 / BTIX05 - BTECH DEPARTMENT OF INFORMATICS. By: Dr. Tendani J. Lavhengwa

MICROSOFT EXAM QUESTIONS & ANSWERS

Sql Fact Constellation Schema In Data Warehouse With Example

PowerPivot, an Introduction. By: Steve Lewis Principal Pyxis Analytics

Implementing a Data Warehouse with Microsoft SQL Server

Data transfer, storage and analysis for data mart enlargement

Introduction to Data Warehousing

Transcription:

Business Intelligence You can t manage what you can t measure. You can t measure what you can t describe Ahsan Kabir

A broad category of applications and technologies for gathering, storing, analyzing, sharing and providing access to data to help enterprise users make better business decisions -Gartner

Why BI? Performance management Identify trends Cash flow trend Fine-tune operations Sales pipeline analysis Future projections business Forecasting Decision Making Tools Convert data into information

What happened? What is happening? Why did it happen? What will happen? What do I want to happen? Past Present Future ERP CRM SCM 3Pty Black books Data How to Think?

Major Players in BI Market

Improving organizations by providing business insights to all employees leading to better, faster, more relevant decisions Advanced Analytics Self Service Reporting End-User Analysis Business Performance Management Operational Applications Microsoft Business Intelligence Vision

Corporate BI Commonly design, implement and maintain data warehouses, data models and integrated reporting and analytics. It require significant time, expertise and money but total business is not covered. Self-service BI (SSBI) SSBI is to empower analysts so that they can design, customize and maintain their own BI solutions. SSBI is a combination of corporate BI and extensions to empower analysts to more fully exploit it. Managed BI Ensuring responsible BI by managing review, approve and audit solutions Data is delivered in a compliant, responsive and secure way and access permissions are enforced BI implementations

SharePoint Dashboards & Scorecards SharePoint Collaboration Excel Workbooks PowerPivot Applications Analysis Services (SSAS) Integration Services (SSIS) DQS Reporting Services (SSRS) Master Data Services (MDS) ERP/CRM DB Cloud Born Data Social Network Microsoft Business Intelligence Components

Step 1 : Business Analysis Step 2 : SSIS Different Source of Data (RDBMS, FTP, Web Services, XML, CSV, EXCEL, etc.) DQS (Data Quality Services) Integration, cleansing, profiling MDS (Master Data Service ) Centrally managing organizational master data ETL (Extraction, Transformation and Loading) framework Step 3 : SSAS Create an OLAP multi-dimensional structure making data available for analytics and reporting SSAS can pre-calculates, summarizes and stores the data in a highly compressed format Reporting is provided by data through SSAS cubes Step 4 : SSRS SSRS (SQL Server Reporting Services) allows creating formatted and interactive reports Step 5 : PowerPivot, Power View, Excel services provide rapid data exploration, visualization, and presentation experience for users. It allows users to interrogate the data from various aspects by using charts, graphs, drill-down paths etc. Excel and PowerPivot services can be used for deploying Excel or PowerPivot to SharePoint in order to make it available to other people, turning Personal BI into Organizational BI. Microsoft Business Intelligence Road Map

Data Warehouse was designed specifically to be a central repository for all data in a company disparate data from transactional systems

DW is a relational database that is designed for query and analysis Ship and integrate data from different sources to the analyst Contains data derived from transaction, internal-external data & archived data But it s not a copy of a source database Characteristics DW

High query performance Analysis queries place extra load on transactional systems Query optimization is hard to do well Queries not visible outside warehouse Local processing at sources unaffected Can operate when sources unavailable Can query data not stored in a DBMS Summarized and Extremal data at warehouse Advantages of Warehousing

Before enter into warehouse Data is processed (cleansed and transformed) Data Marts Users query the data warehouse Warehouse Data is kept in a specific business line wise. DW Architecture

Data Warehouse Corporate/Enterprise-wide Union of all data marts Organized on E-R model* Data Mart Departmental Single business process Star-join* DW vs. Data Mart

Transactional Databases 1. ER modeling is used 2. 3NF Normalized 3. Data is spited into tables 4. Hard to visualize 5. Slows down the response time of the query and report Warehouse Database 1. Dimensional modeling 2. De-normalized 3. Data is kept in fact and dimension 4. Flexible for user perspective 5. Response time and increases the performance Transactional Databases vs. Data warehouse

Warehouse WID (PK) Location Address district WU_Code Client_Information CID (PK) Name Address Credit_Limit User_Profile UId (PK) Name Address Email CellNo Requisition RID(PK) CID (FK) WID (FK) UID (FK) Requestion_Date Product_Profile PID (PK) description brand category Requisition_Details RID (PK) RDD (FK) PID (FK) promotion_key (FK) dollars_sold units_sold dollars_cost Entity Relation Diagram 16

TIME time_key (PK) SQL_date day_of_week month STORE store_key (PK) store_id store_name address district floor_type CLERK clerk_key (PK) clerk_id clerk_name clerk_grade Sales - FACT time_key (FK) store_key (FK) clerk_key (FK) product_key (FK) customer_key (FK) promotion_key (FK) dollars_sold units_sold dollars_cost PRODUCT product_key (PK) SKU description brand category CUSTOMER customer_key (PK) customer_name purchase_profile credit_profile Address City country PROMOTION promotion_key (PK) promotion_name price_type ad_type DIMENSONAL MODEL 17

Data Warehouse Federated Database Extraction Query Rewritten Queries Query Answer Warehouse Answer Mediator Data warehouse Create a copy of all the data and Execute queries against the copy Federated database Pull data from source systems as needed to answer queries Source Systems Federated Databases vs. Data warehouse

Before After Name Address City House No DoB State Country Ahsan CDA Avenue CTG 181/1 05/11/1978 BD Kabir RB Avn CTG 41/6 23/04/1991 DHK Bangladesh Name Address City House No Data Quality problems DoB State Country Ahsan CDA Avenue CTG 181/1 05/11/1978 CT Bangladesh Kabir RB Avenue DHK 41/6 23/04/1991 DHK Bangladesh Indication : Completeness Accuracy Conformity Consistency

Data Quality Issue Sample Data Problem Standard Are data elements consistently defined and understood? Gender code = M, F, U in one system and Gender code = 0, 1, 2 in another system Complete Is all necessary data present? 20% of customers last name is blank, 50% of zip-codes are 99999 Accurate Valid Does the data accurately represent reality or a verifiable source? Do data values fall within acceptable ranges? A Supplier is listed as Active but went out of business six years ago Salary values should be between 60,000-120,000 Unique Data appears several times Both John Ryan and Jack Ryan appear in the system are they the same person? Data Quality Issues

Data Quality Services (DQS) is a Knowledge-Driven data quality solution, enabling to easily improve the quality of their data Data Quality Services (DQS)

Simplicity Users should understand the design Data model should match users conceptual model Queries should be easy and intuitive to write Expressiveness Include enough information to answer all important queries Include all relevant data (without irrelevant data) Performance An efficient physical design should be possible DW Design Consideration

DW consists of Fact tables and dimensions. The relationship between a Fact table and dimensions are based on the foreign key and primary key. Facts are numeric measurements or additive value that represent a specific business aspect or activity. Examples : Unit Cost, Sale Amount, Quantity Sold Salary Amount Purchase amount Dimension has a primary key, which is called the surrogate key. The primary key of the source system will be stored in the dimension table as the business key Dimension tables are tables that contain descriptive information. Dimension table contains a list of columns Example : Incase of Product Product Name Origin Category Manufacturer Date Sales Date The Fact table is a table with foreign keys pointing to surrogate keys of the dimension tables Component of Data Warehousing

TIME time_key (PK) SQL_date day_of_week month STORE store_key (PK) store_id store_name address district floor_type Sales - FACT time_key (FK) store_key (FK) clerk_key (FK) product_key (FK) customer_key (FK) promotion_key (FK) dollars_sold units_sold dollars_cost PRODUCT product_key (PK) SKU description brand category CUSTOMER customer_key (PK) customer_name purchase_profile credit_profile Address City country CLERK clerk_key (PK) clerk_id clerk_name clerk_grade PROMOTION promotion_key (PK) promotion_name price_type ad_type Dimensional Modeling 24

The diagram resembles a star Center of the star consists of one fact table Points of the star are the dimension tables Optimizes performance by keeping queries simple and Providing fast response time Star schema 25

Fact table Date Promotion ONE fact table Sales 4 dimension tables Product Dimension tables Store Star Schema for the retailer s DW 26

TIME time_key (PK) SQL_date day_of_week month STORE store_key (PK) store_id store_name address district floor_type CLERK clerk_key (PK) clerk_id clerk_name clerk_grade Sales - FACT time_key (FK) store_key (FK) clerk_key (FK) product_key (FK) customer_key (FK) promotion_key (FK) dollars_sold units_sold dollars_cost PRODUCT product_key (PK) SKU description brand category CUSTOMER customer_key (PK) customer_name purchase_profile credit_profile Address City country PROMOTION promotion_key (PK) promotion_name price_type ad_type DIMENSONAL MODEL

Simplicity Users should understand the design Data model should match users conceptual model Queries should be easy and intuitive to write Expressiveness Include enough information to answer all important queries Include all relevant data (without irrelevant data) Performance An efficient physical design should be possible Goals for Logical Design 28

Step 1 : Identify business subjects and fields of information of relevant subjects Step 2 : Discover entities and attributes and relationships Step 3 : Identify which information belongs to a central fact table Step 4 : Which information belongs to its associated dimension tables Step 5 : Identify cleansing points Step 6 : Which data need to mange centrally Step 7 : Define surrogate key and business key Step 8 : Make ETL Package Step 9 : Organize data structures on disk Steps of DW Implementation 29

Thanks