Data sources. Gartner, The State of Data Warehousing in 2012

Similar documents
Data sources. Gartner, The State of Data Warehousing in 2012

Azure Data Factory VS. SSIS. Reza Rad, Consultant, RADACAD

Oskari Heikkinen. New capabilities of Azure Data Factory v2

BIG DATA COURSE CONTENT

Alexander Klein. #SQLSatDenmark. ETL meets Azure

Microsoft Analytics Platform System (APS)

Data Architectures in Azure for Analytics & Big Data

Duration: 5 Days. EZY Intellect Pte. Ltd.,

Increase Value from Big Data with Real-Time Data Integration and Streaming Analytics

Microsoft Developer Day

Modern Data Warehouse The New Approach to Azure BI

Stages of Data Processing

Azure Data Factory. Data Integration in the Cloud

Ian Choy. Technology Solutions Professional

Overview of Data Services and Streaming Data Solution with Azure

USERS CONFERENCE Copyright 2016 OSIsoft, LLC

Updating your Business Intelligence Skills to Microsoft SQL Server 2012

Oliver Engels & Tillmann Eitelberg. Big Data! Big Quality?

What is Gluent? The Gluent Data Platform

Ayush Ganeriwal Senior Principal Product Manager, Oracle. Benjamin Perez-Goytia Principal Solution Architect A-Team, Oracle

Alexander Klein. ETL in the Cloud

Modeling. Preparation. Operationalization. Profile Explore. Model Testing & Validation. Feature & Algorithm Selection. Transform Cleanse Denormalize

Data 101 Which DB, When. Joe Yong Azure SQL Data Warehouse, Program Management Microsoft Corp.

Implementing a SQL Data Warehouse

Implementing a SQL Data Warehouse

microsoft

Transitioning From SSIS to Azure Data Factory. Meagan Longoria, Solution Architect, BlueGranite

Take P, R or U. and solve your data quality problems Oliver Engels & Tillmann Eitelberg, OH22

White Paper / Azure Data Platform: Ingest

Updating your Business Intelligence Skills to Microsoft SQL Server 2012 Course 40009A; 3 Days, Instructor-led

Microsoft Implementing a SQL Data Warehouse

Data 101 Which DB, When Joe Yong Sr. Program Manager Microsoft Corp.

Welcome! Power BI User Group (PUG) Copenhagen

Exam /Course 20767B: Implementing a SQL Data Warehouse

Oliver Engels & Tillmann Eitelberg. Big Data! Big Quality?

28 February 1 March 2018, Trafo Baden. #techsummitch

Microsoft Exam

Asanka Padmakumara. ETL 2.0: Data Engineering with Azure Databricks

HDInsight > Hadoop. October 12, 2017

@Pentaho #BigDataWebSeries

Processing Unstructured Data. Dinesh Priyankara Founder/Principal Architect dinesql Pvt Ltd.

Implementing a SQL Data Warehouse (20767)

From Single Purpose to Multi Purpose Data Lakes. Thomas Niewel Technical Sales Director DACH Denodo Technologies March, 2019

Things I Learned The Hard Way About Azure Data Platform Services So You Don t Have To -Meagan Longoria

20767B: IMPLEMENTING A SQL DATA WAREHOUSE

Přehled novinek v SQL Server 2016

Modern ETL Tools for Cloud and Big Data. Ken Beutler, Principal Product Manager, Progress Michael Rainey, Technical Advisor, Gluent Inc.

20767: Implementing a SQL Data Warehouse

17/05/2017. What we ll cover. Who is Greg? Why PaaS and SaaS? What we re not discussing: IaaS

Cortana Intelligence Suite; Where the Magic Happens

Compact Solutions Connector FAQ

Big Data Technology Ecosystem. Mark Burnette Pentaho Director Sales Engineering, Hitachi Vantara

Modernizing Business Intelligence and Analytics

Implementing a SQL Data Warehouse

Streaming Integration and Intelligence For Automating Time Sensitive Events

Making Data Integration Easy For Multiplatform Data Architectures With Diyotta 4.0. WEBINAR MAY 15 th, PM EST 10AM PST

Developing in Power BI. with Streaming Datasets and Real-time Dashboards

Best practices for building a Hadoop Data Lake Solution CHARLOTTE HADOOP USER GROUP

ETL Best Practices and Techniques. Marc Beacom, Managing Partner, Datalere

Azure SQL Data Warehouse. Andrija Marcic Microsoft

IT directors, CIO s, IT Managers, BI Managers, data warehousing professionals, data scientists, enterprise architects, data architects

VOLTDB + HP VERTICA. page

SQL Server SQL Server 2008 and 2008 R2. SQL Server SQL Server 2014 Currently supporting all versions July 9, 2019 July 9, 2024

Implementing a Data Warehouse with Microsoft SQL Server 2014 (20463D)

BI ENVIRONMENT PLANNING GUIDE

Introduction to SSIS. Or you want to take some data, change it, and put it somewhere else? Then boy do I have THE tool for you!

The Cortana Intelligence Suite

Copyright 2016 Datalynx Pty Ltd. All rights reserved. Datalynx Enterprise Data Management Solution Catalogue

COURSE 10977A: UPDATING YOUR SQL SERVER SKILLS TO MICROSOFT SQL SERVER 2014

Franck Mercier. Technical Solution Professional Data + AI Azure Databricks

Training 24x7 DBA Support Staffing. MCSA:SQL 2016 Business Intelligence Development. Implementing an SQL Data Warehouse. (40 Hours) Exam

Microsoft. Perform Data Engineering on Microsoft Azure HDInsight Version: Demo. Web: [ Total Questions: 10]

Microsoft Perform Data Engineering on Microsoft Azure HDInsight.

Index. Scott Klein 2017 S. Klein, IoT Solutions in Microsoft s Azure IoT Suite, DOI /

Capture Business Opportunities from Systems of Record and Systems of Innovation

exam. Microsoft Perform Data Engineering on Microsoft Azure HDInsight. Version 1.0

Leverage the Oracle Data Integration Platform Inside Azure and Amazon Cloud

Implementing a Data Warehouse with Microsoft SQL Server 2012

Neues Dream Team Azure Data Factory v2 und SSIS

Migrating Enterprise BI to Azure

Open Source Tools as a platform for research on Microsoft Azure

##SQLSatMadrid. Project [Vélib by Cortana]

Azure Learning Circles

Data Warehouse Design Decisions

SQL Server Pre Lanzamiento. Federico Marty. Mariano Kovo. Especialista en Plataforma de Aplicaciones Microsoft Argentina & Uruguay

NYC Cloud Machine Learning Meetup. Introduction to Cortana Analytics

Activator Library. Focus on maximizing the value of your data, gain business insights, increase your team s productivity, and achieve success.

Accelerate Your Data Pipeline for Data Lake, Streaming and Cloud Architectures

Data-Intensive Distributed Computing

WHITEPAPER. MemSQL Enterprise Feature List

Implementing a Data Warehouse with Microsoft SQL Server 2014

The Enterprise Data Marketplace

Implementing a Data Warehouse with Microsoft SQL Server 2012

Microsoft Big Data and Hadoop

Understanding the latent value in all content

Evolution of Big Data Facebook. Architecture Summit, Shenzhen, August 2012 Ashish Thusoo

Cloud has become the New Normal

DURATION : 03 DAYS. same along with BI tools.

Bridge the cloud divide with hybrid business intelligence in SharePoint 2016 and Office 365

MCSA SQL SERVER 2012

Transcription:

data warehousing has reached the most significant tipping point since its inception. The biggest, possibly most elaborate data management system in IT is changing. Gartner, The State of Data Warehousing in 2012 Data sources

5 2 Real-time data 1 Increasing data volumes New data sources & types Data sources Non-Relational Data 3 4 Cloud-born data

Extract Transform Load Original Data ETL Tool (SSIS, etc) Transformed Data EDW (SQL Svr, Teradata, etc) BI Tools Data Marts Data Lake(s) Dashboards Apps

Extract Transform Load Original Data ETL Tool (SSIS, etc) Transformed Data EDW (SQL Svr, Teradata, etc) BI Tools Data Marts Data Lake(s) Dashboards Ingest (EL) Original Data Apps

Extract Transform Load Original Data ETL Tool (SSIS, etc) Transformed Data EDW (SQL Svr, Teradata, etc) BI Tools Data Marts Data Lake(s) Ingest (EL) Original Data Scale-out Storage & Compute (HDFS, Blob Storage, etc) Dashboards Apps Streaming data Transform & Load

Extract Transform Load Original Data ETL Tool (SSIS, etc) Transformed Data EDW (SQL Svr, Teradata, etc) BI Tools Data Marts Data Lake(s) Ingest (EL) Original Data Scale-out Storage & Compute (HDFS, Blob Storage, etc) Dashboards Apps Streaming data Transform & Load

Data Sources (Import From) Ingest Data Hub (Storage & Compute) BI Tools Data Marts Move data among Hubs Data Lake(s) Data Sources (Import From) Ingest Data Hub (Storage & Compute) Move to data mart, etc Dashboards Apps Information Production: Connect & Collect Transform & Enrich Publish

Data Sources (Import From) Data Connector: Import from source to Hub Data Hub (Storage & Compute) Coordination & Scheduling Monitoring & Mgmt Data Lineage BI Tools Data Connector: Import/Export among Hubs Data Marts Data Lake(s) Data Sources (Import From) Data Connector: Import from source to Hub Data Hub (Storage & Compute) Data Connector: Export from Hub to data store Dashboards Apps Information Production: Connect & Collect Transform & Enrich Publish

Example Scenario: Data warehouse sales to Azure pipeline

Raw sales (Custom view on top of DW tables) Sales by category by day Hive processing Qty Unit OrderDate Company Category Sales Order Ordered Price 6/1/2004 Action Bicycle Specialists Accessories 1716 22.0393SO71784 6/1/2004 Action Bicycle Specialists Bikes 2288 864.0452SO71784 6/1/2004 Action Bicycle Specialists Clothing 2340 26.8155SO71784 6/1/2004 Action Bicycle Specialists Components 598 329.8538SO71784 6/1/2004 Aerobic Exercise Company Components 338 133.8744SO71915 6/1/2004Action Bicycle Specialists Accessories 910 25.1057SO71938

Data Factory Walkthrough

New-AzureDataFactory -Name DW-Demo2 -Location West-US New-AzureDataFactory -Name HaloTelemetry -Location West-US

Azure Data Factory New User View On Premises SQL Server Azure Blob Storage

View Of Azure Data Factory New Sales Aggregated sales AdventureWorksLTDW2014 On Premises SQL Server Azure Blob Storage

View Of View Of Azure Data Factory Pipeline New Sales Copy NewSales to Blob Storage Cloud New Sales New User Activity New User View On Premises SQL Server Azure Blob Storage

View Of Azure Data Factory Pipeline New Sales Copy New Sales to Blob Storage Cloud New Sales Pipeline OnPrem SSIS package Aggregated Sales Cloud New Sales Aggregate AggregatedSales HDInsight New User View On Premises SQL Server Azure Blob Storage

"availability": { "frequency": "Day", interval": 6 } Activity: (e.g. Hive): Hourly 12-6 6-12 12-6 AggregatesSales

Hourly Sales From DW 12-1 1-2 2-3 Daily Monday Daily Sales Dataset3 Hive Activity Tuesday Daily other source Dataset2 Wednesday Monday Tuesday Wednesday

Is my data successfully getting produced? Is it produced on time? Am I alerted quickly of failures? What about troubleshooting information? Are there any policy warnings or errors?

Easily move data to my existing data marts for consumption by my existing BI tools Azure DB SQL Server on premises Oracle Files Azure Blob content

Coordination: Rich scheduling Complex dependencies Incremental rerun Authoring: JSON & Powershell/C# Management: Lineage Data production policies (late data, rerun, latency, etc) Hub: Azure Hub (HDInsight + Blob storage) Activities: Hive, Pig, C# Data Connectors: Blobs, Tables, Azure DB, On Prem SQL Server, Oracle

Contact me: ChristianCote@IA-TechConsulting.com

http://channel9.msdn.com/events/teched www.microsoft.com/learning http://microsoft.com/technet http://developer.microsoft.com