Take P, R or U. and solve your data quality problems Oliver Engels & Tillmann Eitelberg, OH22

Size: px
Start display at page:

Download "Take P, R or U. and solve your data quality problems Oliver Engels & Tillmann Eitelberg, OH22"

Transcription

1 Take P, R or U and solve your data quality problems Oliver Engels & Tillmann Eitelberg, OH22

2 Oliver Engels CEO, oh22data Datamonster from Germany MS Data Platform MVP President of PASS Germany e.v. Regional Mentor Germany SQL Saturday / SQL Konferenz Organizer Chapter Leader Frankfurt Microsoft ptsp Tillmann Eitelberg CEO oh22information Datamonster from Germany MS Data Platform MVP Vice-President of PASS Germany e.v. Regional Mentor Germany SQL Saturday / SQL Konferenz Organizer Chapter Leader Rheinland

3 Our topic today: OMG! BAD DATA!

4

5 What is Data Quality? Data Quality represents the degree to which the data is suitable for business usages Data Quality is built through: People + Technology + Processes Bad Bata = Bad Business

6 Cost of Data Quality The costs of bad data quality are most of the time hidden or implicite accepted Data quality cost Costs because of bad data quality Cost of optimizing data quality Direct Indirect Prevention Discovery Cleansing

7 Cost of Data Quality Cost Costs due to bad data quality Costs to avoid or correct bad data quality 80% Quality 100%

8 Reasons for bad data quality Data Capturing Expiration Usage Data Quality Processes Architecture Definition

9 Workflow in Data Quality Optimization Validate & Profile

10 DQ in Modern Data Platform landscapes Cloud born data Hybrid & Hyperscale Data Platform Landscape External Data File, WebServices (e.g. Market Research) IaaS Data (e.g. Hosted Custom Application) Presentation Layer Visualization Layer Visual Output PaaS Data (e.g. Custom Application) Governance Layer SaaS Data (e.g. CRM, Logistics) Analytical Layer Analytical Output Streaming Data (e.g. Sensor, Devices) Unstructured / Semi-structured (e.g. Video, Image) Storage Layer Ingest Layer Integration Layer Raw Output On Prem born data Legacy systems (e.g. ERP, CRM) Relational and Multi Dimensional Data (e.g. Custom App) Document based data (e.g. XLS, DOC, PPT, PDF, CSV) Document based data (e.g. XLS, DOC, PPT, PDF, CSV)

11 DQ in Modern Data Platform landscapes # Layer Function 1 Ingest Ingestion of data to a centralized storage environment (ETL, ELT, Raw data pump, Replication) 2 Storage Data Lake with unlimited data capacity and storage of structured, semi-structured and unstructured data 3 Integration Integration of Meta-, Master data as well as Data Quality, Profiling and Lineage 4 Analytical Relational, Multidimensional, Predictive, Stream analytical compute systems with distributed or in-memory capabilities 5 Governance Data Governance and Data Catalog incl. KPI portfolio, Corporate Identify, Security 6 Visualization Explorational data discovery and standard reporting 7 Presentation Any device & multi platform presentation layer

12 DQ in Modern Data Platform landscapes Portal Integration App Integration Search Integration Mobile Client Broadcast and Subscriber Pixel Perfect Visualization Explorative Visualization Statistic Visualization (R) Spatial Visualization Ad-hoc Functional Mapping Data Governance Record Management Performance Management Compliance Visual Master Data Text and Media Analytics Cognitive Analytics Predictive Analytics Azure Stream Analytics Transactional Analytics Multidimensional Analytics Statistic Stream Analytics Federated Search Master Data Management Meta Data Data Lineage Data Profiling Relational Storage Semi Structured Data Storage Unstructured Data Storage Data Lake ETL and ELT Replication and Sync Raw Data Pump

13 MS Mapping On Prem Cloud (Hyperscale) SharePoint SSRS PBI Server Power BI Mobile Client Office 365 SharePoint Azure Apps R-Visual Studio or R-Studio Power BI Desktop (+R) SQL Server Reporting Services (+R) Excel Power BI Service Power BI Embedded SQL Server Master Data Services SQL Server Data Quality Services Azure Data Catalog Power BI Content Packs Power Apps Common Data Services SQL Server Polybase Engine SQL Server ML Engine Azure Machine Learning Azure HDInsight Spark/Hadoop Azure Stream Analytics / Storm SQL Server Operational Analytics Engine SQL Server Multidimensional Tabular Engine Azure SQL Database & DWH Azure Analysis Service Azure Data Lake Analytics (U-SQL) SQL Server Master Data Services SQL Server Data Quality Services Azure Data Catalog Power Apps Common Data Services SQL Server Relational Engine File Storage Azure SQL DB & Data Warehouse Azure Data Lake Azure BLOB Storage SQL Server Integration Services Azure Data Factory Azure Logic Apps

14 Hmm! Data Quality Services (DQS) Microsofts Solution for Data Quality

15 DQS (Data Quality Services) Great idea to integrate DQ functions in the MS Data Platform BUT: Since 2012 no real further development Product still a version 1.0 with small depth in functionality, performance and UI issues No roadmap available Usable for small projects, but not for bigger things! Any help?

16 DQ Solutions in the MS Data Platform In this session we will introduce you to three magic letters of scripting languages, you can use for Data Quality in the MS Data Platform: P (Python), R and U (U-SQL) can be greatly leveraged to help you with your DQ problems and we show you in our samples how

17 Script languages we use for DQ Python is a widely used high-level programming interpreted language for general-purpose programming, It emphasizes code readability and implementation in small and large scale applications R is an open source programming interpreted language and software environment for statistical computing and graphics. The R language is widely used among statisticians and data miners for developing statistical software and data analysis U-SQL is a query language designed for Azure Data Lake. It provides a way to mingle SQL keywords with syntactic C# expressions, so that within a single script, a programmer can schematize the data from an unstructured source, use SQL to aggregate the data into the desired form (Wikipedia) (Stackoverflow)

18 Applying P, R & U to the DQ workflow You will see the following samples as a subset of possibilities with these languages DQ Scoring DQ KB DQ Dedub

19 Using R as DQ Tool Using R Libraries Develop in Visual Studio Operationalize in SQL Server and Spark Demo: Profile & Scoring Analysis with R

20 DEMO R for Data Quality

21 Recap

22 Using P as DQ Tool Using Python Modules Develop in Visual Studio / Rodeo / Jupyter Operationalize in SQL Server and Spark Demo: Knowledge Base Techniques with P Deduplication Algorithms used in P

23 Knowledge Base Matches > 90 % Cleaned Data Source Data Knowledge Base (Synonyms) Matches > 80 % Knowledge Base (e.g. MDS, SharePoint) Data Steward Manual Validation

24 DEMO P for Data Quality

25 Using ADLA (U-SQL) as DQ Tool Using U-SQL with C# Assemblies Develop in Visual Studio Operationalize in Azure Data Analytics Demo: Deduplication with U-SQL Data Enrichment through Federation

26 Using ADLA (U-SQL) as DQ Tool Duplicates classification Is Duplicate Non-Duplicate Found as Duplicate Not found as Duplicate

27 DEMO U for Data Quality

28 Data Quality Dashboard with PBI on ADL

29 DQ with P, R and U > Findings Advantages Disadvantages Great variety of libraries and modules for DQ in P and R, good extensibility of U through assemblies Flexibility to adjust to your companies DQ problems A lot of possibilities to integrate into current data landscapes on prem and in the cloud Ability to scale Not for non technical users No GUI available Not a service Framework with different libraries and versions difficult to know what is useful

30 Enjoy the SQL Server Konferenz!

Oliver Engels & Tillmann Eitelberg. Big Data! Big Quality?

Oliver Engels & Tillmann Eitelberg. Big Data! Big Quality? Oliver Engels & Tillmann Eitelberg Big Data! Big Quality? Like to visit Germany? PASS Camp 2017 Main Camp 5.12 7.12.2017 (4.12 Kick Off Evening) Lufthansa Training & Conference Center, Seeheim SQL Konferenz

More information

Oliver Engels & Tillmann Eitelberg. Big Data! Big Quality?

Oliver Engels & Tillmann Eitelberg. Big Data! Big Quality? Oliver Engels & Tillmann Eitelberg Big Data! Big Quality? Sponsors help us to run this event! THX! You Rock! Sponsor Gold Sponsor Silver Sponsor Bronze Sponsor You Rock! Sponsor Session 13:45 Track 1 Das

More information

17/05/2017. What we ll cover. Who is Greg? Why PaaS and SaaS? What we re not discussing: IaaS

17/05/2017. What we ll cover. Who is Greg? Why PaaS and SaaS? What we re not discussing: IaaS What are all those Azure* and Power* services and why do I want them? Dr Greg Low SQL Down Under greg@sqldownunder.com Who is Greg? CEO and Principal Mentor at SDU Data Platform MVP Microsoft Regional

More information

Přehled novinek v SQL Server 2016

Přehled novinek v SQL Server 2016 Přehled novinek v SQL Server 2016 Martin Rys, BI Competency Leader martin.rys@adastragrp.com https://www.linkedin.com/in/martinrys 20.4.2016 1 BI Competency development 2 Trends, modern data warehousing

More information

BIG DATA COURSE CONTENT

BIG DATA COURSE CONTENT BIG DATA COURSE CONTENT [I] Get Started with Big Data Microsoft Professional Orientation: Big Data Duration: 12 hrs Course Content: Introduction Course Introduction Data Fundamentals Introduction to Data

More information

Data Architectures in Azure for Analytics & Big Data

Data Architectures in Azure for Analytics & Big Data Data Architectures in for Analytics & Big Data October 20, 2018 Melissa Coates Solution Architect, BlueGranite Microsoft Data Platform MVP Blog: www.sqlchick.com Twitter: @sqlchick Data Architecture A

More information

Modern Data Warehouse The New Approach to Azure BI

Modern Data Warehouse The New Approach to Azure BI Modern Data Warehouse The New Approach to Azure BI History On-Premise SQL Server Big Data Solutions Technical Barriers Modern Analytics Platform On-Premise SQL Server Big Data Solutions Modern Analytics

More information

COURSE 10977A: UPDATING YOUR SQL SERVER SKILLS TO MICROSOFT SQL SERVER 2014

COURSE 10977A: UPDATING YOUR SQL SERVER SKILLS TO MICROSOFT SQL SERVER 2014 ABOUT THIS COURSE This five-day instructor-led course teaches students how to use the enhancements and new features that have been added to SQL Server and the Microsoft data platform since the release

More information

"Charting the Course... MOC B Updating Your SQL Server Skills to Microsoft SQL Server 2014 Course Summary

Charting the Course... MOC B Updating Your SQL Server Skills to Microsoft SQL Server 2014 Course Summary Course Summary Description This five-day instructor-led course teaches students how to use the enhancements and new features that have been added to SQL Server and the Microsoft data platform since the

More information

Data 101 Which DB, When. Joe Yong Azure SQL Data Warehouse, Program Management Microsoft Corp.

Data 101 Which DB, When. Joe Yong Azure SQL Data Warehouse, Program Management Microsoft Corp. Data 101 Which DB, When Joe Yong (joeyong@microsoft.com) Azure SQL Data Warehouse, Program Management Microsoft Corp. The world is changing AI increased by 300% in 2017 Data will grow to 44 ZB in 2020

More information

Overview of Data Services and Streaming Data Solution with Azure

Overview of Data Services and Streaming Data Solution with Azure Overview of Data Services and Streaming Data Solution with Azure Tara Mason Senior Consultant tmason@impactmakers.com Platform as a Service Offerings SQL Server On Premises vs. Azure SQL Server SQL Server

More information

COURSE 20466D: IMPLEMENTING DATA MODELS AND REPORTS WITH MICROSOFT SQL SERVER

COURSE 20466D: IMPLEMENTING DATA MODELS AND REPORTS WITH MICROSOFT SQL SERVER ABOUT THIS COURSE The focus of this five-day instructor-led course is on creating managed enterprise BI solutions. It describes how to implement multidimensional and tabular data models, deliver reports

More information

SAP Agile Data Preparation Simplify the Way You Shape Data PUBLIC

SAP Agile Data Preparation Simplify the Way You Shape Data PUBLIC SAP Agile Data Preparation Simplify the Way You Shape Data Introduction SAP Agile Data Preparation Overview Video SAP Agile Data Preparation is a self-service data preparation application providing data

More information

White Paper / Azure Data Platform: Ingest

White Paper / Azure Data Platform: Ingest White Paper / Azure Data Platform: Ingest Contents White Paper / Azure Data Platform: Ingest... 1 Versioning... 2 Meta Data... 2 Foreword... 3 Prerequisites... 3 Azure Data Platform... 4 Flowchart Guidance...

More information

Modeling. Preparation. Operationalization. Profile Explore. Model Testing & Validation. Feature & Algorithm Selection. Transform Cleanse Denormalize

Modeling. Preparation. Operationalization. Profile Explore. Model Testing & Validation. Feature & Algorithm Selection. Transform Cleanse Denormalize Preparation Modeling Ingest Transform Cleanse Denormalize Profile Explore Visualize Feature & Algorithm Selection Model Testing & Validation Operationalization Models Visualizations Deploy Apps, Services

More information

WHAT S NEW IN SQL SERVER 2016 REPORTING SERVICES?

WHAT S NEW IN SQL SERVER 2016 REPORTING SERVICES? WHAT S NEW IN SQL SERVER 2016 REPORTING SERVICES? Timothy P. McAliley CISA, CISM, CISSP, ITIL V3, MCSA, MCSE, MCT, PMP Microsoft Account Technology Strategist Try It Yourself! Two TechNet Virtual Labs

More information

Azure Data Factory VS. SSIS. Reza Rad, Consultant, RADACAD

Azure Data Factory VS. SSIS. Reza Rad, Consultant, RADACAD Azure Data Factory VS. SSIS Reza Rad, Consultant, RADACAD 2 Please silence cell phones Explore Everything PASS Has to Offer FREE ONLINE WEBINAR EVENTS FREE 1-DAY LOCAL TRAINING EVENTS VOLUNTEERING OPPORTUNITIES

More information

Data sources. Gartner, The State of Data Warehousing in 2012

Data sources. Gartner, The State of Data Warehousing in 2012 data warehousing has reached the most significant tipping point since its inception. The biggest, possibly most elaborate data management system in IT is changing. Gartner, The State of Data Warehousing

More information

R Language for the SQL Server DBA

R Language for the SQL Server DBA R Language for the SQL Server DBA Beginning with R Ing. Eduardo Castro, PhD, Principal Data Analyst Architect, LP Consulting Moderated By: Jose Rolando Guay Paz Thank You microsoft.com idera.com attunity.com

More information

Microsoft Power BI for O365

Microsoft Power BI for O365 Microsoft Power BI for O365 Next hour.. o o o o o o o o Power BI for O365 Data Discovery Data Analysis Data Visualization & Power Maps Natural Language Search (Q&A) Power BI Site Data Management Self Service

More information

Microsoft Developer Day

Microsoft Developer Day Microsoft Developer Day Pradeep Menon Microsoft Developer Day Solutions Architect Agenda Microsoft Developer Day Traditional Business Intelligence Architecture Structured Sources Extract Transform Structurize

More information

Alexander Klein. #SQLSatDenmark. ETL meets Azure

Alexander Klein. #SQLSatDenmark. ETL meets Azure Alexander Klein ETL meets Azure BIG Thanks to SQLSat Denmark sponsors Save the date for exiting upcoming events PASS Camp 2017 Main Camp 05.12. 07.12.2017 (04.12. Kick-Off abends) Lufthansa Training &

More information

Ian Choy. Technology Solutions Professional

Ian Choy. Technology Solutions Professional Ian Choy Technology Solutions Professional XML KPIs SQL Server 2000 Management Studio Mirroring SQL Server 2005 Compression Policy-Based Mgmt Programmability SQL Server 2008 PowerPivot SharePoint Integration

More information

Data 101 Which DB, When Joe Yong Sr. Program Manager Microsoft Corp.

Data 101 Which DB, When Joe Yong Sr. Program Manager Microsoft Corp. 17-18 March, 2018 Beijing Data 101 Which DB, When Joe Yong Sr. Program Manager Microsoft Corp. The world is changing AI increased by 300% in 2017 Data will grow to 44 ZB in 2020 Today, 80% of organizations

More information

Migrating Enterprise BI to Azure

Migrating Enterprise BI to Azure Migrating Enterprise BI to Azure Best Practices Wlodek Bielski SQLSat Kyiv Team Yevhen Nedashkivskyi Mykola Pobyivovk Denis Reznik Eugene Polonichko Oksana Borysenko Oksana Tkach Sponsors Session will

More information

Data Management Glossary

Data Management Glossary Data Management Glossary A Access path: The route through a system by which data is found, accessed and retrieved Agile methodology: An approach to software development which takes incremental, iterative

More information

Cortana Intelligence Suite; Where the Magic Happens

Cortana Intelligence Suite; Where the Magic Happens Cortana Intelligence Suite; Where the Magic Happens Reza Rad, Leila Etaati #509 Brisbane 2016 About Us Reza Rad Leila Etaati MVP BI Consultant and Trainer Author of Books Speaker in conferences; PASS Summit,

More information

Microsoft vision for a new era

Microsoft vision for a new era Microsoft vision for a new era United platform for the modern service provider MICROSOFT AZURE CUSTOMER DATACENTER CONSISTENT PLATFORM SERVICE PROVIDER Enterprise-grade Global reach, scale, and security

More information

Data sources. Gartner, The State of Data Warehousing in 2012

Data sources. Gartner, The State of Data Warehousing in 2012 data warehousing has reached the most significant tipping point since its inception. The biggest, possibly most elaborate data management system in IT is changing. Gartner, The State of Data Warehousing

More information

BI ENVIRONMENT PLANNING GUIDE

BI ENVIRONMENT PLANNING GUIDE BI ENVIRONMENT PLANNING GUIDE Business Intelligence can involve a number of technologies and foster many opportunities for improving your business. This document serves as a guideline for planning strategies

More information

HDInsight > Hadoop. October 12, 2017

HDInsight > Hadoop. October 12, 2017 HDInsight > Hadoop October 12, 2017 2 Introduction Mark Hudson >20 years mixing technology with data >10 years with CapTech Microsoft Certified IT Professional Business Intelligence Member of the Richmond

More information

Microsoft Azure Databricks for data engineering. Building production data pipelines with Apache Spark in the cloud

Microsoft Azure Databricks for data engineering. Building production data pipelines with Apache Spark in the cloud Microsoft Azure Databricks for data engineering Building production data pipelines with Apache Spark in the cloud Azure Databricks As companies continue to set their sights on making data-driven decisions

More information

From the Source to the Dashboard: SAP Agile Data Warehousing for Self-Service BI

From the Source to the Dashboard: SAP Agile Data Warehousing for Self-Service BI From the Source to the Dashboard: SAP Agile Data Warehousing for Self-Service BI Michael D Rutland, Sr SE, SAP / @TDWI, 9 October 2017, Savannah Disclaimer The information in this presentation is confidential

More information

microsoft

microsoft 70-775.microsoft Number: 70-775 Passing Score: 800 Time Limit: 120 min Exam A QUESTION 1 Note: This question is part of a series of questions that present the same scenario. Each question in the series

More information

Azure Data Lake Analytics Introduction for SQL Family. Julie

Azure Data Lake Analytics Introduction for SQL Family. Julie Azure Data Lake Analytics Introduction for SQL Family Julie Koesmarno @MsSQLGirl www.mssqlgirl.com jukoesma@microsoft.com What we have is a data glut Vernor Vinge (Emeritus Professor of Mathematics at

More information

Oracle Big Data Discovery

Oracle Big Data Discovery Oracle Big Data Discovery Turning Data into Business Value Harald Erb Oracle Business Analytics & Big Data 1 Safe Harbor Statement The following is intended to outline our general product direction. It

More information

SQL Server 2019 Big Data Clusters

SQL Server 2019 Big Data Clusters SQL Server 2019 Big Data Clusters Ben Weissman @bweissman > SOLISYON GMBH > FÜRTHER STRAßE 212 > 90429 NÜRNBERG > +49 911 990077 20 Who am I? Ben Weissman @bweissman b.weissman@solisyon.de http://biml-blog.de/

More information

Swimming in the Data Lake. Presented by Warner Chaves Moderated by Sander Stad

Swimming in the Data Lake. Presented by Warner Chaves Moderated by Sander Stad Swimming in the Data Lake Presented by Warner Chaves Moderated by Sander Stad Thank You microsoft.com hortonworks.com aws.amazon.com red-gate.com Empower users with new insights through familiar tools

More information

Welcome! Power BI User Group (PUG) Copenhagen

Welcome! Power BI User Group (PUG) Copenhagen Welcome! Power BI User Group (PUG) Copenhagen Connect to Data in Power BI Desktop Just Thorning Blindbæk Consultant, Trainer and Speaker Connect to Data in Power BI Desktop Basic introduction to data connectivity

More information

Microsoft certified solutions associate

Microsoft certified solutions associate Microsoft certified solutions associate MCSA: BI Reporting This certification demonstrates your expertise in analyzing data with both Power BI and Excel. Exam 70-778/Course 20778 Analyzing and Visualizing

More information

From Single Purpose to Multi Purpose Data Lakes. Thomas Niewel Technical Sales Director DACH Denodo Technologies March, 2019

From Single Purpose to Multi Purpose Data Lakes. Thomas Niewel Technical Sales Director DACH Denodo Technologies March, 2019 From Single Purpose to Multi Purpose Data Lakes Thomas Niewel Technical Sales Director DACH Denodo Technologies March, 2019 Agenda Data Lakes Multiple Purpose Data Lakes Customer Example Demo Takeaways

More information

ETL is No Longer King, Long Live SDD

ETL is No Longer King, Long Live SDD ETL is No Longer King, Long Live SDD How to Close the Loop from Discovery to Information () to Insights (Analytics) to Outcomes (Business Processes) A presentation by Brian McCalley of DXC Technology,

More information

Bridge the cloud divide with hybrid business intelligence in SharePoint 2016 and Office 365

Bridge the cloud divide with hybrid business intelligence in SharePoint 2016 and Office 365 Bridge the cloud divide with hybrid business intelligence in SharePoint 2016 and Office 365 Chris McNulty, Microsoft John White, MVP John White CTO/Co-Founder of UnlimitedViz MSc. MVP SQL Server v-ts

More information

Optimizing and Modeling SAP Business Analytics for SAP HANA. Iver van de Zand, Business Analytics

Optimizing and Modeling SAP Business Analytics for SAP HANA. Iver van de Zand, Business Analytics Optimizing and Modeling SAP Business Analytics for SAP HANA Iver van de Zand, Business Analytics Early data warehouse projects LIMITATIONS ISSUES RAISED Data driven by acquisition, not architecture Too

More information

SQL Server SQL Server 2008 and 2008 R2. SQL Server SQL Server 2014 Currently supporting all versions July 9, 2019 July 9, 2024

SQL Server SQL Server 2008 and 2008 R2. SQL Server SQL Server 2014 Currently supporting all versions July 9, 2019 July 9, 2024 Current support level End Mainstream End Extended SQL Server 2005 SQL Server 2008 and 2008 R2 SQL Server 2012 SQL Server 2005 SP4 is in extended support, which ends on April 12, 2016 SQL Server 2008 and

More information

SQL Server 2017 Power your entire data estate from on-premises to cloud

SQL Server 2017 Power your entire data estate from on-premises to cloud SQL Server 2017 Power your entire data estate from on-premises to cloud PREMIER SPONSOR GOLD SPONSORS SILVER SPONSORS BRONZE SPONSORS SUPPORTERS Vulnerabilities (2010-2016) Power your entire data estate

More information

The Cortana Intelligence Suite

The Cortana Intelligence Suite Slide 1 The Cortana Intelligence Suite Foundations Data Discovery and Ingestion Microsoft Machine Learning and Data Science Team CortanaIntelligence.com Main page: http://cortanaanalytics.com To begin

More information

Azure Data Lake Store

Azure Data Lake Store Azure Data Lake Store Analytics 101 Kenneth M. Nielsen Data Solution Architect, MIcrosoft Our Sponsors About me Kenneth M. Nielsen Worked with SQL Server since 1999 Data Solution Architect at Microsoft

More information

Implementing a Data Warehouse with Microsoft SQL Server 2012

Implementing a Data Warehouse with Microsoft SQL Server 2012 Implementing a Data Warehouse with Microsoft SQL Server 2012 Course 10777A 5 Days Instructor-led, Hands-on Introduction Data warehousing is a solution organizations use to centralize business data for

More information

Playing Outside Your Sandbox INTERACTING WITH OTHER SYSTEMS USING SHAREPOINT BCS

Playing Outside Your Sandbox INTERACTING WITH OTHER SYSTEMS USING SHAREPOINT BCS Playing Outside Your Sandbox INTERACTING WITH OTHER SYSTEMS USING SHAREPOINT BCS David Drever o Digital Workplace Services Lead o Office Servers & Services MVP o Saskatchewan SharePoint/O365 User Group

More information

Updating your Business Intelligence Skills to Microsoft SQL Server 2012

Updating your Business Intelligence Skills to Microsoft SQL Server 2012 Course 40009A: Updating your Business Intelligence Skills to Microsoft SQL Server 2012 Course Details Course Outline Module 1: Introduction to SQL Server 2012 for Business Intelligence This module provides

More information

Understanding the latent value in all content

Understanding the latent value in all content Understanding the latent value in all content John F. Kennedy (JFK) November 22, 1963 INGEST ENRICH EXPLORE Cognitive skills Data in any format, any Azure store Search Annotations Data Cloud Intelligence

More information

Index. Scott Klein 2017 S. Klein, IoT Solutions in Microsoft s Azure IoT Suite, DOI /

Index. Scott Klein 2017 S. Klein, IoT Solutions in Microsoft s Azure IoT Suite, DOI / Index A Advanced Message Queueing Protocol (AMQP), 44 Analytics, 9 Apache Ambari project, 209 210 API key, 244 Application data, 4 Azure Active Directory (AAD), 91, 257 Azure Blob Storage, 191 Azure data

More information

Azure Integration Services

Azure Integration Services Azure Integration Services 2018 Microsoft Corporation. All rights reserved. This document is provided "as-is." Information and views expressed in this document, including URL and other Internet Web site

More information

Stages of Data Processing

Stages of Data Processing Data processing can be understood as the conversion of raw data into a meaningful and desired form. Basically, producing information that can be understood by the end user. So then, the question arises,

More information

EMEA USERS CONFERENCE BERLIN, GERMANY. Copyright 2016 OSIsoft, LLC

EMEA USERS CONFERENCE BERLIN, GERMANY. Copyright 2016 OSIsoft, LLC Bridge IT and OT with a process data warehouse Presented by Franco Camba, OSIsoft Matt Ziegler, OSIsoft Frank Ruland, SAP Audience Poll Have you invested or are you looking into Business Intelligence tools?

More information

Alexander Klein. ETL in the Cloud

Alexander Klein. ETL in the Cloud Alexander Klein ETL in the Cloud Sponsors help us to run this event! THX! You Rock! Sponsor Gold Sponsor Silver Sponsor Bronze Sponsor You Rock! Sponsor Session 13:45 Track 1 Das super nerdige Solisyon

More information

Bull Fast Track/PDW and Big Data

Bull Fast Track/PDW and Big Data Bull Fast Track/PDW and Big Data Add High Performance BI to your Big Data Roger Van Unen Expert Microsoft / BI roger.van-unen@bull.net http://www.bull.fr/bi/fastrack.html Michael Schmitter BI Sales Germany

More information

20463C-Implementing a Data Warehouse with Microsoft SQL Server. Course Content. Course ID#: W 35 Hrs. Course Description: Audience Profile

20463C-Implementing a Data Warehouse with Microsoft SQL Server. Course Content. Course ID#: W 35 Hrs. Course Description: Audience Profile Course Content Course Description: This course describes how to implement a data warehouse platform to support a BI solution. Students will learn how to create a data warehouse 2014, implement ETL with

More information

28 February 1 March 2018, Trafo Baden. #techsummitch

28 February 1 March 2018, Trafo Baden. #techsummitch #techsummitch 28 February 1 March 2018, Trafo Baden #techsummitch Transform your data estate with cloud, data and AI #techsummitch The world is changing Data will grow to 44 ZB in 2020 Today, 80% of organizations

More information

MICROSOFT CLOUD PLATFORM AND INFRASTRUCTURE CERTIFICATION. Includes certifications for Microsoft Azure and Windows Server

MICROSOFT CLOUD PLATFORM AND INFRASTRUCTURE CERTIFICATION. Includes certifications for Microsoft Azure and Windows Server MICROSOFT CLOUD PLATFORM AND INFRASTRUCTURE CERTIFICATION Includes certifications for Microsoft Azure and Windows Server Microsoft Azure MCSA: Cloud Platform Pass 2 required exams. M20532 M20533 M20535

More information

SQL Server Machine Learning Marek Chmel & Vladimir Muzny

SQL Server Machine Learning Marek Chmel & Vladimir Muzny SQL Server Machine Learning Marek Chmel & Vladimir Muzny @VladimirMuzny & @MarekChmel MCTs, MVPs, MCSEs Data Enthusiasts! vladimir@datascienceteam.cz marek@datascienceteam.cz Session Agenda Machine learning

More information

Why Choose MS Azure?

Why Choose MS Azure? Why Choose MS Azure? Bio Dave Williams Cloud Architect AWS and Azure Expertise Agenda How to choose a public cloud service provider Microsoft Azure Strengths: Identity & Azure Active Directory Infrastructure

More information

Azure File Sync. Webinaari

Azure File Sync. Webinaari Azure File Sync Webinaari 12.3.2018 Agenda Why use Azure? Moving to the Cloud Azure Storage Backup and Recovery Azure File Sync Demo Q&A What is Azure? A collection of cloud services from Microsoft that

More information

Transitioning From SSIS to Azure Data Factory. Meagan Longoria, Solution Architect, BlueGranite

Transitioning From SSIS to Azure Data Factory. Meagan Longoria, Solution Architect, BlueGranite Transitioning From SSIS to Azure Data Factory Meagan Longoria, Solution Architect, BlueGranite Microsoft Data Platform MVP I enjoy contributing to and learning from the Microsoft data community. Blogger

More information

Creating a Recommender System. An Elasticsearch & Apache Spark approach

Creating a Recommender System. An Elasticsearch & Apache Spark approach Creating a Recommender System An Elasticsearch & Apache Spark approach My Profile SKILLS Álvaro Santos Andrés Big Data & Analytics Solution Architect in Ericsson with more than 12 years of experience focused

More information

@Pentaho #BigDataWebSeries

@Pentaho #BigDataWebSeries Enterprise Data Warehouse Optimization with Hadoop Big Data @Pentaho #BigDataWebSeries Your Hosts Today Dave Henry SVP Enterprise Solutions Davy Nys VP EMEA & APAC 2 Source/copyright: The Human Face of

More information

Go USA! Get There Faster. 6/29/2010 Process World

Go USA! Get There Faster. 6/29/2010 Process World Go USA! Get There Faster. 6/29/2010 Process World 2010 2 Go Germany! Get There Faster. BPM + BI = Process Intelligence Top 10 Business Priorities Rank Top 10 Technology Priorities Rank Business process

More information

Welcome to the Era of Open Analytics

Welcome to the Era of Open Analytics Welcome to the Era of Open Analytics Sumit Sarkar, Chief Data Evangelist, Progress sumit.sarkar@progress.com @SAsInSumit www.linkedin.com/in/meetsumit The embedded analytics market is estimated to nearly

More information

Processing Big Data. with AZURE DATA LAKE ANALYTICS. Sean Forgatch - Senior Consultant. 6/23/ TALAVANT. All Rights Reserved.

Processing Big Data. with AZURE DATA LAKE ANALYTICS. Sean Forgatch - Senior Consultant. 6/23/ TALAVANT. All Rights Reserved. Processing Big Data with AZURE DATA LAKE ANALYTICS Sean Forgatch - Senior Consultant 6/23/2018 2018 TALAVANT. All Rights Reserved. 1 SQL Saturday Iowa 2018 6/23/2018 2018 TALAVANT. All Rights Reserved.

More information

Master BIG DATA with SQL Server 2012

Master BIG DATA with SQL Server 2012 Roy Pasternak Data Platform & BI Lead Ori Weinroth Product Marketing Manager, SQL Server Master BIG DATA with SQL Server 2012 Characteristics of Big Data Large Data Volumes The Twitter Community generates

More information

An InterSystems Guide to the Data Galaxy. Benjamin De Boe Product Manager

An InterSystems Guide to the Data Galaxy. Benjamin De Boe Product Manager An InterSystems Guide to the Data Galaxy Benjamin De Boe Product Manager Analytics 3 InterSystems Corporation. All rights reserved. 4 InterSystems Corporation. All rights reserved. 5 InterSystems Corporation.

More information

Dr. SubraMANI Paramasivam. Think & Work like a Data Scientist with SQL 2016 & R

Dr. SubraMANI Paramasivam. Think & Work like a Data Scientist with SQL 2016 & R Dr. SubraMANI Paramasivam Think & Work like a Data Scientist with SQL 2016 & R About the Speaker Group Leader Dr. SubraMANI Paramasivam PhD., MVP, MCT, MCSE (x2), MCITP (x2), MCP, MCTS (x3), MCSA CEO,

More information

PowerPivot, an Introduction. By: Steve Lewis Principal Pyxis Analytics

PowerPivot, an Introduction. By: Steve Lewis Principal Pyxis Analytics PowerPivot, an Introduction By: Steve Lewis Principal Pyxis Analytics Agenda What is the BISM Model? Components of the BISM Model DAX Overview Walkthroughs What is the BISM Model Business Intelligence

More information

Capture Business Opportunities from Systems of Record and Systems of Innovation

Capture Business Opportunities from Systems of Record and Systems of Innovation Capture Business Opportunities from Systems of Record and Systems of Innovation Amit Satoor, SAP March Hartz, SAP PUBLIC Big Data transformation powers digital innovation system Relevant nuggets of information

More information

Microsoft Analytics Platform System (APS)

Microsoft Analytics Platform System (APS) Microsoft Analytics Platform System (APS) The turnkey modern data warehouse appliance Matt Usher, Senior Program Manager @ Microsoft About.me @two_under Senior Program Manager 9 years at Microsoft Visual

More information

Apache Ignite - Using a Memory Grid for Heterogeneous Computation Frameworks A Use Case Guided Explanation. Chris Herrera Hashmap

Apache Ignite - Using a Memory Grid for Heterogeneous Computation Frameworks A Use Case Guided Explanation. Chris Herrera Hashmap Apache Ignite - Using a Memory Grid for Heterogeneous Computation Frameworks A Use Case Guided Explanation Chris Herrera Hashmap Topics Who - Key Hashmap Team Members The Use Case - Our Need for a Memory

More information

Boost your Analytics with ML for SQL Nerds

Boost your Analytics with ML for SQL Nerds Boost your Analytics with ML for SQL Nerds SQL Saturday Spokane Mar 10, 2018 Julie Koesmarno @MsSQLGirl mssqlgirl.com jukoesma@microsoft.com Principal Program Manager in Business Analytics for SQL Products

More information

Asanka Padmakumara. ETL 2.0: Data Engineering with Azure Databricks

Asanka Padmakumara. ETL 2.0: Data Engineering with Azure Databricks Asanka Padmakumara ETL 2.0: Data Engineering with Azure Databricks Who am I? Asanka Padmakumara Business Intelligence Consultant, More than 8 years in BI and Data Warehousing A regular speaker in data

More information

Updating your Business Intelligence Skills to Microsoft SQL Server 2012 Course 40009A; 3 Days, Instructor-led

Updating your Business Intelligence Skills to Microsoft SQL Server 2012 Course 40009A; 3 Days, Instructor-led Updating your Business Intelligence Skills to Microsoft SQL Server 2012 Course 40009A; 3 Days, Instructor-led Course Description This three-day instructor-led course provides existing SQL Server Business

More information

Implement a Data Warehouse with Microsoft SQL Server

Implement a Data Warehouse with Microsoft SQL Server Implement a Data Warehouse with Microsoft SQL Server 20463D; 5 days, Instructor-led Course Description This course describes how to implement a data warehouse platform to support a BI solution. Students

More information

BI, Big Data, Mission Critical. Eduardo Rivadeneira Specialist Sales Manager

BI, Big Data, Mission Critical. Eduardo Rivadeneira Specialist Sales Manager BI, Big Data, Mission Critical Eduardo Rivadeneira Specialist Sales Manager Required 9s & Protection Blazing-Fast Performance Enhanced Security & Compliance Rapid Data Exploration & Visualization Managed

More information

Cloudy with 100% chance of data

Cloudy with 100% chance of data # T C 1 8 Cloudy with 100% chance of data Phillip Cheung Technical Program Manager Tableau Kevin Boske Analytics Manager Tableau SUPPORTING 3,300 EMPLOYEES 196 SERVER CORES 150 EXTERNAL FACING TABLEAU

More information

Making Data Integration Easy For Multiplatform Data Architectures With Diyotta 4.0. WEBINAR MAY 15 th, PM EST 10AM PST

Making Data Integration Easy For Multiplatform Data Architectures With Diyotta 4.0. WEBINAR MAY 15 th, PM EST 10AM PST Making Data Integration Easy For Multiplatform Data Architectures With Diyotta 4.0 WEBINAR MAY 15 th, 2018 1PM EST 10AM PST Welcome and Logistics If you have problems with the sound on your computer, switch

More information

Think & Work like a Data Scientist with SQL 2016 & R DR. SUBRAMANI PARAMASIVAM (MANI)

Think & Work like a Data Scientist with SQL 2016 & R DR. SUBRAMANI PARAMASIVAM (MANI) Think & Work like a Data Scientist with SQL 2016 & R DR. SUBRAMANI PARAMASIVAM (MANI) About the Speaker Dr. SubraMANI Paramasivam PhD., MCT, MCSE, MCITP, MCP, MCTS, MCSA CEO, Principal Consultant & Trainer

More information

One is the Loneliest Number: Scaling out your Data Warehouse

One is the Loneliest Number: Scaling out your Data Warehouse One is the Loneliest Number: Scaling out your Data Warehouse Greg Galloway SQL Saturday Dallas #396 BI Edition Page 1 Agenda Common data warehouse pain points Analytics Platform System (APS) overview Analytics

More information

Power BI for the Enterprise

Power BI for the Enterprise Power BI for the Enterprise Paul Turley Principal Consultant, Intelligent Business LLC Microsoft Data Platform MVP, SolidQ Mentor e: Paul@IntelligentBiz.net t: @paul_turley b: SqlServerBiBlog.com Please

More information

Processing Unstructured Data. Dinesh Priyankara Founder/Principal Architect dinesql Pvt Ltd.

Processing Unstructured Data. Dinesh Priyankara Founder/Principal Architect dinesql Pvt Ltd. Processing Unstructured Data Dinesh Priyankara Founder/Principal Architect dinesql Pvt Ltd. http://dinesql.com / Dinesh Priyankara @dinesh_priya Founder/Principal Architect dinesql Pvt Ltd. Microsoft Most

More information

SAP HANA as an Accelerator for PLM Processes HANA Basics and Scenarios

SAP HANA as an Accelerator for PLM Processes HANA Basics and Scenarios SAP HANA as an Accelerator for PLM Processes HANA Basics and Scenarios Michael Dietz, Principal Solution Architect HANA Public Agenda SAP HANA Platform Usage Scenarios Potentials in Product Lifecycle Management

More information

Oregon SQL Welcomes You to SQL Saturday Oregon

Oregon SQL Welcomes You to SQL Saturday Oregon Oregon SQL Welcomes You to SQL Saturday Oregon 2012-11-03 Introduction to SQL Server 2012 MDS and DQS Peter Myers Bitwise Solutions Presenter Introduction Peter Myers BI Expert, Bitwise Solutions BBus,

More information

Security & Management

Security & Management Common Identity Data Platform Security & Management Unified Development Category Azure Technology On-Premise Technology Hybrid Storage Azure Storage StorSimple Hybrid Backup & DR Azure Backup + Azure Site

More information

Implementing a Data Warehouse with Microsoft SQL Server

Implementing a Data Warehouse with Microsoft SQL Server Course 20463C: Implementing a Data Warehouse with Microsoft SQL Server Page 1 of 6 Implementing a Data Warehouse with Microsoft SQL Server Course 20463C: 4 days; Instructor-Led Introduction This course

More information

Oracle Big Data Science IOUG Collaborate 16

Oracle Big Data Science IOUG Collaborate 16 Oracle Big Data Science IOUG Collaborate 16 Session 4762 Tim and Dan Vlamis Tuesday, April 12, 2016 Vlamis Software Solutions Vlamis Software founded in 1992 in Kansas City, Missouri Developed 200+ Oracle

More information

CloudSwyft Learning-as-a-Service Course Catalog 2018 (Individual LaaS Course Catalog List)

CloudSwyft Learning-as-a-Service Course Catalog 2018 (Individual LaaS Course Catalog List) CloudSwyft Learning-as-a-Service Course Catalog 2018 (Individual LaaS Course Catalog List) Microsoft Solution Latest Sl Area Refresh No. Course ID Run ID Course Name Mapping Date 1 AZURE202x 2 Microsoft

More information

Get ready to be what s next.

Get ready to be what s next. Get ready to be what s next. Jared Shockley http://jaredontech.com Senior Service Engineer Prior Experience @jshoq Primary Experience Areas David Leveille http://www.linkedin.com/in/leveille Senior Solutions

More information

Azure SQL Database. Indika Dalugama. Data platform solution architect Microsoft datalake.lk

Azure SQL Database. Indika Dalugama. Data platform solution architect Microsoft datalake.lk Azure SQL Database Indika Dalugama Data platform solution architect Microsoft indalug@microsoft.com datalake.lk Agenda Overview Azure SQL adapts Azure SQL Instances (single,e-pool and MI) How to Migrate

More information

Open Source Tools as a platform for research on Microsoft Azure

Open Source Tools as a platform for research on Microsoft Azure Open Source Tools as a platform for research on Microsoft Azure Alessandro Jannuzi Open Source Lead Microsoft Brasil Jaime Puente Director Microsoft Research Azure, Microsoft Cloud Platform 24 Regions

More information

Heute in der Suppenküche: Cognitive Services Allerlei

Heute in der Suppenküche: Cognitive Services Allerlei Heute in der Suppenküche: Cognitive Services Allerlei Marcel Tilly Microsoft marcel.tilly@microsoft.com Constantin Kostja Klein Freudenberg IT @KostjaKlein ckl@sqlpass.de Our Sponsors What product is Joe

More information

Implementing a Data Warehouse with Microsoft SQL Server 2014

Implementing a Data Warehouse with Microsoft SQL Server 2014 Course 20463D: Implementing a Data Warehouse with Microsoft SQL Server 2014 Page 1 of 5 Implementing a Data Warehouse with Microsoft SQL Server 2014 Course 20463D: 4 days; Instructor-Led Introduction This

More information