ETL is No Longer King, Long Live SDD

Size: px
Start display at page:

Download "ETL is No Longer King, Long Live SDD"

Transcription

1 ETL is No Longer King, Long Live SDD How to Close the Loop from Discovery to Information () to Insights (Analytics) to Outcomes (Business Processes) A presentation by Brian McCalley of DXC Technology, Glenn Field of SiriusIQ and Gavin Robertson of WhamTech, Inc.

2 No secret that most organizations face major datarelated hurdles Location System Access Security Container Format Age Dirty Typo/Transposition Missing Meaning Duplication Obfuscation Governance

3 and ANALYTICS is the prime driver to lower costs and increase revenue which, in turn, drives the need for applications* to have clean and understood data in specific formats *Reporting, BI, analytics, CDI-MDM, CRM, SCM, fraud detection, antimoney laundering, ERP, etc.

4 Goals for an optimal data architecture 1. Complete, clean, transformed, standardized and secure data, and master data, for multiple applications 2. Near real-time minimal update and query latency 3. Automation, including workflow and event processing 4. Support reporting, BI and analytics, including graph database 5. Minimize copies of data 6. discovery, metadata repository and data governance 7. Write back to data sources

5 Managing data in, or from, multiple disparate systems requires a new approach CONVENTIONAL APPROACHES ETL: Copy and transform schemas and data to a one-size-fits-all data warehouse Copy: To a single Lake/Big repository Federate: Submit queries through adapters to source systems Search: E.g., Solr /Elasticsearch - copy, read, parse and index data, process queries to provide data Big options for storing data CULTURAL HURDLES TECHNICAL HURDLES SECURITY & PRIVACY HURDLES The new approach leverages the advantages of each conventional approach

6 Typical Warehouse/Big / Lake/Search Application(s) Source Source Source Extract Extract Extract and Schema Transform and Schema Transform and Schema Transform Load Load Load Warehouse/ Big / Lake/ Search Queries resolved in the Warehouse/Big / Lake/ Search Expensive in terms of time and cost to implement and maintain

7 Typical federated data access with conventional adapters Source Connector Adapter Source Connector Adapter Middleware Application(s) Source Connector Adapter Queries resolved mainly in the data source Expensive to implement and limits capabilities

8 ETL has been the only option to come close to meeting the goals for an optimal data architecture, until now Introducing Software Defined (SDD) consisting of unconventional federated adapters that Read, Transform (process) and Index (RTI) source data and process queries against these indexes

9 SDD initial discovery, index and adapter configuration, index build and Standard View (SDV) mapping Discovery and raw indexbased Profiling Metadata Discovery and Semantic Mapping Classification and Security Develop and test Transforms using profiles Distributed Metadata Repository, incl. Governance Source Discovery Network Asset and Device Discovery Source Read, Transform/ clean-up (and Index) Indexes SDD Adapter w/sdv Twelve ways to build and maintain indexes Index schema and names same as data source Indexes do not store data only queryable representations Indexes mapped to Standard View (SDV)

10 SDD index update, query processing and results retrieval Distributed Metadata Repository, incl. Governance Queries resolved in the adapter and indexes Result-set pointers to data in source Source Read, Transform/ clean-up (and Index) User-level access Continuous EIQ Indexes updates Indexes SDD Adapter w/sdv Raw results data transformed/cleaned-up from source SDD Federation Server (submiddleware) w/sdv Applications/middleware connect with standard drivers, APIs, Web/data services and SQL Middleware Application(s) Results provided in almost any format SDD Federation Server Multiple other data sources

11 F I R E W A L L F I R E W A L L SDD adapters can co-exist with other types of adapters to System of Records RDBMS Indexes SDD Adapter Social Media Feed Indexes SDD Adapter SDD Federation Server Hadoop Indexes SDD Adapter Mainframe ERP System Indexes SDD Conventional Adapter SDD Adapter SDD Federation Server SDD Federation Server TCP / IP ODBC/ JDBC Driver REST API Etc. Application(s) Salesforce 3 rd Party Adapter

12 SDD indexes and adapters can be deployed and accessed anywhere, and at any level in multiple combinations Indexes are 100% contiguous, regardless of where or how deployed

13 How does Software Defined compare with other approaches?

14 Goal #1: Complete, clean, transformed, standardized and secure data, and master data, for multiple applications Goal #1 Complete, clean, transformed, standardized and secure data, and master data, for multiple applications SDD ETL to a Warehouse Big Lake Conventional Federated Adapters Solr/ Elasticsearch ( )

15 Goal Goal #2: Near real-time minimize update and query latency #1 Complete, clean, transformed, standardized and secure data, and master data, for multiple applications SDD ETL to a Warehouse Big Lake Conventional Federated Adapters Solr/ Elasticsearch ( ) #2 Near real-time minimize update and query latency ( ) ( ) ( ) ( )

16 Goal Goal #3: Automation, including workflow and event processing #1 Complete, clean, transformed, standardized and secure data, and master data, for multiple applications SDD ETL to a Warehouse Big Lake Conventional Federated Adapters Solr/ Elasticsearch ( ) #2 Near real-time minimize update and query latency ( ) ( ) ( ) ( ) #3 Automation, including workflow and event processing ( )

17 Goal Goal #4: Support reporting, BI and analytics, including graph database #1 Complete, clean, transformed, standardized and secure data, and master data, for multiple applications SDD ETL to a Warehouse Big Lake Conventional Federated Adapters Solr/ Elasticsearch ( ) #2 Near real-time minimize update and query latency ( ) ( ) ( ) ( ) #3 Automation, including workflow and event processing ( ) #4 Support reporting, BI and analytics, including graph database ( ) ( ) ( ) ( )

18 Goal #5: Minimize copies of data Goal #1 Complete, clean, transformed, standardized and secure data, and master data, for multiple applications SDD ETL to a Warehouse Big Lake Conventional Federated Adapters Solr/ Elasticsearch ( ) #2 Near real-time minimize update and query latency ( ) ( ) ( ) ( ) #3 Automation, including workflow and event processing ( ) #4 Support reporting, BI and analytics, including graph database ( ) ( ) ( ) ( ) #5 Minimize copies of data

19 Goal Goal #6: discovery, metadata repository and data governance #1 Complete, clean, transformed, standardized and secure data, and master data, for multiple applications SDD ETL to a Warehouse Big Lake Conventional Federated Adapters Solr/ Elasticsearch ( ) #2 Near real-time minimize update and query latency ( ) ( ) ( ) ( ) #3 Automation, including workflow and event processing ( ) #4 Support reporting, BI and analytics, including graph database ( ) ( ) ( ) ( ) #5 Minimize copies of data #6 discovery, metadata repository and data governance ( ) ( ) ( )

20 Goal #7: Write back to data sources Goal #1 Complete, clean, transformed, standardized and secure data, and master data, for multiple applications SDD ETL to a Warehouse Big Lake Conventional Federated Adapters Solr/ Elasticsearch ( ) #2 Near real-time minimize update and query latency ( ) ( ) ( ) ( ) #3 Automation, including workflow and event processing ( ) #4 Support reporting, BI and analytics, including graph database ( ) ( ) ( ) ( ) #5 Minimize copies of data #6 discovery, metadata repository and data governance ( ) ( ) ( ) #7 Write back to data sources

21 How SDD meets goals for an optimal data architecture Goal #1 Complete, clean, transformed, standardized and secure data, and master data, for multiple applications Software Defined Process source data as building and maintaining indexes and master data, and as reading raw results data Multiple indexes, views, means of access and result formats #2 Near real-time minimize update and query latency Changed data capture High performance, parallel distributed processing almost no load on data sources #3 Automation, including workflow and event processing Index monitoring, REST APIs and workflow integration #4 Support reporting, BI and analytics, including graph database Indexed views, provision highly curated data to analytics, run analytics, and built-in virtual graph database and link analysis #5 Minimize copies of data Can leave and secure data in sources, a Lake or indexes #6 discovery, metadata repository and data governance Use raw indexes for discovery, metadata and combining with IAM and RBAC for data governance from edge/bottom up #7 Write back to data sources Can read as well as insert, delete and update

22 Software Defined (SDD) Implementation and alignment of use-cases is the key to driving Enterprise IP. Technology prohibits this due to binding of data elements within applications Freeing data to create workflows will dramatically reduce time to market Incrementally developing enterprise use-cases through SDD drives innovation to next-gen path allowing a reinvention of the enterprise Agnostic de-coupling of silo solutions drives speed to market Use-case consumption for any user, on any device, anywhere securely enhances collaboration and productivity

23 Software Defined (SDD) Process to eliminate upwards of 95% of today s regression issues translates into 50-75% calendar time savings Business logic and code base functions grow incrementally as business dictates Cumulative code growth ensures reuse, optimal performance and agnostic access. You also benefit from globally available logic Zero impact deployments eliminates downtime and simplifies the SDLC Dynamic, intelligent workflows consume new features when live

24 AI and NLP are the new UI for many applications Leveraging next-gen cognitive rapidly delivers functionality and results Millennial workforce alignment Dramatic decrease in training requirements Dramatic decrease in time to market on features and results Disconnect 3 rd party backend solutions from natural language UI Allow seamless app upgrades by using AI UI, which will interact with both old and new systems

25 Abstracted 3-tier architecture connected through a Smart Fabric PLATFORM MANAGEMENT SERVICES Ecosystem Solutions SYSTEM OF ENGAGEMENT - APPLICATIONS Line-of-Business Users SYSTEM OF INSIGHTS ANALYTICS WAREHOUSE SYSTEM OF RECORDS INTERNAL / EXTERNAL DATA SOURCES SDD Fabric API Catalog

26 SDD is the First Paradigm Shift in how data and analytics are managed in a common meta-object framework *Indexes per data source on structured, unstructured and semistructured data that can either store or not store (default) data SYSTEM OF ENGAGEMENT - APPLICATIONS SYSTEM OF INSIGHTS ANALYTICS WAREHOUSE SYSTEM OF RECORDS INTERNAL / EXTERNAL DATA SOURCES Identity Management Access Control Other Corporate Governance Application and Other Corporate Security Standard drivers, APIs, Web/data services, SQL, Spark SQL and other query languages Distributed Management, Including Source Results Read and Write Back INDEXES* Metadata Repository Governance Role-based Access Control API Management Business Rules Discovery Profiling Security Analytics Quality Transformation Standardization Relationships BPM Software Master Management Audit Logs Virtual Graph base Optional Lake Real-time, Incremental or Batch Updated, Centralized or Distributed

27 The Second Paradigm Shift is the concept of the Analytics Warehouse

28 The Second Paradigm Shift is the concept of the Analytics Warehouse

29 An example of an Analytics Warehouse architecture data ingestion/warehouse model

30 An example of an Analytics Warehouse architecture spanning enterprise systems federated model

31 Examples of applying SDD and an Analytics Warehouse to healthcare analytics

32 3rd Party Partner Ecosystem Healthcare Cloud SDD Fabric Healthcare Clinical Network Management 3-Tier Architecture End-End Integrated Application Ecosystem Environment across Continuum of Care SoE Analytical Insights Catalog and Ecosystem Clinical, Financial, Administrative, Operational SoI Enterprise/ Ecosystem-wide Master Management Platform Longitudinal Patient Record SoR Development and IT Operations Support Environment Human Capital Management, Finance, Sales & Marketing AI Driven Operations Automation End-to-End Security

33 SDD Fabric enables a Longitudinal Patient Record (LPR) view across multiple System of Records, across multiple enterprises Transparent distributed data management layer that plugs-and-plays in existing IT infrastructures Complements and leverages existing IT systems, tools and applications Leave and guard data in sources, copies, e.g., Lake, or stored in indexes a hybrid approach Address upfront data discovery, security, quality, standards, MDM and other data-related processes

34 Use cases from healthcare that combine data and analytics management Use Cases Applications Clinical Applications Diabetes, Hypertension, Heart Failure, etc. Gaps in Care Predictive Readmissions Management Clinical Wellness Management Operational Management Operational Management Hospital Operational Management Physician Practices Physician Quality Reporting Scores Financial Performance Financial Management Hospital Financial Management Physician Practices Claims Analytics Regulatory Reporting Hospital Value Based Purchasing (HVBP) HEDIS Patient Centered Medical Home Scorecard MU 2 Clinical Quality Measures-Hospitals/Physicians MU2 Usage Scorecard - Physicians ACO Quality Reporting Hospital Outpatient Quality Reporting Patient Similarity Comparative Effectiveness Research Predictive models Chronic disease management Ability to create patient cohorts Cohort Manager/Chronic Condition Management Population Management Population Focus & Population Care

35 Conclusion of why ETL is no longer King, long live SDD SDD enables a data management paradigm shift SDD supports an analytics management paradigm shift ETL still has value, but not exclusively SDD can greatly enhance, complement and/or replace ETL in the future SDD is more suited than ETL to the new world of: everywhere API data services/catalog Parallel distributed processing Event and workflow processing Near real-time architectures

36 Thank you A presentation by Brian McCalley of DXC Technology, Glenn Field of SiriusIQ and Gavin Robertson of WhamTech, Inc.

37 Q&A

Comparison of SmartData Fabric with Cloudera and Hortonworks Revision 2.1

Comparison of SmartData Fabric with Cloudera and Hortonworks Revision 2.1 Comparison of SmartData Fabric with Cloudera and Hortonworks Revision 2.1 Page 1 of 11 www.whamtech.com (972) 991-5700 info@whamtech.com August 2018 Page 2 of 11 www.whamtech.com (972) 991-5700 info@whamtech.com

More information

SmartData Fabric (SDF) aka SmartData Lake (SDL) aka Distributed Data Virtualization (DDV) Basic Overview

SmartData Fabric (SDF) aka SmartData Lake (SDL) aka Distributed Data Virtualization (DDV) Basic Overview Smart Fabric (SDF) aka Smart Lake (SDL) aka Distributed Virtualization (DDV) Basic Overview October 2018 Revision 8.6 Copyright 2018 WhamTech, Inc. 1 Smart Fabric Unique indexed adapter-based data virtualization,

More information

Distributed Hybrid MDM, aka Virtual MDM Optional Add-on, for WhamTech SmartData Fabric

Distributed Hybrid MDM, aka Virtual MDM Optional Add-on, for WhamTech SmartData Fabric Distributed Hybrid MDM, aka Virtual MDM Optional Add-on, for WhamTech SmartData Fabric Revision 2.1 Page 1 of 17 www.whamtech.com (972) 991-5700 info@whamtech.com August 2018 Contents Introduction... 3

More information

SmartData Fabric distributed virtual data, graph data and master data management, analytics and security. Solutions and Key Features Revision 2.

SmartData Fabric distributed virtual data, graph data and master data management, analytics and security. Solutions and Key Features Revision 2. s and Key Features Revision 2.5 Page 1 of 7 www.whamtech.com (972) 991-5700 info@whamtech.com March 2018 ID SOL1 Automated Data Discovery and Classification (ADDC) Key Feature ID KF01 KF02 KF03 Key Feature

More information

Building a Data Strategy for a Digital World

Building a Data Strategy for a Digital World Building a Data Strategy for a Digital World Jason Hunter, CTO, APAC Data Challenge: Pushing the Limits of What's Possible The Art of the Possible Multiple Government Agencies Data Hub 100 s of Service

More information

The Emerging Data Lake IT Strategy

The Emerging Data Lake IT Strategy The Emerging Data Lake IT Strategy An Evolving Approach for Dealing with Big Data & Changing Environments bit.ly/datalake SPEAKERS: Thomas Kelly, Practice Director Cognizant Technology Solutions Sean Martin,

More information

SAP Agile Data Preparation Simplify the Way You Shape Data PUBLIC

SAP Agile Data Preparation Simplify the Way You Shape Data PUBLIC SAP Agile Data Preparation Simplify the Way You Shape Data Introduction SAP Agile Data Preparation Overview Video SAP Agile Data Preparation is a self-service data preparation application providing data

More information

Modern Data Warehouse The New Approach to Azure BI

Modern Data Warehouse The New Approach to Azure BI Modern Data Warehouse The New Approach to Azure BI History On-Premise SQL Server Big Data Solutions Technical Barriers Modern Analytics Platform On-Premise SQL Server Big Data Solutions Modern Analytics

More information

MAPR DATA GOVERNANCE WITHOUT COMPROMISE

MAPR DATA GOVERNANCE WITHOUT COMPROMISE MAPR TECHNOLOGIES, INC. WHITE PAPER JANUARY 2018 MAPR DATA GOVERNANCE TABLE OF CONTENTS EXECUTIVE SUMMARY 3 BACKGROUND 4 MAPR DATA GOVERNANCE 5 CONCLUSION 7 EXECUTIVE SUMMARY The MapR DataOps Governance

More information

CONSOLIDATING RISK MANAGEMENT AND REGULATORY COMPLIANCE APPLICATIONS USING A UNIFIED DATA PLATFORM

CONSOLIDATING RISK MANAGEMENT AND REGULATORY COMPLIANCE APPLICATIONS USING A UNIFIED DATA PLATFORM CONSOLIDATING RISK MANAGEMENT AND REGULATORY COMPLIANCE APPLICATIONS USING A UNIFIED PLATFORM Executive Summary Financial institutions have implemented and continue to implement many disparate applications

More information

The Value of Data Governance for the Data-Driven Enterprise

The Value of Data Governance for the Data-Driven Enterprise Solution Brief: erwin Data governance (DG) The Value of Data Governance for the Data-Driven Enterprise Prepare for Data Governance 2.0 by bringing business teams into the effort to drive data opportunities

More information

ADABAS & NATURAL 2050+

ADABAS & NATURAL 2050+ ADABAS & NATURAL 2050+ Guido Falkenberg SVP Global Customer Innovation DIGITAL TRANSFORMATION #WITHOUTCOMPROMISE 2017 Software AG. All rights reserved. ADABAS & NATURAL 2050+ GLOBAL INITIATIVE INNOVATION

More information

WEBMETHODS AGILITY FOR THE DIGITAL ENTERPRISE WEBMETHODS. What you can expect from webmethods

WEBMETHODS AGILITY FOR THE DIGITAL ENTERPRISE WEBMETHODS. What you can expect from webmethods WEBMETHODS WEBMETHODS AGILITY FOR THE DIGITAL ENTERPRISE What you can expect from webmethods Software AG s vision is to power the Digital Enterprise. Our technology, skills and expertise enable you to

More information

Copyright 2016 Datalynx Pty Ltd. All rights reserved. Datalynx Enterprise Data Management Solution Catalogue

Copyright 2016 Datalynx Pty Ltd. All rights reserved. Datalynx Enterprise Data Management Solution Catalogue Datalynx Enterprise Data Management Solution Catalogue About Datalynx Vendor of the world s most versatile Enterprise Data Management software Licence our software to clients & partners Partner-based sales

More information

Taming Structured And Unstructured Data With SAP HANA Running On VCE Vblock Systems

Taming Structured And Unstructured Data With SAP HANA Running On VCE Vblock Systems 1 Taming Structured And Unstructured Data With SAP HANA Running On VCE Vblock Systems The Defacto Choice For Convergence 2 ABSTRACT & SPEAKER BIO Dealing with enormous data growth is a key challenge for

More information

Transforming IT: From Silos To Services

Transforming IT: From Silos To Services Transforming IT: From Silos To Services Chuck Hollis Global Marketing CTO EMC Corporation http://chucksblog.emc.com @chuckhollis IT is being transformed. Our world is changing fast New Technologies New

More information

Title DC Automation: It s a MARVEL!

Title DC Automation: It s a MARVEL! Title DC Automation: It s a MARVEL! Name Nikos D. Anagnostatos Position Network Consultant, Network Solutions Division Classification ISO 27001: Public Data Center Evolution 2 Space Hellas - All Rights

More information

REDEFINING THE ENTERPRISE

REDEFINING THE ENTERPRISE REDEFINING THE ENTERPRISE ENABLING IT AND BUSINESS TRANSFORMATION WITH INDUSTRY BENCHMARKS 1 TODAY S BUSINESS CHALLENGES REACT FASTER TO FIND NEW GROWTH CUT OPERATIONAL COSTS & LEGACY MORE THAN EVER 2

More information

Data Analytics at Logitech Snowflake + Tableau = #Winning

Data Analytics at Logitech Snowflake + Tableau = #Winning Welcome # T C 1 8 Data Analytics at Logitech Snowflake + Tableau = #Winning Avinash Deshpande I am a futurist, scientist, engineer, designer, data evangelist at heart Find me at Avinash Deshpande Chief

More information

The Business Value of Metadata for Data Governance: The Challenge of Integrating Packaged Applications

The Business Value of Metadata for Data Governance: The Challenge of Integrating Packaged Applications The Business Value of Metadata for Data Governance: The Challenge of Integrating Packaged Applications By Donna Burbank Managing Director, Global Data Strategy, Ltd www.globaldatastrategy.com Sponsored

More information

Tamr Technical Whitepaper

Tamr Technical Whitepaper Tamr Technical Whitepaper 1. Executive Summary Tamr was founded to tackle large-scale data management challenges in organizations where extreme data volume and variety require an approach different from

More information

Data 101 Which DB, When. Joe Yong Azure SQL Data Warehouse, Program Management Microsoft Corp.

Data 101 Which DB, When. Joe Yong Azure SQL Data Warehouse, Program Management Microsoft Corp. Data 101 Which DB, When Joe Yong (joeyong@microsoft.com) Azure SQL Data Warehouse, Program Management Microsoft Corp. The world is changing AI increased by 300% in 2017 Data will grow to 44 ZB in 2020

More information

WHITE PAPER: TOP 10 CAPABILITIES TO LOOK FOR IN A DATA CATALOG

WHITE PAPER: TOP 10 CAPABILITIES TO LOOK FOR IN A DATA CATALOG WHITE PAPER: TOP 10 CAPABILITIES TO LOOK FOR IN A DATA CATALOG The #1 Challenge in Successfully Deploying a Data Catalog The data cataloging space is relatively new. As a result, many organizations don

More information

Optimized Data Integration for the MSO Market

Optimized Data Integration for the MSO Market Optimized Data Integration for the MSO Market Actions at the speed of data For Real-time Decisioning and Big Data Problems VelociData for FinTech and the Enterprise VelociData s technology has been providing

More information

Data 101 Which DB, When Joe Yong Sr. Program Manager Microsoft Corp.

Data 101 Which DB, When Joe Yong Sr. Program Manager Microsoft Corp. 17-18 March, 2018 Beijing Data 101 Which DB, When Joe Yong Sr. Program Manager Microsoft Corp. The world is changing AI increased by 300% in 2017 Data will grow to 44 ZB in 2020 Today, 80% of organizations

More information

Dell Boomi Cloud MDM Overview

Dell Boomi Cloud MDM Overview Dell Boomi Cloud MDM Overview Dell Boomi s Multi-Purpose PaaS Boomi as the Multi-Purpose PaaS for enterprise data management Move: AtomSphere Integration Manage: Master Data Management (MDM) Govern: API

More information

What is Gluent? The Gluent Data Platform

What is Gluent? The Gluent Data Platform What is Gluent? The Gluent Data Platform The Gluent Data Platform provides a transparent data virtualization layer between traditional databases and modern data storage platforms, such as Hadoop, in the

More information

The Value of Data Modeling for the Data-Driven Enterprise

The Value of Data Modeling for the Data-Driven Enterprise Solution Brief: erwin Data Modeler (DM) The Value of Data Modeling for the Data-Driven Enterprise Designing, documenting, standardizing and aligning any data from anywhere produces an enterprise data model

More information

Lambda Architecture for Batch and Stream Processing. October 2018

Lambda Architecture for Batch and Stream Processing. October 2018 Lambda Architecture for Batch and Stream Processing October 2018 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Notices This document is provided for informational purposes only.

More information

Fast Innovation requires Fast IT

Fast Innovation requires Fast IT Fast Innovation requires Fast IT Cisco Data Virtualization Puneet Kumar Bhugra Business Solutions Manager 1 Challenge In Data, Big Data & Analytics Siloed, Multiple Sources Business Outcomes Business Opportunity:

More information

Data Management Glossary

Data Management Glossary Data Management Glossary A Access path: The route through a system by which data is found, accessed and retrieved Agile methodology: An approach to software development which takes incremental, iterative

More information

Government IT Modernization and the Adoption of Hybrid Cloud

Government IT Modernization and the Adoption of Hybrid Cloud Government IT Modernization and the Adoption of Hybrid Cloud An IDC InfoBrief, Sponsored by VMware June 2018 Federal and National Governments Are at an Inflection Point Federal and national governments

More information

Optimizing and Modeling SAP Business Analytics for SAP HANA. Iver van de Zand, Business Analytics

Optimizing and Modeling SAP Business Analytics for SAP HANA. Iver van de Zand, Business Analytics Optimizing and Modeling SAP Business Analytics for SAP HANA Iver van de Zand, Business Analytics Early data warehouse projects LIMITATIONS ISSUES RAISED Data driven by acquisition, not architecture Too

More information

Microsoft Azure Databricks for data engineering. Building production data pipelines with Apache Spark in the cloud

Microsoft Azure Databricks for data engineering. Building production data pipelines with Apache Spark in the cloud Microsoft Azure Databricks for data engineering Building production data pipelines with Apache Spark in the cloud Azure Databricks As companies continue to set their sights on making data-driven decisions

More information

Energy Management with AWS

Energy Management with AWS Energy Management with AWS Kyle Hart and Nandakumar Sreenivasan Amazon Web Services August [XX], 2017 Tampa Convention Center Tampa, Florida What is Cloud? The NIST Definition Broad Network Access On-Demand

More information

1 Copyright 2011, Oracle and/or its affiliates. All rights reserved.

1 Copyright 2011, Oracle and/or its affiliates. All rights reserved. 1 Copyright 2011, Oracle and/or its affiliates. All rights reserved. Integrating Complex Financial Workflows in Oracle Database Xavier Lopez Seamus Hayes Oracle PolarLake, LTD 2 Copyright 2011, Oracle

More information

Delivering a 360 o View in Healthcare and Life Sciences With Agile Data

Delivering a 360 o View in Healthcare and Life Sciences With Agile Data Delivering a 360 o View in Healthcare and Life Sciences With Agile Data Imran Chaudhri, @imrantech, Solutions Director, Healthcare & Life Sciences Mark Ferneau, @ferneau, Practice Manager, Healthcare &

More information

The NIH Collaboratory Distributed Research Network: A Privacy Protecting Method for Sharing Research Data Sets

The NIH Collaboratory Distributed Research Network: A Privacy Protecting Method for Sharing Research Data Sets The NIH Collaboratory Distributed Research Network: A Privacy Protecting Method for Sharing Research Data Sets Jeffrey Brown, Lesley Curtis, and Rich Platt June 13, 2014 Previously The NIH Collaboratory:

More information

CloudSwyft Learning-as-a-Service Course Catalog 2018 (Individual LaaS Course Catalog List)

CloudSwyft Learning-as-a-Service Course Catalog 2018 (Individual LaaS Course Catalog List) CloudSwyft Learning-as-a-Service Course Catalog 2018 (Individual LaaS Course Catalog List) Microsoft Solution Latest Sl Area Refresh No. Course ID Run ID Course Name Mapping Date 1 AZURE202x 2 Microsoft

More information

Overview of Data Services and Streaming Data Solution with Azure

Overview of Data Services and Streaming Data Solution with Azure Overview of Data Services and Streaming Data Solution with Azure Tara Mason Senior Consultant tmason@impactmakers.com Platform as a Service Offerings SQL Server On Premises vs. Azure SQL Server SQL Server

More information

Realizing the Full Potential of MDM 1

Realizing the Full Potential of MDM 1 Realizing the Full Potential of MDM SOLUTION MDM Augmented with Data Virtualization INDUSTRY Applicable to all Industries EBSITE www.denodo.com PRODUCT OVERVIE The Denodo Platform offers the broadest access

More information

API, DEVOPS & MICROSERVICES

API, DEVOPS & MICROSERVICES API, DEVOPS & MICROSERVICES RAPID. OPEN. SECURE. INNOVATION TOUR 2018 April 26 Singapore 1 2018 Software AG. All rights reserved. For internal use only THE NEW ARCHITECTURAL PARADIGM Microservices Containers

More information

BRINGING DATA LINEAGE TO YOUR FINGERTIPS

BRINGING DATA LINEAGE TO YOUR FINGERTIPS DATA INTELLIGENCE ASG TECHNOLOGIES LINEAGE APPLIANCE Tailor Data Lineage to Your Enterprise Embed Data Lineage from ASG s Enterprise Data Intelligence Solution Wherever You Need It BRINGING DATA LINEAGE

More information

Modernization and how to implement Digital Transformation. Jarmo Nieminen Sales Engineer, Principal

Modernization and how to implement Digital Transformation. Jarmo Nieminen Sales Engineer, Principal Modernization and how to implement Digital Transformation Jarmo Nieminen Sales Engineer, Principal jarmo.nieminen@progress.com 2 Reinvented 8000 years old tool...? Leveraxe!! 3 In this Digital Economy...

More information

Configuration for Registry*, Repository or Hybrid** Master Data Management. *aka Federated or Virtual **aka Coexistence

Configuration for Registry*, Repository or Hybrid** Master Data Management. *aka Federated or Virtual **aka Coexistence Configuration for Registry*, Repository or Hybrid** Master Data Management *aka Federated or Virtual **aka Coexistence October 2017 (Best viewed in slideshow mode) Revision 3.6 Copyright 2017 WhamTech,

More information

From Single Purpose to Multi Purpose Data Lakes. Thomas Niewel Technical Sales Director DACH Denodo Technologies March, 2019

From Single Purpose to Multi Purpose Data Lakes. Thomas Niewel Technical Sales Director DACH Denodo Technologies March, 2019 From Single Purpose to Multi Purpose Data Lakes Thomas Niewel Technical Sales Director DACH Denodo Technologies March, 2019 Agenda Data Lakes Multiple Purpose Data Lakes Customer Example Demo Takeaways

More information

Gain Insights From Unstructured Data Using Pivotal HD. Copyright 2013 EMC Corporation. All rights reserved.

Gain Insights From Unstructured Data Using Pivotal HD. Copyright 2013 EMC Corporation. All rights reserved. Gain Insights From Unstructured Data Using Pivotal HD 1 Traditional Enterprise Analytics Process 2 The Fundamental Paradigm Shift Internet age and exploding data growth Enterprises leverage new data sources

More information

Big Data com Hadoop. VIII Sessão - SQL Bahia. Impala, Hive e Spark. Diógenes Pires 03/03/2018

Big Data com Hadoop. VIII Sessão - SQL Bahia. Impala, Hive e Spark. Diógenes Pires 03/03/2018 Big Data com Hadoop Impala, Hive e Spark VIII Sessão - SQL Bahia 03/03/2018 Diógenes Pires Connect with PASS Sign up for a free membership today at: pass.org #sqlpass Internet Live http://www.internetlivestats.com/

More information

Big Data with Hadoop Ecosystem

Big Data with Hadoop Ecosystem Diógenes Pires Big Data with Hadoop Ecosystem Hands-on (HBase, MySql and Hive + Power BI) Internet Live http://www.internetlivestats.com/ Introduction Business Intelligence Business Intelligence Process

More information

MICROSOFT CLOUD PLATFORM AND INFRASTRUCTURE CERTIFICATION. Includes certifications for Microsoft Azure and Windows Server

MICROSOFT CLOUD PLATFORM AND INFRASTRUCTURE CERTIFICATION. Includes certifications for Microsoft Azure and Windows Server MICROSOFT CLOUD PLATFORM AND INFRASTRUCTURE CERTIFICATION Includes certifications for Microsoft Azure and Windows Server Microsoft Azure MCSA: Cloud Platform Pass 2 required exams. M20532 M20533 M20535

More information

PERSPECTIVE. Data Virtualization A Potential Antidote for Big Data Growing Pains. Abstract

PERSPECTIVE. Data Virtualization A Potential Antidote for Big Data Growing Pains. Abstract PERSPECTIVE Data Virtualization A Potential Antidote for Big Data Growing Pains Abstract Enterprises are already facing challenges around data consolidation, heterogeneity, quality, and value. Now they

More information

IBM InfoSphere Master Data Management Version 11 Release 5. Overview IBM SC

IBM InfoSphere Master Data Management Version 11 Release 5. Overview IBM SC IBM InfoSphere Master Data Management Version 11 Release 5 Overview IBM SC27-6718-01 IBM InfoSphere Master Data Management Version 11 Release 5 Overview IBM SC27-6718-01 Note Before using this information

More information

Understanding the latent value in all content

Understanding the latent value in all content Understanding the latent value in all content John F. Kennedy (JFK) November 22, 1963 INGEST ENRICH EXPLORE Cognitive skills Data in any format, any Azure store Search Annotations Data Cloud Intelligence

More information

@Pentaho #BigDataWebSeries

@Pentaho #BigDataWebSeries Enterprise Data Warehouse Optimization with Hadoop Big Data @Pentaho #BigDataWebSeries Your Hosts Today Dave Henry SVP Enterprise Solutions Davy Nys VP EMEA & APAC 2 Source/copyright: The Human Face of

More information

Modernizing Healthcare IT for the Data-driven Cognitive Era Storage and Software-Defined Infrastructure

Modernizing Healthcare IT for the Data-driven Cognitive Era Storage and Software-Defined Infrastructure Modernizing Healthcare IT for the Data-driven Cognitive Era Storage and Software-Defined Infrastructure An IDC InfoBrief, Sponsored by IBM April 2018 Executive Summary Today s healthcare organizations

More information

Building the Digital Media Workstream: From project pitch to program purchase

Building the Digital Media Workstream: From project pitch to program purchase MEDIA & ENTERTAINMENT Building the Digital Media Workstream: From project pitch to program purchase Charles Matheson, Industry Strategist, M&E, OpenText The old ways are fading away Big footprint Capital

More information

Composite Software Data Virtualization The Five Most Popular Uses of Data Virtualization

Composite Software Data Virtualization The Five Most Popular Uses of Data Virtualization Composite Software Data Virtualization The Five Most Popular Uses of Data Virtualization Composite Software, Inc. June 2011 TABLE OF CONTENTS INTRODUCTION... 3 DATA FEDERATION... 4 PROBLEM DATA CONSOLIDATION

More information

Virtuoso Infotech Pvt. Ltd.

Virtuoso Infotech Pvt. Ltd. Virtuoso Infotech Pvt. Ltd. About Virtuoso Infotech Fastest growing IT firm; Offers the flexibility of a small firm and robustness of over 30 years experience collectively within the leadership team Technology

More information

Informatica Enterprise Information Catalog

Informatica Enterprise Information Catalog Data Sheet Informatica Enterprise Information Catalog Benefits Automatically catalog and classify all types of data across the enterprise using an AI-powered catalog Identify domains and entities with

More information

Red Hat JBoss Middleware Integration Products Roadmap. Ken Johnson Director, Product Management, Red Hat

Red Hat JBoss Middleware Integration Products Roadmap. Ken Johnson Director, Product Management, Red Hat Red Hat JBoss Middleware Integration Products Roadmap Ken Johnson Director, Product Management, Red Hat The Plan... Integration Products Overview Product-by-product Intro Roadmap Cross-product initiatives

More information

2-4 April 2019 Taets Art and Event Park, Amsterdam CLICK TO KNOW MORE

2-4 April 2019 Taets Art and Event Park, Amsterdam CLICK TO KNOW MORE Co-Host Host 2-4 April 2019 Taets Art and Event Park, Amsterdam CLICK TO KNOW MORE Oracle Cloud Computing Strategy Han Wammes Public Sector Market Development Manager 1 Copyright 2012, Oracle and/or its

More information

Oliver Engels & Tillmann Eitelberg. Big Data! Big Quality?

Oliver Engels & Tillmann Eitelberg. Big Data! Big Quality? Oliver Engels & Tillmann Eitelberg Big Data! Big Quality? Like to visit Germany? PASS Camp 2017 Main Camp 5.12 7.12.2017 (4.12 Kick Off Evening) Lufthansa Training & Conference Center, Seeheim SQL Konferenz

More information

Solving the Enterprise Data Dilemma

Solving the Enterprise Data Dilemma Solving the Enterprise Data Dilemma Harmonizing Data Management and Data Governance to Accelerate Actionable Insights Learn More at erwin.com Is Our Company Realizing Value from Our Data? If your business

More information

Take P, R or U. and solve your data quality problems Oliver Engels & Tillmann Eitelberg, OH22

Take P, R or U. and solve your data quality problems Oliver Engels & Tillmann Eitelberg, OH22 Take P, R or U and solve your data quality problems Oliver Engels & Tillmann Eitelberg, OH22 Oliver Engels CEO, oh22data AG @oengels Datamonster from Germany MS Data Platform MVP President of PASS Germany

More information

Digital Enterprise Platform for Live Business. Kevin Liu SAP Greater China, Vice President General Manager of Big Data and Platform BU

Digital Enterprise Platform for Live Business. Kevin Liu SAP Greater China, Vice President General Manager of Big Data and Platform BU Digital Enterprise Platform for Live Business Kevin Liu SAP Greater China, Vice President General Manager of Big Data and Platform BU Rethinking the Future Competing in today s marketplace means leveraging

More information

Informatica Data Quality Product Family

Informatica Data Quality Product Family Brochure Informatica Product Family Deliver the Right Capabilities at the Right Time to the Right Users Benefits Reduce risks by identifying, resolving, and preventing costly data problems Enhance IT productivity

More information

SOLUTION TRACK Finding the Needle in a Big Data Innovator & Problem Solver Cloudera

SOLUTION TRACK Finding the Needle in a Big Data Innovator & Problem Solver Cloudera SOLUTION TRACK Finding the Needle in a Big Data Haystack @EvaAndreasson, Innovator & Problem Solver Cloudera Agenda Problem (Solving) Apache Solr + Apache Hadoop et al Real-world examples Q&A Problem Solving

More information

Data Governance Industrial Internet & Big Data

Data Governance Industrial Internet & Big Data Data Governance Kari Hiekkanen 29.3.2018 CS-E5340 Introduction to Industrial Internet Industrial Internet & Big Data (IDC Data Age 2025, April 2017) 1 Industrial Internet & Big Data (Statista, 2017) Data

More information

Microsoft certified solutions associate

Microsoft certified solutions associate Microsoft certified solutions associate MCSA: BI Reporting This certification demonstrates your expertise in analyzing data with both Power BI and Excel. Exam 70-778/Course 20778 Analyzing and Visualizing

More information

Creating a Recommender System. An Elasticsearch & Apache Spark approach

Creating a Recommender System. An Elasticsearch & Apache Spark approach Creating a Recommender System An Elasticsearch & Apache Spark approach My Profile SKILLS Álvaro Santos Andrés Big Data & Analytics Solution Architect in Ericsson with more than 12 years of experience focused

More information

Good analytics needs good data and that needs good metadata

Good analytics needs good data and that needs good metadata Good analytics needs good data and that needs good metadata 28 th February 2018 Mandy Chessell CBE FREng CEng FBCS Distinguished Engineer, Master Inventor Analytics Chief Data Office mandy_chessell@uk.ibm.com

More information

Selling Improved Testing

Selling Improved Testing Selling Improved Testing Reducing Customer Pain Technology Advance Partners Technology Advance Partners is a software services consulting firm with deep experience in Information Technology management,

More information

FEATURES BENEFITS SUPPORTED PLATFORMS. Reduce costs associated with testing data projects. Expedite time to market

FEATURES BENEFITS SUPPORTED PLATFORMS. Reduce costs associated with testing data projects. Expedite time to market E TL VALIDATOR DATA SHEET FEATURES BENEFITS SUPPORTED PLATFORMS ETL Testing Automation Data Quality Testing Flat File Testing Big Data Testing Data Integration Testing Wizard Based Test Creation No Custom

More information

USERS CONFERENCE Copyright 2016 OSIsoft, LLC

USERS CONFERENCE Copyright 2016 OSIsoft, LLC Bridge IT and OT with a process data warehouse Presented by Matt Ziegler, OSIsoft Complexity Problem Complexity Drives the Need for Integrators Disparate assets or interacting one-by-one Monitoring Real-time

More information

Hybrid Data Platform

Hybrid Data Platform UniConnect-Powered Data Aggregation Across Enterprise Data Warehouses and Big Data Storage Platforms A Percipient Technology White Paper Author: Ai Meun Lim Chief Product Officer Updated Aug 2017 2017,

More information

Transform to Your Cloud

Transform to Your Cloud Transform to Your Cloud Presented by VMware 2012 VMware Inc. All rights reserved Agenda Corporate Overview Cloud Infrastructure & Management Cloud Application Platform End User Computing The Journey to

More information

& Cross-Channel Customer Engagement RFP Guide

& Cross-Channel Customer Engagement RFP Guide Email & Cross-Channel Customer Engagement RFP Guide Customer Engagement in a Perpetually Connected World Today s perpetually connected customer is interacting with your brand through digital, mobile &

More information

How Insurers are Realising the Promise of Big Data

How Insurers are Realising the Promise of Big Data How Insurers are Realising the Promise of Big Data Jason Hunter, CTO Asia-Pacific, MarkLogic A Big Data Challenge: Pushing the Limits of What's Possible The Art of the Possible Multiple Government Agencies

More information

Metadata and the Rise of Big Data Governance: Active Open Source Initiatives. October 23, 2018

Metadata and the Rise of Big Data Governance: Active Open Source Initiatives. October 23, 2018 Metadata and the Rise of Big Data Governance: Active Open Source Initiatives October 23, 2018 Today s speakers John Mertic, Director of Program Management, Linux Foundation David Radley, ODPi Egeria maintainer,

More information

Applying Auto-Data Classification Techniques for Large Data Sets

Applying Auto-Data Classification Techniques for Large Data Sets SESSION ID: PDAC-W02 Applying Auto-Data Classification Techniques for Large Data Sets Anchit Arora Program Manager InfoSec, Cisco The proliferation of data and increase in complexity 1995 2006 2014 2020

More information

Powering Transformation With Cisco

Powering Transformation With Cisco Shape Your Business For the Future: Powering Transformation With Cisco Enabling Data Center Evolution Towards Cloud Computing Yudi Wiradarma TSO Lead, PT NetApp Indonesia Agenda The Challenge Cloud infrastructure

More information

Big data easily, efficiently, affordably. UniConnect 2.1

Big data easily, efficiently, affordably. UniConnect 2.1 Connecting Data. Delivering Intelligence Big data easily, efficiently, affordably UniConnect 2.1 The UniConnect platform is designed to unify data in a highly scalable and seamless manner, by building

More information

Top 7 Data API Headaches (and How to Handle Them) Jeff Reser Data Connectivity & Integration Progress Software

Top 7 Data API Headaches (and How to Handle Them) Jeff Reser Data Connectivity & Integration Progress Software Top 7 Data API Headaches (and How to Handle Them) Jeff Reser Data Connectivity & Integration Progress Software jreser@progress.com Agenda Data Variety (Cloud and Enterprise) ABL ODBC Bridge Using Progress

More information

Renovating your storage infrastructure for Cloud era

Renovating your storage infrastructure for Cloud era Renovating your storage infrastructure for Cloud era Nguyen Phuc Cuong Software Defined Storage Country Sales Leader Copyright IBM Corporation 2016 2 Business SLAs Challenging Traditional Storage Approaches

More information

Optimizing Your Analytics Life Cycle with SAS & Teradata. Rick Lower

Optimizing Your Analytics Life Cycle with SAS & Teradata. Rick Lower Optimizing Your Analytics Life Cycle with SAS & Teradata Rick Lower 1 Agenda The Analytic Life Cycle Common Problems SAS & Teradata solutions Analytical Life Cycle Exploration Explore All Your Data Preparation

More information

Best practices for building a Hadoop Data Lake Solution CHARLOTTE HADOOP USER GROUP

Best practices for building a Hadoop Data Lake Solution CHARLOTTE HADOOP USER GROUP Best practices for building a Hadoop Data Lake Solution CHARLOTTE HADOOP USER GROUP 07.29.2015 LANDING STAGING DW Let s start with something basic Is Data Lake a new concept? What is the closest we can

More information

IBM Software IBM InfoSphere Information Server for Data Quality

IBM Software IBM InfoSphere Information Server for Data Quality IBM InfoSphere Information Server for Data Quality A component index Table of contents 3 6 9 9 InfoSphere QualityStage 10 InfoSphere Information Analyzer 12 InfoSphere Discovery 13 14 2 Do you have confidence

More information

Please give me your feedback

Please give me your feedback #HPEDiscover Please give me your feedback Session ID: B4385 Speaker: Aaron Spurlock Use the mobile app to complete a session survey 1. Access My schedule 2. Click on the session detail page 3. Scroll down

More information

Lambda Architecture for Batch and Real- Time Processing on AWS with Spark Streaming and Spark SQL. May 2015

Lambda Architecture for Batch and Real- Time Processing on AWS with Spark Streaming and Spark SQL. May 2015 Lambda Architecture for Batch and Real- Time Processing on AWS with Spark Streaming and Spark SQL May 2015 2015, Amazon Web Services, Inc. or its affiliates. All rights reserved. Notices This document

More information

ODPi and Data Governance Free Your MetaData! October 10, 2018

ODPi and Data Governance Free Your MetaData! October 10, 2018 ODPi and Data Governance Free Your MetaData! October 10, 2018 Today s reality @ODPiOrg Imagine An enterprise data catalogue that lists all of your data, where it is located, its origin (lineage), owner,

More information

Advanced Solutions of Microsoft SharePoint Server 2013 Course Contact Hours

Advanced Solutions of Microsoft SharePoint Server 2013 Course Contact Hours Advanced Solutions of Microsoft SharePoint Server 2013 Course 20332 36 Contact Hours Course Overview This course examines how to plan, configure, and manage a Microsoft SharePoint Server 2013 environment.

More information

Active Archive and the State of the Industry

Active Archive and the State of the Industry Active Archive and the State of the Industry Taking Data Archiving to the Next Level Abstract This report describes the state of the active archive market. New Applications Fuel Digital Archive Market

More information

CHARTING THE FUTURE OF SOFTWARE DEFINED NETWORKING

CHARTING THE FUTURE OF SOFTWARE DEFINED NETWORKING www.hcltech.com CHARTING THE FUTURE OF SOFTWARE DEFINED NETWORKING Why Next-Gen Networks? The rapid and large scale adoption of new age disruptive digital technologies has resulted in astronomical growth

More information

Sentara s Digital Vision for the Future: A DevSecOps Story

Sentara s Digital Vision for the Future: A DevSecOps Story FEBRUARY 11, 2019 ORLANDO, FL Sentara s Digital Vision for the Future: A DevSecOps Story Natalie Kaszubowski, VP Software Development Jeff Thomas, VP & CTO Dan Bowden, VP & CISO www.himssconference.org

More information

Advanced Solutions of Microsoft SharePoint 2013

Advanced Solutions of Microsoft SharePoint 2013 Course 20332A :Advanced Solutions of Microsoft SharePoint 2013 Page 1 of 9 Advanced Solutions of Microsoft SharePoint 2013 Course 20332A: 4 days; Instructor-Led About the Course This four-day course examines

More information

Smarts Application Discovery Manager 5.0: Accelerating Server/Data Center Consolidations, Application Migrations, and CMDB Projects

Smarts Application Discovery Manager 5.0: Accelerating Server/Data Center Consolidations, Application Migrations, and CMDB Projects Smarts Application Discovery Manager 5.0: Accelerating Server/ Center Consolidations, Application Migrations, and CMDB Projects Glenn O Donnell EMC Corporation 1 Smarts Application Discovery Manager Application

More information

Data Vault Brisbane User Group

Data Vault Brisbane User Group Data Vault Brisbane User Group 26-02-2013 Agenda Introductions A brief introduction to Data Vault Creating a Data Vault based Data Warehouse Comparisons with 3NF/Kimball When is it good for you? Examples

More information

Advanced Solutions of Microsoft SharePoint Server 2013

Advanced Solutions of Microsoft SharePoint Server 2013 Course Duration: 4 Days + 1 day Self Study Course Pre-requisites: Before attending this course, students must have: Completed Course 20331: Core Solutions of Microsoft SharePoint Server 2013, successful

More information

Be a VDI hero with Nutanix

Be a VDI hero with Nutanix Be a VDI hero with Nutanix Francis O Haire Director, Technology & Strategy DataSolutions Evolution of Enterprise Infrastructure 1990s Today App App App App Virtualization Server Server Server Server Scale-Out

More information

EMEA USERS CONFERENCE BERLIN, GERMANY. Copyright 2016 OSIsoft, LLC

EMEA USERS CONFERENCE BERLIN, GERMANY. Copyright 2016 OSIsoft, LLC Bridge IT and OT with a process data warehouse Presented by Franco Camba, OSIsoft Matt Ziegler, OSIsoft Frank Ruland, SAP Audience Poll Have you invested or are you looking into Business Intelligence tools?

More information