The Rules of Subsurface Analytics Jane McConnell, Practice Partner Oil and Gas, Teradata DEJ KL, 4 October 2017

Similar documents
How Data Management can put the Science into Data Science. Dr Duncan Irving, Lead Consultant Oil & Gas Digital Energy Journal event, KLCC 2016

The future of Subsurface Data Management? Building a Data Science Lab Data Lake Jane McConnell, Practice Partner Oil and Gas, Teradata DEJ KL, 3

MAPR DATA GOVERNANCE WITHOUT COMPROMISE

Virtuoso Infotech Pvt. Ltd.

Provisioning Data Science in E&P Dr Duncan Irving

DATA STEWARDSHIP BODY OF KNOWLEDGE (DSBOK)

Embedding Big Data in Risk Management. David Socha. Practice Partner, Industrial IoT Group, Teradata International

SAP Agile Data Preparation Simplify the Way You Shape Data PUBLIC

Six Weeks to Security Operations The AMP Story. Mike Byrne Cyber Security AMP

PREPARE FOR TAKE OFF. Accelerate your organisation s journey to the Cloud.

About Us. Services CONSULTING OUTSOURCING TRAINING MENTORING STAFF AUGMENTATION 9/9/2016

TDWI Data Governance Fundamentals: Managing Data as an Asset

Data Governance: Are Governance Models Keeping Up?

Managing The Digital Network Workforce Transformation

Upstream Data Management in the 2020s: New uses for old skills in the world of Big Data. Dr Duncan Irving DEJ Aberdeen September 29 th, 2015

Data Management Glossary

Importance of the Data Management process in setting up the GDPR within a company CREOBIS

E&P Data Management Challenges

Increasing the ROI of your Data Lake. Dave Camden DEJ London, 19-Nov Copyright 2018, Flare Solutions Limited 1

Enterprise Data Architect

Solving the Enterprise Data Dilemma

UNLEASHING THE VALUE OF THE TERADATA UNIFIED DATA ARCHITECTURE WITH ALTERYX

Predictive Insight, Automation and Expertise Drive Added Value for Managed Services

Oracle Big Data Discovery

Data Governance. Mark Plessinger / Julie Evans December /7/2017

PERSPECTIVE. Effective Data Governance. Abstract

DIGITAL INNOVATION HYBRID CLOUD COSTS AGILITY PRODUCTIVITY

Deploying, Managing and Reusing R Models in an Enterprise Environment

ALIGNING CYBERSECURITY AND MISSION PLANNING WITH ADVANCED ANALYTICS AND HUMAN INSIGHT

Transformation in Technology Barbara Duck Chief Information Officer. Investor Day 2018

Database Engineering. Percona Live, Amsterdam, September, 2015

The Data Catalog The Key to Managing Data, Big and Small. April Reeve May

The Emerging Data Lake IT Strategy

EUROPEAN ICT PROFESSIONAL ROLE PROFILES VERSION 2 CWA 16458:2018 LOGFILE

DATACENTER SERVICES DATACENTER

Optimize tomorrow today.

2 The IBM Data Governance Unified Process

Cloud Connections SEE Partner Summit Janos Strausz Product Sales Specialist, DC

PROJECT MANAGEMENT (PMGT)

SAFe AGILE TRAINING COURSES

Intelligence for the connected world How European First-Movers Manage IoT Analytics Projects Successfully

Data Mining: Approach Towards The Accuracy Using Teradata!

Informatica Data Quality Product Family

Analytics & Sport Data

3,500. The Developer Division at Microsoft

Building a Data Strategy for a Digital World

New Zealand Government IBM Infrastructure as a Service

Survey Report Industry Survey. Data Governance, Technology & Analytics Trends Q1 2014

How to Become a DATA GOVERNANCE EXPERT

1INSURER DATA MASTERY MODEL. Structure out of Chaos

Organizing for the Cloud

Shane Olivo. Selected UX Project Portfolio. Phone

From Single Purpose to Multi Purpose Data Lakes. Thomas Niewel Technical Sales Director DACH Denodo Technologies March, 2019

WEBMETHODS AGILITY FOR THE DIGITAL ENTERPRISE WEBMETHODS. What you can expect from webmethods

Noam Ikar R&DVP. Complex Event Processing and Situational Awareness in the Digital Age

TRANSFORMING WEST MIDLANDS POLICE A BOLD NEW MODEL FOR POLICING

Intro to BI Architecture Warren Sifre

Day One Success for DevSecOps and Automation on Azure

Good analytics needs good data and that needs good metadata

INTELLIGENCE DRIVEN GRC FOR SECURITY

CloudSwyft Learning-as-a-Service Course Catalog 2018 (Individual LaaS Course Catalog List)

Marc Hornbeek DevOps-the-Gray Principal DevOps Consultant, Trace3 Author, DevOps Test Engineering Course The DevOps Institute

SYMANTEC: SECURITY ADVISORY SERVICES. Symantec Security Advisory Services The World Leader in Information Security

Think & Work like a Data Scientist with SQL 2016 & R DR. SUBRAMANI PARAMASIVAM (MANI)

Improving Data Governance in Your Organization. Faire Co Regional Manger, Information Management Software, ASEAN

Randy House Vice President of Health Informatics. Saint Luke s Health System. Lynsey McNeal Director Data of Governance. Saint Luke s Health System

Cybersecurity. Securely enabling transformation and change

IT Service. Demystifying ITIL. J. Andrew Atencio Andy. 1. Introduction and History. 2. Why Service Management/ITIL?

The Value of Data Governance for the Data-Driven Enterprise

PULLING OUR SOCS UP VODAFONE GROUP AT RSAC Emma Smith. Andy Talbot. Group Technology Security Director Vodafone Group Plc

Vinnie Saini Cloud Solution Architect Big Data & AI

Low Friction Data Warehousing WITH PERSPECTIVE ILM DATA GOVERNOR

T103 PlantPAx System Fundamentals

ADABAS & NATURAL 2050+

Trillium Consulting. Data Governance. Optimizing Business Outcomes through Data and Information Assets

NISTCSF Enterprise Training Solutions. By David Nichols & Rick Lemieux December 2018

CONSOLIDATING RISK MANAGEMENT AND REGULATORY COMPLIANCE APPLICATIONS USING A UNIFIED DATA PLATFORM

EXPERT SERVICES FOR IoT CYBERSECURITY AND RISK MANAGEMENT. An Insight Cyber White Paper. Copyright Insight Cyber All rights reserved.

How to choose the right Data Governance resources. by First San Francisco Partners

VMware Cloud Operations Management Technology Consulting Services

Implementing a Successful Data Governance Program

SAFe Atlassian Style (Updated version with SAFe 4.5) Whitepapers & Handouts

Enabling Innovation in the Digital Economy

Lambda Architecture for Batch and Stream Processing. October 2018

Lenovo Data Center Group. Define a different future

Lean-Thinking. Re-Defined. Going Beyond Toyota. Alan Shalloway.

SDx and the Future of Infrastructure

Data ownership within governance: getting it right

Selling Improved Testing

STEP Data Governance: At a Glance

A CONFUSED TESTER IN AGILE WORLD

Living With Agility

The Fine Art of Creating A Transformational Cyber Security Strategy

The Data Governance Journey at Principal

LEARNING SOLUTIONS & CERTIFICATE PATHS

WHITE PAPER. Master Data s role in a data-driven organization DRIVEN BY DATA

Building Actionable Data Governance through Data Models & Metadata Donna Burbank Global Data Strategy Ltd.

ITIL Service Lifecycle Strategy

Introduction to Data Science

for Modernization Accelerate Your Modernization Project Faster return on your investment goals

Transcription:

The Rules of Subsurface Analytics Jane McConnell, Practice Partner Oil and Gas, Teradata DEJ KL, 4 October 2017

Agenda Why subsurface analytics is different The Rules Rule 1: Right People Rule 2: Right Platform Rule 3: Good Enough Data Management Rule 4: Agile Approach Rule 5: Business Buy-in Recap 2 2016 Teradata

Why Subsurface Analytics is different 3 2016 Teradata

4 1940 s business computing: where it all started for business analytics

But for us, the IT/OT divide happened IT and the business diverge on business critical solutions so IT never get near it beyond providing power and network bandwidth IT failed to keep up with competitive computing developments which become app suites that only subject matter experts understand and manage Business units and teams become fertile ground for point solutions which become silos 5

Subsurface Data Management Culture Custodianship v Stewardship Custodian Controls access Avoids unknown data Transfer data Avoids risk Hates change Acquires knowledge Creates walls Steward Shares data Celebrates variety Enables access Embraces risk Owns change Shares insights Teaches governance 6

Rule 1: Right People 7 2016 Teradata

Right People, plural. And T-shaped. Too many disciplines for any one person to know it all T-shaped people who go wide across many disciplines but deep into their specific domain Need outstanding data management and data engineering skills (and culture) Need platform expertise for sustainability and deployment Need Subject Matter Expertise 8 2016 Teradata

Analytics / data science workflow Ingest Curate Analyse Visualise 2017 Teradata 9 9 footer

Here s one we did earlier in Malaysia Working closely with the customer Subject matter expertise Source system expertise Teradata Subject Matter Expert Data management skills Data platform skills Coding skills Data science skills Frontend/visualisation skills 10

Rule 2: Right Platform 11 2016 Teradata

The problems with existing data stores Knowledge development applications come with import filters for specific file types and specific tasks Data is modelled logically for well-defined (and hence brittle) processes that may not reflect all (or even any!) use cases Only perfect data can be imported into applications or schemas Kerry Blinston, CGG, ECIM 2014 New data types, or new combinations, challenge all of this 12 2014 Teradata

Build a new platform that all disciplines can use If we don t give them a purpose-built platform for analytics, we will be in Desktop/Excel Hell. A platform that Accepts data from any discipline Makes it easy for data scientists to use their tools R, Python etc Provides the right level of governance and data quality Call it a Data Lake, or call it an Analytical Data Platform Just build something where they can get data and ideally write new data back 13 2016 Teradata

Rule 3: Good enough Data Management 14 2016 Teradata

Good Enough Data Management Stewards mentor and support citizen data management Everyone cares about the data and its quality Everyone can do something about it when they find bad data Data governance is a function of data value Good Enough means: Good: don t compromise on quality Enough: don t boil the ocean 15

What should data look like? Don t be a data hoarder! Why not store data at a granularity good enough to extract value? Granular enough Dimensioned (time, space) enough Resample, interpolate, aggregate Profile data so you can get an idea of it quickly 16 2014 Teradata

Difficult file formats (Multi-structured data) Parse out the measurement data Link it through time and space Relate using metadata and master data Form a view on whether a hypothesis is worth developing 17 2014 Teradata

Dealing with unstructured data Text Language Typos Consistency Quality Use simple characterisation tools to understand what is in the data Don t try to build a whole text input and cleansing framework 18 2014 Teradata

Rule 4: Agile Approach 19 2016 Teradata

Agile, Scrum, DevOps, Interactive Visualisation 20 2016 Teradata

What not to do Common resourcesinks: Point solutions Technology projects Waterfalls Brittle data modelling We want ML/AI! xkcd.com 21

Iterate. One project at a time. Deliver value often. Prepare Contextualise and plan Form problem statement Prioritise by impact Communicate analysis plan and responsibilities Review Create concise business story Highlight overall business impact Include assumptions and sources Follow up with business on the actions Execute Build on prior work Validate data Recheck hypotheses Drive insights and recommendations Assimilate Store well-commented SQL Document in wiki Train BU in tool usage Deliver Post code to repository Stakeholder contacts Final presentations 22 2017 Teradata

Agility needs the right mindset Working together Willing to take risks Proactive Speed / Time-to-insight 23 2017 Teradata

Rule 5: Business Buy-in 24 2016 Teradata

Data management to enable business-aligned data science Business Value Data Science Data Management What financial, operational or environmental impact are you delivering? What techniques, functions, workflows and skills are required? What data is required and in what form? 25 2017 Teradata

Business-focused data management Embed data ownership in the business units Engage with business leadership to plan, budget and deliver data-driven initiatives Define and drive data exploitation strategy Understand data value and leverage high value data for business impact Source: Andrew White, Gartner 2017 do we need a Chief Data Officer? 26

Recap 27 2016 Teradata

The 5 Rules all equally important Right People Right Platform Good Enough Data Management Agile Approach Business Buy-in 28 2016 Teradata

Stated another way - Data Management 2.0 Stop doing Brittle data management Silos Disposable data science Keep doing Applying domain expertise High levels of governance Driving data quality Start doing Aligning with business Applying context Data profiling Enriching data Transfer and analysis in Excel Learning Applying critical thinking 29 2017 Teradata

Closing thoughts Trying to plan for the future without a sense of history is like trying to plant cut flowers. Daniel J Boorstin (1914 2004) 12 th Librarian of the US Congress 1975-1987 Jane McConnell Practice Partner O&G, Industrial IoT Group Thing Big Analytics, a Teradata Company Jane.mcconnell@thinkbiganalytics.com +47 98282110 My blog on Forbes My blog on Teradata.com Follow me on Twitter @jane_mcconnell My profile 30

Jane McConnell Practice Partner O&G, Industrial IoT Group Thing Big Analytics, a Teradata Company Jane.mcconnell@thinkbiganalytics.com +47 98282110 My blog on Forbes My blog on Teradata.com Follow me on Twitter @jane_mcconnell My profile 2016 Teradata 31

What is the point of data science?. Data science, also known as data-driven science, is an interdisciplinary field about scientific methods, processes, and systems to extract knowledge or insights from data in various forms, either structured or unstructured, similar to data mining. Wikipedia, 2017 32

Data science workflow Ingest Curate Analyse Visualise 2017 Teradata 3 3 33 footer

Not just machine learning Finding relationships with complex data sets Characterising behaviour and understanding the demographics of data It can be applied to: Data profiling and QC Data preparation Data mining Operational processes Data art 34

How other industries grew an analytics culture Prepare Contextualise and plan Form problem statement Prioritise by impact Communicate analysis plan and responsibilities Review Create concise business story Highlight overall business impact Include assumptions and sources Follow up with business on the actions Execute Build on prior work Validate data Recheck hypotheses Drive insights and recommendations Assimilate Store well-commented SQL Document in wiki Train BU in tool usage Document Post code to repository Stakeholder contacts Final presentations 35 2017 Teradata footer