Decision models for the Digital Economy

Similar documents
VOLTDB + HP VERTICA. page

Demystifying Machine Learning

Making Sense of Artificial Intelligence: A Practical Guide

OPERATIONALIZING MACHINE LEARNING USING GPU ACCELERATED, IN-DATABASE ANALYTICS

IOT AND THE DATA-DRIVEN ENTERPRISE:

Event: PASS SQL Saturday - DC 2018 Presenter: Jon Tupitza, CTO Architect

What's New in MATLAB for Engineering Data Analytics?

Machine Learning on VMware vsphere with NVIDIA GPUs

Based on Big Data: Hype or Hallelujah? by Elena Baralis

Machine Learning with Python

Refactoring Earthquake-Tsunami Causality and Messaging via Big Data Analytics: The Transformative Potential of Credible Tweets

Oracle Machine Learning Notebook

SQL Server in Azure. Marek Chmel. Microsoft MVP: Data Platform Microsoft MCSE: Data Management & Analytics Certified Ethical Hacker

This document (including, without limitation, any product roadmap or statement of direction data) illustrates the planned testing, release and

Nowcasting. D B M G Data Base and Data Mining Group of Politecnico di Torino. Big Data: Hype or Hallelujah? Big data hype?

Your Speakers. Iwan e1 Rahabok Linkedin.com/in/e1ang

Flash Storage Complementing a Data Lake for Real-Time Insight

Self Programming Networks

TCO REPORT. NAS File Tiering. Economic advantages of enterprise file management

Oracle Real Application Clusters (RAC) 12c Release 2 What s Next?

vcard Global Compensation Plan

Contents PART I: CLOUD, BIG DATA, AND COGNITIVE COMPUTING 1

Oracle CoreTech Update OASC Opening 17. November 2014

Build a system health check for Db2 using IBM Machine Learning for z/os

Storage Class Memory in Scalable Cognitive Systems

Challenges for Data Driven Systems

Oracle NoSQL Database and Cisco- Collaboration that produces results. 1 Copyright 2011, Oracle and/or its affiliates. All rights reserved.

Faculté Polytechnique

IBM TS4300 with IBM Spectrum Storage - The Perfect Match -

Searching for Meaning in the Era of Big Data and IoT

Copyright 2018, Oracle and/or its affiliates. All rights reserved.

BI4Dynamics NAV Installation Requirements

Predictive Analytics: Demystifying Current and Emerging Methodologies. Tom Kolde, FCAS, MAAA Linda Brobeck, FCAS, MAAA

Wearable Technology Orientation Using Big Data Analytics for Improving Quality of Human Life

Oracle Machine Learning Notebook

Web Analytics Key Metrics and KPIs Version 1.0

ODL based AI/ML for Networks Prem Sankar Gopannan, Ericsson YuLing Chen, Cisco

ImageNet Classification with Deep Convolutional Neural Networks

Real-Time & Big Data GIS: Best Practices. Suzanne Foss Josh Joyner

Adapted from: TRENDS AND ATTRIBUTES OF HORIZONTAL AND VERTICAL COMPUTING ARCHITECTURES

Real-Time & Big Data GIS: Best Practices. Josh Joyner Adam Mollenkopf

1 Dulcian, Inc., 2001 All rights reserved. Oracle9i Data Warehouse Review. Agenda

SAP HANA as an Accelerator for PLM Processes HANA Basics and Scenarios

Tutorial on Machine Learning Tools

My Other Car is a Redis. Etan Grundstein & Sasha Popov DYNAMIC YIELD

Designing dashboards for performance. Reference deck

Overview. Data Mining for Business Intelligence. Shmueli, Patel & Bruce

Massive Scalability With InterSystems IRIS Data Platform

BI4Dynamics NAV Installation Requirements

BIG DATA TESTING: A UNIFIED VIEW

Opportunities and challenges in personalization of online hotel search

Network Monitoring & Detection. Fast Data: Real Time. Customer Service Department. NTT Communications Corp.

Real-life technical decision points in using cloud & container technology:

Data Integrity in Stateful Services. Velocity, China, 2016

CS377: Database Systems Data Warehouse and Data Mining. Li Xiong Department of Mathematics and Computer Science Emory University

Safe Harbor Statement

Exploiting the OpenPOWER Platform for Big Data Analytics and Cognitive. Rajesh Bordawekar and Ruchir Puri IBM T. J. Watson Research Center

2010 Publishing Site Upgrade to SharePoint 2013

BIG DATA: Data Analytics with MATLAB Christophe POUILLOT Senior Consultant MathWorks

Conceptual Modeling on Tencent s Distributed Database Systems. Pan Anqun, Wang Xiaoyu, Li Haixiang Tencent Inc.

Database Services at CERN with Oracle 10g RAC and ASM on Commodity HW

IBM Leading High Performance Computing and Deep Learning Technologies

Big Data on AWS. Big Data Agility and Performance Delivered in the Cloud. 2015, Amazon Web Services, Inc. or its Affiliates. All rights reserved.

Data Mining: Models and Methods

This tutorial will help computer science graduates to understand the basic-to-advanced concepts related to data warehousing.

TELSTRA CLOUD SERVICES CLOUD INFRASTRUCTURE PRICING GUIDE UNITED KINGDOM

Graph Analytics and Machine Learning A Great Combination Mark Hornick

Introduction to Data Mining and Data Analytics

Data Integrity in Stateful Services. Percona Live, Santa Clara, 2017

Data Driven Networks

Big Data Security Internal Threat Detection. The Critical Role of Machine Learning.

Analytics Fundamentals by Mark Peco

Intelligent Edge Computing and ML-based Traffic Classifier. Kwihoon Kim, Minsuk Kim (ETRI) April 25.

INTRODUCTION TO BIG DATA, DATA MINING, AND MACHINE LEARNING

Accelerate Big Data Insights

C. The system is equally reliable for classifying any one of the eight logo types 78% of the time.

Data Analytics at Logitech Snowflake + Tableau = #Winning

Persistent Memory Productization driven by AI & ML. Danny Sabour VP Marketing, Avalanche Technology

Cloud Monitoring as a Service. Built On Machine Learning

Integrate MATLAB Analytics into Enterprise Applications

How to Troubleshoot Databases and Exadata Using Oracle Log Analytics

HOW TO BUILD A MODERN AI

data-based banking customer analytics

Data Engineering for Data Science

Big Data The end of Data Warehousing?

Big Data on AWS. Peter-Mark Verwoerd Solutions Architect

Is your IT Infrastructure Ready for Machine Learning & Artificial Intelligence?

Use cases and requirements of network intelligence. Yong-Geun Hong (ETRI) January

Infrastructure as a Service (IaaS) Compute with Storage and Backup PRICING DOCUMENT

API, DEVOPS & MICROSERVICES

Enable IoT Solutions using Azure

Why Scale Matters & How the Cloud Really is Different

Orchestrating an OpenStack* based IoT Smart Home

The Root Cause of Unstructured Data Problems is Not What You Think

+ 11.5% % + 7.3%

WKU-MIS-B10 Data Management: Warehousing, Analyzing, Mining, and Visualization. Management Information Systems

Oracle and Tangosol Acquisition Announcement

OBIEE Performance Improvement Tips and Techniques

Accelerate AI with Cisco Computing Solutions

Apache Spark is a fast and general-purpose engine for large-scale data processing Spark aims at achieving the following goals in the Big data context

Transcription:

Decision Camp 2017 Birbeck, University of London Decision models for the Digital Economy Vijay Bandekar InteliOps Inc.

Agenda Problem Statement Proposed Solution Case studies and results Key takeaways Discussions 6/16/2017 Copyright InteliOps, Inc. 2017 2

Digital Economy and Data Explosion 45 However 90% data goes unused 40 Zb of data by 2020, 44X in 11 Years 26 Bn IoT Devices by 2020 40 35 30 25 20 15 10 5 0 2011 2013 2015 2017 2019 2021 Velocity Latency Mission Critical Real-Time Decisions In-Memory High Performance DB DWH Big Data Platform Variety Real-Time Recommendations Big Data Analysis Cloud Storage Volume Capacity Digital economy demands autonomous, self-learning, real-time decision making systems that sense, comprehend and act. Decisions must adapt to changing business environments. Online Video / Photo Sharing Cloud/Cold Storage 6/16/2017 Copyright InteliOps, Inc. 2017 3

Decision Models: Inference, Reasoning and Deductions What are decision models? How do they work? Available technologies? Business Logic Templates. Workflows, rules engine, Rete, logic programs, SQL Rules development and maintenance is cumbersome. Machine learning can generate rules/models, however, they must be manually incorporated into a business rules engines (BRE). 6/16/2017 Copyright InteliOps, Inc. 2017 4

Reasoning and Inference Data driven productions - Rete Underutilized. Implementations Performance varies. Architectures Complex and Inefficient. Environments Cumbersome tools. 6/16/2017 Copyright InteliOps, Inc. 2017 5

Architectures: Stateless v/s State-full Stateless Working Memory DB 1. Create knowledge session Rete 2. Re-create state 3. Insert new facts 4. Call fireall 5. Persist state to DB 6. Dispose knowledge session State-full Working Memory 1. Create knowledge session Rete 2. Insert new facts 3. Call fireall 4. Go to 2 Response time = time to create knowledge session + time to re-create state + time to insert all facts + time to fire all rules + time to persist state to the DB + time to dispose session Response time = time to create knowledge session + time to insert all facts + time to fire all rules 6/16/2017 Copyright InteliOps, Inc. 2017 6

Working Memory: Agenda Control & Transparency 6/16/2017 Copyright InteliOps, Inc. 2017 7

Learning models: Machine Learning and Cognition Supervised and Unsupervised Learning Store-first-process-later does not work. The traditional big-data analytics is too slow, and requires custom integration with decision engines. Deep-learning is not the answer to all ML problems. 6/16/2017 Copyright InteliOps, Inc. 2017 8

Solution: Adaptive Decisions Framework Reasoning and Inference Continuous Learning Data driven productions in a state-full knowledge session Rule engines Intelligent agents Deductions and inferences Data patterns and events Comprehen d Sense Act Incremental model building Statistical classification Decision tree Clusters Regression Neural networks Resilient back propagation Recurrent Convolutional 6/16/2017 Copyright InteliOps, Inc. 2017 9

Incremental Bayesian Learner Rules manage learning categories Incrementally add sample records Real-time update of probabilities Learner operates in dual mode Continuous learning Inference/Prediction 6/16/2017 Copyright InteliOps, Inc. 2012 10

Learned decision tree transformed into Rete 6/16/2017 Copyright InteliOps, Inc. 2017 11

Models injected into Rete 6/16/2017 Copyright InteliOps, Inc. 2017 12

Case Study 1: Media Purchase Decisions CHANNEL PERFORMANCE < << 2016 Week 15 >> > Top 5 Bottom 5 Rank Channel Aggregate Weight Rank Channel Aggregate Weight 1 EMAIL 0.40 25 DISPLAY 0.010 2 INTERNAL 0.33 24 SOCIAL NETWORKS 0.010 3 NATURAL SEARCH 0.29 23 AFFILIATE 0.015 4 PAID SEARCH 0.21 22 DIRECT 0.017 5 REFERRING DOMAINS 0.14 21 PLA 0.019 C. 1. Channel attribution (A) 2. Channel combination probabilities (B) 3. Channel-Brand combination (C) A. probability of a channel producing revenue = revenue attributed / total revenue for the period B. 6/16/2017 Copyright InteliOps, Inc. 2017 13

Statistics on KPIs and categories Frequency distributions LOW MEDIUM HIGH LOW MEDIUM HIGH LOW MEDIUM HIGH Depending on the distribution, mapping to categorical values will utilize mean and median. Adaptive Decision Framework provides Statistics template to automate this normalization process. 6/16/2017 Copyright InteliOps, Inc. 2017 14

Results: Single retailer daily process Daily Data Volume per Retailer Facts count Sales Omniture tracking Rules count Rules fired Initial Final Response Time 30,184 80,000 20 373,668 115,704 127,264 793 Run State-full Stateless % Improvement 1 140 360 61% 2 166 360 54% 3 66 360 82% 4 113 360 69% 5 138 360 62% 6 79 360 78% 7 68 360 81% 8 64 360 82% 9 71 360 80% 10 62 360 83% 96.7 360 73% 6/16/2017 Copyright InteliOps, Inc. 2017 15

Case Study 2: MBS Performance Prediction Predict individual loan performance 25 M residential mortgages acquired over 10 year 1.1 B monthly performance records over 10 year Cluster loans and form MBS pools Interest rate distance Origination distance Simulate MBS pool performance Random loan pools based on clusters Monte Carlo simulation of MBS yield 6/16/2017 Copyright InteliOps, Inc. 2017 16

Performance Benchmark: 2 Core 24 GB RAM VM with 100 GB hard disk. 45 Quarters of loan acquisition and performance data since Q1 2000: Stream loan data form published web site, and Insert facts into state-full working memory 4.5 hours. (Stateless: 16.75 hours estimated) Add sample records to learning category: 1.35 minutes/million records. Training time: 6 hours. (Stateless: 22.3 hours estimated) 6/16/2017 Copyright InteliOps, Inc. 2017 17

Key takeaway State-full working memory enables real-time decisions. Combining decision engine with ML facilitates adaptive/autonomous behavior. 6/16/2017 Copyright InteliOps, Inc. 2012 18

Conclusion Architecture using state-full working memory wins Continuous learning within decision engine enables adaptive behavior Performance of adaptive decision framework meets the needs of Digital Economy. Volume Velocity 6/16/2017 Copyright InteliOps, Inc. 2012 19