Oracle Machine Learning Notebook

Similar documents
Oracle Machine Learning Notebook

Take Your Analytics to the Next Level of Insight using Machine Learning in the Oracle Autonomous Data Warehouse

Transformational Machine Learning Use Cases You Can Deploy Now CON6234

Big Data Analytics with Oracle Advanced Analytics 12c and Big Data SQL

Introducing Oracle Machine Learning

Oracle Machine Learning & Advanced Analytics

Autonomous Data Warehouse in the Cloud

Your New Autonomous Data Warehouse

Oracle Data Mining 11g Release 2

Leverage the Oracle Data Integration Platform Inside Azure and Amazon Cloud

Autonomous Database Level 100

Oracle s Machine Learning and Advanced Analytics Release 12.2 and Oracle Data Miner 4.2 New Features

Fault Detection using Advanced Analytics at CERN's Large Hadron Collider: Too Hot or Too Cold BIWA Summit 2016

Oracle Database 18c and Autonomous Database

Data 101 Which DB, When. Joe Yong Azure SQL Data Warehouse, Program Management Microsoft Corp.

NOSQL DATABASE CLOUD SERVICE. Flexible Data Models. Zero Administration. Automatic Scaling.

Oracle Autonomous Database

MySQL CLOUD SERVICE. Propel Innovation and Time-to-Market

DATA INTEGRATION PLATFORM CLOUD. Experience Powerful Data Integration in the Cloud

Oracle Big Data Science IOUG Collaborate 16

Oracle s Machine Learning and Advanced Analytics Data Management Platforms. Move the Algorithms; Not the Data

Application Container Cloud

Data 101 Which DB, When Joe Yong Sr. Program Manager Microsoft Corp.

Modern and Fast: A New Wave of Database and Java in the Cloud. Joost Pronk Van Hoogeveen Lead Product Manager, Oracle

Understanding the latent value in all content

Latest from the Lab: What's New Machine Learning Sam Buhler - Machine Learning Product/Offering Manager

Oracle Big Data Science

COMPUTE CLOUD SERVICE. Moving to SPARC in the Oracle Cloud

Copyright 2012, Oracle and/or its affiliates. All rights reserved. Insert Information Protection Policy Classification from Slide 13

Oracle Big Data, Machine Learning, DW OOW újdonságok

Safe Harbor Statement

Moving Databases to Oracle Cloud: Performance Best Practices

Oracle s Machine Learning and Advanced Analytics

Oracle Database Exadata Cloud Service Exadata Performance, Cloud Simplicity DATABASE CLOUD SERVICE

Oracle Exadata: Strategy and Roadmap

Oracle Big Data Discovery

Oracle Data Mining Overview and Demo

Javaentwicklung in der Oracle Cloud

MDM Partner Summit 2015 Oracle Enterprise Data Quality Overview & Roadmap

DBAs can use Oracle Application Express? Why?

Copyright 2017 Oracle and/or its affiliates. All rights reserved.

Tour of Database Platforms as a Service. June 2016 Warner Chaves Christo Kutrovsky Solutions Architect

Oracle Data Mining Overview and Demo

1 Copyright 2011, Oracle and/or its affiliates. All rights reserved. reserved. Insert Information Protection Policy Classification from Slide 8

Getting Started with Advanced Analytics in Finance, Marketing, and Operations

Copyright 2012, Oracle and/or its affiliates. All rights reserved.

Modern Data Warehouse The New Approach to Azure BI

Oracle R Technologies

This document (including, without limitation, any product roadmap or statement of direction data) illustrates the planned testing, release and

2-4 April 2019 Taets Art and Event Park, Amsterdam CLICK TO KNOW MORE

Trouble-free Upgrade to Oracle Database 12c with Real Application Testing

Deploying Spatial Applications in Oracle Public Cloud

Database Level 100. Rohit Rahi November Copyright 2018, Oracle and/or its affiliates. All rights reserved.

Demystifying Cloud Data Warehousing

DURATION : 03 DAYS. same along with BI tools.

Open And Linked Data Oracle proposition Subtitle

An Insider s Guide to Oracle Autonomous Transaction Processing

Oracle Real Application Clusters (RAC) 12c Release 2 What s Next?

Activator Library. Focus on maximizing the value of your data, gain business insights, increase your team s productivity, and achieve success.

IBM DB2 Analytics Accelerator Trends and Directions

QLIK INTEGRATION WITH AMAZON REDSHIFT

hcloud Deployment Models

Recent Innovations in Data Storage Technologies Dr Roger MacNicol Software Architect

In-Database Analytics: Predictive Analytics, Data Mining, Oracle Exadata and Oracle Business Intelligence

Copyright 2011, Oracle and/or its affiliates. All rights reserved.

Outrun Your Competition With SAS In-Memory Analytics Sascha Schubert Global Technology Practice, SAS

The Oracle Trust Fabric Securing the Cloud Journey

OpenWorld 2018 SQL Tuning Tips for Cloud Administrators

1 Copyright 2013, Oracle and/or its affiliates. All rights reserved.

FAQs. Business (CIP 2.2) AWS Market Place Troubleshooting and FAQ Guide

Run Critical Databases in the Cloud

1 Dulcian, Inc., 2001 All rights reserved. Oracle9i Data Warehouse Review. Agenda

Frequently Asked Questions for Oracle

Oracle Enterprise Data Quality - Roadmap

IBM dashdb Local. Using a software-defined environment in a private cloud to enable hybrid data warehousing. Evolving the data warehouse

BI ENVIRONMENT PLANNING GUIDE

Optimizing Your Analytics Life Cycle with SAS & Teradata. Rick Lower

Netezza The Analytics Appliance

Spatial Analytics Built for Big Data Platforms

Copyright 2012, Oracle and/or its affiliates. All rights reserved. Insert Information Protection Policy Classification from Slide 12

Analyzing a social network using Big Data Spatial and Graph Property Graph

Oracle Big Data Connectors

Improving Your Business with Oracle Data Integration See How Oracle Enterprise Metadata Management Can Help You

Performance and Load Testing R12 With Oracle Applications Test Suite

Jitterbit is comprised of two components: Jitterbit Integration Environment

Overview of Data Services and Streaming Data Solution with Azure

Modernizing Business Intelligence and Analytics

Cloud Operations for Oracle Cloud Machine ORACLE WHITE PAPER MARCH 2017

Personalized Experiences Enabled Through Extensibility

What s New at AWS? A selection of some new stuff. Constantin Gonzalez, Principal Solutions Architect, Amazon Web Services

Introduction to Database Services

Oracle Linux, Virtualization & OEM12 Discussion Sahil Mahajan / Sundeep Dhall

Was gibt es Neues Better Team Work with Cloud

Oracle Cloud Using Oracle Autonomous Data Warehouse Cloud

Spotfire: Brisbane Breakfast & Learn. Thursday, 9 November 2017

MySQL HA Solutions Selecting the best approach to protect access to your data

Data Analytics at Logitech Snowflake + Tableau = #Winning

Copyright 2018, Oracle and/or its affiliates. All rights reserved.

Increase Value from Big Data with Real-Time Data Integration and Streaming Analytics

Evolving To The Big Data Warehouse

Transcription:

Oracle Machine Learning Notebook Included in Autonomous Data Warehouse Cloud Charlie Berger, MS Engineering, MBA Sr. Director Product Management, Machine Learning, AI and Cognitive Analytics charlie.berger@oracle.com www.twitter.com/charliedatamine

Safe Harbor Statement The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material, code, or functionality, and should not be relied upon in making purchasing decisions. The development, release, and timing of any features or functionality described for Oracle s products remains at the sole discretion of Oracle. 2

Introducing Oracle Autonomous Data Warehouse Cloud Value Proposition Easy Provision a data warehouse in as little as 15-seconds Automated management of database administration Simple Load and Go with Automated Tuning Dedicated cloud-ready migration tools including Redshift Fast Up to 14x performance advantage than Redshift 1 High concurrency supports multi-user access and workloads Based on Exadata for extreme performance Elastic Only Pay for What you Use with user defined sizing, on-demand scaling & idle shut-off Independent scaling of compute and storage Instant scaling with zero downtime Confidential Oracle 3

Oracle Autonomous Data Warehouse Cloud Key Features High-Performance Queries and Concurrent Workloads Optimized query performance with preconfigured resource profiles for different types of users Oracle Machine Learning Oracle SQL Autonomous DW Cloud is compatible with all business analytics tools that support Oracle Database Self Driving Fully automated database for self-tuning patching and upgrading itself while the system is running Highly Elastic Independently scale compute and storage, without having to overpay for fixed blocks of resources Built-in Web-Based SQL ML Tool Apache Zeppelin Oracle Machine Learning notebooks ready to run ML from browser Database migration utility Dedicated cloud-ready migration tools for easy migration from Amazon Redshift, SQL Server and other databases Cloud-Based Data Loading Fast, scalable data-loading from Oracle Object Store, AWS S3, or on-premises Enterprise Grade Security Data is encrypted by default in the cloud, as well as in transit and at rest Confidential Oracle 4

Architecture for Modern Cloud Data Warehousing Developer Tools Data Autonomous Warehouse Data Services Warehouse Cloud (EDWs, DW, departmental marts and sandboxes) Service Management Built-in Access Tools Advanced Analytics Oracle SQL Developer Data Integration Services Oracle Exadata Cloud Service Oracle Database Cloud Service Service Console Express Cloud Service Oracle Machine Learning Oracle Analytics Cloud Oracle Data Integration Platform Cloud Autonomous Database Cloud 3 rd Party DI on Oracle Cloud Compute Oracle Object Storage Cloud Flat Files and Staging 3 rd Party DI On-premises Confidential Oracle 5

Introducing: Oracle Machine Learning SQL Notebook Copyright 2017, Oracle and/or its affiliates. All rights reserved.

Oracle Machine Learning Machine Learning Notebook for Autonomous Data Warehouse Cloud Key Features Collaborative UI for data scientists Packaged with Autonomous Data Warehouse Cloud (V1) Easy access to shared notebooks, templates, permissions, scheduler, etc. SQL ML algorithms API (V1) Supports deployment of ML analytics

Oracle Machine Learning Machine Learning Notebook for Autonomous Data Warehouse Cloud Key Features Collaborative UI for data scientists Packaged with Autonomous Data Warehouse Cloud (V1) Easy access to shared notebooks, templates, permissions, scheduler, etc. SQL ML algorithms API (V1) Supports deployment of ML analytics

9

10

11

12

13

Oracle Machine Learning and Advanced Analytics Strategy and Road Map Support multiple data platforms, analytical engines, languages, UIs and deployment strategies GUI Data Miner, RStudio Notebooks SQL ML Algorithms Common core, parallel, distributed R, Python, etc. Advanced Analytics Big Data / Big Data Cloud Relational Oracle Database Cloud DWCS

Oracle s Machine Learning/Advanced Analytics Platforms Machine Learning Algorithms Embedded in the Data Management Platforms Analytics Producers Analytics Consumers Data Scientists, R Users, Citizen Data Scientists BI Analysts, Managers Functional Users (HCM, CRM) Data Management + Advanced Analytical Platform Big Data SQL Big Data Cloud Service Oracle Machine Learning Big Data Cloud ORAAH Machine Learning Algorithms, Statistical Functions + R Integration for Scalable, Parallel, Distributed Execution Database Cloud Oracle Machine Learning Database Edition Machine Learning Algorithms, Statistical Functions + R Integration for Scalable, Parallel, Distributed, in-db Execution

Oracle s Machine Learning/Advanced Analytics Platforms Machine Learning Algorithms Embedded in the Data Management Platforms Analytics Producers Data Scientists, R Users, Citizen Data Scientists New Zeppelin notebook based UI for data scientists collaborating and sharing ML analytical methodologies in Clouds Data Management + Advanced Analytical Platform Big Data SQL Big Data Cloud Service Oracle Machine Learning Big Data Cloud ORAAH Machine Learning Algorithms, Statistical Functions + R Integration for Scalable, Parallel, Distributed Execution Database Cloud Oracle Machine Learning Database Edition Machine Learning Algorithms, Statistical Functions + R Integration for Scalable, Parallel, Distributed, in-db Execution

Oracle s ML/AA Functionality Copyright 2016, Oracle and/or its affiliates. All rights reserved.

Oracle s Machine Learning/Advanced Analytics Fastest Way to Deliver Scalable Enterprise-wide Predictive Analytics Key Features Parallel, scalable machine learning algorithms and R integration In-Database + Hadoop Don t move the data Data analysts, data scientists & developers Drag and drop workflow, R and SQL APIs Extends data management into powerful advanced/predictive analytics platform Enables enterprise predictive analytics deployment + applications Copyright 2016, Oracle and/or its affiliates. All rights reserved.

Multiple User Profiles Oracle s Machine Learning/Advanced Analytics Application Developers DBAs R User, Data Scientist Data Analyst, Citizen Data Scientist Copyright 2016, Oracle and/or its affiliates. All rights reserved.

Manage and Analyze All Your Data Data Scientists, R Users, Citizen Data Scientists Architecturally, Many Options and Flexibility SQL / R Boil down the Data Lake Big Data SQL / R Object Store Engineered Features Derived attributes that reflect domain knowledge key to best models e.g: Counts Totals Changes over time Copyright 2016, Oracle and/or its affiliates. All rights reserved.

Oracle Advanced Analytics 12.2 Model Build Time Performance Unofficial T7-4 (Sparc & Solaris) X5-4 (Intel and Linux) OAA 12.2 Algorithms Rows (Ms) Model Build Time (Secs / Degree of Parallelism) Wow! That s Fast! Attributes Importance 640 28s / 512 44s / 72 K Means Clustering 640 161s / 256 268s / 144 Expectation Maximization 159 455s / 512 588s / 144 Naive Bayes Classification 320 17s / 256 23s / 72 GLM Classification 640 154s / 512 363s / 144 GLM Regression 640 55s / 512 93s / 144 Support Vector Machine (IPM solver) 640 404s / 512 1411s / 144 Support Vector Machine (SGD solver) 640 84s / 256 188s / 72 The way to read their results is that they compare 2 chips: X5 (Intel and Linux) and T7 (Sparc and Solaris). They are measuring scalability (time in seconds) with increase degree of parallelism (dop). The data also has high cardinality Copyright categorical 2018, Oracle columns and/or which its affiliates. translates All rights in reserved. 9K mining attributes (when algorithms require explosion). There are no comparisons to 12.1 and it is fair to say that the 12.1 algorithms could not run on data of this size.

Fraud Prediction Demo Automated In-DB Analytical Methodology drop table CLAIMS_SET; exec dbms_data_mining.drop_model('claimsmodel'); create table CLAIMS_SET (setting_name varchar2(30), setting_value varchar2(4000)); insert into CLAIMS_SET values ('ALGO_NAME','ALGO_SUPPORT_VECTOR_MACHINES'); insert into CLAIMS_SET values ('PREP_AUTO','ON'); commit; begin dbms_data_mining.create_model('claimsmodel', 'CLASSIFICATION', 'CLAIMS', 'POLICYNUMBER', null, 'CLAIMS_SET'); end; / -- Top 5 most suspicious fraud policy holder claims select * from (select POLICYNUMBER, round(prob_fraud*100,2) percent_fraud, rank() over (order by prob_fraud desc) rnk from (select POLICYNUMBER, prediction_probability(claimsmodel, '0' using *) prob_fraud from CLAIMS where PASTNUMBEROFCLAIMS in ('2to4', 'morethan4'))) where rnk <= 5 order by percent_fraud desc; Automated Monthly Application! Just add: Create View CLAIMS2_30 As Select * from CLAIMS2 Where mydate > SYSDATE 30 Time measure: set timing on;

Oracle Advanced Analytics Real-Time Scoring, Predictions and Recommendations Oracle Cloud On-the-fly, single record apply with new data (e.g. from call center) Select prediction_probability(clas_dt_1_15, 'Yes' USING 7800 as bank_funds, 125 as checking_amount, 20 as credit_balance, 55 as age, 'Married' as marital_status, 250 as MONEY_MONTLY_OVERDRAWN, 1 as house_ownership) from dual; Call Center Social Media Get Advice Branch Office R Likelihood to respond: Web Mobile Email

Build Predictive Models on an Attribute Oracle s Machine Learning Accelerates New Possibilities Machine Learning Model Function(X 1, X 2,.X) Y (LTV_BIN); Probability

Build Predictive Models on an Attribute Oracle s Machine Learning Accelerates New Possibilities Machine Learning Model Function(X 1, X 2,.X) Y (LTV_BIN); Probability

Build Predictive Models on an Attribute Oracle s Machine Learning Accelerates New Possibilities Machine Learning Models Function(X 1, X 2,.X) Y2 (BankFunds Y (LTV_BIN); Probability

Build Predictive Models on an Attribute Oracle s Machine Learning Accelerates New Possibilities Machine Learning Models Function(X 1, X 2,.X) Y2 (BankFunds Y (LTV_BIN); Probability

Copyright 2016, Oracle and/or its affiliates. All rights reserved. 29

The Core Ingredients of Good Machine Learning Domain Knowledge + Data Machine Learning Algorithms Insights, Predictions Machine Learning Algorithms A1 A2 A3 A4 A5 A6 A7 A1 A2 A3 A4 A5 A6 A7

A1 A2 A3 A4 A5 A6 A7 Most Important Factor in Machine Learning? Deployment!! A Thinking Database Machine Learning Algorithms

Community and Resources Copyright 2017, Oracle and/or its affiliates. All rights reserved.

www.biwasummit.org www.analyticsanddatasummit.org

Getting Started Oracle ML/AA Resources & Links Oracle Advanced Analytics Overview Information Oracle's Machine Learning and Advanced Analytics 12.2c and Oracle Data Miner 4.2 New Features preso Oracle Advanced Analytics Public Customer References Oracle s Machine Learning and Advanced Analytics Data Management Platforms white paper on OTN Oracle INTERNAL ONLY OAA Product Management Wiki and Beehive Workspace (contains latest presentations, demos, product, etc. information) YouTube recorded Oracle Advanced Analytics Presentations and Demos, White Papers Oracle's Machine Learning & Advanced Analytics 12.2 & Oracle Data Miner 4.2 New Features YouTube video Library of YouTube Movies on Oracle Advanced Analytics, Data Mining, Machine Learning (7+ live Demos e.g. Oracle Data Miner 4.0 New Features, Retail, Fraud, Loyalty, Overview, etc.) Overview YouTube video of Oracle s Advanced Analytics and Machine Learning Getting Started/Training/Tutorials Link to OAA/Oracle Data Miner Workflow GUI Online (free) Tutorial Series on OTN Link to OAA/Oracle R Enterprise (free) Tutorial Series on OTN Link to Try the Oracle Cloud Now! Link to Getting Started w/ ODM blog entry Link to New OAA/Oracle Data Mining 2-Day Instructor Led Oracle University course. Oracle Data Mining Sample Code Examples Additional Resources, Documentation & OTN Discussion Forums Oracle Advanced Analytics Option on OTN page OAA/Oracle Data Mining on OTN page, ODM Documentation & ODM Blog OAA/Oracle R Enterprise page on OTN page, ORE Documentation & ORE Blog Oracle SQL based Basic Statistical functions on OTN Oracle R Advanced Analytics for Hadoop (ORAAH) on OTN Analytics and Data Summit, All Analytics, All Data, No Nonsense. March 20-22, 2018, Redwood Shores, CA