Deploying, Managing and Reusing R Models in an Enterprise Environment

Similar documents
This document (including, without limitation, any product roadmap or statement of direction data) illustrates the planned testing, release and

TIBCO Spotfire Statement of Direction. Spotfire Product Management

TIBCO Analytics Meetup. Michael O Connell and the TIBCO Data Science Team April 25th, 2017

Spotfire: Brisbane Breakfast & Learn. Thursday, 9 November 2017

From Insight to Action: Analytics from Both Sides of the Brain. Vaz Balasingham Director of Solutions Consulting

This document (including, without limitation, any product roadmap or statement of direction data) illustrates the planned testing, release and

Extending R to the Enterprise

TIBCO Spotfire Hybrid Cloud Architecture Deep Dive

This document (including, without limitation, any product roadmap or statement of direction data) illustrates the planned testing, release and

Overview of TIBCO Cloud Integration

Think Small: API Architecture For The Enterprise

Spotfire and Tableau Positioning. Summary

This document (including, without limitation, any product roadmap or statement of direction data) illustrates the planned testing, release and

Putting it all together: Creating a Big Data Analytic Workflow with Spotfire

Latest from the Lab: What's New Machine Learning Sam Buhler - Machine Learning Product/Offering Manager

Introducing Oracle Machine Learning

TIBCO Spotfire Analytics Investments

Combine Native SQL Flexibility with SAP HANA Platform Performance and Tools

This document (including, without limitation, any product roadmap or statement of direction data) illustrates the planned testing, release and

Week 1 Unit 1: Introduction to Data Science

How to Troubleshoot Databases and Exadata Using Oracle Log Analytics

End-to-End data mining feature integration, transformation and selection with Datameer Datameer, Inc. All rights reserved.

Introducing SAS Model Manager 15.1 for SAS Viya

MAPR DATA GOVERNANCE WITHOUT COMPROMISE

From the Source to the Dashboard: SAP Agile Data Warehousing for Self-Service BI

Streaming iphone sensor data to SAS Event Stream Processing

Oracle Big Data Connectors

Intelligence for the connected world How European First-Movers Manage IoT Analytics Projects Successfully

Spotfire and Qlik Sense Positioning. Summary

WEBMETHODS AGILITY FOR THE DIGITAL ENTERPRISE WEBMETHODS. What you can expect from webmethods

Optimizing Data Integration Solutions by Customizing the IBM InfoSphere Information Server Deployment Architecture IBM Redbooks Solution Guide

Informatica Enterprise Information Catalog

Spotfire Advanced Data Services. Lunch & Learn Tuesday, 21 November 2017

Outrun Your Competition With SAS In-Memory Analytics Sascha Schubert Global Technology Practice, SAS

Qlik Sense Desktop. Data, Discovery, Collaboration in minutes. Qlik Sense Desktop. Qlik Associative Model. Get Started for Free

Hitachi Vantara Overview Pentaho 8.0 and 8.1 Roadmap. Pedro Alves

TABLE OF CONTENTS DOCUMENT HISTORY 3

Introducing Microsoft SQL Server 2016 R Services. Julian Lee Advanced Analytics Lead Global Black Belt Asia Timezone

Oracle API Platform Cloud Service

Capture Business Opportunities from Systems of Record and Systems of Innovation

Integrating MATLAB Analytics into Business-Critical Applications Marta Wilczkowiak Senior Applications Engineer MathWorks

REDUCE TCO AND IMPROVE BUSINESS AND OPERATIONAL EFFICIENCY

Turning Data Science into a reality with TIBCO Spotfire

Oracle Big Data Discovery

Fluentd + MongoDB + Spark = Awesome Sauce

Improving Your Business with Oracle Data Integration See How Oracle Enterprise Metadata Management Can Help You

Oracle WebCenter Interaction: Roadmap for BEA AquaLogic User Interaction. Ajay Gandhi Sr. Director of Product Management Enterprise 2.

API, DEVOPS & MICROSERVICES

CONFIDENTLY INTEGRATE VMWARE CLOUD ON AWS WITH INTELLIGENT OPERATIONS

SAP HANA SPS 08 - What s New? SAP HANA Interactive Education - SHINE (Delta from SPS 07 to SPS 08) SAP HANA Product Management May, 2014

Oracle Adapter for Salesforce Lightning. Winter 18. New Feature Summary

Oracle Mobile Hub. Complete Mobile Platform

Data Protection for Virtualized Environments

Optimize Your Databases Using Foglight for Oracle s Performance Investigator

THE USE OF APL IN SIMCORP DIMENSION

TIBCO Complex Event Processing Evaluation Guide

CloudSwyft Learning-as-a-Service Course Catalog 2018 (Individual LaaS Course Catalog List)

SAS Event Stream Processing

Spotfire Data Science with Hadoop Using Spotfire Data Science to Operationalize Data Science in the Age of Big Data

Approaching the Petabyte Analytic Database: What I learned

Accelerate critical decisions and optimize network use with distributed computing

The TIBCO Insight Platform Actions with Analytics

Powering Knowledge Discovery. Insights from big data with Linguamatics I2E

Noam Ikar R&DVP. Complex Event Processing and Situational Awareness in the Digital Age

Copyright 2018, Oracle and/or its affiliates. All rights reserved.

WHAT S NEW IN QLIKVIEW 11

INTRODUCTION... 2 FEATURES OF DARWIN... 4 SPECIAL FEATURES OF DARWIN LATEST FEATURES OF DARWIN STRENGTHS & LIMITATIONS OF DARWIN...

Qlik. 10 key elements of a successful data strategy and modern analytics platform. February 2019 Julie Kae Executive Director, Qlik.

WEB-APIs DRIVING DIGITAL INNOVATION

Oracle Enterprise Manager Configuration Management Unleashed: Top 10 Expert Tips

Oracle Warehouse Builder 10g Release 2 Integrating Packaged Applications Data

ebook ADVANCED LOAD BALANCING IN THE CLOUD 5 WAYS TO SIMPLIFY THE CHAOS

MicroStrategy Desktop MicroStrategy 10.2: New features overview. microstrategy.com 1

Introduction to the Azure Portal

Event: PASS SQL Saturday - DC 2018 Presenter: Jon Tupitza, CTO Architect

TECHNICAL OVERVIEW OF NEW AND IMPROVED FEATURES OF EMC ISILON ONEFS 7.1.1

UGKnowledge. SAP User Groups

Fast Innovation requires Fast IT

Oracle R Technologies

Analytics Fundamentals by Mark Peco

ORACLE DATABASE LIFECYCLE MANAGEMENT PACK

Composite Software Data Virtualization The Five Most Popular Uses of Data Virtualization

Frequently Asked Questions Oracle Content Management Integration. An Oracle White Paper June 2007

Scaling MATLAB. for Your Organisation and Beyond. Rory Adams The MathWorks, Inc. 1

Oracle9i Data Mining. Data Sheet August 2002

Integrate MATLAB Analytics into Enterprise Applications

Boost your Analytics with Machine Learning for SQL Nerds. Julie mssqlgirl.com

Luncheon Webinar Series April 25th, Governance for ETL Presented by Beate Porst Sponsored By:

Evaluating Cloud Databases for ecommerce Applications. What you need to grow your ecommerce business

UX402 SAP SAPUI5 Development

IBM DB2 Analytics Accelerator Trends and Directions

Boost your Analytics with ML for SQL Nerds

THE SIX ESSENTIAL CAPABILITIES OF AN ANALYTICS-DRIVEN SIEM

Third generation of Data Virtualization

CONSOLIDATING RISK MANAGEMENT AND REGULATORY COMPLIANCE APPLICATIONS USING A UNIFIED DATA PLATFORM

Build a system health check for Db2 using IBM Machine Learning for z/os

MDM Partner Summit 2015 Oracle Enterprise Data Quality Overview & Roadmap

What s New in Spotfire DXP 1.1. Spotfire Product Management January 2007

SOLUTION BRIEF NETWORK OPERATIONS AND ANALYTICS. How Can I Predict Network Behavior to Provide for an Exceptional Customer Experience?

Understanding the latent value in all content

Transcription:

Deploying, Managing and Reusing R Models in an Enterprise Environment Making Data Science Accessible to a Wider Audience Lou Bajuk-Yorgan, Sr. Director, Product Management Streaming and Advanced Analytics TIBCO Software

DISCLAIMER During the course of this presentation, TIBCO or its representatives may make forward-looking statements regarding future events, TIBCO s future results or our future financial performance. Although we believe that the expectations reflected in the forward-looking statements contained in this presentation are reasonable, these expectations or any of the forward-looking statements could prove to be incorrect and actual results or financial performance could differ materially from those stated herein. TIBCO could experience factors that could cause actual results or financial performance to differ materially from those contained in any forward-looking statement made in connection with this presentation. TIBCO does not undertake to update any forward-looking statements that may be made from time to time or on its behalf. This document (including, without limitation, any product roadmap or statement of direction data) illustrates the planned testing, release and availability dates for TIBCO products and services. This document is provided for informational purposes only and its contents are subject to change without notice. TIBCO makes no warranties, express or implied, in or relating to this document or any information in it, including, without limitation, that the information is error-free or meets any conditions of merchantability or fitness for a particular purpose. This document may not be reproduced or transmitted in any form or by any means without our prior written permission. The material provided is for informational purposes only, and should not be relied on in making a purchasing decision. The information is not a commitment, promise or legal obligation to deliver any material, code, or functionality. The development, release, and timing of any features or functionality described for our products remains at our sole discretion.

Multiple paths from Data to Decision and Action Data Science helps deliver better decisions faster

Scarcity of Data Science Skills General Population Citizen Data Scientists (Analysts, Engineers, Scientists) Data Scientists Number of users Analytical complexity of task and capability of user

Skeptical about Citizen Data Scientists? Citizen Data Scientist: aspire beyond pretty pictures and simplistic dashboards By 2019, citizen data scientists will surpass data scientists in the amount of advanced analysis produced. By 2020, more than 40% of data science tasks will be automated, resulting in increased productivity and broader usage by citizen data scientists. This is the trend. How do we make sure people have the the right tools, to get the right answers? 5 http://www.gartner.com/newsroom/id/3570917

Where does R fit in? Pros Easy prototyping of new models and analysis Huge array of analytic methods available The best method to solve a given problem is likely available Lots of people learning R in university Cons Performance: Not designed for real time or Big Data applications Hard for non-data Scientist to use directly exacerbates the Data Science skills scarcity, by requiring both coding and Data Science knowledge Challenging to deploy, integrate and manage in enterprise applications Performance, commercial support and Intellectual Property concerns Result: Compromises which impact Agility Recode in a new, less agile environment Rewrite, use specialized R packages to solve one problem better 6

TIBCO Analytics Business User and Citizen Data Scientists Data Discovery - Insight TIBCO Spotfire Data Scientist Analytics - Model TIBCO Statistica and TIBCO Enterprise Runtime for R (TERR) Create and publish R models and scripts to Spotfire Library, with authorship, user access control, etc. INSIGHT Embed R models and scripts in Spotfire visualizations for wider use Deploy to the cloud and web-based applications Numerical TERR is a Models commercially-supported, proprietary engine for the R Analytic language, Apps built for high performance MODEL TERR embedded in TIBCO products for native R scripting Statistica provides model governance: authorship, user access control, version tracking, etc. Developer Real time - Action TIBCO Streambase Call embedded models for real time scoring Model deployment via centralized ACTION service, with authorship, user access control, approval to deployment, etc. Update models in live real-time applications Copyright 2000-2017 TIBCO Software Inc.

FIND AND ACT ON "CRITICAL BUSINESS MOMENTS" Deliver proactive customer service Smart cross-sell offers Predict impending equipment failure Real-Time inventory Management Optimize Pricing Prevent Fraud Optimize Routes Anticipate and handle disruptions Critical business moments occur in every facet of enterprise operations, they drive competitive differentiation, customer satisfaction and business success!

#1. Smart Visual Analytics Recommendation driven insights Visual analytics is like a bicycle for your business mind.

TIBCO Spotfire Visual Analytics Smart Recommendation-driven insights Multiple dynamic perspectives no old school single page Fastest in and out of memory data engine for data big and small Rich, multilayer, accurate maps Threaded, searchable conversations with annotations and bookmarks Easy configured process specific analytic applications Over 40 relational, big data, cloud & proprietary sources 10

#2. Numerical Models Analytic Apps

S Point and Click Data Science Contextual, one click calculations make powerful methods easy to use: descriptive stats, similarity, clustering, correlations, fitting, forecast Unique commercial engine for R language TIBCO Enterprise Runtime for R (TERR) Any statistic can be part of Spotfire visual aggregations or expression language Easily leverage the work of your Data Scientists from R, Statistica, SAS, Matlab, Python Access to Machine Learning, Deep Learning platforms TIBCO Community shares data science components

Embedded TERR in Spotfire Write R code directly in Spotfire; TERR executes locally or on server Manage TERR analytics locally or in Server to reuse across community Deploy TERR-powered applications to the web

Draw layers on Spotfire Maps with R/TERR scripts Spotfire TERR Data Function contourlines(x,y)

Power of Embedded Advanced Analytics

TIBCO Spotfire with H2O Integration Example: Predictive Analytics for Manufacturing ( scrap parts as early as possible )

TIBCO Statistica Analytic Apps Comprehensive Stats and Predictive Analytics Simple UX 1000 s of stats, machine and deep learning, Bayesian methods Algorithm marketplaces Azure ML, Algorithmia, Apervita, H2O Open source R, Python, C#, Spark, H2O, CNTK Deep NN Data Blending any data, anywhere Model & Rule Lifecycle Management Create workspace, manage, version control, deploy, embed Citizen Data Scientists scale best practices with Web UI IoT Analytics device and gateway publish, scoring Security & Governance Repeatable, auditable; GXP validation : audit logs, version control Non-traditional data image & audio; text mining; Network Analytics with OrientDB, in-database analytics 17 Copyright 2000-2017 TIBCO Software Inc.

TIBCO Statistica: Highlights Simple UX for Data Scientist Drag-and-drop UI for model + rule creation and deployment Simplified data preparation, mash-up, and ETL Comprehensive palette of math and analytics Machine learning, deep learning, Bayesian methods Business User Image, audio, text, Graph-db In-db and In-memory algorithms Flexible integration with R, Python, Scala, SAS, C++, C#, Java Model & Rule Management and Deployment Data Scientist Metadata repository for model & rule version control, governance, security and audit trail Model version and rule lineage; champion/challenger Model & rule publish and embed everywhere Publish to TIBCO Streambase for streaming analytics on live data feed IoT applications - publish to edge Developer Copyright 2000-2017 TIBCO Software Inc.

#3. Streaming Analytics automation Continuous algorithmic awareness &

Streaming Analytics with Spotfire and TERR LiveView Dashboard Alerting Real Time Visualizations Spotfire Visualization for context, drill down for root cause analysis

Streaming Analytics Low/no code workflows for accessing, transforming and acting on Real Time Data Visual Powerful Scalable Fast Extensible Score R models in Real Time applications using native TERR node (+PMML, SparkML, H2O, etc.) Deploy models via centralized service, with approval before production deployment 21

community.tibco.com Copyright 2000-2017 TIBCO

Spotfire Wiki community.tibco.com Copyright 2000-2017 TIBCO Software Inc.

Spotfire Machine Learning Community Spotfire (R) Data Functions Machine Learning / Deep Learning Gradient Boosting Random Forests Anomaly Detection: Autoencoder Segmentation Propensity Affinity Non-Linear Regression; Decline Curves Modeling & Simulation Genetic Algorithms Optimization

This document (including, without limitation, any product roadmap or statement Copyright of direction 2000-2017 data) illustrates TIBCO the planned Software testing, Inc. release and availability dates for TIBCO products and services. It is

TIBCO Analytics Business User and Citizen Data Scientists Data Discovery - Insight TIBCO Spotfire Data Scientist Analytics - Model TIBCO Statistica and TIBCO Enterprise Runtime for R (TERR) Create and publish R models and scripts to Spotfire Library, with authorship, user access control, etc. INSIGHT Embed R models and scripts in Spotfire visualizations for wider use Deploy to the cloud and web-based applications Numerical TERR is a Models commercially-supported, proprietary engine for the R Analytic language, Apps built for high performance MODEL TERR embedded in TIBCO products for native R scripting Statistica provides model governance: authorship, user access control, version tracking, etc. Developer Real time - Action TIBCO Streambase Call embedded models for real time scoring Model deployment via centralized ACTION service, with authorship, user access control, approval to deployment, etc. Update models in live real-time applications Copyright 2000-2017 TIBCO Software Inc.

Summary More demand than ever for Data Science, with too few skilled Data Scientists Rise of Citizen Data Scientists, who need the right tools, guidance and frameworks Importance of leveraging the work of Data Scientists R is a key part of the solution, if R models can be managed, deployed, embedded, reused, TIBCO Analytics: Easy to embed/leverage/deploy the work of Data Scientists, from R and beyond In Spotfire Visual Applications, used by business users and Citizen Data Scientists In real time applications, to automate decision making Easier for Data Scientists to create and reuse predictive analytics in Statistica While leveraging the best of open source R, Python, etc. Rich community with examples, reusable assets, etc. While maintaining necessary analytic governance and model management