Oracle R Technologies

Similar documents
Introducing Oracle Machine Learning

Fault Detection using Advanced Analytics at CERN's Large Hadron Collider: Too Hot or Too Cold BIWA Summit 2016

Introducing Oracle R Enterprise 1.4 -

Oracle Big Data Science

Oracle Big Data. A NA LYT ICS A ND MA NAG E MENT.

Modern and Fast: A New Wave of Database and Java in the Cloud. Joost Pronk Van Hoogeveen Lead Product Manager, Oracle

Oracle Secure Backup 12.2 What s New. Copyright 2018, Oracle and/or its affiliates. All rights reserved.

Oracle Big Data Science IOUG Collaborate 16

FastR: Status and Outlook

Copyright 2012, Oracle and/or its affiliates. All rights reserved.

Performance Innovations with Oracle Database In-Memory

DNS Level 100. Rohit Rahi November Copyright 2018, Oracle and/or its affiliates. All rights reserved.

Copyright 2012, Oracle and/or its affiliates. All rights reserved.

Oracle Big Data Connectors

Oracle Database 18c and Autonomous Database

Javaentwicklung in der Oracle Cloud

Leverage the Oracle Data Integration Platform Inside Azure and Amazon Cloud

DATA INTEGRATION PLATFORM CLOUD. Experience Powerful Data Integration in the Cloud

Verarbeitung von Vektor- und Rasterdaten auf der Hadoop Plattform DOAG Spatial and Geodata Day 2016

CONTAINER CLOUD SERVICE. Managing Containers Easily on Oracle Public Cloud

Continuous delivery of Java applications. Marek Kratky Principal Sales Consultant Oracle Cloud Platform. May, 2016

DBAs can use Oracle Application Express? Why?

Big Data with Hadoop Ecosystem

Safe Harbor Statement

Stages of Data Processing

Automating Information Lifecycle Management with

Oracle Application Express fast = true

Oracle Enterprise Manager 12c IBM DB2 Database Plug-in

Power your cloud infrastructure with Oracle VM and Cisco!

COMPUTE CLOUD SERVICE. Moving to SPARC in the Oracle Cloud

MySQL Cluster Web Scalability, % Availability. Andrew

Oracle Database Exadata Cloud Service Exadata Performance, Cloud Simplicity DATABASE CLOUD SERVICE

Oracle Big Data SQL. Release 3.2. Rich SQL Processing on All Data

Apache Spark is a fast and general-purpose engine for large-scale data processing Spark aims at achieving the following goals in the Big data context

Application Container Cloud

Safe Harbor Statement

Do-It-Yourself 1. Oracle Big Data Appliance 2X Faster than

NOSQL DATABASE CLOUD SERVICE. Flexible Data Models. Zero Administration. Automatic Scaling.

Latest from the Lab: What's New Machine Learning Sam Buhler - Machine Learning Product/Offering Manager

Autonomous Data Warehouse in the Cloud

Oracle Solaris Virtualization: From DevOps to Enterprise

MDM Partner Summit 2015 Oracle Enterprise Data Quality Overview & Roadmap

Deploying, Managing and Reusing R Models in an Enterprise Environment

Improving Your Business with Oracle Data Integration See How Oracle Enterprise Metadata Management Can Help You

Oracle Big Data Discovery

Oracle GoldenGate for Big Data

Survey of Oracle Database

Taking R to New Heights for Scalability and Performance

<Insert Picture Here> Virtualisierung mit Oracle VirtualBox und Oracle Solaris Containern

<Insert Picture Here>

Deploying Spatial Applications in Oracle Public Cloud

MySQL CLOUD SERVICE. Propel Innovation and Time-to-Market

How to Troubleshoot Databases and Exadata Using Oracle Log Analytics

Accelerate Big Data Insights

2/26/2017. Originally developed at the University of California - Berkeley's AMPLab

Getting Started with Advanced Analytics in Finance, Marketing, and Operations

EsgynDB Enterprise 2.0 Platform Reference Architecture

Copyright 2014 Oracle and/or its affiliates. All rights reserved.

Oracle Enterprise Manager 12c Sybase ASE Database Plug-in

Innovatus Technologies

SQLDevTipsTricks. Jeff Smith Senior Principal Product Database Tools, Oracle Corp

@Pentaho #BigDataWebSeries

File Storage Level 100

Session 7: Oracle R Enterprise OAAgraph Package

Truffle A language implementation framework

IBM DB2 Analytics Accelerator Trends and Directions

Taming Structured And Unstructured Data With SAP HANA Running On VCE Vblock Systems

Database Level 100. Rohit Rahi November Copyright 2018, Oracle and/or its affiliates. All rights reserved.

Oracle Java SE Advanced for ISVs

Performance and Load Testing R12 With Oracle Applications Test Suite

Netezza The Analytics Appliance

Oracle Linux, Virtualization & OEM12 Discussion Sahil Mahajan / Sundeep Dhall

Oracle Big Data Fundamentals Ed 1

Oracle R Enterprise Platform and Configuration Requirements Oracle R Enterprise runs on 64-bit platforms only.

Oracle Database 10G. Lindsey M. Pickle, Jr. Senior Solution Specialist Database Technologies Oracle Corporation

Open And Linked Data Oracle proposition Subtitle

Copyright 2012, Oracle and/or its affiliates. All rights reserved.

Evolving To The Big Data Warehouse

Connecting your Microservices and Cloud Services with Oracle Integration CON7348

August Oracle - GoldenGate Statement of Direction

Security and Performance advances with Oracle Big Data SQL

Oracle Exadata Statement of Direction NOVEMBER 2017

Oracle R Technologies Overview

Understanding Oracle RAC ( ) Internals: The Cache Fusion Edition

Oracle Database 11g for Data Warehousing & Big Data: Strategy, Roadmap Jean-Pierre Dijcks, Hermann Baer Oracle Redwood City, CA, USA

Big Data com Hadoop. VIII Sessão - SQL Bahia. Impala, Hive e Spark. Diógenes Pires 03/03/2018

Information empowerment for your evolving data ecosystem

What is Gluent? The Gluent Data Platform

What s New with Oracle Database 12c on Windows: On-Premises and in the Cloud

Veritas NetBackup and Oracle Cloud Infrastructure Object Storage ORACLE HOW TO GUIDE FEBRUARY 2018

Copyright 2013, Oracle and/or its affiliates. All rights reserved. Insert Information Protection Policy Classification from Slide 12

Big Data Hadoop Stack

Spotfire Data Science with Hadoop Using Spotfire Data Science to Operationalize Data Science in the Age of Big Data

MySQL HA Solutions Selecting the best approach to protect access to your data

MOHA: Many-Task Computing Framework on Hadoop

Microsoft Big Data and Hadoop

IBM Data Science Experience White paper. SparkR. Transforming R into a tool for big data analytics

Oracle Machine Learning Notebook

<Insert Picture Here> Introduction to Big Data Technology

Activator Library. Focus on maximizing the value of your data, gain business insights, increase your team s productivity, and achieve success.

Transcription:

Oracle R Technologies R for the Enterprise Mark Hornick, Director, Oracle Advanced Analytics @MarkHornick mark.hornick@oracle.com

Safe Harbor Statement The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material, code, or functionality, and should not be relied upon in making purchasing decisions. The development, release, and timing of any features or functionality described for Oracle s products remains at the sole discretion of Oracle. 3

What technologies does Oracle provide? Oracle R Distribution ROracle Oracle R Enterprise Oracle R Advanced Analytics for Hadoop 4

Oracle R Distribution Ability to dynamically load Intel Math Kernel Library AMD Core Math Library Solaris Sun Performance Library Oracle Support An Oracle-Supported Redistribution of Open Source R Enhanced linear algebra performance via dynamically loaded libraries Improve scalability at client and database for embedded R execution Enterprise support for customers of Oracle Advanced Analytics option, Big Data Appliance, and Oracle Linux Free download Oracle contributes bug fixes and enhancements to open source R

ROracle R package enabling scalable and performant connectivity to Oracle Database Open source, publicly available on CRAN Oracle is maintainer Oracle Database Interface (DBI) for R Re-implemented and optimized driver based on OCI Execute SQL statements from R interface Enables transactional behavior for insert, update, and delete Oracle Database ROracle

Oracle R Enterprise Oracle Advanced Analytics Option to Oracle Database Eliminate memory constraint of client R engine Minimize or eliminate data movement latency Leverage Oracle Database as HPC environment Execute R scripts through database server machine for scalability and performance Leverage parallel, distributed in-database data mining algorithms Execute and manage R scripts via SQL Operationalize R scripts in production applications eliminate porting R code Avoid reinventing code to integrate R results into existing applications Client R Engine ORE packages Oracle Database User tables In-db stats SQL Interfaces SQL*Plus, SQLDeveloper, Database Server Machine 7

HCache Hadoop Abstraction Layer Oracle R Advanced Analytics for Hadoop ORD R Client R script {CRAN packages} Hadoop Job Mapper Reducer R HDFS R MapReduce R Hive R sqoop/olh Hadoop Cluster MapReduce Nodes {CRAN packages} HDFS Nodes ORD Transparent access to Hadoop Cluster from R Manipulate data in HDFS, Hive, database, and file system Write and execute MapReduce jobs with R Leverage CRAN R packages to work on HDFS-resident data Prepackaged parallel, distributed algorithms Oracle Database

Analytics Pain Points for example It takes too long to get my data or to get the right data I can t analyze or mine all of my data it has to be sampled Putting R models and results into production is ad hoc and complex Recoding R models into SQL, C, or Java takes time and is error prone Our company is concerned about data security, backup and recovery We need to build 10s of thousands of models fast to meet business objectives 9

Oracle Strategy 1. Provide choice of data management infrastructure for analytics Bring algorithms to data to eliminate data movement, but don t dictate data management infrastructure Achieve scalability and parallelism using open source R environment as interface 2. Exploit evolving technology trends for reduced time to insight Distributed memory & distributed computation for fast terabyte-scale analytics Use as much data as business problem requires for a quality solution avoid compromise due to tool limits Solve real-world problems with infrastructure on demand with high perf, in-mem, parallel distributed algorithms 3. Enable agility across enterprise user types One size doesn t fit all GUI users, data scientists, and application developers have different needs Deployment simplicity and speed is critical for all analytics 4. Leverage and contribute to open source Facilitate deploying open source packages in production Enable open source algorithms to work close to where data exists 10

Research Prototype: FastR New implementation of R in Java Uses the new Truffle interpreter framework and Graal optimizing compiler in conjunction with the HotSpot JVM for high performance, scalability and portability Dynamically compiles, adaptively optimizes and deoptimizes at run time Joint effort: Oracle Labs (Germany, USA, Austria), JKU Linz (Austria), Purdue University (USA), TU Dortmund (Germany) Open-source project GPLv2 https://bitbucket.org/allr/fastr 11

To Learn More about Oracle s R Technologies http://oracle.com/goto/r 12

Oracle Confidential Internal/Restricted/Highly Restricted 15