Introducing Oracle R Enterprise 1.4 -

Size: px
Start display at page:

Download "Introducing Oracle R Enterprise 1.4 -"

Transcription

1 Hello, and welcome to this online, self-paced lesson entitled Introducing Oracle R Enterprise. This session is part of an eight-lesson tutorial series on Oracle R Enterprise. My name is Brian Pottle. I will be your guide for the next 45 minutes of interactive lectures and review sessions on this lesson. 1

2 2

3 3

4 Introducing Oracle R Enterprise is the first lesson of eight self-study sessions on Oracle R Enterprise. 4

5 In this lesson, you ll learn: What R is, who uses it, and why they use it. Next, we ll examine several common user interfaces for R. Finally, you ll learn about Oracle s strategy for supporting the R community. So, let s start with the first topic: Using R: What, Who, and Why? 5

6 R is a language and environment for statistical computing and graphics. This GNU Project is similar to the S language and environment, which was developed at Bell Laboratories (formerly AT&T, now Lucent Technologies) by John Chambers and colleagues. R can be considered a different implementation of S. There are some important differences, but much code written for S runs unaltered under R. R is an open-source language and environment that supports: Statistical computing and data visualization Data manipulations and transformations And sophisticated graphical displays 6

7 With over 2 million R users worldwide, R is increasingly being used as the statistical tool in the academic world. Many colleges and universities worldwide are using R today in their statistics classes. In addition, more and more corporate analysts are using R. R benefits from around 5000 open-source packages, which can be thought of as a collection of related functions. This number grows continuously with new packages submissions from the R user community. Each package provides specialized functionality in such areas as bioinformatics and financial market analysis. In the slide, the list on the right shows CRAN Task Views. CRAN stands for the Comprehensive R Archive Network, which is a network of FTP and web servers that store identical, up-to-date versions of R code and documentation. The CRAN Task Views list areas of concentration for a set of packages. Each link contains information that is available on a wide range of topics. 7

8 So, why do statisticians and data analysts use R? As mentioned previously, R is a statistics language that is similar to SAS or SPSS. R is a powerful and extensible environment, with a wide range of statistics and data visualization capabilities. - Powerful: Users can perform data analysis and visualization with a minimal amount of R code. - Extensible: Users can write their own R functions and packages that can be used locally, shared within their organizations, or shared with the broader R community through CRAN. It s easy to install and use. And it s free and downloadable from the R Project website. 8

9 Although it s a powerful and effective statistical environment, R has limitations. First, R was conceived as a single-user tool that was not multithreaded. The client and server components are bundled together as a single executable, much like Excel. R is limited by the memory and processing power of the machine on which it runs. Also, R can t automatically leverage the CPU capacity on a user s multiprocessor laptop without special packages and programming. Second, R suffers from another scalability limitation that is associated with RAM. R requires data that it operates on to be first loaded into memory. In addition, R s approach to passing data between function invocations results in data duplication. This call by value approach to parameter passing can use up memory quickly. So inherently, R is really not designed for use with big data. Some users have provided packages to overcome some of the memory limitations, but the users must explicitly program with these packages. 9

10 R provides a wealth of resources to help users, including: Many R-related books that are available on the R project website Many user groups and user conferences that are available to the R community Online libraries of reusable code from the CRAN website Documented R packages with sample data and code 10

11 Next, let s examine several common user interfaces for R. 11

12 First, let s take a quick look at the interface that comes with open-source R by default, called the R Console. This default open-source R graphical user interface (GUI) includes a command-line interface for running scripts or individual functions, as shown in the slide. In addition, open-source R supports many third-party graphics packages. - In this example, we load a popular third-party graphics package named ggplot2. - Then, the graphics package is called from the R Console command line. The second qplot function call displays the graphic on the right. - Here, the qplot function is invoked on the mtchars data set, which comes with R. In the graph, we plot miles per gallon against weight, with the size of each dot indicating the number of cylinders. 12

13 In addition to the default open-source R GUI, you can use a third-party integrated development environment (IDE), such as RStudio, which is shown in the slide. With RStudio: You can use the upper-left pane to view R scripts and select portions of an R script for execution. In the Console pane, you can execute R scripts or functions at the command line, in a similar fashion as the default R GUI. You can execute selected portions of R scripts in the top window by clicking the Run button. With this method, selected lines are pasted into the Console pane and executed. You can view graph results in the right pane. In this case, the Plots tab is selected. In this next view, the R script that we saw previously is displayed in the viewer window. Here, the first portion of the script is selected. This code requests help on the gplot() function. When the Run button is clicked, the selected code is pasted into the Console pane and then executed. In the display pane, you can select (and switch between) different tabbed output views on the Files, Plots, Packages, and Help tabs. In this case, the Help tab is selected to display results from the R help command. In this final view, the last function in the R script is selected. This same gplot() function was shown previously in the default R GUI. The Run button is clicked and the code is executed. The Plots tab shows the current output. In fact, RStudio also lets you view previously generated plots. RStudio is only one of many third-party R IDEs. As shown in the table of this 2011 poll, RStudio is the second most commonly used interface, behind the built-in R console we looked at earlier. However, it s often user preference that decides which IDE will be used. 13

14 Data visualization helps convey information faster than most other means. The link shown in the slide is for the R Graph Gallery, where you can find a variety of graphic types for R. 14

15 Here are a few examples of graphs in R. Of course, there are many others. Moving from left to right, and top to bottom, we show: A box plot Perspective graphs of mathematical surfaces 3-D scatter plots with points A regression plane Multivariate facet crafts Smooth scatter plots Venn diagrams And even chromosome mappings from the bioconductor package 15

16 In this final section of the lesson, you ll learn about Oracle s strategy for supporting the R community. This section includes the following topics: Goals Software term definitions High- and mid-level architectural overviews Software component features R user-community definitions 16

17 Scalability, performance, and production deployment are key requirements for the enterprise data analytics arena. What may work fine on your laptop for thousands or even millions of rows, won t scale to 100s of millions and billions of rows. Similarly, performance may be adequate on smaller scale data, but will moving to big data allow you to keep up? The cost of moving data to a separate server can render an application unusable. Finally, when you ve finished your project in a lab environment, how easy is it to deploy that result into production? Each of these requirements are met by Oracle s strategy for supporting the R community. 17

18 Oracle s goal for supporting open-source R is to deliver enterprise-level advanced analytics based on the R environment. The strategy is implemented through the release of the following Oracle technologies: Oracle R Enterprise (ORE), which is part of the Oracle Advanced Analytics option for Oracle Database 12c and 11g, release 2. ORE contains a statistics engine, and provides transparent access to database-resident data from R, as you will learn in this tutorial series. Oracle R distribution, which supports configurations of open-source R on various platforms. In addition, Oracle contributes bug fixes and enhancements to open-source R. ROracle, the open-source Oracle database interface for R. Oracle R Advanced Analytics for Hadoop, or ORAAH, provides an R interface to an Oracle Hadoop cluster on the BDA,and also to non-oracle Hadoop clusters. It enables you to access and manipulate data in the Hadoop Distributed File System, in the Oracle Database, and on the file system. 18

19 Now, let s examine an architectural view of ORE. The R workspace console may be the default R GUI or any of the third-party R GUIs. Users execute R scripts here. Then, the ORE transparency layer intercepts functions that operate on database tables or views. It translates the request into SQL for execution in Oracle Database for transformations and statistical computations. In Oracle Database, the statistics engine consists of native database functionality that leverages SQL and the various database management system (DBMS) packages, as well as enhancements that are specific to ORE. Finally, the results can be leveraged by enterprise systems, such as Oracle Business Intelligence Enterprise Edition (OBIEE), or web services-based applications. This design results in: No changes to the R user experience in the development environment The ability to scale to large data sets in the production environment And, the ability to embed results in operational systems, such as Oracle OBIEE Dashboards 19

20 This architectural view illustrates how ORE can work with Oracle R Advanced Analytics for Hadoop. ORAAH enables native R access to the Hadoop cluster for both: MapReduce programming in R, and Access to Hadoop Distributed File System (HDFS) data, in either the Big Data Appliance (as shown in the slide), or non-oracle Hadoop clusters 20

21 Oracle Big Data Appliance has been mentioned a couple of time so far in this lesson. So, what is it? Oracle BDA: Is an optimized solution for storing and integrating low-density data into Exadata. Is a preintegrated configuration with 18 of Oracle's Sun servers that include InfiniBand and Ethernet connectivity to simplify implementation and management. Has the Cloudera distribution, including Apache Hadoop to acquire and organize data, along with Oracle NoSQL Database Community Edition to acquire data. Includes additional system software: Oracle Linux, Oracle Java Hotspot Virtual Machine, and an open-source distribution of R. Oracle Big Data Connectors is an option for BDA. It consists of: Oracle Loader for Hadoop Oracle Data Integrator Application Adapter for Hadoop Oracle Direct Connector for HDFS Oracle R Advanced Analytics for Hadoop You can use ORAAH to access data in Exadata, and perform R calculations on HDFS data by using scalable map-reduce methods. 21

22 Now, let s take a brief look at the components of Oracle R Enterprise. From a software perspective, ORE consists of R packages, database libraries, and SQL extensions. We ll divide the features into three main groups: the Transparency Layer, the Statistics Engine, and SQL extensions. The Transparency Layer is a set of packages that map R data types to Oracle Database objects. This feature automatically generates SQL for R expressions on mapped data types, enabling direct interaction with data in Oracle Database while using R language constructs. Functionally, this mapping provides access to database tables from R as a type of data.frame: a base R data representation with rows and columns. ORE calls this an ore.frame. Therefore, when you invoke an R function on an ore.frame, the R operation is sent to the database for execution as SQL. The Statistics Engine is a database library that supports a variety of statistical computations. This engine includes existing in-database advanced analytics and new features added specifically in ORE. SQL extensions enable in-database embedded R execution, which is particularly valuable for third-party R packages, or custom functions, that do not have equivalent in-database functionality. 22 These SQL extensions also facilitate embedding R in operational systems.

23 If we look at ORE from the perspective of a collaborative execution model, it leverages three layers of computational engines. The first one is the client (or user) R engine, which resides on the desktop. This R engine consists of the base R packages, the ORE packages, and any other R packages that the user may have installed. At this level, the Transparency Layer intercepts R functions for in-database execution. It also enables interactive display of graphical results, while flow control remains with the R environment. From the client, users can submit entire R scripts for execution by Oracle Database, using embedded R execution. And, although not explicitly depicted here, users can connect to a Hadoop Cluster by using Oracle R Connector for Hadoop. The second compute engine is Oracle Database. This database allows scaling to large data sets. R users are able to access tables, views, and external tables, as well as data that is accessible through database links. The SQL generator through the Transparency Layer can automatically leverage database parallelism. It can also leverage both new and existing in-database statistical and data mining capabilities. The third compute engine (or engines) are those spawned and managed by Oracle Database, and they execute on the database server machine. These embedded R engines enable more efficient data transfer between the database and R. Because these engines run on the database server, rather than on the client, they are likely to have greater memory capacity and compute power. Exadata is an example. The embedded R execution enables parallel data transfer, returning rich XML or PNG image output, SQL access to R, and the ability to run parallel simulations. The embedded R engines also enable use of 3rd party packages or custom functions that do not have in-database SQL equivalent functionality. The engines also enable R users to write and test map-reduce scripts before rolling them out to a Hadoop cluster. Finally, these engines enable lights-out execution of R scripts; that is, scheduling or triggering R script packages inside a SQL or PL/SQL query. 23

24 The ORE target environment design provides a comprehensive, database-centric environment for end-to-end analytic processes in R, with immediate deployment to production environments. It provides many benefits, including: Elimination of R client engine memory constraint Execution of R scripts through the Oracle Database server machine for scalability and performance Seamless integration of Oracle Database as the HPC environment for R scripts, providing data parallelism and resource management The ability to operationalize entire R scripts in production applications Scoring of R models in Oracle Database 24

25 R and ORE can receive data from many sources. In this figure, we depict the R engine running on the user s laptop, as shown in the previous slide. Through a series of R packages, R itself is able to access data stored in both files, and in databases. In addition, ORE provides transparent access to data stored in the local Oracle Database, as we previously discussed. In addition, ORE has access to: Data in other databases, which are accessible through database links Data in external tables And, of course, data in HDFS. In addition to bulk import, ORE makes it possible to access Hadoop directly, in a similar fashion to external tables, by using HDFS connect. This means that you can join Hadoop data with database data. 25

26 26

27 Here are a few resources on Oracle R related technology. 27

28 28

29 So, in this lesson, we covered three primary topics. First, you learned what R is, who uses it, and why they use it. Then, we looked at some common user interfaces for R. Finally, we discussed Oracle s strategy for supporting the R community, including an overview of goals, definitions of software terms, high- and mid-level architecture, software component features, and R user-community definitions. 29

30 You ve just completed Introducing Oracle R Enterprise. Please move on to the next lesson in the series: Getting Started with ORE. 30

31 31

32 32

My name is Brian Pottle. I will be your guide for the next 45 minutes of interactive lectures and review on this lesson.

My name is Brian Pottle. I will be your guide for the next 45 minutes of interactive lectures and review on this lesson. Hello, and welcome to this online, self-paced lesson entitled ORE Embedded R Scripts: SQL Interface. This session is part of an eight-lesson tutorial series on Oracle R Enterprise. My name is Brian Pottle.

More information

Oracle Big Data Science

Oracle Big Data Science Oracle Big Data Science Tim Vlamis and Dan Vlamis Vlamis Software Solutions 816-781-2880 www.vlamis.com @VlamisSoftware Vlamis Software Solutions Vlamis Software founded in 1992 in Kansas City, Missouri

More information

Getting Started with ORE - 1

Getting Started with ORE - 1 Hello, and welcome to this online, self-paced lesson entitled Getting Started with ORE. This session is part of an eight-lesson tutorial series on Oracle R Enterprise. My name is Brian Pottle. I will be

More information

Oracle Big Data Science IOUG Collaborate 16

Oracle Big Data Science IOUG Collaborate 16 Oracle Big Data Science IOUG Collaborate 16 Session 4762 Tim and Dan Vlamis Tuesday, April 12, 2016 Vlamis Software Solutions Vlamis Software founded in 1992 in Kansas City, Missouri Developed 200+ Oracle

More information

<Insert Picture Here>

<Insert Picture Here> 1 Oracle R Enterprise Training Sessions Session 1: Getting Started with Oracle R Enterprise Mark Hornick, Senior Manager, Development Oracle Advanced Analytics The following is intended

More information

Oracle R Technologies

Oracle R Technologies Oracle R Technologies R for the Enterprise Mark Hornick, Director, Oracle Advanced Analytics @MarkHornick mark.hornick@oracle.com Safe Harbor Statement The following is intended to outline our general

More information

Oracle Big Data Connectors

Oracle Big Data Connectors Oracle Big Data Connectors Oracle Big Data Connectors is a software suite that integrates processing in Apache Hadoop distributions with operations in Oracle Database. It enables the use of Hadoop to process

More information

Brendan Tierney. Running R in the Database using Oracle R Enterprise 05/02/2018. Code Demo

Brendan Tierney. Running R in the Database using Oracle R Enterprise 05/02/2018. Code Demo Running R in the Database using Oracle R Enterprise Brendan Tierney Code Demo Data Warehousing since 1997 Data Mining since 1998 Analytics since 1993 1 Agenda What is R? Oracle Advanced Analytics Option

More information

Fault Detection using Advanced Analytics at CERN's Large Hadron Collider: Too Hot or Too Cold BIWA Summit 2016

Fault Detection using Advanced Analytics at CERN's Large Hadron Collider: Too Hot or Too Cold BIWA Summit 2016 Fault Detection using Advanced Analytics at CERN's Large Hadron Collider: Too Hot or Too Cold BIWA Summit 2016 Mark Hornick, Director, Advanced Analytics January 27, 2016 Safe Harbor Statement The following

More information

Oracle Big Data. A NA LYT ICS A ND MA NAG E MENT.

Oracle Big Data. A NA LYT ICS A ND MA NAG E MENT. Oracle Big Data. A NALYTICS A ND MANAG E MENT. Oracle Big Data: Redundância. Compatível com ecossistema Hadoop, HIVE, HBASE, SPARK. Integração com Cloudera Manager. Possibilidade de Utilização da Linguagem

More information

Oracle Big Data Fundamentals Ed 1

Oracle Big Data Fundamentals Ed 1 Oracle University Contact Us: +0097143909050 Oracle Big Data Fundamentals Ed 1 Duration: 5 Days What you will learn In the Oracle Big Data Fundamentals course, learn to use Oracle's Integrated Big Data

More information

Oracle BDA: Working With Mammoth - 1

Oracle BDA: Working With Mammoth - 1 Hello and welcome to this online, self-paced course titled Administering and Managing the Oracle Big Data Appliance (BDA). This course contains several lessons. This lesson is titled Working With Mammoth.

More information

Oracle 1Z Oracle Big Data 2017 Implementation Essentials.

Oracle 1Z Oracle Big Data 2017 Implementation Essentials. Oracle 1Z0-449 Oracle Big Data 2017 Implementation Essentials https://killexams.com/pass4sure/exam-detail/1z0-449 QUESTION: 63 Which three pieces of hardware are present on each node of the Big Data Appliance?

More information

Oracle Big Data SQL. Release 3.2. Rich SQL Processing on All Data

Oracle Big Data SQL. Release 3.2. Rich SQL Processing on All Data Oracle Big Data SQL Release 3.2 The unprecedented explosion in data that can be made useful to enterprises from the Internet of Things, to the social streams of global customer bases has created a tremendous

More information

Security and Performance advances with Oracle Big Data SQL

Security and Performance advances with Oracle Big Data SQL Security and Performance advances with Oracle Big Data SQL Jean-Pierre Dijcks Oracle Redwood Shores, CA, USA Key Words SQL, Oracle, Database, Analytics, Object Store, Files, Big Data, Big Data SQL, Hadoop,

More information

Spotfire Data Science with Hadoop Using Spotfire Data Science to Operationalize Data Science in the Age of Big Data

Spotfire Data Science with Hadoop Using Spotfire Data Science to Operationalize Data Science in the Age of Big Data Spotfire Data Science with Hadoop Using Spotfire Data Science to Operationalize Data Science in the Age of Big Data THE RISE OF BIG DATA BIG DATA: A REVOLUTION IN ACCESS Large-scale data sets are nothing

More information

Introduction to the Oracle Big Data Appliance - 1

Introduction to the Oracle Big Data Appliance - 1 Hello and welcome to this online, self-paced course titled Administering and Managing the Oracle Big Data Appliance (BDA). This course contains several lessons. This lesson is titled Introduction to the

More information

Safe Harbor Statement

Safe Harbor Statement Safe Harbor Statement The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment

More information

Spotfire: Brisbane Breakfast & Learn. Thursday, 9 November 2017

Spotfire: Brisbane Breakfast & Learn. Thursday, 9 November 2017 Spotfire: Brisbane Breakfast & Learn Thursday, 9 November 2017 CONFIDENTIALITY The following information is confidential information of TIBCO Software Inc. Use, duplication, transmission, or republication

More information

Oracle Big Data SQL brings SQL and Performance to Hadoop

Oracle Big Data SQL brings SQL and Performance to Hadoop Oracle Big Data SQL brings SQL and Performance to Hadoop Jean-Pierre Dijcks Oracle Redwood City, CA, USA Keywords: Big Data SQL, Hadoop, Big Data Appliance, SQL, Oracle, Performance, Smart Scan Introduction

More information

Big Data The end of Data Warehousing?

Big Data The end of Data Warehousing? Big Data The end of Data Warehousing? Hermann Bär Oracle USA Redwood Shores, CA Schlüsselworte Big data, data warehousing, advanced analytics, Hadoop, unstructured data Introduction If there was an Unwort

More information

Evolving To The Big Data Warehouse

Evolving To The Big Data Warehouse Evolving To The Big Data Warehouse Kevin Lancaster 1 Copyright Director, 2012, Oracle and/or its Engineered affiliates. All rights Insert Systems, Information Protection Policy Oracle Classification from

More information

Oracle NoSQL Database and Cisco- Collaboration that produces results. 1 Copyright 2011, Oracle and/or its affiliates. All rights reserved.

Oracle NoSQL Database and Cisco- Collaboration that produces results. 1 Copyright 2011, Oracle and/or its affiliates. All rights reserved. Oracle NoSQL Database and Cisco- Collaboration that produces results 1 Copyright 2011, Oracle and/or its affiliates. All rights reserved. What is Big Data? SOCIAL BLOG SMART METER VOLUME VELOCITY VARIETY

More information

Narration Script for ODI Adapter for Hadoop estudy

Narration Script for ODI Adapter for Hadoop estudy Narration Script for ODI Adapter for Hadoop estudy MODULE 1: Overview of Oracle Big Data Title Hello, and welcome to this Oracle self-study course entitled Oracle Data Integrator Application Adapter for

More information

Oracle Big Data Fundamentals Ed 2

Oracle Big Data Fundamentals Ed 2 Oracle University Contact Us: 1.800.529.0165 Oracle Big Data Fundamentals Ed 2 Duration: 5 Days What you will learn In the Oracle Big Data Fundamentals course, you learn about big data, the technologies

More information

Copyright 2012, Oracle and/or its affiliates. All rights reserved.

Copyright 2012, Oracle and/or its affiliates. All rights reserved. 1 Big Data Connectors: High Performance Integration for Hadoop and Oracle Database Melli Annamalai Sue Mavris Rob Abbott 2 Program Agenda Big Data Connectors: Brief Overview Connecting Hadoop with Oracle

More information

IBM SPSS Statistics and open source: A powerful combination. Let s go

IBM SPSS Statistics and open source: A powerful combination. Let s go and open source: A powerful combination Let s go The purpose of this paper is to demonstrate the features and capabilities provided by the integration of IBM SPSS Statistics and open source programming

More information

Securing the Oracle BDA - 1

Securing the Oracle BDA - 1 Hello and welcome to this online, self-paced course titled Administering and Managing the Oracle Big Data Appliance (BDA). This course contains several lessons. This lesson is titled Securing the Oracle

More information

How to choose the right approach to analytics and reporting

How to choose the right approach to analytics and reporting SOLUTION OVERVIEW How to choose the right approach to analytics and reporting A comprehensive comparison of the open source and commercial versions of the OpenText Analytics Suite In today s digital world,

More information

Blended Learning Outline: Cloudera Data Analyst Training (171219a)

Blended Learning Outline: Cloudera Data Analyst Training (171219a) Blended Learning Outline: Cloudera Data Analyst Training (171219a) Cloudera Univeristy s data analyst training course will teach you to apply traditional data analytics and business intelligence skills

More information

Boost your Analytics with ML for SQL Nerds

Boost your Analytics with ML for SQL Nerds Boost your Analytics with ML for SQL Nerds SQL Saturday Spokane Mar 10, 2018 Julie Koesmarno @MsSQLGirl mssqlgirl.com jukoesma@microsoft.com Principal Program Manager in Business Analytics for SQL Products

More information

1Z Oracle Big Data 2017 Implementation Essentials Exam Summary Syllabus Questions

1Z Oracle Big Data 2017 Implementation Essentials Exam Summary Syllabus Questions 1Z0-449 Oracle Big Data 2017 Implementation Essentials Exam Summary Syllabus Questions Table of Contents Introduction to 1Z0-449 Exam on Oracle Big Data 2017 Implementation Essentials... 2 Oracle 1Z0-449

More information

Composite Software Data Virtualization The Five Most Popular Uses of Data Virtualization

Composite Software Data Virtualization The Five Most Popular Uses of Data Virtualization Composite Software Data Virtualization The Five Most Popular Uses of Data Virtualization Composite Software, Inc. June 2011 TABLE OF CONTENTS INTRODUCTION... 3 DATA FEDERATION... 4 PROBLEM DATA CONSOLIDATION

More information

Innovatus Technologies

Innovatus Technologies HADOOP 2.X BIGDATA ANALYTICS 1. Java Overview of Java Classes and Objects Garbage Collection and Modifiers Inheritance, Aggregation, Polymorphism Command line argument Abstract class and Interfaces String

More information

Oracle Application Express: Administration 1-2

Oracle Application Express: Administration 1-2 Oracle Application Express: Administration 1-2 The suggested course agenda is displayed in the slide. Each lesson, except the Course Overview, will be followed by practice time. Oracle Application Express:

More information

Massive Scalability With InterSystems IRIS Data Platform

Massive Scalability With InterSystems IRIS Data Platform Massive Scalability With InterSystems IRIS Data Platform Introduction Faced with the enormous and ever-growing amounts of data being generated in the world today, software architects need to pay special

More information

Bringing OpenStack to the Enterprise. An enterprise-class solution ensures you get the required performance, reliability, and security

Bringing OpenStack to the Enterprise. An enterprise-class solution ensures you get the required performance, reliability, and security Bringing OpenStack to the Enterprise An enterprise-class solution ensures you get the required performance, reliability, and security INTRODUCTION Organizations today frequently need to quickly get systems

More information

Oracle R Enterprise Platform and Configuration Requirements Oracle R Enterprise runs on 64-bit platforms only.

Oracle R Enterprise Platform and Configuration Requirements Oracle R Enterprise runs on 64-bit platforms only. Oracle R Enterprise Release Notes Release 1.5.1 E83205-02 April 2017 These release notes contain important information about Release 1.5.1 of Oracle R Enterprise. New Features in Oracle R Enterprise 1.5.1

More information

BEST BIG DATA CERTIFICATIONS

BEST BIG DATA CERTIFICATIONS VALIANCE INSIGHTS BIG DATA BEST BIG DATA CERTIFICATIONS email : info@valiancesolutions.com website : www.valiancesolutions.com VALIANCE SOLUTIONS Analytics: Optimizing Certificate Engineer Engineering

More information

CONSOLIDATING RISK MANAGEMENT AND REGULATORY COMPLIANCE APPLICATIONS USING A UNIFIED DATA PLATFORM

CONSOLIDATING RISK MANAGEMENT AND REGULATORY COMPLIANCE APPLICATIONS USING A UNIFIED DATA PLATFORM CONSOLIDATING RISK MANAGEMENT AND REGULATORY COMPLIANCE APPLICATIONS USING A UNIFIED PLATFORM Executive Summary Financial institutions have implemented and continue to implement many disparate applications

More information

Getting Started with Advanced Analytics in Finance, Marketing, and Operations

Getting Started with Advanced Analytics in Finance, Marketing, and Operations Getting Started with Advanced Analytics in Finance, Marketing, and Operations Southwest Regional Oracle Applications User Group Dan Vlamis February 24, 2017 @VlamisSoftware Vlamis Software Solutions Vlamis

More information

Stages of Data Processing

Stages of Data Processing Data processing can be understood as the conversion of raw data into a meaningful and desired form. Basically, producing information that can be understood by the end user. So then, the question arises,

More information

Quick Deployment Step- by- step instructions to deploy Oracle Big Data Lite Virtual Machine

Quick Deployment Step- by- step instructions to deploy Oracle Big Data Lite Virtual Machine Quick Deployment Step- by- step instructions to deploy Oracle Big Data Lite Virtual Machine Version 4.1.0 Please note: This appliance is for testing and educational purposes only; it is unsupported and

More information

Quick Deployment Step-by-step instructions to deploy Oracle Big Data Lite Virtual Machine

Quick Deployment Step-by-step instructions to deploy Oracle Big Data Lite Virtual Machine Quick Deployment Step-by-step instructions to deploy Oracle Big Data Lite Virtual Machine Version 4.11 Last Updated: 1/10/2018 Please note: This appliance is for testing and educational purposes only;

More information

Information empowerment for your evolving data ecosystem

Information empowerment for your evolving data ecosystem Information empowerment for your evolving data ecosystem Highlights Enables better results for critical projects and key analytics initiatives Ensures the information is trusted, consistent and governed

More information

Specialist ICT Learning

Specialist ICT Learning Specialist ICT Learning APPLIED DATA SCIENCE AND BIG DATA ANALYTICS GTBD7 Course Description This intensive training course provides theoretical and technical aspects of Data Science and Business Analytics.

More information

<Insert Picture Here> Introduction to Big Data Technology

<Insert Picture Here> Introduction to Big Data Technology Introduction to Big Data Technology The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into

More information

Getting Started With Intellicus. Version: 7.3

Getting Started With Intellicus. Version: 7.3 Getting Started With Intellicus Version: 7.3 Copyright 2015 Intellicus Technologies This document and its content is copyrighted material of Intellicus Technologies. The content may not be copied or derived

More information

SAS Data Integration Studio 3.3. User s Guide

SAS Data Integration Studio 3.3. User s Guide SAS Data Integration Studio 3.3 User s Guide The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2006. SAS Data Integration Studio 3.3: User s Guide. Cary, NC: SAS Institute

More information

JMP and SAS : One Completes The Other! Philip Brown, Predictum Inc, Potomac, MD! Wayne Levin, Predictum Inc, Toronto, ON!

JMP and SAS : One Completes The Other! Philip Brown, Predictum Inc, Potomac, MD! Wayne Levin, Predictum Inc, Toronto, ON! Paper JM08-2014 JMP and SAS : One Completes The Other Philip Brown, Predictum Inc, Potomac, MD Wayne Levin, Predictum Inc, Toronto, ON ABSTRACT Integrating JMP with SAS creates a formidable data management

More information

Oracle Warehouse Builder 10g Release 2 Integrating Packaged Applications Data

Oracle Warehouse Builder 10g Release 2 Integrating Packaged Applications Data Oracle Warehouse Builder 10g Release 2 Integrating Packaged Applications Data June 2006 Note: This document is for informational purposes. It is not a commitment to deliver any material, code, or functionality,

More information

Oracle Big Data SQL High Performance Data Virtualization Explained

Oracle Big Data SQL High Performance Data Virtualization Explained Keywords: Oracle Big Data SQL High Performance Data Virtualization Explained Jean-Pierre Dijcks Oracle Redwood City, CA, USA Big Data SQL, SQL, Big Data, Hadoop, NoSQL Databases, Relational Databases,

More information

Massively Parallel Processing. Big Data Really Fast. A Proven In-Memory Analytical Processing Platform for Big Data

Massively Parallel Processing. Big Data Really Fast. A Proven In-Memory Analytical Processing Platform for Big Data Big Data Really Fast A Proven In-Memory Analytical Processing Platform for Big Data 2 Executive Summary / Overview: Big Data can be a big headache for organizations that have outgrown the practicality

More information

QLIKVIEW ARCHITECTURAL OVERVIEW

QLIKVIEW ARCHITECTURAL OVERVIEW QLIKVIEW ARCHITECTURAL OVERVIEW A QlikView Technology White Paper Published: October, 2010 qlikview.com Table of Contents Making Sense of the QlikView Platform 3 Most BI Software Is Built on Old Technology

More information

Oracle NoSQL Database Enterprise Edition, Version 18.1

Oracle NoSQL Database Enterprise Edition, Version 18.1 Oracle NoSQL Database Enterprise Edition, Version 18.1 Oracle NoSQL Database is a scalable, distributed NoSQL database, designed to provide highly reliable, flexible and available data management across

More information

INTRODUCTION... 2 FEATURES OF DARWIN... 4 SPECIAL FEATURES OF DARWIN LATEST FEATURES OF DARWIN STRENGTHS & LIMITATIONS OF DARWIN...

INTRODUCTION... 2 FEATURES OF DARWIN... 4 SPECIAL FEATURES OF DARWIN LATEST FEATURES OF DARWIN STRENGTHS & LIMITATIONS OF DARWIN... INTRODUCTION... 2 WHAT IS DATA MINING?... 2 HOW TO ACHIEVE DATA MINING... 2 THE ROLE OF DARWIN... 3 FEATURES OF DARWIN... 4 USER FRIENDLY... 4 SCALABILITY... 6 VISUALIZATION... 8 FUNCTIONALITY... 10 Data

More information

From Design to Deployment:

From Design to Deployment: From Design to Deployment: Leveraging the combined power of LabVIEW and Mathematica throughout the development cycle Copyright 2002 BetterVIEW Consulting Introduction As research and production cycles

More information

Boost your Analytics with Machine Learning for SQL Nerds. Julie mssqlgirl.com

Boost your Analytics with Machine Learning for SQL Nerds. Julie mssqlgirl.com Boost your Analytics with Machine Learning for SQL Nerds Julie Koesmarno @MsSQLGirl mssqlgirl.com 1. Y ML 2. Operationalizing ML 3. Tips & Tricks 4. Resources automation delighting customers Deepen Engagement

More information

Intellicus Getting Started

Intellicus Getting Started Intellicus Getting Started Intellicus Web-based Reporting Suite Version 4.5 Enterprise Professional Smart Developer Smart Viewer Intellicus Technologies info@intellicus.com www.intellicus.com Copyright

More information

Introduction to Big Data

Introduction to Big Data Introduction to Big Data OVERVIEW We are experiencing transformational changes in the computing arena. Data is doubling every 12 to 18 months, accelerating the pace of innovation and time-to-value. The

More information

Blended Learning Outline: Developer Training for Apache Spark and Hadoop (180404a)

Blended Learning Outline: Developer Training for Apache Spark and Hadoop (180404a) Blended Learning Outline: Developer Training for Apache Spark and Hadoop (180404a) Cloudera s Developer Training for Apache Spark and Hadoop delivers the key concepts and expertise need to develop high-performance

More information

Performance and Scalability Overview

Performance and Scalability Overview Performance and Scalability Overview This guide provides an overview of some of the performance and scalability capabilities of the Pentaho Business Anlytics platform PENTAHO PERFORMANCE ENGINEERING TEAM

More information

Qlik Sense Enterprise architecture and scalability

Qlik Sense Enterprise architecture and scalability White Paper Qlik Sense Enterprise architecture and scalability June, 2017 qlik.com Platform Qlik Sense is an analytics platform powered by an associative, in-memory analytics engine. Based on users selections,

More information

Oracle Database 11g for Data Warehousing & Big Data: Strategy, Roadmap Jean-Pierre Dijcks, Hermann Baer Oracle Redwood City, CA, USA

Oracle Database 11g for Data Warehousing & Big Data: Strategy, Roadmap Jean-Pierre Dijcks, Hermann Baer Oracle Redwood City, CA, USA Oracle Database 11g for Data Warehousing & Big Data: Strategy, Roadmap Jean-Pierre Dijcks, Hermann Baer Oracle Redwood City, CA, USA Keywords: Big Data, Oracle Big Data Appliance, Hadoop, NoSQL, Oracle

More information

Dealing with Data Especially Big Data

Dealing with Data Especially Big Data Dealing with Data Especially Big Data INFO-GB-2346.01 Fall 2017 Professor Norman White nwhite@stern.nyu.edu normwhite@twitter Teaching Assistant: Frenil Sanghavi fps241@stern.nyu.edu Administrative Assistant:

More information

Big Data Hadoop Developer Course Content. Big Data Hadoop Developer - The Complete Course Course Duration: 45 Hours

Big Data Hadoop Developer Course Content. Big Data Hadoop Developer - The Complete Course Course Duration: 45 Hours Big Data Hadoop Developer Course Content Who is the target audience? Big Data Hadoop Developer - The Complete Course Course Duration: 45 Hours Complete beginners who want to learn Big Data Hadoop Professionals

More information

Using Machine Learning in OBIEE for Actionable BI. By Lakshman Bulusu Mitchell Martin Inc./ Bank of America

Using Machine Learning in OBIEE for Actionable BI. By Lakshman Bulusu Mitchell Martin Inc./ Bank of America Using Machine Learning in OBIEE for Actionable BI By Lakshman Bulusu Mitchell Martin Inc./ Bank of America Using Machine Learning in OBIEE for Actionable BI Using Machine Learning (ML) via Oracle R Technologies

More information

SQL Server Machine Learning Marek Chmel & Vladimir Muzny

SQL Server Machine Learning Marek Chmel & Vladimir Muzny SQL Server Machine Learning Marek Chmel & Vladimir Muzny @VladimirMuzny & @MarekChmel MCTs, MVPs, MCSEs Data Enthusiasts! vladimir@datascienceteam.cz marek@datascienceteam.cz Session Agenda Machine learning

More information

Getting Started with Intellicus. Version: 16.0

Getting Started with Intellicus. Version: 16.0 Getting Started with Intellicus Version: 16.0 Copyright 2016 Intellicus Technologies This document and its content is copyrighted material of Intellicus Technologies. The content may not be copied or derived

More information

TIBCO Statistica Release Notes

TIBCO Statistica Release Notes TIBCO Statistica Release Notes Software Release 13.3.1 November 2017 Two-Second Advantage Important Information SOME TIBCO SOFTWARE EMBEDS OR BUNDLES OTHER TIBCO SOFTWARE. USE OF SUCH EMBEDDED OR BUNDLED

More information

DATA SCIENCE USING SPARK: AN INTRODUCTION

DATA SCIENCE USING SPARK: AN INTRODUCTION DATA SCIENCE USING SPARK: AN INTRODUCTION TOPICS COVERED Introduction to Spark Getting Started with Spark Programming in Spark Data Science with Spark What next? 2 DATA SCIENCE PROCESS Exploratory Data

More information

ORAAH Change List Summary. ORAAH Change List Summary

ORAAH Change List Summary. ORAAH Change List Summary ORAAH 2.7.1 Change List Summary i ORAAH 2.7.1 Change List Summary ORAAH 2.7.1 Change List Summary ii REVISION HISTORY NUMBER DATE DESCRIPTION NAME ORAAH 2.7.1 Change List Summary iii Contents 1 ORAAH 2.7.1

More information

Apache Spark 2 X Cookbook Cloud Ready Recipes For Analytics And Data Science

Apache Spark 2 X Cookbook Cloud Ready Recipes For Analytics And Data Science Apache Spark 2 X Cookbook Cloud Ready Recipes For Analytics And Data Science We have made it easy for you to find a PDF Ebooks without any digging. And by having access to our ebooks online or by storing

More information

The History and Use of R. Joseph Kambourakis

The History and Use of R. Joseph Kambourakis The History and Use of R Joseph Kambourakis Ground Rules Interrupt me These are all my opinions and not of EMC or Big Data Analytics, Discovery & Visualization Meetup Slides will be available Joseph

More information

Fusion Registry 9 SDMX Data and Metadata Management System

Fusion Registry 9 SDMX Data and Metadata Management System Registry 9 Data and Management System Registry 9 is a complete and fully integrated statistical data and metadata management system using. Whether you require a metadata repository supporting a highperformance

More information

This is a brief tutorial that explains how to make use of Sqoop in Hadoop ecosystem.

This is a brief tutorial that explains how to make use of Sqoop in Hadoop ecosystem. About the Tutorial Sqoop is a tool designed to transfer data between Hadoop and relational database servers. It is used to import data from relational databases such as MySQL, Oracle to Hadoop HDFS, and

More information

Talend Big Data Sandbox. Big Data Insights Cookbook

Talend Big Data Sandbox. Big Data Insights Cookbook Overview Pre-requisites Setup & Configuration Hadoop Distribution Download Demo (Scenario) Overview Pre-requisites Setup & Configuration Hadoop Distribution Demo (Scenario) About this cookbook What is

More information

Learning R Series Session 5: Oracle R Enterprise 1.3 Integrating R Results and Images with OBIEE Dashboards Mark Hornick Oracle Advanced Analytics

Learning R Series Session 5: Oracle R Enterprise 1.3 Integrating R Results and Images with OBIEE Dashboards Mark Hornick Oracle Advanced Analytics Learning R Series Session 5: Oracle R Enterprise 1.3 Integrating R Results and Images with OBIEE Dashboards Mark Hornick Oracle Advanced Analytics Learning R Series 2012 Session Title

More information

Intellicus Enterprise Reporting and BI Platform

Intellicus Enterprise Reporting and BI Platform Getting Started with Intellicus Intellicus Enterprise Reporting and BI Platform Intellicus Technologies info@intellicus.com www.intellicus.com Getting Started with Intellicus i Copyright 2012 Intellicus

More information

Profitability Application Pack Installation Guide Release

Profitability Application Pack Installation Guide Release Profitability Application Pack Installation Guide Release 8.0.6.1.0 October 2018 Document Versioning Version Number Revision Date Changes Done 1.0 10 September 2018 Final version released 1.1 5 October

More information

Part 1: Indexes for Big Data

Part 1: Indexes for Big Data JethroData Making Interactive BI for Big Data a Reality Technical White Paper This white paper explains how JethroData can help you achieve a truly interactive interactive response time for BI on big data,

More information

Acquiring Big Data to Realize Business Value

Acquiring Big Data to Realize Business Value Acquiring Big Data to Realize Business Value Agenda What is Big Data? Common Big Data technologies Use Case Examples Oracle Products in the Big Data space In Summary: Big Data Takeaways

More information

SAS Enterprise Miner Performance on IBM System p 570. Jan, Hsian-Fen Tsao Brian Porter Harry Seifert. IBM Corporation

SAS Enterprise Miner Performance on IBM System p 570. Jan, Hsian-Fen Tsao Brian Porter Harry Seifert. IBM Corporation SAS Enterprise Miner Performance on IBM System p 570 Jan, 2008 Hsian-Fen Tsao Brian Porter Harry Seifert IBM Corporation Copyright IBM Corporation, 2008. All Rights Reserved. TABLE OF CONTENTS ABSTRACT...3

More information

QuickSpecs. ISG Navigator for Universal Data Access M ODELS OVERVIEW. Retired. ISG Navigator for Universal Data Access

QuickSpecs. ISG Navigator for Universal Data Access M ODELS OVERVIEW. Retired. ISG Navigator for Universal Data Access M ODELS ISG Navigator from ISG International Software Group is a new-generation, standards-based middleware solution designed to access data from a full range of disparate data sources and formats.. OVERVIEW

More information

KNIME for the life sciences Cambridge Meetup

KNIME for the life sciences Cambridge Meetup KNIME for the life sciences Cambridge Meetup Greg Landrum, Ph.D. KNIME.com AG 12 July 2016 What is KNIME? A bit of motivation: tool blending, data blending, documentation, automation, reproducibility More

More information

Outrun Your Competition With SAS In-Memory Analytics Sascha Schubert Global Technology Practice, SAS

Outrun Your Competition With SAS In-Memory Analytics Sascha Schubert Global Technology Practice, SAS Outrun Your Competition With SAS In-Memory Analytics Sascha Schubert Global Technology Practice, SAS Topics AGENDA Challenges with Big Data Analytics How SAS can help you to minimize time to value with

More information

Spatial Analytics Built for Big Data Platforms

Spatial Analytics Built for Big Data Platforms Spatial Analytics Built for Big Platforms Roberto Infante Software Development Manager, Spatial and Graph 1 Copyright 2011, Oracle and/or its affiliates. All rights Global Digital Growth The Internet of

More information

Spotfire and Tableau Positioning. Summary

Spotfire and Tableau Positioning. Summary Licensed for distribution Summary So how do the products compare? In a nutshell Spotfire is the more sophisticated and better performing visual analytics platform, and this would be true of comparisons

More information

Scalable Machine Learning in R. with H2O

Scalable Machine Learning in R. with H2O Scalable Machine Learning in R with H2O Erin LeDell @ledell DSC July 2016 Introduction Statistician & Machine Learning Scientist at H2O.ai in Mountain View, California, USA Ph.D. in Biostatistics with

More information

Perform scalable data exchange using InfoSphere DataStage DB2 Connector

Perform scalable data exchange using InfoSphere DataStage DB2 Connector Perform scalable data exchange using InfoSphere DataStage Angelia Song (azsong@us.ibm.com) Technical Consultant IBM 13 August 2015 Brian Caufield (bcaufiel@us.ibm.com) Software Architect IBM Fan Ding (fding@us.ibm.com)

More information

SQT03 Big Data and Hadoop with Azure HDInsight Andrew Brust. Senior Director, Technical Product Marketing and Evangelism

SQT03 Big Data and Hadoop with Azure HDInsight Andrew Brust. Senior Director, Technical Product Marketing and Evangelism Big Data and Hadoop with Azure HDInsight Andrew Brust Senior Director, Technical Product Marketing and Evangelism Datameer Level: Intermediate Meet Andrew Senior Director, Technical Product Marketing and

More information

Netezza The Analytics Appliance

Netezza The Analytics Appliance Software 2011 Netezza The Analytics Appliance Michael Eden Information Management Brand Executive Central & Eastern Europe Vilnius 18 October 2011 Information Management 2011IBM Corporation Thought for

More information

Overview of R. Biostatistics

Overview of R. Biostatistics Overview of R Biostatistics 140.776 Stroustrup s Law There are only two kinds of languages: the ones people complain about and the ones nobody uses. R is a dialect of S What is R? What is S? S is a language

More information

What does SAS Data Management do? For whom is SAS Data Management designed? Key Benefits

What does SAS Data Management do? For whom is SAS Data Management designed? Key Benefits FACT SHEET SAS Data Management Transform raw data into a valuable business asset What does SAS Data Management do? SAS Data Management helps transform, integrate, govern and secure data while improving

More information

Overview. : Cloudera Data Analyst Training. Course Outline :: Cloudera Data Analyst Training::

Overview. : Cloudera Data Analyst Training. Course Outline :: Cloudera Data Analyst Training:: Module Title Duration : Cloudera Data Analyst Training : 4 days Overview Take your knowledge to the next level Cloudera University s four-day data analyst training course will teach you to apply traditional

More information

Analyzing Big Data with Microsoft R

Analyzing Big Data with Microsoft R Analyzing Big Data with Microsoft R 20773; 3 days, Instructor-led Course Description The main purpose of the course is to give students the ability to use Microsoft R Server to create and run an analysis

More information

Hortonworks Data Platform

Hortonworks Data Platform Hortonworks Data Platform Workflow Management (August 31, 2017) docs.hortonworks.com Hortonworks Data Platform: Workflow Management Copyright 2012-2017 Hortonworks, Inc. Some rights reserved. The Hortonworks

More information

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe Copyright 2016 Ramez Elmasri and Shamkant B. Navathe CHAPTER 1 Databases and Database Users Copyright 2016 Ramez Elmasri and Shamkant B. Navathe Slide 1-2 OUTLINE Types of Databases and Database Applications

More information

Conditionally control code flow (loops, control structures). Create stored procedures and functions.

Conditionally control code flow (loops, control structures). Create stored procedures and functions. TEMARIO Oracle Database: Program with PL/SQL Ed 2 Duration: 5 Days What you will learn This Oracle Database: Program with PL/SQL training starts with an introduction to PL/SQL and then explores the benefits

More information

Abstract. The Challenges. ESG Lab Review InterSystems IRIS Data Platform: A Unified, Efficient Data Platform for Fast Business Insight

Abstract. The Challenges. ESG Lab Review InterSystems IRIS Data Platform: A Unified, Efficient Data Platform for Fast Business Insight ESG Lab Review InterSystems Data Platform: A Unified, Efficient Data Platform for Fast Business Insight Date: April 218 Author: Kerry Dolan, Senior IT Validation Analyst Abstract Enterprise Strategy Group

More information