R Language for the SQL Server DBA

Similar documents
A Crash-Course in Biml. Tim Mitchell, Principal Data Architect, Tyleris Data Solutions Moderated By: Cathrine Wilhelmsen

A Closer Look at Distributed Availability Groups. Allan Hirt, Managing Partner, SQLHA LLC Moderated By: George Carlisle

Locking, Blocking, Versions: Concurrency for Maximum Performance. Kalen Delaney, Moderated By: Daniel Janik

The Ambiguous Case of Off-Row Storage in In- Memory OLTP. Dmitri Korotkevitch, aboutsqlserver.com Moderated By: Sander Stad

Swimming in the Data Lake. Presented by Warner Chaves Moderated by Sander Stad

New Paradigm for Performance Tuning in SQL Server Presented by Robert Davis

BIG DATA COURSE CONTENT

Stages of Data Processing

Asanka Padmakumara. ETL 2.0: Data Engineering with Azure Databricks

Columnstore Technology Improvements in SQL Server Presented by Niko Neugebauer Moderated by Nagaraj Venkatesan

Big Data Architect.

Using JSON with SQL Server Presented by Steve Hughes Moderated by Sarah Huang

Microsoft Big Data and Hadoop

Data Architectures in Azure for Analytics & Big Data

SQL Operations Studio - a new multi-platform tool for SQL Server database development, administration, and monitoring

SQL Server Machine Learning Marek Chmel & Vladimir Muzny

Overview. : Cloudera Data Analyst Training. Course Outline :: Cloudera Data Analyst Training::

SQT03 Big Data and Hadoop with Azure HDInsight Andrew Brust. Senior Director, Technical Product Marketing and Evangelism

Ian Choy. Technology Solutions Professional

Microsoft certified solutions associate

IT directors, CIO s, IT Managers, BI Managers, data warehousing professionals, data scientists, enterprise architects, data architects

HDInsight > Hadoop. October 12, 2017

Webinar Series TMIP VISION

Azure Data Factory VS. SSIS. Reza Rad, Consultant, RADACAD

Securing SQL Server Processes with Certificates. Robert, Davis, Database Engineer, BlueMountain Capital Management Moderated By: Ivan Sanders

Blended Learning Outline: Cloudera Data Analyst Training (171219a)

microsoft

Big Data Hadoop Developer Course Content. Big Data Hadoop Developer - The Complete Course Course Duration: 45 Hours

Přehled novinek v SQL Server 2016

Modeling. Preparation. Operationalization. Profile Explore. Model Testing & Validation. Feature & Algorithm Selection. Transform Cleanse Denormalize

BEST BIG DATA CERTIFICATIONS

Big Data Technology Ecosystem. Mark Burnette Pentaho Director Sales Engineering, Hitachi Vantara

Cortana Intelligence Suite; Where the Magic Happens

What is Gluent? The Gluent Data Platform

SQL Server Internals: The Practical Angle Sneak Peek. Dmitri Korotkevitch Moderated by Roberto Fonseca

Take P, R or U. and solve your data quality problems Oliver Engels & Tillmann Eitelberg, OH22

Big Data. Big Data Analyst. Big Data Engineer. Big Data Architect

COURSE 20466D: IMPLEMENTING DATA MODELS AND REPORTS WITH MICROSOFT SQL SERVER

DATA SCIENCE USING SPARK: AN INTRODUCTION

Increase Value from Big Data with Real-Time Data Integration and Streaming Analytics

Big Data com Hadoop. VIII Sessão - SQL Bahia. Impala, Hive e Spark. Diógenes Pires 03/03/2018

Hadoop. Introduction / Overview

COURSE 10977A: UPDATING YOUR SQL SERVER SKILLS TO MICROSOFT SQL SERVER 2014

Innovatus Technologies

@Pentaho #BigDataWebSeries

Monitoring Page Splits in SQL Server

Mastering Data Warehouse Aggregates Solutions For Star Schema Performance

Data 101 Which DB, When. Joe Yong Azure SQL Data Warehouse, Program Management Microsoft Corp.

Ooops, data breach? Not with Always Encrypted. Daniel de Sousa, BI Specialist, Dominos Pizza Enterprise Moderated By: Shane O'Neill

Hadoop course content

Monitoring & Tuning Azure SQL Database

SpagoBI and Talend jointly support Big Data scenarios

Oracle Big Data Science IOUG Collaborate 16

Microsoft. Exam Questions Perform Data Engineering on Microsoft Azure HDInsight (beta) Version:Demo

Big Data Specialized Studies

Big Data Infrastructures & Technologies

CERTIFICATE IN SOFTWARE DEVELOPMENT LIFE CYCLE IN BIG DATA AND BUSINESS INTELLIGENCE (SDLC-BD & BI)

Data Science and Open Source Software. Iraklis Varlamis Assistant Professor Harokopio University of Athens

An InterSystems Guide to the Data Galaxy. Benjamin De Boe Product Manager

Exam Questions

The age of Big Data Big Data for Oracle Database Professionals

Things Every Oracle DBA Needs to Know about the Hadoop Ecosystem. Zohar Elkayam

Microsoft. Exam Questions Perform Data Engineering on Microsoft Azure HDInsight (beta) Version:Demo

Think & Work like a Data Scientist with SQL 2016 & R DR. SUBRAMANI PARAMASIVAM (MANI)

Big Data with Hadoop Ecosystem

Goldilocks and The Three Linux Bears

MCSE Cloud Platform & Infrastructure CLOUD PLATFORM & INFRASTRUCTURE.

The Hadoop Ecosystem. EECS 4415 Big Data Systems. Tilemachos Pechlivanoglou

Microsoft, Open Source, R: You Gotta be Kidding Me!

MCSE Mobility Earned: MCSE Cloud Platform & Infrastructure Earned: 2017 MCSE MCSE. MCSD App Builder. MCSE Business Applications Earned 2017

BigInsights and Cognos Stefan Hubertus, Principal Solution Specialist Cognos Wilfried Hoge, IT Architect Big Data IBM Corporation

Extending Applications Securely Using Service Broker. Ed Leighton-Dick, Founder, Kingfisher Technologies Moderated By: Lance Harra

Activator Library. Focus on maximizing the value of your data, gain business insights, increase your team s productivity, and achieve success.

Oracle Big Data Science

"Charting the Course... MOC B Updating Your SQL Server Skills to Microsoft SQL Server 2014 Course Summary

Apache Spark 2 X Cookbook Cloud Ready Recipes For Analytics And Data Science

Specialist ICT Learning

Cloud Computing 3. CSCI 4850/5850 High-Performance Computing Spring 2018

Processing Unstructured Data. Dinesh Priyankara Founder/Principal Architect dinesql Pvt Ltd.

SQL Server Evolution. SQL 2016 new innovations. Trond Brande

Fast Innovation requires Fast IT

Microsoft Azure Databricks for data engineering. Building production data pipelines with Apache Spark in the cloud

Saving ETL Costs Through Data Virtualization Across The Enterprise

Oracle Big Data Connectors

exam. Microsoft Perform Data Engineering on Microsoft Azure HDInsight. Version 1.0

MODERN BIG DATA DESIGN PATTERNS CASE DRIVEN DESINGS

Big Data Syllabus. Understanding big data and Hadoop. Limitations and Solutions of existing Data Analytics Architecture

Delving Deep into Hadoop Course Contents Introduction to Hadoop and Architecture

Introduction to NoSQL by William McKnight

Microsoft Exam

Talend Big Data Sandbox. Big Data Insights Cookbook

Oliver Engels & Tillmann Eitelberg. Big Data! Big Quality?

CloudSwyft Learning-as-a-Service Course Catalog 2018 (Individual LaaS Course Catalog List)

Bring Context To Your Machine Data With Hadoop, RDBMS & Splunk

Big Data Analytics using Apache Hadoop and Spark with Scala

Microsoft Perform Data Engineering on Microsoft Azure HDInsight.

Big Data Analytics. Yossi Elkayam Sr. BI Architect Microsoft Services

STREAMLINED CERTIFICATION PATHS

Oracle Data Integrator 12c: Integration and Administration

Chapter 6 VIDEO CASES

Transcription:

R Language for the SQL Server DBA Beginning with R Ing. Eduardo Castro, PhD, Principal Data Analyst Architect, LP Consulting Moderated By: Jose Rolando Guay Paz

Thank You microsoft.com idera.com attunity.com Empower users with new insights through familiar tools while balancing the need for IT to monitor and manage user created content. Deliver access to all data types across structured and unstructured sources. IDERA s award-winning SQL Server database solutions and multi-platform database, application and cloud monitoring tools ensure your business never slows down. Attunity, a leader in data integration and management software, helps move, transform and analyze data efficiently in SQL Server/Azure environments. 2

JOIN PASS PASS is a not-for-profit organization which offers year-round learning opportunities to data professionals Membership is free, join today at www.sqlpass.org Access to online training and content Join Local Chapters and Virtual Chapters Enjoy discounted event rates Get advance notice of member exclusives

Save on PASS Summit 2016 Registration! The world s largest gathering of SQL Server & BI professionals Learn from the world s top data experts, in over 190 technical sessions More than 4000 attendees from all over the world Meet the Microsoft engineering team! Save $200 right now using discount code 24HOP200! $2,195 until September 18, 2016 www.passsummit.com

BIO Ing. Eduardo Castro, PhD Microsoft Data Platform MVP and PASS Board of Advisor for LATAM, is a well known LATAM SQL Server Expert and focuses on architecture, Business Intelligence and Data Analytics, Eduardo has an specialization in Data Analysis and Big Data. ecastrom @edocastro http://tinyurl.com/h35nqt4

Session objective R and Phyton are the new tools for data professionals. The SQL Server DBA should know how to integrate R Scripts into data analytics and data warehouses. In this session, you will learn how to use the new feature in SQL Server 2016 to run R Scripts.

Data Science and Data Analytics Statistics, machine learning algorithms applied to data analysis Hypotheses, experiments, facts with tools popular among statistics experts.

Data wrangling Big data Data mining & machine learning Statistics

New data sources in the Data Analysis Pipe 010101010101010101 1010101010101010 01010101010101 101010101010 Data Transformation Big Data Tools R Language Big Data Unstructured Data Sources Tabular OLAP SQL PowerBI

Tools Chart from "Data Science Salary Survey 2014" (ISBN 978-1-491-91842-5) 2015 O'Reilly Media, used with permission. Arrows mine. For more info, and great titles on data science, visit oreilly.com

Popular Tools SPSS, Matlab, SAS NoSQL, Mongo DB, Couchbase, Cassandra Microsoft Excel Java, R, Python, Clojure, Haskell, Scala Hadoop, HDFS MapReduce, Spark, Storm HBase, Pig Hive, Shark ETL, Webscrapers,Flume, SqoopSQL, RDBMS, DW, OLAP Knime, Weka, RapidMiner

Tools by Microsoft Hadoop in the cloud + Storm (real time analysis) +HBase (NoSQL) +Mahoot (Macine Learning Power BI: Power Query, Power View, and Dashboards Excel Azure Data Factory (ETL in the cloud) Analytics Platform System (SQL Server on steroids + Hadoop + hardware) Streaming Data from Cloud Based in HDInsight / Hadoop

Tools by Microsoft Let s you run Scrips inside Visual Studio Integrate R Scripts Integrate R Graphs Open Source and Enterprise Editions

What is R? Interpreted Language Emphasis in statistical software packages 5000+ IDE: R Studio http://www.rstudio.com/ Open Source, free, multiplatform R Core: http://cran.r-project.org/ Revolution Analytics: parallelism and Performance: http://www.revolutionanalytics.com/ Azure ML: built-in

First steps with R R is a language popular among statistics experts and data scientists Open Source R is extensible, the are hundreds of packages that add new functionalities to R How to install R http://www.r-project.org/ Multiplatform Windows, Mac, Linux To install an IDE R Studio: IDE for R http://www.rstudio.com/ First install R then R Studio

R Studio

The Open Source R R loads data in memory R only has ONE thread Is not easy to create a R Cluster R Open is supported by the community Microsoft R Server doesn t have this limitations

Microsoft R Server previously Revolution Server

Microsof R Server Versions Microsoft R Open Microsoft R Enterprise

Integrating R inside SQL Server 2016 Fraud detection Sales forecast Predictive Maintenance R Language R Scripting 010010 100100 010101 010010 100100 010101 010010 100100 010101 Analytical library 010010 100100 010101 T-SQL Interface Relational data 010010 100100 010101 SQL Server 2016 Data scientists interact directly data Data Developer / DBA Data management and analytical in the same engine Azure Machine Learning Support R Language and Phyton

Installing R Support in SQL 2016

Installing R Support in SQL 2016

Installing R Support in SQL 2016

R integration within SQL Server 2016 exec sp_configure'external scripts enabled', 1; reconfigure; "C: \ Program files \ RRO \ RRO-3.2.2-for-RRE-7.5.0 \ R-3.2.2 \ library \RevoScaleR\rxLibs\ X64 \ registerrext.exe "/ install

R integration within SQL Server 2016 USE <target database name> GO CREATE LOGIN [<login name>] WITH PASSWORD = '<password>', CHECK_EXPIRATION = OFF, CHECK_POLICY = OFF; CREATE USER [<user name>] FOR LOGIN [<login name>] WITH DEFAULT_SCHEMA = [db_datareader] ALTER ROLE [db_datareader] ADD MEMBER [<user name>]

R integration within SQL Server 2016 USE [master] GO CREATE USER [<user name>] FOR LOGIN [<login name>] WITH DEFAULT_SCHEMA = [db_rrerole] ALTER ROLE [db_rrerole] ADD MEMBER [<user name>]

Demo. Installing R Support

What tool should I use?

Using R Studio

Demo. Using R Studio

Review R inside SQL Server 2016 Fraud detection Sales forecast Predictive Maintenance R Language R Scripting 010010 100100 010101 010010 100100 010101 010010 100100 010101 Analytical library 010010 100100 010101 T-SQL Interface Relational data 010010 100100 010101 SQL Server 2016 Data scientists interact directly data Data Developer / DBA Data management and analytical in the same engine Azure Machine Learning Support R Language and Phyton

Demo. Running R Scripts inside SQL Server

Summary There are new requirements for the DBA Often they come from the Data Science area In this session we had shown how to leverage the new features in SQL Server 2016 to include R Scripts inside the database in an integrated way