Week 1 Unit 1: Introduction to Data Science

Similar documents
HA301. SAP HANA 2.0 SPS03 - Advanced Modeling COURSE OUTLINE. Course Version: 15 Course Duration:

HA100 SAP HANA Introduction

BW405. BW/4HANA Query Design and Analysis COURSE OUTLINE. Course Version: 14 Course Duration: 5 Day(s)

S4H01. Introduction to SAP S/4HANA COURSE OUTLINE. Course Version: 04 Course Duration: 2 Day(s)

HA300 SAP HANA Modeling

BOD410 SAP Lumira 2.0 Designer

HA100 SAP HANA Introduction

HA355. SAP HANA Smart Data Integration COURSE OUTLINE. Course Version: 12 Course Duration: 3 Day(s)

HA215 SAP HANA Monitoring and Performance Analysis

HA150. SAP HANA 2.0 SPS02 - SQL and SQLScript for SAP HANA COURSE OUTLINE. Course Version: 14 Course Duration: 3 Day(s)

Device Operation Process Diagrams. SAP Mobile Secure rapid-deployment solution September 2014

Week 2 Unit 3: Creating a JDBC Application. January, 2015

HA300 SAP HANA Modeling

C4C30. SAP Cloud Applications Studio COURSE OUTLINE. Course Version: 21 Course Duration: 4 Day(s)

HA100 SAP HANA Introduction

BC414. Programming Database Updates COURSE OUTLINE. Course Version: 15 Course Duration: 2 Day(s)

CA611 Testing with ecatt

HA 450. Application Development for SAP HANA COURSE OUTLINE. Course Version: 12 Course Duration:

CLD100. Cloud for SAP COURSE OUTLINE. Course Version: 16 Course Duration: 2 Day(s)

HA215 SAP HANA Monitoring and Performance Analysis

HA100 SAP HANA Introduction

HA150. SAP HANA 2.0 SPS03 - SQL and SQLScript for SAP HANA COURSE OUTLINE. Course Version: 15 Course Duration:

BW305H. Query Design and Analysis with SAP Business Warehouse Powered by SAP HANA COURSE OUTLINE. Course Version: 15 Course Duration: 5 Day(s)

COURSE LISTING. Courses Listed. Training for Database & Technology with Modeling in SAP HANA. Last updated on: 30 Nov 2018.

SAP EarlyWatch Alert. SAP HANA Deployment Best Practices Active Global Support, SAP AG 2015

BOCRC. SAP Crystal Reports Compact Course COURSE OUTLINE. Course Version: 15 Course Duration: 3 Day(s)

SLT100. Real Time Replication with SAP LT Replication Server COURSE OUTLINE. Course Version: 13 Course Duration: 3 Day(s)

Week 2 Unit 1: Introduction and First Steps with EJB. January, 2015

MDG100 Master Data Governance

DBW4H. Data Warehousing with SAP BW/4HANA - Delta from SAP BW powered by SAP HANA COURSE OUTLINE. Course Version: 13 Course Duration: 2 Day(s)

S4H410. SAP S/4HANA Embedded Analytics and Modeling with Core Data Services (CDS) Views COURSE OUTLINE. Course Version: 05 Course Duration: 2 Day(s)

Device Application Onboarding Process Diagrams. SAP Mobile Secure: SAP Afaria 7 SP5 September 2014

Complementary Demo Guide

BC470. Form Printing with SAP Smart Forms COURSE OUTLINE. Course Version: 18 Course Duration:

D75AW. Delta ABAP Workbench SAP NetWeaver 7.0 to SAP NetWeaver 7.51 COURSE OUTLINE. Course Version: 18 Course Duration:

HA150 SQL Basics for SAP HANA

Let s Exploit DITA: How to automate an App Catalog

S4D430 Building Views in Core Data Services ABAP (CDS ABAP)

SAP Hybris Billing, Pricing Simulation Extended Functions Release 2.0, SP03

ADM505. Oracle Database Administration COURSE OUTLINE. Course Version: 15 Course Duration: 3 Day(s)

COURSE LISTING. Courses Listed. Training for Database & Technology with Modeling in SAP HANA. Einsteiger. Fortgeschrittene.

HA240 Authorization, Security and Scenarios

SAP Analytics Cloud model maintenance Restoring invalid model data caused by hierarchy conflicts

UX402 SAP SAPUI5 Development

HA240 SAP HANA 2.0 SPS02

DS10. Data Services - Platform and Transforms COURSE OUTLINE. Course Version: 15 Course Duration: 3 Day(s)

FAQs Data Sources SAP Hybris Cloud for Customer PUBLIC

SAP HANA SPS 08 - What s New? SAP HANA Interactive Education - SHINE (Delta from SPS 07 to SPS 08) SAP HANA Product Management May, 2014

BC403 Advanced ABAP Debugging

SAP HANA SPS 09 - What s New? SAP River

UX300 SAP Screen Personas 3.0 Development

FAQs Data Cleansing SAP Hybris Cloud for Customer PUBLIC

FAQs OData Services SAP Hybris Cloud for Customer PUBLIC

BW310H. Data Warehousing with SAP Business Warehouse powered by SAP HANA COURSE OUTLINE. Course Version: 15 Course Duration: 5 Day(s)

SAP 3D Visual Enterprise 9.0: Localization of Authoring Content

BC405 Programming ABAP Reports

HA400 ABAP Programming for SAP HANA

BW305. SAP Business Warehouse Query Design and Analysis COURSE OUTLINE. Course Version: 15 Course Duration: 5 Day(s)

BC404. ABAP Programming in Eclipse COURSE OUTLINE. Course Version: 16 Course Duration: 3 Day(s)

BW462 SAP BW/4HANA COURSE OUTLINE. Course Version: 16 Course Duration: 5 Day(s)

FAQs Facebook Integration with SAP Hybris Cloud for Customer SAP Hybris Cloud for Customer PUBLIC

ADM506. Database Administration Oracle II COURSE OUTLINE. Course Version: 15 Course Duration: 2 Day(s)

BOID10. SAP BusinessObjects Information Design Tool COURSE OUTLINE. Course Version: 17 Course Duration: 5 Day(s)

Device Configuration Process Diagrams. SAP Mobile Secure: SAP Afaria 7 SP5 September 2014

UX400. OpenUI5 Development Foundations COURSE OUTLINE. Course Version: 02 Course Duration: 5 Day(s)

FAQs Data Workbench SAP Hybris Cloud for Customer PUBLIC

SAP Mobile Secure Rapiddeployment. Software Requirements

SCM380 SAP MII - Manufacturing Integration and Intelligence Fundamentals

SAP HANA Cloud Integration for data services What s new in (Sept 2015) Ben Hofmans, Product Manager

ADM110. Installing and Patching SAP S/4HANA and SAP Business Suite Systems COURSE OUTLINE. Course Version: 17 Course Duration: 4 Day(s)

ADM110. Installing and Patching SAP S/4HANA and SAP Business Suite Systems COURSE OUTLINE. Course Version: 18 Course Duration: 4 Day(s)

BC401. ABAP Objects COURSE OUTLINE. Course Version: 18 Course Duration:

SAP HANA SPS 08 - What s New? SAP HANA Modeling (Delta from SPS 07 to SPS 08) SAP HANA Product Management May, 2014

SAP HANA SPS 08 - What s New? SAP HANA Web-based Development Workbench. (Delta from SPS 07 to SPS 08) SAP HANA Product Management May, 2014

SAP Business One Integration Framework

ADM535. DB2 LUW Administration for SAP COURSE OUTLINE. Course Version: Course Duration: 3 Day(s)

Software and Delivery Requirements

BW362. SAP BW Powered by SAP HANA COURSE OUTLINE. Course Version: 11 Course Duration: 5 Day(s)

TADM51. SAP NetWeaver AS - DB Operation (Oracle) COURSE OUTLINE. Course Version: 15 Course Duration: 5 Day(s)

BW350H. SAP BW Powered by SAP HANA - Data Acquisition COURSE OUTLINE. Course Version: 15 Course Duration: 5 Day(s)

SAP HANA Data Warehousing Foundation Data Distribution Optimizer / Data Life Cycle Manager DWF SP03

BIT660 Data Archiving

SAP HANA Operation Expert Summit PLAN - Hardware Landscapes. Addi Brosig, SAP HANA Product Management May 2014

Combine Native SQL Flexibility with SAP HANA Platform Performance and Tools

opensap TEXT ANALYTICS WITH SAP HANA PLATFORM WEEK 1

RDP203 - Enhanced Support for SAP NetWeaver BW Powered by SAP HANA and Mixed Scenarios. October 2013

COURSE LISTING. Courses Listed. Training for Cloud with SAP Hybris in Sales Cloud (C4C) 25 August 2018 (01:04 BST) Fortgeschrittene.

Analyze Big Data Faster and Store It Cheaper

SAP Hybris Billing, pricing simulation Application Operations Guide Release 2.0, SP03

SAP SMS 365 SAP Messaging Proxy 365 Product Description August 2016 Version 1.0

COURSE LISTING. Courses Listed. Training for Database & Technology with Administration in Database Migration. 3 September 2018 (21:31 BST)

How to create a What If simulation in SAP Analytics Cloud

SAP: Speeding GRC Control Testing by 90% with SAP Solutions for GRC

SAP HANA SPS 08 - What s New? SAP HANA Platform Lifecycle Management (Delta from SPS 07 to SPS 08) SAP HANA Product Management May, 2014

System x Server for SAP Business One, version for SAP HANA

SAP Single Sign-On 2.0 Overview Presentation

opensap: Big Data with SAP HANA Vora Course Week 03 - Exercises

COURSE LISTING. Courses Listed. Training for Cloud with SAP Ariba in Buy Side. 27 July 2018 (05:54 BST) Grundlagen. Fortgeschrittene.

Using SAP SuccessFactors Integration Center for generating exports on Interview Central. SAP SuccessFactors Recruiting Management

Two-Factor Authentication over Mobile: Simplifying Security and Authentication

Transcription:

Week 1 Unit 1: Introduction to Data Science

The next 6 weeks What to expect in the next 6 weeks? 2

Curriculum flow (weeks 1-3) Business & Data Understanding 1 2 3 Data Preparation Modeling (1) Introduction to Data Science Introduction to Project Methodologies Business Understanding Phase Overview Defining Project Success Criteria Data Understanding Phase Overview Initial Data Analysis & Exploratory Data Analysis Weekly Assignment Data Preparation Phase Overview Predictive Modeling Methodology Overview Data Manipulation Selecting Data Variable and Feature Selection Data Encoding Weekly Assignment Modeling Phase Overview Detecting Anomalies Association Analysis Cluster Analysis Classification Analysis with Regression Weekly Assignment 3

Curriculum flow (weeks 4-6) 4 Modeling (2) 5 Evaluation 6 Deployment & Maintenance Classification Analysis with Decision Trees Classification Analysis with KNN, NN, and SVM Time Series Analysis Ensemble Methods Simulation & Optimization Automated Modeling Evaluation Phase Overview Model Performance Metrics Model Testing Improving Model Performance Deployment Phase Overview Deployment Options Monitoring & Maintenance Automating Deployment & Maintenance Myths & Challenges Data Science Applications and References Weekly Assignment Weekly Assignment Weekly Assignment Final Exam 4

Cumulative points lead to record of achievement Watch the deadlines! Participate in Weekly Assignment (Weeks1-6) Final Exam (Week 7) Record of Achievement 6 assignments 6 x 30 = 180 points 180 points When results above 180 points 5

What is data science? Data science is an interdisciplinary field about processes and systems that enable the extraction of knowledge or insights from data. Data science employs techniques and theories drawn from a wide range of disciplines. 6

Data science personas Business Users Data Analysts / Citizen Data Scientists Data Scientists SAP HANA Application Developers Analytics skills from low to high Embedded Analytics Business User / Data Analyst Driven Analytics Custom Analytics Embedded Analytics SAP Suite / Application Innovation / Industry / LoB / CDP SAP Hybris Marketing, IoT Predictive Maintenance, Fraud SAP Predictive Analytics Application Function Modeler (AFM) Predictive in SAP HANA PAL, APL, R, AFLs e.g. UDF, OFL Data Science Solutions from SAP 7

Data science solutions from SAP SAP Predictive Analytics SAP Lumira SAP HANA Studio / AFM SAP RDS Analytics Solutions SAP Industry & LoB Solutions Partner Analytical BI & Tools SAP HANA Predictive Analysis Library (PAL) Text Search Business Function Library Text Analysis and Mining Automated Predictive Library Simulation Optimization Spatial Analysis Graph Engine Rules Engine R SAP IQ HADOOP SAP ESP 3 rd Party Data Source SAP Data Services Data Connectors Data types Connect to SAP HANA directly or via Sybase IQ / Hadoop / ESP / Data Services Transaction Data Unstructured Data Real-Time Data Location Data Machine Data Others 8

SAP HANA Predictive Analysis Library (PAL) Build High-Performance Predictive Apps The SAP HANA Predictive Analysis Library (PAL) is a built-in C++ library for performing in-memory data mining and statistical calculations. PAL is designed to provide high performance on large datasets for real-time analytics. Hadoop / Sybase IQ, Sybase ASE, Teradata Spatial, Machine, Real-Time Data Virtual Tables SAP HANA Main Memory Text Analysis Spatial Data SQLScript Optimized Query Plan PAL R Scripts Unstructured KNN classification K-means ABC classification Weighted score tables R Engine Regression C4.5 decision tree Association analysis: market basket SAP HANA Studio/AFM, Apps & Tools 9

SAP HANA Predictive Analysis Library (PAL) algorithms SAP HANA Predictive Analysis Library (PAL) contains a wide range of algorithms that can be deployed for in-hana and standalone data science applications. A wide range of algorithms are available for the following types of analysis: SAP HANA Predictive Analysis Library Association Analysis Classification Analysis Regression Cluster Analysis Time Series Analysis Probability Distribution Outlier Detection Link Prediction Data Preparation Statistic Functions (Univariate) Statistic Functions (Multivariate) 10

SAP HANA Automated Predictive Library (APL) algorithms SAP HANA APL is an application function library (AFL) that lets you use the data mining capabilities of the SAP Predictive Analytics automated analytics engine on your customer datasets stored in SAP HANA. You can create a wide range of models to answer your business questions. Regression Models Classification Models Social Network Analysis APL Clustering Models Time Series Analysis Recommendation 11

R integration for SAP HANA and standalone Application SAP HANA Database R SQL Interface R Calculation Engine R Operator R Client R Trigger R Write R Rserve Font R Rserve Rserve R Runtime Results Tables Tables 12

SAP Predictive Analytics SAP Predictive Analytics is built for both data scientists and business / data analysts, making predictive analytics accessible to a broad spectrum of users. Automated and expert modes Used to automate data preparation, predictive modeling, and deployment tasks Rich pre-built modelling functionality PAL, APL, and R language support Advanced visualization Native integration with SAP HANA 13

Application function modeler (AFM) Graphical tool to build advanced applications in SAP HANA Web-based flow-graph editor Support for AFL, R, SDI, & SDQ Used to create procedures or task runtime operations Interoperability with SAP HANA studio AFM SAP HANA studio-based AFM PAL function support including time series, clustering, classification, and statistics General usability enhancements for an easier, simpler, and more functional experience 14

Thank you Contact information: open@sap.com

2016 SAP SE or an SAP affiliate company. All rights reserved. No part of this publication may be reproduced or transmitted in any form or for any purpose without the express permission of SAP SE or an SAP affiliate company. SAP and other SAP products and services mentioned herein as well as their respective logos are trademarks or registered trademarks of SAP SE (or an SAP affiliate company) in Germany and other countries. Please see http://global12.sap.com/corporate-en/legal/copyright/index.epx for additional trademark information and notices. Some software products marketed by SAP SE and its distributors contain proprietary software components of other software vendors. National product specifications may vary. These materials are provided by SAP SE or an SAP affiliate company for informational purposes only, without representation or warranty of any kind, and SAP SE or its affiliated companies shall not be liable for errors or omissions with respect to the materials. The only warranties for SAP SE or SAP affiliate company products and services are those that are set forth in the express warranty statements accompanying such products and services, if any. Nothing herein should be construed as constituting an additional warranty. In particular, SAP SE or its affiliated companies have no obligation to pursue any course of business outlined in this document or any related presentation, or to develop or release any functionality mentioned therein. This document, or any related presentation, and SAP SE s or its affiliated companies strategy and possible future developments, products, and/or platform directions and functionality are all subject to change and may be changed by SAP SE or its affiliated companies at any time for any reason without notice. The information in this document is not a commitment, promise, or legal obligation to deliver any material, code, or functionality. All forwardlooking statements are subject to various risks and uncertainties that could cause actual results to differ materially from expectations. Readers are cautioned not to place undue reliance on these forward-looking statements, which speak only as of their dates, and they should not be relied upon in making purchasing decisions. 16