SAS Enterprise Miner : What does the future hold?

Size: px
Start display at page:

Download "SAS Enterprise Miner : What does the future hold?"

Transcription

1 SAS Enterprise Miner : What does the future hold? David Duling EM Development Director SAS Inc. Sascha Schubert Product Manager Data Mining SAS International

2 Topics for Discussion: EM 4.2/SAS 9.0 AF/SCL Architecture EM 5.0/SAS tier Architecture EM Demo of the Alpha EM 5.0 Java UI

3 EM Two Paths for Two Goals! Evolutionary Development of Data Mining Functionality! Keep up the quality! Upgrade release for current sites! Stay on top of the market! Revolutionary Development of Data Mining Architecture! Address scalability and performance! Address the limitations of current architecture! Make new architecture future-proof Copyright 2002, SAS Institute Inc. All rights reserved.

4 Time Line Project Mercury + DM Apr 02 Jun 02 Nov 02 Feb 03 SAS V9 EM 4.2 Evolutionary Release EA LA GA SAS V9.1 EM 5.0 Revolutionary Release DP EA LA GA

5 Goals for EM 4.2! Maintain current product! Fix known defects! Evolve beta tools to production status! Interactive Grouping! Improve scalability (parallel processing)

6 EM 4.2 Evolve Beta Tools to Production Status! Memory Based Reasoning! DM Neural! Two-Stage Model! Time Series! Link Analysis! J-Score, XML

7 Interactive Grouping Node! Was developed as part of Credit Scoring Solution! Will be fully integrated in EM 4.2 / 5.0! Used to calculate weights of evidence! also useful for general interactive grouping! Interactive grouping of variables into natural groups in relation to target! now possible for class and interval variables

8 Publishing Enterprise Miner Models via the Open Meta Server Save Enterprise Miner Register Read HTTP/JSP WWW clients Search Models Retrieve Models Reports Score code Open Meta Server WWW Server

9 Mining Model Repository! SAS Code, C Code, Java Code! Statistics, Charts, Reports! Input and Output Variables described in XML Process flow report in HTML format Fit and assessment statistics in SAS data sets Cscore code Cscore meta information stored in XML Fit and assessment statistics stored in CSV Target and input data set info stored in text Formats, score, and macro code as SAS code Metadata info about the model in a SAS catalog

10 Performance and Scalability! XOT! enables parallel input (read) of partitioned data sets)! Using XOT for data I/O! TK (Threaded Kernel)! Multi Threading, making use of multiple CPUs! TK for PROC DMDB, PROC DMINE (Vsel), PROC DMREG! Optional for all listed procedures

11 Scale-Up Proc DMINE Stones (S64) 64 bit Solaris - 8 CPUs Time XOT-TK Unthreaded Number of Threads

12 Benchmarking TK (Proc DMDB) 100K obs 100 interval vars 100K obs 50 interval vars 50 class vars 100K obs 50 class vars Single Threaded real time 7.77 seconds cpu time 7.77 seconds real time seconds cpu time seconds real time seconds cpu time seconds Multi-Threaded (4 Threads) real time 1.95 seconds cpu time 4.82 seconds real time 1.95 seconds cpu time 4.82 seconds real time seconds cpu time seconds 5M obs 2 interval vars real time 6.50 seconds cpu time 6.50 seconds real time 1.51 seconds cpu time 4.92 seconds

13 EM 5.0 The Future of Enterprise Miner

14 Plans for EM 5.0! Create a new 3-tier architecture SAS server - Batch and interactive modes - Use existing tools and expertise Java foundation services - Metadata services - Configuration management Java client - API Integration projects - GUI Swing-based Data Mining from everywhere

15 Goals for EM 5.0 Create a new EM 5.0! SAS server Batch and interactive modes Use existing tools and expertise! Java middleware Metadata services Configuration management! Java client API Integration projects GUI Swing-based New procedures PATH production ARBOR production (replace split) TAXONOMY experimental SVM experimental Production version of MFC Tree viewer PROC ARBOR IOM procedure interface for interactive training Production Model Repository EM 5.0 model registration EM 4.2 model registration Web GUI Warehouse Admin. Scoring

16 Current AF / SCL Architecture Project persistence SAS Server Data Persistence SAS Version 8.2 EM 4.x classes SAS Version 8.2 SAS EM Client! SAS AF/SCL Infrastructure! Project Stored Locally on the Windows Client as well as the SAS installation! EM models trained on EM server (single threaded)

17 Distributed Architecture in EM 5.0 Data Mining Compute Server Project Data Persistence SAS System Metadata Persistence EM 5.0 Java API EM 5.0 Java UI Java EM Client Middleware Server EM 5.0 Java Middlware

18 Distributed Architecture in EM 5.0 Reporting Project Data Persistence Compute Server SAS System Metadata Persistence EM 5.0 Java API EM 5.0 Java UI Middleware Server EM 5.0 Java Middlware JSP Server SAS Open Metadata Server Web Client

19 Distributed Architecture in EM 5.0 Warehousing Compute Server Project Data Persistence SAS System Metadata Persistence EM 5.0 Java API EM 5.0 Java UI Middleware Server EM 5.0 Java Middlware JSP Server SAS Open Metadata Server Web Client Data Builder Java Client

20 EM 5.0 Configuration Options! Stand alone client! SAS Server, Java middleware, GUI on the same machine! Client server! SAS server, Java middleware server, clients connect through Java GUI! Distributed computing! All components on different machines, user connect from anywhere

21 Reasons for n-tier Architecture Client 1 SAS Server Client 1 SAS Server EM Server Client 2 OMS Client 2 OMS Central administration Easier thin-client deployment Reduce client footprint Offers centralized location for file storage Improved security control of all login processes Easier configuration More persistence options controlled by administrator Better resource monitoring Who s using the system How many processes are running Copyright 2002, SAS Institute Inc. All rights reserved.

22 New GUI Based on Java Swing! Improved Graphics! Deployed through the web allowing multiple user access! Platform independent! Server independent! Configurable! On-line help! Extendable! XML import/export of diagrams! Start and stop processes

23 Sample EM 5.0 Results Exploratory Plots Assessment Plots

24 Interactive Tree Results Viewer

25 EM5.0 Reporting! SPK=SAS Publish and Subscribe! SAS distributes a package reader! Tables stored as CSV files => activate MS Excel! Can be registered in OMS and Model Repository

26 Enhanced Performance! Uses MP CONNECT technologies to distribute mining processes across multiple CPUs providing the ability to run nodes in parallel.! DMINE and DMREG procedures have been reengineered to take advantage of the TK and XOT frameworks of V9.! Supports Stop Processing of an EM process.

27 User 1 User 2 EM 5.0 Performance! GUI sessions get dedicated SAS/IOM workspace Middleware IOM user session: user1 IOM user session: user2 IOM process session: user2 SAS: Train Model 1 SAS Server! Model training gets dedicated SAS/IOM workspace! Parallel branches in process flow run in dedicated SAS/IOM workspaces! xot procedures with spds libname engine start multiple data read threads! tk enabled procedures start multiple computational threads SAS: Train Model 2 tk 1 tk 2 tk 3 tk 4 Server Operating System CPU CPU CPU CPU Event Threads Total User 1 Connects 1 1 User 2 Connects 1 2 User 2 Starts process 1 3 User 2 Disconnects -1 2 Process starts model 1 training 1 3 Process starts model 2 training 1 4 Model 2 starts four threads running 4 8 Model 2 completes -4 4 Process completes -3 1 User 2 Reconnects 1 2

28 EM 5.0 Batch Processing! Java API/UI for batch processing Runs in middleware Opens existing workspace and starts training process Loads XML diagram files! XML files API Save entire diagrams as XML files Mail from one user to another Scheduled execution %EM5(xmlfile=) macro for running diagrams!data set API Nodes data set: all nodes and properties Connections data set: flow of logic from one node to another Actions data set: nodes and actions to perform on nodes Workspace data set: library and files locations Variables meta data sets: input, target, rejected, etc %EM5(nodes=,connect=, ) macro for running diagrams

29 EM 5.0 Batch Processing! Compatible with all EM5 file structures! Run the same diagram from UI or batch! Automate model training from diagrams built in the GUI! All SAS language capabilities! Encapsulates EM processing! BATCH.SAS always created for every node! Automate creation of new diagrams! Distribute diagrams! Consulting: initial setup and delivery! May include results, or not

30 EM 5.0 Batch Processing! API to Allow Java Programs to Call EM! String ids_id=myworkspace.addnode( Datasource );! String reg_id=myworkspace.addnode( Regression );! myworkspace.connectnode(ids_id,reg_id);! myworkspace.runnode(reg_id);

31 Integrated with OMS and Data Builder! OMS persists metadata about SAS servers, EM project locations, results packages, and data dictionaries for training tables! Scoring processes as well as input/output data sets can be defined and exchanged with other SAS companion products through registration of EM metadata and processes within the SAS OMR.

32 Other Major Enhancements! New Mining Algorithms:! Support Vector Machines popular algorithm for general classification problems! Web Path Analysis provides efficient and scalable mining of frequent paths from click-stream data.! Taxonomy supports hierarchical associations to populate rules at different levels in the hierarchy.! Improved decision tree algorithm to enable interactive training on the server and provide improved performance of disk resident data.

33 New Procedures! PROC PATH! PROC SVM! PROC ARBOR! PROX TAXONOMY

34 New Path node (production)! PROC PATH - a new procedure to mine frequent paths from preprocessed click stream data! Features:! Efficient, scalable and fast! Path completion - Reintroduce missing requests (e.g., back button clicks)! Detecting path breaks - Identify separate subpaths! Generating longest contiguous sub-paths! Correctly handling page reload requests

35 Path Analysis! Improved customer experience! Tuning web-site structure based on browsing patterns! Build customer relationships! Customizing content at individual or segment level! Real-time target marketing! Cross-sell, up-sell product recommendations! Ad/Rebate placement! Predict site abandonment! Browsing behavior as input to predictive modeling! Segmentation based on browsing behavior

36 Support Vector Machines (experimental)! Supervised learning tool for creating functions from a set of labeled training data! A binary classifier! A general regression function! Applications! Suitable for general classification problems! Text Categorization! Biosequence Analysis; Micro Arrays

37 SVM Classification is achieved by a linear or nonlinear separating surface in the input space of the dataset.! Linear SVMs operate by finding a hypersurface in the space of possible inputs. This hypersurface will attempt to split the positive examples from the negative examples. The split will be chosen to have the largest distance from the hypersurface to the nearest of the positive and negative examples.! If the training examples are not linearly separable, SVMs work by mapping the training data into a higher dimension feature space using an appropriate kernel function.

38 Other new Nodes/Procedures! Taxonomy Hierarchical associations (exp)! ARBOR Replacement for SPLIT.! Support client/server interactive training As an interactive procedure As an engine for a client side Windows Application! Improved performance of disk-resident data! Documented at the level of SAS/STAT procedures! All procedures will use a dynamic DMDB! No permanent physical DMDB data set is created

39 Early Adopters for EM 5! Looking for Early Adopters in SeUGI time frame! 5 20 sites worldwide recommended from local offices! Different regions and different industries! Following scenarios

40 Early Adopters for EM 5! Following scenarios desired! distribute the EM Java thin client to multiple users that are geographically dispersed to test 3-tier architecture! small to medium sized firm to evaluate EM 5.0 running entirely on a local client! site to test Java API to integrate EM analytics and scoring services into site specific mining applications! site to test EM analytical deployment test Model Repository! sites with excellent statistical/ai modeling skills and applications to evaluate the new algorithms (SVM, Path analysis node, Interactive Tree, Hierarchical Associations)

41 EM 5.0 Summary! Delivered as a modern, distributed client-server system for data mining! Enables wide area collaboration on data mining projects and extensive integration opportunities! SAS server uses new parallel and multi-processing features of the SAS V9.0 system and includes an API for running data mining processes and for adding new data mining tools.! Java middleware manages SAS server sessions, user identity, metadata, and report delivery.! Data mining sessions can be created and managed through a Java API.! The user interface is based on Java Swing libraries containing advanced graphics and visualization techniques! New mining algorithms

42 EM Summary! Provide renowned data mining functionality based on modern future-proof architecture! Clear differentiation between data processing, meta data management and flexible user interface! Architecture open for integration with other SAS and 3 rd party applications! Ensure backward compatibility by parallel maintenance of traditional AF solution

43 Other Data Mining Presentations at SeUGI! Wed, 16:25, TKC Distributed Data Mining with SAS Enterprise Miner! Wed, 11:40, Analytical Expertise stream, SAS Text Miner! Wed, 17:05, TKC, SAS Text Mining! Analytical Demo Station in TKC

44 DEMO

45

Enterprise Miner Version 4.0. Changes and Enhancements

Enterprise Miner Version 4.0. Changes and Enhancements Enterprise Miner Version 4.0 Changes and Enhancements Table of Contents General Information.................................................................. 1 Upgrading Previous Version Enterprise Miner

More information

Enterprise Miner Software: Changes and Enhancements, Release 4.1

Enterprise Miner Software: Changes and Enhancements, Release 4.1 Enterprise Miner Software: Changes and Enhancements, Release 4.1 The correct bibliographic citation for this manual is as follows: SAS Institute Inc., Enterprise Miner TM Software: Changes and Enhancements,

More information

1 Dulcian, Inc., 2001 All rights reserved. Oracle9i Data Warehouse Review. Agenda

1 Dulcian, Inc., 2001 All rights reserved. Oracle9i Data Warehouse Review. Agenda Agenda Oracle9i Warehouse Review Dulcian, Inc. Oracle9i Server OLAP Server Analytical SQL Mining ETL Infrastructure 9i Warehouse Builder Oracle 9i Server Overview E-Business Intelligence Platform 9i Server:

More information

ADVANCED ANALYTICS USING SAS ENTERPRISE MINER RENS FEENSTRA

ADVANCED ANALYTICS USING SAS ENTERPRISE MINER RENS FEENSTRA INSIGHTS@SAS: ADVANCED ANALYTICS USING SAS ENTERPRISE MINER RENS FEENSTRA AGENDA 09.00 09.15 Intro 09.15 10.30 Analytics using SAS Enterprise Guide Ellen Lokollo 10.45 12.00 Advanced Analytics using SAS

More information

Page 1. Oracle9i OLAP. Agenda. Mary Rehus Sales Consultant Patrick Larkin Vice President, Oracle Consulting. Oracle Corporation. Business Intelligence

Page 1. Oracle9i OLAP. Agenda. Mary Rehus Sales Consultant Patrick Larkin Vice President, Oracle Consulting. Oracle Corporation. Business Intelligence Oracle9i OLAP A Scalable Web-Base Business Intelligence Platform Mary Rehus Sales Consultant Patrick Larkin Vice President, Oracle Consulting Agenda Business Intelligence Market Oracle9i OLAP Business

More information

SAS Enterprise Miner 7.1

SAS Enterprise Miner 7.1 SAS Enterprise Miner 7.1 Data Mining using SAS IASRI Satyajit Dwivedi Transforming the World DATA MINING SEMMA Process Sample Explore Modify Model Assess Utility 2 SEMMA Process - Creating Library Select

More information

ENTERPRISE MINER: 1 DATA EXPLORATION AND VISUALISATION

ENTERPRISE MINER: 1 DATA EXPLORATION AND VISUALISATION ENTERPRISE MINER: 1 DATA EXPLORATION AND VISUALISATION JOZEF MOFFAT, ANALYTICS & INNOVATION PRACTICE, SAS UK 10, MAY 2016 DATA EXPLORATION AND VISUALISATION AGENDA SAS Webinar 10th May 2016 at 10:00 AM

More information

Gain Greater Productivity in Enterprise Data Mining

Gain Greater Productivity in Enterprise Data Mining Clementine 9.0 Specifications Gain Greater Productivity in Enterprise Data Mining Discover patterns and associations in your organization s data and make decisions that lead to significant, measurable

More information

Introducing SAS Model Manager 15.1 for SAS Viya

Introducing SAS Model Manager 15.1 for SAS Viya ABSTRACT Paper SAS2284-2018 Introducing SAS Model Manager 15.1 for SAS Viya Glenn Clingroth, Robert Chu, Steve Sparano, David Duling SAS Institute Inc. SAS Model Manager has been a popular product since

More information

Developing Applications with Business Intelligence Beans and Oracle9i JDeveloper: Our Experience. IOUG 2003 Paper 406

Developing Applications with Business Intelligence Beans and Oracle9i JDeveloper: Our Experience. IOUG 2003 Paper 406 Developing Applications with Business Intelligence Beans and Oracle9i JDeveloper: Our Experience IOUG 2003 Paper 406 Chris Claterbos claterbos@vlamis.com Vlamis Software Solutions, Inc. (816) 781-2880

More information

Outrun Your Competition With SAS In-Memory Analytics Sascha Schubert Global Technology Practice, SAS

Outrun Your Competition With SAS In-Memory Analytics Sascha Schubert Global Technology Practice, SAS Outrun Your Competition With SAS In-Memory Analytics Sascha Schubert Global Technology Practice, SAS Topics AGENDA Challenges with Big Data Analytics How SAS can help you to minimize time to value with

More information

OLAP Introduction and Overview

OLAP Introduction and Overview 1 CHAPTER 1 OLAP Introduction and Overview What Is OLAP? 1 Data Storage and Access 1 Benefits of OLAP 2 What Is a Cube? 2 Understanding the Cube Structure 3 What Is SAS OLAP Server? 3 About Cube Metadata

More information

to-end Solution Using OWB and JDeveloper to Analyze Your Data Warehouse

to-end Solution Using OWB and JDeveloper to Analyze Your Data Warehouse An End-to to-end Solution Using OWB and JDeveloper to Analyze Your Data Warehouse Presented at ODTUG 2003 Dan Vlamis dvlamis@vlamis.com Vlamis Software Solutions, Inc. (816) 781-2880 http://www.vlamis.com

More information

Accessibility Features in the SAS Intelligence Platform Products

Accessibility Features in the SAS Intelligence Platform Products 1 CHAPTER 1 Overview of Common Data Sources Overview 1 Accessibility Features in the SAS Intelligence Platform Products 1 SAS Data Sets 1 Shared Access to SAS Data Sets 2 External Files 3 XML Data 4 Relational

More information

Certkiller.A QA

Certkiller.A QA Certkiller.A00-260.70.QA Number: A00-260 Passing Score: 800 Time Limit: 120 min File Version: 3.3 It is evident that study guide material is a victorious and is on the top in the exam tools market and

More information

Optimizing Your Analytics Life Cycle with SAS & Teradata. Rick Lower

Optimizing Your Analytics Life Cycle with SAS & Teradata. Rick Lower Optimizing Your Analytics Life Cycle with SAS & Teradata Rick Lower 1 Agenda The Analytic Life Cycle Common Problems SAS & Teradata solutions Analytical Life Cycle Exploration Explore All Your Data Preparation

More information

Intelligence Platform

Intelligence Platform SAS Publishing SAS Overview Second Edition Intelligence Platform The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2006. SAS Intelligence Platform: Overview, Second Edition.

More information

Data Mining Overview. CHAPTER 1 Introduction to SAS Enterprise Miner Software

Data Mining Overview. CHAPTER 1 Introduction to SAS Enterprise Miner Software 1 CHAPTER 1 Introduction to SAS Enterprise Miner Software Data Mining Overview 1 Layout of the SAS Enterprise Miner Window 2 Using the Application Main Menus 3 Using the Toolbox 8 Using the Pop-Up Menus

More information

Types of Data Mining

Types of Data Mining Data Mining and The Use of SAS to Deploy Scoring Rules South Central SAS Users Group Conference Neil Fleming, Ph.D., ASQ CQE November 7-9, 2004 2W Systems Co., Inc. Neil.Fleming@2WSystems.com 972 733-0588

More information

1 Copyright 2013, Oracle and/or its affiliates. All rights reserved.

1 Copyright 2013, Oracle and/or its affiliates. All rights reserved. 1 Copyright 2013, Oracle and/or its affiliates. All rights reserved. Oracle Application Express 2 Copyright 2013, Oracle and/or its affiliates. All rights reserved. Fully supported no-cost feature of Oracle

More information

What Is SAS? CHAPTER 1 Essential Concepts of Base SAS Software

What Is SAS? CHAPTER 1 Essential Concepts of Base SAS Software 3 CHAPTER 1 Essential Concepts of Base SAS Software What Is SAS? 3 Overview of Base SAS Software 4 Components of the SAS Language 4 SAS Files 4 SAS Data Sets 5 External Files 5 Database Management System

More information

Enterprise Miner Tutorial Notes 2 1

Enterprise Miner Tutorial Notes 2 1 Enterprise Miner Tutorial Notes 2 1 ECT7110 E-Commerce Data Mining Techniques Tutorial 2 How to Join Table in Enterprise Miner e.g. we need to join the following two tables: Join1 Join 2 ID Name Gender

More information

What s New In Sawmill 8 Why Should I Upgrade To Sawmill 8?

What s New In Sawmill 8 Why Should I Upgrade To Sawmill 8? What s New In Sawmill 8 Why Should I Upgrade To Sawmill 8? Sawmill 8 is a major new version of Sawmill, the result of several years of development. Nearly every aspect of Sawmill has been enhanced, and

More information

Test On Line: reusing SAS code in WEB applications Author: Carlo Ramella TXT e-solutions

Test On Line: reusing SAS code in WEB applications Author: Carlo Ramella TXT e-solutions Test On Line: reusing SAS code in WEB applications Author: Carlo Ramella TXT e-solutions Chapter 1: Abstract The Proway System is a powerful complete system for Process and Testing Data Analysis in IC

More information

Paper SAS Taming the Rule. Charlotte Crain, Chris Upton, SAS Institute Inc.

Paper SAS Taming the Rule. Charlotte Crain, Chris Upton, SAS Institute Inc. ABSTRACT Paper SAS2620-2016 Taming the Rule Charlotte Crain, Chris Upton, SAS Institute Inc. When business rules are deployed and executed--whether a rule is fired or not if the rule-fire outcomes are

More information

CMPUT 695 Fall 2004 Assignment 2 Xelopes

CMPUT 695 Fall 2004 Assignment 2 Xelopes CMPUT 695 Fall 2004 Assignment 2 Xelopes Paul Nalos, Ben Chu November 5, 2004 1 Introduction We evaluated Xelopes, a data mining library produced by prudsys 1. Xelopes is available for Java, C++, and CORBA

More information

Pre-Requisites: CS2510. NU Core Designations: AD

Pre-Requisites: CS2510. NU Core Designations: AD DS4100: Data Collection, Integration and Analysis Teaches how to collect data from multiple sources and integrate them into consistent data sets. Explains how to use semi-automated and automated classification

More information

Using SAS Enterprise Guide with the WIK

Using SAS Enterprise Guide with the WIK Using SAS Enterprise Guide with the WIK Philip Mason, Wood Street Consultants Ltd, United Kingdom ABSTRACT Enterprise Guide provides an easy to use interface to SAS software for users to create reports

More information

SAS offers technology to facilitate working with CDISC standards : the metadata perspective.

SAS offers technology to facilitate working with CDISC standards : the metadata perspective. SAS offers technology to facilitate working with CDISC standards : the metadata perspective. Mark Lambrecht, PhD Principal Consultant, Life Sciences SAS Agenda SAS actively supports CDISC standards Tools

More information

The PMBR Procedure. Overview Procedure Syntax PROC PMBR Statement VAR Statement TARGET Statement CLASS Statement. The PMBR Procedure

The PMBR Procedure. Overview Procedure Syntax PROC PMBR Statement VAR Statement TARGET Statement CLASS Statement. The PMBR Procedure The PMBR Procedure Overview Procedure Syntax PROC PMBR Statement VAR Statement TARGET Statement CLASS Statement Overview The PMBR procedure is used for prediction as an alternative to other predictive

More information

Oracle Machine Learning Notebook

Oracle Machine Learning Notebook Oracle Machine Learning Notebook Included in Autonomous Data Warehouse Cloud Charlie Berger, MS Engineering, MBA Sr. Director Product Management, Machine Learning, AI and Cognitive Analytics charlie.berger@oracle.com

More information

INTRODUCTION... 2 FEATURES OF DARWIN... 4 SPECIAL FEATURES OF DARWIN LATEST FEATURES OF DARWIN STRENGTHS & LIMITATIONS OF DARWIN...

INTRODUCTION... 2 FEATURES OF DARWIN... 4 SPECIAL FEATURES OF DARWIN LATEST FEATURES OF DARWIN STRENGTHS & LIMITATIONS OF DARWIN... INTRODUCTION... 2 WHAT IS DATA MINING?... 2 HOW TO ACHIEVE DATA MINING... 2 THE ROLE OF DARWIN... 3 FEATURES OF DARWIN... 4 USER FRIENDLY... 4 SCALABILITY... 6 VISUALIZATION... 8 FUNCTIONALITY... 10 Data

More information

Data Set. What is Data Mining? Data Mining (Big Data Analytics) Illustrative Applications. What is Knowledge Discovery?

Data Set. What is Data Mining? Data Mining (Big Data Analytics) Illustrative Applications. What is Knowledge Discovery? Data Mining (Big Data Analytics) Andrew Kusiak Intelligent Systems Laboratory 2139 Seamans Center The University of Iowa Iowa City, IA 52242-1527 andrew-kusiak@uiowa.edu http://user.engineering.uiowa.edu/~ankusiak/

More information

Automatic Detection of Section Membership for SAS Conference Paper Abstract Submissions: A Case Study

Automatic Detection of Section Membership for SAS Conference Paper Abstract Submissions: A Case Study 1746-2014 Automatic Detection of Section Membership for SAS Conference Paper Abstract Submissions: A Case Study Dr. Goutam Chakraborty, Professor, Department of Marketing, Spears School of Business, Oklahoma

More information

Tools to Develop New Linux Applications

Tools to Develop New Linux Applications Tools to Develop New Linux Applications IBM Software Development Platform Tools for every member of the Development Team Supports best practices in Software Development Analyst Architect Developer Tester

More information

Intellicus Getting Started

Intellicus Getting Started Intellicus Getting Started Intellicus Web-based Reporting Suite Version 4.5 Enterprise Professional Smart Developer Smart Viewer Intellicus Technologies info@intellicus.com www.intellicus.com Copyright

More information

Enterprise Guide. Delivering Information to the People Who Need to Know. Adrian Bruty Product Manager Business Intelligence SAS EMEA

Enterprise Guide. Delivering Information to the People Who Need to Know. Adrian Bruty Product Manager Business Intelligence SAS EMEA Enterprise Guide Delivering Information to the People Who Need to Know Adrian Bruty Product Manager Business Intelligence SAS EMEA Introduction Enterprise Guide Objectives Version 1.2 Demo 2 Challenges

More information

HYPERION SYSTEM 9 PERFORMANCE SCORECARD

HYPERION SYSTEM 9 PERFORMANCE SCORECARD HYPERION SYSTEM 9 PERFORMANCE SCORECARD RELEASE 9.2 NEW FEATURES Welcome to Hyperion System 9 Performance Scorecard, Release 9.2. This document describes the new or modified features in this release. C

More information

An Interactive GUI Front-End for a Credit Scoring Modeling System by Jeffrey Morrison, Futian Shi, and Timothy Lee

An Interactive GUI Front-End for a Credit Scoring Modeling System by Jeffrey Morrison, Futian Shi, and Timothy Lee An Interactive GUI Front-End for a Credit Scoring Modeling System by Jeffrey Morrison, Futian Shi, and Timothy Lee Abstract The need for statistical modeling has been on the rise in recent years. Banks,

More information

Security and Performance advances with Oracle Big Data SQL

Security and Performance advances with Oracle Big Data SQL Security and Performance advances with Oracle Big Data SQL Jean-Pierre Dijcks Oracle Redwood Shores, CA, USA Key Words SQL, Oracle, Database, Analytics, Object Store, Files, Big Data, Big Data SQL, Hadoop,

More information

Netezza The Analytics Appliance

Netezza The Analytics Appliance Software 2011 Netezza The Analytics Appliance Michael Eden Information Management Brand Executive Central & Eastern Europe Vilnius 18 October 2011 Information Management 2011IBM Corporation Thought for

More information

Qlik Sense Enterprise architecture and scalability

Qlik Sense Enterprise architecture and scalability White Paper Qlik Sense Enterprise architecture and scalability June, 2017 qlik.com Platform Qlik Sense is an analytics platform powered by an associative, in-memory analytics engine. Based on users selections,

More information

SAS E-MINER: AN OVERVIEW

SAS E-MINER: AN OVERVIEW SAS E-MINER: AN OVERVIEW Samir Farooqi, R.S. Tomar and R.K. Saini I.A.S.R.I., Library Avenue, Pusa, New Delhi 110 012 Samir@iasri.res.in; tomar@iasri.res.in; saini@iasri.res.in Introduction SAS Enterprise

More information

SAS STUDIO. JUNE 2014 PRESENTER: MARY HARDING Education SAS Canada. Copyr i g ht 2014, SAS Ins titut e Inc. All rights res er ve d.

SAS STUDIO. JUNE 2014 PRESENTER: MARY HARDING Education SAS Canada. Copyr i g ht 2014, SAS Ins titut e Inc. All rights res er ve d. JUNE 2014 PRESENTER: MARY HARDING Education SAS Canada NEW SAS PROGRAMMING ENVIRONMENT Available Consistent Assistive AVAILABLE THROUGH ALL MODERN WEB BROWSERS Available Consistent Assistive ONE INTERFACE

More information

Dr. SubraMANI Paramasivam. Think & Work like a Data Scientist with SQL 2016 & R

Dr. SubraMANI Paramasivam. Think & Work like a Data Scientist with SQL 2016 & R Dr. SubraMANI Paramasivam Think & Work like a Data Scientist with SQL 2016 & R About the Speaker Group Leader Dr. SubraMANI Paramasivam PhD., MVP, MCT, MCSE (x2), MCITP (x2), MCP, MCTS (x3), MCSA CEO,

More information

Liberate, a component-based service orientated reporting architecture

Liberate, a component-based service orientated reporting architecture Paper TS05 PHUSE 2006 Liberate, a component-based service orientated reporting architecture Paragon Global Services Ltd, Huntingdon, U.K. - 1 - Contents CONTENTS...2 1. ABSTRACT...3 2. INTRODUCTION...3

More information

OASUS Spring 2014 Questions and Answers

OASUS Spring 2014 Questions and Answers OASUS Spring 2014 Questions and Answers The following answers are provided to the benefit of the OASUS Users Group and are not meant to replace SAS Technical Support. Also, the Enterprise Guide project

More information

SAS Enterprise Miner : Tutorials and Examples

SAS Enterprise Miner : Tutorials and Examples SAS Enterprise Miner : Tutorials and Examples SAS Documentation February 13, 2018 The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2017. SAS Enterprise Miner : Tutorials

More information

RAPIDMINER FREE SOFTWARE FOR DATA MINING, ANALYTICS AND BUSINESS INTELLIGENCE

RAPIDMINER FREE SOFTWARE FOR DATA MINING, ANALYTICS AND BUSINESS INTELLIGENCE RAPIDMINER FREE SOFTWARE FOR DATA MINING, ANALYTICS AND BUSINESS INTELLIGENCE Luigi Grimaudo (luigi.grimaudo@polito.it) DataBase And Data Mining Research Group (DBDMG) Summary RapidMiner project Strengths

More information

DEPARTMENT OF COMPUTER SCIENCE

DEPARTMENT OF COMPUTER SCIENCE Department of Computer Science 1 DEPARTMENT OF COMPUTER SCIENCE Office in Computer Science Building, Room 279 (970) 491-5792 cs.colostate.edu (http://www.cs.colostate.edu) Professor L. Darrell Whitley,

More information

Summary. RapidMiner Project 12/13/2011 RAPIDMINER FREE SOFTWARE FOR DATA MINING, ANALYTICS AND BUSINESS INTELLIGENCE

Summary. RapidMiner Project 12/13/2011 RAPIDMINER FREE SOFTWARE FOR DATA MINING, ANALYTICS AND BUSINESS INTELLIGENCE RAPIDMINER FREE SOFTWARE FOR DATA MINING, ANALYTICS AND BUSINESS INTELLIGENCE Luigi Grimaudo (luigi.grimaudo@polito.it) DataBase And Data Mining Research Group (DBDMG) Summary RapidMiner project Strengths

More information

SAS Factory Miner 14.2: User s Guide

SAS Factory Miner 14.2: User s Guide SAS Factory Miner 14.2: User s Guide SAS Documentation The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2016. SAS Factory Miner 14.2: User s Guide. Cary, NC: SAS Institute

More information

Installation and Configuration Instructions. SAS Model Manager API. Overview

Installation and Configuration Instructions. SAS Model Manager API. Overview Installation and Configuration Instructions SAS Model Manager 2.1 This document is intended to guide an administrator through the pre-installation steps, the installation process, and post-installation

More information

Empowering Self-Service Capabilities with Agile Analytics

Empowering Self-Service Capabilities with Agile Analytics Empowering Self-Service Capabilities with Agile Analytics Paul Segal, Teradata Corporation, San Diego, California Tho Nguyen, Teradata Corporation, Raleigh, North Carolina Bob Matsey Teradata Corporation,

More information

ORACLE SERVICES FOR APPLICATION MIGRATIONS TO ORACLE HARDWARE INFRASTRUCTURES

ORACLE SERVICES FOR APPLICATION MIGRATIONS TO ORACLE HARDWARE INFRASTRUCTURES ORACLE SERVICES FOR APPLICATION MIGRATIONS TO ORACLE HARDWARE INFRASTRUCTURES SERVICE, SUPPORT AND EXPERT GUIDANCE FOR THE MIGRATION AND IMPLEMENTATION OF YOUR ORACLE APPLICATIONS ON ORACLE INFRASTRUCTURE

More information

An Interactive GUI Front-End for a Credit Scoring Modeling System

An Interactive GUI Front-End for a Credit Scoring Modeling System Paper 6 An Interactive GUI Front-End for a Credit Scoring Modeling System Jeffrey Morrison, Futian Shi, and Timothy Lee Knowledge Sciences & Analytics, Equifax Credit Information Services, Inc. Abstract

More information

EMC Documentum xdb. High-performance native XML database optimized for storing and querying large volumes of XML content

EMC Documentum xdb. High-performance native XML database optimized for storing and querying large volumes of XML content DATA SHEET EMC Documentum xdb High-performance native XML database optimized for storing and querying large volumes of XML content The Big Picture Ideal for content-oriented applications like dynamic publishing

More information

R07. FirstRanker. 7. a) What is text mining? Describe about basic measures for text retrieval. b) Briefly describe document cluster analysis.

R07. FirstRanker. 7. a) What is text mining? Describe about basic measures for text retrieval. b) Briefly describe document cluster analysis. www..com www..com Set No.1 1. a) What is data mining? Briefly explain the Knowledge discovery process. b) Explain the three-tier data warehouse architecture. 2. a) With an example, describe any two schema

More information

Using Java to Front SAS Software: A Detailed Design for Internet Information Delivery

Using Java to Front SAS Software: A Detailed Design for Internet Information Delivery Using Java to Front SAS Software: A Detailed Design for Internet Information Delivery Jonathan Stokes, JJT Inc., Austin, Texas, USA ABSTRACT As the Java language evolves from its rudimentary stages into

More information

SAS Data Integration Studio 3.3. User s Guide

SAS Data Integration Studio 3.3. User s Guide SAS Data Integration Studio 3.3 User s Guide The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2006. SAS Data Integration Studio 3.3: User s Guide. Cary, NC: SAS Institute

More information

System Requirements. SAS Profitability Management 2.3. Deployment Options. Supported Operating Systems and Versions. Windows Server Operating Systems

System Requirements. SAS Profitability Management 2.3. Deployment Options. Supported Operating Systems and Versions. Windows Server Operating Systems SAS Profitability Management 2.3 This document provides the requirements for installing and running SAS Profitability Management. This document has been updated for the first maintenance release of SAS

More information

ThinProway A Java client to a SAS application. A successful story. Exactly what you need?

ThinProway A Java client to a SAS application. A successful story. Exactly what you need? ThinProway A Java client to a SAS application. A successful story. Exactly what you need? Author: Riccardo Proni TXT Ingegneria Informatica Abstract ThinProway is a software solution dedicated to the manufacturing

More information

What is Data Mining? Data Mining. Data Mining Architecture. Illustrative Applications. Pharmaceutical Industry. Pharmaceutical Industry

What is Data Mining? Data Mining. Data Mining Architecture. Illustrative Applications. Pharmaceutical Industry. Pharmaceutical Industry Data Mining Andrew Kusiak Intelligent Systems Laboratory 2139 Seamans Center The University of Iowa Iowa City, IA 52242-1527 andrew-kusiak@uiowa.edu http://www.icaen.uiowa.edu/~ankusiak Tel. 319-335 5934

More information

What is Data Mining? Data Mining. Data Mining Architecture. Illustrative Applications. Pharmaceutical Industry. Pharmaceutical Industry

What is Data Mining? Data Mining. Data Mining Architecture. Illustrative Applications. Pharmaceutical Industry. Pharmaceutical Industry Data Mining Andrew Kusiak Intelligent Systems Laboratory 2139 Seamans Center The University it of Iowa Iowa City, IA 52242-1527 andrew-kusiak@uiowa.edu http://www.icaen.uiowa.edu/~ankusiak Tel. 319-335

More information

Specialist ICT Learning

Specialist ICT Learning Specialist ICT Learning APPLIED DATA SCIENCE AND BIG DATA ANALYTICS GTBD7 Course Description This intensive training course provides theoretical and technical aspects of Data Science and Business Analytics.

More information

Microsoft SharePoint 2010 The business collaboration platform for the Enterprise and the Web. We have a new pie!

Microsoft SharePoint 2010 The business collaboration platform for the Enterprise and the Web. We have a new pie! Microsoft SharePoint 2010 The business collaboration platform for the Enterprise and the Web We have a new pie! 2 Introduction Key Session Objectives Agenda More Scalable More Flexible More Features Intranet

More information

Metadata implementation for a Business Intelligence environment. Yuriy Verbitskiy William Yeoh Andy Koronios

Metadata implementation for a Business Intelligence environment. Yuriy Verbitskiy William Yeoh Andy Koronios Metadata implementation for a Business Intelligence environment Yuriy Verbitskiy William Yeoh Andy Koronios Outline Research background Introduction to BI and its issues Research methodology Research environment

More information

ArcGIS Enterprise: Architecture & Deployment. Anthony Myers

ArcGIS Enterprise: Architecture & Deployment. Anthony Myers ArcGIS Enterprise: Architecture & Deployment Anthony Myers 1 2 3 4 5 Web GIS Overview of ArcGIS Enterprise Federation & Hosted Server Deployment Patterns Implementation 1 Web GIS ArcGIS Enabling GIS for

More information

Community edition(open-source) Enterprise edition

Community edition(open-source) Enterprise edition Suseela Bhaskaruni Rapid Miner is an environment for machine learning and data mining experiments. Widely used for both research and real-world data mining tasks. Software versions: Community edition(open-source)

More information

SAS System Powers Web Measurement Solution at U S WEST

SAS System Powers Web Measurement Solution at U S WEST SAS System Powers Web Measurement Solution at U S WEST Bob Romero, U S WEST Communications, Technical Expert - SAS and Data Analysis Dale Hamilton, U S WEST Communications, Capacity Provisioning Process

More information

Oracle9i Data Mining. Data Sheet August 2002

Oracle9i Data Mining. Data Sheet August 2002 Oracle9i Data Mining Data Sheet August 2002 Oracle9i Data Mining enables companies to build integrated business intelligence applications. Using data mining functionality embedded in the Oracle9i Database,

More information

Oracle Mobile Hub. Complete Mobile Platform

Oracle Mobile Hub. Complete Mobile Platform Oracle Mobile Hub Mobile is everywhere and has changed nearly every facet of our lives. The way we work, play, socialize and interact with one another have all been revolutionized by mobile devices. More

More information

SAS Enterprise Miner Performance on IBM System p 570. Jan, Hsian-Fen Tsao Brian Porter Harry Seifert. IBM Corporation

SAS Enterprise Miner Performance on IBM System p 570. Jan, Hsian-Fen Tsao Brian Porter Harry Seifert. IBM Corporation SAS Enterprise Miner Performance on IBM System p 570 Jan, 2008 Hsian-Fen Tsao Brian Porter Harry Seifert IBM Corporation Copyright IBM Corporation, 2008. All Rights Reserved. TABLE OF CONTENTS ABSTRACT...3

More information

A SAS/AF Application for Parallel Extraction, Transformation, and Scoring of a Very Large Database

A SAS/AF Application for Parallel Extraction, Transformation, and Scoring of a Very Large Database Paper 11 A SAS/AF Application for Parallel Extraction, Transformation, and Scoring of a Very Large Database Daniel W. Kohn, Ph.D., Torrent Systems Inc., Cambridge, MA David L. Kuhn, Ph.D., Innovative Idea

More information

SAS Platform Strategy Prepared for FANS usergroup. Mike Frost, Director, Product Management Fiona McNeill, Global Product Marketing

SAS Platform Strategy Prepared for FANS usergroup. Mike Frost, Director, Product Management Fiona McNeill, Global Product Marketing SAS Platform Strategy Prepared for FANS usergroup Mike Frost, Director, Product Management Fiona McNeill, Global Product Marketing Information is subject to change. Q1 2017 Q2 2017 Q3 2017 Q4 2017 H1

More information

Scoring with Analytic Stores

Scoring with Analytic Stores Scoring with Analytic Stores Merve Yasemin Tekbudak, SAS Institute Inc., Cary, NC In supervised learning, scoring is the process of applying a previously built predictive model to a new data set in order

More information

Web Serving Architectures

Web Serving Architectures Web Serving Architectures Paul Dantzig IBM Global Services 2000 without the express written consent of the IBM Corporation is prohibited Contents Defining the Problem e-business Solutions e-business Architectures

More information

SAS Enterprise Miner 14.1

SAS Enterprise Miner 14.1 SAS Enterprise Miner 14.1 Administration and Configuration SAS Documentation The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2015. SAS Enterprise Miner 14.1: Administration

More information

Grid Computing Systems: A Survey and Taxonomy

Grid Computing Systems: A Survey and Taxonomy Grid Computing Systems: A Survey and Taxonomy Material for this lecture from: A Survey and Taxonomy of Resource Management Systems for Grid Computing Systems, K. Krauter, R. Buyya, M. Maheswaran, CS Technical

More information

Oracle Database Competency Center

Oracle Database Competency Center Oracle Database Competency Center Suchai Yenruedee Consulting & Customer Support Director Advanced Solutions Application Hosting Services Database Competency Center Space: 167.54 sqm. Location: 7th Floor

More information

Oracle 1Z0-515 Exam Questions & Answers

Oracle 1Z0-515 Exam Questions & Answers Oracle 1Z0-515 Exam Questions & Answers Number: 1Z0-515 Passing Score: 800 Time Limit: 120 min File Version: 38.7 http://www.gratisexam.com/ Oracle 1Z0-515 Exam Questions & Answers Exam Name: Data Warehousing

More information

Business Intelligence Roadmap HDT923 Three Days

Business Intelligence Roadmap HDT923 Three Days Three Days Prerequisites Students should have experience with any relational database management system as well as experience with data warehouses and star schemas. It would be helpful if students are

More information

1 Topic. Image classification using Knime.

1 Topic. Image classification using Knime. 1 Topic Image classification using Knime. The aim of image mining is to extract valuable knowledge from image data. In the context of supervised image classification, we want to assign automatically a

More information

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING SHRI ANGALAMMAN COLLEGE OF ENGINEERING & TECHNOLOGY (An ISO 9001:2008 Certified Institution) SIRUGANOOR,TRICHY-621105. DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING Year / Semester: IV/VII CS1011-DATA

More information

9. Conclusions. 9.1 Definition KDD

9. Conclusions. 9.1 Definition KDD 9. Conclusions Contents of this Chapter 9.1 Course review 9.2 State-of-the-art in KDD 9.3 KDD challenges SFU, CMPT 740, 03-3, Martin Ester 419 9.1 Definition KDD [Fayyad, Piatetsky-Shapiro & Smyth 96]

More information

SAS Enterprise Miner: Code Node Tips

SAS Enterprise Miner: Code Node Tips SAS Enterprise Miner: Code Node Tips October 16, 2013 Lorne Rothman, PhD, PStat, Principal Statistician Lorne.Rothman@sas.com SAS Institute (Canada) Inc. Copyright 2010 SAS Institute Inc. All rights reserved.

More information

Enterprise Client Software for the Windows Platform

Enterprise Client Software for the Windows Platform Paper 154 Enterprise Client Software for the Windows Platform Gail Kramer, SAS Institute Inc., Cary, NC Carol Rigsbee, SAS Institute Inc., Cary, NC John Toebes, SAS Institute Inc., Cary, NC Jeff Polzin,

More information

SAS Model Manager 2.3

SAS Model Manager 2.3 SAS Model Manager 2.3 Administrator's Guide SAS Documentation The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2010. SAS Model Manager 2.3: Administrator's Guide. Cary,

More information

Classification using Weka (Brain, Computation, and Neural Learning)

Classification using Weka (Brain, Computation, and Neural Learning) LOGO Classification using Weka (Brain, Computation, and Neural Learning) Jung-Woo Ha Agenda Classification General Concept Terminology Introduction to Weka Classification practice with Weka Problems: Pima

More information

What s New in VMware vsphere 5.1 VMware vcenter Server

What s New in VMware vsphere 5.1 VMware vcenter Server What s New in VMware vsphere 5.1 VMware vsphere 5.1 TECHNICAL MARKETING DOCUMENTATION V 1.0/UPDATED JULY 2012 Table of Contents Introduction... 3 Key Components... 3 vcenter Single Sign-On Server.... 3

More information

Connecting ESRI to Anything: EAI Solutions

Connecting ESRI to Anything: EAI Solutions Connecting ESRI to Anything: EAI Solutions Frank Weiss P.E., ESRI User s Conference 2002 Agenda Introduction What is EAI? Industry trends Key integration issues Point-to-point interfaces vs. Middleware

More information

Oracle Application Express: Administration 1-2

Oracle Application Express: Administration 1-2 Oracle Application Express: Administration 1-2 The suggested course agenda is displayed in the slide. Each lesson, except the Course Overview, will be followed by practice time. Oracle Application Express:

More information

Remove complexity in protecting your virtual infrastructure with. IBM Spectrum Protect Plus. Data availability made easy. Overview

Remove complexity in protecting your virtual infrastructure with. IBM Spectrum Protect Plus. Data availability made easy. Overview Overview Challenge In your organization, backup management is too complex and consumes too much time and too many IT resources. Solution IBM Spectrum Protect Plus dramatically simplifies data protection

More information

docalpha 5.0 Server Configuration Utility User Guide

docalpha 5.0 Server Configuration Utility User Guide docalpha 5.0 Server Configuration Utility User Guide Contents 1. docalpha Architecture Overview 3 1.1. docalpha Server Overview 3 2. Working with docalpha Server Configuration Utility 4 2.1. Starting docalpha

More information

Architectural Styles I

Architectural Styles I Architectural Styles I Software Architecture VO/KU (707023/707024) Roman Kern KTI, TU Graz 2015-01-07 Roman Kern (KTI, TU Graz) Architectural Styles I 2015-01-07 1 / 86 Outline 1 Non-Functional Concepts

More information

SAS IT Resource Management Forecasting. Setup Specification Document. A SAS White Paper

SAS IT Resource Management Forecasting. Setup Specification Document. A SAS White Paper SAS IT Resource Management Forecasting Setup Specification Document A SAS White Paper Table of Contents Introduction to SAS IT Resource Management Forecasting... 1 Getting Started with the SAS Enterprise

More information

Data Virtualization Implementation Methodology and Best Practices

Data Virtualization Implementation Methodology and Best Practices White Paper Data Virtualization Implementation Methodology and Best Practices INTRODUCTION Cisco s proven Data Virtualization Implementation Methodology and Best Practices is compiled from our successful

More information

System Requirements. SAS Activity-Based Management 7.2. Deployment

System Requirements. SAS Activity-Based Management 7.2. Deployment System Requirements SAS Activity-Based Management 7.2 This document provides the requirements for installing and running SAS Activity-Based Management. You must update your computer to meet the minimum

More information

bold The requirements for this software are: Software must be able to build, debug, run, and col ect data from Discrete Event Simulation models

bold The requirements for this software are: Software must be able to build, debug, run, and col ect data from Discrete Event Simulation models The following items in bold are requirements specified by an experienced simulation user who was evaluating new products. The Simio narrative responses are the parts not in bold. The requirements for this

More information

Session Questions and Responses

Session Questions and Responses Product: Topic: Audience: Updated: OpenText Image Crawler Webinar Questions ILTA February 10, 2015 Discover How to Make your Scanned Images Searchable with OpenText Image Crawler Session Questions and

More information