Enterprise Miner Software: Changes and Enhancements, Release 4.1

Similar documents
SAS Enterprise Miner : Tutorials and Examples

SAS Web Infrastructure Kit 1.0. Overview

Introduction. LOCK Statement. CHAPTER 11 The LOCK Statement and the LOCK Command

Chapter 3 Managing Results in Projects. Chapter Table of Contents

SAS Factory Miner 14.2: User s Guide

Data Representation. Variable Precision and Storage Information. Numeric Variables in the Alpha Environment CHAPTER 9

Using Data Transfer Services

Getting Started with SAS Factory Miner 14.2

Chapter 28 Saving and Printing Tables. Chapter Table of Contents SAVING AND PRINTING TABLES AS OUTPUT OBJECTS OUTPUT OBJECTS...

Permission Program. Support for Version 6 Only. Allowing SAS/SHARE Client Access to SAS Libraries or Files CHAPTER 40

Chapter 6 Creating Reports. Chapter Table of Contents

Chapter 25 Editing Windows. Chapter Table of Contents

Using Cross-Environment Data Access (CEDA)

SAS Publishing SAS. Forecast Studio 1.4. User s Guide

SAS Enterprise Miner TM 6.1. Migration Guide

APPENDIX 2 Customizing SAS/ASSIST Software

Formats. Formats Under UNIX. HEXw. format. $HEXw. format. Details CHAPTER 11

Introduction to MDDBs

SAS Visual Analytics 7.2, 7.3, and 7.4: Getting Started with Analytical Models

SAS 9.2 Foundation Services. Administrator s Guide

SAS Model Manager 2.2. Tutorials

Chapter 23 Animating Graphs. Chapter Table of Contents ANIMATING SELECTION OF OBSERVATIONS ANIMATING SELECTED GRAPHS...347

Chapter 27 Saving and Printing Graphics

SAS Data Integration Studio 3.3. User s Guide

SAS Web Report Studio 3.1

Tasks Menu Reference. Introduction. Data Management APPENDIX 1

SAS Model Manager 2.3

Graphics. Chapter Overview CHAPTER 4

ADVANCED ANALYTICS USING SAS ENTERPRISE MINER RENS FEENSTRA

SAS Web Infrastructure Kit 1.0. Overview, Second Edition

Enterprise Miner Version 4.0. Changes and Enhancements

Optimizing System Performance

SAS. IT Service Level Management 2.1: Migration Documentation

SAS I/O Engines. Definition. Specifying a Different Engine. How Engines Work with SAS Files CHAPTER 36

SAS Clinical Data Integration 2.4

SAS Enterprise Guide 4.3

Chapter 15 Mixed Models. Chapter Table of Contents. Introduction Split Plot Experiment Clustered Data References...

SAS. Installation Guide Fifth Edition Intelligence Platform

Chapter 2 User Interface Features. networks Window. Drawing Panel

SAS Clinical Data Integration 2.6

Storing and Reusing Macros

Licensing SAS DataFlux Products

Administering SAS Enterprise Guide 4.2

Creating and Executing Stored Compiled DATA Step Programs

SAS IT Resource Management 3.3

SAS Marketing Operations Management 6.0 R14 Update 2

Choosing the Right Procedure

SAS 9.3 Intelligence Platform

Examples That Use Remote Objecting

The correct bibliographic citation for this manual is as follows: SAS Institute Inc Proc EXPLODE. Cary, NC: SAS Institute Inc.

SAS Contextual Analysis 13.2: Administrator s Guide

Chapter 25 PROC PARETO Statement. Chapter Table of Contents. OVERVIEW SYNTAX SummaryofOptions DictionaryofOptions...

SAS Structural Equation Modeling 1.3 for JMP

SCL Arrays. Introduction. Declaring Arrays CHAPTER 4

SAS. Information Map Studio 3.1: Creating Your First Information Map

Informats. Informats Under UNIX. HEXw. informat. $HEXw. informat. Details CHAPTER 13

The EXPLODE Procedure

Chapter 13 Multivariate Techniques. Chapter Table of Contents

Overview. CHAPTER 2 Using the SAS System and SAS/ ASSIST Software

Data Set Options. Specify a data set option in parentheses after a SAS data set name. To specify several data set options, separate them with spaces.

SAS 9.4 Foundation Services: Administrator s Guide

Chapter 28 Command Reference. Chapter Table of Contents TSVIEW COMMAND FORECAST COMMAND

External Files. Definition CHAPTER 38

SAS Enterprise Case Management 2.1. Administrator s Guide

SAS Workflow Manager 2.2: Administrator s Guide

SAS Business Rules Manager 1.2

SAS Visual Analytics 6.1

Getting Started with. SAS Enterprise Miner 5.3

CHAPTER 13 Importing and Exporting External Data

SAS Catalogs. Definition. Catalog Names. Parts of a Catalog Name CHAPTER 32

Getting Started with SAS Enterprise Miner 12.1

SAS Clinical Data Integration Server 2.1

SAS Enterprise Miner TM 6.1

SAS BI Dashboard 3.1. User s Guide Second Edition

SAS Drug Development 3.3_03. December 14, 2007

SAS/Warehouse Metadata API. Reference Second Edition

SAS Publishing. Configure SAS. Forecast Server 1.4. Stored Processes

Installation Instructions for SAS/ACCESS Interface to SAP BW

Using the SQL Editor. Overview CHAPTER 11

CREATING THE DISTRIBUTION ANALYSIS

Using CVS Repositories with SAS

SAS Fair Banking 8.1 Installation Instructions

Macro Facility. About the Macro Facility. Automatic Macro Variables CHAPTER 14

JMP Clinical. Release Notes. Version 5.0

CHAPTER 7 Examples of Combining Compute Services and Data Transfer Services

Enterprise Miner Tutorial Notes 2 1

CHAPTER 7 Using Other SAS Software Products

SAS Support for TELNET on Windows

SAS Forecast Server 3.1. Administrator s Guide to Installation and Configuration

SAS Add-In 7.1 for Microsoft Office: Getting Started in Microsoft Excel, Microsoft Word, and Microsoft PowerPoint, Second Edition

SAS Contextual Analysis 14.3: Administrator s Guide

SAS Drug Development. SAS Macro API 1.3 User s Guide

SAS. Contextual Analysis 13.2: User s Guide. SAS Documentation

Administrator for Enterprise Clients: User s Guide. Second Edition

Alternative ODS HTML Statements for Running Examples in Different Operating Environments

Installation Instructions for Release 5.1 of the SAS Performance Management Solutions

SAS File Management. Improving Performance CHAPTER 37

Chapter 23 Introduction to the OPTEX Procedure

Getting Started with SAS Enterprise Miner 14.2

SAS/STAT 13.1 User s Guide. The Power and Sample Size Application

Transcription:

Enterprise Miner Software: Changes and Enhancements, Release 4.1

The correct bibliographic citation for this manual is as follows: SAS Institute Inc., Enterprise Miner TM Software: Changes and Enhancements, Release 4.1, Cary, NC: SAS Institute Inc., 20001. Enterprise Miner TM Software: Changes and Enhancements, Release 4.1 Copyright 2001 by SAS Institute Inc., Cary, NC, USA. All rights reserved. Produced in the United States of America. No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, electronic, mechanical, photocopying, or otherwise, without the prior written permission of the publisher, SAS Institute Inc. U.S. Government Restricted Rights Notice. Use, duplication, or disclosure of this software and related documentation by the U.S. government is subject to the Agreement with SAS Institute and the restrictions set forth in FAR 52.227 19 Commercial Computer Software-Restricted Rights (June 1987). SAS Institute Inc., SAS Campus Drive, Cary, North Carolina 27513. 1st printing, February 2001 SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks of SAS Institute Inc. in the USA and other countries. indicates USA registration. IBM and all other International Business Machines Corporation product or service names are registered trademarks or trademarks of International Business Machines Corporation in the USA and other countries. Oracle and all other Oracle Corporation product or service names are registered trademarks or trademarks of Oracle Corporation in the USA and other countries. Other brand and product names are registered trademarks or trademarks of their respective companies.

Enterprise Miner Software: Changes and Enhancements, Release 4.1 Table of Contents SAS Process Monitor.. 1 Summary of Node Updates. 2 New Enterprise Miner Nodes and Node Features... 4 Experimental Text-Mining Nodes... 6

SAS Process Monitor The Regression, Neural Network, and SOM/Kohonen nodes use the SAS Process Monitor to track nodes and stop them from running. The following display shows the contents of the SAS Process Monitor Graph tab while training a neural network: You can change the settings of the SAS Process Monitor in the Process Monitor window. To open the Process Monitor window in the Regression and SOM/Kohonen nodes, 1. Open the node. 2. From the main menu, select Tools Training Monitor Setup. To open the Process Monitor window in the Neural Network node, 1. Open the node. 2. In the General tab, select the Configure button next to the Training process monitor check box. For more information about the SAS Process Monitor, select from the Enterprise Miner main menu Help EM Reference SAS Process Monitor.

Summary of Node Updates Enhancements to the Assessment node and to the Model Manager include the following: A scatter plot is available, for interval targets only. The plot enables you to view the pair-wise distribution of any two variables. To create a scatter plot, select one model from the Assessment Results Browser or from the Model Manager window and then select Tools Predicted Plot. The following display shows a scatter plot of the predicted target plotted against the actual target: The Export Model is experimental. This feature enables you to export both model results and metadata and either store them in a file on the PC, or register them in a repository. Enhancements to the C*Score node include the following: In the C Code tab, the Produce XML function and data description check box enables you to create the XML-generated output at the top of the C Code tab. Also, the Use individual input and output values as separate arguments check box can be used to cause each data set variable in the function to be listed as a separate parameter to the function. The Edit button in the C Code tab of the C*Score Results Browser opens a PREVIEW window. The generated C code is displayed in the PREVIEW window and can be edited, printed, and saved as a separate file.

Distributions of variables can now be viewed in the Variables tab when the Data Set Attributes node is open. Enhancements to the Filter Outliers node include the following: In the Settings tab, the new Use sample and Use entire data radio buttons enable you to specify the data set for the automatic and manual filtering. In the Settings tab, the names of the Apply all and Apply unchanged buttons have been changed to Apply these filters to all vars and Apply only to vars without existing filters, respectively, to be more descriptive. For class variables, you can select the values of a variable to be excluded in the Variable Histogram window by clicking a bar. The Variable Histogram window is enhanced to enable you to view the frequencies in a table format. To open a table view of frequencies of a variable, select from the main menu View Frequency table. In the frequency table, select the values for exclusion. For interval variables, you can set the range of inclusion in the Select Value window. When the Select Value window is open, it displays the most recently saved values of MIN and MAX. If both of the displayed boundary values are within the actual data range, you cannot set the values outside the actual data range. Enhancements to the Regression node include the following: The Regression node now supports nominal target variables. The Minimize resource usage check box has been added to the Advanced tab. This is useful when the model has a large number of parameters to be estimated, especially for nominal target variables. In the General subtab of the Defaults tab, a Replace unknown level with check box has been added. This is useful when the class variables in the score data set contain values that are not in the training data set. You can replace the unknown value by selecting either most frequent value (count) or missing value.

Enhancements to the Sampling node include the following: Distributions of variables can be viewed in the Variables tab when the Sampling node is open. In the Stratified tab, the Enforce proportions check box has been added. Use it to specify what proportion of the sample will come from each stratum. Note: When a stratum of the stratified variable does not have enough observations to meet the requested proportions, selecting this check box adjusts the requested sample size to preserve the requested proportion. If this check box is not selected, the node will try to meet the sample size requirement and select as many observations as possible from each stratum. This might lead to sample proportions that are different from what you requested. The map plot in the Map tab of the Results Browser now resembles a map, instead of a grid plot. New Enterprise Miner Nodes and Node Features The Link Analysis node is experimental. The Link Analysis node transforms data from differing sources into a data model that can be graphed. The data model supports simple statistical measures, presents a simple interactive graph for basic analytical exploration, and generates cluster scores from raw data that can be used for data reduction and segmentation. The Memory-Based Reasoning node is experimental. The Memory-Based Reasoning node is a modeling tool that uses a k-nearest neighbor algorithm to categorize or predict observations.

The Princomp/Dmneural node fits an additive nonlinear model that uses the bucketed principal components as inputs to predict a binary or an interval target variable. It also performs a stand alone principal components analysis and passes the scored principal components to the successor nodes. The Time Series node is experimental. The Time Series node converts transactional data to time series data, and performs seasonal and trend analysis on an interval target variable. The Two Stage Model node computes a two-stage model to predict a class target and an interval target. The interval target variable is usually the value that is associated with a level of the class target variable. The Tree Results Viewer is experimental. Enhancements to the Tree Results Viewer include the following: The presentation quality of the Tree view is improved. Abbreviated splitting rules are displayed. Zooming capability is improved. New views that display competing and surrogate splitting rules are added. Node colors in the Tree Results Viewer can be specified. Context Help has been added and can be accessed by pressing the F1 key.

Experimental Text-Mining Nodes The following experimental text-mining nodes require a separate setinit. For more information, contact your SAS sales representative. The Text Parsing node decomposes textual data into term and document frequency tables that are suitable for data mining. The input data is a collection of documents or text that you want to examine for patterns or that you want to mine for information. The Singular Value Decomposition (SVD) node provides a technique to reduce the dimensions from thousands to one or two hundred. In text-mining applications, it resolves the problem of determining when two documents are similar by performing a latent semantic analysis, in which terms and documents that are closely associated are placed near one another. Also, documents that have similar conceptual meaning but dissimilar term patterns are generally placed close together. The Expectation-Maximization Clustering (E-M Clustering) node performs observation clustering by identifying primary and secondary clusters for a given data set. In the Expectation-Maximization Clustering node, observations are not grouped into clusters, but belong to each primary cluster with a certain probability.