Surviving Survival Forecasting of Product Failure

Size: px
Start display at page:

Download "Surviving Survival Forecasting of Product Failure"

Transcription

1 Surviving Survival Forecasting of Product Failure Ryan Carr Advisory Statistical Data Scientist SAS #AnalyticsX

2 Agenda Survival Model Concepts Censoring & time Alignment Preparing the data for analysis Parametric Models Exponential Log-linear Weibull 2p Weibull 3p Generalized Gamma Process to Forecast % failure at fixed points from in-service date Process to Forecast weekly failure based on in-service dates Conclusion

3 Survival Model Concepts Right Censored Unit 1 1 Mar Jun 16 Unit 1 Failed after 16 weeks Unit 2 1 Mar 16 Today The failure time for Unit 2 is considered censored since it did not fail during our study period

4 Survival Model Concepts Time Aligned Unit 1 Unit 3 1 Mar Mar 16 Today Unit 1 was placed into service the first week Unit 2 was not placed into service until 4 weeks later Unit 1 Unit 3 Week 0 Week 0 Week 20 Time align each unit by using relative times (hours, days, weeks ) rather than absolute times.

5 Survival Model Concepts Preparing the data In Service Data Returns by Week FOR each in-service date Two ways to get 1 week in service.

6 Survival Model Concepts Preparing the data Censored / time Aligned Data Censored replacements are actually units still in service this row has only been in service 1 week before the end of the study. These 44 units actually failed only 1 week after being placed in service. It is the sum of any failing 1 week after any in-service date

7 Survival Model Concepts Preparing the data Forecast Template For forecasting (after the model is fit) We focus back on only those units still in service.

8 Comparing Models Distributions Exponential Weibull 2p Weibull 3p Lognormal Generalized Gamma SOME of the Relationships among the distributions: Exponential is Weibull 2p with Scale=1 Weibull 2p is Generalized Gamma with Shape=1 Weibull 3p is Weibull 2p with an offset parameter LogNormal is Generalized Gamma with Shape=0 NOTE: distribution information from and

9 Comparing Models Exponential CDF G t = exp( α t ) ods output ParameterEstimates = exp_pe2 ; proc lifereg data=returns_censored outest=pe_exponential ; model WeeksInService*censor(1)= / distribution=exponential ; weight replacements ; output out=resid_exponential sres=sresiduals ; probplot / hlower=.05 ; inset ; run; α = 10443

10 Comparing Models Weibull 2p CDF G t = exp( α t γ ) ods output ParameterEstimates = w2p_pe2 ; proc lifereg data=returns_censored outest=pe_weibull2p ; model WeeksInService*censor(1)= / distribution=weibull ; weight replacements ; output out=resid_weibull2p sres=sresiduals ; probplot / hlower=0.05 ; inset ; α = 1800 γ = 1.374

11 Comparing Models Weibull 3p CDF G t = exp( α t δ γ ) ods output ParmEst = pe_w3p ; proc reliability data=returns_censored ; freq replacements ; distribution W3 ; probplot WeeksInService*Censor(1) ; α = 3382 γ = δ = The Weibull 3p model is a generalization of the Weibull 2p model where a location or offset parameter is added. This offset represents the minimum time to event.

12 Comparing Models LogNormal CDF ods output ParameterEstimates = ln_pe2 ; proc lifereg data=returns_censored outest=pe_lognormal ; model WeeksInService*censor(1)= / distribution=lognormal ; weight replacements ; output out=resid_lognormal sres=sresiduals ; probplot / hlower=0.05 ; inset ; run; μ = σ = 2.443

13 Comparing Models Generalized Gamma CDF ods output ParameterEstimates = gg_pe2 ; proc lifereg data=returns_censored inest=in_estw outest=pe_ggamma ; model WeeksInService*censor(1)= / distribution=gamma ; weight replacements ; output out=resid_ggamma sres=sresiduals ; probplot ; inset ; run; a = d= p=

14 Generalized Gamma Setting initial parameters proc lifereg data=returns_censored outest=out_estw noprint ; model WeeksInService*censor(1)= / distribution=weibull maxiter=5000 ; weight replacements ; data in_estw ; set out_estw ; _dist_ = "Gamma" ; _shape1_ = 1 ; * Weibull 2p * ; proc lifereg data=returns_censored inest=in_estw outest=pe_ggamma ; model WeeksInService*censor(1)= / distribution=gamma maxiter=10000 ; weight replacements ; output out=resid_ggamma sres=sresiduals ; probplot ; inset ; run; NOTE: The Generalized Gamma is a fairly complex distribution and may have convergence problems in maximum likelihood parameter estimation Two steps to help with convergence are: 1) Start parameter search at a reasonable position like the Weibull 2p estimates 2) Set the maximum iterations to a higher number

15 Applying Models to forecasts Have models Have parameter estimates How do we apply these to get estimated future values?

16 Applying Models Points in time Exponential CDF G t = exp( α t ) α = Direct Formula ods output ParameterEstimates = exp_pe2 ; proc sql ; select estimate into :alpha from exp_pe2 where parameter ='Weibull Scale' ; 0.25% Chance of failure by week 26 data prob_failure ; do WeeksInService = 4, 13, 26 ; cdf = 1 - exp(- WeeksInService/&alpha.) ; output ; end ;

17 Applying Models Points in time Weibull 2p CDF G t = exp( α t γ ) α = 1800 γ = Direct Formula proc sql ; select put(estimate, 15.10) as estimate into :alpha from w2p_pe2 where parameter ='Weibull Shape' ; select put(estimate, 15.10) as estimate into :gamma from w2p_pe2 where parameter ='Weibull Scale' ; 0.30% Chance of failure by week 26 data prob_failure ; do WeeksInService = 4, 13, 26 ; cdf = (1- (exp(-((weeksinservice)/&gamma.)**&alpha.)) ) ; output ; end ;

18 Applying Models Points in time Weibull 3p Direct Formula CDF G t = exp( α t δ γ ) α = 3382,γ = 1.195, δ = select estimate into :delta from pe_w3p where parameter ='Weibull Threshold' ; 0.28% Chance of failure by week 26 data prob_failure ; do WeeksInService = 4, 13, 26 ; cdf = (1- (exp(-((weeksinservice-&delta.)/&gamma.)**&alpha.)) ) ; output ; end ;

19 Applying Models Points in time LogNormal CDF μ = 9.999, σ = CDF Function proc lifereg data=returns_censored outest=pe_lognormal ;... proc sql ; select intercept, _scale_ into :mu, :sigma from pe_lognormal; data prob_failure ; do WeeksInService = 4, 13, 26 ; cdf2 = cdf('lognormal', WeeksInService, &mu., &sigma.) ; output ; end ; 0.29% Chance of failure by week 26

20 Applying Models Points in time Generalized Gamma CDF a = 9.838, d= , p= Lifereg maxiter=0 proc lifereg outest=pe_ggamma ; data pfail_in ; censor = 0 ; replacements = 1 ; do WeeksInService = 4, 13, 26 ; output ; end ; 0.25% Chance of failure by week 26 proc lifereg data=pfail_in inest=pe_ggamma noprint ; model WeeksInService*censor(1)= / distribution=gamma maxiter=0 ; output out=prob_failurel cdf=cdf ;

21 Applying Models Weekly Returns Generate periods / weeks for projection Apply models to get probability of failure each week Determine units still in service from each source (ship week) For each source (ship week) Apply weekly failure rates. Remove units from service for next week Predict next week s failure Align returns by source (ship week) and summarize expected returns each future week.

22 Applying Models Weekly Returns Generate periods / weeks for projection data forecastin ; fcstrange = &numperiods. ; do WeeksInService = 1 to fcstrange ; replacements=1 ; censor = 0 ; output ; end ; ForecastIn

23 Applying Models Weekly Returns Apply models to get probability of failure each week proc lifereg data=forecastin inest=pe_ggamma noprint ; model WeeksInService*censor(1)= / distribution=gamma maxiter=0 ; weight replacements ; output out=predcdf cdf=cdf ; data predpct ; set predcdf (keep=weeksinservice CDF) ; prevcdf = lag(cdf) ; if _n_ = 1 then prevcdf = 0 ; retn_pct = cdf - prevcdf ;

24 Applying Models Weekly Returns Determine units still in service from each source (ship week) proc sql ; create table UnitsInService as select ship_week, WeeksInService, censor, field_pop from ShippedStillInField a where censor = 1 group by WeeksInService, censor ;

25 Applying Models Weekly Returns For each source (ship week) Apply weekly failure rates. Remove units from service for next week Predict next week s failure proc transpose data=predpct out=predpctt prefix=retnpct ; var retn_pct ; data fct ; set UnitsInService ; if _n_ = 1 then set predpctt (drop=_name_) ; retain retnpct: ; array retnpct(*) retnpct: ; array forc[&numperiods.] ; offset = WeeksInService - 1 ; do i = 1 to (&numperiods.-offset) ; forc[i] = round(field_pop * retnpct(i+offset), 1) ; field_pop = field_pop - forc[i] ; end ;

26 Applying Models Weekly Returns Align returns by source (ship week) and summarize expected returns each future week. data forecast; merge shipsum fct_sort; by descending WeeksInService ; proc print data=forecast noobs label ; id ship_week ; var field_pop forc1-forc26 ; sum field_pop forc1-forc26 ; label field_pop="units in Service" ; format field_pop forc: comma9. ;

27 Applying Models Weekly Returns Comparing results with graphs Generalized Gamma Weibull 3p

28 Applying Models Summary Distribution Formula CDF() LIFEREG Exponential Yes Yes Yes Weibull2p Yes Yes Yes Weibull3p Yes Yes With mods LogNormal Yes Yes Generalized Gamma Yes NOTE: searching the internet for applications of the Generalized Gamma in SAS leads to many unanswered questions. The few answers I could find focused on implementation of the partial gamma function via SAS IML. The use of LIFEREG with maxiter=0 as a means of forecasting with an existing model and new data was not directly documented.

29 Conclusion selecting model Can structure comparisons leveraging relationships of models and nested ML Lognormal, Exponential and Weibull 2p are all instances of Generalized Gamma But Weibull 3p is not? Could structure comparison using RMSE of actual vs predicted Ultimately Test against reality Understand essentially, all models are wrong, but some are useful George E. P. Box. Use simplest projection method(s)

30 #AnalyticsX

Chapter 30 The RELIABILITY Procedure

Chapter 30 The RELIABILITY Procedure Chapter 30 The RELIABILITY Procedure Chapter Table of Contents OVERVIEW...923 GETTING STARTED...925 Analysis of Right-Censored Data from a Single Population...925 Weibull Analysis Comparing Groups of Data....928

More information

Table of Contents. The RETAIN Statement. The LAG and DIF Functions. FIRST. and LAST. Temporary Variables. List of Programs.

Table of Contents. The RETAIN Statement. The LAG and DIF Functions. FIRST. and LAST. Temporary Variables. List of Programs. Table of Contents List of Programs Preface Acknowledgments ix xvii xix The RETAIN Statement Introduction 1 Demonstrating a DATA Step with and without a RETAIN Statement 1 Generating Sequential SUBJECT

More information

SAS/QC 14.2 User s Guide. The RELIABILITY Procedure

SAS/QC 14.2 User s Guide. The RELIABILITY Procedure SAS/QC 14.2 User s Guide The RELIABILITY Procedure This document is an individual chapter from SAS/QC 14.2 User s Guide. The correct bibliographic citation for this manual is as follows: SAS Institute

More information

SAS/ETS 14.2 User s Guide. The SEVERITY Procedure

SAS/ETS 14.2 User s Guide. The SEVERITY Procedure SAS/ETS 14.2 User s Guide The SEVERITY Procedure This document is an individual chapter from SAS/ETS 14.2 User s Guide. The correct bibliographic citation for this manual is as follows: SAS Institute Inc.

More information

/* Parametric models: AFT modeling */ /* Data described in Chapter 3 of P. Allison, "Survival Analysis Using the SAS System." */

/* Parametric models: AFT modeling */ /* Data described in Chapter 3 of P. Allison, Survival Analysis Using the SAS System. */ /* Parametric models: AFT modeling */ /* Data described in Chapter 3 of P. Allison, "Survival Analysis Using the SAS System." */ options ls =79; data recidall; input week arrest fin age race wexp mar paro

More information

A User Manual for the Multivariate MLE Tool. Before running the main multivariate program saved in the SAS file Part2-Main.sas,

A User Manual for the Multivariate MLE Tool. Before running the main multivariate program saved in the SAS file Part2-Main.sas, A User Manual for the Multivariate MLE Tool Before running the main multivariate program saved in the SAS file Part-Main.sas, the user must first compile the macros defined in the SAS file Part-Macros.sas

More information

JMP 12.1 Quick Reference Windows and Macintosh Keyboard Shortcuts

JMP 12.1 Quick Reference Windows and Macintosh Keyboard Shortcuts Data Table Actions JMP 12.1 Quick Reference and Keyboard s Select the left or right cell. If a blinking cursor is inserted in a cell, move one character left or right through the cell contents. Select

More information

Stat 302 Statistical Software and Its Applications SAS: Distributions

Stat 302 Statistical Software and Its Applications SAS: Distributions Stat 302 Statistical Software and Its Applications SAS: Distributions Yen-Chi Chen Department of Statistics, University of Washington Autumn 2016 1 / 39 Distributions in R and SAS Distribution R SAS Beta

More information

Package citools. October 20, 2018

Package citools. October 20, 2018 Type Package Package citools October 20, 2018 Title Confidence or Prediction Intervals, Quantiles, and Probabilities for Statistical Models Version 0.5.0 Maintainer John Haman Functions

More information

SAS (Statistical Analysis Software/System)

SAS (Statistical Analysis Software/System) SAS (Statistical Analysis Software/System) SAS Adv. Analytics or Predictive Modelling:- Class Room: Training Fee & Duration : 30K & 3 Months Online Training Fee & Duration : 33K & 3 Months Learning SAS:

More information

Accelerated Life Testing Module Accelerated Life Testing - Overview

Accelerated Life Testing Module Accelerated Life Testing - Overview Accelerated Life Testing Module Accelerated Life Testing - Overview The Accelerated Life Testing (ALT) module of AWB provides the functionality to analyze accelerated failure data and predict reliability

More information

From Manual to Automatic with Overdrive - Using SAS to Automate Report Generation Faron Kincheloe, Baylor University, Waco, TX

From Manual to Automatic with Overdrive - Using SAS to Automate Report Generation Faron Kincheloe, Baylor University, Waco, TX Paper 152-27 From Manual to Automatic with Overdrive - Using SAS to Automate Report Generation Faron Kincheloe, Baylor University, Waco, TX ABSTRACT This paper is a case study of how SAS products were

More information

SAS/STAT 14.1 User s Guide. The LIFEREG Procedure

SAS/STAT 14.1 User s Guide. The LIFEREG Procedure SAS/STAT 14.1 User s Guide The LIFEREG Procedure This document is an individual chapter from SAS/STAT 14.1 User s Guide. The correct bibliographic citation for this manual is as follows: SAS Institute

More information

Chapter 6: Modifying and Combining Data Sets

Chapter 6: Modifying and Combining Data Sets Chapter 6: Modifying and Combining Data Sets The SET statement is a powerful statement in the DATA step. Its main use is to read in a previously created SAS data set which can be modified and saved as

More information

Changes and Enhancements

Changes and Enhancements Changes and Enhancements Overview This section summarizes the major changes and enhancements to SAS/QC software in Versions 7 and 8. Information about changes and enhancements that were implemented in

More information

Chapter 5 INSET Statement. Chapter Table of Contents

Chapter 5 INSET Statement. Chapter Table of Contents Chapter 5 INSET Statement Chapter Table of Contents OVERVIEW...191 GETTING STARTED...192 DisplayingSummaryStatisticsonaHistogram...192 Formatting Values and Customizing Labels..... 193 AddingaHeaderandPositioningtheInset...194

More information

Be#er%Accelerated%Tests%

Be#er%Accelerated%Tests% `` Be#er%Accelerated%Tests% Wayne%Nelson,%Sta5s5cal%Consultant% Wnconsult@aol.com,%%% (518)346C5138% ASTR%2015,%Sep%9%C%11,%Cambridge,%MA% 1% Copyright Wayne Nelson 2015 1 `PURPOSE DESCRIBE AVAILABLE STATISTICAL

More information

AURA ACADEMY SAS TRAINING. Opposite Hanuman Temple, Srinivasa Nagar East, Ameerpet,Hyderabad Page 1

AURA ACADEMY SAS TRAINING. Opposite Hanuman Temple, Srinivasa Nagar East, Ameerpet,Hyderabad Page 1 SAS TRAINING SAS/BASE BASIC THEORY & RULES ETC SAS WINDOWING ENVIRONMENT CREATION OF LIBRARIES SAS PROGRAMMING (BRIEFLY) - DATASTEP - PROC STEP WAYS TO READ DATA INTO SAS BACK END PROCESS OF DATASTEP INSTALLATION

More information

Matlab statistics toolbox-fitting Distributions to Data 刘静远

Matlab statistics toolbox-fitting Distributions to Data 刘静远 Matlab statistics toolbox-fitting Distributions to Data 刘静远 0711160008 2007.12.18 Analyzing Survival or Reliability Data In biological or medical applications survival analysis. The times may represent

More information

SAS Online Training: Course contents: Agenda:

SAS Online Training: Course contents: Agenda: SAS Online Training: Course contents: Agenda: (1) Base SAS (6) Clinical SAS Online Training with Real time Projects (2) Advance SAS (7) Financial SAS Training Real time Projects (3) SQL (8) CV preparation

More information

ECE 510 Midterm 13 Feb 2013

ECE 510 Midterm 13 Feb 2013 ECE 510 Midterm 13 Feb 2013 Questions (short answer) 1. What does QRE stand for? 2. Name 3 job activities a QRE might perform. 3. Would 100 DPM at the end of an expected life of 5 years be a typical goal

More information

Contents of SAS Programming Techniques

Contents of SAS Programming Techniques Contents of SAS Programming Techniques Chapter 1 About SAS 1.1 Introduction 1.1.1 SAS modules 1.1.2 SAS module classification 1.1.3 SAS features 1.1.4 Three levels of SAS techniques 1.1.5 Chapter goal

More information

Using Templates Created by the SAS/STAT Procedures

Using Templates Created by the SAS/STAT Procedures Paper 081-29 Using Templates Created by the SAS/STAT Procedures Yanhong Huang, Ph.D. UMDNJ, Newark, NJ Jianming He, Solucient, LLC., Berkeley Heights, NJ ABSTRACT SAS procedures provide a large quantity

More information

The EMCLUS Procedure. The EMCLUS Procedure

The EMCLUS Procedure. The EMCLUS Procedure The EMCLUS Procedure Overview Procedure Syntax PROC EMCLUS Statement VAR Statement INITCLUS Statement Output from PROC EMCLUS EXAMPLES-SECTION Example 1: Syntax for PROC FASTCLUS Example 2: Use of the

More information

How data analytics can be used to improve the network performance: some use cases

How data analytics can be used to improve the network performance: some use cases How data analytics can be used to improve the network performance: some use cases lvelasco@ac.upc.edu ACP 2016 Outline Three use cases will be presented where the use of data analytics helps to improve

More information

Reducing Credit Union Member Attrition with Predictive Analytics

Reducing Credit Union Member Attrition with Predictive Analytics Reducing Credit Union Member Attrition with Predictive Analytics Nate Derby Stakana Analytics Seattle, WA PhilaSUG 10/29/15 Nate Derby Reducing Credit Union Attrition 1 / 28 Outline 1 2 Duplicating the

More information

MATLAB COMPUTATIONAL FINANCE CONFERENCE Quantitative Sports Analytics using MATLAB

MATLAB COMPUTATIONAL FINANCE CONFERENCE Quantitative Sports Analytics using MATLAB MATLAB COMPUTATIONAL FINANCE CONFERENCE 2017 Quantitative Sports Analytics using MATLAB Robert Kissell, PhD Robert.Kissell@KissellResearch.com September 28, 2017 Important Email and Web Addresses AlgoSports23/MATLAB

More information

Chapter 10: Extensions to the GLM

Chapter 10: Extensions to the GLM Chapter 10: Extensions to the GLM 10.1 Implement a GAM for the Swedish mortality data, for males, using smooth functions for age and year. Age and year are standardized as described in Section 4.11, for

More information

Purchase this book at

Purchase this book at Chapter 2 2 Creating Simple Stored Processes BASE SAS gives programmers the exponential ability to query and report about data from their desktops; however, this limitation means that a user can access

More information

INTRODUCTION TO SAS HOW SAS WORKS READING RAW DATA INTO SAS

INTRODUCTION TO SAS HOW SAS WORKS READING RAW DATA INTO SAS TO SAS NEED FOR SAS WHO USES SAS WHAT IS SAS? OVERVIEW OF BASE SAS SOFTWARE DATA MANAGEMENT FACILITY STRUCTURE OF SAS DATASET SAS PROGRAM PROGRAMMING LANGUAGE ELEMENTS OF THE SAS LANGUAGE RULES FOR SAS

More information

A Side of Hash for You To Dig Into

A Side of Hash for You To Dig Into A Side of Hash for You To Dig Into Shan Ali Rasul, Indigo Books & Music Inc, Toronto, Ontario, Canada. ABSTRACT Within the realm of Customer Relationship Management (CRM) there is always a need for segmenting

More information

Fitting Statistical Models with PROCs NLMIXED and MCMC

Fitting Statistical Models with PROCs NLMIXED and MCMC Paper 902-2017 Fitting Statistical Models with PROCs NLMIXED and MCMC Robin High and Wael ElRayes, University of Nebraska Medical Center SAS /Stat software has several procedures which estimate parameters

More information

Excel Functions & Tables

Excel Functions & Tables Excel Functions & Tables SPRING 2016 Spring 2016 CS130 - EXCEL FUNCTIONS & TABLES 1 Review of Functions Quick Mathematics Review As it turns out, some of the most important mathematics for this course

More information

Macros to Report Missing Data: An HTML Data Collection Guide Patrick Thornton, University of California San Francisco, SF, California

Macros to Report Missing Data: An HTML Data Collection Guide Patrick Thornton, University of California San Francisco, SF, California Macros to Report Missing Data: An HTML Data Collection Guide Patrick Thornton, University of California San Francisco, SF, California ABSTRACT This paper presents SAS macro programs that calculate missing

More information

General Factorial Models

General Factorial Models In Chapter 8 in Oehlert STAT:5201 Week 9 - Lecture 2 1 / 34 It is possible to have many factors in a factorial experiment. In DDD we saw an example of a 3-factor study with ball size, height, and surface

More information

Box-Cox Transformation for Simple Linear Regression

Box-Cox Transformation for Simple Linear Regression Chapter 192 Box-Cox Transformation for Simple Linear Regression Introduction This procedure finds the appropriate Box-Cox power transformation (1964) for a dataset containing a pair of variables that are

More information

Package simsurv. May 18, 2018

Package simsurv. May 18, 2018 Type Package Title Simulate Survival Data Version 0.2.2 Date 2018-05-18 Package simsurv May 18, 2018 Maintainer Sam Brilleman Description Simulate survival times from standard

More information

Appendix A Microsoft Office Specialist exam objectives

Appendix A Microsoft Office Specialist exam objectives A 1 Appendix A Microsoft Office Specialist exam objectives This appendix covers these additional topics: A Excel 2013 Specialist exam objectives, with references to corresponding coverage in ILT Series

More information

Minitab detailed

Minitab detailed Minitab 18.1 - detailed ------------------------------------- ADDITIVE contact sales: 06172-5905-30 or minitab@additive-net.de ADDITIVE contact Technik/ Support/ Installation: 06172-5905-20 or support@additive-net.de

More information

Stat 5100 Handout #12.f SAS: Time Series Case Study (Unit 7)

Stat 5100 Handout #12.f SAS: Time Series Case Study (Unit 7) Stat 5100 Handout #12.f SAS: Time Series Case Study (Unit 7) Data: Weekly sales (in thousands of units) of Super Tech Videocassette Tapes over 161 weeks [see Bowerman & O Connell Forecasting and Time Series:

More information

SD10 A SAS MACRO FOR PERFORMING BACKWARD SELECTION IN PROC SURVEYREG

SD10 A SAS MACRO FOR PERFORMING BACKWARD SELECTION IN PROC SURVEYREG Paper SD10 A SAS MACRO FOR PERFORMING BACKWARD SELECTION IN PROC SURVEYREG Qixuan Chen, University of Michigan, Ann Arbor, MI Brenda Gillespie, University of Michigan, Ann Arbor, MI ABSTRACT This paper

More information

GAMs semi-parametric GLMs. Simon Wood Mathematical Sciences, University of Bath, U.K.

GAMs semi-parametric GLMs. Simon Wood Mathematical Sciences, University of Bath, U.K. GAMs semi-parametric GLMs Simon Wood Mathematical Sciences, University of Bath, U.K. Generalized linear models, GLM 1. A GLM models a univariate response, y i as g{e(y i )} = X i β where y i Exponential

More information

ECE521: Week 11, Lecture March 2017: HMM learning/inference. With thanks to Russ Salakhutdinov

ECE521: Week 11, Lecture March 2017: HMM learning/inference. With thanks to Russ Salakhutdinov ECE521: Week 11, Lecture 20 27 March 2017: HMM learning/inference With thanks to Russ Salakhutdinov Examples of other perspectives Murphy 17.4 End of Russell & Norvig 15.2 (Artificial Intelligence: A Modern

More information

Paper DB2 table. For a simple read of a table, SQL and DATA step operate with similar efficiency.

Paper DB2 table. For a simple read of a table, SQL and DATA step operate with similar efficiency. Paper 76-28 Comparative Efficiency of SQL and Base Code When Reading from Database Tables and Existing Data Sets Steven Feder, Federal Reserve Board, Washington, D.C. ABSTRACT In this paper we compare

More information

INF4820, Algorithms for AI and NLP: Evaluating Classifiers Clustering

INF4820, Algorithms for AI and NLP: Evaluating Classifiers Clustering INF4820, Algorithms for AI and NLP: Evaluating Classifiers Clustering Erik Velldal University of Oslo Sept. 18, 2012 Topics for today 2 Classification Recap Evaluating classifiers Accuracy, precision,

More information

Anomaly detection for NFSen/nfdump netflow engine - with Holt-Winters algorithm

Anomaly detection for NFSen/nfdump netflow engine - with Holt-Winters algorithm Anomaly detection for NFSen/nfdump netflow engine - with Holt-Winters algorithm János Mohácsi, Gábor Kiss NIIF/HUNGARNET Motivation Usual work of CSIRT teams: Find abnormal behaviour Visual detection of

More information

Data Preparation for Analytics

Data Preparation for Analytics Data Preparation for Analytics Dr. Gerhard Svolba SAS-Austria Vienna http://sascommunity.org/wiki/gerhard_svolba Agenda Data Preparation for Analytics General Thoughts Data Structures for Analytics Case

More information

Package UnivRNG. R topics documented: January 10, Type Package

Package UnivRNG. R topics documented: January 10, Type Package Type Package Package UnivRNG January 10, 2018 Title Univariate Pseudo-Random Number Generation Version 1.1 Date 2018-01-10 Author Hakan Demirtas, Rawan Allozi Maintainer Rawan Allozi

More information

An Animated Guide: Proc Transpose

An Animated Guide: Proc Transpose ABSTRACT An Animated Guide: Proc Transpose Russell Lavery, Independent Consultant If one can think about a SAS data set as being made up of columns and rows one can say Proc Transpose flips the columns

More information

Clustering Lecture 5: Mixture Model

Clustering Lecture 5: Mixture Model Clustering Lecture 5: Mixture Model Jing Gao SUNY Buffalo 1 Outline Basics Motivation, definition, evaluation Methods Partitional Hierarchical Density-based Mixture model Spectral methods Advanced topics

More information

STAT 5200 Handout #25. R-Square & Design Matrix in Mixed Models

STAT 5200 Handout #25. R-Square & Design Matrix in Mixed Models STAT 5200 Handout #25 R-Square & Design Matrix in Mixed Models I. R-Square in Mixed Models (with Example from Handout #20): For mixed models, the concept of R 2 is a little complicated (and neither PROC

More information

Monte Carlo Integration and Random Numbers

Monte Carlo Integration and Random Numbers Monte Carlo Integration and Random Numbers Higher dimensional integration u Simpson rule with M evaluations in u one dimension the error is order M -4! u d dimensions the error is order M -4/d u In general

More information

On the Parameter Estimation of the Generalized Exponential Distribution Under Progressive Type-I Interval Censoring Scheme

On the Parameter Estimation of the Generalized Exponential Distribution Under Progressive Type-I Interval Censoring Scheme arxiv:1811.06857v1 [math.st] 16 Nov 2018 On the Parameter Estimation of the Generalized Exponential Distribution Under Progressive Type-I Interval Censoring Scheme Mahdi Teimouri Email: teimouri@aut.ac.ir

More information

Package intccr. September 12, 2017

Package intccr. September 12, 2017 Type Package Package intccr September 12, 2017 Title Semiparametric Competing Risks Regression under Interval Censoring Version 0.2.0 Author Giorgos Bakoyannis , Jun Park

More information

Square Peg, Square Hole Getting Tables to Fit on Slides in the ODS Destination for PowerPoint

Square Peg, Square Hole Getting Tables to Fit on Slides in the ODS Destination for PowerPoint PharmaSUG 2018 - Paper DV-01 Square Peg, Square Hole Getting Tables to Fit on Slides in the ODS Destination for PowerPoint Jane Eslinger, SAS Institute Inc. ABSTRACT An output table is a square. A slide

More information

StatsMate. User Guide

StatsMate. User Guide StatsMate User Guide Overview StatsMate is an easy-to-use powerful statistical calculator. It has been featured by Apple on Apps For Learning Math in the App Stores around the world. StatsMate comes with

More information

Checking for Duplicates Wendi L. Wright

Checking for Duplicates Wendi L. Wright Checking for Duplicates Wendi L. Wright ABSTRACT This introductory level paper demonstrates a quick way to find duplicates in a dataset (with both simple and complex keys). It discusses what to do when

More information

Random Sampling For the Non-statistician Diane E. Brown AdminaStar Solutions, Associated Insurance Companies Inc.

Random Sampling For the Non-statistician Diane E. Brown AdminaStar Solutions, Associated Insurance Companies Inc. Random Sampling For the Non-statistician Diane E. Brown AdminaStar Solutions, Associated Insurance Companies Inc. Random samples can be drawn based on: - Size: an approximate number, an exact number, a

More information

Statements with the Same Function in Multiple Procedures

Statements with the Same Function in Multiple Procedures 67 CHAPTER 3 Statements with the Same Function in Multiple Procedures Overview 67 Statements 68 BY 68 FREQ 70 QUIT 72 WEIGHT 73 WHERE 77 Overview Several statements are available and have the same function

More information

Two useful macros to nudge SAS to serve you

Two useful macros to nudge SAS to serve you Two useful macros to nudge SAS to serve you David Izrael, Michael P. Battaglia, Abt Associates Inc., Cambridge, MA Abstract This paper offers two macros that augment the power of two SAS procedures: LOGISTIC

More information

SAS/STAT 15.1 User s Guide The RMSTREG Procedure

SAS/STAT 15.1 User s Guide The RMSTREG Procedure SAS/STAT 15.1 User s Guide The RMSTREG Procedure This document is an individual chapter from SAS/STAT 15.1 User s Guide. The correct bibliographic citation for this manual is as follows: SAS Institute

More information

Package bayesdp. July 10, 2018

Package bayesdp. July 10, 2018 Type Package Package bayesdp July 10, 2018 Title Tools for the Bayesian Discount Prior Function Version 1.3.2 Date 2018-07-10 Depends R (>= 3.2.3), ggplot2, survival, methods Functions for data augmentation

More information

Technical Report of ISO/IEC Test Program of the M-DISC Archival DVD Media June, 2013

Technical Report of ISO/IEC Test Program of the M-DISC Archival DVD Media June, 2013 Technical Report of ISO/IEC 10995 Test Program of the M-DISC Archival DVD Media June, 2013 With the introduction of the M-DISC family of inorganic optical media, Traxdata set the standard for permanent

More information

Data Should Not be a Four Letter Word Microsoft Excel QUICK TOUR

Data Should Not be a Four Letter Word Microsoft Excel QUICK TOUR Toolbar Tour AutoSum + more functions Chart Wizard Currency, Percent, Comma Style Increase-Decrease Decimal Name Box Chart Wizard QUICK TOUR Name Box AutoSum Numeric Style Chart Wizard Formula Bar Active

More information

Applied Clustering Techniques. Jing Dong

Applied Clustering Techniques. Jing Dong Applied Clustering Techniques Jing Dong Nov 31, 2016 What is cluster analysis? What is Cluster Analysis? Cluster: o Similar to one another within the same cluster o Dissimilar to the objects in other clusters

More information

Lab 3 (80 pts.) - Assessing the Normality of Data Objectives: Creating and Interpreting Normal Quantile Plots

Lab 3 (80 pts.) - Assessing the Normality of Data Objectives: Creating and Interpreting Normal Quantile Plots STAT 350 (Spring 2015) Lab 3: SAS Solutions 1 Lab 3 (80 pts.) - Assessing the Normality of Data Objectives: Creating and Interpreting Normal Quantile Plots Note: The data sets are not included in the solutions;

More information

DDP ANOVA Model. May 9, ddpanova-package... 1 ddpanova... 2 ddpsurvival... 5 post.pred... 7 post.sim DDP ANOVA with univariate response

DDP ANOVA Model. May 9, ddpanova-package... 1 ddpanova... 2 ddpsurvival... 5 post.pred... 7 post.sim DDP ANOVA with univariate response R topics documented: DDP ANOVA Model May 9, 2007 ddpanova-package..................................... 1 ddpanova.......................................... 2 ddpsurvival.........................................

More information

CSSS 510: Lab 2. Introduction to Maximum Likelihood Estimation

CSSS 510: Lab 2. Introduction to Maximum Likelihood Estimation CSSS 510: Lab 2 Introduction to Maximum Likelihood Estimation 2018-10-12 0. Agenda 1. Housekeeping: simcf, tile 2. Questions about Homework 1 or lecture 3. Simulating heteroskedastic normal data 4. Fitting

More information

Excel Functions & Tables

Excel Functions & Tables Excel Functions & Tables Winter 2012 Winter 2012 CS130 - Excel Functions & Tables 1 Review of Functions Quick Mathematics Review As it turns out, some of the most important mathematics for this course

More information

Function Approximation. Pieter Abbeel UC Berkeley EECS

Function Approximation. Pieter Abbeel UC Berkeley EECS Function Approximation Pieter Abbeel UC Berkeley EECS Outline Value iteration with function approximation Linear programming with function approximation Value Iteration Algorithm: Start with for all s.

More information

Online Supplementary Appendix for. Dziak, Nahum-Shani and Collins (2012), Multilevel Factorial Experiments for Developing Behavioral Interventions:

Online Supplementary Appendix for. Dziak, Nahum-Shani and Collins (2012), Multilevel Factorial Experiments for Developing Behavioral Interventions: Online Supplementary Appendix for Dziak, Nahum-Shani and Collins (2012), Multilevel Factorial Experiments for Developing Behavioral Interventions: Power, Sample Size, and Resource Considerations 1 Appendix

More information

Weibull Reliability Analyses

Weibull Reliability Analyses Visual-XSel Software-Guide for Weibull The Weibull analysis shows the failure frequencies or the unreliability of parts and components in the Weibull-net and interprets them. Basics and more details can

More information

SAS Example A10. Output Delivery System (ODS) Sample Data Set sales.txt. Examples of currently available ODS destinations: Mervyn Marasinghe

SAS Example A10. Output Delivery System (ODS) Sample Data Set sales.txt. Examples of currently available ODS destinations: Mervyn Marasinghe SAS Example A10 data sales infile U:\Documents\...\sales.txt input Region : $8. State $2. +1 Month monyy5. Headcnt Revenue Expenses format Month monyy5. Revenue dollar12.2 proc sort by Region State Month

More information

A SAS Solution to Create a Weekly Format Susan Bakken, Aimia, Plymouth, MN

A SAS Solution to Create a Weekly Format Susan Bakken, Aimia, Plymouth, MN Paper S126-2012 A SAS Solution to Create a Weekly Format Susan Bakken, Aimia, Plymouth, MN ABSTRACT As programmers, we are frequently asked to report by periods that do not necessarily correspond to weeks

More information

General Factorial Models

General Factorial Models In Chapter 8 in Oehlert STAT:5201 Week 9 - Lecture 1 1 / 31 It is possible to have many factors in a factorial experiment. We saw some three-way factorials earlier in the DDD book (HW 1 with 3 factors:

More information

The Proc Transpose Cookbook

The Proc Transpose Cookbook ABSTRACT PharmaSUG 2017 - Paper TT13 The Proc Transpose Cookbook Douglas Zirbel, Wells Fargo and Co. Proc TRANSPOSE rearranges columns and rows of SAS datasets, but its documentation and behavior can be

More information

SAS is the most widely installed analytical tool on mainframes. I don t know the situation for midrange and PCs. My Focus for SAS Tools Here

SAS is the most widely installed analytical tool on mainframes. I don t know the situation for midrange and PCs. My Focus for SAS Tools Here Explore, Analyze, and Summarize Your Data with SAS Software: Selecting the Best Power Tool from a Rich Portfolio PhD SAS is the most widely installed analytical tool on mainframes. I don t know the situation

More information

THIS IS NOT REPRESNTATIVE OF CURRENT CLASS MATERIAL. STOR 455 Midterm 1 September 28, 2010

THIS IS NOT REPRESNTATIVE OF CURRENT CLASS MATERIAL. STOR 455 Midterm 1 September 28, 2010 THIS IS NOT REPRESNTATIVE OF CURRENT CLASS MATERIAL STOR 455 Midterm September 8, INSTRUCTIONS: BOTH THE EXAM AND THE BUBBLE SHEET WILL BE COLLECTED. YOU MUST PRINT YOUR NAME AND SIGN THE HONOR PLEDGE

More information

NCSS Statistical Software

NCSS Statistical Software Chapter 245 Introduction This procedure generates R control charts for variables. The format of the control charts is fully customizable. The data for the subgroups can be in a single column or in multiple

More information

Statistical Programming in SAS. From Chapter 10 - Programming with matrices and vectors - IML

Statistical Programming in SAS. From Chapter 10 - Programming with matrices and vectors - IML Week 12 [30+ Nov.] Class Activities File: week-12-iml-prog-16nov08.doc Directory: \\Muserver2\USERS\B\\baileraj\Classes\sta402\handouts From Chapter 10 - Programming with matrices and vectors - IML 10.1:

More information

Content-based image and video analysis. Machine learning

Content-based image and video analysis. Machine learning Content-based image and video analysis Machine learning for multimedia retrieval 04.05.2009 What is machine learning? Some problems are very hard to solve by writing a computer program by hand Almost all

More information

SAS (Statistical Analysis Software/System)

SAS (Statistical Analysis Software/System) SAS (Statistical Analysis Software/System) SAS Analytics:- Class Room: Training Fee & Duration : 23K & 3 Months Online: Training Fee & Duration : 25K & 3 Months Learning SAS: Getting Started with SAS Basic

More information

CSE 546 Machine Learning, Autumn 2013 Homework 2

CSE 546 Machine Learning, Autumn 2013 Homework 2 CSE 546 Machine Learning, Autumn 2013 Homework 2 Due: Monday, October 28, beginning of class 1 Boosting [30 Points] We learned about boosting in lecture and the topic is covered in Murphy 16.4. On page

More information

SAS (Statistical Analysis Software/System)

SAS (Statistical Analysis Software/System) SAS (Statistical Analysis Software/System) Clinical SAS:- Class Room: Training Fee & Duration : 23K & 3 Months Online: Training Fee & Duration : 25K & 3 Months Learning SAS: Getting Started with SAS Basic

More information

Modeling and Tolerating Heterogeneous Failures in Large Parallel Systems

Modeling and Tolerating Heterogeneous Failures in Large Parallel Systems Modeling and Tolerating Heterogeneous Failures in Large Parallel Systems Eric Heien 1, Derrick Kondo 1, Ana Gainaru 2, Dan LaPine 2, Bill Kramer 2, Franck Cappello 1, 2 1 INRIA, France 2 UIUC, USA Context

More information

Learning Objectives. Continuous Random Variables & The Normal Probability Distribution. Continuous Random Variable

Learning Objectives. Continuous Random Variables & The Normal Probability Distribution. Continuous Random Variable Learning Objectives Continuous Random Variables & The Normal Probability Distribution 1. Understand characteristics about continuous random variables and probability distributions 2. Understand the uniform

More information

Predictive Analysis: Evaluation and Experimentation. Heejun Kim

Predictive Analysis: Evaluation and Experimentation. Heejun Kim Predictive Analysis: Evaluation and Experimentation Heejun Kim June 19, 2018 Evaluation and Experimentation Evaluation Metrics Cross-Validation Significance Tests Evaluation Predictive analysis: training

More information

Using PC SAS/ASSIST* for Statistical Analyses

Using PC SAS/ASSIST* for Statistical Analyses Using PC SAS/ASSIST* for Statistical Analyses Margaret A. Nemeth, Monsanto Company lptroductjon SAS/ASSIST, a user friendly, menu driven applications system, is available on several platforms. This paper

More information

Multiple Comparisons of Treatments vs. a Control (Simulation)

Multiple Comparisons of Treatments vs. a Control (Simulation) Chapter 585 Multiple Comparisons of Treatments vs. a Control (Simulation) Introduction This procedure uses simulation to analyze the power and significance level of two multiple-comparison procedures that

More information

CMISS the SAS Function You May Have Been MISSING Mira Shapiro, Analytic Designers LLC, Bethesda, MD

CMISS the SAS Function You May Have Been MISSING Mira Shapiro, Analytic Designers LLC, Bethesda, MD ABSTRACT SESUG 2016 - RV-201 CMISS the SAS Function You May Have Been MISSING Mira Shapiro, Analytic Designers LLC, Bethesda, MD Those of us who have been using SAS for more than a few years often rely

More information

MR-2010I %PHChoice Macro %PHChoice Macro

MR-2010I %PHChoice Macro %PHChoice Macro MR-2010I %PHChoice Macro 1173 %PHChoice Macro The %PHChoice autocall macro customizes the output from PROC PHREG for choice modeling. Typically, you run the following macro once to customize the PROC PHREG

More information

Software Size Estimation II

Software Size Estimation II Copyright 1994 Carnegie Mellon University Disciplined Software Engineering - Lecture 4 1 Software Size Estimation II Material adapted from: Disciplined Software Engineering Software Engineering Institute

More information

Package survivalmpl. December 11, 2017

Package survivalmpl. December 11, 2017 Package survivalmpl December 11, 2017 Title Penalised Maximum Likelihood for Survival Analysis Models Version 0.2 Date 2017-10-13 Author Dominique-Laurent Couturier, Jun Ma, Stephane Heritier, Maurizio

More information

Laplace Transform of a Lognormal Random Variable

Laplace Transform of a Lognormal Random Variable Approximations of the Laplace Transform of a Lognormal Random Variable Joint work with Søren Asmussen & Jens Ledet Jensen The University of Queensland School of Mathematics and Physics August 1, 2011 Conference

More information

CXO-Cockpit Hands-On Workshop

CXO-Cockpit Hands-On Workshop CXO-Cockpit Hands-On Workshop Agenda - Day 1 o Introduction o Main tools and basic concepts o Architecture o Demo of CXO-Cockpit web application o Demo of CXO-Cockpit designer o Hands-on Exercises: create

More information

Markov Random Fields and Segmentation with Graph Cuts

Markov Random Fields and Segmentation with Graph Cuts Markov Random Fields and Segmentation with Graph Cuts Computer Vision Jia-Bin Huang, Virginia Tech Many slides from D. Hoiem Administrative stuffs Final project Proposal due Oct 27 (Thursday) HW 4 is out

More information

Calculation of the Mean and Variance of Lognormal Data Which Contains Left-Censored Observations

Calculation of the Mean and Variance of Lognormal Data Which Contains Left-Censored Observations Calculation of the Mean and Variance of Lognormal Data Which Contains Left-Censored Observations Stephen J. Ganocy The Goodyear Tire & Rubber Company Akron, Ohio Abstract The mean and variance of data

More information

Beta-Regression with SPSS Michael Smithson School of Psychology, The Australian National University

Beta-Regression with SPSS Michael Smithson School of Psychology, The Australian National University 9/1/2005 Beta-Regression with SPSS 1 Beta-Regression with SPSS Michael Smithson School of Psychology, The Australian National University (email: Michael.Smithson@anu.edu.au) SPSS Nonlinear Regression syntax

More information

Penetrating the Matrix Justin Z. Smith, William Gui Zupko II, U.S. Census Bureau, Suitland, MD

Penetrating the Matrix Justin Z. Smith, William Gui Zupko II, U.S. Census Bureau, Suitland, MD Penetrating the Matrix Justin Z. Smith, William Gui Zupko II, U.S. Census Bureau, Suitland, MD ABSTRACT While working on a time series modeling problem, we needed to find the row and column that corresponded

More information

Resource and Performance Distribution Prediction for Large Scale Analytics Queries

Resource and Performance Distribution Prediction for Large Scale Analytics Queries Resource and Performance Distribution Prediction for Large Scale Analytics Queries Prof. Rajiv Ranjan, SMIEEE School of Computing Science, Newcastle University, UK Visiting Scientist, Data61, CSIRO, Australia

More information