CREATING A SUMMARY TABLE OF NORMALIZED (Z) SCORES
|
|
- Camron Davis
- 5 years ago
- Views:
Transcription
1 CREATING A SUMMARY TABLE OF NORMALIZED (Z) SCORES Walter W. OWen The Biostatistics Center The George Washington University ABSTRACT Data from the behavioral sciences are often analyzed by normalizing the scores for individuals in experimental subgroups to a reference population. Normalized scores, called Z-scores, may then be used to compare performance relative to the reference group either across the experimental subgroups or among different variables. Summary procedures al~ow group statistics to be output to SAS data sets. These data sets may be reshaped using the MATRIX and TRANSPOSE procedures before being brought together via SET and MERGE statements. The result is a compact table of normalized scores with SAS variable labels identifying the tests presented. Population: Sample: Where: Z Z = (Xi Z = (Xi - X ref ) I s ref Normalized Score Individualized Raw Score Reference Mean Scores Reference Standard Deviation Addition of the reference values to the table allows the reader to extrapolate information about the experimental subgroup means and to compare the reference group to other populations reported in the literature. Further, the percentage of each subgroup having absolute Z-scores greater than an arbitrary cutoff could be added yielding an even better definition of the experimental subgroups. For example, absolute scores greater than 1.64 indicate that an individual is performing at a level different from 90% of a normally distributed reference group. INTRODUCTION The use of normalized (Z) scores is widespread in the behavioral sciences. The process of normalization involves the transformation of data from experimental subgroups using the performance of a standard, or control, group as the initial point of reference. The resultant Z-scores allow researchers a common ground on which to compare a wide variety of tests that may be scored on different scales or are essentially objective in nature. The formulas for computing individual Z-scores based on reference populations and samples are listed below. The resultant Z values are unitless scores indicating the number of standard deviations by which the corresponding raw score lies above or below the mean of the reference distribution. The reference group is characterized by a Z-score mean of zero and standard deviation of one. If an individual has an absolute Z-score of 1.64 or greater, he is performing at a level different from 90% of a normally distributed reference population. Similarly, an absolute Z-score greater than 1.96 puts the individual outside the range of 95% of that reference group. When reporting the scores of several different subgroups for a battery of tests, it is desirable to present the results in tabular form. SAS provides several paths by which to create such a table. This paper will focus on gathering the information and the usefulness of different table layouts rather than elaborate methods for putting the information on paper. Accordingly, PROe PRINT, with LINESIZE options, was shown. used to output the tables 1116
2 METHODS Several requirements for an incoming data set should be established before elaborating on other methodology. Though the techniques described below work equally well for any number of subgroups, the groups must be classified by a single variable (perhaps GROUP) that identifies the experimental groupings as well as the reference group, in a mutually exclusive format. The appropriate PRoe FORMAT statement should include values for all groups and labels suitable as SAS variable names (i.e., eight characters or fewer with no spaces). This format should be permanently assigned to the GROUPing variable when creating the groups. Global macro variables should be established to give the number of variables (called by &N in the programming segments that follow) and the number of groups used (called by &G, not including the reference group), thus allowing much of the remaining programming to be generalized to accept variations in these values (see Program Segment 1). Also, a macro (called by %VARS) listing the actual variables to be normalized is fundamental if the program is to be easily adapted for various purposes. The number of observations per group is output from PROe FREQuency, TRANSPOSEd into a single observation, and saved for later use (see Program Segment 2). It is convenient to create a permanent length of 40 for the variable labels at this point. This will allow any label up to 40 characters to be printed in the final table without worrying about truncation in a subsequent MERGE statement. PROe PRINT will adjust spacing if no label requires this much space. The next step in the process is to create two data sets, one for the reference group and the other for all of the subgroup data. The mean and standard deviation for each variable in the reference group is output as a single observation using PROe MEANS. A copy of this data set is reshaped by PROe MATRIX and output for use in the final tables as reference parameters (see Program Segment 3). PROe SUMMARY could also be used, but requires a separate PRINT statement to look at the data. For smaller data sets, PROe MEANS is preferred even if it is slightly less efficient. Appending the reference statistics to each observation of the subgroup data set allows the calculation of individual Z scores (see Program Segment 4). The scores should replace the original raw values, thus retaining the variable labels for future use. A word of caution -- be aware that the variables must be able to accommodate the decimal portion of the newly created Z-score. A series of counting variables may be created to record which Z-scores are outside a desired range (perhaps 1.64 or 1.96 as described previously). If a value of 100 is used in these counting variables to mark a score as deviant and a value of zero if it is not, the mean of the values will automatically yield the percentage of individuals outside the specified range. PRoe MEANS (or SUMMARY) is used again, this time BY the GROUPing variable to output a data set of mean Z-scores with an observation for each subgroup (see' Program Segment 5). PRoe TRANSPOSE, using GROUP as the 10 variable, will produce data ready for the final table. The same general process is used to prepare the percentages of outliers for the table (see Program Segment 6). Data manipulation is completed by match MERGEing the reference statistics with the Z-score and percentage means for each subgroup. The group sizes may now be SET with the information collected for the variables in the previous step (see Program Segment 7). THE TABLES Now that all of the necessary information is together in a single data set having one observation for each variable plus one observation containing the group sizes, the tables may be PRINTed. The SPLIT option for labeling columns of PRoe PRINT should be used to give better definition to the table. The most simplistic output (see Table 1) gives only the mean Z-scores for each subgroup. Addition of the reference group means and standard deviations (see Table 2) will define where the values are centered and allows the reader to determine the means of the experimental subgroups. This is done 1117
3 by multiplying the reference standard deviation by the subgroup mean Z-score and then adding this value to the reference mean. The final bit of information to add is the percentage of each subgroup which lies outside of the specified range. These values are based on the number of subjects in each group who actually took the test and can enhance the information already listed by indicating the possible skewness of the subgroups. See Program Segment 8 for the PROe PRINT used to produce Table 3. The statements to produce Tables 1 and 2 are comparable. SAS is the registered trademark of SAS Institute, Inc., Cary, NC, USA. Address Correspondence To: Walter W. Owen The Biostatistics Center 7979 Old Georgetown Road, Suite 500 Bethesda, MD SUMMARY The use of the global macro variables G and N, defining the number of subgroups and variables respectively, allows flexibility in the programming. Simply by varying the value of N, along with the appropriate modifications in the VARS macro containing the list of variables used, the table may reflect different subsets of test items. The format of any of these tables may, of course, be changed to reflect the desired number of significant digits. If measurement units for the variables are needed, they should be included in the SAS variable labels. Units apply only for the reference group as Z-scores and percentages are unitless values. The SAS macro language offers some intriguing possibilities for the ambitious programmer. If further generalizations were added, a procedure-style macro could be set up with defining parameters to cover many of the requirements set forth for the incoming data mentioned earlier in this paper. It has proven to be a formidable challenge to put group sizes into macro variables for use in labeling the output, but SAS capabilities should make this possible. Also there is the possibility of using PUT statements to print the tables, although more information is generally needed to allow for varying column lengths, particularly for the variable labels. 1118
4 Table 1 VARIABLE GROUP GROUP 2 DESCRIPTION MEAN Z MEAN Z N= SAS LABEL FOR VARIABLE SAS LABEL FOR VARIABLE SAS LABEL FOR VARIABLE SAS LABEL FOR VARIABLE SAS LABEL FOR VARIABLE O.BO Table 2 NORMALIZED TO REFERENCE PARAMETERS VARIABLE REFERENCE REFERENCE GROUP GROUP 2 DESCRIPTION MEAN STD DEV MEAN Z MEAN Z N= SAS LABEL FOR VARIABLE B SAS LABEL FOR VARIABLE SAS LABEL FOR VARIABLE SAS LABEL FOR VARIABLE SAS LABEL FOR VARIABLE Table 3 NORMALIZED TO REFERENCE PARAMETERS PERCENTAGE OF GROUP WITH Izl > 1.64 SHOWN VARIABLE REFERENCE REFERENCE GROUP PCT GROUP 2 PCT 2 DESCRIPTION MEAN STn DEV MEAN Z MEAN Z N= SAS LABEL FOR VARIABLE SAS LABEL FOR VARIABLE SAS LABEL FOR VARIABLE SAS LABEL FOR VARIABLE SAS LABEL FOR VARIABLE
5 PROGRAMMING SEGMENTS Program Segment 1 Macro Definitions Call &G &N &CUT %VARS Heaning number of groups number of variables Z score cutoff list of raw score variables Program Segment 2 Obtaining Group N's Assignment %LET G = 2; XLET N = 5; %LET CUT = 1.64; %KACRO V ARS; variable list %HEND VARS; PROC FREQ; TABLES GROUP / OUT=FREQSET NOPRINT; PROC TRANSPOSE DATA=FREQSET OUT=FREQSET; ID GROUP; VAR COUNT; DATA FREQSET; LENGTH VARLABEL $40; SET FREqSET (RENAME=( NAME =VARNAME REFGRP=REFMEAN»; VARLABEL='N='; Program Segment 3 Obtaining Reference Statistics PROC MEANS DATA=REFGRPS NOPRINT; OUTPUT OUT=REFMEANS MEAN=MEAN1-MEAN&N STD=STD1-STD&N; PRoe MATRIX; FETCH X DATA=REFHEANS; Y = SHAPE(X.&N); z = y'; OUTPUT Z OUT=REFSET(RENAME3 (COL1=REFMEAN COL2=REFSTD»; Program Segment 4 Calculate Individual Z-scores DATA ZSCORES; IF N =1 THEN SET REFHEANS; SET-CROUPS; ARRAY Z (8) %VARS; ARRAY V (H) %VARS; ARRAY M (H) MEANI-MEAN&N; ARRAY S (H) STD1-STD&N; 00 OVER S; IF S HE 0 THEN 00; Z = (V-M)/S; ELSE Z =.; Program Segment 5 Calculate Hean Z-scores PRoe SORT DATA=ZSCORES; PRoe MEANS DATA=ZSCORES NOPRINT; OUTPUT OUT=ZMEANS MEAN= %VARS; PROC TRANSPOSE DATA=ZMEANS OUT=ZMEANS; ID GROUP; Program Segment 6 Obtain the Percentage of Deviate Z-scores DATA PCTZ; SET ZSCORES; ARRAY Z (H) %VARS; ARRAY CNT (H) CNT1-CNT&N; 00 OVER Z; IF ABS(Z) GT &CUT THEN CNT=IOO; ELSE IF Z NE THEN CNT=O; PROC SORT DATA=PCTZ; PROC MEANS DATA=PCTZ NOPRINT; VAR CNTl-CNT&N; OUTPUT OUT=PCTZ HEAN=%VARS; PROC TRANSPOSE DATA=PCTZ OUT=PCTZ PREFIX=PCT; Program Segment 1 Combine and Concatenate Data DATA COMBINE; MERGE ZMEANS PCTZ REFSET (DROP=ROW); RENAME NAME =VARNAME =LABEL_=VARLABEL; DATA FINAL; SET FREQSET COHBINE; LABEL REFHEAN =REFERENCE* Mean REFSTD =REFERENCE* Std Dev VARNAME =VARIABLE VARLABEL=VARIABLE*DESCRIPTION GROUPl =GROUP l*mean Z GROUP2 =GROUP 2*Mean Z PCT1 =PCT 1 PCT2 =PCT 2 Program Segment 8 Printing Table ~: PRoe PRINT SPLIT=*; ID VARLABEL; VAR REFMEAN REFSTD GROUP I PCTI GROUP2 PCT2; FORHAT REFMEAN REFSTD GROUPI-GROUP&G 8.2 PCTl-PCT&G 3.0; 1120
Using SAS Macro to Include Statistics Output in Clinical Trial Summary Table
Using SAS Macro to Include Statistics Output in Clinical Trial Summary Table Amy C. Young, Ischemia Research and Education Foundation, San Francisco, CA Sharon X. Zhou, Ischemia Research and Education
More informationChapter 6: Modifying and Combining Data Sets
Chapter 6: Modifying and Combining Data Sets The SET statement is a powerful statement in the DATA step. Its main use is to read in a previously created SAS data set which can be modified and saved as
More informationThe STANDARD Procedure
1135 CHAPTER 35 The STANDARD Procedure Overview 1135 Procedure Syntax 1137 PROC STANDARD Statement 1138 BY Statement 1140 FREQ Statement 1141 VAR Statement 1141 WEIGHT Statement 1142 Results 1142 Missing
More informationThere s No Such Thing as Normal Clinical Trials Data, or Is There? Daphne Ewing, Octagon Research Solutions, Inc., Wayne, PA
Paper HW04 There s No Such Thing as Normal Clinical Trials Data, or Is There? Daphne Ewing, Octagon Research Solutions, Inc., Wayne, PA ABSTRACT Clinical Trials data comes in all shapes and sizes depending
More informationPaper DB2 table. For a simple read of a table, SQL and DATA step operate with similar efficiency.
Paper 76-28 Comparative Efficiency of SQL and Base Code When Reading from Database Tables and Existing Data Sets Steven Feder, Federal Reserve Board, Washington, D.C. ABSTRACT In this paper we compare
More informationA SAS Macro for Producing Benchmarks for Interpreting School Effect Sizes
A SAS Macro for Producing Benchmarks for Interpreting School Effect Sizes Brian E. Lawton Curriculum Research & Development Group University of Hawaii at Manoa Honolulu, HI December 2012 Copyright 2012
More informationSAS Enterprise Miner : Tutorials and Examples
SAS Enterprise Miner : Tutorials and Examples SAS Documentation February 13, 2018 The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2017. SAS Enterprise Miner : Tutorials
More informationThe DMSPLIT Procedure
The DMSPLIT Procedure The DMSPLIT Procedure Overview Procedure Syntax PROC DMSPLIT Statement FREQ Statement TARGET Statement VARIABLE Statement WEIGHT Statement Details Examples Example 1: Creating a Decision
More informationCreating Macro Calls using Proc Freq
Creating Macro Calls using Proc Freq, Educational Testing Service, Princeton, NJ ABSTRACT Imagine you were asked to get a series of statistics/tables for each country in the world. You have the data, but
More informationAn Application of PROC NLP to Survey Sample Weighting
An Application of PROC NLP to Survey Sample Weighting Talbot Michael Katz, Analytic Data Information Technologies, New York, NY ABSTRACT The classic weighting formula for survey respondents compensates
More informationInterleaving a Dataset with Itself: How and Why
cc002 Interleaving a Dataset with Itself: How and Why Howard Schreier, U.S. Dept. of Commerce, Washington DC ABSTRACT When two or more SAS datasets are combined by means of a SET statement and an accompanying
More informationA Side of Hash for You To Dig Into
A Side of Hash for You To Dig Into Shan Ali Rasul, Indigo Books & Music Inc, Toronto, Ontario, Canada. ABSTRACT Within the realm of Customer Relationship Management (CRM) there is always a need for segmenting
More informationContents. About This Book...1
Contents About This Book...1 Chapter 1: Basic Concepts...5 Overview...6 SAS Programs...7 SAS Libraries...13 Referencing SAS Files...15 SAS Data Sets...18 Variable Attributes...21 Summary...26 Practice...28
More informationHandling Numeric Representation SAS Errors Caused by Simple Floating-Point Arithmetic Computation Fuad J. Foty, U.S. Census Bureau, Washington, DC
Paper BB-206 Handling Numeric Representation SAS Errors Caused by Simple Floating-Point Arithmetic Computation Fuad J. Foty, U.S. Census Bureau, Washington, DC ABSTRACT Every SAS programmer knows that
More informationUsing PROC SQL to Generate Shift Tables More Efficiently
ABSTRACT SESUG Paper 218-2018 Using PROC SQL to Generate Shift Tables More Efficiently Jenna Cody, IQVIA Shift tables display the change in the frequency of subjects across specified categories from baseline
More informationABSTRACT DATA CLARIFCIATION FORM TRACKING ORACLE TABLE INTRODUCTION REVIEW QUALITY CHECKS
Efficient SAS Quality Checks: Unique Error Identification And Enhanced Data Management Analysis Jim Grudzinski, Biostatistics Manager Of SAS Programming Covance Periapproval Services Inc, Radnor, PA ABSTRACT
More informationPaper SAS Programming Conventions Lois Levin, Independent Consultant, Bethesda, Maryland
Paper 241-28 SAS Programming Conventions Lois Levin, Independent Consultant, Bethesda, Maryland ABSTRACT This paper presents a set of programming conventions and guidelines that can be considered in developing
More informationSummarizing Impossibly Large SAS Data Sets For the Data Warehouse Server Using Horizontal Summarization
Summarizing Impossibly Large SAS Data Sets For the Data Warehouse Server Using Horizontal Summarization Michael A. Raithel, Raithel Consulting Services Abstract Data warehouse applications thrive on pre-summarized
More informationBase and Advance SAS
Base and Advance SAS BASE SAS INTRODUCTION An Overview of the SAS System SAS Tasks Output produced by the SAS System SAS Tools (SAS Program - Data step and Proc step) A sample SAS program Exploring SAS
More informationSAS Online Training: Course contents: Agenda:
SAS Online Training: Course contents: Agenda: (1) Base SAS (6) Clinical SAS Online Training with Real time Projects (2) Advance SAS (7) Financial SAS Training Real time Projects (3) SQL (8) CV preparation
More informationTechdata Solution. SAS Analytics (Clinical/Finance/Banking)
+91-9702066624 Techdata Solution Training - Staffing - Consulting Mumbai & Pune SAS Analytics (Clinical/Finance/Banking) What is SAS SAS (pronounced "sass", originally Statistical Analysis System) is an
More informationSAS (Statistical Analysis Software/System)
SAS (Statistical Analysis Software/System) SAS Analytics:- Class Room: Training Fee & Duration : 23K & 3 Months Online: Training Fee & Duration : 25K & 3 Months Learning SAS: Getting Started with SAS Basic
More informationSAS (Statistical Analysis Software/System)
SAS (Statistical Analysis Software/System) Clinical SAS:- Class Room: Training Fee & Duration : 23K & 3 Months Online: Training Fee & Duration : 25K & 3 Months Learning SAS: Getting Started with SAS Basic
More informationChoosing the Right Procedure
3 CHAPTER 1 Choosing the Right Procedure Functional Categories of Base SAS Procedures 3 Report Writing 3 Statistics 3 Utilities 4 Report-Writing Procedures 4 Statistical Procedures 5 Efficiency Issues
More informationKeeping Track of Database Changes During Database Lock
Paper CC10 Keeping Track of Database Changes During Database Lock Sanjiv Ramalingam, Biogen Inc., Cambridge, USA ABSTRACT Higher frequency of data transfers combined with greater likelihood of changes
More informationA Practical and Efficient Approach in Generating AE (Adverse Events) Tables within a Clinical Study Environment
A Practical and Efficient Approach in Generating AE (Adverse Events) Tables within a Clinical Study Environment Abstract Jiannan Hu Vertex Pharmaceuticals, Inc. When a clinical trial is at the stage of
More informationWorkload Characterization Techniques
Workload Characterization Techniques Raj Jain Washington University in Saint Louis Saint Louis, MO 63130 Jain@cse.wustl.edu These slides are available on-line at: http://www.cse.wustl.edu/~jain/cse567-08/
More informationGenerating Customized Analytical Reports from SAS Procedure Output Brinda Bhaskar and Kennan Murray, RTI International
Abstract Generating Customized Analytical Reports from SAS Procedure Output Brinda Bhaskar and Kennan Murray, RTI International SAS has many powerful features, including MACRO facilities, procedures such
More informationFrom Manual to Automatic with Overdrive - Using SAS to Automate Report Generation Faron Kincheloe, Baylor University, Waco, TX
Paper 152-27 From Manual to Automatic with Overdrive - Using SAS to Automate Report Generation Faron Kincheloe, Baylor University, Waco, TX ABSTRACT This paper is a case study of how SAS products were
More informationSAS Programming Conventions Lois Levin, Independent Consultant
SAS Programming Conventions Lois Levin, Independent Consultant INTRODUCTION This paper presents a set of programming conventions and guidelines that can be considered in developing code to ensure that
More informationT.I.P.S. (Techniques and Information for Programming in SAS )
Paper PO-088 T.I.P.S. (Techniques and Information for Programming in SAS ) Kathy Harkins, Carolyn Maass, Mary Anne Rutkowski Merck Research Laboratories, Upper Gwynedd, PA ABSTRACT: This paper provides
More informationABSTRACT INTRODUCTION TRICK 1: CHOOSE THE BEST METHOD TO CREATE MACRO VARIABLES
An Efficient Method to Create a Large and Comprehensive Codebook Wen Song, ICF International, Calverton, MD Kamya Khanna, ICF International, Calverton, MD Baibai Chen, ICF International, Calverton, MD
More informationLearning SAS by Example
Learning SAS by Example A Programmer's Guide Second Edition.sas Ron Cody The correct bibliographic citation for this manual is as follows: Cody, Ron. 2018. Learning SAS by Example: A Programmer's Guide,
More informationVirtual Accessing of a SAS Data Set Using OPEN, FETCH, and CLOSE Functions with %SYSFUNC and %DO Loops
Paper 8140-2016 Virtual Accessing of a SAS Data Set Using OPEN, FETCH, and CLOSE Functions with %SYSFUNC and %DO Loops Amarnath Vijayarangan, Emmes Services Pvt Ltd, India ABSTRACT One of the truths about
More informationHow to Incorporate Old SAS Data into a New DATA Step, or What is S-M-U?
Paper 54-25 How to Incorporate Old SAS Data into a New DATA Step, or What is S-M-U? Andrew T. Kuligowski Nielsen Media Research Abstract / Introduction S-M-U. Some people will see these three letters and
More informationUsing Taylor s Linearization Technique in StEPS to Estimate Variances for Non-Linear Survey Estimators 1
Using Taylor s Linearization Technique in StEPS to Estimate Variances for Non-Linear Survey Estimators Roger L. Goodwin, U.S. Bureau of the Census, Washington, DC 033 Katherine J. Thompson, U.S. Bureau
More informationIntroduction to SAS Mike Zdeb ( , #1
Mike Zdeb (402-6479, msz03@albany.edu) #1 (10) REARRANGING DATA If you want to conduct an analysis across observations in a data set, you can use SAS procedures. If you want to conduct an analysis within
More informationMastering the Basics: Preventing Problems by Understanding How SAS Works. Imelda C. Go, South Carolina Department of Education, Columbia, SC
SESUG 2012 ABSTRACT Paper PO 06 Mastering the Basics: Preventing Problems by Understanding How SAS Works Imelda C. Go, South Carolina Department of Education, Columbia, SC There are times when SAS programmers
More informationEpidemiology Principles of Biostatistics Chapter 3. Introduction to SAS. John Koval
Epidemiology 9509 Principles of Biostatistics Chapter 3 John Koval Department of Epidemiology and Biostatistics University of Western Ontario What we will do today We will learn to use use SAS to 1. read
More information2 = Disagree 3 = Neutral 4 = Agree 5 = Strongly Agree. Disagree
PharmaSUG 2012 - Paper HO01 Multiple Techniques for Scoring Quality of Life Questionnaires Brandon Welch, Rho, Inc., Chapel Hill, NC Seungshin Rhee, Rho, Inc., Chapel Hill, NC ABSTRACT In the clinical
More informationIntroduction to PROC SQL
Introduction to PROC SQL Steven First, Systems Seminar Consultants, Madison, WI ABSTRACT PROC SQL is a powerful Base SAS Procedure that combines the functionality of DATA and PROC steps into a single step.
More informationIt s Proc Tabulate Jim, but not as we know it!
Paper SS02 It s Proc Tabulate Jim, but not as we know it! Robert Walls, PPD, Bellshill, UK ABSTRACT PROC TABULATE has received a very bad press in the last few years. Most SAS Users have come to look on
More informationAutomating Preliminary Data Cleaning in SAS
Paper PO63 Automating Preliminary Data Cleaning in SAS Alec Zhixiao Lin, Loan Depot, Foothill Ranch, CA ABSTRACT Preliminary data cleaning or scrubbing tries to delete the following types of variables
More informationSmall Sample Equating: Best Practices using a SAS Macro
SESUG 2013 Paper BtB - 11 Small Sample Equating: Best Practices using a SAS Macro Anna M. Kurtz, North Carolina State University, Raleigh, NC Andrew C. Dwyer, Castle Worldwide, Inc., Morrisville, NC ABSTRACT
More informationGraphical Analysis of Data using Microsoft Excel [2016 Version]
Graphical Analysis of Data using Microsoft Excel [2016 Version] Introduction In several upcoming labs, a primary goal will be to determine the mathematical relationship between two variable physical parameters.
More informationTaming a Spreadsheet Importation Monster
SESUG 2013 Paper BtB-10 Taming a Spreadsheet Importation Monster Nat Wooding, J. Sargeant Reynolds Community College ABSTRACT As many programmers have learned to their chagrin, it can be easy to read Excel
More informationMacro to compute best transform variable for the model
Paper 3103-2015 Macro to compute best transform variable for the model Nancy Hu, Discover Financial Service ABSTRACT This study is intended to assist Analysts to generate the best of variables using simple
More informationKEYWORDS ARRAY statement, DO loop, temporary arrays, MERGE statement, Hash Objects, Big Data, Brute force Techniques, PROC PHREG
PharmaSUG 2015 - Paper TT06 Using Arrays to Quickly Perform Fuzzy Merge Look-ups: Case Studies in Efficiency Arthur L. Carpenter California Occidental Consultants, Anchorage, AK ABSTRACT Merging two data
More informationTo conceptualize the process, the table below shows the highly correlated covariates in descending order of their R statistic.
Automating the process of choosing among highly correlated covariates for multivariable logistic regression Michael C. Doherty, i3drugsafety, Waltham, MA ABSTRACT In observational studies, there can be
More informationChecking for Duplicates Wendi L. Wright
Checking for Duplicates Wendi L. Wright ABSTRACT This introductory level paper demonstrates a quick way to find duplicates in a dataset (with both simple and complex keys). It discusses what to do when
More informationIntroducing a Colorful Proc Tabulate Ben Cochran, The Bedford Group, Raleigh, NC
Paper S1-09-2013 Introducing a Colorful Proc Tabulate Ben Cochran, The Bedford Group, Raleigh, NC ABSTRACT Several years ago, one of my clients was in the business of selling reports to hospitals. He used
More informationMULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.
Exam Name 1) The table Design view shows 1) A) the relationships established for the table. B) the formatting applied to the table. C) the structure of the table. D) the number of records in the table.
More informationExcel 2016: Core Data Analysis, Manipulation, and Presentation; Exam
Microsoft Office Specialist Excel 2016: Core Data Analysis, Manipulation, and Presentation; Exam 77-727 Successful candidates for the Microsoft Office Specialist Excel 2016 certification exam will have
More informationRanking Between the Lines
Ranking Between the Lines A %MACRO for Interpolated Medians By Joe Lorenz SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks of SAS Institute Inc. in
More informationSAS (Statistical Analysis Software/System)
SAS (Statistical Analysis Software/System) SAS Adv. Analytics or Predictive Modelling:- Class Room: Training Fee & Duration : 30K & 3 Months Online Training Fee & Duration : 33K & 3 Months Learning SAS:
More informationSAS PROGRAM EFFICIENCY FOR BEGINNERS. Bruce Gilsen, Federal Reserve Board
SAS PROGRAM EFFICIENCY FOR BEGINNERS Bruce Gilsen, Federal Reserve Board INTRODUCTION This paper presents simple efficiency techniques that can benefit inexperienced SAS software users on all platforms.
More informationSAS PROGRAM EFFICIENCY FOR BEGINNERS. Bruce Gilsen, Federal Reserve Board
SAS PROGRAM EFFICIENCY FOR BEGINNERS Bruce Gilsen, Federal Reserve Board INTRODUCTION This paper presents simple efficiency techniques that can benefit inexperienced SAS software users on all platforms.
More informationBruce Gilsen, Federal Reserve Board
SAS PROGRAM EFFICIENCY FOR BEGINNERS Bruce Gilsen, Federal Reserve Board INTRODUCTION This paper presents simple efficiency techniques that can benefit inexperienced SAS software users on all platforms
More informationChapter Two: Descriptive Methods 1/50
Chapter Two: Descriptive Methods 1/50 2.1 Introduction 2/50 2.1 Introduction We previously said that descriptive statistics is made up of various techniques used to summarize the information contained
More informationKnow What You Are Missing: How to Catalogue and Manage Missing Pieces of Historical Data
Know What You Are Missing: How to Catalogue and Manage Missing Pieces of Historical Data Shankar Yaddanapudi, SAS Consultant, Washington DC ABSTRACT In certain applications it is necessary to maintain
More informationUsing Data Set Options in PROC SQL Kenneth W. Borowiak Howard M. Proskin & Associates, Inc., Rochester, NY
Using Data Set Options in PROC SQL Kenneth W. Borowiak Howard M. Proskin & Associates, Inc., Rochester, NY ABSTRACT Data set options are an often over-looked feature when querying and manipulating SAS
More informationData Manipulation with JMP
Data Manipulation with JMP Introduction JMP was introduced in 1989 by its Principle developer, John Sall at SAS. It is a richly graphic environment and very useful for both preliminary data exploration
More informationUsing SAS Macros to Extract P-values from PROC FREQ
SESUG 2016 ABSTRACT Paper CC-232 Using SAS Macros to Extract P-values from PROC FREQ Rachel Straney, University of Central Florida This paper shows how to leverage the SAS Macro Facility with PROC FREQ
More informationIntroduction to SAS Procedures SAS Basics III. Susan J. Slaughter, Avocet Solutions
Introduction to SAS Procedures SAS Basics III Susan J. Slaughter, Avocet Solutions DATA versus PROC steps Two basic parts of SAS programs DATA step PROC step Begin with DATA statement Begin with PROC statement
More informationWe have already seen the transportation problem and the assignment problem. Let us take the transportation problem, first.
Advanced Operations Research Prof. G. Srinivasan Department of Management Studies Indian Institute of Technology, Madras Lecture 19 Network Models In this lecture, we will discuss network models. (Refer
More informationPROGRAM DOCUMENTATION FOR SAS PLS PROGRAM I. This document describes the source code for the first program described in the article titled:
PROGRAM DOCUMENTATION FOR SAS PLS PROGRAM I General This document describes the source code for the first program described in the article titled: SAS PLS for analysis of spectroscopic data published in
More informationIf You Need These OBS and These VARS, Then Drop IF, and Keep WHERE Jay Iyengar, Data Systems Consultants LLC
Paper 2417-2018 If You Need These OBS and These VARS, Then Drop IF, and Keep WHERE Jay Iyengar, Data Systems Consultants LLC ABSTRACT Reading data effectively in the DATA step requires knowing the implications
More informationSAS Scalable Performance Data Server 4.3
Scalability Solution for SAS Dynamic Cluster Tables A SAS White Paper Table of Contents Introduction...1 Cluster Tables... 1 Dynamic Cluster Table Loading Benefits... 2 Commands for Creating and Undoing
More informationSimplifying Effective Data Transformation Via PROC TRANSPOSE
MWSUG 2016 - Paper SA05 Simplifying Effective Data Transformation Via PROC TRANSPOSE Arthur X. Li, City of Hope Comprehensive Cancer Center, Duarte, CA ABSTRACT You can store data with repeated measures
More informationDATA Step Debugger APPENDIX 3
1193 APPENDIX 3 DATA Step Debugger Introduction 1194 Definition: What is Debugging? 1194 Definition: The DATA Step Debugger 1194 Basic Usage 1195 How a Debugger Session Works 1195 Using the Windows 1195
More informationAn Experiment in Visual Clustering Using Star Glyph Displays
An Experiment in Visual Clustering Using Star Glyph Displays by Hanna Kazhamiaka A Research Paper presented to the University of Waterloo in partial fulfillment of the requirements for the degree of Master
More informationMultiple Graphical and Tabular Reports on One Page, Multiple Ways to Do It Niraj J Pandya, CT, USA
Paper TT11 Multiple Graphical and Tabular Reports on One Page, Multiple Ways to Do It Niraj J Pandya, CT, USA ABSTRACT Creating different kind of reports for the presentation of same data sounds a normal
More informationStatements with the Same Function in Multiple Procedures
67 CHAPTER 3 Statements with the Same Function in Multiple Procedures Overview 67 Statements 68 BY 68 FREQ 70 QUIT 72 WEIGHT 73 WHERE 77 Overview Several statements are available and have the same function
More informationMS Office for Engineers
MS Office for Engineers Lesson 4 Excel 2 Pre-reqs/Technical Skills Basic knowledge of Excel Completion of Excel 1 tutorial Basic computer use Expectations Read lesson material Implement steps in software
More informationGary L. Katsanis, Blue Cross and Blue Shield of the Rochester Area, Rochester, NY
Table Lookups in the SAS Data Step Gary L. Katsanis, Blue Cross and Blue Shield of the Rochester Area, Rochester, NY Introduction - What is a Table Lookup? You have a sales file with one observation for
More informationUsing Templates Created by the SAS/STAT Procedures
Paper 081-29 Using Templates Created by the SAS/STAT Procedures Yanhong Huang, Ph.D. UMDNJ, Newark, NJ Jianming He, Solucient, LLC., Berkeley Heights, NJ ABSTRACT SAS procedures provide a large quantity
More informationEXAMPLE 3: MATCHING DATA FROM RESPONDENTS AT 2 OR MORE WAVES (LONG FORMAT)
EXAMPLE 3: MATCHING DATA FROM RESPONDENTS AT 2 OR MORE WAVES (LONG FORMAT) DESCRIPTION: This example shows how to combine the data on respondents from the first two waves of Understanding Society into
More informationOptimizing System Performance
243 CHAPTER 19 Optimizing System Performance Definitions 243 Collecting and Interpreting Performance Statistics 244 Using the FULLSTIMER and STIMER System Options 244 Interpreting FULLSTIMER and STIMER
More informationCut Out The Cut And Paste: SAS Macros For Presenting Statistical Output ABSTRACT INTRODUCTION
Cut Out The Cut And Paste: SAS Macros For Presenting Statistical Output Myungshin Oh, UCLA Department of Biostatistics Mel Widawski, UCLA School of Nursing ABSTRACT We, as statisticians, often spend more
More informationPROC JACK REG - A SAS~ PROCEDURE FOR JACKKNIFE REGRESSION Panayiotis Hambi, West Virginia University
ABSTRACT PROC JACK REG - A SAS~ PROCEDURE FOR JACKKNIFE REGRESSION Panayiotis Hambi, West Virginia University Daniel M. Chl1ko, West Virginia UniverSity Gerry Hobos, West Virginia University PROC JACKREG
More informationIvy s Business Analytics Foundation Certification Details (Module I + II+ III + IV + V)
Ivy s Business Analytics Foundation Certification Details (Module I + II+ III + IV + V) Based on Industry Cases, Live Exercises, & Industry Executed Projects Module (I) Analytics Essentials 81 hrs 1. Statistics
More informationSAS Linear Model Demo. Overview
SAS Linear Model Demo Yupeng Wang, Ph.D, Data Scientist Overview SAS is a popular programming tool for biostatistics and clinical trials data analysis. Here I show an example of using SAS linear regression
More informationPruning the SASLOG Digging into the Roots of NOTEs, WARNINGs, and ERRORs
Pruning the SASLOG Digging into the Roots of NOTEs, WARNINGs, and ERRORs Andrew T. Kuligowski Nielsen Media Research ABSTRACT "Look at your SASLOG." You hear it from instructors of courses related to the
More informationRight-click on whatever it is you are trying to change Get help about the screen you are on Help Help Get help interpreting a table
Q Cheat Sheets What to do when you cannot figure out how to use Q What to do when the data looks wrong Right-click on whatever it is you are trying to change Get help about the screen you are on Help Help
More informationTOP 10 (OR MORE) WAYS TO OPTIMIZE YOUR SAS CODE
TOP 10 (OR MORE) WAYS TO OPTIMIZE YOUR SAS CODE Handy Tips for the Savvy Programmer SAS PROGRAMMING BEST PRACTICES Create Readable Code Basic Coding Recommendations» Efficiently choosing data for processing»
More informationProcessing SAS Data Sets
Statistical Data Analysis 1 Processing SAS Data Sets Namhyoung Kim Dept. of Applied Statistics Gachon University nhkim@gachon.ac.kr 1 Using OUT Dataset OUTPUT Statement OUTPUT
More informationIt s Not All Relative: SAS/Graph Annotate Coordinate Systems
Paper TU05 It s Not All Relative: SAS/Graph Annotate Coordinate Systems Rick Edwards, PPD Inc, Wilmington, NC ABSTRACT This paper discusses the SAS/Graph Annotation coordinate systems and how a combination
More informationSAS CURRICULUM. BASE SAS Introduction
SAS CURRICULUM BASE SAS Introduction Data Warehousing Concepts What is a Data Warehouse? What is a Data Mart? What is the difference between Relational Databases and the Data in Data Warehouse (OLTP versus
More informationEffectively Utilizing Loops and Arrays in the DATA Step
Paper 1618-2014 Effectively Utilizing Loops and Arrays in the DATA Step Arthur Li, City of Hope National Medical Center, Duarte, CA ABSTRACT The implicit loop refers to the DATA step repetitively reading
More informationCreating output datasets using SQL (Structured Query Language) only Andrii Stakhniv, Experis Clinical, Ukraine
ABSTRACT PharmaSUG 2015 Paper QT22 Andrii Stakhniv, Experis Clinical, Ukraine PROC SQL is one of the most powerful procedures in SAS. With this tool we can easily manipulate data and create a large number
More informationSAS CLINICAL SYLLABUS. DURATION: - 60 Hours
SAS CLINICAL SYLLABUS DURATION: - 60 Hours BASE SAS PART - I Introduction To Sas System & Architecture History And Various Modules Features Variables & Sas Syntax Rules Sas Data Sets Data Set Options Operators
More informationSo Much Data, So Little Time: Splitting Datasets For More Efficient Run Times and Meeting FDA Submission Guidelines
Paper TT13 So Much Data, So Little Time: Splitting Datasets For More Efficient Run Times and Meeting FDA Submission Guidelines Anthony Harris, PPD, Wilmington, NC Robby Diseker, PPD, Wilmington, NC ABSTRACT
More informationThe EMCLUS Procedure. The EMCLUS Procedure
The EMCLUS Procedure Overview Procedure Syntax PROC EMCLUS Statement VAR Statement INITCLUS Statement Output from PROC EMCLUS EXAMPLES-SECTION Example 1: Syntax for PROC FASTCLUS Example 2: Use of the
More informationEasing into Data Exploration, Reporting, and Analytics Using SAS Enterprise Guide
Paper 809-2017 Easing into Data Exploration, Reporting, and Analytics Using SAS Enterprise Guide ABSTRACT Marje Fecht, Prowerk Consulting Whether you have been programming in SAS for years, are new to
More information%MISSING: A SAS Macro to Report Missing Value Percentages for a Multi-Year Multi-File Information System
%MISSING: A SAS Macro to Report Missing Value Percentages for a Multi-Year Multi-File Information System Rushi Patel, Creative Information Technology, Inc., Arlington, VA ABSTRACT It is common to find
More informationBUSINESS ANALYTICS. 96 HOURS Practical Learning. DexLab Certified. Training Module. Gurgaon (Head Office)
SAS (Base & Advanced) Analytics & Predictive Modeling Tableau BI 96 HOURS Practical Learning WEEKDAY & WEEKEND BATCHES CLASSROOM & LIVE ONLINE DexLab Certified BUSINESS ANALYTICS Training Module Gurgaon
More information9 Ways to Join Two Datasets David Franklin, Independent Consultant, New Hampshire, USA
9 Ways to Join Two Datasets David Franklin, Independent Consultant, New Hampshire, USA ABSTRACT Joining or merging data is one of the fundamental actions carried out when manipulating data to bring it
More informationPDF Multi-Level Bookmarks via SAS
Paper TS04 PDF Multi-Level Bookmarks via SAS Steve Griffiths, GlaxoSmithKline, Stockley Park, UK ABSTRACT Within the GlaxoSmithKline Oncology team we recently experienced an issue within our patient profile
More informationAutomated Checking Of Multiple Files Kathyayini Tappeta, Percept Pharma Services, Bridgewater, NJ
PharmaSUG 2015 - Paper QT41 Automated Checking Of Multiple Files Kathyayini Tappeta, Percept Pharma Services, Bridgewater, NJ ABSTRACT Most often clinical trial data analysis has tight deadlines with very
More informationThe Dataset Diet How to transform short and fat into long and thin
Paper TU06 The Dataset Diet How to transform short and fat into long and thin Kathryn Wright, Oxford Pharmaceutical Sciences, UK ABSTRACT What do you do when you are given a dataset with one observation
More information