SUAVe Open Problem (May 8 th 2018 meeting)

Size: px
Start display at page:

Download "SUAVe Open Problem (May 8 th 2018 meeting)"

Transcription

1 SUAVe Open Problem (May 8 th 2018 meeting) For this Open Problem, you are given a list of people and the set of (ICD9) diagnoses coded for each person by one or more physicians within a 12 month period. Each person has between one and five distinct ICD9 codes. The problem dataset contains two columns: Name and ICD9. There is one row for each ICD9 code that applies to each person. Your challenge is to find matched pairs of two people each, where both people have the exact same set of ICD9 codes. That is, the full set of ICD9 codes for both persons must be identical. A matched pair could consist of two people who have the same single diagnosis, up to two people who have the same five ICD9 codes. People without a match can be dropped. Here are a couple of sample matches, drawn from the data below: Dora Ted (both have exactly these two codes only: 401 & 53081) Susan Dawn (both have exactly this list of codes: 493, 709, 7099, V048) When there are more than two people with the same set of diagnoses, it is not necessary to identify all pairings among those people (unless you want to). For example, if k people have the exact same codes, it is sufficient to identify the k 1 pairs that include one particular person in the set, or instead a list of k 2 mutually exclusive pairs of people, rather than all ( k ) pairs. However you handle this scenario, describe your approach. 2 To load the problem data into a SAS dataset, you can run the following code. data op_data; infile cards4; input name $ icd9 cards4; Aiden 250 Alfred 311 Alfred 595 Alfred 7099 Alfred 7832 Brian 493 Brian 709 Brian 7099 Brian V048 Brianna 250 Charles 244 Charles 250 Cheryl 7099 Cheryl 7832 Cynthia V25 Dawn 493 Dawn 709 Dawn 7099 Dawn V048 Dora 401 Dora Dylan 401 Dylan Dylan 709 Dylan 715 Dylan V048 Eric V25 Ethan 493 Ethan 7099 Hannah 401 Hannah Hannah 709 Hannah 715 Hannah V048 Helen 244 Janet 311 Janet Janet 595 Janet 7099 Janet 7832 John 244 John 250 Mary 244 Mary 250 Matthew 250 Michael 244 Norbert 7099 Norbert 7832 Robert 401 Robert Samantha 250 Sean 493 Sean 7099 Sean 7832 Sophia 493 Sophia 7099 Steven 595 Steven 715 Steven V048 Susan 493 Susan 709 Susan 7099 Susan V048 Ted 401 Ted Violet 401 Violet Willliam 311 Willliam 595 Willliam 7099 Willliam 7832 ;;;; 1

2 Solution 1. David Mah, BC Ministry of Health * put the diag codes into separate columns; data TMP1; set OP_DATA; if first.name then ct=1; else ct+1; proc sort data=tmp1; by name ct icd9; data TMP; set TMP1; format diag1-diag5 $5.; array diag(5) diag1-diag5; do i=1 to 5; if i=ct then do; diag(i) = icd9; keep name diag1-diag5; * combine the diag codes into one row per person; proc sort data=tmp; by name diag1-diag5; data UPDT; update TMP (obs=0) TMP; data COMBINE; set UPDT; diag=diag1 diag2 diag3 diag4 diag5; d2=compress(diag,''); drop diag1-diag5 diag; proc sort data=combine; by d2 name; data RETDAT; set COMBINE; by d2 name; format diagcd $20.; if first.d2 then do; retain d2; diagcd=d2; * count the number of distinct people in the data and put the number in a global var; create table PEEPS as select count(*) as peepcnt from RETDAT; 2

3 data NULL; set PEEPS; %global nums; call symput('nums',peepcnt); * create separate datasets for each person; %macro DODS (y); %let x=&y.; %do z=1 %to &x.; data DS&z.; set RETDAT; if _n_ = &z. then output DS&z.; % %m %DODS(&nums.); * match datasets; * checks for matching pairs of datasets; * deletes empty datasets; %macro MATCHDS (y); %global f; *number of successful matches; %let a=1; %let z=1; %do %until (&z.=&y.); %let z1=&z.; %let z2=%eval(&z1.+1); create table MATCH&z1. as select a.name, b.name as name2, a.diagcd, b.d2 as diagcd2 from DS&z1. a, DS&z2. b where a.d2=b.d2; * check for empty dataset looking at the number of observations in each dataset; data _null_; call symput('obscnt',0); set MATCH&z1.; call symput('obscnt',_n_); stop; * deletes empty datasets; %If &obscnt=0 %then %do; %put "There are no records in the dataset MATCH&z."; proc delete data=match&z1. (gennum=all); * delete dataset; % %else %do; create table SUCCESS&a. as select * from MATCH&z1.; %let a=%eval(&a.+1); %let f=%eval(&a.-1); % %let z=%eval(&z1.+1); % 3

4 %m %MATCHDS (&nums.); * combine all non empty datasets into one final dataset; data FINAL; set SUCCESS1-SUCCESS&f.; Solution 2. Arthur Tabachneck, Analyst Finder, Inc. proc transpose data=op_data out=op_data_wide (drop=_:) prefix=icd9_; var icd9; /* Sort table in order of icd9 codes */ proc sort data=op_data_wide; by icd9_1-icd9_5; /* Select mutually exclusive pairs for first person with matching codes */ data want (keep=pair); set op_data_wide; by icd9_1-icd9_5; retain first; if not (first.icd9_5 and last.icd9_5) then do; if first.icd9_5 then first=name; else do; pair=catx('+',first,name); Solution 3. Peter Ott, BC Ministry of FLNRORD data convert; *convert codes to a wide character string; length string $ 30; set op_data; retain string; if first.name then do; string=''; string=catx(",",string,icd9); if last.name then keep name string; 4

5 data pairs; *all possible unique matching pairs; set convert; pos=_n_; do k=pos+1 to nobs; *note starting index of loop; set convert(rename=(name=cname string=cstring)) point=k nobs=nobs; if string=cstring then do; pair1=name; pair2=cname; keep pair1 pair2 string; Solution 4. Robert Ellsworth, Ellsworth Stewart Consulting /*** The approach is to build a key with all the codes a person has (max 5). I sort by the name so I can create a compound key as I scan through the rows. I sort by the code so the codes will be in the same order for everyone. Join the table to itself using my constructed key to get all the matches. I use the a.name < b.name so I don t get a person matching themselves or the same match in a different order. ***/ data op_unique; retain icd9by5; length icd9by5 $25; set op_data; if first.name then icd9by5 = icd9; else icd9by5 = trim(icd9by5) "," icd9; if last.name ; create table matches as select a.name as name_1, b.name as name_2, a.icd9by5 from op_unique a join op_unique b on a.icd9by5 = b.icd9by5 and a.name < b.name order by a.icd9by5, a.name, b.name; proc print data =matches noobs; var name_1 name_2 icd9by5; 5

Open Problem for SUAVe User Group Meeting, November 26, 2013 (UVic)

Open Problem for SUAVe User Group Meeting, November 26, 2013 (UVic) Open Problem for SUAVe User Group Meeting, November 26, 2013 (UVic) Background The data in a SAS dataset is organized into variables and observations, which equate to rows and columns. While the order

More information

Chapter 6: Modifying and Combining Data Sets

Chapter 6: Modifying and Combining Data Sets Chapter 6: Modifying and Combining Data Sets The SET statement is a powerful statement in the DATA step. Its main use is to read in a previously created SAS data set which can be modified and saved as

More information

INTERMEDIATE SQL GOING BEYOND THE SELECT. Created by Brian Duffey

INTERMEDIATE SQL GOING BEYOND THE SELECT. Created by Brian Duffey INTERMEDIATE SQL GOING BEYOND THE SELECT Created by Brian Duffey WHO I AM Brian Duffey 3 years consultant at michaels, ross, and cole 9+ years SQL user What have I used SQL for? ROADMAP Introduction 1.

More information

Example1D.1.sas. * Procedures : ; * 1. print to show the dataset. ;

Example1D.1.sas. * Procedures : ; * 1. print to show the dataset. ; Example1D.1.sas * SAS example program 1D.1 ; * 1. Create a dataset called prob from the following data: ; * age prob lb ub ; * 24.25.20.31 ; * 36.26.21.32 ; * 48.28.24.33 ; * 60.31.28.36 ; * 72.35.32.39

More information

proc print data=account; <insert statement here> run;

proc print data=account; <insert statement here> run; Statistics 6250 Name: Fall 2012 (print: first last ) Prof. Fan NetID #: Midterm Three Instructions: This is an in-class and open book midterm. You must write your answers on the provide spaces. Give concise

More information

Ranking Between the Lines

Ranking Between the Lines Ranking Between the Lines A %MACRO for Interpolated Medians By Joe Lorenz SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks of SAS Institute Inc. in

More information

Taming a Spreadsheet Importation Monster

Taming a Spreadsheet Importation Monster SESUG 2013 Paper BtB-10 Taming a Spreadsheet Importation Monster Nat Wooding, J. Sargeant Reynolds Community College ABSTRACT As many programmers have learned to their chagrin, it can be easy to read Excel

More information

INTRODUCTION TO SAS HOW SAS WORKS READING RAW DATA INTO SAS

INTRODUCTION TO SAS HOW SAS WORKS READING RAW DATA INTO SAS TO SAS NEED FOR SAS WHO USES SAS WHAT IS SAS? OVERVIEW OF BASE SAS SOFTWARE DATA MANAGEMENT FACILITY STRUCTURE OF SAS DATASET SAS PROGRAM PROGRAMMING LANGUAGE ELEMENTS OF THE SAS LANGUAGE RULES FOR SAS

More information

Macros are a block of code that can be executed/called on demand

Macros are a block of code that can be executed/called on demand What Are These Macros are a block of code that can be executed/called on demand Global variables are variables that you assign a value to, which can be referenced anywhere within your program. (Leah s

More information

Let SAS Help You Easily Find and Access Your Folders and Files

Let SAS Help You Easily Find and Access Your Folders and Files Paper 11720-2016 Let SAS Help You Easily Find and Access Your Folders and Files ABSTRACT Ting Sa, Cincinnati Children s Hospital Medical Center In this paper, a SAS macro is introduced that can help users

More information

Quality Control of Clinical Data Listings with Proc Compare

Quality Control of Clinical Data Listings with Proc Compare ABSTRACT Quality Control of Clinical Data Listings with Proc Compare Robert Bikwemu, Pharmapace, Inc., San Diego, CA Nicole Wallstedt, Pharmapace, Inc., San Diego, CA Checking clinical data listings with

More information

Introduction to SAS Mike Zdeb ( , #1

Introduction to SAS Mike Zdeb ( , #1 Mike Zdeb (402-6479, msz03@albany.edu) #1 (10) REARRANGING DATA If you want to conduct an analysis across observations in a data set, you can use SAS procedures. If you want to conduct an analysis within

More information

PharmaSUG Paper TT11

PharmaSUG Paper TT11 PharmaSUG 2014 - Paper TT11 What is the Definition of Global On-Demand Reporting within the Pharmaceutical Industry? Eric Kammer, Novartis Pharmaceuticals Corporation, East Hanover, NJ ABSTRACT It is not

More information

11/27/2011. Derek Chapman, PhD December Data Linkage Techniques: Tricks of the Trade. General data cleaning issue

11/27/2011. Derek Chapman, PhD December Data Linkage Techniques: Tricks of the Trade. General data cleaning issue Derek Chapman, PhD December 2011 Data Linkage Techniques: Tricks of the Trade General data cleaning issue Linkage can create more duplicates Easier to deal with before linkage Accurate counts are important

More information

My objective is twofold: Examine the capabilities of MP Connect and apply those capabilities to a real-world application.

My objective is twofold: Examine the capabilities of MP Connect and apply those capabilities to a real-world application. Abstract MP CONNECT: Warp Engine for SAS (Multi-Processing in the Sun Solaris Environment). Pablo J. Nogueras CitiFinancial International, Risk Management Technology, Irving, Texas When you are assigned

More information

T.I.P.S. (Techniques and Information for Programming in SAS )

T.I.P.S. (Techniques and Information for Programming in SAS ) Paper PO-088 T.I.P.S. (Techniques and Information for Programming in SAS ) Kathy Harkins, Carolyn Maass, Mary Anne Rutkowski Merck Research Laboratories, Upper Gwynedd, PA ABSTRACT: This paper provides

More information

Tutorial 5 Advanced Queries and Enhancing Table Design

Tutorial 5 Advanced Queries and Enhancing Table Design Tutorial 5 Advanced Queries and Enhancing Table Design (Sessions 1 and 3 only) The Clinic Database Clinic.accdb file for Tutorials 5-8 object names include tags no spaces in field names to promote upsizing

More information

An exercise in separating client-specific parameters from your program

An exercise in separating client-specific parameters from your program An exercise in separating client-specific parameters from your program Erik Tilanus The Netherlands WIILSU 2015 Milwaukee Do you recognize this? You write a 'one-time' program for one particular situation

More information

A Macro that can Search and Replace String in your SAS Programs

A Macro that can Search and Replace String in your SAS Programs ABSTRACT MWSUG 2016 - Paper BB27 A Macro that can Search and Replace String in your SAS Programs Ting Sa, Cincinnati Children s Hospital Medical Center, Cincinnati, OH In this paper, a SAS macro is introduced

More information

SAS 101. Based on Learning SAS by Example: A Programmer s Guide Chapter 21, 22, & 23. By Tasha Chapman, Oregon Health Authority

SAS 101. Based on Learning SAS by Example: A Programmer s Guide Chapter 21, 22, & 23. By Tasha Chapman, Oregon Health Authority SAS 101 Based on Learning SAS by Example: A Programmer s Guide Chapter 21, 22, & 23 By Tasha Chapman, Oregon Health Authority Topics covered All the leftovers! Infile options Missover LRECL=/Pad/Truncover

More information

Networks and Web for Health Informatics (HINF 6220) Tutorial 13 : PHP 29 Oct 2015

Networks and Web for Health Informatics (HINF 6220) Tutorial 13 : PHP 29 Oct 2015 Networks and Web for Health Informatics (HINF 6220) Tutorial 13 : PHP 29 Oct 2015 PHP Arrays o Arrays are single variables that store multiple values at the same time! o Consider having a list of values

More information

Correcting for natural time lag bias in non-participants in pre-post intervention evaluation studies

Correcting for natural time lag bias in non-participants in pre-post intervention evaluation studies Correcting for natural time lag bias in non-participants in pre-post intervention evaluation studies Gandhi R Bhattarai PhD, OptumInsight, Rocky Hill, CT ABSTRACT Measuring the change in outcomes between

More information

Chaining Logic in One Data Step Libing Shi, Ginny Rego Blue Cross Blue Shield of Massachusetts, Boston, MA

Chaining Logic in One Data Step Libing Shi, Ginny Rego Blue Cross Blue Shield of Massachusetts, Boston, MA Chaining Logic in One Data Step Libing Shi, Ginny Rego Blue Cross Blue Shield of Massachusetts, Boston, MA ABSTRACT Event dates stored in multiple rows pose many challenges that have typically been resolved

More information

Getting Started with Arrays and DO LOOPS. Created 01/2004, Updated 03/2008, Afton Royal Training & Consulting

Getting Started with Arrays and DO LOOPS. Created 01/2004, Updated 03/2008, Afton Royal Training & Consulting Getting Started with Arrays and DO LOOPS Created 01/2004, Updated 03/2008, Afton Royal Training & Consulting What to Expect We have 30 minutes to learn how to use DO LOOPS and arrays. We will assume you

More information

PhoneBook (Sorting) You will be modifying your PhoneBook class to sort by different fields in a contact.

PhoneBook (Sorting) You will be modifying your PhoneBook class to sort by different fields in a contact. PhoneBook (Sorting) Background: You will be modifying your PhoneBook class to sort by different fields in a contact. Assignment: 1. Create a project called PhoneBookSort. Copy your PhoneBook class and

More information

Using PROC SQL to Calculate FIRSTOBS David C. Tabano, Kaiser Permanente, Denver, CO

Using PROC SQL to Calculate FIRSTOBS David C. Tabano, Kaiser Permanente, Denver, CO Using PROC SQL to Calculate FIRSTOBS David C. Tabano, Kaiser Permanente, Denver, CO ABSTRACT The power of SAS programming can at times be greatly improved using PROC SQL statements for formatting and manipulating

More information

A Format to Make the _TYPE_ Field of PROC MEANS Easier to Interpret Matt Pettis, Thomson West, Eagan, MN

A Format to Make the _TYPE_ Field of PROC MEANS Easier to Interpret Matt Pettis, Thomson West, Eagan, MN Paper 045-29 A Format to Make the _TYPE_ Field of PROC MEANS Easier to Interpret Matt Pettis, Thomson West, Eagan, MN ABSTRACT: PROC MEANS analyzes datasets according to the variables listed in its Class

More information

Program Validation: Logging the Log

Program Validation: Logging the Log Program Validation: Logging the Log Adel Fahmy, Symbiance Inc., Princeton, NJ ABSTRACT Program Validation includes checking both program Log and Logic. The program Log should be clear of any system Error/Warning

More information

Checking for Duplicates Wendi L. Wright

Checking for Duplicates Wendi L. Wright Checking for Duplicates Wendi L. Wright ABSTRACT This introductory level paper demonstrates a quick way to find duplicates in a dataset (with both simple and complex keys). It discusses what to do when

More information

Base and Advance SAS

Base and Advance SAS Base and Advance SAS BASE SAS INTRODUCTION An Overview of the SAS System SAS Tasks Output produced by the SAS System SAS Tools (SAS Program - Data step and Proc step) A sample SAS program Exploring SAS

More information

Essentials of PDV: Directing the Aim to Understanding the DATA Step! Arthur Xuejun Li, City of Hope National Medical Center, Duarte, CA

Essentials of PDV: Directing the Aim to Understanding the DATA Step! Arthur Xuejun Li, City of Hope National Medical Center, Duarte, CA PharmaSUG 2013 - Paper TF17 Essentials of PDV: Directing the Aim to Understanding the DATA Step! Arthur Xuejun Li, City of Hope National Medical Center, Duarte, CA ABSTRACT Beginning programmers often

More information

SAS Application Development Using Windows RAD Software for Front End

SAS Application Development Using Windows RAD Software for Front End Applications Development SAS Application Development Using Windows RAD Software for Front End Zhuan (John) Xu Blue Cross Blue Shield ofiowa & Big Creek Software, Des Moines, IA Abstract This paper presents

More information

The Art of Defensive Programming: Coping with Unseen Data

The Art of Defensive Programming: Coping with Unseen Data INTRODUCTION Paper 1791-2018 The Art of Defensive Programming: Coping with Unseen Data Philip R Holland, Holland Numerics Limited, United Kingdom This paper discusses how you cope with the following data

More information

LST in Comparison Sanket Kale, Parexel International Inc., Durham, NC Sajin Johnny, Parexel International Inc., Durham, NC

LST in Comparison Sanket Kale, Parexel International Inc., Durham, NC Sajin Johnny, Parexel International Inc., Durham, NC ABSTRACT PharmaSUG 2013 - Paper PO01 LST in Comparison Sanket Kale, Parexel International Inc., Durham, NC Sajin Johnny, Parexel International Inc., Durham, NC The need for producing error free programming

More information

GiftWorks Import Guide Page 2

GiftWorks Import Guide Page 2 Import Guide Introduction... 2 GiftWorks Import Services... 3 Import Sources... 4 Preparing for Import... 9 Importing and Matching to Existing Donors... 11 Handling Receipting of Imported Donations...

More information

SAS Online Training: Course contents: Agenda:

SAS Online Training: Course contents: Agenda: SAS Online Training: Course contents: Agenda: (1) Base SAS (6) Clinical SAS Online Training with Real time Projects (2) Advance SAS (7) Financial SAS Training Real time Projects (3) SQL (8) CV preparation

More information

SUGI 29 Data Warehousing, Management and Quality

SUGI 29 Data Warehousing, Management and Quality Building a Purchasing Data Warehouse for SRM from Disparate Procurement Systems Zeph Stemle, Qualex Consulting Services, Inc., Union, KY ABSTRACT SAS Supplier Relationship Management (SRM) solution offers

More information

Planting Your Rows: Using SAS Formats to Make the Generation of Zero- Filled Rows in Tables Less Thorny

Planting Your Rows: Using SAS Formats to Make the Generation of Zero- Filled Rows in Tables Less Thorny Planting Your Rows: Using SAS Formats to Make the Generation of Zero- Filled Rows in Tables Less Thorny Kathy Hardis Fraeman, United BioSource Corporation, Bethesda, MD ABSTRACT Often tables or summary

More information

UMBC CMSC 331 Final Exam

UMBC CMSC 331 Final Exam UMBC CMSC 331 Final Exam Name: UMBC Username: You have two hours to complete this closed book exam. We reserve the right to assign partial credit, and to deduct points for answers that are needlessly wordy

More information

DO NOT SEND DUPLICATE COPIES OF YOUR LOG AND DO NOT SEND A PRINTED COPY.

DO NOT SEND DUPLICATE COPIES OF YOUR LOG AND DO NOT SEND A PRINTED COPY. AMERICAN BOARD OF UROLOGY 2018 LIFE LONG LEARNING (LLL) LEVEL 2 PEDIATRIC UROLOGY SUBSPECIALTY CERTIFICATION EXAMINATION PROCESS INSTRUCTIONS FOR SUBMISSION OF ELECTRONIC LOGS Please read all instructions

More information

DO Loop Processing. DO Loop Processing. Repetitive Coding. DO Loop Processing

DO Loop Processing. DO Loop Processing. Repetitive Coding. DO Loop Processing DO Loop Processing DO Loop Processing Statements within a DO loop execute for a specific number of iterations or until a specific condition stops the loop. DATA statement SAS statements DO statement iterated

More information

Ready To Become Really Productive Using PROC SQL? Sunil K. Gupta, Gupta Programming, Simi Valley, CA

Ready To Become Really Productive Using PROC SQL? Sunil K. Gupta, Gupta Programming, Simi Valley, CA PharmaSUG 2012 - Paper HW04 Ready To Become Really Productive Using PROC SQL? Sunil K. Gupta, Gupta Programming, Simi Valley, CA ABSTRACT Using PROC SQL, can you identify at least four ways to: select

More information

LIFE LONG LEARNING LEVEL INSTRUCTIONS FOR SUBMISSION OF ELECTRONIC LOGS

LIFE LONG LEARNING LEVEL INSTRUCTIONS FOR SUBMISSION OF ELECTRONIC LOGS AMERICAN BOARD OF UROLOGY LIFE LONG LEARNING LEVEL 2 2018 INSTRUCTIONS FOR SUBMISSION OF ELECTRONIC LOGS Please read all instructions carefully before preparing your log. It is imperative that you carefully

More information

2017 MOC PEDIATRIC PRACTICE LOG TEMPLATE:

2017 MOC PEDIATRIC PRACTICE LOG TEMPLATE: AMERICAN BOARD OF UROLOGY 2017 MAINTENANCE OF CERTIFICATION (MOC) Level 4 PEDIATRIC UROLOGY SUBSPECIALTY CERTIFICATION EXAMINATION PROCESS INSTRUCTIONS FOR SUBMISSION OF ELECTRONIC LOGS Please read all

More information

Greenspace: A Macro to Improve a SAS Data Set Footprint

Greenspace: A Macro to Improve a SAS Data Set Footprint Paper AD-150 Greenspace: A Macro to Improve a SAS Data Set Footprint Brian Varney, Experis Business Intelligence and Analytics Practice ABSTRACT SAS programs can be very I/O intensive. SAS data sets with

More information

Using Templates Created by the SAS/STAT Procedures

Using Templates Created by the SAS/STAT Procedures Paper 081-29 Using Templates Created by the SAS/STAT Procedures Yanhong Huang, Ph.D. UMDNJ, Newark, NJ Jianming He, Solucient, LLC., Berkeley Heights, NJ ABSTRACT SAS procedures provide a large quantity

More information

EE221 Databases Practicals Manual

EE221 Databases Practicals Manual EE221 Databases Practicals Manual Lab 1 An Introduction to SQL Lab 2 Database Creation and Querying using SQL Assignment Data Analysis, Database Design, Implementation and Relation Normalisation School

More information

Introduction to SQL 4/24/2017. University of Iowa SAS Users Group. 1. Introduction and basic uses 2. Joins and Views 3. Reporting examples

Introduction to SQL 4/24/2017. University of Iowa SAS Users Group. 1. Introduction and basic uses 2. Joins and Views 3. Reporting examples University of Iowa SAS Users Group 1. Introduction and basic uses 2. Joins and Views 3. Reporting examples 1 Patient Sex YOB Blood Type Visit # Weight Joe male 73 A+ 1 150 Joe male 73 A+ 2 153 Joe male

More information

A SAS Macro for Producing Benchmarks for Interpreting School Effect Sizes

A SAS Macro for Producing Benchmarks for Interpreting School Effect Sizes A SAS Macro for Producing Benchmarks for Interpreting School Effect Sizes Brian E. Lawton Curriculum Research & Development Group University of Hawaii at Manoa Honolulu, HI December 2012 Copyright 2012

More information

1 Files to download. 3 Macro to list the highest and lowest N data values. 2 Reading in the example data file

1 Files to download. 3 Macro to list the highest and lowest N data values. 2 Reading in the example data file 1 2 22S:172 Lab session 10 Macros for data cleaning July 17, 2003 GENDER VISIT HR SBP DBP DX AE = "Gender" = "Visit Date" = "Heart Rate" = "Systolic Blood Pressure" = "Diastolic Blood Pressure" = "Diagnosis

More information

Effectively Utilizing Loops and Arrays in the DATA Step

Effectively Utilizing Loops and Arrays in the DATA Step Paper 1618-2014 Effectively Utilizing Loops and Arrays in the DATA Step Arthur Li, City of Hope National Medical Center, Duarte, CA ABSTRACT The implicit loop refers to the DATA step repetitively reading

More information

Introduction to SAS Procedures SAS Basics III. Susan J. Slaughter, Avocet Solutions

Introduction to SAS Procedures SAS Basics III. Susan J. Slaughter, Avocet Solutions Introduction to SAS Procedures SAS Basics III Susan J. Slaughter, Avocet Solutions DATA versus PROC steps Two basic parts of SAS programs DATA step PROC step Begin with DATA statement Begin with PROC statement

More information

Using Microsoft Access

Using Microsoft Access Using Microsoft Access Creating Select Queries Norm Downey Chapter 2 pages 173 193 and Chapter 3 pages 218 249 2 1 This PowerPoint uses the Sample Databases on the class website Please download them now

More information

Sample Questions. SAS Advanced Programming for SAS 9. Question 1. Question 2

Sample Questions. SAS Advanced Programming for SAS 9. Question 1. Question 2 Sample Questions The following sample questions are not inclusive and do not necessarily represent all of the types of questions that comprise the exams. The questions are not designed to assess an individual's

More information

My SAS Grid Scheduler

My SAS Grid Scheduler ABSTRACT Paper 1148-2017 My SAS Grid Scheduler Patrick Cuba, Cuba BI Consulting No Batch Scheduler? No problem! This paper describes the use of a SAS DI Studio job that can be started by a time dependent

More information

Using Microsoft Access

Using Microsoft Access Using Microsoft Access Creating Queries Norm Downey This PowerPoint uses the Sample Databases on the class website Please download them now 2 1 What is a Query? Queries allow the user to manipulate the

More information

Macros for Two-Sample Hypothesis Tests Jinson J. Erinjeri, D.K. Shifflet and Associates Ltd., McLean, VA

Macros for Two-Sample Hypothesis Tests Jinson J. Erinjeri, D.K. Shifflet and Associates Ltd., McLean, VA Paper CC-20 Macros for Two-Sample Hypothesis Tests Jinson J. Erinjeri, D.K. Shifflet and Associates Ltd., McLean, VA ABSTRACT Statistical Hypothesis Testing is performed to determine whether enough statistical

More information

Paper DB2 table. For a simple read of a table, SQL and DATA step operate with similar efficiency.

Paper DB2 table. For a simple read of a table, SQL and DATA step operate with similar efficiency. Paper 76-28 Comparative Efficiency of SQL and Base Code When Reading from Database Tables and Existing Data Sets Steven Feder, Federal Reserve Board, Washington, D.C. ABSTRACT In this paper we compare

More information

To conceptualize the process, the table below shows the highly correlated covariates in descending order of their R statistic.

To conceptualize the process, the table below shows the highly correlated covariates in descending order of their R statistic. Automating the process of choosing among highly correlated covariates for multivariable logistic regression Michael C. Doherty, i3drugsafety, Waltham, MA ABSTRACT In observational studies, there can be

More information

KEYWORDS Metadata, macro language, CALL EXECUTE, %NRSTR, %TSLIT

KEYWORDS Metadata, macro language, CALL EXECUTE, %NRSTR, %TSLIT MWSUG 2017 - Paper BB15 Building Intelligent Macros: Driving a Variable Parameter System with Metadata Arthur L. Carpenter, California Occidental Consultants, Anchorage, Alaska ABSTRACT When faced with

More information

David Franklin Independent SAS Consultant TheProgramersCabin.com

David Franklin Independent SAS Consultant TheProgramersCabin.com Countdown of the Top 10 Ways to Merge Data Trivia The film The Poseidon Adventure is based on a real life event that involved the Queen Mary in 1942 the ship was hit by a 92 foot wave which listed the

More information

From Manual to Automatic with Overdrive - Using SAS to Automate Report Generation Faron Kincheloe, Baylor University, Waco, TX

From Manual to Automatic with Overdrive - Using SAS to Automate Report Generation Faron Kincheloe, Baylor University, Waco, TX Paper 152-27 From Manual to Automatic with Overdrive - Using SAS to Automate Report Generation Faron Kincheloe, Baylor University, Waco, TX ABSTRACT This paper is a case study of how SAS products were

More information

Automated Checking Of Multiple Files Kathyayini Tappeta, Percept Pharma Services, Bridgewater, NJ

Automated Checking Of Multiple Files Kathyayini Tappeta, Percept Pharma Services, Bridgewater, NJ PharmaSUG 2015 - Paper QT41 Automated Checking Of Multiple Files Kathyayini Tappeta, Percept Pharma Services, Bridgewater, NJ ABSTRACT Most often clinical trial data analysis has tight deadlines with very

More information

Using PROC REPORT to Cross-Tabulate Multiple Response Items Patrick Thornton, SRI International, Menlo Park, CA

Using PROC REPORT to Cross-Tabulate Multiple Response Items Patrick Thornton, SRI International, Menlo Park, CA Using PROC REPORT to Cross-Tabulate Multiple Response Items Patrick Thornton, SRI International, Menlo Park, CA ABSTRACT This paper describes for an intermediate SAS user the use of PROC REPORT to create

More information

An Introduction to SAS University Edition

An Introduction to SAS University Edition An Introduction to SAS University Edition Ron Cody From An Introduction to SAS University Edition. Full book available for purchase here. Contents List of Programs... xi About This Book... xvii About the

More information

Laboratory: Recursion Basics

Laboratory: Recursion Basics Fundamentals of Computer Science I: Media Computing (CS151.02 2007F) Laboratory: Recursion Basics Summary: In this laboratory, you will explore some basic concepts in recursing over lists. Contents: Preparation

More information

Intro to Structured Query Language Part I

Intro to Structured Query Language Part I Intro to Structured Query Language Part I The Select Statement In a relational database, data is stored in tables. An example table would relate Social Security Number, Name, and Address: EmployeeAddressTable

More information

In This Lecture. More SQL Select. SQL SELECT Overview. Aliases. Example. Example. More SQL Select. Database Systems Lecture 8 Natasha Alechina

In This Lecture. More SQL Select. SQL SELECT Overview. Aliases. Example. Example. More SQL Select. Database Systems Lecture 8 Natasha Alechina In This Lecture More SQL Select Database Systems Lecture 8 Natasha Alechina More SQL Select Aliases Self-joins IN, EXISTS, ANY, ALL For more information Connoly and Begg Chapter 5 Ullman and Widom Chapter

More information

Contents of SAS Programming Techniques

Contents of SAS Programming Techniques Contents of SAS Programming Techniques Chapter 1 About SAS 1.1 Introduction 1.1.1 SAS modules 1.1.2 SAS module classification 1.1.3 SAS features 1.1.4 Three levels of SAS techniques 1.1.5 Chapter goal

More information

AMERICAN BOARD OF UROLOGY 2017 INSTRUCTIONS FOR SUBMISSION OF ELECTRONIC LOGS

AMERICAN BOARD OF UROLOGY 2017 INSTRUCTIONS FOR SUBMISSION OF ELECTRONIC LOGS AMERICAN BOARD OF UROLOGY 2017 INSTRUCTIONS FOR SUBMISSION OF ELECTRONIC LOGS Please read all instructions carefully before preparing your log. It is imperative that you carefully review the data contained

More information

22S:172. Duplicates. may need to check for either duplicate ID codes or duplicate observations duplicate observations should just be eliminated

22S:172. Duplicates. may need to check for either duplicate ID codes or duplicate observations duplicate observations should just be eliminated 22S:172 1 2 Duplicates Data Cleaning involving duplicate IDs and duplicate records may need to check for either duplicate ID codes or duplicate observations duplicate observations should just be eliminated

More information

Getting Classy: A SAS Macro for CLASS Statement Automation

Getting Classy: A SAS Macro for CLASS Statement Automation Getting Classy: A SAS Macro for CLASS Statement Automation ABSTRACT When creating statistical models that include multiple covariates, it is important to address which variables are considered categorical

More information

Stat 302 Statistical Software and Its Applications SAS Functions

Stat 302 Statistical Software and Its Applications SAS Functions Stat 302 Statistical Software and Its Applications SAS Functions Yen-Chi Chen Department of Statistics, University of Washington Autumn 2016 1 / 31 Creating New Variables Here we create new variables using

More information

Introduction to SAS Procedures SAS Basics III. Susan J. Slaughter, Avocet Solutions

Introduction to SAS Procedures SAS Basics III. Susan J. Slaughter, Avocet Solutions Introduction to SAS Procedures SAS Basics III Susan J. Slaughter, Avocet Solutions SAS Essentials Section for people new to SAS Core presentations 1. How SAS Thinks 2. Introduction to DATA Step Programming

More information

Imelda C. Go, South Carolina Department of Education, Columbia, SC

Imelda C. Go, South Carolina Department of Education, Columbia, SC ABSTRACT Paper P02-04 Matching SAS Data Sets: If at First You Don t Succeed, Match, Match Again Imelda C. Go, South Carolina Department of Education, Columbia, SC Two data sets are often matched by using

More information

Using a HASH Table to Reference Variables in an Array by Name. John Henry King, Hopper, Arkansas

Using a HASH Table to Reference Variables in an Array by Name. John Henry King, Hopper, Arkansas PharmaSUG 2011 - Paper TT04 Using a HASH Table to Reference Variables in an Array by Name John Henry King, Hopper, Arkansas ABSTRACT Array elements are referenced by their index value using a constant,

More information

Macro Method to use Google Maps and SAS to Geocode a Location by Name or Address

Macro Method to use Google Maps and SAS to Geocode a Location by Name or Address Paper 2684-2018 Macro Method to use Google Maps and SAS to Geocode a Location by Name or Address Laurie Smith, Cincinnati Children s Hospital Medical Center, Cincinnati, Ohio ABSTRACT Google Maps is a

More information

The new SAS 9.2 FCMP Procedure, what functions are in your future? John H. Adams, Boehringer Ingelheim Pharmaceutical, Inc.

The new SAS 9.2 FCMP Procedure, what functions are in your future? John H. Adams, Boehringer Ingelheim Pharmaceutical, Inc. PharmaSUG2010 - Paper AD02 The new SAS 9.2 FCMP Procedure, what functions are in your future? John H. Adams, Boehringer Ingelheim Pharmaceutical, Inc., Ridgefield, CT ABSTRACT Our company recently decided

More information

The Building Blocks of SAS Datasets. (Set, Merge, and Update) Andrew T. Kuligowski FCCI Insurance Group

The Building Blocks of SAS Datasets. (Set, Merge, and Update) Andrew T. Kuligowski FCCI Insurance Group The Building Blocks of SAS Datasets S-M-U (Set, Merge, and Update) Andrew T. Kuligowski FCCI Insurance Group S-M-U What is S M U? 2 S-M-U What is S M U? Shmoo? 3 S-M-U What is S M U? Southern Methodist

More information

ECLT 5810 SAS Programming - Introduction

ECLT 5810 SAS Programming - Introduction ECLT 5810 SAS Programming - Introduction Why SAS? Able to process data set(s). Easy to handle multiple variables. Generate useful basic analysis Summary statistics Graphs Many companies and government

More information

Merge Processing and Alternate Table Lookup Techniques Prepared by

Merge Processing and Alternate Table Lookup Techniques Prepared by Merge Processing and Alternate Table Lookup Techniques Prepared by The syntax for data step merging is as follows: International SAS Training and Consulting This assumes that the incoming data sets are

More information

TOP 10 (OR MORE) WAYS TO OPTIMIZE YOUR SAS CODE

TOP 10 (OR MORE) WAYS TO OPTIMIZE YOUR SAS CODE TOP 10 (OR MORE) WAYS TO OPTIMIZE YOUR SAS CODE Handy Tips for the Savvy Programmer SAS PROGRAMMING BEST PRACTICES Create Readable Code Basic Coding Recommendations» Efficiently choosing data for processing»

More information

Databases - Classic Models

Databases - Classic Models Databases - Classic Models Gordon Royle School of Mathematics & Statistics University of Western Australia Gordon Royle (UWA) Classic Models 1 / 33 This lecture This lecture analyses a freely available

More information

Updates - Insert. SQL Other Features and Commands. Updates Bulk Insert. Outline. Update commands Views: more details NULL values Constraints

Updates - Insert. SQL Other Features and Commands. Updates Bulk Insert. Outline. Update commands Views: more details NULL values Constraints Updates - Insert SQL Other Features and ommands Davood afiei emp(sin, name, phone, city) INSET INSET INTO emp VLUES ( 999, John Doe, 444-555, Edmonton ) INSET INTO emp(sin,name,city) VLUES ( 999, Jim Gray,

More information

The Proc Transpose Cookbook

The Proc Transpose Cookbook ABSTRACT PharmaSUG 2017 - Paper TT13 The Proc Transpose Cookbook Douglas Zirbel, Wells Fargo and Co. Proc TRANSPOSE rearranges columns and rows of SAS datasets, but its documentation and behavior can be

More information

Getting Started with the XML Engine

Getting Started with the XML Engine 3 CHAPTER 1 Getting Started with the XML Engine What Does the XML Engine Do? 3 Understanding How the XML Engine Works 3 Assigning a Libref to an XML Document 3 Importing an XML Document 4 Exporting an

More information

Better Metadata Through SAS II: %SYSFUNC, PROC DATASETS, and Dictionary Tables

Better Metadata Through SAS II: %SYSFUNC, PROC DATASETS, and Dictionary Tables Paper 3458-2015 Better Metadata Through SAS II: %SYSFUNC, PROC DATASETS, and Dictionary Tables ABSTRACT Louise Hadden, Abt Associates Inc., Cambridge, MA SAS provides a wealth of resources for users to

More information

Are you Still Afraid of Using Arrays? Let s Explore their Advantages

Are you Still Afraid of Using Arrays? Let s Explore their Advantages Paper CT07 Are you Still Afraid of Using Arrays? Let s Explore their Advantages Vladyslav Khudov, Experis Clinical, Kharkiv, Ukraine ABSTRACT At first glance, arrays in SAS seem to be a complicated and

More information

Contents. Overview How SAS processes programs Compilation phase Execution phase Debugging a DATA step Testing your programs

Contents. Overview How SAS processes programs Compilation phase Execution phase Debugging a DATA step Testing your programs SAS Data Step Contents Overview How SAS processes programs Compilation phase Execution phase Debugging a DATA step Testing your programs 2 Overview Introduction This section teaches you what happens "behind

More information

Using a Control Dataset to Manage Production Compiled Macro Library Curtis E. Reid, Bureau of Labor Statistics, Washington, DC

Using a Control Dataset to Manage Production Compiled Macro Library Curtis E. Reid, Bureau of Labor Statistics, Washington, DC AP06 Using a Control Dataset to Manage Production Compiled Macro Library Curtis E. Reid, Bureau of Labor Statistics, Washington, DC ABSTRACT By default, SAS compiles and stores all macros into the WORK

More information

Moving Data and Results Between SAS and Excel. Harry Droogendyk Stratia Consulting Inc.

Moving Data and Results Between SAS and Excel. Harry Droogendyk Stratia Consulting Inc. Moving Data and Results Between SAS and Excel Harry Droogendyk Stratia Consulting Inc. Introduction SAS can read ( and write ) anything Introduction In the end users want EVERYTHING in. Introduction SAS

More information

Arrays: The How and the Why of it All Jane Stroupe

Arrays: The How and the Why of it All Jane Stroupe Arrays: The How and the Why of it All Jane Stroupe When you need to join data sets, do you even ask yourself Why don t I just use an array? Perhaps because arrays are a mystery or perhaps because you have

More information

Certkiller.A QA

Certkiller.A QA Certkiller.A00-260.70.QA Number: A00-260 Passing Score: 800 Time Limit: 120 min File Version: 3.3 It is evident that study guide material is a victorious and is on the top in the exam tools market and

More information

The Problem With NODUPLICATES, Continued

The Problem With NODUPLICATES, Continued The Problem With NODUPLICATES, Continued Jack Hamilton First Health West Sacramento, California JackHamilton@FirstHealth.com moredupsov.doc Wednesday, Wed Apr 28 1999 12:37 PM Page 1 of 11 What Should

More information

Introduction to MS Access: creating tables, keys, and relationships

Introduction to MS Access: creating tables, keys, and relationships Introduction to MS Access: creating tables, keys, and relationships BSAD 141 Dave Novak Topics Covered Brief introduction to MS Access Name and save a DB file Create tables and keys Create and enforce

More information

Dictionary.coumns is your friend while appending or moving data

Dictionary.coumns is your friend while appending or moving data ABSTRACT SESUG Paper CC-41-2017 Dictionary.coumns is your friend while appending or moving data Kiran Venna, Dataspace Inc. Dictionary.columns is a dictionary table, which gives metadata information of

More information

Automating Preliminary Data Cleaning in SAS

Automating Preliminary Data Cleaning in SAS Paper PO63 Automating Preliminary Data Cleaning in SAS Alec Zhixiao Lin, Loan Depot, Foothill Ranch, CA ABSTRACT Preliminary data cleaning or scrubbing tries to delete the following types of variables

More information

2. Don t forget semicolons and RUN statements The two most common programming errors.

2. Don t forget semicolons and RUN statements The two most common programming errors. Randy s SAS hints March 7, 2013 1. Always begin your programs with internal documentation. * ***************** * Program =test1, Randy Ellis, March 8, 2013 ***************; 2. Don t forget semicolons and

More information

Creating Macro Calls using Proc Freq

Creating Macro Calls using Proc Freq Creating Macro Calls using Proc Freq, Educational Testing Service, Princeton, NJ ABSTRACT Imagine you were asked to get a series of statistics/tables for each country in the world. You have the data, but

More information

3N Validation to Validate PROC COMPARE Output

3N Validation to Validate PROC COMPARE Output ABSTRACT Paper 7100-2016 3N Validation to Validate PROC COMPARE Output Amarnath Vijayarangan, Emmes Services Pvt Ltd, India In the clinical research world, data accuracy plays a significant role in delivering

More information