Paper DM07. An Animated Guide: Comparing Files without Proc Compare Alejandro Jaramillo (Data Means Corp) and Russ Lavery contractor

Size: px
Start display at page:

Download "Paper DM07. An Animated Guide: Comparing Files without Proc Compare Alejandro Jaramillo (Data Means Corp) and Russ Lavery contractor"

Transcription

1 Paper DM07 An Animated Guide: Comparing Files without Proc Compare Alejandro Jaramillo (Data Means Corp) and Russ Lavery contractor ABSTRACT This paper provides an easy to understand SAS-based programmed method for comparing files and has several advantages over Proc Compare when the files are expected to be different This method produces a small volume of output and allows easy creation of reports to summarize and analyze the differences This automated analysis of differences helps resolve differences It should be noted that Proc compare is excellent if differences are small This paper discusses comparing files in the context of a system migration, or a change of vendors, where many (hundreds) files from an old vendor/system must be compared to files from a new vendor/system If the programmer is asked to compare hundreds of files and there are issues with the new files- the programming problem becomes embedded in a management problem Some effort must be spent tracking: file checked, files passed/rejected, status of rework of rejected files, communicating results and managing the timeline of the project This paper discusses code for doing the comparisons (it provides fully functioning sample code) and the project structure and task flow DETAIL LEVEL COMPARISON Proc compare is the method of choice for comparing files that are expected to have no, of few, errors If the comparison is expected to produce many failures to match, (thousands of observations fail for a variety of reasons), there is a need to post process information on failures and SAS is an excellent tool for doing this A logic for comparison is explained below Logic ---Left File--- Store Reg Prod LQ1 LQ2 LQ Merged File--- Store Reg Prod LQ1 LQ2 LQ3 1 RQ1 RQ2 RQ Match Match Data Checking Keys and Data gives exact answers key error Key ODD Data Right File--- Store Reg Prod RQ1 RQ2 RQ ODD Key Match lines on keys and use array & loop to compare data values This method checks EVERY value in the file Figure 1 Please look at Figure 1 We are going to be comparing two files (call the two files left and right) and will separate variables in the files into Key variables and data variables Store, region and prod (in red boxes) are key information and identify a unique row in the file These files have three numeric variables: Q1, Q2 and Q3 When we finally do a detailed comparison of the files we will merge the files on key information and use arrays to loop over data elements to check that the data associated matches We can have four states of nature and they are illustrated above Our eventual task is to report on all these

2 conditions: 1) keys and data match see black above stores 001 and 022 2) keys match but data elements differ see blue above-stores 044 (all data are bad) and 311 (Q3 data element is bad) 3) there was a data entry issue with the key and the observation can not be matched In our example, these were actually from the same store 956, but a 056 was keyed instead of a 956 4) a file has an observation that is not in the other file green above-stores 211 and 555 When switching from one vendor to another, the mechanisms the vendors use to collect data can differ and in that case a 100 percent match might not possible Sometime, management must decide on close enough %let Fuzz=00; /*how close is the match*/ data both badleft badright; merge Left (in=l rename=(q1=lq1 q2=lq2 q3=lq3)) right(in=r rename=(q1=rq1 q2=rq2 q3=rq3)); by store reg Prod; if L*R=1 then infile="both "; else if L=1 and R=0 then infile="left "; else if L=0 and R=1 then infile="right"; Array ALsales(*) Lq1-LQ3; Array ARSales(*) RQ1-RQ3; Array ADiff(*) Diff1-Diff3 (0 0 0) ; Left_vs_Right=L*10 +R*1; /*two measures of how much is at risk if obs are not "good"*/ if L then LSales=sum(of LQ1-LQ3); else LSales=0; if R then RSales=sum(of RQ1-RQ3); else RSales=0; if L*R EQ 1 then Total_Sales=sum(Lsales,Rsales)/2; else if L=1 and R=0 then Total_Sales=sum(LSales); else if L=0 and R=1 then Total_Sales=sum(RSales); SumDiff=0; Diff1=0; Diff2=0; Diff3=0;/*measures of size of data mismatch*/ if L*R NE 1 then KeyFlag="Bad ";/*this flags if key good/bad*/ Mismatch="N";/*Lower the data element mismatch flag*/ if L*R then /*if keys match, loop through an array to check the data elements*/ do; do i= 1 to 3;/*do loop-check data elements only if keys match*/ ADiff(i)=0; if ALsales(I) LT ARSales(i)*(1-&fuzz) /*if "close is OK, set fuzz to be GT 0*/ or ALsales(I) GT ARSales(i)*(1+&fuzz) then do; ADiff(i)=ALsales(i)-ARsales(I); if abs(adiff(i)) GT MaxDiff then MaxDiff=abs(Adiff(i));/*biggest difference */ SumDiff+abs(Adiff(i)); /*sum of differences */ mismatch="y"; /*raise the data element mismatch flag */ end; end; end; else mismatch="y"; Absolute value output both; if L=1 and R=0 then Output BadLeft; if L=0 and R=1 then Output BadRight; run; Absolute value use fuzz if exact match not achievable Rename data elements Infile is a flag for observation source If we had char & num data elements, we would need char & num arrays How important is obs? Use Avg if keys match cause data elements might not match /*if keys are bad, raise the data element mismatch flag*/ 2

3 The resulting file BOTH is shown below L e f T t o _ t v a M s l S K i M i _ L R _ u e s a S n D D D R S S S m y m x t P f i i i i a a a D F a D o r r L L L R R R i f f f g l l l i l t i r e o Q Q Q Q Q Q l f f f h e e e f a c f e g d e t s s s f g h i f Both N Both N Both Y Left Bad Y Left Bad Y Both Y Right Bad Y Right Bad Y The file both can be used for creating managerially useful reports as below: proc format ; value ObsFrom 1 =" 1 = obs in Right only" 10="10 Obs in Left only" 11="11 obs in both "; value fflag 0="Yes, Problems with key or data" 1="NO Problems with Key or data"; value $ YandN Y ="Yes: Problems with key or data" N ="NO problems with key or data"; run; Figure 2 shows that there were two observations that were only in the left file and that there were two observations that were only in the right file The keys not matching caused this situation This situation, keys failing to match, is worth some consideration If keys do not match, there is little more to be done at the detail level It is very difficult 3

4 for a QC person to fix, or even do much worthwhile investigation, of bad keys As we can see above, where a 9 in the store ID was coded as a 0 (the number next to it on the keyboard), if keys are miscoded and the data is sorted by keys, observations that should match can be anywhere in the resulting file If keys fail to match, the files likely have different sources (or the same source but the data was pulled from different time periods) Figure 3 shows that there were two observations that matched both keys and data and that there were two observations that matched on keys and had data issues In this example, the percent of bad observations was very high What if the data table were like this pretty good mismatch Left_vs_Right 1= Obs 10= Obs 11= Obs Total Frequency in Right in in both Percent Only LEFT only NO Problems Yes:Data Problems Total Management might want to know the importance of the six observations that were in the problem cells If they were a small enough percentage of total sales, management might accept the file mismatch Left_vs_Right 1= Obs 10= Obs 11= Obs Total Frequency in Right in in both Percent Only LEFT only NO problems with key or data Yes: Problems with key or data Total proc freq data =both; tables mismatch*left_vs_right; format mismatch $ YandN Left_vs_Right ObsFrom ; run; Store Reg Prod LQ1 LQ2 LQ3 R Q1 R Q2 R Q we are comparing data in one file to missing values - there is a data problem we are comparing data in one file to missing values there is a data problem obs in only one file obs in only one file Figure 2 Note that if an observation was classified as bad if 1) keys failed to match or if 2) ANY data of the data elements failed to match If there are many data elements on a row, classifying an observation as bad if one pair of data elements differs by just a small amount can be a harsh judgment As an example, Store 311 was classified as a mismatch because one data element failed to match It is possible this classification is too extreme Management often wants to know how important these bad data elements are in relation to the total sales in the file Management might want to consider more than the percent of bad elements Information about the importance of bad observations can be gotten form a Proc Means and from using weighted frequencies in a Proc Freq Code and output for Proc Freq and Proc Means are shown in Figure 4 The total sales in the whole data set is 2435 and only 2382 percent of the total is in observations that are perfect in keys and 4

5 data If the data sets have mismatches in keys and on data elements and those mismatches make up a small percent of the total sales (or whatever number the file contains) management may decide to accept the file This is the final piece of detail analysis covered in this paper mismatch Left_vs_Right 1= Obs 10= Obs 11= Obs Total Frequency in Right in in both Percent Only LEFT only NO problems with key or data Yes: Problems with key or data Total Ideally, all obs are be here Keys Match, issues w/data Figure 3 Store Reg Prod LQ1 LQ2 LQ3 RQ1 RQ2 RQ Match Match Data Key ODD Data ODD Key Frequency 1 = obs 10 Obs i 11 obs i Total Percent in Righ n Left o n both Col Pct t only nly NO problems with key or data Yes: Problems with key or data Total proc freq data=both; tables mismatch*left_vs_right / missing norow; format mismatch $ YandN Left_vs_Right ObsFrom; weight Total_Sales; proc means data=both n min mean max sum; var maxdiff sumdiff total_sales; total Max Sum Mis Store Reg Prod LQ1 LQ2 LQ3 R Q1 R Q2 R Q3 sales Diff Diff match N N Y 056 Variable N Min 12 Mean 10 8 Max Sum Y MaxDiff SumDiff Y 311 Total_Sales Y Y Y Figure 4 OVERVIEW AND PROJECT MANAGEMENT The SAS code for comparison of files is all one would need if there were only one file that needed to be compared However, if this is part of s system migration or of changing a to a new data supplier, many files might need to be compared In the case of a large project, the one bit of SAS code, shown above, needs to be put in context of it being part of a large project A large project would need an audit train of documents showing that files were proven good when accepted and providing actionable feedback to the vendor when they fail A large project would need inter-departmental co-ordination, reports, tracking of jobs completed, time estimates and to run the above code 5

6 (which can take time to program and to run), only when necessary Starting with the last item- running the code when necessary If the project is one where many files are being delivered that do not pass, labor can be saved by making the running of the code, shown above, the last step in a drill down process Vendors have been known to deliver the wrong file and it saves time if the QC team can just do a quick check and then do as much further work as the quality of the file warrants We first run Proc Contents on each file and paste the results into a word document If the number of obs in one file is ten times the number of obs in the other file, the keys and data comparison that we did above is not warranted If variables are missing or of the wrong type, the comparison of the files might stop at this point and the Proc Contents information sent to the vendor This output also helps decide what variables are to be used as key, what variables are to be numeric data elements and what variables are character data elements While our example only has numeric elements, in real life we need to loop over both numeric and character variables The Proc Summary information, as basic as it is, documents that the proper files were compared The run dates are important if the new file has been delivered many times Having the documentation for a file start with the Proc Contents information below puts to rest any question of the QC being done on improper files or version of the files Data Set Name WORKLEFT Observations 6 Created 01 Dec 2007 Sat 19:03:13 oclock Last Modified 01 Dec 2007 Sat 19:03:13 oclock File Name C:\DOCUME~1\user\LOCALS~1\Temp\SAS Temporary Files\_TD464\leftsas7bdat Alphabetic List of Variables and Attributes # Variable Type Len 3 Prod Char 8 4 Q1 Num 8 5 Q2 Num 8 6 Q3 Num 8 1 Store Char 8 2 reg Char 8 Figure 5 Data Set Name WORKRIGHT Observations 6 Created 01 Dec 2007 Sat 19:03:13 oclock Last Modified 01 Dec 2007 Sat 19:03:13 oclock File Name C:\DOCUME~1\user\LOCALS~1\Temp\SAS Temporary Alphabetic List of Variables and Attributes # Variable Type Len 3 Prod Char 8 4 Q1 Num 8 5 Q2 Num 8 6 Q3 Num 8 1 Store Char 8 2 reg Char 8 The creation of the Proc Summary output is only the first step in process of peeling back the onion on the data and stopping as soon as soon as we learn of major problems That process is shown in Figure 6 below The output from the Contents date and size step is shown in Figure 5, above Any data cleaning that need to be done can be done here This data cleaning has the object of fixing errors and making the files worth comparing 6

7 Process for QC Left File New or Right File Contents: date & size Check for duplicates Check for bad codes R P T Contents: date & size Check for duplicates Check for bad codes Freq - important vars Freq - important vars Clean the file Merge-Calc Diff by important Vars Rpt Clean the file Identify every row with problem Merge- Calc High Level Diffs Rpt electronic copy electronic copy Key Analysis Rpt Problem Analysis Row Problem Analysis sales electronic copy Figure 6 We suggest checking both files for duplicate observations Errors have been found in files that have been in use for years Sometimes the QC of a new data stream cleans up errors in old data streams Here we move to some basic principals of the peeling back of the onion Summary data is easier/faster to read than detail data If the data matches at the detail level, it will match at the summary level If the summaries are off, consider stopping It is NOT true that data that matches at the summary level will match at the detail level- ERRORS cancel If there are important variables, like dates, if is often useful to compute and compare counts by these important variables we have sees situations where both left and right had the same range of dates, but one file was missing several weeks- simply had no obs for a few weeks The totals for the files were close in number of obs and for total sales, but one file was bad A useful way to do this was to do Proc Summaries on both files and merge the summary files This provides a wealth of data at many different levels of summary Code, output, and is shown below proc summary data=left missing; class store reg prod; output out=lefthisummry n()= sum(q:)= /autoname; /*Left has the Sum in the name*/ types () store reg prod Store*reg*prod; proc summary data=right missing; class store reg prod; output out=righthisummry n()= sum(q:)=; /*Right has original vars*/ types () store reg prod Store*reg*prod; 7

8 *merge the file and do a comparison of the Sales info; data hiboth(keep=_type_ vartype FreqDiff store reg prod source salesdiff:) ; source=" "; /*wide enough for *Total* computed in Proc Report*/ merge LeftHiSummry(in=L rename=(_freq_=lfreq)) RightHiSummry(in=R rename=(_freq_=rfreq)); by _type_ store reg prod; if L*R then source="both "; Else if L then source="left " ; Else if R then source="right "; Array LeftSales(3) Q1_Sum Q2_Sum Q3_Sum; Array RightSales(3) Q1 Q2 Q3; Array flagsales(3) SalesFlag1-SalesFlag3; Array SalesDiff(3) SalesDiff1-SalesDiff3; array numvars(*) _numeric_; do i=1 to dim(numvars); if NumVars(i)= then numvars(i)=0; end; Zero fill in obs that are only from one file /* calculate differences tween Left and Calculate right*/ differences for freq and data values FreqDiff=LFreq-RFreq; do i=1 to 3; salesdiff(i)=leftsales(i) - RightSales(i); Knowing the order of subtraction is helpful end; vartype="sales (Left-Right)"; Output; run; options linesize=100 ps=50; proc report data=hiboth spacing=0 missing nowd; columns _type_ store reg prod source vartype FreqDiff salesdiff:; define _type_ /order width=4 "Type"; define store /order width=6 spacing=1; define reg /display width=7 spacing=1 "Region"; define prod /display width=8 spacing=1 "Product/Code"; define source /display width=7 spacing=1 "data/source"; Define vartype /display spacing=1 width=21 ; Define FreqDiff /sum width=5 spacing=1 "Freq/Diff"; Define SalesDiff1 /sum width=5 spacing=4 "Q1 /Diff"; Define SalesDiff2 /sum width=5 spacing=1 "Q2 /Diff"; Define SalesDiff3 /sum width=5 spacing=1 "Q3 /Diff"; Break after _type_ / suppress summarize ol ul skip; compute after _type_ ; source="*total*"; endcomp; run; 8

9 Product data Freq Q1 Q2 Q3 Type Store Region Code Source vartype Diff Diff Diff Diff 0 Both Sales (Left-Right) *Total* Both Sales (Left-Right) Right Sales (Left-Right) Left Sales (Left-Right) Both Sales (Left-Right) *Total* Right Sales (Left-Right) Both Sales (Left-Right) Both Sales (Left-Right) *Total* Both Sales (Left-Right) Both Sales (Left-Right) Both Sales (Left-Right) Left Sales (Left-Right) Left Sales (Left-Right) Both Sales (Left-Right) Right Sales (Left-Right) Right Sales (Left-Right) *Total* Both Sales (Left-Right) Both Sales (Left-Right) Both Sales (Left-Right) Left Sales (Left-Right) Left Sales (Left-Right) Both Sales (Left-Right) Right Sales (Left-Right) Right Sales (Left-Right) *Total* Figure 7 The red box sows the danger of just checking summary totals for the files Positive and negative errors will cancel out and make the overall numbers look good This canceling happens in real life and not just in made up exercises Totals can look very good and sub-totals or details can look bad The output shown in Figure 7 becomes part of the documentation for the project The QC documentation for a file might be: Proc Summary output - shown in Figure 5 High-level comparisons shown in Figure 7 Detail Level comparison summaries shown in Figures 3 and 4 9

10 QC PROCESS FLOWCHART AND TIMELINES The final section is a short discussion of the time line for QC-ing a file Management sometimes does not understand the process and has unreasonable expectations for timelines The fact is that QC-ing a file is not a time consuming process, in any case Often a programmer can program ahead and run multiple sessions of SAS and be QC-ing several files at the same time If QC- programs are run interactively, output from one section can be assembled into a report while the next section of code is running This quasi-quick process IF THE FILES ARE GOOD If the files are bad, the QC-ing step is still quick, but the approval of the file process becomes much slower as the process starts to loop back on itself If the files are identical, the Green box meeting is very quick and the orange box can be eliminated If the files are close or bad, the meeting can take quite a while If the files are judged to be close the pink box must happen The comparison of standard reports to see if the differences between the files is visible to report users, can take a while Finally, if the files are bad, the green meeting can take quite a long time When the files are bad, the reports shown in this paper have been able to provide good initial direction as the cause of the problem However, when the files are bad, the tasks in the pink box must happen and these tasks can take quite a while Finding discrepancies is an easy task, fixing the discrepancies can be very difficult and requires commitment Timeline Write programs, for series of files, in anticipation of file delivery A batch of files, to be compared, is delivered The QC Process Run QC Programs on the batch of files (using multiple SAS sessions if required/possible) Assemble report on the batch of files (often concurrent w/ run) QC Programming Reviews and annotates report (1 day) Arrange meeting with group producing the series of files (1 week) Discuss report w/ group producing files and create action items (1 day) File is OK or Close Figure 8 File(s) FAIL Investigate / fix action items (1 week) F A I L If files are close, users of files run old reports with new files and see if results look odd (1 week) Create new version of files (2 weeks) File Passes log file as Done CONCLUSION Proc compare provides a quick and easy way to compare files that are well known and expected to have small differences or not differences at all When data sources and process change use of a systematic approach as the outlined in this paper to compare data at the top and record level provides an efficient mechanism to track progress, identify and resolve potential problems CONTACT INFORMATION Your comments and questions are valued and encouraged Contact the authors at: Alejandro Jaramillo, Data Means Corp, ajaramillo@datameanscom, wwwdatameanscom Russ Lavery, Independent Consultant for Numeric Resources, RussLavery@verizonnet SAS and all other SAS Institute Inc product or service names are registered trademarks or trademarks of SAS Institute Inc in the USA and other countries indicates USA registration Other brand and product names are trademarks of their respective companies 10

An Animated Guide: Proc Transpose

An Animated Guide: Proc Transpose ABSTRACT An Animated Guide: Proc Transpose Russell Lavery, Independent Consultant If one can think about a SAS data set as being made up of columns and rows one can say Proc Transpose flips the columns

More information

Uncommon Techniques for Common Variables

Uncommon Techniques for Common Variables Paper 11863-2016 Uncommon Techniques for Common Variables Christopher J. Bost, MDRC, New York, NY ABSTRACT If a variable occurs in more than one data set being merged, the last value (from the variable

More information

SC-15. An Annotated Guide: Using Proc Tabulate And Proc Summary to Validate SAS Code Russ Lavery, Contractor, Ardmore, PA

SC-15. An Annotated Guide: Using Proc Tabulate And Proc Summary to Validate SAS Code Russ Lavery, Contractor, Ardmore, PA SC-15 An Annotated Guide: Using Proc Tabulate And Proc Summary to Validate SAS Code Russ Lavery, Contractor, Ardmore, PA ABSTRACT This paper discusses how Proc Tabulate and Proc Summary can be used to

More information

An Efficient Tool for Clinical Data Check

An Efficient Tool for Clinical Data Check PharmaSUG 2018 - Paper AD-16 An Efficient Tool for Clinical Data Check Chao Su, Merck & Co., Inc., Rahway, NJ Shunbing Zhao, Merck & Co., Inc., Rahway, NJ Cynthia He, Merck & Co., Inc., Rahway, NJ ABSTRACT

More information

10 The First Steps 4 Chapter 2

10 The First Steps 4 Chapter 2 9 CHAPTER 2 Examples The First Steps 10 Invoking the Query Window 11 Changing Your Profile 11 ing a Table 13 ing Columns 14 Alias Names and Labels 14 Column Format 16 Creating a WHERE Expression 17 Available

More information

Automating Preliminary Data Cleaning in SAS

Automating Preliminary Data Cleaning in SAS Paper PO63 Automating Preliminary Data Cleaning in SAS Alec Zhixiao Lin, Loan Depot, Foothill Ranch, CA ABSTRACT Preliminary data cleaning or scrubbing tries to delete the following types of variables

More information

Automated Checking Of Multiple Files Kathyayini Tappeta, Percept Pharma Services, Bridgewater, NJ

Automated Checking Of Multiple Files Kathyayini Tappeta, Percept Pharma Services, Bridgewater, NJ PharmaSUG 2015 - Paper QT41 Automated Checking Of Multiple Files Kathyayini Tappeta, Percept Pharma Services, Bridgewater, NJ ABSTRACT Most often clinical trial data analysis has tight deadlines with very

More information

CC13 An Automatic Process to Compare Files. Simon Lin, Merck & Co., Inc., Rahway, NJ Huei-Ling Chen, Merck & Co., Inc., Rahway, NJ

CC13 An Automatic Process to Compare Files. Simon Lin, Merck & Co., Inc., Rahway, NJ Huei-Ling Chen, Merck & Co., Inc., Rahway, NJ CC13 An Automatic Process to Compare Files Simon Lin, Merck & Co., Inc., Rahway, NJ Huei-Ling Chen, Merck & Co., Inc., Rahway, NJ ABSTRACT Comparing different versions of output files is often performed

More information

Using PROC REPORT to Cross-Tabulate Multiple Response Items Patrick Thornton, SRI International, Menlo Park, CA

Using PROC REPORT to Cross-Tabulate Multiple Response Items Patrick Thornton, SRI International, Menlo Park, CA Using PROC REPORT to Cross-Tabulate Multiple Response Items Patrick Thornton, SRI International, Menlo Park, CA ABSTRACT This paper describes for an intermediate SAS user the use of PROC REPORT to create

More information

An Introduction to PROC REPORT

An Introduction to PROC REPORT Paper BB-276 An Introduction to PROC REPORT Kirk Paul Lafler, Software Intelligence Corporation, Spring Valley, California Abstract SAS users often need to create and deliver quality custom reports and

More information

There s No Such Thing as Normal Clinical Trials Data, or Is There? Daphne Ewing, Octagon Research Solutions, Inc., Wayne, PA

There s No Such Thing as Normal Clinical Trials Data, or Is There? Daphne Ewing, Octagon Research Solutions, Inc., Wayne, PA Paper HW04 There s No Such Thing as Normal Clinical Trials Data, or Is There? Daphne Ewing, Octagon Research Solutions, Inc., Wayne, PA ABSTRACT Clinical Trials data comes in all shapes and sizes depending

More information

Features and Benefits

Features and Benefits AutoCAD 2005 Features and s AutoCAD 2005 software provides powerhouse productivity tools that help you create single drawings as productively as possible, as well as new features for the efficient creation,

More information

Formulas, LookUp Tables and PivotTables Prepared for Aero Controlex

Formulas, LookUp Tables and PivotTables Prepared for Aero Controlex Basic Topics: Formulas, LookUp Tables and PivotTables Prepared for Aero Controlex Review ribbon terminology such as tabs, groups and commands Navigate a worksheet, workbook, and multiple workbooks Prepare

More information

Ditch the Data Memo: Using Macro Variables and Outer Union Corresponding in PROC SQL to Create Data Set Summary Tables Andrea Shane MDRC, Oakland, CA

Ditch the Data Memo: Using Macro Variables and Outer Union Corresponding in PROC SQL to Create Data Set Summary Tables Andrea Shane MDRC, Oakland, CA ABSTRACT Ditch the Data Memo: Using Macro Variables and Outer Union Corresponding in PROC SQL to Create Data Set Summary Tables Andrea Shane MDRC, Oakland, CA Data set documentation is essential to good

More information

PharmaSUG Paper PO12

PharmaSUG Paper PO12 PharmaSUG 2015 - Paper PO12 ABSTRACT Utilizing SAS for Cross-Report Verification in a Clinical Trials Setting Daniel Szydlo, Fred Hutchinson Cancer Research Center, Seattle, WA Iraj Mohebalian, Fred Hutchinson

More information

IBM InfoSphere Information Server Version 8 Release 7. Reporting Guide SC

IBM InfoSphere Information Server Version 8 Release 7. Reporting Guide SC IBM InfoSphere Server Version 8 Release 7 Reporting Guide SC19-3472-00 IBM InfoSphere Server Version 8 Release 7 Reporting Guide SC19-3472-00 Note Before using this information and the product that it

More information

Combining Contiguous Events and Calculating Duration in Kaplan-Meier Analysis Using a Single Data Step

Combining Contiguous Events and Calculating Duration in Kaplan-Meier Analysis Using a Single Data Step Combining Contiguous Events and Calculating Duration in Kaplan-Meier Analysis Using a Single Data Step Hui Song, PRA International, Horsham, PA George Laskaris, PRA International, Horsham, PA ABSTRACT

More information

Microsoft Excel 2010 Handout

Microsoft Excel 2010 Handout Microsoft Excel 2010 Handout Excel is an electronic spreadsheet program you can use to enter and organize data, and perform a wide variety of number crunching tasks. Excel helps you organize and track

More information

Introduction to SAS Procedures SAS Basics III. Susan J. Slaughter, Avocet Solutions

Introduction to SAS Procedures SAS Basics III. Susan J. Slaughter, Avocet Solutions Introduction to SAS Procedures SAS Basics III Susan J. Slaughter, Avocet Solutions DATA versus PROC steps Two basic parts of SAS programs DATA step PROC step Begin with DATA statement Begin with PROC statement

More information

A Side of Hash for You To Dig Into

A Side of Hash for You To Dig Into A Side of Hash for You To Dig Into Shan Ali Rasul, Indigo Books & Music Inc, Toronto, Ontario, Canada. ABSTRACT Within the realm of Customer Relationship Management (CRM) there is always a need for segmenting

More information

OC RDC HTML User Guide

OC RDC HTML User Guide CRA - Monitor OC RDC 4.5.3 HTML User Guide Page 1 of 46 TABLE OF CONTENTS Accessing OC RDC Steps for Access Logging On Change Password Computer and System Security Study and Site 3 4 5 5 6 Navigating OC

More information

Administering OLAP with SAS/Warehouse Administrator(TM)

Administering OLAP with SAS/Warehouse Administrator(TM) Paper 123 Administering OLAP with SAS/Warehouse Administrator(TM) Abstract: By Michael Burns, SAS Institute Inc., Austin, TX. When building an OLAP application, there are a wide variety of strategies that

More information

Using PROC SQL to Generate Shift Tables More Efficiently

Using PROC SQL to Generate Shift Tables More Efficiently ABSTRACT SESUG Paper 218-2018 Using PROC SQL to Generate Shift Tables More Efficiently Jenna Cody, IQVIA Shift tables display the change in the frequency of subjects across specified categories from baseline

More information

Excel Tips for Compensation Practitioners Weeks Text Formulae

Excel Tips for Compensation Practitioners Weeks Text Formulae Excel Tips for Compensation Practitioners Weeks 70-73 Text Formulae Week 70 Using Left, Mid and Right Formulae When analysing compensation data, you generally extract data from the payroll, the HR system,

More information

Defining Test Data Using Population Analysis Clarence Wm. Jackson, CQA - City of Dallas CIS

Defining Test Data Using Population Analysis Clarence Wm. Jackson, CQA - City of Dallas CIS Defining Test Data Using Population Analysis Clarence Wm. Jackson, CQA - City of Dallas CIS Abstract Defining test data that provides complete test case coverage requires the tester to accumulate data

More information

Beginning Tutorials. PROC FSEDIT NEW=newfilename LIKE=oldfilename; Fig. 4 - Specifying a WHERE Clause in FSEDIT. Data Editing

Beginning Tutorials. PROC FSEDIT NEW=newfilename LIKE=oldfilename; Fig. 4 - Specifying a WHERE Clause in FSEDIT. Data Editing Mouse Clicking Your Way Viewing and Manipulating Data with Version 8 of the SAS System Terry Fain, RAND, Santa Monica, California Cyndie Gareleck, RAND, Santa Monica, California ABSTRACT Version 8 of the

More information

Utilizing SAS for Cross- Report Verification in a Clinical Trials Setting

Utilizing SAS for Cross- Report Verification in a Clinical Trials Setting Utilizing SAS for Cross- Report Verification in a Clinical Trials Setting Daniel Szydlo, SCHARP/Fred Hutch, Seattle, WA Iraj Mohebalian, SCHARP/Fred Hutch, Seattle, WA Marla Husnik, SCHARP/Fred Hutch,

More information

Easing into Data Exploration, Reporting, and Analytics Using SAS Enterprise Guide

Easing into Data Exploration, Reporting, and Analytics Using SAS Enterprise Guide Paper 809-2017 Easing into Data Exploration, Reporting, and Analytics Using SAS Enterprise Guide ABSTRACT Marje Fecht, Prowerk Consulting Whether you have been programming in SAS for years, are new to

More information

Investigator Site OC RDC PDF User Guide

Investigator Site OC RDC PDF User Guide Investigator Site OC RDC PDF User Guide Version 1.0 Page 1 of 40 TABLE OF CONTENTS Accessing OC RDC Steps for Access 3 Logging On 4 Change Password 4 Laptop and System Security 5 Change Study 5 Navigating

More information

LST in Comparison Sanket Kale, Parexel International Inc., Durham, NC Sajin Johnny, Parexel International Inc., Durham, NC

LST in Comparison Sanket Kale, Parexel International Inc., Durham, NC Sajin Johnny, Parexel International Inc., Durham, NC ABSTRACT PharmaSUG 2013 - Paper PO01 LST in Comparison Sanket Kale, Parexel International Inc., Durham, NC Sajin Johnny, Parexel International Inc., Durham, NC The need for producing error free programming

More information

Compute Blocks in Report

Compute Blocks in Report Compute Blocks in Report Compute Blocks Though it is always possible to compute new variables inside a data step, PROC REPORT allows for similar computations to be done internally as well. Computations

More information

David Beam, Systems Seminar Consultants, Inc., Madison, WI

David Beam, Systems Seminar Consultants, Inc., Madison, WI Paper 150-26 INTRODUCTION TO PROC SQL David Beam, Systems Seminar Consultants, Inc., Madison, WI ABSTRACT PROC SQL is a powerful Base SAS Procedure that combines the functionality of DATA and PROC steps

More information

Paper TT17 An Animated Guide : Speed Merges with Key Merging and the _IORC_ Variable Russ Lavery Contractor for Numeric resources, Inc.

Paper TT17 An Animated Guide : Speed Merges with Key Merging and the _IORC_ Variable Russ Lavery Contractor for Numeric resources, Inc. Paper TT7 An Animated Guide : Speed Merges with Key Merging and the _IORC_ Variable Russ Lavery Contractor for Numeric resources, Inc. ABSTRACT The key mege (A.K.A. _IORC_ merge) is an efficiency technique.

More information

It s Proc Tabulate Jim, but not as we know it!

It s Proc Tabulate Jim, but not as we know it! Paper SS02 It s Proc Tabulate Jim, but not as we know it! Robert Walls, PPD, Bellshill, UK ABSTRACT PROC TABULATE has received a very bad press in the last few years. Most SAS Users have come to look on

More information

Data Quality Review for Missing Values and Outliers

Data Quality Review for Missing Values and Outliers Paper number: PH03 Data Quality Review for Missing Values and Outliers Ying Guo, i3, Indianapolis, IN Bradford J. Danner, i3, Lincoln, NE ABSTRACT Before performing any analysis on a dataset, it is often

More information

APPENDIX 4 Migrating from QMF to SAS/ ASSIST Software. Each of these steps can be executed independently.

APPENDIX 4 Migrating from QMF to SAS/ ASSIST Software. Each of these steps can be executed independently. 255 APPENDIX 4 Migrating from QMF to SAS/ ASSIST Software Introduction 255 Generating a QMF Export Procedure 255 Exporting Queries from QMF 257 Importing QMF Queries into Query and Reporting 257 Alternate

More information

1. Introduction to Microsoft Excel

1. Introduction to Microsoft Excel 1. Introduction to Microsoft Excel A spreadsheet is an online version of an accountant's worksheet, which can automatically do most of the calculating for you. You can do budgets, analyze data, or generate

More information

Paper CC16. William E Benjamin Jr, Owl Computer Consultancy LLC, Phoenix, AZ

Paper CC16. William E Benjamin Jr, Owl Computer Consultancy LLC, Phoenix, AZ Paper CC16 Smoke and Mirrors!!! Come See How the _INFILE_ Automatic Variable and SHAREBUFFERS Infile Option Can Speed Up Your Flat File Text-Processing Throughput Speed William E Benjamin Jr, Owl Computer

More information

footnote1 height=8pt j=l "(Rev. &sysdate)" j=c "{\b\ Page}{\field{\*\fldinst {\b\i PAGE}}}";

footnote1 height=8pt j=l (Rev. &sysdate) j=c {\b\ Page}{\field{\*\fldinst {\b\i PAGE}}}; Producing an Automated Data Dictionary as an RTF File (or a Topic to Bring Up at a Party If You Want to Be Left Alone) Cyndi Williamson, SRI International, Menlo Park, CA ABSTRACT Data dictionaries are

More information

Stat 302 Statistical Software and Its Applications SAS: Data I/O

Stat 302 Statistical Software and Its Applications SAS: Data I/O Stat 302 Statistical Software and Its Applications SAS: Data I/O Yen-Chi Chen Department of Statistics, University of Washington Autumn 2016 1 / 33 Getting Data Files Get the following data sets from the

More information

Microsoft Access XP (2002) - Advanced Queries

Microsoft Access XP (2002) - Advanced Queries Microsoft Access XP (2002) - Advanced Queries Group/Summary Operations Change Join Properties Not Equal Query Parameter Queries Working with Text IIF Queries Expression Builder Backing up Tables Action

More information

QUICK EXCEL TUTORIAL. The Very Basics

QUICK EXCEL TUTORIAL. The Very Basics QUICK EXCEL TUTORIAL The Very Basics You Are Here. Titles & Column Headers Merging Cells Text Alignment When we work on spread sheets we often need to have a title and/or header clearly visible. Merge

More information

Paper # Jazz it up a Little with Formats. Brian Bee, The Knowledge Warehouse Ltd

Paper # Jazz it up a Little with Formats. Brian Bee, The Knowledge Warehouse Ltd Paper #1495-2014 Jazz it up a Little with Formats Brian Bee, The Knowledge Warehouse Ltd Abstract Formats are an often under-valued tool in the SAS toolbox. They can be used in just about all domains to

More information

An Application of PROC NLP to Survey Sample Weighting

An Application of PROC NLP to Survey Sample Weighting An Application of PROC NLP to Survey Sample Weighting Talbot Michael Katz, Analytic Data Information Technologies, New York, NY ABSTRACT The classic weighting formula for survey respondents compensates

More information

Know What You Are Missing: How to Catalogue and Manage Missing Pieces of Historical Data

Know What You Are Missing: How to Catalogue and Manage Missing Pieces of Historical Data Know What You Are Missing: How to Catalogue and Manage Missing Pieces of Historical Data Shankar Yaddanapudi, SAS Consultant, Washington DC ABSTRACT In certain applications it is necessary to maintain

More information

Square Peg, Square Hole Getting Tables to Fit on Slides in the ODS Destination for PowerPoint

Square Peg, Square Hole Getting Tables to Fit on Slides in the ODS Destination for PowerPoint PharmaSUG 2018 - Paper DV-01 Square Peg, Square Hole Getting Tables to Fit on Slides in the ODS Destination for PowerPoint Jane Eslinger, SAS Institute Inc. ABSTRACT An output table is a square. A slide

More information

3N Validation to Validate PROC COMPARE Output

3N Validation to Validate PROC COMPARE Output ABSTRACT Paper 7100-2016 3N Validation to Validate PROC COMPARE Output Amarnath Vijayarangan, Emmes Services Pvt Ltd, India In the clinical research world, data accuracy plays a significant role in delivering

More information

ABSTRACT INTRODUCTION TRICK 1: CHOOSE THE BEST METHOD TO CREATE MACRO VARIABLES

ABSTRACT INTRODUCTION TRICK 1: CHOOSE THE BEST METHOD TO CREATE MACRO VARIABLES An Efficient Method to Create a Large and Comprehensive Codebook Wen Song, ICF International, Calverton, MD Kamya Khanna, ICF International, Calverton, MD Baibai Chen, ICF International, Calverton, MD

More information

Part 1. Getting Started. Chapter 1 Creating a Simple Report 3. Chapter 2 PROC REPORT: An Introduction 13. Chapter 3 Creating Breaks 57

Part 1. Getting Started. Chapter 1 Creating a Simple Report 3. Chapter 2 PROC REPORT: An Introduction 13. Chapter 3 Creating Breaks 57 Part 1 Getting Started Chapter 1 Creating a Simple Report 3 Chapter 2 PROC REPORT: An Introduction 13 Chapter 3 Creating Breaks 57 Chapter 4 Only in the LISTING Destination 75 Chapter 5 Creating and Modifying

More information

Basic Concepts #6: Introduction to Report Writing

Basic Concepts #6: Introduction to Report Writing Basic Concepts #6: Introduction to Report Writing Using By-line, PROC Report, PROC Means, PROC Freq JC Wang By-Group Processing By-group processing in a procedure step, a BY line identifies each group

More information

Digital Intelligence Systems, LLC PeopleSoft Guide Vendors

Digital Intelligence Systems, LLC PeopleSoft Guide Vendors Digital Intelligence Systems, LLC PeopleSoft Guide Vendors Version 1.0 July 2016 CONTENTS INTRODUCTION... 3 1.1 Change Password... 3 PROFILE INFORMATION... 5 2.1 Identifying Information... 6 2.2 Address...

More information

Supplier SAP SNC User Guide

Supplier SAP SNC User Guide Supplier SAP SNC User Guide Version 1.0 July 29, 2014 AGCO Corporation Page 1 1 Introduction AGCO has chosen SAP Supplier Network Collaboration (SNC) to improve visibility and capability in North America

More information

STATISTICAL TECHNIQUES. Interpreting Basic Statistical Values

STATISTICAL TECHNIQUES. Interpreting Basic Statistical Values STATISTICAL TECHNIQUES Interpreting Basic Statistical Values INTERPRETING BASIC STATISTICAL VALUES Sample representative How would one represent the average or typical piece of information from a given

More information

Facilitate Statistical Analysis with Automatic Collapsing of Small Size Strata

Facilitate Statistical Analysis with Automatic Collapsing of Small Size Strata PO23 Facilitate Statistical Analysis with Automatic Collapsing of Small Size Strata Sunil Gupta, Linfeng Xu, Quintiles, Inc., Thousand Oaks, CA ABSTRACT Often in clinical studies, even after great efforts

More information

New Vs. Old Under the Hood with Procs CONTENTS and COMPARE Patricia Hettinger, SAS Professional, Oakbrook Terrace, IL

New Vs. Old Under the Hood with Procs CONTENTS and COMPARE Patricia Hettinger, SAS Professional, Oakbrook Terrace, IL Paper SS-03 New Vs. Old Under the Hood with Procs CONTENTS and COMPARE Patricia Hettinger, SAS Professional, Oakbrook Terrace, IL ABSTRACT There s SuperCE for comparing text files on the mainframe. Diff

More information

Producing Summary Tables in SAS Enterprise Guide

Producing Summary Tables in SAS Enterprise Guide Producing Summary Tables in SAS Enterprise Guide Lora D. Delwiche, University of California, Davis, CA Susan J. Slaughter, Avocet Solutions, Davis, CA ABSTRACT This paper shows, step-by-step, how to use

More information

Introduction to SAS Procedures SAS Basics III. Susan J. Slaughter, Avocet Solutions

Introduction to SAS Procedures SAS Basics III. Susan J. Slaughter, Avocet Solutions Introduction to SAS Procedures SAS Basics III Susan J. Slaughter, Avocet Solutions SAS Essentials Section for people new to SAS Core presentations 1. How SAS Thinks 2. Introduction to DATA Step Programming

More information

DataMaster for Windows

DataMaster for Windows DataMaster for Windows Version 3.0 April 2004 Mid America Computer Corp. 111 Admiral Drive Blair, NE 68008-0700 (402) 426-6222 Copyright 2003-2004 Mid America Computer Corp. All rights reserved. Table

More information

CRA OC RDC Classic User Guide

CRA OC RDC Classic User Guide CRA OC RDC Classic User Guide Version 1.0 Page 1 of 37 TABLE OF CONTENTS Accessing OC RDC Steps for Access 3 Logging On 3 Change Password 5 Change Study 5 Laptop and System Security 6 Navigating OC RDC

More information

A Guided Tour Through the SAS Windowing Environment Casey Cantrell, Clarion Consulting, Los Angeles, CA

A Guided Tour Through the SAS Windowing Environment Casey Cantrell, Clarion Consulting, Los Angeles, CA A Guided Tour Through the SAS Windowing Environment Casey Cantrell, Clarion Consulting, Los Angeles, CA ABSTRACT The SAS system running in the Microsoft Windows environment contains a multitude of tools

More information

All Excel Topics Page 1 of 11

All Excel Topics Page 1 of 11 All Excel Topics Page 1 of 11 All Excel Topics All of the Excel topics covered during training are listed below. Pick relevant topics and tailor a course to meet your needs. Select a topic to find out

More information

PharmaSUG Paper IB11

PharmaSUG Paper IB11 PharmaSUG 2015 - Paper IB11 Proc Compare: Wonderful Procedure! Anusuiya Ghanghas, inventiv International Pharma Services Pvt Ltd, Pune, India Rajinder Kumar, inventiv International Pharma Services Pvt

More information

EXCEL ADVANCED Linda Muchow

EXCEL ADVANCED Linda Muchow EXCEL ADVANCED 2016 Alexandria Technical and Community College Customized Training Technology Specialist 1601 Jefferson Street, Alexandria, MN 56308 320-762-4539 Linda Muchow lindac@alextech.edu 1 Table

More information

Automate Clinical Trial Data Issue Checking and Tracking

Automate Clinical Trial Data Issue Checking and Tracking PharmaSUG 2018 - Paper AD-31 ABSTRACT Automate Clinical Trial Data Issue Checking and Tracking Dale LeSueur and Krishna Avula, Regeneron Pharmaceuticals Inc. Well organized and properly cleaned data are

More information

Visual Streamline FAQ

Visual Streamline FAQ Program Overview: Visual Streamline FAQ How does the program Map Import, located in: Inventory > Global Changes, work? This program enables users the flexibility to use their own excel spreadsheet, and

More information

Siebel Project and Resource Management Administration Guide. Siebel Innovation Pack 2013 Version 8.1/8.2 September 2013

Siebel Project and Resource Management Administration Guide. Siebel Innovation Pack 2013 Version 8.1/8.2 September 2013 Siebel Project and Resource Management Administration Guide Siebel Innovation Pack 2013 Version 8.1/ September 2013 Copyright 2005, 2013 Oracle and/or its affiliates. All rights reserved. This software

More information

Doctor's Prescription to Re-engineer Process of Pinnacle 21 Community Version Friendly ADaM Development

Doctor's Prescription to Re-engineer Process of Pinnacle 21 Community Version Friendly ADaM Development PharmaSUG 2018 - Paper DS-15 Doctor's Prescription to Re-engineer Process of Pinnacle 21 Community Version Friendly ADaM Development Aakar Shah, Pfizer Inc; Tracy Sherman, Ephicacy Consulting Group, Inc.

More information

Best Practice for Creation and Maintenance of a SAS Infrastructure

Best Practice for Creation and Maintenance of a SAS Infrastructure Paper 2501-2015 Best Practice for Creation and Maintenance of a SAS Infrastructure Paul Thomas, ASUP Ltd. ABSTRACT The advantage of using metadata to control and maintain data and access to data on databases,

More information

SUGI 29 Data Warehousing, Management and Quality

SUGI 29 Data Warehousing, Management and Quality Building a Purchasing Data Warehouse for SRM from Disparate Procurement Systems Zeph Stemle, Qualex Consulting Services, Inc., Union, KY ABSTRACT SAS Supplier Relationship Management (SRM) solution offers

More information

Cleaning Duplicate Observations on a Chessboard of Missing Values Mayrita Vitvitska, ClinOps, LLC, San Francisco, CA

Cleaning Duplicate Observations on a Chessboard of Missing Values Mayrita Vitvitska, ClinOps, LLC, San Francisco, CA Cleaning Duplicate Observations on a Chessboard of Missing Values Mayrita Vitvitska, ClinOps, LLC, San Francisco, CA ABSTRACT Removing duplicate observations from a data set is not as easy as it might

More information

Using SAS/SCL to Create Flexible Programs... A Super-Sized Macro Ellen Michaliszyn, College of American Pathologists, Northfield, IL

Using SAS/SCL to Create Flexible Programs... A Super-Sized Macro Ellen Michaliszyn, College of American Pathologists, Northfield, IL Using SAS/SCL to Create Flexible Programs... A Super-Sized Macro Ellen Michaliszyn, College of American Pathologists, Northfield, IL ABSTRACT SAS is a powerful programming language. When you find yourself

More information

BI-09 Using Enterprise Guide Effectively Tom Miron, Systems Seminar Consultants, Madison, WI

BI-09 Using Enterprise Guide Effectively Tom Miron, Systems Seminar Consultants, Madison, WI Paper BI09-2012 BI-09 Using Enterprise Guide Effectively Tom Miron, Systems Seminar Consultants, Madison, WI ABSTRACT Enterprise Guide is not just a fancy program editor! EG offers a whole new window onto

More information

Document Imaging User Guide

Document Imaging User Guide Release 4.9 IMAGING TECHNOLOGY GROUP Document Imaging Systems Document Imaging User Guide IMAGING TECHNOLOGY GROUP IMIGIT tm Document Imaging User Guide Release 4.91 March 2007 Imaging Technology Group

More information

User Guide -Version 3.00

User Guide -Version 3.00 Software for production planning User Guide -Version 3.00 Summary SUMMARY... 1 INTRODUCTION... 3 General...3 The keyboard...3 Structure of a window...4 Units of measurement and program limits...4 OPERATION...

More information

Data Edit-checks Integration using ODS Tagset Niraj J. Pandya, Element Technologies Inc., NJ Vinodh Paida, Impressive Systems Inc.

Data Edit-checks Integration using ODS Tagset Niraj J. Pandya, Element Technologies Inc., NJ Vinodh Paida, Impressive Systems Inc. PharmaSUG2011 - Paper DM03 Data Edit-checks Integration using ODS Tagset Niraj J. Pandya, Element Technologies Inc., NJ Vinodh Paida, Impressive Systems Inc., TX ABSTRACT In the Clinical trials data analysis

More information

The Proc Transpose Cookbook

The Proc Transpose Cookbook ABSTRACT PharmaSUG 2017 - Paper TT13 The Proc Transpose Cookbook Douglas Zirbel, Wells Fargo and Co. Proc TRANSPOSE rearranges columns and rows of SAS datasets, but its documentation and behavior can be

More information

Quality Gates User guide

Quality Gates User guide Quality Gates 3.3.5 User guide 06/2013 1 Table of Content 1 - Introduction... 4 2 - Navigation... 5 2.1 Navigation tool bar... 5 2.2 Navigation tree... 5 2.3 Folder Tree... 6 2.4 Test history... 7 3 -

More information

ABSTRACT INTRODUCTION SIMPLE COMPOSITE VARIABLE REVIEW SESUG Paper IT-06

ABSTRACT INTRODUCTION SIMPLE COMPOSITE VARIABLE REVIEW SESUG Paper IT-06 SESUG 2012 Paper IT-06 Review That You Can Do: A Guide for Systematic Review of Complex Data Lesa Caves, RTI International, Durham, NC Nicole Williams, RTI International, Durham, NC ABSTRACT Quality control

More information

So Much Data, So Little Time: Splitting Datasets For More Efficient Run Times and Meeting FDA Submission Guidelines

So Much Data, So Little Time: Splitting Datasets For More Efficient Run Times and Meeting FDA Submission Guidelines Paper TT13 So Much Data, So Little Time: Splitting Datasets For More Efficient Run Times and Meeting FDA Submission Guidelines Anthony Harris, PPD, Wilmington, NC Robby Diseker, PPD, Wilmington, NC ABSTRACT

More information

Hyperion Essbase Audit Logs Turning Off Without Notification

Hyperion Essbase Audit Logs Turning Off Without Notification Hyperion Essbase Audit Logs Turning Off Without Notification Audit logs, or SSAUDIT, are a crucial component of backing up Hyperion Essbase applications in many environments. It is the equivalent of a

More information

Localize Your Custom JMP Applications For International Users Michael Hecht, JMP Principal Systems Developer, SAS

Localize Your Custom JMP Applications For International Users Michael Hecht, JMP Principal Systems Developer, SAS Localize Your Custom JMP Applications For International Users Michael Hecht, JMP Principal Systems Developer, SAS Abstract JSL, JMP s powerful scripting language, can be used to extend JMP s features in

More information

Microsoft Excel Basics Ben Johnson

Microsoft Excel Basics Ben Johnson Microsoft Excel Basics Ben Johnson Topic...page # Basics...1 Workbook and worksheets...1 Sizing columns and rows...2 Auto Fill...2 Sort...2 Formatting Cells...3 Formulas...3 Percentage Button...4 Sum function...4

More information

Monitor Qlik Sense sites. Qlik Sense Copyright QlikTech International AB. All rights reserved.

Monitor Qlik Sense sites. Qlik Sense Copyright QlikTech International AB. All rights reserved. Monitor Qlik Sense sites Qlik Sense 2.1.2 Copyright 1993-2015 QlikTech International AB. All rights reserved. Copyright 1993-2015 QlikTech International AB. All rights reserved. Qlik, QlikTech, Qlik Sense,

More information

Getting it Done with PROC TABULATE

Getting it Done with PROC TABULATE ABSTRACT Getting it Done with PROC TABULATE Michael J. Williams, ICON Clinical Research, San Francisco, CA The task of displaying statistical summaries of different types of variables in a single table

More information

Getting Up to Speed with PROC REPORT Kimberly LeBouton, K.J.L. Computing, Rossmoor, CA

Getting Up to Speed with PROC REPORT Kimberly LeBouton, K.J.L. Computing, Rossmoor, CA SESUG 2012 Paper HW-01 Getting Up to Speed with PROC REPORT Kimberly LeBouton, K.J.L. Computing, Rossmoor, CA ABSTRACT Learning the basics of PROC REPORT can help the new SAS user avoid hours of headaches.

More information

Summarizing Impossibly Large SAS Data Sets For the Data Warehouse Server Using Horizontal Summarization

Summarizing Impossibly Large SAS Data Sets For the Data Warehouse Server Using Horizontal Summarization Summarizing Impossibly Large SAS Data Sets For the Data Warehouse Server Using Horizontal Summarization Michael A. Raithel, Raithel Consulting Services Abstract Data warehouse applications thrive on pre-summarized

More information

How a Code-Checking Algorithm Can Prevent Errors

How a Code-Checking Algorithm Can Prevent Errors How a Code-Checking Algorithm Can Prevent Errors SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks of SAS Institute Inc. in the USA and other countries.

More information

Paper BI SAS Enterprise Guide System Design. Jennifer First-Kluge and Steven First, Systems Seminar Consultants, Inc.

Paper BI SAS Enterprise Guide System Design. Jennifer First-Kluge and Steven First, Systems Seminar Consultants, Inc. ABSTRACT Paper BI-10-2015 SAS Enterprise Guide System Design Jennifer First-Kluge and Steven First, Systems Seminar Consultants, Inc. A good system should embody the following characteristics: It is planned,

More information

Making Visio Diagrams Come Alive with Data

Making Visio Diagrams Come Alive with Data Making Visio Diagrams Come Alive with Data An Information Commons Workshop Making Visio Diagrams Come Alive with Data Page Workshop Why Add Data to A Diagram? Here are comparisons of a flow chart with

More information

Checking for Duplicates Wendi L. Wright

Checking for Duplicates Wendi L. Wright Checking for Duplicates Wendi L. Wright ABSTRACT This introductory level paper demonstrates a quick way to find duplicates in a dataset (with both simple and complex keys). It discusses what to do when

More information

SAS Application to Automate a Comprehensive Review of DEFINE and All of its Components

SAS Application to Automate a Comprehensive Review of DEFINE and All of its Components PharmaSUG 2017 - Paper AD19 SAS Application to Automate a Comprehensive Review of DEFINE and All of its Components Walter Hufford, Vincent Guo, and Mijun Hu, Novartis Pharmaceuticals Corporation ABSTRACT

More information

BY S NOTSORTED OPTION Karuna Samudral, Octagon Research Solutions, Inc., Wayne, PA Gregory M. Giddings, Centocor R&D Inc.

BY S NOTSORTED OPTION Karuna Samudral, Octagon Research Solutions, Inc., Wayne, PA Gregory M. Giddings, Centocor R&D Inc. ABSTRACT BY S NOTSORTED OPTION Karuna Samudral, Octagon Research Solutions, Inc., Wayne, PA Gregory M. Giddings, Centocor R&D Inc., Malvern, PA What if the usual sort and usual group processing would eliminate

More information

Ten Great Reasons to Learn SAS Software's SQL Procedure

Ten Great Reasons to Learn SAS Software's SQL Procedure Ten Great Reasons to Learn SAS Software's SQL Procedure Kirk Paul Lafler, Software Intelligence Corporation ABSTRACT The SQL Procedure has so many great features for both end-users and programmers. It's

More information

Getting Started with. Office 2008

Getting Started with. Office 2008 Getting Started with Office 2008 Copyright 2010 - Information Technology Services Kennesaw State University This document may be downloaded, printed, or copied, for educational use, without further permission

More information

OLAP Drill-through Table Considerations

OLAP Drill-through Table Considerations Paper 023-2014 OLAP Drill-through Table Considerations M. Michelle Buchecker, SAS Institute, Inc. ABSTRACT When creating an OLAP cube, you have the option of specifying a drill-through table, also known

More information

Taking Attendance with a Spreadsheet

Taking Attendance with a Spreadsheet Appendix A Taking Attendance with a Spreadsheet In this chapter, we will learn the following to World Class standards: Designing an Attendance Spreadsheet Opening Microsoft Excel Build the Spreadsheet

More information

DREAMFACTORY SOFTWARE INC. Snapshot User Guide. Product Usage and Best Practices Guide. By Sathyamoorthy Sridhar June 25, 2012

DREAMFACTORY SOFTWARE INC. Snapshot User Guide. Product Usage and Best Practices Guide. By Sathyamoorthy Sridhar June 25, 2012 DREAMFACTORY SOFTWARE INC Snapshot User Guide Product Usage and Best Practices Guide By Sathyamoorthy Sridhar June 25, 2012 This document describes Snapshot s features and provides the reader with notes

More information

SESUG 2014 IT-82 SAS-Enterprise Guide for Institutional Research and Other Data Scientists Claudia W. McCann, East Carolina University.

SESUG 2014 IT-82 SAS-Enterprise Guide for Institutional Research and Other Data Scientists Claudia W. McCann, East Carolina University. Abstract Data requests can range from on-the-fly, need it yesterday, to extended projects taking several weeks or months to complete. Often institutional researchers and other data scientists are juggling

More information

Chapter 6: Modifying and Combining Data Sets

Chapter 6: Modifying and Combining Data Sets Chapter 6: Modifying and Combining Data Sets The SET statement is a powerful statement in the DATA step. Its main use is to read in a previously created SAS data set which can be modified and saved as

More information

Mapping Clinical Data to a Standard Structure: A Table Driven Approach

Mapping Clinical Data to a Standard Structure: A Table Driven Approach ABSTRACT Paper AD15 Mapping Clinical Data to a Standard Structure: A Table Driven Approach Nancy Brucken, i3 Statprobe, Ann Arbor, MI Paul Slagle, i3 Statprobe, Ann Arbor, MI Clinical Research Organizations

More information