CMU MSP : SAS FORMATs and INFORMATs Howard Seltman October 15, 2017

Size: px
Start display at page:

Download "CMU MSP : SAS FORMATs and INFORMATs Howard Seltman October 15, 2017"

Transcription

1 CMU MSP : SAS FORMATs and INFORMATs Howard Seltman October 15, ) Informats are programs that convert ASCII (Unicode) text to binary. Formats are programs that convert binary to text. Both come in two forms, character (which always has a $ in its names) and numeric, named for the binary data type. Common uses include reading and writing dates, storing character categories as numbers, displaying numeric codes as words, binning numeric variables, collapsing categories, and data checking on input. 2) Making your own INFORMATs: INFORMATs determine the relationship between the external file s text (in the input buffer) and the data value that INPUT places into the program data vector (PDV). a. Create the informat(s) using a PROC FORMAT, and then use them in an INFORMAT (or INPUT) statement in a DATA step. b. A PROC FORMAT can have one or more INVALUE statements that define the informat(s). (You can also use several PROC FORMATs.) Unless you follow the procedures to store it permanently, the INFORMAT will not be available in the future without re-running the PROC FORMAT. c. Restrictions: INFORMAT names must start with $ if the stored (PDV, binary) value is a string, can be up to 31 characters long, and must NOT end with a number. Do not include the final. when defining the INFORMAT. d. Syntax: INVALUE myinfmtname [(myoptions)] myvalrange[, myvalrange ]=myvalue [myvalrange[, myvalrange ]=myvalue ]; The left side of the equals sign is what is looked for in the input text, and the right side is what is stored in the PDV (with numbers converted to 8 byte internal format). Any input text that does not match any myvalrange is converted as if there were no INFORMAT (i.e., using the default INFORMAT). The most useful Option is (UPCASE) to convert data to upper case before comparing it to a myvalrange. Parentheses are needed! myvalrange is either a value such as 15 or 'Fred', or it is a range of values of the form 3-5, 'A'-'C', 3-<5 for [3,5), low-3 for (-, 3], or 120-high for [120, ) or the keyword OTHER (unquoted) to indicate all other values. myvalue is either a value whose type matches the stored data type, or it is an existing informat of the correct type inside [] s to indicate processing by 1

2 e. Examples that informat. Also allowed is _ERROR_ to generate an error and _SAME_ for no conversion. For more details, see The FORMAT PROCEDURE: INVALUE Statement. i. PROC FORMAT; INVALUE trial 'A'-<'N'=1 'N'-'Z'= =3 low-0=_error_; DATA temp; INFORMAT trial trial.; INPUT trial B12 M17 O23 a Results are 1, 1, 2,. (with an error), 3, 3, 3, 12345, 3,. (with an error) means keep the rest of the input line for the next iteration of the implied for loop. ii. PROC FORMAT; /* Note: the "$" is a prefix, not a suffix */ INVALUE $gendfmt 1='M' 2='F' OTHER=_ERROR_; DATA temp; INFORMAT gender $gendfmt.; INPUT M F Results are M, F, F, M,.,.,.,. (with errors for the missing values). PROC FORMAT; INVALUE $gend2fmt 1,M=M 2,F=F "."=" " OTHER=_ERROR_; DATA temp; INFORMAT gender $gend2fmt.; INPUT M F Results are M, F, F, M,,, M, F with an error message for the 0 only. Note all spaces is missing for strings. 2

3 iii. PROC FORMAT; INVALUE mytfformat T,True,TRUE,true=1 F,False,FALSE,false=0 OTHER=_ERROR_; INVALUE mytfuformat (UPCASE) T,TRUE=1 F,FALSE=0 OTHER=_ERROR_; DATA junk; INFILE INFORMAT num mytfformat. numu mytfuformat.; INPUT num numu True True TRUE TRUE false false FALSE FALSE T T 0 0 Results are 1, 1, 0, 0, 1,. for both variables. iv. In the above syntax ( list input format) the informats on the INFORMAT line must be present (via re-running PROC FORMAT or using permanent (in)formats) to use the data set in a SET statement in the future. The modified list input format shown here does not require the INFORMAT to be known for future use in a SET statement. LIBNAME here "."; DATA here.junk; INFILE INPUT num : True TRUE false FALSE T 0 PROC FORMAT; INVALUE $fuel G=Gas O=Oil S=Solar X="Other fuels" OTHER=_ERROR_; DATA here.crap; INPUT type$ : fuel. CARDS; G 10 O 12.3 X 11 S 9 O 21 f. Note that there is a way to create format ranges from the fields in a dataset (see below). 3

4 3) Making your own formats a. Create formats for output using PROC FORMAT. Use them in DATA steps to make them defaults for some variables. Use them in PROC steps to invoke them temporarily. This is usually clearer and safer than IF/THEN in DATA steps. b. A PROC FORMAT can have one (or more) VALUE statements that define the format(s). (Or you can use several PROC FORMATs.) Unless you follow the procedures to store it permanently, the FORMAT will not be available in the future without re-running the PROC FORMAT, which can cause a problem for permanent data sets. c. Restrictions: FORMAT names must start with $ if the stored data type (PDV, binary) is a string, and can be up to 32 characters long. Do not include the final. when defining the FORMAT. d. Syntax: VALUE myfmtname myvalrange[, myvalrange ]=myvalue [myvalrange[, myvalrange ]=myvalue ]; e. Examples The left side of the equals sign is what is looked for in the data, and the right side is what is output instead. Any input text that does not match any myvalrange is output as if there were no FORMAT (i.e., using the default FORMAT). myvalrange is either a value such as 15 or 'Fred', or it is a range of values of the form 3-5 or 'A'-'C', and it must match the format/data type or it is the keyword OTHER (unquoted). myvalue is either a string value, or it is an existing format of the correct type inside [] s. If it is a format, that format is used to create the result. For more details see The FORMAT PROCEDURE: VALUE Statement. i. PROC FORMAT; VALUE $fuel G=Gas O=Oil S=Solar X="Other fuels" OTHER="Unknown type"; DATA craps; INPUT type : $1. FORMAT type $fuel.; CARDS; G 10 O 12.3 X 11 S 9 N 21 4

5 ii. PROC FORMAT; VALUE fuel 0-1=Gas 2=Oil 3=Solar 4-9="Other fuels".=missing other=error; DATA crapn; INPUT type Amount FORMAT type fuel.; CARDS; PROC MEANS DATA=crapN; /* WHERE type = "Gas"; is an ERROR */ WHERE PUT(type, fuel.) = "Gas"; VAR amount; iii. PROC FORMAT; VALUE ageranges low-<18 = "Minor" 18-<45 = "Young adult" 45-<60 = "Middle age" 60-high = "Elderly "; PROC FREQ DATA=ageData; TABLES age / MISSING; FORMAT age ageranges.; iv. PROC FORMAT; VALUE pval 0-<0.005='<0.005' OTHER=[5.3]; DATA pvals; INPUT FORMAT p pval.;

6 4) Using your formats and informats to create new variables a. The PUT() function expresses a variable using a FORMAT. E.g., continuing the example from above, when type=5 then PUT(type, fuel.) returns the string value "Other fuels". The return value of PUT() is always a string, and the argument must match the $ in the format name. b. The INPUT() function, uses an INFORMAT to do the equivalent of reading data from a plain text file, but using a variable for the input instead. INPUT("$12,123.45", comma10.2) returns This argument of INPUT() is always a string and the return value matches the $ in the informat name. c. Example: longtype becomes the long string version of type. Neither a LENGTH statement nor a $ is needed, because that info (LENGTH 11$.) is stored in the format. DATA crap2; SET craps; longtype = PUT(type, fuel.); d. Example: numtype becomes the numeric version of type. PROC FORMAT; VALUE $numfuel G="1,000" O="2,000" S="3,000" X=4 OTHER="-1"; DATA crap3; SET craps; numtype = INPUT(PUT(type, $numfuel.), COMMA7.); e. Alternate date formats DATA dates(drop=s); LENGTH s $11; INPUT IF UPCASE(SUBSTR(s, 4, 1)) >= 'A' & UPCASE(SUBSTR(s, 4, 1)) <= 'Z' THEN date = INPUT(s, DATE11.); ELSE date = INPUT(s, MMDDYY10.); FORMAT date DATE11.; 1/25/ /2/ Jan Dec

7 5) Permanent (IN)FORMATs a. Permanent (IN)FORMATs are better for data management, precluding the need to re-run PROC FORMATs each time you use your data. b. Step 1: Use PROC FORMAT LIBRARY=myLibRef; when creating the (IN)FORMAT. c. Step 2: Use OPTIONS FMTSEARCH=(myLibRef); in current and future sessions before any steps that reference the formats. (Don t forget the parentheses.) d. Not doing this for FORMATs included in DATA steps that create permanent data sets will prevent the data set from being used in the future unless you manually re-run the PROC FORMAT. e. Not doing this for INFORMATs included in the DATA steps that create permanent data sets will prevent the data set from being the target of a SET statement in a DATA step in the future unless you manually re-run the PROC FORMAT. f. Example of permanent (IN)FORMAT creation code: LIBNAME heart "heartdata"; FILENAME hdata "heartdata/hearts.txt"; /* Special formats for reading/writing lab data */ PROC FORMAT LIBRARY=heart; INVALUE readlab...; VALUE labfmt...; OPTIONS FMTSEARCH=(heart); /* create permanent data set */ DATA heart.hrtstudy; INFILE hdata; INPUT id$ date lab1 lab2 outcome; INFORMAT date DDMMYY8.; INFORMAT lab1 lab2 readlab.; FORMAT date DATE11.; FORMAT lab1 lab2 labfmt.; g. Example of permanent (IN)FORMAT data use code: LIBNAME study "heartdata"; OPTIONS FMTSEARCH=(study); DATA temp; SET study.hrtstudy; labratio = lab1/lab2; /* Analyze */ PROC REG data=temp; MODEL outcome = labratio; QUIT; /* because REG is an interactive PROC */ 7

8 6) INFORMATs from data sets a. The FORMAT procedure has an option CNTLIN=fmtDataSet for the PROC FORMAT statement, which reads the formatting information from the specified data set rather than from the body of the PROC. b. The data set supplies the formatting information using variables called FMTNAME, START, and LABEL, and possibly TYPE, END, SEXCL, and EEXCL (start exclude and end exclude). c. FMTNAME is the name of the format to be created. A given FORMAT PROC can create one or several formats using CNTLIN=. Each format typically consists of several data lines with the same FMTNAME. d. START specifies that value that is to be formatted or START and END specify a range of values. If both are included and a single value is to be formatted, both START and END must be set to that value. Typically, START is a string column, but if all values are numeric, a numeric type is allowed. e. LABEL contains the result of formatting, i.e., the value to be output. Typically, this column is a string, but if all values are numeric, numeric is allowed. f. If START and END are both supplied, you may include SEXLC and EEXCL (where S=start, E=end, and EXLC=exclude) and set each one to Y or N where Y means the range does exclude the specified value and N means it does not. g. If the value to be formatted is a string, then either its FMTNAME must start with a $ or a TYPE column must be included. TYPE must be C for character (string) or N for numeric on every line (P for picture is also allowed). 8

9 h. Example: DATA myfmts; LENGTH FMTNAME $5 START $1; INPUT FMTNAME START LABEL $15.; foods 1 broccoli foods 2 tomatoes foods 3 brussel sprouts foods 4 chicken foods. PROC FORMAT CNTLIN=myFmts; DATA myrecipes; LENGTH name $9; INPUT name ingr1 ingr2 ingr3; LABEL ingr1="ingredient 1" ingr2="ingredient 2" ingr3="ingredient 3"; FORMAT ingr1-ingr3 foods.; ChickenBS 4 3. BrocTom 1 2. Veg ChickTom 2 4. PROC PRINT DATA=myRecipes; 9

CMU MSP : SAS FORMATs and INFORMATs Howard Seltman Nov. 7+12, 2018

CMU MSP : SAS FORMATs and INFORMATs Howard Seltman Nov. 7+12, 2018 CMU MSP 36-601: SAS FORMATs and INFORMATs Howard Seltman Nov. 7+12, 2018 1) Formats and informats flexibly re-represent data in a data set on input or output. Common uses include reading and writing dates,

More information

SAS 101. Based on Learning SAS by Example: A Programmer s Guide Chapter 21, 22, & 23. By Tasha Chapman, Oregon Health Authority

SAS 101. Based on Learning SAS by Example: A Programmer s Guide Chapter 21, 22, & 23. By Tasha Chapman, Oregon Health Authority SAS 101 Based on Learning SAS by Example: A Programmer s Guide Chapter 21, 22, & 23 By Tasha Chapman, Oregon Health Authority Topics covered All the leftovers! Infile options Missover LRECL=/Pad/Truncover

More information

PROC FORMAT. CMS SAS User Group Conference October 31, 2007 Dan Waldo

PROC FORMAT. CMS SAS User Group Conference October 31, 2007 Dan Waldo PROC FORMAT CMS SAS User Group Conference October 31, 2007 Dan Waldo 1 Today s topic: Three uses of formats 1. To improve the user-friendliness of printed results 2. To group like data values without affecting

More information

The FORMAT procedure - more than just a VALUE statement Lawrence Heaton-Wright, Quintiles, Bracknell, UK

The FORMAT procedure - more than just a VALUE statement Lawrence Heaton-Wright, Quintiles, Bracknell, UK Paper TT10 The FORMAT procedure - more than just a VALUE statement Lawrence Heaton-Wright, Quintiles, Bracknell, UK ABSTRACT The FORMAT procedure is most frequently used to define formats for variables.

More information

Formats, Informats and How to Program with Them Ian Whitlock, Westat, Rockville, MD

Formats, Informats and How to Program with Them Ian Whitlock, Westat, Rockville, MD Formats, Informats and How to Program with Them Ian Whitlock, Westat, Rockville, MD Abstract Formats tell how to display stored data and informats how to read them. In other words, they allow the separation

More information

Advanced Tutorials. Paper More than Just Value: A Look Into the Depths of PROC FORMAT

Advanced Tutorials. Paper More than Just Value: A Look Into the Depths of PROC FORMAT Paper 4-27 More than Just Value: A Look Into the Depths of PROC FORMAT Pete Lund, Northwest Crime and Social Research, Olympia, WA Abstract It doesn t take long for even novice SAS programmers to get their

More information

using and Understanding Formats

using and Understanding Formats using and Understanding SAS@ Formats Howard Levine, DynaMark, Inc. Oblectives The purpose of this paper is to enable you to use SAS formats to perform the following tasks more effectively: Improving the

More information

Introduction to SAS Mike Zdeb ( , #61

Introduction to SAS Mike Zdeb ( , #61 Mike Zdeb (402-6479, msz03@albany.edu) #61 FORMAT, you can design informats for reading and interpreting non-standard data, and you can design formats for displaying data in non-standard ways....example

More information

Intermediate SAS: Working with Data

Intermediate SAS: Working with Data Intermediate SAS: Working with Data OIT Technical Support Services 293-4444 oithelp@mail.wvu.edu oit.wvu.edu/training/classmat/sas/ Table of Contents Getting set up for the Intermediate SAS workshop:...

More information

SAS Institue EXAM A SAS Base Programming for SAS 9

SAS Institue EXAM A SAS Base Programming for SAS 9 SAS Institue EXAM A00-211 SAS Base Programming for SAS 9 Total Questions: 70 Question: 1 After a SAS program is submitted, the following is written to the SAS log: What issue generated the error in the

More information

CC13 An Automatic Process to Compare Files. Simon Lin, Merck & Co., Inc., Rahway, NJ Huei-Ling Chen, Merck & Co., Inc., Rahway, NJ

CC13 An Automatic Process to Compare Files. Simon Lin, Merck & Co., Inc., Rahway, NJ Huei-Ling Chen, Merck & Co., Inc., Rahway, NJ CC13 An Automatic Process to Compare Files Simon Lin, Merck & Co., Inc., Rahway, NJ Huei-Ling Chen, Merck & Co., Inc., Rahway, NJ ABSTRACT Comparing different versions of output files is often performed

More information

Using an ICPSR set-up file to create a SAS dataset

Using an ICPSR set-up file to create a SAS dataset Using an ICPSR set-up file to create a SAS dataset Name library and raw data files. From the Start menu, launch SAS, and in the Editor program, write the codes to create and name a folder in the SAS permanent

More information

Basic Concept Review

Basic Concept Review Basic Concept Review Quiz Using the Programming Workspace Referencing Files and Setting Options Editing and Debugging SAS Programs End of Review SAS Format Format Formats are variable

More information

Chapter 1 The DATA Step

Chapter 1 The DATA Step Chapter 1 The DATA Step 1.1 Structure of SAS Programs...1-3 1.2 SAS Data Sets... 1-12 1.3 Creating a Permanent SAS Data Set... 1-18 1.4 Writing a SAS DATA Step... 1-24 1.5 Creating a DATA Step View...

More information

Introduction to SAS. Cristina Murray-Krezan Research Assistant Professor of Internal Medicine Biostatistician, CTSC

Introduction to SAS. Cristina Murray-Krezan Research Assistant Professor of Internal Medicine Biostatistician, CTSC Introduction to SAS Cristina Murray-Krezan Research Assistant Professor of Internal Medicine Biostatistician, CTSC cmurray-krezan@salud.unm.edu 20 August 2018 What is SAS? Statistical Analysis System,

More information

Introduction to SAS Statistical Package

Introduction to SAS Statistical Package Instructor: Introduction to SAS Statistical Package Biostatistics 140.632 Lecture 1 Lucy Meoni lmeoni@jhmi.edu Teaching Assistant : Sorina Eftim seftim@jhsph.edu Lecture/Lab: Room 3017 WEB site: www.biostat.jhsph.edu/bstcourse/bio632/default.htm

More information

Reducing SAS Dataset Merges with Data Driven Formats

Reducing SAS Dataset Merges with Data Driven Formats Paper CT01 Reducing SAS Dataset Merges with Data Driven Formats Paul Grimsey, Roche Products Ltd, Welwyn Garden City, UK ABSTRACT Merging different data sources is necessary in the creation of analysis

More information

Paper B GENERATING A DATASET COMPRISED OF CUSTOM FORMAT DETAILS

Paper B GENERATING A DATASET COMPRISED OF CUSTOM FORMAT DETAILS Paper B07-2009 Eliminating Redundant Custom Formats (or How to Really Take Advantage of Proc SQL, Proc Catalog, and the Data Step) Philip A. Wright, University of Michigan, Ann Arbor, MI ABSTRACT Custom

More information

Paper PO06. Building Dynamic Informats and Formats

Paper PO06. Building Dynamic Informats and Formats Paper PO06 Building Dynamic Informats and Formats Michael Zhang, Merck & Co, Inc, West Point, PA ABSTRACT Using the FORMAT procedure to define informats and formats is a common task in SAS programming

More information

Paper # Jazz it up a Little with Formats. Brian Bee, The Knowledge Warehouse Ltd

Paper # Jazz it up a Little with Formats. Brian Bee, The Knowledge Warehouse Ltd Paper #1495-2014 Jazz it up a Little with Formats Brian Bee, The Knowledge Warehouse Ltd Abstract Formats are an often under-valued tool in the SAS toolbox. They can be used in just about all domains to

More information

Merge Processing and Alternate Table Lookup Techniques Prepared by

Merge Processing and Alternate Table Lookup Techniques Prepared by Merge Processing and Alternate Table Lookup Techniques Prepared by The syntax for data step merging is as follows: International SAS Training and Consulting This assumes that the incoming data sets are

More information

data Vote; /* Read a CSV file */ infile 'c:\users\yuen\documents\6250\homework\hw1\political.csv' dsd; input state $ Party $ Age; run;

data Vote; /* Read a CSV file */ infile 'c:\users\yuen\documents\6250\homework\hw1\political.csv' dsd; input state $ Party $ Age; run; Chapter 3 2. data Vote; /* Read a CSV file */ infile 'c:\users\yuen\documents\6250\homework\hw1\political.csv' dsd; input state $ Party $ Age; title "Listing of Vote data set"; /* compute frequencies for

More information

22S:166. Checking Values of Numeric Variables

22S:166. Checking Values of Numeric Variables 22S:1 Computing in Statistics Lecture 24 Nov. 2, 2016 1 Checking Values of Numeric Variables range checks when you know what the range of possible values is for a given quantitative variable internal consistency

More information

Validating And Updating Your Data Using SAS Formats Peter Welbrock, Britannia Consulting, Inc., MA

Validating And Updating Your Data Using SAS Formats Peter Welbrock, Britannia Consulting, Inc., MA Validating And Updating Your Data Using SAS Formats Peter Welbrock, Britannia Consulting, Inc., MA Overview In whatever way you use SAS software, at some point you will have to deal with data. It is unavoidable.

More information

DATA Step Debugger APPENDIX 3

DATA Step Debugger APPENDIX 3 1193 APPENDIX 3 DATA Step Debugger Introduction 1194 Definition: What is Debugging? 1194 Definition: The DATA Step Debugger 1194 Basic Usage 1195 How a Debugger Session Works 1195 Using the Windows 1195

More information

ERROR: ERROR: ERROR:

ERROR: ERROR: ERROR: ERROR: ERROR: ERROR: Formatting Variables: Back and forth between character and numeric Why should you care? DATA name1; SET name; if var = Three then delete; if var = 3 the en delete; if var = 3 then

More information

BIOS 546 Midterm March 26, Write the line of code that all Perl programs on biolinx must start with so they can be executed.

BIOS 546 Midterm March 26, Write the line of code that all Perl programs on biolinx must start with so they can be executed. 1. What values are false in Perl? BIOS 546 Midterm March 26, 2007 2. Write the line of code that all Perl programs on biolinx must start with so they can be executed. 3. How do you make a comment in Perl?

More information

Overview of Data Management Tasks (command file=datamgt.sas)

Overview of Data Management Tasks (command file=datamgt.sas) Overview of Data Management Tasks (command file=datamgt.sas) Create the March data set: To create the March data set, you can read it from the MARCH.DAT raw data file, using a data step, as shown below.

More information

Chapter 2: Getting Data Into SAS

Chapter 2: Getting Data Into SAS Chapter 2: Getting Data Into SAS Data stored in many different forms/formats. Four categories of ways to read in data. 1. Entering data directly through keyboard 2. Creating SAS data sets from raw data

More information

Procedure for Stamping Source File Information on SAS Output Elizabeth Molloy & Breda O'Connor, ICON Clinical Research

Procedure for Stamping Source File Information on SAS Output Elizabeth Molloy & Breda O'Connor, ICON Clinical Research Procedure for Stamping Source File Information on SAS Output Elizabeth Molloy & Breda O'Connor, ICON Clinical Research ABSTRACT In the course of producing a report for a clinical trial numerous drafts

More information

Exporting & Importing Datasets & Catalogs: Utility Macros

Exporting & Importing Datasets & Catalogs: Utility Macros Exporting & Importing Datasets & Catalogs: Utility Macros Adel Fahmy, SYSMART Consulting, North Brunswick, NJ ABSTRACT Since different companies use different SAS versions installed on different platforms,

More information

Create a Format from a SAS Data Set Ruth Marisol Rivera, i3 Statprobe, Mexico City, Mexico

Create a Format from a SAS Data Set Ruth Marisol Rivera, i3 Statprobe, Mexico City, Mexico PharmaSUG 2011 - Paper TT02 Create a Format from a SAS Data Set Ruth Marisol Rivera, i3 Statprobe, Mexico City, Mexico ABSTRACT Many times we have to apply formats and it could be hard to create them specially

More information

Epidemiology Principles of Biostatistics Chapter 3. Introduction to SAS. John Koval

Epidemiology Principles of Biostatistics Chapter 3. Introduction to SAS. John Koval Epidemiology 9509 Principles of Biostatistics Chapter 3 John Koval Department of Epidemiology and Biostatistics University of Western Ontario What we will do today We will learn to use use SAS to 1. read

More information

Chapter 7 File Access. Chapter Table of Contents

Chapter 7 File Access. Chapter Table of Contents Chapter 7 File Access Chapter Table of Contents OVERVIEW...105 REFERRING TO AN EXTERNAL FILE...105 TypesofExternalFiles...106 READING FROM AN EXTERNAL FILE...107 UsingtheINFILEStatement...107 UsingtheINPUTStatement...108

More information

Procedures. PROC CATALOG CATALOG=<libref.>catalog <ENTRYTYPE=etype> <KILL>; CONTENTS <OUT=SAS-data-set> <FILE=fileref;>

Procedures. PROC CATALOG CATALOG=<libref.>catalog <ENTRYTYPE=etype> <KILL>; CONTENTS <OUT=SAS-data-set> <FILE=fileref;> 355 CHAPTER 19 Procedures SAS Procedures under Windows 355 CATALOG 355 CIMPORT 356 CONTENTS 357 CONVERT 358 CPORT 361 DATASETS 361 OPTIONS 363 PMENU 364 PRINTTO 365 SORT 367 SAS Procedures under Windows

More information

DSCI 325: Handout 2 Getting Data into SAS Spring 2017

DSCI 325: Handout 2 Getting Data into SAS Spring 2017 DSCI 325: Handout 2 Getting Data into SAS Spring 2017 Data sets come in many different formats. In some situations, data sets are stored on paper (e.g., surveys) and other times data are stored in huge

More information

Objectives Reading SAS Data Sets and Creating Variables Reading a SAS Data Set Reading a SAS Data Set onboard ia.dfwlax FirstClass Economy

Objectives Reading SAS Data Sets and Creating Variables Reading a SAS Data Set Reading a SAS Data Set onboard ia.dfwlax FirstClass Economy Reading SAS Data Sets and Creating Variables Objectives Create a SAS data set using another SAS data set as input. Create SAS variables. Use operators and SAS functions to manipulate data values. Control

More information

STAT 7000: Experimental Statistics I

STAT 7000: Experimental Statistics I STAT 7000: Experimental Statistics I 2. A Short SAS Tutorial Peng Zeng Department of Mathematics and Statistics Auburn University Fall 2009 Peng Zeng (Auburn University) STAT 7000 Lecture Notes Fall 2009

More information

Exam Name: SAS Base Programming for SAS 9

Exam Name: SAS Base Programming for SAS 9 Vendor: SAS Exam Code: A00-211 Exam Name: SAS Base Programming for SAS 9 Version: DEMO QUESTION 1 Given the SAS data set AGES: AGES AGE --------- The variable AGE contains character values. data subset;

More information

SAS Display Manager Windows. For Windows

SAS Display Manager Windows. For Windows SAS Display Manager Windows For Windows Computers with SAS software SSCC Windows Terminal Servers (Winstat) Linux Servers (linstat) Lab computers DoIT Info Labs (as of June 2014) In all Labs with Windows

More information

SAS Viya 3.1 FAQ for Processing UTF-8 Data

SAS Viya 3.1 FAQ for Processing UTF-8 Data SAS Viya 3.1 FAQ for Processing UTF-8 Data Troubleshooting Tips for Processing UTF-8 Data (Existing SAS Code) What Is the Encoding of My Data Set? PROC CONTENTS displays information about the data set

More information

DSCI 325 Practice Midterm Questions Spring In SAS, a statement must end

DSCI 325 Practice Midterm Questions Spring In SAS, a statement must end DSCI 325 Practice Midterm Questions Spring 2016 1. In SAS, a statement must end a. with a colon b. with a semicolon c. in a new line d. with the command RUN 2. Which of the following is a valid variable

More information

SAS PROGRAMMING AND APPLICATIONS (STAT 5110/6110): FALL 2015 Module 2

SAS PROGRAMMING AND APPLICATIONS (STAT 5110/6110): FALL 2015 Module 2 SAS PROGRAMMING AND APPLICATIONS (STAT 5110/6110): FALL 2015 Department of MathemaGcs and StaGsGcs Phone: 4-3620 Office: Parker 364- A E- mail: carpedm@auburn.edu Web: hup://www.auburn.edu/~carpedm/stat6110

More information

Cody s Collection of Popular SAS Programming Tasks and How to Tackle Them

Cody s Collection of Popular SAS Programming Tasks and How to Tackle Them Cody s Collection of Popular SAS Programming Tasks and How to Tackle Them Ron Cody Contents List of Programs... ix About This Book... xv About The Author... xix Acknowledgments... xxi Chapter 1 Tasks Involving

More information

Creation of SAS Dataset

Creation of SAS Dataset Creation of SAS Dataset Contents SAS data step Access to PC files Access to Oracle Access to SQL 2 SAS Data Step Contents Creating SAS data sets from raw data Creating and managing variables 3 Creating

More information

SAS Online Training: Course contents: Agenda:

SAS Online Training: Course contents: Agenda: SAS Online Training: Course contents: Agenda: (1) Base SAS (6) Clinical SAS Online Training with Real time Projects (2) Advance SAS (7) Financial SAS Training Real time Projects (3) SQL (8) CV preparation

More information

Stat 302 Statistical Software and Its Applications SAS: Data I/O

Stat 302 Statistical Software and Its Applications SAS: Data I/O Stat 302 Statistical Software and Its Applications SAS: Data I/O Yen-Chi Chen Department of Statistics, University of Washington Autumn 2016 1 / 33 Getting Data Files Get the following data sets from the

More information

Using Dynamic Data Exchange

Using Dynamic Data Exchange 145 CHAPTER 8 Using Dynamic Data Exchange Overview of Dynamic Data Exchange 145 DDE Syntax within SAS 145 Referencing the DDE External File 146 Determining the DDE Triplet 146 Controlling Another Application

More information

Control Structures. A program can proceed: Sequentially Selectively (branch) - making a choice Repetitively (iteratively) - looping

Control Structures. A program can proceed: Sequentially Selectively (branch) - making a choice Repetitively (iteratively) - looping Control Structures A program can proceed: Sequentially Selectively (branch) - making a choice Repetitively (iteratively) - looping Conditional Execution if is a reserved word The most basic syntax for

More information

A Format to Make the _TYPE_ Field of PROC MEANS Easier to Interpret Matt Pettis, Thomson West, Eagan, MN

A Format to Make the _TYPE_ Field of PROC MEANS Easier to Interpret Matt Pettis, Thomson West, Eagan, MN Paper 045-29 A Format to Make the _TYPE_ Field of PROC MEANS Easier to Interpret Matt Pettis, Thomson West, Eagan, MN ABSTRACT: PROC MEANS analyzes datasets according to the variables listed in its Class

More information

2. Don t forget semicolons and RUN statements The two most common programming errors.

2. Don t forget semicolons and RUN statements The two most common programming errors. Randy s SAS hints March 7, 2013 1. Always begin your programs with internal documentation. * ***************** * Program =test1, Randy Ellis, March 8, 2013 ***************; 2. Don t forget semicolons and

More information

Chapter 6: Modifying and Combining Data Sets

Chapter 6: Modifying and Combining Data Sets Chapter 6: Modifying and Combining Data Sets The SET statement is a powerful statement in the DATA step. Its main use is to read in a previously created SAS data set which can be modified and saved as

More information

Beyond FORMAT Basics Mike Zdeb, School of Public Health, Rensselaer, NY

Beyond FORMAT Basics Mike Zdeb, School of Public Health, Rensselaer, NY Beyond FORMAT Basics Mike Zdeb, University@Albany School of Public Health, Rensselaer, NY ABSTRACT Beginning and even intermediate level SAS users sometimes never venture beyond the basics in using formats.

More information

PHPM 672/677 Lab #2: Variables & Conditionals Due date: Submit by 11:59pm Monday 2/5 with Assignment 2

PHPM 672/677 Lab #2: Variables & Conditionals Due date: Submit by 11:59pm Monday 2/5 with Assignment 2 PHPM 672/677 Lab #2: Variables & Conditionals Due date: Submit by 11:59pm Monday 2/5 with Assignment 2 Overview Most assignments will have a companion lab to help you learn the task and should cover similar

More information

3. Almost always use system options options compress =yes nocenter; /* mostly use */ options ps=9999 ls=200;

3. Almost always use system options options compress =yes nocenter; /* mostly use */ options ps=9999 ls=200; Randy s SAS hints, updated Feb 6, 2014 1. Always begin your programs with internal documentation. * ***************** * Program =test1, Randy Ellis, first version: March 8, 2013 ***************; 2. Don

More information

1 Files to download. 3 Macro to list the highest and lowest N data values. 2 Reading in the example data file

1 Files to download. 3 Macro to list the highest and lowest N data values. 2 Reading in the example data file 1 2 22S:172 Lab session 10 Macros for data cleaning July 17, 2003 GENDER VISIT HR SBP DBP DX AE = "Gender" = "Visit Date" = "Heart Rate" = "Systolic Blood Pressure" = "Diastolic Blood Pressure" = "Diagnosis

More information

Using Data Set Options in PROC SQL Kenneth W. Borowiak Howard M. Proskin & Associates, Inc., Rochester, NY

Using Data Set Options in PROC SQL Kenneth W. Borowiak Howard M. Proskin & Associates, Inc., Rochester, NY Using Data Set Options in PROC SQL Kenneth W. Borowiak Howard M. Proskin & Associates, Inc., Rochester, NY ABSTRACT Data set options are an often over-looked feature when querying and manipulating SAS

More information

Using Tcl. Learning Objectives

Using Tcl. Learning Objectives Using Tcl Learning Objectives Using the transcript program, you will translate a given specification (in DC-Shell format) into PrimeTime Tcl format. After completing this lab, you should be able to: Write

More information

SAS coding for those who like to be control

SAS coding for those who like to be control SAS coding for those who like to be control Montreal SAS Users Group 30 May 2018 Charu Shankar SAS Institute, Toronto About your presenter SAS Senior Technical Training Specialist, Charu Shankar teaches

More information

Leave Your Bad Code Behind: 50 Ways to Make Your SAS Code Execute More Efficiently.

Leave Your Bad Code Behind: 50 Ways to Make Your SAS Code Execute More Efficiently. Leave Your Bad Code Behind: 50 Ways to Make Your SAS Code Execute More Efficiently. William E Benjamin Jr Owl Computer Consultancy, LLC 2012 Topic Groups Processing more than one file in each DATA step

More information

44 Tricks with the 4mat Procedure

44 Tricks with the 4mat Procedure 44 Tricks with the 4mat Procedure Ben Cochran, The Bedford Group, Raleigh, NC Abstract: Actually, there probably are not a total of 44 tricks that one can do with the FORMAT procedure. The number was chosen

More information

PROC FORMAT Jack Shoemaker Real Decisions Corporation

PROC FORMAT Jack Shoemaker Real Decisions Corporation 140 Beginning Tutorials PROC FORMAT Jack Shoemaker Real Decisions Corporation Abstract: Although SAS stores and processes data intemally as either characters or numbers, you can control the external view

More information

Hidden in plain sight: my top ten underpublicized enhancements in SAS Versions 9.2 and 9.3

Hidden in plain sight: my top ten underpublicized enhancements in SAS Versions 9.2 and 9.3 Hidden in plain sight: my top ten underpublicized enhancements in SAS Versions 9.2 and 9.3 Bruce Gilsen, Federal Reserve Board, Washington, DC ABSTRACT SAS Versions 9.2 and 9.3 contain many interesting

More information

PASS4TEST. IT Certification Guaranteed, The Easy Way! We offer free update service for one year

PASS4TEST. IT Certification Guaranteed, The Easy Way!  We offer free update service for one year PASS4TEST \ http://www.pass4test.com We offer free update service for one year Exam : A00-212 Title : SAS Advanced Programming Exam for SAS 9 Vendor : SASInstitute Version : DEMO Get Latest & Valid A00-212

More information

An Animated Guide : Speed Merges: resource use by common procedures Russell Lavery, Contractor, Ardmore, PA

An Animated Guide : Speed Merges: resource use by common procedures Russell Lavery, Contractor, Ardmore, PA An Animated Guide : Speed Merges: resource use by common procedures Russell Lavery, Contractor, Ardmore, PA ABSTRACT This paper is a comparison of how resources are used by different SAS table lookup (Figure

More information

BEYOND FORMAT BASICS 1

BEYOND FORMAT BASICS 1 BEYOND FORMAT BASICS 1 CNTLIN DATA SETS...LABELING VALUES OF VARIABLE One common use of a format in SAS is to assign labels to values of a variable. The rules for creating a format with PROC FORMAT are

More information

Using Maps with the JSON LIBNAME Engine in SAS Andrew Gannon, The Financial Risk Group, Cary NC

Using Maps with the JSON LIBNAME Engine in SAS Andrew Gannon, The Financial Risk Group, Cary NC Paper 1734-2018 Using Maps with the JSON LIBNAME Engine in SAS Andrew Gannon, The Financial Risk Group, Cary NC ABSTRACT This paper serves as an introduction to reading JSON data via the JSON LIBNAME engine

More information

By the end of this section you should: Understand what the variables are and why they are used. Use C++ built in data types to create program

By the end of this section you should: Understand what the variables are and why they are used. Use C++ built in data types to create program 1 By the end of this section you should: Understand what the variables are and why they are used. Use C++ built in data types to create program variables. Apply C++ syntax rules to declare variables, initialize

More information

CS 221 Lecture. Tuesday, 4 October There are 10 kinds of people in this world: those who know how to count in binary, and those who don t.

CS 221 Lecture. Tuesday, 4 October There are 10 kinds of people in this world: those who know how to count in binary, and those who don t. CS 221 Lecture Tuesday, 4 October 2011 There are 10 kinds of people in this world: those who know how to count in binary, and those who don t. Today s Agenda 1. Announcements 2. You Can Define New Functions

More information

9/21/17. Outline. Expression Evaluation and Control Flow. Arithmetic Expressions. Operators. Operators. Notation & Placement

9/21/17. Outline. Expression Evaluation and Control Flow. Arithmetic Expressions. Operators. Operators. Notation & Placement Outline Expression Evaluation and Control Flow In Text: Chapter 6 Notation Operator evaluation order Operand evaluation order Overloaded operators Type conversions Short-circuit evaluation of conditions

More information

Petros: A Multi-purpose Text File Manipulation Language

Petros: A Multi-purpose Text File Manipulation Language Petros: A Multi-purpose Text File Manipulation Language Language Reference Manual Joseph Sherrick js2778@columbia.edu June 20, 2008 Table of Contents 1 Introduction...................................................

More information

Introduction OR CARDS. INPUT DATA step OUTPUT DATA 8-1

Introduction OR CARDS. INPUT DATA step OUTPUT DATA 8-1 Introduction Thus far, all the DATA step programs we have seen have involved reading and writing only SAS data sets. In this chapter we will present techniques to read and write external or "raw" files

More information

Stat 302 Statistical Software and Its Applications SAS: Data I/O & Descriptive Statistics

Stat 302 Statistical Software and Its Applications SAS: Data I/O & Descriptive Statistics Stat 302 Statistical Software and Its Applications SAS: Data I/O & Descriptive Statistics Fritz Scholz Department of Statistics, University of Washington Winter Quarter 2015 February 19, 2015 2 Getting

More information

ASSIGNMENT #2 ( *** ANSWERS ***) 1

ASSIGNMENT #2 ( *** ANSWERS ***) 1 ASSIGNMENT #2 ( *** ANSWERS ***) 1 * problem #1 *** WHERE WILL THE PERMANENT SAS DATA SET BE WRITTEN libname x 'i:\' CREATE A PERMANENT SAS DATA SET NAMED CLINICAL USE AN INFILE STATEMENT TO TELL SAS WHERE

More information

Intro to Programming. Unit 7. What is Programming? What is Programming? Intro to Programming

Intro to Programming. Unit 7. What is Programming? What is Programming? Intro to Programming Intro to Programming Unit 7 Intro to Programming 1 What is Programming? 1. Programming Languages 2. Markup vs. Programming 1. Introduction 2. Print Statement 3. Strings 4. Types and Values 5. Math Externals

More information

Introduction. Getting Started with the Macro Facility CHAPTER 1

Introduction. Getting Started with the Macro Facility CHAPTER 1 1 CHAPTER 1 Introduction Getting Started with the Macro Facility 1 Replacing Text Strings Using Macro Variables 2 Generating SAS Code Using Macros 3 Inserting Comments in Macros 4 Macro Definition Containing

More information

SAS PROGRAM EFFICIENCY FOR BEGINNERS. Bruce Gilsen, Federal Reserve Board

SAS PROGRAM EFFICIENCY FOR BEGINNERS. Bruce Gilsen, Federal Reserve Board SAS PROGRAM EFFICIENCY FOR BEGINNERS Bruce Gilsen, Federal Reserve Board INTRODUCTION This paper presents simple efficiency techniques that can benefit inexperienced SAS software users on all platforms.

More information

SAS PROGRAM EFFICIENCY FOR BEGINNERS. Bruce Gilsen, Federal Reserve Board

SAS PROGRAM EFFICIENCY FOR BEGINNERS. Bruce Gilsen, Federal Reserve Board SAS PROGRAM EFFICIENCY FOR BEGINNERS Bruce Gilsen, Federal Reserve Board INTRODUCTION This paper presents simple efficiency techniques that can benefit inexperienced SAS software users on all platforms.

More information

Bruce Gilsen, Federal Reserve Board

Bruce Gilsen, Federal Reserve Board SAS PROGRAM EFFICIENCY FOR BEGINNERS Bruce Gilsen, Federal Reserve Board INTRODUCTION This paper presents simple efficiency techniques that can benefit inexperienced SAS software users on all platforms

More information

Using SAS software to fulfil an FDA request for database documentation

Using SAS software to fulfil an FDA request for database documentation Using SAS software to fulfil an FDA request for database documentation Introduction Pantaleo Nacci, Adam Crisp Glaxo Wellcome R&D, UK Historically, a regulatory submission to seek approval for a new drug

More information

SAS Macro Language: Reference

SAS Macro Language: Reference SAS Macro Language: Reference INTRODUCTION Getting Started with the Macro Facility This is the macro facility language reference for the SAS System. It is a reference for the SAS macro language processor

More information

Eventus Example Series Using Non-CRSP Data in Eventus 7 1

Eventus Example Series Using Non-CRSP Data in Eventus 7 1 Eventus Example Series Using Non-CRSP Data in Eventus 7 1 Goal: Use Eventus software version 7.0 or higher to construct a mini-database of data obtained from any source, and run one or more event studies

More information

Your Own SAS Macros Are as Powerful as You Are Ingenious

Your Own SAS Macros Are as Powerful as You Are Ingenious Paper CC166 Your Own SAS Macros Are as Powerful as You Are Ingenious Yinghua Shi, Department Of Treasury, Washington, DC ABSTRACT This article proposes, for user-written SAS macros, separate definitions

More information

The Power of PROC SQL Techniques and SAS Dictionary Tables in Handling Data

The Power of PROC SQL Techniques and SAS Dictionary Tables in Handling Data Paper PO31 The Power of PROC SQL Techniques and SAS Dictionary Tables in Handling Data MaryAnne DePesquo Hope, Health Services Advisory Group, Phoenix, Arizona Fen Fen Li, Health Services Advisory Group,

More information

Other Data Sources SAS can read data from a variety of sources:

Other Data Sources SAS can read data from a variety of sources: Other Data Sources SAS can read data from a variety of sources: Plain text files, including delimited and fixed-column files Spreadsheets, such as Excel Databases XML Others Text Files Text files of various

More information

Introduction to SAS. I. Understanding the basics In this section, we introduce a few basic but very helpful commands.

Introduction to SAS. I. Understanding the basics In this section, we introduce a few basic but very helpful commands. Center for Teaching, Research and Learning Research Support Group American University, Washington, D.C. Hurst Hall 203 rsg@american.edu (202) 885-3862 Introduction to SAS Workshop Objective This workshop

More information

Customizing Your SAS Session

Customizing Your SAS Session 13 CHAPTER 2 Customizing Your SAS Session Introduction 13 Specifying System Options in the SAS Command 14 Configuration Files 15 Creating a User Configuration File 15 Specifying a User Configuration File

More information

Base and Advance SAS

Base and Advance SAS Base and Advance SAS BASE SAS INTRODUCTION An Overview of the SAS System SAS Tasks Output produced by the SAS System SAS Tools (SAS Program - Data step and Proc step) A sample SAS program Exploring SAS

More information

INTRODUCTION TO SAS HOW SAS WORKS READING RAW DATA INTO SAS

INTRODUCTION TO SAS HOW SAS WORKS READING RAW DATA INTO SAS TO SAS NEED FOR SAS WHO USES SAS WHAT IS SAS? OVERVIEW OF BASE SAS SOFTWARE DATA MANAGEMENT FACILITY STRUCTURE OF SAS DATASET SAS PROGRAM PROGRAMMING LANGUAGE ELEMENTS OF THE SAS LANGUAGE RULES FOR SAS

More information

Accessing Data and Creating Data Structures. SAS Global Certification Webinar Series

Accessing Data and Creating Data Structures. SAS Global Certification Webinar Series Accessing Data and Creating Data Structures SAS Global Certification Webinar Series Accessing Data and Creating Data Structures Becky Gray Certification Exam Developer SAS Global Certification Michele

More information

AN INTRODUCTION TO MACRO VARIABLES AND MACRO PROGRAMS Mike Zdeb, School of Public Health

AN INTRODUCTION TO MACRO VARIABLES AND MACRO PROGRAMS Mike Zdeb, School of Public Health AN INTRODUCTION TO MACRO VARIABLES AND MACRO PROGRAMS Mike Zdeb, University@Albany School of Public Health INTRODUCTION There are a number of SAS tools that you may never have to use. Why? The main reason

More information

Get Started Writing SAS Macros Luisa Hartman, Jane Liao, Merck Sharp & Dohme Corp.

Get Started Writing SAS Macros Luisa Hartman, Jane Liao, Merck Sharp & Dohme Corp. Get Started Writing SAS Macros Luisa Hartman, Jane Liao, Merck Sharp & Dohme Corp. ABSTRACT The SAS Macro Facility is a tool which lends flexibility to your SAS code and promotes easier maintenance. It

More information

Bash shell programming Part II Control statements

Bash shell programming Part II Control statements Bash shell programming Part II Control statements Deniz Savas and Michael Griffiths 2005-2011 Corporate Information and Computing Services The University of Sheffield Email M.Griffiths@sheffield.ac.uk

More information

Some Subnetting Practice Problem Solutions

Some Subnetting Practice Problem Solutions Some Subnetting Practice Problem Solutions practice problem 1. What is 23.183.62.51 in binary? Solution: dec bin 23 0001 0111 183 1011 0111 62 0011 1110 51 0011 0011 2. What class address is this? Solution:

More information

STAT:5400 Computing in Statistics. Other software packages. Microsoft Excel spreadsheet very convenient for entering data in flatfile

STAT:5400 Computing in Statistics. Other software packages. Microsoft Excel spreadsheet very convenient for entering data in flatfile STAT:5400 Computing in Statistics Other Software Packages Proc import A bit on SAS macro language Lecture 26 ov 2, 2016 Kate Cowles 374 SH, 335-0727 kate-cowles@uiowaedu Other software packages Microsoft

More information

Syntax Conventions for SAS Programming Languages

Syntax Conventions for SAS Programming Languages Syntax Conventions for SAS Programming Languages SAS Syntax Components Keywords A keyword is one or more literal name components of a language element. Keywords are uppercase, and in reference documentation,

More information

Find2000: A Search Tool to Find Date-Related Strings in SAS

Find2000: A Search Tool to Find Date-Related Strings in SAS Find2000: A Search Tool to Find Date-Related Strings in SAS Sarah L. Mitchell, Qualex Consulting Services, Inc. Michael Gilman, Qualex Consulting Services, Inc. Figure 1 Abstract Although SAS Version 6

More information

Acknowledgments xi Preface xiii About the Author xv About This Book xvii New in the Macro Language xxi

Acknowledgments xi Preface xiii About the Author xv About This Book xvii New in the Macro Language xxi Contents Part 1 Acknowledgments xi Preface xiii About the Author xv About This Book xvii New in the Macro Language xxi Macro Basics Chapter 1 Introduction 3 1.1 Macro Facility Overview 3 1.2 Terminology

More information

Calgary SAS Users Group April 7, 2016 Peter Eberhardt Fernwood Consulting Group Inc. Xue Yao Winnipeg Regional Health Authority

Calgary SAS Users Group April 7, 2016 Peter Eberhardt Fernwood Consulting Group Inc. Xue Yao Winnipeg Regional Health Authority Calgary SAS Users Group April 7, 2016 Peter Eberhardt Fernwood Consulting Group Inc. Xue Yao Winnipeg Regional Health Authority Agenda Overview of DS2 Data types Scope Methods Packages Missing values and

More information

General Tips for Working with Large SAS datasets and Oracle tables

General Tips for Working with Large SAS datasets and Oracle tables General Tips for Working with Large SAS datasets and Oracle tables 1) Avoid duplicating Oracle tables as SAS datasets only keep the rows and columns needed for your analysis. Use keep/drop/where directly

More information