Create a SAS Program to create the following files from the PREC2 sas data set created in LAB2.

Size: px
Start display at page:

Download "Create a SAS Program to create the following files from the PREC2 sas data set created in LAB2."

Transcription

1 Topics: Data step Subsetting Concatenation and Merging Reference: Little SAS Book - Chapter 5, Section 3.6 and 2.2 Online documentation Exercise I LAB EXERCISE The following is a lab exercise to give you experience combining SAS data sets. The data files, nmes, employee1-employee4, data1-data3 are located on the website on the LAB page under class3 Download the files from LAB on the website to your folder. If you are taking the class for credit (either pass/fail or graded), please read the italicized instructions at the end of each section. Please save the logs, output sections and the answers to the questions specified into one word document and to the class sas@jhsph.edu. Please do not send all of the logs and output windows. Please label each section clearly and put your name and LAB3 in the subject line of the . Use a TITLE statements. Start the SAS Program Create a SAS Program to create the following files from the PREC2 sas data set created in LAB2. 1. Create a temporary file that contains only records with only known values of systolic and diastolic pressure (msbp and mdbp). 2. Create another file that contains only males whose age in 1998 was less than 75 years of age. Do not include the variables wgt and hgt on this dataset. 3. Create two files in the same data step that contain males and females separately. Save the saslog from these 3 data steps send in the exercise e- mail. Label this section Lab3 Exercise 1 1

2 Exercise II A. Concatenation and Merging 1. DATA1 and DATA2 are two SAS data sets described below that contain disease and follow-up information on a group of patients. The maximum number of diseases codes (ICD-9 codes) is 6. We want to create a new file, DATA1_2, by combining these two files. Both of the files contain the variables described below. Type in the following program into the ENHANCED EDITOR window and submit to create one file with the data derived from these two files. Check the SAS log and answer the questions. Libname mylib insert your folder name ; Data data1_2; Set ; Run; How many observations are in DATA1? How many observations in Data1_2? How many variables? Variable Description Type ID Patient ID Numeric DX1 Diagnosis 1 Character DX2 Diagnosis 2 Character DX3 Diagnosis 3 Character DX4 Diagnosis 4 Character DX5 Diagnosis 5 Character DX6 Diagnosis 6 Character Sex 0 = female Numeric 1 = male Yearc Year of last Numeric contact Yob Year of Birth Numeric Cvd Cardiovascular Numeric Disease 0 = no 1 = yes Smoker 0 = no 1 = yes Numeric Chol Cholesterol mg/dl Numeric 2

3 2. We have additional patient information to add to the Data1_2 file created in 1. DATA3 contains additional information described below for the patients in the Data1_2 file. Therefore, use a MERGE step to combine these files. We are adding additional data to existing records. This is a description of the data in DATA3 Variable Type Description ID numeric id SBP numeric systolic blood pressure mmhg DBP numeric diastolic blood pressure mmhg NO_CIG numeric number of cigarettes per day 0=none 1=1-10 2= = =40 or more BMI numeric body mass index kg/m 2 Remember we need to sort both files by ID before merging (using PROC SORT). Proc Sort data= ; by id; Proc Sort data= ; by id; Data mylib.alldata; merge ; Proc print data=mylib.alldata; Run; Check the SAS log for errors. Although you may not have any errors, there is a major problem with the merge program. The program did not match-merge the data because the BY statement was missing. Instead the file was sequentially matched and data from different patients were combined into one record. How many observations in the ALLDATA file? Compare the values for ICD-9 codes for the first five records of the ALLDATA file to the first five records of the Data1_2. Notice the problems with the matching. Now return to the program editor window. Create a new SAS data set (ALLDATA) by match-merging the data in Data1_2 with the data in DATA3 using a key variable (id). To do this add a BY statement to the DATA step and rerun. How many observations are in the ALLDATA file? Compare the first five records to the records in Data1_2. 3. We are going to use the data set option (in= ) to determine which records did not match. Return to the program in the Enhanced Editor and add the following instructions to the DATA step. Remember the in variable for each file will equal one for each record on that file. 3

4 Data mylib.alldata; merge (in=count) (in=count2); by id; If count=0 then put id= count=; If count2=0 then put id= count2=; Proc print data=mylib.alldata; Title With By statement ; Run; Review the log window. How many records from the DATA1_2 file did not have a match in DATA3? How many records from the DATA3 file did not have a match in Data1_2? 4. Suppose you only want to include those records that matched included in my ALLDATA file. You can use the count and count2 variables in the DATA step to exclude the non-matches using IF-THEN clauses. Add the appropriate statement(s) to the program and run. Check the SAS log for errors. SAVE the SAS log from this final DATA step and the answer to the following question in the exercise . Label this section Lab3 Exercise II Part A. How many observations are in the ALLDATA file? NOTE: The SAS system has an option to prevent accidental merging without a BY statement. Look at the NOMERGEBY system option in HELP for further details. B. Concatenation and Merging The following files contain employee information. Use the SET and MERGE statements to combine the following files. 1. Create a combined SAS data set named employee1_2 (temporary or permanent, you choose) by concatenating the employee1 and employee2 files (SAS data sets). The data sets contain different individuals with the following variables: Variable SSN Description SOCIAL SECURITY NUMBER ( XXXXXXXXX) 4

5 Name employee name : lastname, first name Hire hire date Date Variable Salary Phone annual salary office telephone number: In the form : XXX-XXXX Add a LABEL statement to the DATA step to label the name, hire, and phone variables with the description given above. Add a PROC CONTENTS step to list out the contents of employee1_2. Review the LOG and OUTPUT windows. How many records are in the employee1_2 SAS data set? 2. Employee3 contains additional employees that we need to add to the file created in 1. Combine this file with the employee1_2 SAS data set created in section A.1 and name the new SAS data set employee123. DO NOT INCLUDE the variable name in the employee123 file (DROP or KEEP Data Set Option). The employee3 file includes the following variables: Variable SSN Description SOCIAL SECURITY NUMBER ( XXXXXXXXX) Name employee name : In the form lastname, first name Gender gender F=female M=male Hire hire date Date variable Salary annual salary Notice employee3 does not contain the phone variable, but does include the gender variable. Use PROC PRINT to print out the new dataset employee123. It should contain all of the records in employee1, employee2 and employee3. SAVE the OUTPUT window (from #2 only) containing the listing of employee123. Make sure that you put the name EMPLOYEE 123 file as the title at the top of the listing. Include the listing and the answers to the following 3 questions in your exercise . Label this section Lab3 5

6 Exercise II Part B.1 1. How many observations are in employee123? 2. What is the value for gender for SSN= ? 3. What is the office telephone number for SSN= ? 3. Add the following data from the employee4 file to the records from employee123 file created in 2. The Employee4 contains additional information on the same employees in the employee123 file. SSN is the key variable to use to match the records. Variable SSN Description SOCIAL SECURITY NUMBER Left date left the company date variable Blank if still an employee Phone home phone number In the form (XXX-XXXX) First, run PROC CONTENTS on the employee4 file. Notice the label for the phone variable. It is the home phone number. The variable phone on the employee123 file is the office telephone number. We want to merge the employee4 SAS data set with the employee123 SAS data set created in 3, BUT we want to keep both the home and office phone numbers. Remember SAS will retain only one of the variables because they have the same name (Hint: use a Data Set Option on the MERGE statement).match-merge using SSN as the key variable and create a new SAS data set employee_total. Print out the file using PROC PRINT. SAVE the LOG and OUTPUT windows (from #3) containing results from the program creating employee_total and the answers to the following questions. Please label this part of the report as Lab 3 Exercise II Part B.2 and include in your exercise How many records are in the employee123 and employee4 files? 2. How many records and variables are in the employee_total file? 6

7 3. List the SSN of the records that do not match? Use the IN data set option to identify the records that do not match and list them in the LOG window. 4. How many variables does the file employee_total have? 4. Modify the DATA step that creates employee_total to use the IN data option to include only those observations that exist in both files(employee123 and employee4). There will be 14 observations in employee_total. Save the SAS LOG (from #4) creating the new employee_total. Label this section as Lab 3 Exercise II Part B.3 and include in your exercise . 7

INTRODUCTION to SAS STATISTICAL PACKAGE LAB 3

INTRODUCTION to SAS STATISTICAL PACKAGE LAB 3 Topics: Data step Subsetting Concatenation and Merging Reference: Little SAS Book - Chapter 5, Section 3.6 and 2.2 Online documentation Exercise I LAB EXERCISE The following is a lab exercise to give you

More information

WHO STEPS Surveillance Support Materials. STEPS Epi Info Training Guide

WHO STEPS Surveillance Support Materials. STEPS Epi Info Training Guide STEPS Epi Info Training Guide Department of Chronic Diseases and Health Promotion World Health Organization 20 Avenue Appia, 1211 Geneva 27, Switzerland For further information: www.who.int/chp/steps WHO

More information

Creating New Variables in JMP Datasets Using Formulas Exercises

Creating New Variables in JMP Datasets Using Formulas Exercises Creating New Variables in JMP Datasets Using Formulas Exercises Exercise 3 Calculate the Difference of Two Columns 1. This Exercise will use the data table Cholesterol. This data table contains the following

More information

Remove this where. statement to produce the. report on the right with all 4 regions. Retain this where. statement to produce the

Remove this where. statement to produce the. report on the right with all 4 regions. Retain this where. statement to produce the Problem 4, Chapter 14, Ex. 2. Using the SAS sales data set, create the report shown in the text. Note: The report shown in the text for this question, contains only East & West region data. However, the

More information

ASSIGNMENT #2 ( *** ANSWERS ***) 1

ASSIGNMENT #2 ( *** ANSWERS ***) 1 ASSIGNMENT #2 ( *** ANSWERS ***) 1 * problem #1 *** WHERE WILL THE PERMANENT SAS DATA SET BE WRITTEN libname x 'i:\' CREATE A PERMANENT SAS DATA SET NAMED CLINICAL USE AN INFILE STATEMENT TO TELL SAS WHERE

More information

Maximizing Statistical Interactions Part II: Database Issues Provided by: The Biostatistics Collaboration Center (BCC) at Northwestern University

Maximizing Statistical Interactions Part II: Database Issues Provided by: The Biostatistics Collaboration Center (BCC) at Northwestern University Maximizing Statistical Interactions Part II: Database Issues Provided by: The Biostatistics Collaboration Center (BCC) at Northwestern University While your data tables or spreadsheets may look good to

More information

1 Files to download. 3 Macro to list the highest and lowest N data values. 2 Reading in the example data file

1 Files to download. 3 Macro to list the highest and lowest N data values. 2 Reading in the example data file 1 2 22S:172 Lab session 10 Macros for data cleaning July 17, 2003 GENDER VISIT HR SBP DBP DX AE = "Gender" = "Visit Date" = "Heart Rate" = "Systolic Blood Pressure" = "Diastolic Blood Pressure" = "Diagnosis

More information

22S:166. Checking Values of Numeric Variables

22S:166. Checking Values of Numeric Variables 22S:1 Computing in Statistics Lecture 24 Nov. 2, 2016 1 Checking Values of Numeric Variables range checks when you know what the range of possible values is for a given quantitative variable internal consistency

More information

Using an ICPSR set-up file to create a SAS dataset

Using an ICPSR set-up file to create a SAS dataset Using an ICPSR set-up file to create a SAS dataset Name library and raw data files. From the Start menu, launch SAS, and in the Editor program, write the codes to create and name a folder in the SAS permanent

More information

Contents. About This Book...1

Contents. About This Book...1 Contents About This Book...1 Chapter 1: Basic Concepts...5 Overview...6 SAS Programs...7 SAS Libraries...13 Referencing SAS Files...15 SAS Data Sets...18 Variable Attributes...21 Summary...26 Practice...28

More information

B/ Use data set ADMITS to find the most common day of the week for admission. (HINT: Use a function or format.)

B/ Use data set ADMITS to find the most common day of the week for admission. (HINT: Use a function or format.) ASSIGNMENT #6 (*** ANSWERS ***) #1 DATES The following data are similar to data in example 8.3 in the notes. data admits format admit mmddyy10. input admit1 : mmddyy10. @@ datalines 11181998 12111998 02281998

More information

SAS and Data Management

SAS and Data Management SAS and Data Management Kim Magee Department of Biostatistics College of Public Health Review INFILE statement data bp; infile c:\sas\bp.csv dlm=, ; input clinic $ dbp1 sbp1 dbp2 sbp2; run; Name the dataset

More information

Statistical Analysis Using SPSS for Windows Getting Started (Ver. 2018/10/30) The numbers of figures in the SPSS_screenshot.pptx are shown in red.

Statistical Analysis Using SPSS for Windows Getting Started (Ver. 2018/10/30) The numbers of figures in the SPSS_screenshot.pptx are shown in red. Statistical Analysis Using SPSS for Windows Getting Started (Ver. 2018/10/30) The numbers of figures in the SPSS_screenshot.pptx are shown in red. 1. How to display English messages from IBM SPSS Statistics

More information

1. Study Registration. 2. Confirm Registration

1. Study Registration. 2. Confirm Registration USER MANUAL 1. Study Registration Diabetic patients are more susceptible to experiencing cardiovascular events, but this can be minimized with control of blood glucose levels and other risk factors (blood

More information

Task: Design an ER diagram for that problem. Specify key attributes of each entity type.

Task: Design an ER diagram for that problem. Specify key attributes of each entity type. Q1. Consider the following set of requirements for a university database that is used to keep track of students transcripts. (10 marks) 1. The university keeps track of each student s name, student number,

More information

STAT 7000: Experimental Statistics I

STAT 7000: Experimental Statistics I STAT 7000: Experimental Statistics I 2. A Short SAS Tutorial Peng Zeng Department of Mathematics and Statistics Auburn University Fall 2009 Peng Zeng (Auburn University) STAT 7000 Lecture Notes Fall 2009

More information

SAS and Data Management Kim Magee. Department of Biostatistics College of Public Health

SAS and Data Management Kim Magee. Department of Biostatistics College of Public Health SAS and Data Management Kim Magee Department of Biostatistics College of Public Health Review of Previous Material Review INFILE statement data bp; infile c:\sas\bp.csv dlm=, ; input clinic $ dbp1 sbp1

More information

OneUSG Connect. Hire a New Employee. Hire a New Employee HR_JA002

OneUSG Connect. Hire a New Employee. Hire a New Employee HR_JA002 Description This process describes the steps necessary to a new employee into a Position. Conditions A Position has been created in HCM Source Documents Hire Documentation Identify Verification Documentation

More information

i2itracks Population Health Analytics (ipha) Custom Reports & Dashboards

i2itracks Population Health Analytics (ipha) Custom Reports & Dashboards i2itracks Population Health Analytics (ipha) Custom Reports & Dashboards 377 Riverside Drive, Suite 300 Franklin, TN 37064 707-575-7100 www.i2ipophealth.com Table of Contents Creating ipha Custom Reports

More information

NYSLRS NYSLRS. Enroll a Member (Optional)

NYSLRS NYSLRS. Enroll a Member (Optional) Enroll a Member (Optional) NYSLRS NYSLRS New York State and Local Retirement System This quick guide shows you (as a personnel contact for a participating employer) how to enroll a member. To begin the

More information

PROC FORMAT. CMS SAS User Group Conference October 31, 2007 Dan Waldo

PROC FORMAT. CMS SAS User Group Conference October 31, 2007 Dan Waldo PROC FORMAT CMS SAS User Group Conference October 31, 2007 Dan Waldo 1 Today s topic: Three uses of formats 1. To improve the user-friendliness of printed results 2. To group like data values without affecting

More information

A Simple Guide to Using SPSS (Statistical Package for the. Introduction. Steps for Analyzing Data. Social Sciences) for Windows

A Simple Guide to Using SPSS (Statistical Package for the. Introduction. Steps for Analyzing Data. Social Sciences) for Windows A Simple Guide to Using SPSS (Statistical Package for the Social Sciences) for Windows Introduction ٢ Steps for Analyzing Data Enter the data Select the procedure and options Select the variables Run the

More information

using and Understanding Formats

using and Understanding Formats using and Understanding SAS@ Formats Howard Levine, DynaMark, Inc. Oblectives The purpose of this paper is to enable you to use SAS formats to perform the following tasks more effectively: Improving the

More information

Database Concepts Using Microsoft Access

Database Concepts Using Microsoft Access lab Database Concepts Using Microsoft Access 9 Objectives: Upon successful completion of Lab 9, you will be able to Understand fundamental concepts including database, table, record, field, field name,

More information

GP Mac. Drug report. Figure 0.1 Patient s on a Drug pop-up box

GP Mac. Drug report. Figure 0.1 Patient s on a Drug pop-up box GP Mac 1. Creating a Diabetes Register First it is essential to find your patients with a diagnosis of Diabetes In GP Mac you can do a drug search for all patients prescribed Diabetes Meds/Test Strips.

More information

Introduction to SAS Statistical Package

Introduction to SAS Statistical Package Instructor: Introduction to SAS Statistical Package Biostatistics 140.632 Lecture 1 Lucy Meoni lmeoni@jhmi.edu Teaching Assistant : Sorina Eftim seftim@jhsph.edu Lecture/Lab: Room 3017 WEB site: www.biostat.jhsph.edu/bstcourse/bio632/default.htm

More information

Epidemiology Principles of Biostatistics Chapter 3. Introduction to SAS. John Koval

Epidemiology Principles of Biostatistics Chapter 3. Introduction to SAS. John Koval Epidemiology 9509 Principles of Biostatistics Chapter 3 John Koval Department of Epidemiology and Biostatistics University of Western Ontario What we will do today We will learn to use use SAS to 1. read

More information

3. Almost always use system options options compress =yes nocenter; /* mostly use */ options ps=9999 ls=200;

3. Almost always use system options options compress =yes nocenter; /* mostly use */ options ps=9999 ls=200; Randy s SAS hints, updated Feb 6, 2014 1. Always begin your programs with internal documentation. * ***************** * Program =test1, Randy Ellis, first version: March 8, 2013 ***************; 2. Don

More information

If you have never used IACRA, your first step is to become registered as an applicant.

If you have never used IACRA, your first step is to become registered as an applicant. IACRA Registration: If you have never used IACRA, your first step is to become registered as an applicant. Go to http://iacra.faa.gov NOTE: If you receive a browser error, you will need to download the

More information

STEP BY STEP HOW TO COMPLETE THE ELECTRONIC BGC FORM

STEP BY STEP HOW TO COMPLETE THE ELECTRONIC BGC FORM Human Resources Background Check Program backgroundchecks.hr.ncsu.edu 2711 Sullivan Drive, Admin Services II Raleigh, NC 27695 background-checks@ncsu.edu STEP BY STEP HOW TO COMPLETE THE ELECTRONIC BGC

More information

CFAR Biometrics_REDCap Data Restructuring Using SAS

CFAR Biometrics_REDCap Data Restructuring Using SAS /* syntax00a_redcap_import PREPARE DATA. EXPORTED INSTRUMENTS: EXPORTED EVENTS: EXPORTED DATA: ALL (ONLY NEED: ENROLMENT VISIT, FOLLOWUP, AND LAB DATA). ALL. (I WILL PARSE BELOW.) TO SAS. SAVED THE PATHWAY

More information

Ten Great Reasons to Learn SAS Software's SQL Procedure

Ten Great Reasons to Learn SAS Software's SQL Procedure Ten Great Reasons to Learn SAS Software's SQL Procedure Kirk Paul Lafler, Software Intelligence Corporation ABSTRACT The SQL Procedure has so many great features for both end-users and programmers. It's

More information

Beyond FORMAT Basics Mike Zdeb, School of Public Health, Rensselaer, NY

Beyond FORMAT Basics Mike Zdeb, School of Public Health, Rensselaer, NY Beyond FORMAT Basics Mike Zdeb, University@Albany School of Public Health, Rensselaer, NY ABSTRACT Beginning and even intermediate level SAS users sometimes never venture beyond the basics in using formats.

More information

Introduction (SPSS) Opening SPSS Start All Programs SPSS Inc SPSS 21. SPSS Menus

Introduction (SPSS) Opening SPSS Start All Programs SPSS Inc SPSS 21. SPSS Menus Introduction (SPSS) SPSS is the acronym of Statistical Package for the Social Sciences. SPSS is one of the most popular statistical packages which can perform highly complex data manipulation and analysis

More information

Introduction to Database Concepts and Microsoft Access Database Concepts and Access Things to Do. Introduction Database Microsoft Access

Introduction to Database Concepts and Microsoft Access Database Concepts and Access Things to Do. Introduction Database Microsoft Access Introduction to Database Concepts and Microsoft Access 2016 Academic Health Center Training training@health.ufl.edu (352) 273 5051 Database Concepts and Access 2016 Introduction Database Microsoft Access

More information

Tanita Health Ware Help

Tanita Health Ware Help Tanita Health Ware Help Getting Started Managing Users Measurements Analysis Graphs Files & Sharing Exporting ANT Scale Installation Using Garmin Watches Bluetooth Scale Installation Getting Started The

More information

Merge Processing and Alternate Table Lookup Techniques Prepared by

Merge Processing and Alternate Table Lookup Techniques Prepared by Merge Processing and Alternate Table Lookup Techniques Prepared by The syntax for data step merging is as follows: International SAS Training and Consulting This assumes that the incoming data sets are

More information

2. Don t forget semicolons and RUN statements The two most common programming errors.

2. Don t forget semicolons and RUN statements The two most common programming errors. Randy s SAS hints March 7, 2013 1. Always begin your programs with internal documentation. * ***************** * Program =test1, Randy Ellis, March 8, 2013 ***************; 2. Don t forget semicolons and

More information

CPRD Aurum Frequently asked questions (FAQs)

CPRD Aurum Frequently asked questions (FAQs) CPRD Aurum Frequently asked questions (FAQs) Version 2.0 Date: 10 th April 2019 Authors: Helen Booth, Daniel Dedman, Achim Wolf (CPRD, UK) 1 Documentation Control Sheet During the course of the project

More information

Automating Unpredictable Processes:

Automating Unpredictable Processes: Automating Unpredictable Processes: Building Responsive Apps using Business Rules By Carl Hewitt, Chief Architect, Decisions and Heath Oderman, CTO, Decisions Copyright 2016 Building Responsive Apps: Comparing

More information

Lab #1: Introduction to Basic SAS Operations

Lab #1: Introduction to Basic SAS Operations Lab #1: Introduction to Basic SAS Operations Getting Started: OVERVIEW OF SAS (access lab pages at http://www.stat.lsu.edu/exstlab/) There are several ways to open the SAS program. You may have a SAS icon

More information

SAS Programs SAS Lecture 4 Procedures. Aidan McDermott, April 18, Outline. Internal SAS formats. SAS Formats

SAS Programs SAS Lecture 4 Procedures. Aidan McDermott, April 18, Outline. Internal SAS formats. SAS Formats SAS Programs SAS Lecture 4 Procedures Aidan McDermott, April 18, 2006 A SAS program is in an imperative language consisting of statements. Each statement ends in a semi-colon. Programs consist of (at least)

More information

Final Stat 302, March 17, 2014

Final Stat 302, March 17, 2014 First Name Last Name Student ID Final Stat 302, March 17, 2014 Fritz Scholz Questions 1-15 count as 4 points each, the rest as 6 points each (180 total). 1. Could Y and y refer to different objects within

More information

Update Experience USER GUIDE. NextGen. Contact Us. NextGen Update Experience User Guide. E: P: F:

Update Experience USER GUIDE. NextGen. Contact Us. NextGen Update Experience User Guide. E: P: F: NextGen Update Experience USER GUIDE Contact Us E: support@harriscomputer.com P: 800.239.6224 F: 251.544.4901 Harris School Solutions Update Experience User Guide Page 1 esupport: https://support.harriscomputer.com/home.aspx

More information

Intermediate SAS: Working with Data

Intermediate SAS: Working with Data Intermediate SAS: Working with Data OIT Technical Support Services 293-4444 oithelp@mail.wvu.edu oit.wvu.edu/training/classmat/sas/ Table of Contents Getting set up for the Intermediate SAS workshop:...

More information

Lecture 1 Getting Started with SAS

Lecture 1 Getting Started with SAS SAS for Data Management, Analysis, and Reporting Lecture 1 Getting Started with SAS Portions reproduced with permission of SAS Institute Inc., Cary, NC, USA Goals of the course To provide skills required

More information

BE/EE189 Design and Construction of Biodevices Lecture 2. BE/EE189 Design and Construction of Biodevices - Caltech

BE/EE189 Design and Construction of Biodevices Lecture 2. BE/EE189 Design and Construction of Biodevices - Caltech BE/EE189 Design and Construction of Biodevices Lecture 2 LabVIEW Programming More Basics, Structures, Data Types, VI Case structure Debugging techniques Useful shortcuts Data types in labview Concept of

More information

Import and Browse. Review data. bp_stages is a chart based on a graphic

Import and Browse. Review data. bp_stages is a chart based on a graphic Import and Browse Review data is a chart based on a graphic hrs_clin is clinical data patient id (anonymized) some interesting things to note here. female is a boolean age is a number depress_dx is a 0/1,

More information

LAB 1 INSTRUCTIONS DESCRIBING AND DISPLAYING DATA

LAB 1 INSTRUCTIONS DESCRIBING AND DISPLAYING DATA LAB 1 INSTRUCTIONS DESCRIBING AND DISPLAYING DATA This lab will assist you in learning how to summarize and display categorical and quantitative data in StatCrunch. In particular, you will learn how to

More information

Predicting Diabetes and Heart Disease Using Diagnostic Measurements and Supervised Learning Classification Models

Predicting Diabetes and Heart Disease Using Diagnostic Measurements and Supervised Learning Classification Models Predicting Diabetes and Heart Disease Using Diagnostic Measurements and Supervised Learning Classification Models Kunal Sharma CS 4641 Machine Learning Abstract Supervised learning classification algorithms

More information

Select Cases. Select Cases GRAPHS. The Select Cases command excludes from further. selection criteria. Select Use filter variables

Select Cases. Select Cases GRAPHS. The Select Cases command excludes from further. selection criteria. Select Use filter variables Select Cases GRAPHS The Select Cases command excludes from further analysis all those cases that do not meet specified selection criteria. Select Cases For a subset of the datafile, use Select Cases. In

More information

1/24/2012. Chapter 7 Outline. Chapter 7 Outline (cont d.) CS 440: Database Management Systems

1/24/2012. Chapter 7 Outline. Chapter 7 Outline (cont d.) CS 440: Database Management Systems CS 440: Database Management Systems Chapter 7 Outline Using High-Level Conceptual Data Models for Database Design A Sample Database Application Entity Types, Entity Sets, Attributes, and Keys Relationship

More information

SAS Training Spring 2006

SAS Training Spring 2006 SAS Training Spring 2006 Coxe/Maner/Aiken Introduction to SAS: This is what SAS looks like when you first open it: There is a Log window on top; this will let you know what SAS is doing and if SAS encountered

More information

PHPM 672/677 Lab #2: Variables & Conditionals Due date: Submit by 11:59pm Monday 2/5 with Assignment 2

PHPM 672/677 Lab #2: Variables & Conditionals Due date: Submit by 11:59pm Monday 2/5 with Assignment 2 PHPM 672/677 Lab #2: Variables & Conditionals Due date: Submit by 11:59pm Monday 2/5 with Assignment 2 Overview Most assignments will have a companion lab to help you learn the task and should cover similar

More information

11/27/2011. Derek Chapman, PhD December Data Linkage Techniques: Tricks of the Trade. General data cleaning issue

11/27/2011. Derek Chapman, PhD December Data Linkage Techniques: Tricks of the Trade. General data cleaning issue Derek Chapman, PhD December 2011 Data Linkage Techniques: Tricks of the Trade General data cleaning issue Linkage can create more duplicates Easier to deal with before linkage Accurate counts are important

More information

Lab 1: Introduction to Data

Lab 1: Introduction to Data 1 Lab 1: Introduction to Data Some define Statistics as the field that focuses on turning information into knowledge. The first step in that process is to summarize and describe the raw information the

More information

22S:172. Duplicates. may need to check for either duplicate ID codes or duplicate observations duplicate observations should just be eliminated

22S:172. Duplicates. may need to check for either duplicate ID codes or duplicate observations duplicate observations should just be eliminated 22S:172 1 2 Duplicates Data Cleaning involving duplicate IDs and duplicate records may need to check for either duplicate ID codes or duplicate observations duplicate observations should just be eliminated

More information

The first thing you may want to do is copy the EMS accounts to HMS. The certified list pulls the account information from HMS, but HMS isn t always

The first thing you may want to do is copy the EMS accounts to HMS. The certified list pulls the account information from HMS, but HMS isn t always 1 The first thing you may want to do is copy the EMS accounts to HMS. The certified list pulls the account information from HMS, but HMS isn t always kept up to date. You can run this program to copy the

More information

%MAKE_IT_COUNT: An Example Macro for Dynamic Table Programming Britney Gilbert, Juniper Tree Consulting, Porter, Oklahoma

%MAKE_IT_COUNT: An Example Macro for Dynamic Table Programming Britney Gilbert, Juniper Tree Consulting, Porter, Oklahoma Britney Gilbert, Juniper Tree Consulting, Porter, Oklahoma ABSTRACT Today there is more pressure on programmers to deliver summary outputs faster without sacrificing quality. By using just a few programming

More information

Statistical Tests for Variable Discrimination

Statistical Tests for Variable Discrimination Statistical Tests for Variable Discrimination University of Trento - FBK 26 February, 2015 (UNITN-FBK) Statistical Tests for Variable Discrimination 26 February, 2015 1 / 31 General statistics Descriptional:

More information

The BIRO software. 1st EUBIROD Annual Meeting, Dasman Centre, Kuwait City

The BIRO software. 1st EUBIROD Annual Meeting, Dasman Centre, Kuwait City The BIRO software 1st EUBIROD Annual Meeting, Dasman Centre, Kuwait City BIROBox BIRO Box: a Graphical User Interface to configure and run all BIRO software tools for the local BIRO system XML files BIROBox

More information

Search and Reports. Vision 3

Search and Reports. Vision 3 Vision 3 Search and Reports Copyright INPS Ltd 2013 The Bread Factory, 1A Broughton Street, Battersea, London, SW8 3QJ T: +44 (0) 207 5017000 F:+44 (0) 207 5017100 W: www.inps.co.uk Copyright Notice 2013

More information

Standard Safety Visualization Set-up Using Spotfire

Standard Safety Visualization Set-up Using Spotfire Paper SD08 Standard Safety Visualization Set-up Using Spotfire Michaela Mertes, F. Hoffmann-La Roche, Ltd., Basel, Switzerland ABSTRACT Stakeholders are requesting real-time access to clinical data to

More information

SAS 101. Based on Learning SAS by Example: A Programmer s Guide Chapter 21, 22, & 23. By Tasha Chapman, Oregon Health Authority

SAS 101. Based on Learning SAS by Example: A Programmer s Guide Chapter 21, 22, & 23. By Tasha Chapman, Oregon Health Authority SAS 101 Based on Learning SAS by Example: A Programmer s Guide Chapter 21, 22, & 23 By Tasha Chapman, Oregon Health Authority Topics covered All the leftovers! Infile options Missover LRECL=/Pad/Truncover

More information

Using a Fillable PDF together with SAS for Questionnaire Data Donald Evans, US Department of the Treasury

Using a Fillable PDF together with SAS for Questionnaire Data Donald Evans, US Department of the Treasury Using a Fillable PDF together with SAS for Questionnaire Data Donald Evans, US Department of the Treasury Introduction The objective of this paper is to demonstrate how to use a fillable PDF to collect

More information

The editor window is where we write our SAS programs which we will begin doing shortly.

The editor window is where we write our SAS programs which we will begin doing shortly. Introductions Overview of SAS Welcome to our SAS tutorials. This first tutorial will provide a basic overview of the SAS environment and SAS programming. We don t want you to try to follow along with this

More information

Omitting Records with Invalid Default Values

Omitting Records with Invalid Default Values Paper 7720-2016 Omitting Records with Invalid Default Values Lily Yu, Statistics Collaborative Inc. ABSTRACT Many databases include default values that are set inappropriately. These default values may

More information

Guidance for building Study and CRF in OpenClinica

Guidance for building Study and CRF in OpenClinica Guidance for building Study and CRF in OpenClinica 1. Use of Patient Identifying information Patient Identifying Data (PID) is any data within clinical data that could potentially be used to identify subjects,

More information

Welcome to ExACCT! Getting Started

Welcome to ExACCT! Getting Started Please Note: To navigate through the ExACCT User Guide easily, please use the Books or Thumbnails available in the navigation pane. To show these navigation links, click on the button Show/Hide Navigation

More information

Professional Development

Professional Development Contents Profile Creation... 2 Forgot My Password?... 4 Forgot My Email?... 5 Dashboards... 6 Transcript & Content... 7 Workshop Search... 7 Registration... 8 Workshop Creation... 8 Global Reports... 12

More information

Basic Medical Statistics Course

Basic Medical Statistics Course Basic Medical Statistics Course S0 SPSS Intro December 2014 Wilma Heemsbergen w.heemsbergen@nki.nl This Afternoon 13.00 ~ 15.00 SPSS lecture Short break Exercise 2 Database Example 3 Types of data Type

More information

SYSTEM 2000 Essentials

SYSTEM 2000 Essentials 7 CHAPTER 2 SYSTEM 2000 Essentials Introduction 7 SYSTEM 2000 Software 8 SYSTEM 2000 Databases 8 Database Name 9 Labeling Data 9 Grouping Data 10 Establishing Relationships between Schema Records 10 Logical

More information

Risk Adjustment Tool for Length of Stay and Mortality User Guide

Risk Adjustment Tool for Length of Stay and Mortality User Guide Appendix 5 to Moore L, Evans D, Yanchar N et al. Canadian benchmarks for acute injury care. Can J Surg 2017. Risk Adjustment Tool for Length of Stay and Mortality User Guide 1 TABLE OF CONTENTS 2 Introduction...

More information

INTRODUCTION TO SPSS. Anne Schad Bergsaker 13. September 2018

INTRODUCTION TO SPSS. Anne Schad Bergsaker 13. September 2018 INTRODUCTION TO SPSS Anne Schad Bergsaker 13. September 2018 BEFORE WE BEGIN... LEARNING GOALS 1. Be familiar with and know how to navigate between the different windows in SPSS 2. Know how to write a

More information

PCGENESIS PAYROLL SYSTEM OPERATIONS GUIDE

PCGENESIS PAYROLL SYSTEM OPERATIONS GUIDE PCGENESIS PAYROLL SYSTEM OPERATIONS GUIDE 10/11/2011 Section I: Special Functions [Topic 8D: Payroll Earnings History File Processing, V1.3] Revision History Date Version Description Author 10/11/2011

More information

salary changes Mercer using shown here: below. Page

salary changes Mercer using shown here: below. Page Your Guide to Providing Ongoing Eligibility Files Provide any changes ( i.e., eligible new hires, terminated employees, job class changes, salary changes etc.) to your ongoing eligibility file to Mercer

More information

Creation of SAS Dataset

Creation of SAS Dataset Creation of SAS Dataset Contents SAS data step Access to PC files Access to Oracle Access to SQL 2 SAS Data Step Contents Creating SAS data sets from raw data Creating and managing variables 3 Creating

More information

Part Identification. Accuracy. Special Features PI SYSTEM. Irregular Heartbeat Detection (IHB) Low Noise System. Memory Feature. Body.

Part Identification. Accuracy. Special Features PI SYSTEM. Irregular Heartbeat Detection (IHB) Low Noise System. Memory Feature. Body. mmhg mmhg MEM START/STOP Special Features Part Identification PI SYSTEM TM Zewa s exclusive PI System TM (Personalized Inflation System) automatically determines the correct inflation pressure needed to

More information

Acquiring Staff IDs for Non-Payroll Staff

Acquiring Staff IDs for Non-Payroll Staff This applies to institutions that have personnel who are not on the institutions payroll, but require a Staff UID. The Staff UID process cannot be ran for non-payroll employees. A manual process must be

More information

Aeromedical Electronic Resource Office (AERO) User's Guide Completing an AMS in AERO

Aeromedical Electronic Resource Office (AERO) User's Guide Completing an AMS in AERO Aeromedical Electronic Resource Office (AERO) User's Guide Completing an AMS in AERO Enter an AMS for any physical (including applicant and grounding) which requires endorsement by NAMI (BUMED) and action

More information

A Feasibility and Acceptability Study of the Provision

A Feasibility and Acceptability Study of the Provision A Feasibility and Acceptability Study of the Provision of mhealth Interventions for Behavior Change in Prehypertensive subjects in Argentina, Guatemala, and Peru. Med e Tel 2012 Beratarrechea A 1, Fernandez

More information

SAS seminar. The little SAS book Chapters 3 & 4. April 15, Åsa Klint. By LD Delwiche and SJ Slaughter. 3.1 Creating and Redefining variables

SAS seminar. The little SAS book Chapters 3 & 4. April 15, Åsa Klint. By LD Delwiche and SJ Slaughter. 3.1 Creating and Redefining variables SAS seminar April 15, 2003 Åsa Klint The little SAS book Chapters 3 & 4 By LD Delwiche and SJ Slaughter Data step - read and modify data - create a new dataset - performs actions on rows Proc step - use

More information

TYPES OF VARIABLES, STRUCTURE OF DATASETS, AND BASIC STATA LAYOUT

TYPES OF VARIABLES, STRUCTURE OF DATASETS, AND BASIC STATA LAYOUT PRIMER FOR ACS OUTCOMES RESEARCH COURSE: TYPES OF VARIABLES, STRUCTURE OF DATASETS, AND BASIC STATA LAYOUT STEP 1: Install STATA statistical software. STEP 2: Read through this primer and complete the

More information

Unit 2: Managing Patron Records

Unit 2: Managing Patron Records LINCC Circulation Unit 2 Unit 2: Managing Patron Records This unit provides an overview of patron records and describes searching for a patron record, editing patron information, and adding a new patron

More information

Introduction to SAS Mike Zdeb ( , #1

Introduction to SAS Mike Zdeb ( , #1 Mike Zdeb (402-6479, msz03@albany.edu) #1 (1) INTRODUCTION Once, the acronym SAS actually did stand for Statistical Analysis System. Now, when you use the term SAS, you are referring to a collection of

More information

Database Principles: Fundamentals of Design, Implementation, and Management Tenth Edition. Chapter 7 Data Modeling with Entity Relationship Diagrams

Database Principles: Fundamentals of Design, Implementation, and Management Tenth Edition. Chapter 7 Data Modeling with Entity Relationship Diagrams Database Principles: Fundamentals of Design, Implementation, and Management Tenth Edition Chapter 7 Data Modeling with Entity Relationship Diagrams Objectives In this chapter, students will learn: The

More information

Select the group of isolates you want to analyze using the chart and statistics tool Create a comparison of these isolates Perform a query or

Select the group of isolates you want to analyze using the chart and statistics tool Create a comparison of these isolates Perform a query or Using the Chart & Statistics Tool and Groups Steven Stroika April 2011 Overview Using the Chart and Statistics Tool Utility of Graphs in Cluster Detection and Reporting Utility of Groups Chart and Statistics

More information

Keywords- Classification algorithm, Hypertensive, K Nearest Neighbor, Naive Bayesian, Data normalization

Keywords- Classification algorithm, Hypertensive, K Nearest Neighbor, Naive Bayesian, Data normalization GLOBAL JOURNAL OF ENGINEERING SCIENCE AND RESEARCHES APPLICATION OF CLASSIFICATION TECHNIQUES TO DETECT HYPERTENSIVE HEART DISEASE Tulasimala B. N* 1, Elakkiya S 2 & Keerthana N 3 *1 Assistant Professor,

More information

Mapping Clinical Data to a Standard Structure: A Table Driven Approach

Mapping Clinical Data to a Standard Structure: A Table Driven Approach ABSTRACT Paper AD15 Mapping Clinical Data to a Standard Structure: A Table Driven Approach Nancy Brucken, i3 Statprobe, Ann Arbor, MI Paul Slagle, i3 Statprobe, Ann Arbor, MI Clinical Research Organizations

More information

Database Programming - Section 18. Instructor Guide

Database Programming - Section 18. Instructor Guide Database Programming - Section 18 Instructor Guide Table of Contents...1 Lesson 1 - Certification Exam Preparation...1 What Will I Learn?...2 Why Learn It?...3 Tell Me / Show Me...4 Try It / Solve It...5

More information

Automating the Production of Formatted Item Frequencies using Survey Metadata

Automating the Production of Formatted Item Frequencies using Survey Metadata Automating the Production of Formatted Item Frequencies using Survey Metadata Tim Tilert, Centers for Disease Control and Prevention (CDC) / National Center for Health Statistics (NCHS) Jane Zhang, CDC

More information

proc print data=account; <insert statement here> run;

proc print data=account; <insert statement here> run; Statistics 6250 Name: Fall 2012 (print: first last ) Prof. Fan NetID #: Midterm Three Instructions: This is an in-class and open book midterm. You must write your answers on the provide spaces. Give concise

More information

Subject Area Data Element Examples Earliest Date Patient Demographics Race, primary language, mortality 2000 Encounters

Subject Area Data Element Examples Earliest Date Patient Demographics Race, primary language, mortality 2000 Encounters User Guide DataDirect is a self-service tool enabling access to robust, up-to-date data on more than 3 million unique patients from across the UMHS enterprise. This data informs study design and guides

More information

Table of Contents. The RETAIN Statement. The LAG and DIF Functions. FIRST. and LAST. Temporary Variables. List of Programs.

Table of Contents. The RETAIN Statement. The LAG and DIF Functions. FIRST. and LAST. Temporary Variables. List of Programs. Table of Contents List of Programs Preface Acknowledgments ix xvii xix The RETAIN Statement Introduction 1 Demonstrating a DATA Step with and without a RETAIN Statement 1 Generating Sequential SUBJECT

More information

i2b2 User Guide Informatics for Integrating Biology & the Bedside Version 1.0 October 2012

i2b2 User Guide Informatics for Integrating Biology & the Bedside Version 1.0 October 2012 i2b2 (Informatics for Integrating Biology and the Bedside) is an informatics framework designed to simplify the process of using existing, de-identified, clinical data for preliminary research cohort discovery

More information

Reference. Table of Contents Page No. PedCath Formula Reference.. 2. Body Surface Area Dubois and Dubois height-weight equation..

Reference. Table of Contents Page No. PedCath Formula Reference.. 2. Body Surface Area Dubois and Dubois height-weight equation.. Reference Page 1 of 7 Reference Table of Contents Page No. PedCath Formula Reference.. Body Surface Area Dubois and Dubois height-weight equation.. O Capacity...... MV, SA, PA and PV O Content..... Oxygen

More information

CHCN EOV Documentation

CHCN EOV Documentation CHCN EOV Documentation Tuesday, January 22, 2013 4:37 PM Installing the templates Download and import the EOV templates CHCN_end_of_visit.txml CHCN_eov_alert_details.txml CHCN_eov_config.txml CHCN_EOV_config_alerts.txml

More information

GMINs: Frequently Asked Questions for Dealership Employees

GMINs: Frequently Asked Questions for Dealership Employees GMINs: Frequently Asked Questions for Dealership Employees What is a GMIN? A GMIN is a 9-digit, randomly generated General Motors Identification Number. In most instances, your GMIN will replace the use

More information

PROC REPORT AN INTRODUCTION

PROC REPORT AN INTRODUCTION Table Generation Using the PROC REPORT Feature Edward R. Smith, Senior Scientific Programmer Covance Periapproval Services Inc, Radnor, PA ABSTRACT The PROC REPORT procedure is a powerful report generation

More information

A Simple Time Series Macro Scott Hanson, SVP Risk Management, Bank of America, Calabasas, CA

A Simple Time Series Macro Scott Hanson, SVP Risk Management, Bank of America, Calabasas, CA A Simple Time Series Macro Scott Hanson, SVP Risk Management, Bank of America, Calabasas, CA ABSTRACT One desirable aim within the financial industry is to understand customer behavior over time. Despite

More information