EDUCATION IN PROC TABULATE I EDUCATION IN PROC TABULATE II

Size: px
Start display at page:

Download "EDUCATION IN PROC TABULATE I EDUCATION IN PROC TABULATE II"

Transcription

1 EDUCTON N PROC TBULTE Georg Morsing - SS nstitute EDUCTON N PROC TBULTE 258

2 Education in PROC TBULTE ntroduction NECESSRY STTEMENTS CONTENTS PROC TBULTE DT= SSdataset l. ntroduction 2. The TBLE statement 3. Calculations 4. mprovement of output. The table layout 6. The menu system CLSS VR TBLE variable list variable list PRTTON OF VRBLES: variable/expression n PROC TBULTE it is necessary to characterize all variables to the SS System: Bl '" QULFCTONS Basic knowledge of: No knowledge of: - PROC step generally - PROC FORMT - LBEL and FORMT statements - PROC TBULTE CLSS variables: VR variables: - grouped (discrete) variables - numeric or character variables - examples: area code year quarter - analyze (continuous) variables - numeric variables - examples: income level of interest number of unemployed ry""'..i:!r.'...r:..: ":"-'.",-.: ' """",,;,;,,..J.;::..->"' > ;" ""- ':;":-" "p ":'.'<,"::"!" ';;'':'''''"_.'''.:'i.....!.!:''_''':' i;_..,' ':"::',..r",,:.<, ",.Y':"... <.,,:, 'r" ", :.,... :.,,>.-:.-;.:,:::.... ;.;_;,;i-<.;:;;:.w...f:-,.,.'- - '.;.:,.(''''''''''i''''...,..;;... ;-!.>;j>:j''( 'ri-:;.. '.'.,(_.,;).. :'!.-;...;";:..... {.".<... ),,"""'"';\9:fr -lumw6ibed... )N-i:.>iii... lit-h' f'.fti''';"v.,..

3 '"..t:j'$,,,+l.{!.;rp:,:,jjt _,,s'b6ji'j.!rtim!'.1 "'«" ;t_!;e::..7'i'-k ':-"-d:w.,o.l;:,',". -,.,..... ",", '""' The TBLE statement The TBLE statement' VR '{ TBLE x '{ '{, ===> Dimension VR '{ TBLE *B B * ===> concatenating VR '{ TBLE B '{ B '{ VR Y TBLE -, *B B * ===> Concatenating

4 options ps=42 nonumber nodate formchar='-nn HH WU,; The TBLE statement proc print data=sasuser.unemploy; title 'Number of unemployed persons in thousands in Denmark' proc tabulate data=sasuser.unemploy; title 'pply TBULTE to report writing'; class year quarter; var male female: run: table year*quarter male female The TBLE statement pply TBULTE to report writing MLE FEMLE SUM SUM YER Number of unemployed persons in thousands in Denmark OBS YER MLE FEMLE ; , re , ' ! ,"'.;;';;V':d-: :

5 ,:-.t '!'cm!,."{:.tl!"' :&,& jt:j)"et" ''''',-''p-l.."','''',\ H rt'.''''::'< ",,::..-,'">.-';'t"f..r"l/!'"i;-':-:-"'-::-::"?".:"':-;'-':'l"'!"(_'"'l.t;:;,-,,-,-:j;" '. -:-: -,- -, '- Calculations Simple calculations KEYWORDS: CLCULTONS N PROC TBULTE Standard calculation: SUM When nothing is stated, the procedure will compute the sum of the analysis variables given. These two TBLE statements are identical: TBLE B, TBLE B *SUM N NMSS SUM MEN MN, M RNGE the number of observations (- missing values) the number of observations including missing values the sum the mean the minimum the maximum the range between the maximum and the minimum VR Y TBLE, *(MEN MN M) CLCULTONS N PROC TBULTE n the TBLE statement you can decide for yourself which kind of calculation you wish to execute by stating a KEYWORD. KEYWORD can be divided into 4 groups: 1. Simple calculations MEN MN M 2. dditions 3. Calculation of percentages 4. statistical calculations PROC TBULTE DT=sasuser.unemploy; TTLE 'Simple calculations in PROC TBULTE'; CLSS year quarter; VR male female ; RUN; TBLE year, male*(min mean max); TBLE year, (male female)*(mean range)

6 Simple calculations Totals Simple calculations in FRoe TBULTE i TOTLS N PROe TBULTE MLE MN MEN M YER KEYWORD LL - calculations of a total sum, mean, eeqe - can be placed everywhere in a TBLE statement - may figure several times in a TBLE statement PRoe TBULTE; elss B VR Y ; TBLE LL, Y x Y re Simple calculations in PRoe TBULTE MLE FEMLE MEN RNGE MEN RNGE YER LL option ps=84; PRoe TBULTE.DT=sasuser.unemp1oy; TTLE 'Totals in PRoe TBULTE'; elss year quarter; VR male female ; RUN; TBLE year*(quarter LL) LL, (male female)*mean ;>C!.""",.,-."?;",,,. _.-."'<';: f"";:...:r... : :;"r..,!o't".u:-."""d,',"' ::...: ;;;:""",,:y:::',_.;,"' 4. ' >:;!.!,"',,-i-'>!-0'.:.,-.-"'-, -"'";'t-""?-':"'h" " '-''' ''7',:-,,,,;H-,'r-'-'_!.-.'!J.''l ''''''''';''':';;'';;':''''''.r,,,,,,j:...":t::t..,.,_a.;;' ;;'"- '''''$-"-,,,y;.;"l,,",,h'-!hn-''lj?'''mbdu.''''''....

7 ;:'_ f:p.'; _b'»",{\f!.-tf'i-"9,,_,"",!-'::_"_'"'yk-!",-_'_.'._', - ',_.' _:,-F:';- Totals Percent calculation CLCULTON OF PERCENTGES N PROC TBULTE Totals in PROC TBULTE MLE FEMLE KEYWORD MEN YER 1980 l' MEN PCTN PCTSUM percentage in proportion to the number of observations percentage in proportion to the SUM of an analysis variable LL LL (J) '" LL , PCTN < variable > PCTSUM < variable*variable > VR Y ; TBLE LL, *( SUM PCTSUM< LL> ) - - LL SUM PCTSUM LL LL option ps=84; PROC TBULTE RUN; TTLE CLSS VR DT=sasuser.unemploy : 'Percentages in PRoe TBULTE'; year quarter; male female ; TBLE year*(quarter all),male*mean PCTN< LL>; LL LL

8 ;l",..'''':'''-'' ";"'j;...s;::w('.';t.\ ;:.<'-_...'.;:e:;..(. i-' );:..;v.,;.. :-,;,>-;- '!iw:<':'_',""''':';'';;'.-.:,j.:iaw;i.'-:i:'... :".i.: - "\. "i" -. -". -. -_",:,:p:!,r;;.,."'''', :''; ''':'-"7-;,,,,,,,:\_>_,;';'t :>l: '''-h,i..i2.'''; :;;;;:'""_1:liM","i-.;.::...::'t.:t';":'i""r.l"';"l,. - 1'),,< M>: "::'>'"'" _,,""l' p{:. <r-.;. ;......;;;:)U,,"+-\.sov ' Percent calculation Statistical calculations Percentages in PROC TBUlTE MLE - MEN PCTH STTSTCL CLCULTONS N PROC TBULTE YER.lauRTER 1980 l' LL KEYWORD: STD USS the standard deviation the incorrected sum of squares LL ' l LL CSS STDERR cv T PRT VR the sum of squares corrected for the mean the standard t error f of the mean the percentage coefficient of variation student's t-test (population mean = 0) the probability under the null hypothesis of obtaining an absolute value of t greater than the t value observed in this sample. the variance re LL LL ! LL

9 :-;'t_!1?":'''':.''!:'n'i{g'l{_,,_ t,;:k;'-'(c', ',>-.' mprovement of output mprovement output MPROVEMENT OF OUTPUT By means of the FORMT and the LBEL statements, all headlines in a PROC TBULTE can be changed. Furthermore, all data in the table can be printed in a FORMT already defined in the SS System or in formats made by PROC FORMT. 1. Names of variables 2. Values of variables (for CLSS variables) 3. The data part of a PROC TBULTE 4. Names of keywords, ; VR Y; TBLE. ; Quarter first second third fourth FORMT $6. LBEL ='ncome l = 1 Quarter 1 KEYLBEL SUM= 'in 1000 DKK'; ncome in 1000 DKK '" 0'> 0'> VR Y TBLE. LBEL > FORMT -> <- SUM <- < LBEL KEY LBEL FORMT option ps=21; PROC FORMT; VLUE quar l='firstl 2='Second l 3= 1 Third 4=1 Fouth, ; PROC TBULTE DT=sasuser.unemploy TTLE 'mprovement of output'; CLSS year quarter; VR male female run; TBLE quarter all, (male female)*mean; LBEL quarter='quarter l male='male' female='female ' ; KEYLBEL mean='in 1000 persons all='n all'; FORMT quarter quar.;

10 :-: mprovement of output Keylabel problem KEYLBEL PROBLEM Quarter mprovement of output Male Female in 1000 in 1000 persons persons First Second Third Fouth n all VR Y i TBLE, Y interest > Y <+- income procent > SUM! SUM <+- in DKK ( ) f-> re VR Y TBLE =', *MEN=' percentaqe, Y*MEN='in ORR' LBEL ='interest' Y='income' nterest! percentage! ncome in DKK <, '.,".' _,' ',.' _,',,':" :-..!t;":",\.,;..h.;;;:;..;:-.,p.::;:-,i,-i<.'},:;';;"j,h;..i;";::';:).;z;.\oiliwii... ";""-,,':;';'r"_,.;. \. - "' t.,. ;" ",, Ol\iw"..i&i"

11 .lt"mi'>.-qph:hm:"'"''}n::+r!,j'4:. "... M-!.H;:'lLt,... ". ;." ;0".-;"''-'" h"";":;"'l' r -:::'-::':,;?-.''': '-''- c: <,,, :-:-' :-';;::"'7"';"':;-[>';;",.,.,.\,"$." Tabel layout Tabel layout TBEL LYOUT PROC TBULTE applies a cell format of 12.2 and 1/4 of the width of page for the row title space as a standard. 1 ROW TTLE SPCE ' CELL FORMT r - ' 1 option ps=42; PROC FORMT; PCTURE komma LOW-HGH=' ,9 '(mult=10) PROC TBULTE DT=sasuser.unemploy TTLE 'Table layout'; CLSS year quarter; VR male female ; TBLE quarter LL='n all', male='unemployed men' *MEN=' l*f=10.1 female='unemployed women' *MEN= '*f=komma. /RTS=12 BO='in 1000 persons' RUN; FORMT quarter quar. PROC TBULTE FORMT=10.2; <- standard format VR Y TBLE =',, *MEN*F=5. <- format Y*F=15.2 <- format /RTS=l1; Table la!out in 1000 Unemployed Unemployed persons men women First ,8 MEN Y SUM Second ,3 Third ,3 Fouth ,5 n all ,2

12 Page d:imension Page dimension Table layout PGE DMENSON Year 1980 Men Women TBEL, B, Y One table per group First Second Y i Y Y Third Fouth n all B =1982 = Year 1981 Men Women First Second Third Fouth n all option ps=60; PROC TBULTE DT=sasuser.unemploy, TTLE 'Table layout', CLSS year quarter, VR male female ; FORMT quarter quar., LBEL male='men ' female= 'women, year='year'; KEYLBEL all='n all', RUN, TBLE year, quarter all, (male female)*mean=', / box=yage_ condense Year 1982 Men Women First Second Third Fouth n all _.- " _,n..:. _:!.:::.l'';.<!'_;.,;...'_.h:.:h'... ;,i:-:t,;,;.;.;...,._;_... :.;'"''; ".',.,,-;'.7,', ',' "-:"";''''''.:/'':'')m.h..o!;:''','')' i ;';-'ip,j..;'-& > }WSa1:M+"il1t\f!i"%$!lJQ'r;jf?n1i?J.$..

13 !""t:t:-;-s.,,,, :,:?<.';'i,:l'';$t-."ik.-t"''''''':''h-'' :.,..,-,"""<"""!.,".,.,'t.r.:.",!-"r",,-""'"'j"':"_'""i"'"'"l'"1 - '"!.:'i ">';'"!,, T?.t"_''':"''!l.'": :? -.'''.- ;">-_-'C_,-,-! -!.-;.'<!,'","':,,,"'''!,,,,,e',_,... ">.m Page dimension Table layout Year 1983 Men Women First Second Third Fouth n all Year 1984 Men Women! First ' Second Third Fouth n all Year 1985 Men Women First Second Third Fouth n all ,':,:;

Writing Reports with the

Writing Reports with the Writing Reports with the SAS System s TABULATE Procedure or Big Money Proc Tabulate Ben Cochran The Bedford Group bencochran@nc.rr.com Writing Reports with the SAS System s TABULATE Procedure Copyright

More information

Introducing a Colorful Proc Tabulate Ben Cochran, The Bedford Group, Raleigh, NC

Introducing a Colorful Proc Tabulate Ben Cochran, The Bedford Group, Raleigh, NC Paper S1-09-2013 Introducing a Colorful Proc Tabulate Ben Cochran, The Bedford Group, Raleigh, NC ABSTRACT Several years ago, one of my clients was in the business of selling reports to hospitals. He used

More information

Christopher Toppe, Ph.D. Computer Sciences Corporation

Christopher Toppe, Ph.D. Computer Sciences Corporation An Introduction to PROC TABULATE: A Hands-On Workshop Christopher Toppe, Ph.D. Computer Sciences Corporation Abstract The Tabulate procedure is, along with PROC REPORT, one of the most powerful and difficult

More information

M'''~ ft\'''' PRot MEAAJS PROC. Pilot. P2.oc- ME AtJS. Ci+.\.,t.t\ {oy «(0""''" "OUS ) F~EQ. \keel. fo (OM p"te cbsty'.ptl\'t.

M'''~ ft\'''' PRot MEAAJS PROC. Pilot. P2.oc- ME AtJS. Ci+.\.,t.t\ {oy «(0'' OUS ) F~EQ. \keel. fo (OM pte cbsty'.ptl\'t. It),I PRot MEAAJS PROC. Pilot F~EQ \JNI\I~~'An P2.oc- ME AtJS \keel. fo (OM p"te cbsty'.ptl\'t Ci+.\.,t.t\ {oy «(0""''" "OUS ),,\lm,ylt data. ~\.\,\\,(~.I\l,,,.At ~ N. " """9 M'''~ ". ~ \'lo-cla,d tit,

More information

The REPORT Procedure CHAPTER 32

The REPORT Procedure CHAPTER 32 859 CHAPTER 32 The REPORT Procedure Overview 861 Types of Reports 861 A Sampling of Reports 861 Concepts 866 Laying Out a Report 866 Usage of Variables in a Report 867 Display Variables 867 Order Variables

More information

Basic Concepts #6: Introduction to Report Writing

Basic Concepts #6: Introduction to Report Writing Basic Concepts #6: Introduction to Report Writing Using By-line, PROC Report, PROC Means, PROC Freq JC Wang By-Group Processing By-group processing in a procedure step, a BY line identifies each group

More information

It s Proc Tabulate Jim, but not as we know it!

It s Proc Tabulate Jim, but not as we know it! Paper SS02 It s Proc Tabulate Jim, but not as we know it! Robert Walls, PPD, Bellshill, UK ABSTRACT PROC TABULATE has received a very bad press in the last few years. Most SAS Users have come to look on

More information

Research Methods for Business and Management. Session 8a- Analyzing Quantitative Data- using SPSS 16 Andre Samuel

Research Methods for Business and Management. Session 8a- Analyzing Quantitative Data- using SPSS 16 Andre Samuel Research Methods for Business and Management Session 8a- Analyzing Quantitative Data- using SPSS 16 Andre Samuel A Simple Example- Gym Purpose of Questionnaire- to determine the participants involvement

More information

I AlB 1 C 1 D ~~~ I I ; -j-----; ;--i--;--j- ;- j--; AlB

I AlB 1 C 1 D ~~~ I I ; -j-----; ;--i--;--j- ;- j--; AlB PROC TABULATE: CONTROLLNG TABLE APPEARANCE August V. Treff Baltimore City Public Schools Office of Research and Evaluation ABSTRACT Proc Tabulate provides one, two, and three dimensional tables. Tables

More information

Setting the Percentage in PROC TABULATE

Setting the Percentage in PROC TABULATE SESUG Paper 193-2017 Setting the Percentage in PROC TABULATE David Franklin, QuintilesIMS, Cambridge, MA ABSTRACT PROC TABULATE is a very powerful procedure which can do statistics and frequency counts

More information

Chapters 18, 19, 20 Solutions. Page 1 of 14. Demographics from COLLEGE Data Set

Chapters 18, 19, 20 Solutions. Page 1 of 14. Demographics from COLLEGE Data Set 18.2 proc tabulate data=learn.college format=7.; class schoolsize gender scholarship; table schoolsize ALL, gender scholarship ALL; n = ' '; Demographics from COLLEGE Data Set ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ

More information

PROC FORMAT; VALUE RESTF

PROC FORMAT; VALUE RESTF TABULATE Tutorial Alan Eaton and Ken Howell SAS Institute PROC TABULATE displays descriptive statistics arranged in hierarchical tables. This overview illustrates many features by using them in examples.

More information

Robust Linear Regression (Passing- Bablok Median-Slope)

Robust Linear Regression (Passing- Bablok Median-Slope) Chapter 314 Robust Linear Regression (Passing- Bablok Median-Slope) Introduction This procedure performs robust linear regression estimation using the Passing-Bablok (1988) median-slope algorithm. Their

More information

Going Beyond Proc Tabulate Jim Edgington, LabOne, Inc., Lenexa, KS Carole Lindblade, LabOne, Inc., Lenexa, KS

Going Beyond Proc Tabulate Jim Edgington, LabOne, Inc., Lenexa, KS Carole Lindblade, LabOne, Inc., Lenexa, KS Going Beyond Proc Tabulate Jim Edgington, LabOne, Inc., Lenexa, KS Carole Lindblade, LabOne, Inc., Lenexa, KS ABSTRACT PROC Tabulate is one of the most powerful and versatile of the SAS reporting tools.

More information

STAT:5400 Computing in Statistics

STAT:5400 Computing in Statistics STAT:5400 Computing in Statistics Introduction to SAS Lecture 18 Oct 12, 2015 Kate Cowles 374 SH, 335-0727 kate-cowles@uiowaedu SAS SAS is the statistical software package most commonly used in business,

More information

Using PROC TABULATE and ODS Style Options to Make Really Great Tables Wendi L. Wright, CTB / McGraw-Hill, Harrisburg, PA

Using PROC TABULATE and ODS Style Options to Make Really Great Tables Wendi L. Wright, CTB / McGraw-Hill, Harrisburg, PA Using PROC TABULATE and ODS Style Options to Make Really Great Tables Wendi L. Wright, CTB / McGraw-Hill, Harrisburg, PA ABSTRACT We start with an introduction to PROC TABULATE, looking at the basic syntax,

More information

STAT 503 Fall Introduction to SAS

STAT 503 Fall Introduction to SAS Getting Started Introduction to SAS 1) Download all of the files, sas programs (.sas) and data files (.dat) into one of your directories. I would suggest using your H: drive if you are using a computer

More information

IQR = number. summary: largest. = 2. Upper half: Q3 =

IQR = number. summary: largest. = 2. Upper half: Q3 = Step by step box plot Height in centimeters of players on the 003 Women s Worldd Cup soccer team. 157 1611 163 163 164 165 165 165 168 168 168 170 170 170 171 173 173 175 180 180 Determine the 5 number

More information

8. MINITAB COMMANDS WEEK-BY-WEEK

8. MINITAB COMMANDS WEEK-BY-WEEK 8. MINITAB COMMANDS WEEK-BY-WEEK In this section of the Study Guide, we give brief information about the Minitab commands that are needed to apply the statistical methods in each week s study. They are

More information

Correctly Compute Complex Samples Statistics

Correctly Compute Complex Samples Statistics SPSS Complex Samples 15.0 Specifications Correctly Compute Complex Samples Statistics When you conduct sample surveys, use a statistics package dedicated to producing correct estimates for complex sample

More information

The Utter Simplicity? off The Fina. le TABULATE Procedure - Chapter. Dan Bruns, Tennessee Valley Authority, Chattanooga, TN

The Utter Simplicity? off The Fina. le TABULATE Procedure - Chapter. Dan Bruns, Tennessee Valley Authority, Chattanooga, TN The Utter Simplicity? off The Fina le TABULATE Procedure - Chapter Dan Bruns, Tennessee Valley Authority, Chattanooga, TN n the Beginning Well, here we are again TABULATE fans. believe have exhausted this

More information

EXST3201 Mousefeed01 Page 1

EXST3201 Mousefeed01 Page 1 EXST3201 Mousefeed01 Page 1 3 /* 4 Examine differences among the following 6 treatments 5 N/N85 fed normally before weaning and 85 kcal/wk after 6 N/R40 fed normally before weaning and 40 kcal/wk after

More information

SAS Example A10. Output Delivery System (ODS) Sample Data Set sales.txt. Examples of currently available ODS destinations: Mervyn Marasinghe

SAS Example A10. Output Delivery System (ODS) Sample Data Set sales.txt. Examples of currently available ODS destinations: Mervyn Marasinghe SAS Example A10 data sales infile U:\Documents\...\sales.txt input Region : $8. State $2. +1 Month monyy5. Headcnt Revenue Expenses format Month monyy5. Revenue dollar12.2 proc sort by Region State Month

More information

1. Basic Steps for Data Analysis Data Editor. 2.4.To create a new SPSS file

1. Basic Steps for Data Analysis Data Editor. 2.4.To create a new SPSS file 1 SPSS Guide 2009 Content 1. Basic Steps for Data Analysis. 3 2. Data Editor. 2.4.To create a new SPSS file 3 4 3. Data Analysis/ Frequencies. 5 4. Recoding the variable into classes.. 5 5. Data Analysis/

More information

ANSWERS -- Prep for Psyc350 Laboratory Final Statistics Part Prep a

ANSWERS -- Prep for Psyc350 Laboratory Final Statistics Part Prep a ANSWERS -- Prep for Psyc350 Laboratory Final Statistics Part Prep a Put the following data into an spss data set: Be sure to include variable and value labels and missing value specifications for all variables

More information

CSC 328/428 Summer Session I 2002 Data Analysis for the Experimenter FINAL EXAM

CSC 328/428 Summer Session I 2002 Data Analysis for the Experimenter FINAL EXAM options pagesize=53 linesize=76 pageno=1 nodate; proc format; value $stcktyp "1"="Growth" "2"="Combined" "3"="Income"; data invstmnt; input stcktyp $ perform; label stkctyp="type of Stock" perform="overall

More information

Salary 9 mo : 9 month salary for faculty member for 2004

Salary 9 mo : 9 month salary for faculty member for 2004 22s:52 Applied Linear Regression DeCook Fall 2008 Lab 3 Friday October 3. The data Set In 2004, a study was done to examine if gender, after controlling for other variables, was a significant predictor

More information

Simulating Multivariate Normal Data

Simulating Multivariate Normal Data Simulating Multivariate Normal Data You have a population correlation matrix and wish to simulate a set of data randomly sampled from a population with that structure. I shall present here code and examples

More information

Introduction to Statistical Analyses in SAS

Introduction to Statistical Analyses in SAS Introduction to Statistical Analyses in SAS Programming Workshop Presented by the Applied Statistics Lab Sarah Janse April 5, 2017 1 Introduction Today we will go over some basic statistical analyses in

More information

Excel 2010 with XLSTAT

Excel 2010 with XLSTAT Excel 2010 with XLSTAT J E N N I F E R LE W I S PR I E S T L E Y, PH.D. Introduction to Excel 2010 with XLSTAT The layout for Excel 2010 is slightly different from the layout for Excel 2007. However, with

More information

Econ Stata Tutorial I: Reading, Organizing and Describing Data. Sanjaya DeSilva

Econ Stata Tutorial I: Reading, Organizing and Describing Data. Sanjaya DeSilva Econ 329 - Stata Tutorial I: Reading, Organizing and Describing Data Sanjaya DeSilva September 8, 2008 1 Basics When you open Stata, you will see four windows. 1. The Results window list all the commands

More information

4. Descriptive Statistics: Measures of Variability and Central Tendency

4. Descriptive Statistics: Measures of Variability and Central Tendency 4. Descriptive Statistics: Measures of Variability and Central Tendency Objectives Calculate descriptive for continuous and categorical data Edit output tables Although measures of central tendency and

More information

Selected Introductory Statistical and Data Manipulation Procedures. Gordon & Johnson 2002 Minitab version 13.

Selected Introductory Statistical and Data Manipulation Procedures. Gordon & Johnson 2002 Minitab version 13. Minitab@Oneonta.Manual: Selected Introductory Statistical and Data Manipulation Procedures Gordon & Johnson 2002 Minitab version 13.0 Minitab@Oneonta.Manual: Selected Introductory Statistical and Data

More information

PDQ-Notes. Reynolds Farley. PDQ-Note 3 Displaying Your Results in the Expert Query Window

PDQ-Notes. Reynolds Farley. PDQ-Note 3 Displaying Your Results in the Expert Query Window PDQ-Notes Reynolds Farley PDQ-Note 3 Displaying Your Results in the Expert Query Window PDQ-Note 3 Displaying Your Results in the Expert Query Window Most of your options for configuring your query results

More information

EXST SAS Lab Lab #6: More DATA STEP tasks

EXST SAS Lab Lab #6: More DATA STEP tasks EXST SAS Lab Lab #6: More DATA STEP tasks Objectives 1. Working from an current folder 2. Naming the HTML output data file 3. Dealing with multiple observations on an input line 4. Creating two SAS work

More information

22s:152 Applied Linear Regression

22s:152 Applied Linear Regression 22s:152 Applied Linear Regression Chapter 22: Model Selection In model selection, the idea is to find the smallest set of variables which provides an adequate description of the data. We will consider

More information

Chapter 2. Frequency distribution. Summarizing and Graphing Data

Chapter 2. Frequency distribution. Summarizing and Graphing Data Frequency distribution Chapter 2 Summarizing and Graphing Data Shows how data are partitioned among several categories (or classes) by listing the categories along with the number (frequency) of data values

More information

CDAA No. 4 - Part Two - Multiple Regression - Initial Data Screening

CDAA No. 4 - Part Two - Multiple Regression - Initial Data Screening CDAA No. 4 - Part Two - Multiple Regression - Initial Data Screening Variables Entered/Removed b Variables Entered GPA in other high school, test, Math test, GPA, High school math GPA a Variables Removed

More information

Hypermarket Retail Analysis Customer Buying Behavior. Reachout Analytics Client Sample Report

Hypermarket Retail Analysis Customer Buying Behavior. Reachout Analytics Client Sample Report Hypermarket Retail Analysis Customer Buying Behavior Report Tools Used: R Python WEKA Techniques Applied: Comparesion Tests Association Tests Requirement 1: All the Store Brand significance to Gender Towards

More information

Epidemiology Principles of Biostatistics Chapter 3. Introduction to SAS. John Koval

Epidemiology Principles of Biostatistics Chapter 3. Introduction to SAS. John Koval Epidemiology 9509 Principles of Biostatistics Chapter 3 John Koval Department of Epidemiology and Biostatistics University of Western Ontario What we will do today We will learn to use use SAS to 1. read

More information

22S:166. Checking Values of Numeric Variables

22S:166. Checking Values of Numeric Variables 22S:1 Computing in Statistics Lecture 24 Nov. 2, 2016 1 Checking Values of Numeric Variables range checks when you know what the range of possible values is for a given quantitative variable internal consistency

More information

Frequency Distributions and Descriptive Statistics in SPS

Frequency Distributions and Descriptive Statistics in SPS 230 Combs Building 859.622.3050 studentcomputing.eku.edu studentcomputing@eku.edu Frequency Distributions and Descriptive Statistics in SPSS In this tutorial, we re going to work through a sample problem

More information

Subject. Creating a diagram. Dataset. Importing the data file. Descriptive statistics with TANAGRA.

Subject. Creating a diagram. Dataset. Importing the data file. Descriptive statistics with TANAGRA. Subject Descriptive statistics with TANAGRA. The aim of descriptive statistics is to describe the main features of a collection of data in quantitative terms 1. The visualization of the whole data table

More information

Lab #1: Introduction to Basic SAS Operations

Lab #1: Introduction to Basic SAS Operations Lab #1: Introduction to Basic SAS Operations Getting Started: OVERVIEW OF SAS (access lab pages at http://www.stat.lsu.edu/exstlab/) There are several ways to open the SAS program. You may have a SAS icon

More information

ANALYSIS OF NOMINAL DATA MULTI-WAY CONTINGENCY TABLE

ANALYSIS OF NOMINAL DATA MULTI-WAY CONTINGENCY TABLE M A T H E M A T I C A L E C O N O M I C S No. 7(14) 2011 ANALYSIS OF NOMINAL DATA MULTI-WAY CONTINGENCY TABLE Abstract. Presented in this paper the method of graphical presentation of the relationship

More information

ST512. Fall Quarter, Exam 1. Directions: Answer questions as directed. Please show work. For true/false questions, circle either true or false.

ST512. Fall Quarter, Exam 1. Directions: Answer questions as directed. Please show work. For true/false questions, circle either true or false. ST512 Fall Quarter, 2005 Exam 1 Name: Directions: Answer questions as directed. Please show work. For true/false questions, circle either true or false. 1. (42 points) A random sample of n = 30 NBA basketball

More information

INTRODUCTION SAS Prepared by A. B. Billings West Virginia University May 1999 (updated August 2006)

INTRODUCTION SAS Prepared by A. B. Billings West Virginia University May 1999 (updated August 2006) INTRODUCTION To SAS Prepared by A. B. Billings West Virginia University May 1999 (updated August 2006) 1 Getting Started with SAS SAS stands for Statistical Analysis System. SAS is a computer software

More information

Opening a Data File in SPSS. Defining Variables in SPSS

Opening a Data File in SPSS. Defining Variables in SPSS Opening a Data File in SPSS To open an existing SPSS file: 1. Click File Open Data. Go to the appropriate directory and find the name of the appropriate file. SPSS defaults to opening SPSS data files with

More information

Tables for Two: An Introduction to the TABULATE Procedure

Tables for Two: An Introduction to the TABULATE Procedure Tables for Two: An Introduction to the TABULATE Procedure Kathy McLeod. Computer Science Corporation ABSTRACT The SASe System includes many powerful tools for summarizing and displaying data. PROe TABULATE

More information

Math 227 EXCEL / MEGASTAT Guide

Math 227 EXCEL / MEGASTAT Guide Math 227 EXCEL / MEGASTAT Guide Introduction Introduction: Ch2: Frequency Distributions and Graphs Construct Frequency Distributions and various types of graphs: Histograms, Polygons, Pie Charts, Stem-and-Leaf

More information

15 Wyner Statistics Fall 2013

15 Wyner Statistics Fall 2013 15 Wyner Statistics Fall 2013 CHAPTER THREE: CENTRAL TENDENCY AND VARIATION Summary, Terms, and Objectives The two most important aspects of a numerical data set are its central tendencies and its variation.

More information

CS1100: Excel Lab 1. Problem 1 (25 Points) Filtering and Summarizing Data

CS1100: Excel Lab 1. Problem 1 (25 Points) Filtering and Summarizing Data CS1100: Excel Lab 1 Filtering and Summarizing Data To complete this assignment you must submit an electronic copy to Blackboard by the due date. Use the data in the starter file. In this lab you are asked

More information

STAT 2607 REVIEW PROBLEMS Word problems must be answered in words of the problem.

STAT 2607 REVIEW PROBLEMS Word problems must be answered in words of the problem. STAT 2607 REVIEW PROBLEMS 1 REMINDER: On the final exam 1. Word problems must be answered in words of the problem. 2. "Test" means that you must carry out a formal hypothesis testing procedure with H0,

More information

1. Make a bar graph in Excel. (1.5 points) Copy the following table into two columns under a blank worksheet in Excel.

1. Make a bar graph in Excel. (1.5 points) Copy the following table into two columns under a blank worksheet in Excel. STAT 243 Lab 3 Rachel Webb 25 points This lab should be done using Microsoft Excel available in all PSU computer labs. A hard copy of your output is to be handed in to during lecture on the due date posted

More information

Quantitative - One Population

Quantitative - One Population Quantitative - One Population The Quantitative One Population VISA procedures allow the user to perform descriptive and inferential procedures for problems involving one population with quantitative (interval)

More information

Fathom Dynamic Data TM Version 2 Specifications

Fathom Dynamic Data TM Version 2 Specifications Data Sources Fathom Dynamic Data TM Version 2 Specifications Use data from one of the many sample documents that come with Fathom. Enter your own data by typing into a case table. Paste data from other

More information

Lab #9: ANOVA and TUKEY tests

Lab #9: ANOVA and TUKEY tests Lab #9: ANOVA and TUKEY tests Objectives: 1. Column manipulation in SAS 2. Analysis of variance 3. Tukey test 4. Least Significant Difference test 5. Analysis of variance with PROC GLM 6. Levene test for

More information

Stata versions 12 & 13 Week 4 Practice Problems

Stata versions 12 & 13 Week 4 Practice Problems Stata versions 12 & 13 Week 4 Practice Problems SOLUTIONS 1 Practice Screen Capture a Create a word document Name it using the convention lastname_lab1docx (eg bigelow_lab1docx) b Using your browser, go

More information

Statistical Good Practice Guidelines. 1. Introduction. Contents. SSC home Using Excel for Statistics - Tips and Warnings

Statistical Good Practice Guidelines. 1. Introduction. Contents. SSC home Using Excel for Statistics - Tips and Warnings Statistical Good Practice Guidelines SSC home Using Excel for Statistics - Tips and Warnings On-line version 2 - March 2001 This is one in a series of guides for research and support staff involved in

More information

Week 4: Simple Linear Regression II

Week 4: Simple Linear Regression II Week 4: Simple Linear Regression II Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ARR 1 Outline Algebraic properties

More information

STA 570 Spring Lecture 5 Tuesday, Feb 1

STA 570 Spring Lecture 5 Tuesday, Feb 1 STA 570 Spring 2011 Lecture 5 Tuesday, Feb 1 Descriptive Statistics Summarizing Univariate Data o Standard Deviation, Empirical Rule, IQR o Boxplots Summarizing Bivariate Data o Contingency Tables o Row

More information

CHAPTER 2 DESCRIPTIVE STATISTICS

CHAPTER 2 DESCRIPTIVE STATISTICS CHAPTER 2 DESCRIPTIVE STATISTICS 1. Stem-and-Leaf Graphs, Line Graphs, and Bar Graphs The distribution of data is how the data is spread or distributed over the range of the data values. This is one of

More information

Multivariate Capability Analysis

Multivariate Capability Analysis Multivariate Capability Analysis Summary... 1 Data Input... 3 Analysis Summary... 4 Capability Plot... 5 Capability Indices... 6 Capability Ellipse... 7 Correlation Matrix... 8 Tests for Normality... 8

More information

Chapter 2 Ratios, Percents, Simple Equations, and Ratio-Proportion

Chapter 2 Ratios, Percents, Simple Equations, and Ratio-Proportion Chapter 2 Ratios, Percents, Simple Equations, and Ratio-Proportion PROBLEM Decimal Fraction Percent Ratio 1. 0.05 2. 3. 45% 4. 1. Complete row 1 in the table above., 5%, 1:20 DIF: Application REF: Ratios

More information

Frequency Tables. Chapter 500. Introduction. Frequency Tables. Types of Categorical Variables. Data Structure. Missing Values

Frequency Tables. Chapter 500. Introduction. Frequency Tables. Types of Categorical Variables. Data Structure. Missing Values Chapter 500 Introduction This procedure produces tables of frequency counts and percentages for categorical and continuous variables. This procedure serves as a summary reporting tool and is often used

More information

Chapter 1. Looking at Data-Distribution

Chapter 1. Looking at Data-Distribution Chapter 1. Looking at Data-Distribution Statistics is the scientific discipline that provides methods to draw right conclusions: 1)Collecting the data 2)Describing the data 3)Drawing the conclusions Raw

More information

> glucose = c(81, 85, 93, 93, 99, 76, 75, 84, 78, 84, 81, 82, 89, + 81, 96, 82, 74, 70, 84, 86, 80, 70, 131, 75, 88, 102, 115, + 89, 82, 79, 106)

> glucose = c(81, 85, 93, 93, 99, 76, 75, 84, 78, 84, 81, 82, 89, + 81, 96, 82, 74, 70, 84, 86, 80, 70, 131, 75, 88, 102, 115, + 89, 82, 79, 106) This document describes how to use a number of R commands for plotting one variable and for calculating one variable summary statistics Specifically, it describes how to use R to create dotplots, histograms,

More information

Descriptive Statistics: Using Indicator Variables

Descriptive Statistics: Using Indicator Variables Descriptive Statistics: sing ndicator Variables Juliana. a" NC Highway Safety Research Center Tamara R. Fischell, NC Highway Safety Research Center ntroduction ndicator variables have proved useful in

More information

Spatial Patterns Point Pattern Analysis Geographic Patterns in Areal Data

Spatial Patterns Point Pattern Analysis Geographic Patterns in Areal Data Spatial Patterns We will examine methods that are used to analyze patterns in two sorts of spatial data: Point Pattern Analysis - These methods concern themselves with the location information associated

More information

Stat 500 lab notes c Philip M. Dixon, Week 10: Autocorrelated errors

Stat 500 lab notes c Philip M. Dixon, Week 10: Autocorrelated errors Week 10: Autocorrelated errors This week, I have done one possible analysis and provided lots of output for you to consider. Case study: predicting body fat Body fat is an important health measure, but

More information

Laboratory for Two-Way ANOVA: Interactions

Laboratory for Two-Way ANOVA: Interactions Laboratory for Two-Way ANOVA: Interactions For the last lab, we focused on the basics of the Two-Way ANOVA. That is, you learned how to compute a Brown-Forsythe analysis for a Two-Way ANOVA, as well as

More information

Averages and Variation

Averages and Variation Averages and Variation 3 Copyright Cengage Learning. All rights reserved. 3.1-1 Section 3.1 Measures of Central Tendency: Mode, Median, and Mean Copyright Cengage Learning. All rights reserved. 3.1-2 Focus

More information

Week 5: Multiple Linear Regression II

Week 5: Multiple Linear Regression II Week 5: Multiple Linear Regression II Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ARR 1 Outline Adjusted R

More information

SPSS TRAINING SPSS VIEWS

SPSS TRAINING SPSS VIEWS SPSS TRAINING SPSS VIEWS Dataset Data file Data View o Full data set, structured same as excel (variable = column name, row = record) Variable View o Provides details for each variable (column in Data

More information

Unit 1 Review of BIOSTATS 540 Practice Problems SOLUTIONS - Stata Users

Unit 1 Review of BIOSTATS 540 Practice Problems SOLUTIONS - Stata Users BIOSTATS 640 Spring 2018 Review of Introductory Biostatistics STATA solutions Page 1 of 13 Key Comments begin with an * Commands are in bold black I edited the output so that it appears here in blue Unit

More information

Handling missing values in Analysis

Handling missing values in Analysis Handling missing values in Analysis Before we analyze the data, which includes missing values, we should make sure that all the missing values have been coded as SAS missing values. There are many ways

More information

Factorial ANOVA with SAS

Factorial ANOVA with SAS Factorial ANOVA with SAS /* potato305.sas */ options linesize=79 noovp formdlim='_' ; title 'Rotten potatoes'; title2 ''; proc format; value tfmt 1 = 'Cool' 2 = 'Warm'; data spud; infile 'potato2.data'

More information

Centering and Interactions: The Training Data

Centering and Interactions: The Training Data Centering and Interactions: The Training Data A random sample of 150 technical support workers were first given a test of their technical skill and knowledge, and then randomly assigned to one of three

More information

How to Get Census Data from E-STAT

How to Get Census Data from E-STAT How to Get Census Data from E-STAT Connecting to E-STAT 1) From the Library Home Page, http://library.mcmaster.ca, click on the Databases tab 2) Once on the Databases tab, type in estat and hit the Go

More information

THE UNIVERSITY OF BRITISH COLUMBIA FORESTRY 430 and 533. Time: 50 minutes 40 Marks FRST Marks FRST 533 (extra questions)

THE UNIVERSITY OF BRITISH COLUMBIA FORESTRY 430 and 533. Time: 50 minutes 40 Marks FRST Marks FRST 533 (extra questions) THE UNIVERSITY OF BRITISH COLUMBIA FORESTRY 430 and 533 MIDTERM EXAMINATION: October 14, 2005 Instructor: Val LeMay Time: 50 minutes 40 Marks FRST 430 50 Marks FRST 533 (extra questions) This examination

More information

Chapter One: Getting Started With IBM SPSS for Windows

Chapter One: Getting Started With IBM SPSS for Windows Chapter One: Getting Started With IBM SPSS for Windows Using Windows The Windows start-up screen should look something like Figure 1-1. Several standard desktop icons will always appear on start up. Note

More information

STATS PAD USER MANUAL

STATS PAD USER MANUAL STATS PAD USER MANUAL For Version 2.0 Manual Version 2.0 1 Table of Contents Basic Navigation! 3 Settings! 7 Entering Data! 7 Sharing Data! 8 Managing Files! 10 Running Tests! 11 Interpreting Output! 11

More information

Assignment 5.5. Nothing here to hand in

Assignment 5.5. Nothing here to hand in Assignment 5.5 Nothing here to hand in Load the tidyverse before we start: library(tidyverse) ## Loading tidyverse: ggplot2 ## Loading tidyverse: tibble ## Loading tidyverse: tidyr ## Loading tidyverse:

More information

EXAMPLES OF DATA LISTINGS AND CLINICAL SUMMARY TABLES USING PROC REPORT'S BATCH LANGUAGE

EXAMPLES OF DATA LISTINGS AND CLINICAL SUMMARY TABLES USING PROC REPORT'S BATCH LANGUAGE EXAMPLES OF DATA LISTINGS AND CLINICAL SUMMARY TABLES USING PROC REPORT'S BATCH LANGUAGE Rob Hoffman Hoffmann-La Roche, Inc. Abstract PROC REPORT Is a powerful report writing tool which can easily create

More information

Stata v 12 Illustration. First Session

Stata v 12 Illustration. First Session Launch Stata PC Users Stata v 12 Illustration Mac Users START > ALL PROGRAMS > Stata; or Double click on the Stata icon on your desktop APPLICATIONS > STATA folder > Stata; or Double click on the Stata

More information

CHAPTER 2: DESCRIPTIVE STATISTICS Lecture Notes for Introductory Statistics 1. Daphne Skipper, Augusta University (2016)

CHAPTER 2: DESCRIPTIVE STATISTICS Lecture Notes for Introductory Statistics 1. Daphne Skipper, Augusta University (2016) CHAPTER 2: DESCRIPTIVE STATISTICS Lecture Notes for Introductory Statistics 1 Daphne Skipper, Augusta University (2016) 1. Stem-and-Leaf Graphs, Line Graphs, and Bar Graphs The distribution of data is

More information

Compare Linear Regression Lines for the HP-67

Compare Linear Regression Lines for the HP-67 Compare Linear Regression Lines for the HP-67 by Namir Shammas This article presents an HP-67 program that calculates the linear regression statistics for two data sets and then compares their slopes and

More information

LAB 1 INSTRUCTIONS DESCRIBING AND DISPLAYING DATA

LAB 1 INSTRUCTIONS DESCRIBING AND DISPLAYING DATA LAB 1 INSTRUCTIONS DESCRIBING AND DISPLAYING DATA This lab will assist you in learning how to summarize and display categorical and quantitative data in StatCrunch. In particular, you will learn how to

More information

22s:152 Applied Linear Regression DeCook Fall 2011 Lab 3 Monday October 3

22s:152 Applied Linear Regression DeCook Fall 2011 Lab 3 Monday October 3 s:5 Applied Linear Regression DeCook all 0 Lab onday October The data Set In 004, a study was done to examine if gender, after controlling for other variables, was a significant predictor of salary for

More information

Measures of Position

Measures of Position Measures of Position In this section, we will learn to use fractiles. Fractiles are numbers that partition, or divide, an ordered data set into equal parts (each part has the same number of data entries).

More information

The subject of this chapter is the pivot table, the name given to a special

The subject of this chapter is the pivot table, the name given to a special Chapter 2: Generating Pivot Tables In This Chapter Understanding how to use pivot tables to summarize and analyze your data The many methods for creating pivot tables Pivoting the elements in the data

More information

DATA DEFINITION PHASE

DATA DEFINITION PHASE Twoway Analysis of Variance Unlike previous problems in the manual, the present problem involves two independent variables (gender of juror and type of crime committed by defendant). There are two levels

More information

Repeated Measures Part 4: Blood Flow data

Repeated Measures Part 4: Blood Flow data Repeated Measures Part 4: Blood Flow data /* bloodflow.sas */ options linesize=79 pagesize=100 noovp formdlim='_'; title 'Two within-subjecs factors: Blood flow data (NWK p. 1181)'; proc format; value

More information

Chapter 7 Assignment due Wednesday, May 24

Chapter 7 Assignment due Wednesday, May 24 due Wednesday, May 24 Calculating Probabilities for Normal Distributions Overview What you re going to do in this assignment is use an online applet to calculate: probabilities associated with given -scores

More information

Title. Syntax. svy: tabulate oneway One-way tables for survey data. Basic syntax. svy: tabulate varname. Full syntax

Title. Syntax. svy: tabulate oneway One-way tables for survey data. Basic syntax. svy: tabulate varname. Full syntax Title svy: tabulate oneway One-way tables for survey data Syntax Basic syntax svy: tabulate varname Full syntax svy [ vcetype ] [, svy options ] : tabulate varname [ if ] [ in ] [, tabulate options display

More information

Software Reference Sheet: Inserting and Organizing Data in a Spreadsheet

Software Reference Sheet: Inserting and Organizing Data in a Spreadsheet Inserting and formatting text Software Reference Sheet: Inserting and Organizing Data in a Spreadsheet Column headings are very important to include in your spreadsheet so that you can remember what the

More information

Measures of Central Tendency. A measure of central tendency is a value used to represent the typical or average value in a data set.

Measures of Central Tendency. A measure of central tendency is a value used to represent the typical or average value in a data set. Measures of Central Tendency A measure of central tendency is a value used to represent the typical or average value in a data set. The Mean the sum of all data values divided by the number of values in

More information

OUTPUT DELIVERY SYSTEM

OUTPUT DELIVERY SYSTEM Differences in ODS formtting for HTML with Proc Print nd Proc Report Lur L. M. Thornton, USDA-ARS, Animl Improvement Progrms Lortory, Beltsville, MD ABSTRACT While Proc Print is terrific tool for dt checking

More information

Bluman & Mayer, Elementary Statistics, A Step by Step Approach, Canadian Edition

Bluman & Mayer, Elementary Statistics, A Step by Step Approach, Canadian Edition Bluman & Mayer, Elementary Statistics, A Step by Step Approach, Canadian Edition Online Learning Centre Technology Step-by-Step - Minitab Minitab is a statistical software application originally created

More information