Applied Multivariate Analysis
|
|
- Alvin Griffith
- 6 years ago
- Views:
Transcription
1 Department of Mathematics and Statistics, University of Vaasa, Finland Spring 2017
2 Choosing Statistical Method
3 1 Choice an appropriate method 2 Cross-tabulation More advance analysis of frequency tables (Log-linear models) 3 Regression
4 Scales of Measurements Measurement is a process by which numbers or symbols are attached to given characteristics of an object according to predetermined rules. Main scales: Nominal: classification (similar, different) Ordinal: in addition to nominal scale, ordering Interval: in addition to previous, differences between two measurements are meaningful, however no fixed origin (zero point) Ratio: in addition to previous, fixed origin (zero point).
5 Dependent-Independent Variables: Statistical Methods Depending on the scale, multivariate analysis can be applied (roughly) according to the following table in a dependentindependent variable analysis:
6 Dependent-Independent Variables: Statistical Methods Dependent Variable(s) One More than One Metric Nonmetric Metric Nonmetric Indep. vars One Metric Regression Discriminant Canonical Multiple analysis (RA) analysis (DA) correlation DA (MDA) Logistic regression Non- t-test Discrete DA MANOVA Multiple metric (DDA) (MDDA) More Metric Multiple RA DA Canonical MDA correlation Structural equations Non- ANOVA DDA MANOVA MDDA metric Conjoint an.
7 Analysis of Interdependencies Most common methods for analyzing interdependencies (without causal relationships) are: Type of Data No of Variables Metric Nonmetric Two Simple Two-way correlation contingency tables Cluster analysis Loglinear models More Principal Multiway component contingency analysis tables Factor anlaysis Cluster analysis Loglinear models Correspondence analysis
8 1 Choice an appropriate method 2 Cross-tabulation More advance analysis of frequency tables (Log-linear models) 3 Regression
9 1 Choice an appropriate method 2 Cross-tabulation More advance analysis of frequency tables (Log-linear models) 3 Regression
10 Analysis of frequency tables Analysis of dependencies between two classification variables can be analyzed using cross-tabulation. X 1 2 c Sum 1 f 11 f 12 f 1c f 1. 2 f 21 f 22 f 2c f 2. Y..... r f r1 f r2 f rc f r. Sum f.1 f.2 f.c n f ij is the number of observations in Y class i and X class j, f.j = r i=1 f ij, f i. = c i=1 f ij, and n = r c i=1 j=1 f ij is the total number of observations.
11 Example 1 (Source: Base SAS Procedures Guide 9.2, Example 3.1): The eye and hair color of children from two different regions of Europe are recorded in the data set Color. Instead of recording one observation per child, the data are recorded as cell counts, where the variable Count contains the number of children exhibiting each of the 15 eye and hair color combinations. The data set does not include missing combinations. data color; input region eyes $ hair $ allows reading several obs per line */ label eyes = Eye Color hair = Hair Color region= Geographic Region ; datalines; 1 blue fair 23 1 blue red 7 1 blue medium 24 1 blue dark 11 1 green fair 19 1 green red 7 1 green medium 18 1 green dark 14 1 brown fair 34 1 brown red 5 1 brown medium 41 1 brown dark 40 1 brown black 3 2 blue fair 46 2 blue red 21 2 blue medium 44 2 blue dark 40 2 blue black 6 2 green fair 50 2 green red 31 2 green medium 37 2 green dark 23 2 brown fair 56 2 brown red 42 2 brown medium 53 2 brown dark 54 2 brown black 13 ;
12 SAS commands The data can be presented in contingency tables Two-way tables: /* depedency of region and eye color */ proc freq data = color; title "Region and Eye Color of European Children"; tables region*eyes /chisq norow nocol nopercent; weight count; /* Note: important to weight by count due to the format of the data */ run;
13 SAS results OUTPUT: Region and Eye Color of European Children ======================================================= Eye Color Geographic Region blue brown green Total Total ====================================================== Statistics for Table of region by eyes ================================================== Statistic DF Value Prob Chi-Square <- Chi-square for independence (not significant) Likelihood Ratio Chi-Square Mantel-Haenszel Chi-Square Phi Coefficient Contingency Coefficient Cramer s V ================================================== Sample Size = 762
14 Exampple: Frequency tables The chi-square value of with 2 degrees of freedom corresponds a p-value of which indicates that there is no convincing empirical evidence of dependence between region and eye color. Similar analysis for hair color (see SAS example on the web page) shows that hair color is related to the region χ 2 (4) = 20.5, with p-value The percentage distributions of the SAS output show that medium hear color is more frequent in region 1 while red is more frequent in region 2. Similar analysis of dependence between hair color and eye color suggest dependence between them, χ 2 (8) = 20.9, p = The major source of the dependence seems to be that dark brown eyes seem to be related to dark hair than other eye colors.
15 Example: Frequency tables Finally we can analyze the relation of eye and hair colors separately in different regions by running three way tables. /* 3-way contingency table */ /* the first variable become a kind of control variable */ proc freq data = color; title "Three-way table of Region, Eye color, and Hair color"; tables region*hair*eyes / chisq norow nocol nopercent; /* Note: tables are formed by region */ weight count; run; The results show that the dependence is in particular in region 2.
16 More advance analysis of frequency tables (Log-linear models) 1 Choice an appropriate method 2 Cross-tabulation More advance analysis of frequency tables (Log-linear models) 3 Regression
17 More advance analysis of frequency tables (Log-linear models) Intuition of log-linear models Consider two categorial variable A and B and let f ij be the number of observation (frequency) in A s category i and B s category j. Let η ij = E[f ij ] denote the expected value of f ij, i.e., the expected number of observations out of n observations falling to A s class i and B s class j. Denoting further f i. = b j=1 f ij and f.j = a i=1 f ij (a is the number of A categories and b the number of B categories) the marginal totals (frequencies) with expected values η i. and η.j.
18 More advance analysis of frequency tables (Log-linear models) Log linear models Writing η ij = η i. η.j η ij /(η i. η.j ) and taking logarithms, we get log(η ij ) = λ + λ i + λ j + λ ij, (1) where λ is related to average frequency, λ i indicates the marginal contribution of A and λ j indicates B s marginal contribution on the expected frequency, and finally λ ij indicates the joint effect of A and B. In this simple case if λ ij = 0 then variables A and B are independent.
19 More advance analysis of frequency tables (Log-linear models) Log linear models The above generalizes to higher order tables. Consider three variables A, B, and C with f ijk denoting the number observations in cell (i, j, k) (i.e., in the intersection where A s class i, Bs class j, and Cs class k. We are interested on the following models Model A + B + C AB + C AC + B A + BC AB + AC AB + BC AC + BC AB + AC + BC ABC Dependence structure Independence model (only marginal effect) A and B dependent, C independet A and C dependent, B independent A independent, B and C dependent A and B dependent, A and C dependent A and B dependent, B and C dependent A and C dependent, B and C dependent All pairwise dependencies Saturated model
20 More advance analysis of frequency tables (Log-linear models) Example: SAS CATMOD for log linear models See the SAS example on the course web page for an analysis with empirical data of the dependencies in the above table. As a homework, work out also the example on the course web page.
21 1 Choice an appropriate method 2 Cross-tabulation More advance analysis of frequency tables (Log-linear models) 3 Regression
22 Regression The basic regression model is of the form y i = β 0 + β 1 x i1 + β 2 x i2 + + β p x ip + u i (2) where i is the dependent variable and x ij are explanatory variables, u i is the error term, assumed independently and identically distributed (iid), i = 1,..., n (sample size), j = 1,..., p. Slope coefficient β j indicates the marginal effect of variable x j on the dependent variable (given that the other x-variable do not change (ceteris paribus condition), i.e., if x j changes by one unit y is expected to change by β j units. Note that when transformation are applied on the variables, interpretation of the slope coefficients must be adapted accordingly.
23 Regression Example 2 Using the wage data set referred to on the course web page, estimate we estimate the regression model log(wage) = β 0 +δ 1 female+δ f mfemale+δ mmmale+β 1 educ+β 2 exper+β 3 tenure+u. Questions: 1 Are there wage differences between genders? 2 Is there marriage premiums? 3 Does an additional year of education pay off equally well for women and men?
Biostat Methods STAT 5820/6910 Handout #4: Chi-square, Fisher s, and McNemar s Tests
Biostat Methods STAT 5820/6910 Handout #4: Chi-square, Fisher s, and McNemar s Tests Example 1: 152 patients were randomly assigned to 4 dose groups in a clinical study. During the course of the study,
More informationFactorial ANOVA. Skipping... Page 1 of 18
Factorial ANOVA The potato data: Batches of potatoes randomly assigned to to be stored at either cool or warm temperature, infected with one of three bacterial types. Then wait a set period. The dependent
More informationDSCI 325: Handout 10 Summarizing Numerical and Categorical Data in SAS Spring 2017
DSCI 325: Handout 10 Summarizing Numerical and Categorical Data in SAS Spring 2017 USING PROC MEANS The routine PROC MEANS can be used to obtain limited summaries for numerical variables (e.g., the mean,
More informationData Mining. 2.4 Data Integration. Fall Instructor: Dr. Masoud Yaghini. Data Integration
Data Mining 2.4 Fall 2008 Instructor: Dr. Masoud Yaghini Data integration: Combines data from multiple databases into a coherent store Denormalization tables (often done to improve performance by avoiding
More informationWeek 4: Simple Linear Regression II
Week 4: Simple Linear Regression II Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ARR 1 Outline Algebraic properties
More informationSPSS Modules Features
SPSS Modules Features Core System Functionality (included in every license) Data access and management Data Prep features: Define Variable properties tool; copy data properties tool, Visual Bander, Identify
More informationSpatial Patterns Point Pattern Analysis Geographic Patterns in Areal Data
Spatial Patterns We will examine methods that are used to analyze patterns in two sorts of spatial data: Point Pattern Analysis - These methods concern themselves with the location information associated
More informationCLUSTER ANALYSIS. V. K. Bhatia I.A.S.R.I., Library Avenue, New Delhi
CLUSTER ANALYSIS V. K. Bhatia I.A.S.R.I., Library Avenue, New Delhi-110 012 In multivariate situation, the primary interest of the experimenter is to examine and understand the relationship amongst the
More informationRegression. Dr. G. Bharadwaja Kumar VIT Chennai
Regression Dr. G. Bharadwaja Kumar VIT Chennai Introduction Statistical models normally specify how one set of variables, called dependent variables, functionally depend on another set of variables, called
More informationResearch Methods for Business and Management. Session 8a- Analyzing Quantitative Data- using SPSS 16 Andre Samuel
Research Methods for Business and Management Session 8a- Analyzing Quantitative Data- using SPSS 16 Andre Samuel A Simple Example- Gym Purpose of Questionnaire- to determine the participants involvement
More informationIntroduction to Mixed Models: Multivariate Regression
Introduction to Mixed Models: Multivariate Regression EPSY 905: Multivariate Analysis Spring 2016 Lecture #9 March 30, 2016 EPSY 905: Multivariate Regression via Path Analysis Today s Lecture Multivariate
More informationThis code and the crash data set can be found on the course web page.
Homework 2 Solutions, 1. The file crash.dat was obtained from a national data base of automobile crashes.data were selected were from serious accidents in New Jersey in 1999. The data file has one line
More informationWorkload Characterization Techniques
Workload Characterization Techniques Raj Jain Washington University in Saint Louis Saint Louis, MO 63130 Jain@cse.wustl.edu These slides are available on-line at: http://www.cse.wustl.edu/~jain/cse567-08/
More informationCoding Categorical Variables in Regression: Indicator or Dummy Variables. Professor George S. Easton
Coding Categorical Variables in Regression: Indicator or Dummy Variables Professor George S. Easton DataScienceSource.com This video is embedded on the following web page at DataScienceSource.com: DataScienceSource.com/DummyVariables
More informationWeek 5: Multiple Linear Regression II
Week 5: Multiple Linear Regression II Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ARR 1 Outline Adjusted R
More informationTopic 3: GIS Models 10/2/2017. What is a Model? What is a GIS Model. Geography 38/42:477 Advanced Geomatics
Geography 38/42:477 Advanced Geomatics Topic 3: GIS Models What is a Model? Simplified representation of real world Physical, Schematic, Mathematical Map GIS database Reduce complexity and help us understand
More informationStrategies for Modeling Two Categorical Variables with Multiple Category Choices
003 Joint Statistical Meetings - Section on Survey Research Methods Strategies for Modeling Two Categorical Variables with Multiple Category Choices Christopher R. Bilder Department of Statistics, University
More informationStatistical Package for the Social Sciences INTRODUCTION TO SPSS SPSS for Windows Version 16.0: Its first version in 1968 In 1975.
Statistical Package for the Social Sciences INTRODUCTION TO SPSS SPSS for Windows Version 16.0: Its first version in 1968 In 1975. SPSS Statistics were designed INTRODUCTION TO SPSS Objective About the
More informationSTA 570 Spring Lecture 5 Tuesday, Feb 1
STA 570 Spring 2011 Lecture 5 Tuesday, Feb 1 Descriptive Statistics Summarizing Univariate Data o Standard Deviation, Empirical Rule, IQR o Boxplots Summarizing Bivariate Data o Contingency Tables o Row
More informationCell means coding and effect coding
Cell means coding and effect coding /* mathregr_3.sas */ %include 'readmath.sas'; title2 ''; /* The data step continues */ if ethnic ne 6; /* Otherwise, throw the case out */ /* Indicator dummy variables
More informationST512. Fall Quarter, Exam 1. Directions: Answer questions as directed. Please show work. For true/false questions, circle either true or false.
ST512 Fall Quarter, 2005 Exam 1 Name: Directions: Answer questions as directed. Please show work. For true/false questions, circle either true or false. 1. (42 points) A random sample of n = 30 NBA basketball
More informationLab #3: Probability, Simulations, Distributions:
Lab #3: Probability, Simulations, Distributions: A. Objectives: 1. Reading from an external file 2. Create contingency table 3. Simulate a probability distribution 4. The Uniform Distribution Reading from
More informationBase package The Base subscription includes the following features:
IBM SPSS Statistics Subscription: Base, add-ons and features 1 2 3 Base package The Base subscription includes the following features: Data access and management Compare two data files for compatibility
More informationGeneral Factorial Models
In Chapter 8 in Oehlert STAT:5201 Week 9 - Lecture 2 1 / 34 It is possible to have many factors in a factorial experiment. In DDD we saw an example of a 3-factor study with ball size, height, and surface
More informationGeneral Factorial Models
In Chapter 8 in Oehlert STAT:5201 Week 9 - Lecture 1 1 / 31 It is possible to have many factors in a factorial experiment. We saw some three-way factorials earlier in the DDD book (HW 1 with 3 factors:
More informationNuts and Bolts Research Methods Symposium
Organizing Your Data Jenny Holcombe, PhD UT College of Medicine Nuts & Bolts Conference August 16, 3013 Topics to Discuss: Types of Variables Constructing a Variable Code Book Developing Excel Spreadsheets
More informationOrganizing Your Data. Jenny Holcombe, PhD UT College of Medicine Nuts & Bolts Conference August 16, 3013
Organizing Your Data Jenny Holcombe, PhD UT College of Medicine Nuts & Bolts Conference August 16, 3013 Learning Objectives Identify Different Types of Variables Appropriately Naming Variables Constructing
More informationApplied Multivariate Analysis
Department of Mathematics and Statistics, University of Vaasa, Finland Spring 2017 Cluster Analysis Background 1 Cluster analysis Background Distance data Background Example 1 Consider the following data
More informationHierarchical Generalized Linear Models
Generalized Multilevel Linear Models Introduction to Multilevel Models Workshop University of Georgia: Institute for Interdisciplinary Research in Education and Human Development 07 Generalized Multilevel
More informationMultiple Regression White paper
+44 (0) 333 666 7366 Multiple Regression White paper A tool to determine the impact in analysing the effectiveness of advertising spend. Multiple Regression In order to establish if the advertising mechanisms
More informationLoglinear and Logit Models for Contingency Tables
Chapter 8 Loglinear and Logit Models for Contingency Tables Loglinear models comprise another special case of generalized linear models designed for contingency tables of frequencies. They are most easily
More informationSAS Macros CORR_P and TANGO: Interval Estimation for the Difference Between Correlated Proportions in Dependent Samples
Paper SD-03 SAS Macros CORR_P and TANGO: Interval Estimation for the Difference Between Correlated Proportions in Dependent Samples Patricia Rodríguez de Gil, Jeanine Romano Thanh Pham, Diep Nguyen, Jeffrey
More informationJMP Book Descriptions
JMP Book Descriptions The collection of JMP documentation is available in the JMP Help > Books menu. This document describes each title to help you decide which book to explore. Each book title is linked
More informationIntermediate SAS: Statistics
Intermediate SAS: Statistics OIT TSS 293-4444 oithelp@mail.wvu.edu oit.wvu.edu/training/classmat/sas/ Table of Contents Procedures... 2 Two-sample t-test:... 2 Paired differences t-test:... 2 Chi Square
More informationWeek 11: Interpretation plus
Week 11: Interpretation plus Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ARR 1 Outline A bit of a patchwork
More informationIBM SPSS Statistics Traditional License packages and features
IBM SPSS Statistics Traditional License packages and features 1 2 3 The includes the following features: Data access and management Compare two data files for compatibility Data prep features: Define Variable
More informationAnalysis of Complex Survey Data with SAS
ABSTRACT Analysis of Complex Survey Data with SAS Christine R. Wells, Ph.D., UCLA, Los Angeles, CA The differences between data collected via a complex sampling design and data collected via other methods
More informationCorrectly Compute Complex Samples Statistics
SPSS Complex Samples 15.0 Specifications Correctly Compute Complex Samples Statistics When you conduct sample surveys, use a statistics package dedicated to producing correct estimates for complex sample
More informationTeaching students quantitative methods using resources from the British Birth Cohorts
Centre for Longitudinal Studies, Institute of Education Teaching students quantitative methods using resources from the British Birth Cohorts Assessment of Cognitive Development through Childhood CognitiveExercises.doc:
More informationSTATISTICS (STAT) Statistics (STAT) 1
Statistics (STAT) 1 STATISTICS (STAT) STAT 2013 Elementary Statistics (A) Prerequisites: MATH 1483 or MATH 1513, each with a grade of "C" or better; or an acceptable placement score (see placement.okstate.edu).
More informationIBM SPSS Categories. Predict outcomes and reveal relationships in categorical data. Highlights. With IBM SPSS Categories you can:
IBM Software IBM SPSS Statistics 19 IBM SPSS Categories Predict outcomes and reveal relationships in categorical data Highlights With IBM SPSS Categories you can: Visualize and explore complex categorical
More informationPredict Outcomes and Reveal Relationships in Categorical Data
PASW Categories 18 Specifications Predict Outcomes and Reveal Relationships in Categorical Data Unleash the full potential of your data through predictive analysis, statistical learning, perceptual mapping,
More informationSAS data statements and data: /*Factor A: angle Factor B: geometry Factor C: speed*/
STAT:5201 Applied Statistic II (Factorial with 3 factors as 2 3 design) Three-way ANOVA (Factorial with three factors) with replication Factor A: angle (low=0/high=1) Factor B: geometry (shape A=0/shape
More informationLoglinear Models for Categorical Data. Michael Friendly
Admit?: Yes Sex: Male 1198 1493 Admit?: No Right Eye Grade High 2 3 Unaided distant vision data Brown Hazel Green Blue -3.1 4.4 2.3 7.0 557 1278 Sex: Female Low High 2 3 Left Eye Grade Low -2.2-5.9 Black
More informationLog-linear Models of Contingency Tables: Multidimensional Tables
CSSS/SOC/STAT 536: Logistic Regression and Log-linear Models Log-linear Models of Contingency Tables: Multidimensional Tables Christopher Adolph University of Washington, Seattle March 10, 2005 Assistant
More informationUsing SAS Macros to Extract P-values from PROC FREQ
SESUG 2016 ABSTRACT Paper CC-232 Using SAS Macros to Extract P-values from PROC FREQ Rachel Straney, University of Central Florida This paper shows how to leverage the SAS Macro Facility with PROC FREQ
More informationCorrectly Compute Complex Samples Statistics
PASW Complex Samples 17.0 Specifications Correctly Compute Complex Samples Statistics When you conduct sample surveys, use a statistics package dedicated to producing correct estimates for complex sample
More informationStat 342 Exam 3 Fall 2014
Stat 34 Exam 3 Fall 04 I have neither given nor received unauthorized assistance on this exam. Name Signed Date Name Printed There are questions on the following 6 pages. Do as many of them as you can
More informationWant to Do a Better Job? - Select Appropriate Statistical Analysis in Healthcare Research
Want to Do a Better Job? - Select Appropriate Statistical Analysis in Healthcare Research Liping Huang, Center for Home Care Policy and Research, Visiting Nurse Service of New York, NY, NY ABSTRACT The
More informationData Mining: Concepts and Techniques. (3 rd ed.) Chapter 3. Chapter 3: Data Preprocessing. Major Tasks in Data Preprocessing
Data Mining: Concepts and Techniques (3 rd ed.) Chapter 3 1 Chapter 3: Data Preprocessing Data Preprocessing: An Overview Data Quality Major Tasks in Data Preprocessing Data Cleaning Data Integration Data
More information186 Statistics, Data Analysis and Modeling. Proceedings of MWSUG '95
A Statistical Analysis Macro Library in SAS Carl R. Haske, Ph.D., STATPROBE, nc., Ann Arbor, M Vivienne Ward, M.S., STATPROBE, nc., Ann Arbor, M ABSTRACT Statistical analysis plays a major role in pharmaceutical
More informationLecture 1: Statistical Reasoning 2. Lecture 1. Simple Regression, An Overview, and Simple Linear Regression
Lecture Simple Regression, An Overview, and Simple Linear Regression Learning Objectives In this set of lectures we will develop a framework for simple linear, logistic, and Cox Proportional Hazards Regression
More informationInference for loglinear models (contd):
Stat 504, Lecture 25 1 Inference for loglinear models (contd): Loglinear/Logit connection Intro to Graphical Models Stat 504, Lecture 25 2 Loglinear Models no distinction between response and explanatory
More informationMathematics (JUN11MPC201) General Certificate of Education Advanced Subsidiary Examination June Unit Pure Core TOTAL
Centre Number Candidate Number For Examiner s Use Surname Other Names Candidate Signature Examiner s Initials Mathematics Unit Pure Core 2 Wednesday 18 May 2011 General Certificate of Education Advanced
More informationSPSS QM II. SPSS Manual Quantitative methods II (7.5hp) SHORT INSTRUCTIONS BE CAREFUL
SPSS QM II SHORT INSTRUCTIONS This presentation contains only relatively short instructions on how to perform some statistical analyses in SPSS. Details around a certain function/analysis method not covered
More informationFrequently Asked Questions Updated 2006 (TRIM version 3.51) PREPARING DATA & RUNNING TRIM
Frequently Asked Questions Updated 2006 (TRIM version 3.51) PREPARING DATA & RUNNING TRIM * Which directories are used for input files and output files? See menu-item "Options" and page 22 in the manual.
More information8. MINITAB COMMANDS WEEK-BY-WEEK
8. MINITAB COMMANDS WEEK-BY-WEEK In this section of the Study Guide, we give brief information about the Minitab commands that are needed to apply the statistical methods in each week s study. They are
More informationLIST OF TABLES. Page Title No.
LIST OF TABLES Table 1.1 Growth in Total Subscriber Base of Telecom Industry 5 1.2 Growth in Tele-density 11 2.1 Top 10 Countries with the number of Mobile Phone Subscribers 2011 23 2.2 World Wide Market
More informationStudy Guide. Module 1. Key Terms
Study Guide Module 1 Key Terms general linear model dummy variable multiple regression model ANOVA model ANCOVA model confounding variable squared multiple correlation adjusted squared multiple correlation
More informationFrequency Distributions
Displaying Data Frequency Distributions After collecting data, the first task for a researcher is to organize and summarize the data so that it is possible to get a general overview of the results. Remember,
More informationMathematics MPC2. General Certificate of Education Advanced Subsidiary Examination. Unit Pure Core 2
Centre Number Candidate Number For Examiner s Use Surname Other Names Candidate Signature Examiner s Initials Mathematics Unit Pure Core 2 General Certificate of Education Advanced Subsidiary Examination
More informationFrequency Tables. Chapter 500. Introduction. Frequency Tables. Types of Categorical Variables. Data Structure. Missing Values
Chapter 500 Introduction This procedure produces tables of frequency counts and percentages for categorical and continuous variables. This procedure serves as a summary reporting tool and is often used
More informationAdvanced Analytics with Enterprise Guide Catherine Truxillo, Ph.D., Stephen McDaniel, and David McNamara, SAS Institute Inc.
Advanced Analytics with Enterprise Guide Catherine Truxillo, Ph.D., Stephen McDaniel, and David McNamara, SAS Institute Inc., Cary, NC ABSTRACT From SAS/STAT to SAS/ETS to SAS/QC to SAS/GRAPH, Enterprise
More informationChapter 2. Introduction to SAS. 2.1 The Four Main File Types
Chapter 2 Introduction to SAS SAS stands for Statistical Analysis System. Even though it runs on linux and Windows PCs as well as on bigger computers, it is truly the last of the great old mainframe statistical
More information1. Solve the following system of equations below. What does the solution represent? 5x + 2y = 10 3x + 5y = 2
1. Solve the following system of equations below. What does the solution represent? 5x + 2y = 10 3x + 5y = 2 2. Given the function: f(x) = a. Find f (6) b. State the domain of this function in interval
More informationSAS Macros for Binning Predictors with a Binary Target
ABSTRACT Paper 969-2017 SAS Macros for Binning Predictors with a Binary Target Bruce Lund, Magnify Analytic Solutions, Detroit MI, Wilmington DE, Charlotte NC Binary logistic regression models are widely
More informationBivariate (Simple) Regression Analysis
Revised July 2018 Bivariate (Simple) Regression Analysis This set of notes shows how to use Stata to estimate a simple (two-variable) regression equation. It assumes that you have set Stata up on your
More informationStatistical graphics in analysis Multivariable data in PCP & scatter plot matrix. Paula Ahonen-Rainio Maa Visual Analysis in GIS
Statistical graphics in analysis Multivariable data in PCP & scatter plot matrix Paula Ahonen-Rainio Maa-123.3530 Visual Analysis in GIS 11.11.2015 Topics today YOUR REPORTS OF A-2 Thematic maps with charts
More informationSAS (Statistical Analysis Software/System)
SAS (Statistical Analysis Software/System) SAS Adv. Analytics or Predictive Modelling:- Class Room: Training Fee & Duration : 30K & 3 Months Online Training Fee & Duration : 33K & 3 Months Learning SAS:
More informationDATA CLASSIFICATORY TECHNIQUES
DATA CLASSIFICATORY TECHNIQUES AMRENDER KUMAR AND V.K.BHATIA Indian Agricultural Statistics Research Institute Library Avenue, New Delhi-110 012 akjha@iasri.res.in 1. Introduction Rudimentary, exploratory
More informationProduct Catalog. AcaStat. Software
Product Catalog AcaStat Software AcaStat AcaStat is an inexpensive and easy-to-use data analysis tool. Easily create data files or import data from spreadsheets or delimited text files. Run crosstabulations,
More informationMultivariate Capability Analysis
Multivariate Capability Analysis Summary... 1 Data Input... 3 Analysis Summary... 4 Capability Plot... 5 Capability Indices... 6 Capability Ellipse... 7 Correlation Matrix... 8 Tests for Normality... 8
More informationPSY 9556B (Feb 5) Latent Growth Modeling
PSY 9556B (Feb 5) Latent Growth Modeling Fixed and random word confusion Simplest LGM knowing how to calculate dfs How many time points needed? Power, sample size Nonlinear growth quadratic Nonlinear growth
More informationChapter 1. Using the Cluster Analysis. Background Information
Chapter 1 Using the Cluster Analysis Background Information Cluster analysis is the name of a multivariate technique used to identify similar characteristics in a group of observations. In cluster analysis,
More informationSubset Selection in Multiple Regression
Chapter 307 Subset Selection in Multiple Regression Introduction Multiple regression analysis is documented in Chapter 305 Multiple Regression, so that information will not be repeated here. Refer to that
More informationIntroductory Applied Statistics: A Variable Approach TI Manual
Introductory Applied Statistics: A Variable Approach TI Manual John Gabrosek and Paul Stephenson Department of Statistics Grand Valley State University Allendale, MI USA Version 1.1 August 2014 2 Copyright
More informationCREATING SIMULATED DATASETS Edition by G. David Garson and Statistical Associates Publishing Page 1
Copyright @c 2012 by G. David Garson and Statistical Associates Publishing Page 1 @c 2012 by G. David Garson and Statistical Associates Publishing. All rights reserved worldwide in all media. No permission
More informationStatistical Methods for the Analysis of Repeated Measurements
Charles S. Davis Statistical Methods for the Analysis of Repeated Measurements With 20 Illustrations #j Springer Contents Preface List of Tables List of Figures v xv xxiii 1 Introduction 1 1.1 Repeated
More informationJMP Chong Ho
JMP Interface: ipod of statistical software Chong Ho Yu, Ph.D. (2012) cyu@apu.edu www.creative wisdom.com JMP is software package created by SAS Institute for data visualization and exploratory data analysis.
More informationData analysis using Microsoft Excel
Introduction to Statistics Statistics may be defined as the science of collection, organization presentation analysis and interpretation of numerical data from the logical analysis. 1.Collection of Data
More informationHandling missing values in Analysis
Handling missing values in Analysis Before we analyze the data, which includes missing values, we should make sure that all the missing values have been coded as SAS missing values. There are many ways
More informationModelling Proportions and Count Data
Modelling Proportions and Count Data Rick White May 4, 2016 Outline Analysis of Count Data Binary Data Analysis Categorical Data Analysis Generalized Linear Models Questions Types of Data Continuous data:
More informationbook 2014/5/6 15:21 page v #3 List of figures List of tables Preface to the second edition Preface to the first edition
book 2014/5/6 15:21 page v #3 Contents List of figures List of tables Preface to the second edition Preface to the first edition xvii xix xxi xxiii 1 Data input and output 1 1.1 Input........................................
More informationBluman & Mayer, Elementary Statistics, A Step by Step Approach, Canadian Edition
Bluman & Mayer, Elementary Statistics, A Step by Step Approach, Canadian Edition Online Learning Centre Technology Step-by-Step - Minitab Minitab is a statistical software application originally created
More informationSTAT 3304/5304 Introduction to Statistical Computing. Introduction to SAS
STAT 3304/5304 Introduction to Statistical Computing Introduction to SAS What is SAS? SAS (originally an acronym for Statistical Analysis System, now it is not an acronym for anything) is a program designed
More informationAnalysis of Variance in R
nalysis of Variance in R Dale arr R Training: University of Glasgow Dale arr (R Training: University of Glasgow) nalysis of Variance in R 1 / 19 When is NOV applicable? When you wish to assess the independent/joint
More information- 1 - Fig. A5.1 Missing value analysis dialog box
WEB APPENDIX Sarstedt, M. & Mooi, E. (2019). A concise guide to market research. The process, data, and methods using SPSS (3 rd ed.). Heidelberg: Springer. Missing Value Analysis and Multiple Imputation
More informationGET A GRIP ON MACROS IN JUST 50 MINUTES! Arthur Li, City of Hope Comprehensive Cancer Center, Duarte, CA
GET A GRIP ON MACROS IN JUST 50 MINUTES! Arthur Li, City of Hope Comprehensive Cancer Center, Duarte, CA ABSTRACT The SAS macro facility, which includes macro variables and macro programs, is the most
More informationAn introduction to SPSS
An introduction to SPSS To open the SPSS software using U of Iowa Virtual Desktop... Go to https://virtualdesktop.uiowa.edu and choose SPSS 24. Contents NOTE: Save data files in a drive that is accessible
More informationUNIVERSITY OF CAMBRIDGE INTERNATIONAL EXAMINATIONS International General Certificate of Secondary Education MATHEMATICS
UNIVERSITY OF CAMBRIDGE INTERNATIONAL EXAMINATIONS International General Certificate of Secondary Education MATHEMATICS 0580/04 0581/04 Paper 4 (Extended) October/November 2004 Additional Materials: Answer
More informationSimulation: Solving Dynamic Models ABE 5646 Week 12, Spring 2009
Simulation: Solving Dynamic Models ABE 5646 Week 12, Spring 2009 Week Description Reading Material 12 Mar 23- Mar 27 Uncertainty and Sensitivity Analysis Two forms of crop models Random sampling for stochastic
More informationQuick Start Guide Jacob Stolk PhD Simone Stolk MPH November 2018
Quick Start Guide Jacob Stolk PhD Simone Stolk MPH November 2018 Contents Introduction... 1 Start DIONE... 2 Load Data... 3 Missing Values... 5 Explore Data... 6 One Variable... 6 Two Variables... 7 All
More informationModelling Proportions and Count Data
Modelling Proportions and Count Data Rick White May 5, 2015 Outline Analysis of Count Data Binary Data Analysis Categorical Data Analysis Generalized Linear Models Questions Types of Data Continuous data:
More informationFirst steps in SPSS. Figure 1
First steps in SPSS Statistical Package for Social Science (SPSS) is a computer program, working with the Windows operating system, and is specialized in the classification, processing and analysis of
More informationData Mining. ❷Chapter 2 Basic Statistics. Asso.Prof.Dr. Xiao-dong Zhu. Business School, University of Shanghai for Science & Technology
❷Chapter 2 Basic Statistics Business School, University of Shanghai for Science & Technology 2016-2017 2nd Semester, Spring2017 Contents of chapter 1 1 recording data using computers 2 3 4 5 6 some famous
More informationLabor Economics with STATA. Estimating the Human Capital Model Using Artificial Data
Labor Economics with STATA Liyousew G. Borga December 2, 2015 Estimating the Human Capital Model Using Artificial Data Liyou Borga Labor Economics with STATA December 2, 2015 84 / 105 Outline 1 The Human
More informationBox-Cox Transformation for Simple Linear Regression
Chapter 192 Box-Cox Transformation for Simple Linear Regression Introduction This procedure finds the appropriate Box-Cox power transformation (1964) for a dataset containing a pair of variables that are
More informationStatistics Lab #7 ANOVA Part 2 & ANCOVA
Statistics Lab #7 ANOVA Part 2 & ANCOVA PSYCH 710 7 Initialize R Initialize R by entering the following commands at the prompt. You must type the commands exactly as shown. options(contrasts=c("contr.sum","contr.poly")
More information1. Basic Steps for Data Analysis Data Editor. 2.4.To create a new SPSS file
1 SPSS Guide 2009 Content 1. Basic Steps for Data Analysis. 3 2. Data Editor. 2.4.To create a new SPSS file 3 4 3. Data Analysis/ Frequencies. 5 4. Recoding the variable into classes.. 5 5. Data Analysis/
More informationThe DMSPLIT Procedure
The DMSPLIT Procedure The DMSPLIT Procedure Overview Procedure Syntax PROC DMSPLIT Statement FREQ Statement TARGET Statement VARIABLE Statement WEIGHT Statement Details Examples Example 1: Creating a Decision
More information