Session 8. Statistical analysis Using Gauss Applications
|
|
- Jennifer Ward
- 6 years ago
- Views:
Transcription
1 Session 8 Statistical analysis Using Gauss Applications page 1. Descriptive Statistics 8-2 Example: Frequencies 8-2 Example: Histogram Linear Regression 8-3 Linear regression Options 8-3 Practical Session 8a Quantal Response 8-6 Multinomial logit Multinomial logit Options for Ordered (see src directory) 8-7 Example for ordered logit 8-12 Example for ordered probit 8-12 Example for probit 8-13 Example for poisson 8-13 Example for Qtest 8-14 Practical Session 8b
2 Statistical analysis Using Gauss Applications 1. Descriptive Statistics The Descriptive Statistics module is a set of procedures which generate basic sample statistics of the variables in given GAUSS data set. These statistics describe the numerical characteristics of the random variables, and provide information for further statistical analysis. In GAUSS you can write functions or procedures which can be accessed by your programs as if they were GUSS functions. To maintain control over the name GAUSS uses the library keyword. The gauss and user libraries are always active, while all other libraries must be activated with the library command as: Library dstat; This statement opens the dstat library, which contains functions for the description of data. Note that the library statements are not cumulative. I.e. a subsequent library statement inactivates previously activated libraries (except for Gauss and user) Example: Frequencies library dstat; dstatset; title="fr1.e: without weights"; dataset = "d:/gauss/examples/freq"; output file = fr1.out reset; { cats,ncats,freqs } = freq(dataset,1 2 3); Example: Histogram library dstat,pgraph; dstatset; graphset; title="fr4.e: Using GETFREQ and HISTF"; dataset = "d:/gauss/examples/freq"; output file = fr4.out reset; miss = 1; { cats,ncats,freqs } = freq(dataset,1 2 3); { f,c } = getfreq(1,cats,ncats,freqs); If miss were 0 and missing was a category with cases, you must: c = packr(c~f); 2
3 f = c[.,2]; c = c[.,1]; before running histf(f,c); histf(f,c); 2. Linear Regression library lr; dataset = "d:/gauss/examples/scigau"; datalist ^dataset; dep = { pub3 }; indep = { const,pub1,cit1,cit3 }; call lreg(dataset,dep,indep,0); indep = { pub1,cit1,cit3 }; end; Linear regression Options LREG Purpose: To compute ordinary least squares coefficients. Format: Q = LREG(dataset,depvar,indvars,Restrict) Input: dataset -- string, name of GAUSS data set. depvar -- character, name of the dependent variable. Example: depvar = { consume }; indvars -- character vector of all independent variable names. If constant term is needed, specifies "CONST" in the indvars list. Example: rhs = { const,p,plag,income }; Restrict -- string, constrainted information on parameters to perform restricted estimation. The syntax of Restrict is as follows: Restrict="rest1, rest2,..., restn"; More than one restriction is allowed provided each is separated by commas. Each restriction must be written as a linear equation with all variables in the left-hand side and the constant in the right-hand side (i.e., x1+x2=1). 3
4 Variables shown in each restriction must be variables in the right-hand side of the model. Restrictions in the RESTRICT argument must be consistent and not redundant otherwise error messages will be given. Users should note that only the parameters associated with the variables are restricted, and not the variables in the model themselves. Examples of some VALID restrict arguments: 1) Restrict="x3+x4+x5=1"; 2) Restrict="constant=0, x4=0, x5=0, x6=0"; Examples of some INVALID restrict arguments: 1) Restrict x2-x3=0,x3-x4=0,x2-x4=0; 2) Restrict x2-x3=0,x2-x4=0,x3-x4=1; Both invalid examples show redundancy in example 1 and inconsistency in example 2. Output: Q -- a "COMPACT" output vector containing all calculated statistics. See manual for more details on extracting information from it. Varaibles contained in Q are: nms -- name of the regressors. b -- regression coefficients. vc -- varaiance-covariance matrix of b. se -- standard error of b. s2 -- variance of the error. cx -- correlation matrix of b. rsq -- coefficient of determination. rbsq -- adjusted R-squared. dw -- Durbin-Watson statistic. nobs -- number of observations. xtx -- cross-moment matrix of X. sse -- residual sum of square. Globals: _lregcol -- scalar. If 1, perform collinearity diagnostics. See manual for more details. Default = 0. _lreghc -- scalar. If 1, the heteroskedastic-consistent covariance matrix estimator will be calculated. Default = 0. _lregres -- string, a file name to request influence diagnostics. Statistics generated from the diagnostics are saved under this file name. Besides the diagnostic statistics, the predicted values, dependent variable and independent 4
5 variables are also saved. They are saved in the following order: COL. NAME DESCRIPTIONS 1 RES Residuals = (observed-predicted) 2 HAT Hat Matrix Values 3 SRES Standardized Residuals 4 RSTUDENT Studentized Residuals 5 COOK Cook Influence Statistics 6 YHAT Predicted Values 7 <depname> Dependent Variable 8 + <indname> Independent Variables _lrpcor -- scalar. If 1, print the correlation matrix of coefficients. Default = 0. _lrpcov -- scalar. If 1, print the covariance matrix of coefficients. Default = 0. range -- a 2 x 1 vector. Specifies the range of the data set to be used in estimation. The first element specifies the beginning observation while the second element specifies the ending observation. Example: range = { 100,200 }. Default is { 0,0 } and uses the whole data set. output -- scalar. If nonzero, results are printed. Default = 2. weight -- string, name of the weight variable. By default, unweighted least sqaures will be calcuated. Users should realize the weights are assumed to be inversely proportional to the error variances and are greater than zero. Details are mentioned in the manual. title -- string, message printed at the top of the results. Default =""; Practical Session 8a 1. Repeat the above analysis and include all other variables in the model, with and without the constant. Remove the non-significant ones. By changing the options impose a restriction on the covariates, perform collinearity diagnostics, calculate the heteroskedastic-consistent covariance matrix estimator, print the covariance matrix of coefficients. 5
6 2. The data in the gauss data set solve3.dat is the time taken to solve four block design problems by 24 fifth-grade children, with EFT value for the embedded figures test, measure of difficulty in abstracting logical structure of a problem from its context, and COR_GRP is (group) classification by type of problems presented first, i.e. those solved by row (group 1) or formation strategy. The solv3.prg program fit a linear model to this data and test for Heteroskedasticity. Modify this program and fit a linear model with interaction only. 3. Quantal Response The quantal response models are special regression models in which the dependent variable is qualitative in some way. The common structure of these models is to relate the conditional probability of each response to some exogenous variables. However, using simple regression procedures such as OLS to estimate quantal response model is inappropriate, because of heteroskedasticity and other statiostical problems. The quantal response module is a statistical package which provides a set of procedures for estimating these models. It offers the following procedures for different quantal response model specifications: 1. LOGIT Estimates the multinomial logit model. 2. ORDERED Estimates the ordered logit or ordered probit model. 3. PROBIT Estimates the binomial probit model. 4. PSNREG Estimates the Poisson regression model. 5. QTEST Performs a linear hypothesis testing of logit or probit model. Here are some examples, Multinomial logit 1 LGTALD2.E: Logit analysis of the Aldrich and Nelson (p. 63) data for a dichotomous dependent variable. library quantal; quantset; output file = lgtald2.out reset; _qrcatnm = { GRD=A, GRD=B, GRD=C }; dsn = "d:/gauss/examples/aldnel"; datalist ^dsn; dv = { abc }; iv = { gpa, tuce, psi }; { vnam,b,vc,n,pct,mn,sd,fit,df,tol } = logit(dsn,dv,iv); 6
7 Multinomial logit 2 LGTDEL5.E: Logit analysis of the NORC data using DATALOOP to delete the fifth category. library quantal; quantset; datalist d:/gauss/examples/norc; dataloop d:/gauss/examples/norc tem; delete depvar == 5; endata; datalist tem; output file = lgtdel5.out reset; title="lgtdel5.e: NORC data. Deleting fifth category."; dsn = "tem"; dv = 1; iv = { 2, 3, 4 }; _qrcatnm = { Menial, Blue_Col, Craft, Whte_Col, Prof }; { vnam,b,vc,n,pct,mn,sd,fit,df,tol } = logit(dsn,dv,iv); Options for Ordered (see src directory) ordered.src - Ordered Logit and Probit Analysis (C) Copyright Aptech Systems, Inc. All Rights Reserved. This Software Product is PROPRIETARY SOURCE CODE OF APTECH SYSTEMS, INC. This File Header must accompany all files using any portion, in whole or in part, of this Source Code. In addition, the right to create such files is strictly limited by Section 2.A. of the GAUSS Applications License Agreement accompanying this Software Product. If you wish to distribute any portion of the proprietary Source Code, in whole or in part, you must first obtain written permission from Aptech Systems Purpose: To estimate the ordered probit or logit model using a GAUSS data set. By default the ordered probit model is estimated. The ordered logit model is estimated by setting _QRLOGIT to 1. Format: { vnames,b,vc,ndtran,pct,meanx,sdx,fit,df,tol } = ORDERED(dataset,depvar,indvars); 7
8 Input: dataset -- string, name of data file depvar -- string, name of dependent variable - or - scalar, index of dependent variable. The value of depvar will be truncated before analysis. Thus, 1.4 is treated as category 1. indvars -- Kx1 character vector, names of independent variables. - or - Kx1 numeric vector, indices of independent variables. The program adds one variable for the constant term. Defaults are provided for the following global input variables. They can be ignored unless you need control over the other options provided by this procedure. WARNING: If you change the defaults in a command file, the new values will apply in the next program you run using ORDERED unless you change them back. This can be done by running QUANTSET. altnam -- provides alternative names for the variables. if 0 (default), the original names of the variables are used. if a ((1+NIVAR)x1) character vector, the first name in this vector will be used to label the dependent variable and the remaining NIVAR names will be used to label the independent variables. miss -- global scalar, default 1. if 0, there are no missing values (fastest). if 1, do listwise deletion, drop an observation if there are any missing values among the independent and dependent variables. output -- global scalar, default 1. if 1, sends results to the output device (including the screen). if 0, no information is sent to output. range -- 2*1 vector. The range of record in data set used for analysis. The first element is the starting row index, the second element is the endding row index. Default is the whole dataset. 8
9 row -- global scalar, default 0. if 0, the number of rows to read per iteration of the read loop is calculated by the program. if not 0, the specified number of rows will be read. _dtsel -- global scalar, default 0. if 0, all cases are selected for analysis. if Kx3, cases are selected into samples according to specified conditions. See DTRAN for details. tol -- global scalar controlling the iterations. tol indicates the maximum difference between estimates of the coefficients in two adjacent iterations. _qrcatnm -- NCATx1 character vector of names of outcome categories - or - default scalar 0 in which case names CAT1, CAT2,... are used. _qrfit -- global scalar, default 0. if 1, print detailed goodness of fit measures, including table of observed and predicted outcomes. if 0, only print chi-square, -2*log-likehood and percent correctly predicted. _qriter -- global scalar, default 0. if 0, do not print information on iterations. if 1, send detailed information on iterations to the screen but not to the output device. if 2, send detailed information on iterations to the output device. _qrlogit -- global scalar, default 0; if 1, the ordered logit model is estimated; if 0, the ordered probit model is estimated. _qrpred -- global scalar, default 0. if 0, predicted values will not be written to disk. if not 0, predicted probabilities for each outcome category are written to file ^_qrpred with NCAT+1 variables. The first ncat are PRED1,PRED2,...,PREDNCAT. The last variable is the variable 9
10 defined by the variable depvar. _qrpredn -- string name of dataset for predicted values. The default name is "_qrpred". _qrstart -- global scalar, default 0. if 0, do not use user supplied start values. if not 0, user should provide a (NCAT-1+NIVAR) vector of start values. First, provide start values for the intercepts, then the slopes. _qrstat -- global scalar, default 0. if 0, do not print descriptive statistics. if 1, print descriptive statistics. ORDERED uses the method of scoring for estimation, with squeezes. Squeezes are controlled with these globals: _qrsqtol -- global scalar, default.01 when the proportional change in the likelihood function is smaller than _qrsqtol or the change in the likelihood function is in the wrong direction, take a squeeze. _qrnsqz0 -- global scalar, default 0. if 0, squeezes will not be computed until changes in the likelihood function from one iteration to the next become small. if not 0, the program will take up to that number of squeezes per iteration starting with the first iteration. Since squeezes take time and are less effective when estimates are far from the converged values, it is generally best to leave this as 0. _qrsqz -- global scalar, default 0. if 0, don't take squeezes until the change in the likelihood function is small. if 1, consider taking squeezes from the first iteration. _qrnsqz1 -- global scalar, default 10. when squeezes begin, this is the maximum number of squeezes that will be taken before proceeding to the next iteration. 10
11 _qrmiter -- maximum number of iterations, default = Output: vnames -- a (K+2)x1 character vector containing the names of the variables in the model. The order is: depvar "CONSTANT" indvars. b -- an NPARM=(NCAT-1)*(K+1) vector of parameter estimates in the order: intercepts var1 var2...vark. For each variable the parameters are in the order comparing the first category to NCAT, the second to NCAT,... to NCAT-1 to NCAT. See below for details. If errors are encountered a message will be sent to the error log. Also, b will contain a scalar error code. This code appears as missing unless it is translated with the command scalerr(b). The codes are defined as: 1 data file not found 2 found undefined variables 30 system singular 31 too few nonmissing observations. 71 number of categories of dependent variable is less than 2 72 one of the outcome categories has no cases 73 an independent variable has no variation 74 can't open file for predicted values 75 out of disk space 77 all cases were deleted 78 singular matrix encountered during iterations 79 wrong number of start values specified vc -- NPARMxNPARM variance covariance matrix for the parameters in b. ndtran -- 2x1 vector of observations. Element 1 contains number of cases read from dataset; element 2 contains number of cases left after deletion of missing cases controlled by miss, it is the number of cases used in the analysis. pct -- the percent of cases in each of the outcome categories. Arranged in order lowest to highest. meanx -- the means based on nused cases of the independent variables in the order in indvars. sdx -- the standard deviations based on nused cases of the independent variables in the order in indvars. fit -- 4x1 vector of goodness of fit measures. Element 1 is the likelihood ratio chi-square assessing the overall fit of the model; element 2 is -2 times the log 11
12 likelihood function evaluated at the estimated values; element 3 is -2 times the log likelihood function evaluated with the slopes fixed to zero; element 4 is the percentage of correct predictions from the model. df -- the degrees of freedom associated with lrx2. tol -- the tolerance reached. If convergence was obtained, tol must be less than tol. Remarks: See the manual for details on the model. Library: QUANTAL See Also: LOGIT, PROBIT, DTRAN Example for ordered logit OLPRED.E: Ordinal Logit analysis of the NORC data. Saving predicted values to disk. library quantal; #include quantal.ext; quantset; output file = olpred.out reset; title="olpred.e: Ordinal Logit Analysis of the NORC"; _qrlogit = 1; _qriter = 0; _qrpred = 1; _qrpredn = "olpred"; _qrfit = 1; _qrstat = 1; dataset = "d:/gauss/examples/norc"; depvar = { DEPVAR }; indvars = { EXPER, EDUC, WHITE, FBLUE }; call ordered(dataset,depvar,indvars); Example for ordered probit OPNORC.E: Ordered Probit analysis of the NORC data. library quantal; quantset; output file = opnorc.out reset; title="opnorc.e: Ordered Probit analysis of NORC data on occupation"; dsn = "d:/gauss/examples/norc"; 12
13 row = { 0, 100 read only first 100 cases for dv = 1; iv = { 2, 3, 4 }; _qrcatnm = { "Menial", "Blue_Col", "Craft", "Whte_Col", "Prof" }; _qrlogit = 0; call ordered(dsn,dv,iv); Example for probit PBTNEWT.E: Probit analysis of Aldrich and Nelson Data (pg. 62) Using Newton-Raphson library quantal; #include quantal.ext; output file = pbtnewt.out reset; quantset; title="pbtnewt.e: Aldrich and Nelson Data (pg. 62) using Newton-Raphson"; _qriter = 1; iteration results: 0 no, 1 view, 2 print _qrstat = 1; 0 for no desc stats _qrfit = 1; 1 to print detailed goodness of fit measures _qrcatnm = { B_or_C, A }; _qrpred = 0; save predicted values _pbtnewt = 1; weight = { gpa }; fnm = "d:/gauss/examples/aldnel"; dv = { 5 }; iv = { 2, 3, 4 }; { vnam,b,vc,n,pct,mn,sd,lrx2,df,tol } = PROBIT(fnm,dv,iv); Example for Poisson PSNREG.E example for PSNREG library quantal; #include quantal.ext; quantset; _qrstat=1; let dep = wars; let ind = age party unem; dataset = "d:/gauss/examples/sample"; output file = psnreg.out reset; call psnreg(dataset,dep,ind); 13
14 Example for Qtest QRTEST.E: Test of linear hypothesis of logit model library quantal; #include quantal.ext; quantset; _qrcatnm = { GRD=A, GRD=B, GRD=C }; dsn = "d:/gauss/examples/aldnel"; dv = { abc }; iv = { gpa, tuce, psi }; { vnam,b,vc,n,pct,mn,sd,fit,df,tol } = logit(dsn,dv,iv); output file = qrtest.out reset; test1 = "gpa:2 + tuce:2 = 0"; { wald1 } = qtest(vnam,b, vc,test1); test2 = "gpa:1-2.5tuce:1 = 2, tuce:1 + psi:1 = 0, 3gpa:2 + 2tuce:1 - psi:2 = 2" ; { wald2 } = qtest(vnam,b, vc,test2); Practical Session 8b In the data set bronc.dat, Res. is the indication of having bronchitis (res =1) or not (res = 0). CIG. is the amount of cigarette consumption and poll is the pollution level in that household. Do the following: 1. Categorise CIG to four categories of 0, less or equal than 3, less or equal than 8 and more than Categorise POLL to less or equal than (0,55], (55,57.5], (57.5,60], (60,62.5], (62.5, 65] and more than Fit a logistic model to this data and find the significant covariates. 4. You may need to consult bronc.prg program. The data in file CLAIMS.dat give the number of policyholders PONO of an insurance company who were exposed to risk, and the number CLAIM of car insurance claims made in the third quarter of 1973 by these policyholders arranged as a contingency table, cross-classified by three four-level factors: DIST, the district in which the policyholder lived, CAR, the insurance group into which the car was placed, and AGE, the age of the policyholder. The first 16 observations are in DIST1, the second 16 are in DIST2,. The first four observations are in CAR1, the second four observations in CAR2,. The first observation is in AGE1, the second in AGE2, the third is in AGE3, the fourth is in AGE4 and again the fifth is in AGE1. 14
15 1. Rearrange the data and construct dummy variables for the above categories. 2. Perform a Poisson regression model for number of claims (CLAIM). 3. Use and modify the CLAIM.prg program. 15
Correctly Compute Complex Samples Statistics
SPSS Complex Samples 15.0 Specifications Correctly Compute Complex Samples Statistics When you conduct sample surveys, use a statistics package dedicated to producing correct estimates for complex sample
More informationRegression. Dr. G. Bharadwaja Kumar VIT Chennai
Regression Dr. G. Bharadwaja Kumar VIT Chennai Introduction Statistical models normally specify how one set of variables, called dependent variables, functionally depend on another set of variables, called
More informationCorrectly Compute Complex Samples Statistics
PASW Complex Samples 17.0 Specifications Correctly Compute Complex Samples Statistics When you conduct sample surveys, use a statistics package dedicated to producing correct estimates for complex sample
More informationAn introduction to SPSS
An introduction to SPSS To open the SPSS software using U of Iowa Virtual Desktop... Go to https://virtualdesktop.uiowa.edu and choose SPSS 24. Contents NOTE: Save data files in a drive that is accessible
More informationCHAPTER 5. BASIC STEPS FOR MODEL DEVELOPMENT
CHAPTER 5. BASIC STEPS FOR MODEL DEVELOPMENT This chapter provides step by step instructions on how to define and estimate each of the three types of LC models (Cluster, DFactor or Regression) and also
More informationCDAA No. 4 - Part Two - Multiple Regression - Initial Data Screening
CDAA No. 4 - Part Two - Multiple Regression - Initial Data Screening Variables Entered/Removed b Variables Entered GPA in other high school, test, Math test, GPA, High school math GPA a Variables Removed
More informationCHAPTER 7 EXAMPLES: MIXTURE MODELING WITH CROSS- SECTIONAL DATA
Examples: Mixture Modeling With Cross-Sectional Data CHAPTER 7 EXAMPLES: MIXTURE MODELING WITH CROSS- SECTIONAL DATA Mixture modeling refers to modeling with categorical latent variables that represent
More informationTwo-Stage Least Squares
Chapter 316 Two-Stage Least Squares Introduction This procedure calculates the two-stage least squares (2SLS) estimate. This method is used fit models that include instrumental variables. 2SLS includes
More informationAlso, for all analyses, two other files are produced upon program completion.
MIXOR for Windows Overview MIXOR is a program that provides estimates for mixed-effects ordinal (and binary) regression models. This model can be used for analysis of clustered or longitudinal (i.e., 2-level)
More informationData Analysis and Solver Plugins for KSpread USER S MANUAL. Tomasz Maliszewski
Data Analysis and Solver Plugins for KSpread USER S MANUAL Tomasz Maliszewski tmaliszewski@wp.pl Table of Content CHAPTER 1: INTRODUCTION... 3 1.1. ABOUT DATA ANALYSIS PLUGIN... 3 1.3. ABOUT SOLVER PLUGIN...
More informationFrequently Asked Questions Updated 2006 (TRIM version 3.51) PREPARING DATA & RUNNING TRIM
Frequently Asked Questions Updated 2006 (TRIM version 3.51) PREPARING DATA & RUNNING TRIM * Which directories are used for input files and output files? See menu-item "Options" and page 22 in the manual.
More informationStatCalc User Manual. Version 9 for Mac and Windows. Copyright 2018, AcaStat Software. All rights Reserved.
StatCalc User Manual Version 9 for Mac and Windows Copyright 2018, AcaStat Software. All rights Reserved. http://www.acastat.com Table of Contents Introduction... 4 Getting Help... 4 Uninstalling StatCalc...
More informationStat 500 lab notes c Philip M. Dixon, Week 10: Autocorrelated errors
Week 10: Autocorrelated errors This week, I have done one possible analysis and provided lots of output for you to consider. Case study: predicting body fat Body fat is an important health measure, but
More informationBluman & Mayer, Elementary Statistics, A Step by Step Approach, Canadian Edition
Bluman & Mayer, Elementary Statistics, A Step by Step Approach, Canadian Edition Online Learning Centre Technology Step-by-Step - Minitab Minitab is a statistical software application originally created
More informationDr. Barbara Morgan Quantitative Methods
Dr. Barbara Morgan Quantitative Methods 195.650 Basic Stata This is a brief guide to using the most basic operations in Stata. Stata also has an on-line tutorial. At the initial prompt type tutorial. In
More informationSTATS PAD USER MANUAL
STATS PAD USER MANUAL For Version 2.0 Manual Version 2.0 1 Table of Contents Basic Navigation! 3 Settings! 7 Entering Data! 7 Sharing Data! 8 Managing Files! 10 Running Tests! 11 Interpreting Output! 11
More informationFathom Dynamic Data TM Version 2 Specifications
Data Sources Fathom Dynamic Data TM Version 2 Specifications Use data from one of the many sample documents that come with Fathom. Enter your own data by typing into a case table. Paste data from other
More informationApplied Regression Modeling: A Business Approach
i Applied Regression Modeling: A Business Approach Computer software help: SAS SAS (originally Statistical Analysis Software ) is a commercial statistical software package based on a powerful programming
More informationAnalysis of Complex Survey Data with SAS
ABSTRACT Analysis of Complex Survey Data with SAS Christine R. Wells, Ph.D., UCLA, Los Angeles, CA The differences between data collected via a complex sampling design and data collected via other methods
More informationHandbook of Statistical Modeling for the Social and Behavioral Sciences
Handbook of Statistical Modeling for the Social and Behavioral Sciences Edited by Gerhard Arminger Bergische Universität Wuppertal Wuppertal, Germany Clifford С. Clogg Late of Pennsylvania State University
More informationINTRODUCTION TO PANEL DATA ANALYSIS
INTRODUCTION TO PANEL DATA ANALYSIS USING EVIEWS FARIDAH NAJUNA MISMAN, PhD FINANCE DEPARTMENT FACULTY OF BUSINESS & MANAGEMENT UiTM JOHOR PANEL DATA WORKSHOP-23&24 MAY 2017 1 OUTLINE 1. Introduction 2.
More informationContents. 1 Installation 1. 2 Loglinear Analysis 5. 3 Loglinear Analysis Reference 15
Loglinear Analysis Information in this document is subject to change without notice and does not represent a commitment on the part of Aptech Systems, Inc. The software described in this document is furnished
More informationJMP 10 Student Edition Quick Guide
JMP 10 Student Edition Quick Guide Instructions presume an open data table, default preference settings and appropriately typed, user-specified variables of interest. RMC = Click Right Mouse Button Graphing
More informationApplied Regression Modeling: A Business Approach
i Applied Regression Modeling: A Business Approach Computer software help: SPSS SPSS (originally Statistical Package for the Social Sciences ) is a commercial statistical software package with an easy-to-use
More informationAn Example of Using inter5.exe to Obtain the Graph of an Interaction
An Example of Using inter5.exe to Obtain the Graph of an Interaction This example covers the general use of inter5.exe to produce data from values inserted into a regression equation which can then be
More informationCHAPTER 1 INTRODUCTION
Introduction CHAPTER 1 INTRODUCTION Mplus is a statistical modeling program that provides researchers with a flexible tool to analyze their data. Mplus offers researchers a wide choice of models, estimators,
More informationProduct Catalog. AcaStat. Software
Product Catalog AcaStat Software AcaStat AcaStat is an inexpensive and easy-to-use data analysis tool. Easily create data files or import data from spreadsheets or delimited text files. Run crosstabulations,
More informationGeneralized Additive Model
Generalized Additive Model by Huimin Liu Department of Mathematics and Statistics University of Minnesota Duluth, Duluth, MN 55812 December 2008 Table of Contents Abstract... 2 Chapter 1 Introduction 1.1
More informationBinary IFA-IRT Models in Mplus version 7.11
Binary IFA-IRT Models in Mplus version 7.11 Example data: 635 older adults (age 80-100) self-reporting on 7 items assessing the Instrumental Activities of Daily Living (IADL) as follows: 1. Housework (cleaning
More informationMinitab 17 commands Prepared by Jeffrey S. Simonoff
Minitab 17 commands Prepared by Jeffrey S. Simonoff Data entry and manipulation To enter data by hand, click on the Worksheet window, and enter the values in as you would in any spreadsheet. To then save
More informationSTRAT. A Program for Analyzing Statistical Strategic Models. Version 1.4. Curtis S. Signorino Department of Political Science University of Rochester
STRAT A Program for Analyzing Statistical Strategic Models Version 1.4 Curtis S. Signorino Department of Political Science University of Rochester c Copyright, 2001 2003, Curtis S. Signorino All rights
More informationStat 5100 Handout #14.a SAS: Logistic Regression
Stat 5100 Handout #14.a SAS: Logistic Regression Example: (Text Table 14.3) Individuals were randomly sampled within two sectors of a city, and checked for presence of disease (here, spread by mosquitoes).
More informationBox-Cox Transformation for Simple Linear Regression
Chapter 192 Box-Cox Transformation for Simple Linear Regression Introduction This procedure finds the appropriate Box-Cox power transformation (1964) for a dataset containing a pair of variables that are
More informationIntroduction to Mixed Models: Multivariate Regression
Introduction to Mixed Models: Multivariate Regression EPSY 905: Multivariate Analysis Spring 2016 Lecture #9 March 30, 2016 EPSY 905: Multivariate Regression via Path Analysis Today s Lecture Multivariate
More informationSTATA Tutorial. Introduction to Econometrics. by James H. Stock and Mark W. Watson. to Accompany
STATA Tutorial to Accompany Introduction to Econometrics by James H. Stock and Mark W. Watson STATA Tutorial to accompany Stock/Watson Introduction to Econometrics Copyright 2003 Pearson Education Inc.
More informationMean Tests & X 2 Parametric vs Nonparametric Errors Selection of a Statistical Test SW242
Mean Tests & X 2 Parametric vs Nonparametric Errors Selection of a Statistical Test SW242 Creation & Description of a Data Set * 4 Levels of Measurement * Nominal, ordinal, interval, ratio * Variable Types
More information[/TTEST [PERCENT={5}] [{T }] [{DF } [{PROB }] [{COUNTS }] [{MEANS }]] {n} {NOT} {NODF} {NOPROB}] {NOCOUNTS} {NOMEANS}
MVA MVA [VARIABLES=] {varlist} {ALL } [/CATEGORICAL=varlist] [/MAXCAT={25 ** }] {n } [/ID=varname] Description: [/NOUNIVARIATE] [/TTEST [PERCENT={5}] [{T }] [{DF } [{PROB }] [{COUNTS }] [{MEANS }]] {n}
More informationGeneralized least squares (GLS) estimates of the level-2 coefficients,
Contents 1 Conceptual and Statistical Background for Two-Level Models...7 1.1 The general two-level model... 7 1.1.1 Level-1 model... 8 1.1.2 Level-2 model... 8 1.2 Parameter estimation... 9 1.3 Empirical
More informationSubset Selection in Multiple Regression
Chapter 307 Subset Selection in Multiple Regression Introduction Multiple regression analysis is documented in Chapter 305 Multiple Regression, so that information will not be repeated here. Refer to that
More informationAn Introduction to Stata Part II: Data Analysis
An Introduction to Stata Part II: Data Analysis Kerry L. Papps 1. Overview Do-files Sorting a dataset Combining datasets Creating a dataset of means or medians etc. Weights Panel data capabilities Dummy
More informationStatistical Methods for the Analysis of Repeated Measurements
Charles S. Davis Statistical Methods for the Analysis of Repeated Measurements With 20 Illustrations #j Springer Contents Preface List of Tables List of Figures v xv xxiii 1 Introduction 1 1.1 Repeated
More informationBivariate (Simple) Regression Analysis
Revised July 2018 Bivariate (Simple) Regression Analysis This set of notes shows how to use Stata to estimate a simple (two-variable) regression equation. It assumes that you have set Stata up on your
More informationCREATING THE ANALYSIS
Chapter 14 Multiple Regression Chapter Table of Contents CREATING THE ANALYSIS...214 ModelInformation...217 SummaryofFit...217 AnalysisofVariance...217 TypeIIITests...218 ParameterEstimates...218 Residuals-by-PredictedPlot...219
More informationSTAT - Edit Scroll up the appropriate list to highlight the list name at the very top Press CLEAR, followed by the down arrow or ENTER
Entering/Editing Data Use arrows to scroll to the appropriate list and position Enter or edit data, pressing ENTER after each (including the last) Deleting Data (One Value at a Time) Use arrows to scroll
More informationLecture 7: Linear Regression (continued)
Lecture 7: Linear Regression (continued) Reading: Chapter 3 STATS 2: Data mining and analysis Jonathan Taylor, 10/8 Slide credits: Sergio Bacallado 1 / 14 Potential issues in linear regression 1. Interactions
More informationLabor Economics with STATA. Estimating the Human Capital Model Using Artificial Data
Labor Economics with STATA Liyousew G. Borga December 2, 2015 Estimating the Human Capital Model Using Artificial Data Liyou Borga Labor Economics with STATA December 2, 2015 84 / 105 Outline 1 The Human
More informationExample 1 of panel data : Data for 6 airlines (groups) over 15 years (time periods) Example 1
Panel data set Consists of n entities or subjects (e.g., firms and states), each of which includes T observations measured at 1 through t time period. total number of observations : nt Panel data have
More informationChapter 5 Parameter Estimation:
Chapter 5 Parameter Estimation: MODLER s regression commands at their most basic are essentially intuitive. For example, consider: IMP=F(GNP,CAPI) which specifies that IMP is a function F() of the variables
More informationaddesc Add a variable description to the key file CCDmanual0.docx
addesc Add a variable description to the key file CCDmanual0.docx The function adds a variable description to the key file. This is useful in cases where a new variable is created, whose description is
More informationNCSS Statistical Software
Chapter 327 Geometric Regression Introduction Geometric regression is a special case of negative binomial regression in which the dispersion parameter is set to one. It is similar to regular multiple regression
More informationA User Manual for the Multivariate MLE Tool. Before running the main multivariate program saved in the SAS file Part2-Main.sas,
A User Manual for the Multivariate MLE Tool Before running the main multivariate program saved in the SAS file Part-Main.sas, the user must first compile the macros defined in the SAS file Part-Macros.sas
More informationTHIS IS NOT REPRESNTATIVE OF CURRENT CLASS MATERIAL. STOR 455 Midterm 1 September 28, 2010
THIS IS NOT REPRESNTATIVE OF CURRENT CLASS MATERIAL STOR 455 Midterm September 8, INSTRUCTIONS: BOTH THE EXAM AND THE BUBBLE SHEET WILL BE COLLECTED. YOU MUST PRINT YOUR NAME AND SIGN THE HONOR PLEDGE
More informationExample Using Missing Data 1
Ronald H. Heck and Lynn N. Tabata 1 Example Using Missing Data 1 Creating the Missing Data Variable (Miss) Here is a data set (achieve subset MANOVAmiss.sav) with the actual missing data on the outcomes.
More informationSerial Correlation and Heteroscedasticity in Time series Regressions. Econometric (EC3090) - Week 11 Agustín Bénétrix
Serial Correlation and Heteroscedasticity in Time series Regressions Econometric (EC3090) - Week 11 Agustín Bénétrix 1 Properties of OLS with serially correlated errors OLS still unbiased and consistent
More informationDescriptives. Graph. [DataSet1] C:\Documents and Settings\BuroK\Desktop\Prestige.sav
GET FILE='C:\Documents and Settings\BuroK\Desktop\Prestige.sav'. DESCRIPTIVES VARIABLES=prestige education income women /STATISTICS=MEAN STDDEV MIN MAX. Descriptives Input Missing Value Handling Resources
More informationLinear Methods for Regression and Shrinkage Methods
Linear Methods for Regression and Shrinkage Methods Reference: The Elements of Statistical Learning, by T. Hastie, R. Tibshirani, J. Friedman, Springer 1 Linear Regression Models Least Squares Input vectors
More informationZero-Inflated Poisson Regression
Chapter 329 Zero-Inflated Poisson Regression Introduction The zero-inflated Poisson (ZIP) regression is used for count data that exhibit overdispersion and excess zeros. The data distribution combines
More informationIntroduction to the R Statistical Computing Environment R Programming: Exercises
Introduction to the R Statistical Computing Environment R Programming: Exercises John Fox (McMaster University) ICPSR 2014 1. A straightforward problem: Write an R function for linear least-squares regression.
More informationSTATA TUTORIAL B. Rabin with modifications by T. Marsh
STATA TUTORIAL B. Rabin with modifications by T. Marsh 5.2.05 (content also from http://www.ats.ucla.edu/stat/spss/faq/compare_packages.htm) Why choose Stata? Stata has a wide array of pre-defined statistical
More informationUsing HLM for Presenting Meta Analysis Results. R, C, Gardner Department of Psychology
Data_Analysis.calm: dacmeta Using HLM for Presenting Meta Analysis Results R, C, Gardner Department of Psychology The primary purpose of meta analysis is to summarize the effect size results from a number
More informationResources for statistical assistance. Quantitative covariates and regression analysis. Methods for predicting continuous outcomes.
Resources for statistical assistance Quantitative covariates and regression analysis Carolyn Taylor Applied Statistics and Data Science Group (ASDa) Department of Statistics, UBC January 24, 2017 Department
More informationModel Diagnostic tests
Model Diagnostic tests 1. Multicollinearity a) Pairwise correlation test Quick/Group stats/ correlations b) VIF Step 1. Open the EViews workfile named Fish8.wk1. (FROM DATA FILES- TSIME) Step 2. Select
More informationANNOUNCING THE RELEASE OF LISREL VERSION BACKGROUND 2 COMBINING LISREL AND PRELIS FUNCTIONALITY 2 FIML FOR ORDINAL AND CONTINUOUS VARIABLES 3
ANNOUNCING THE RELEASE OF LISREL VERSION 9.1 2 BACKGROUND 2 COMBINING LISREL AND PRELIS FUNCTIONALITY 2 FIML FOR ORDINAL AND CONTINUOUS VARIABLES 3 THREE-LEVEL MULTILEVEL GENERALIZED LINEAR MODELS 3 FOUR
More informationSTAT 2607 REVIEW PROBLEMS Word problems must be answered in words of the problem.
STAT 2607 REVIEW PROBLEMS 1 REMINDER: On the final exam 1. Word problems must be answered in words of the problem. 2. "Test" means that you must carry out a formal hypothesis testing procedure with H0,
More informationModelling Proportions and Count Data
Modelling Proportions and Count Data Rick White May 4, 2016 Outline Analysis of Count Data Binary Data Analysis Categorical Data Analysis Generalized Linear Models Questions Types of Data Continuous data:
More informationIntroduction to Mplus
Introduction to Mplus May 12, 2010 SPONSORED BY: Research Data Centre Population and Life Course Studies PLCS Interdisciplinary Development Initiative Piotr Wilk piotr.wilk@schulich.uwo.ca OVERVIEW Mplus
More informationLudwig Fahrmeir Gerhard Tute. Statistical odelling Based on Generalized Linear Model. íecond Edition. . Springer
Ludwig Fahrmeir Gerhard Tute Statistical odelling Based on Generalized Linear Model íecond Edition. Springer Preface to the Second Edition Preface to the First Edition List of Examples List of Figures
More informationSPSS INSTRUCTION CHAPTER 9
SPSS INSTRUCTION CHAPTER 9 Chapter 9 does no more than introduce the repeated-measures ANOVA, the MANOVA, and the ANCOVA, and discriminant analysis. But, you can likely envision how complicated it can
More informationST512. Fall Quarter, Exam 1. Directions: Answer questions as directed. Please show work. For true/false questions, circle either true or false.
ST512 Fall Quarter, 2005 Exam 1 Name: Directions: Answer questions as directed. Please show work. For true/false questions, circle either true or false. 1. (42 points) A random sample of n = 30 NBA basketball
More informationTable Of Contents. Table Of Contents
Statistics Table Of Contents Table Of Contents Basic Statistics... 7 Basic Statistics Overview... 7 Descriptive Statistics Available for Display or Storage... 8 Display Descriptive Statistics... 9 Store
More informationRegression on SAT Scores of 374 High Schools and K-means on Clustering Schools
Regression on SAT Scores of 374 High Schools and K-means on Clustering Schools Abstract In this project, we study 374 public high schools in New York City. The project seeks to use regression techniques
More informationNAG Toolbox for MATLAB. g02ef.1
G2 Correlation and Regression Analysis g2ef Purpose NAG Toolbox for MATLAB g2ef g2ef calculates a full stepwise selection from p variables by using Clarke s sweep algorithm on the correlation matrix of
More informationModelling Proportions and Count Data
Modelling Proportions and Count Data Rick White May 5, 2015 Outline Analysis of Count Data Binary Data Analysis Categorical Data Analysis Generalized Linear Models Questions Types of Data Continuous data:
More informationSAS/STAT 13.1 User s Guide. The NESTED Procedure
SAS/STAT 13.1 User s Guide The NESTED Procedure This document is an individual chapter from SAS/STAT 13.1 User s Guide. The correct bibliographic citation for the complete manual is as follows: SAS Institute
More informationCross-validation and the Bootstrap
Cross-validation and the Bootstrap In the section we discuss two resampling methods: cross-validation and the bootstrap. 1/44 Cross-validation and the Bootstrap In the section we discuss two resampling
More informationResearch Methods for Business and Management. Session 8a- Analyzing Quantitative Data- using SPSS 16 Andre Samuel
Research Methods for Business and Management Session 8a- Analyzing Quantitative Data- using SPSS 16 Andre Samuel A Simple Example- Gym Purpose of Questionnaire- to determine the participants involvement
More informationPredict Outcomes and Reveal Relationships in Categorical Data
PASW Categories 18 Specifications Predict Outcomes and Reveal Relationships in Categorical Data Unleash the full potential of your data through predictive analysis, statistical learning, perceptual mapping,
More informationPackage endogenous. October 29, 2016
Package endogenous October 29, 2016 Type Package Title Classical Simultaneous Equation Models Version 1.0 Date 2016-10-25 Maintainer Andrew J. Spieker Description Likelihood-based
More informationD-Optimal Designs. Chapter 888. Introduction. D-Optimal Design Overview
Chapter 888 Introduction This procedure generates D-optimal designs for multi-factor experiments with both quantitative and qualitative factors. The factors can have a mixed number of levels. For example,
More informationChapter 7: Linear regression
Chapter 7: Linear regression Objective (1) Learn how to model association bet. 2 variables using a straight line (called "linear regression"). (2) Learn to assess the quality of regression models. (3)
More informationbook 2014/5/6 15:21 page v #3 List of figures List of tables Preface to the second edition Preface to the first edition
book 2014/5/6 15:21 page v #3 Contents List of figures List of tables Preface to the second edition Preface to the first edition xvii xix xxi xxiii 1 Data input and output 1 1.1 Input........................................
More informationAn Introductory Guide to Stata
An Introductory Guide to Stata Scott L. Minkoff Assistant Professor Department of Political Science Barnard College sminkoff@barnard.edu Updated: July 9, 2012 1 TABLE OF CONTENTS ABOUT THIS GUIDE... 4
More information- 1 - Fig. A5.1 Missing value analysis dialog box
WEB APPENDIX Sarstedt, M. & Mooi, E. (2019). A concise guide to market research. The process, data, and methods using SPSS (3 rd ed.). Heidelberg: Springer. Missing Value Analysis and Multiple Imputation
More informationAssignments Fill out this form to do the assignments or see your scores.
Assignments Assignment schedule General instructions for online assignments Troubleshooting technical problems Fill out this form to do the assignments or see your scores. Login Course: Statistics W21,
More informationoptimization_machine_probit_bush106.c
optimization_machine_probit_bush106.c. probit ybush black00 south hispanic00 income owner00 dwnom1n dwnom2n Iteration 0: log likelihood = -299.27289 Iteration 1: log likelihood = -154.89847 Iteration 2:
More informationSTATISTICS (STAT) Statistics (STAT) 1
Statistics (STAT) 1 STATISTICS (STAT) STAT 2013 Elementary Statistics (A) Prerequisites: MATH 1483 or MATH 1513, each with a grade of "C" or better; or an acceptable placement score (see placement.okstate.edu).
More informationUsing SPSS with The Fundamentals of Political Science Research
Using SPSS with The Fundamentals of Political Science Research Paul M. Kellstedt and Guy D. Whitten Department of Political Science Texas A&M University c Paul M. Kellstedt and Guy D. Whitten 2009 Contents
More informationA Short Guide to Stata 10 for Windows
A Short Guide to Stata 10 for Windows 1. Introduction 2 2. The Stata Environment 2 3. Where to get help 2 4. Opening and Saving Data 3 5. Importing Data 4 6. Data Manipulation 5 7. Descriptive Statistics
More informationLAMPIRAN. Sampel Penelitian
LAMPIRAN Lampiran 1 Daftar Perusahaan Sampel Penelitian No. Kode Kriteria Perusahaan 1 2 3 4 Sampel 1 ADES 1 2 AISA 2 3 ALTO 4 CEKA 5 DAVO 6 DLTA 3 7 ICBP 4 8 INDF 5 9 MLBI 6 10 MYOR 11 PSDN 7 12 ROTI
More informationSAS Structural Equation Modeling 1.3 for JMP
SAS Structural Equation Modeling 1.3 for JMP SAS Documentation The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2012. SAS Structural Equation Modeling 1.3 for JMP. Cary,
More informationWritten by Donna Hiestand-Tupper CCBC - Essex TI 83 TUTORIAL. Version 3.0 to accompany Elementary Statistics by Mario Triola, 9 th edition
TI 83 TUTORIAL Version 3.0 to accompany Elementary Statistics by Mario Triola, 9 th edition Written by Donna Hiestand-Tupper CCBC - Essex 1 2 Math 153 - Introduction to Statistical Methods TI 83 (PLUS)
More informationPackage glmmml. R topics documented: March 25, Encoding UTF-8 Version Date Title Generalized Linear Models with Clustering
Encoding UTF-8 Version 1.0.3 Date 2018-03-25 Title Generalized Linear Models with Clustering Package glmmml March 25, 2018 Binomial and Poisson regression for clustered data, fixed and random effects with
More informationWeek 4: Simple Linear Regression II
Week 4: Simple Linear Regression II Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ARR 1 Outline Algebraic properties
More informationUSER S GUIDE LATENT GOLD 4.0. Innovations. Statistical. Jeroen K. Vermunt & Jay Magidson. Thinking outside the brackets! TM
LATENT GOLD 4.0 USER S GUIDE Jeroen K. Vermunt & Jay Magidson Statistical Innovations Thinking outside the brackets! TM For more information about Statistical Innovations Inc. please visit our website
More informationThe NESTED Procedure (Chapter)
SAS/STAT 9.3 User s Guide The NESTED Procedure (Chapter) SAS Documentation This document is an individual chapter from SAS/STAT 9.3 User s Guide. The correct bibliographic citation for the complete manual
More informationFrequencies, Unequal Variance Weights, and Sampling Weights: Similarities and Differences in SAS
ABSTRACT Paper 1938-2018 Frequencies, Unequal Variance Weights, and Sampling Weights: Similarities and Differences in SAS Robert M. Lucas, Robert M. Lucas Consulting, Fort Collins, CO, USA There is confusion
More informationTable of Contents (As covered from textbook)
Table of Contents (As covered from textbook) Ch 1 Data and Decisions Ch 2 Displaying and Describing Categorical Data Ch 3 Displaying and Describing Quantitative Data Ch 4 Correlation and Linear Regression
More informationDetecting and Circumventing Collinearity or Ill-Conditioning Problems
Chapter 8 Detecting and Circumventing Collinearity or Ill-Conditioning Problems Section 8.1 Introduction Multicollinearity/Collinearity/Ill-Conditioning The terms multicollinearity, collinearity, and ill-conditioning
More informationMultidimensional Latent Regression
Multidimensional Latent Regression Ray Adams and Margaret Wu, 29 August 2010 In tutorial seven, we illustrated how ConQuest can be used to fit multidimensional item response models; and in tutorial five,
More informationBasics of Multivariate Modelling and Data Analysis
Basics of Multivariate Modelling and Data Analysis Kurt-Erik Häggblom 9. Linear regression with latent variables 9.1 Principal component regression (PCR) 9.2 Partial least-squares regression (PLS) [ mostly
More information