PROC JACK REG - A SAS~ PROCEDURE FOR JACKKNIFE REGRESSION Panayiotis Hambi, West Virginia University

Size: px
Start display at page:

Download "PROC JACK REG - A SAS~ PROCEDURE FOR JACKKNIFE REGRESSION Panayiotis Hambi, West Virginia University"

Transcription

1 ABSTRACT PROC JACK REG - A SAS~ PROCEDURE FOR JACKKNIFE REGRESSION Panayiotis Hambi, West Virginia University Daniel M. Chl1ko, West Virginia UniverSity Gerry Hobos, West Virginia University PROC JACKREG allows the user to calculate robust estimates of regression coefficients in simple or multiple regression problems by use of the Jackknife method. See Mosteller & Tukey for a good description of the Jackknife. PROC JACKREG may also be used as a validation tool. The Least Squares Regression equation is validated by comparing each observation to the equation generated when that observation is left out. Output includes the individual regression coefficients at each step of the leave-one-out process with their corresponding R-square values. A further table of the final Jackknife coefficients and the ordinary least squares coefficients. with their estimated standard deviations, t-stati8tic8 and P-values are a180 displayed. the sample size. The Jackknife estuastes are the average of the pseudo-values. SPECIFICATIONS The following statements are used with the JACKREG procedure: I I PROC JACKREG options; I MODEL dependent = regressors / options; I OUTPUT OUT: SASdataset keyword~name ; I BY variables; The procedure must always be accompanied by a MODEL statement to specify the regression model. An OUTPUT statement may be specified anywhere after the PROC statement to request an output data set. The purpose of each statement is: Output data sets may be created optionally upon request. One of these data sets may contain the individual Regression coefficients at each step of the leave-one-out proeess (OUTEST=). The other data set may contain the Predicted/Residuals for the ordinary least squares model. Predicted/Residuals for the Jackknife estimates and Predicted/Residuals for the leave-one-out process. INTRODUCTION Suppose that a response variable Y can be predicted by a linear combination of some regressor variables. say Xl and X2. Then you can estimate the parameters in the equation: Y : BO + Bi x Xl + B2 x X 2 + error To fit this model with JACKREG specify: PROC JACKREG; MODEL Y = Xl X2; procedure JACKREG uses the principle of least squares to produce estimates that are the ~Best~ linear unbiased estimates under the classical statistical assumptions. JACKREG calculates also the Jackknife estimates for the model. The Jackknife estimates are defined as follows: Let b be an estimate for a parameter B based on a sample of size N. Now delete one of the observations, say the j-th, re-calculate the Least-squares estimate, and denote the new estimate b-j. Delete, in turn, each of the observations so that there are N estimates each based on N-I observations. Define the pseudo~ values as: The product of the b-j times N-l subtracted from the product of the ordinary least squares estimates. times N.where N is The MODEL statement specifies the dependent and independent variables in the regression model and the options to be used. The OUTPUT statement requests an output data set and names the variables to contain predicted values. residuals and other output values. The BY statement specifies variables to defiue subgroups for the analysis. PROC JACKREG Statement PROC JACKREG options; The following options may be specified on the PROC JACKREG statement: DATA = SASdataset This parameter specifies the input data set the procedure will operate on. If this parameter is omitted the procedure will, by default. use the most recently created SAS data set. OUTEST : SASdataset Requests that the parameter estimates, of the Leave-one-out regression coefficients, be output to this data set. If you want to create a permanent SAS data set, you must specify a twolevel name (see Chapter 12, ISAS Data Sets', in SAS Userls Guide: Basics, 1982, for more information on permanent SAS data sets). NOPRINT Suppresses the normal printed output. 864

2 Using this option. you can create SAS data sets to be used for further analysis in subsequent procedures, without generating any printed output. Using this option is equivalent to specifying NOPRINT on the MODEL statement. MODEL Statement MODEL dependent = regressors I options; After the keyword MODEL, the dependent (response) variable is specified, followed by an equal sign and the regressor (independent) variab les Variables specified in the MODEL statement must be numeric variables in the data set to be analyzed. The following options may appear in the MODEL statement. The slash is used to separate the independent variables from the options. If no options are specified the slash is not needed. GROUP- n Specifies the size of the group of observations to be left out when calculating the pseudo-values. n may be any integer number between 1 and the number of observations in the input data set. If this option is omitted it defaults to one. INVERSE Requests that the inverse of the crossproduct matrix INV(X'X) be printed out. If this option is omitted the inverse is not printed. NOPRINT suppresses the normal output of the procedure. Using this option is the same as specifying the HOPRINT option on the PROC statement. USSCP prints the uncorrected sums of squares and crossproducts matrix X~X. NOINT requp.sts that the intercept parameter not be included in the model. By default the intercept is included. OUTPUT Statement OUTPUT OUT -' SASdataset PREDICTED RESIDUALS JPREDICTED JRESIDUALS VPREDICTED VRESIDUALS variable name = variable_name = variable_name '" variable_name '" variable_name '" variable_name The variable names used must conform to the SAS naming convensions. The OUTPUT statement specifies that JACKREG create a new SAS data set. Predicted and residual values as well as all the variables in the original data set are included in the new data set. If you want to create a permanent SAS data set, you must specify a two-level name (see chapter 12, 'SAS Data Sets' in SAS User's Guide; Basics, 1982, for more information on permanent SAS data sets). If only the keyword OUTPUT is specified without any options, a copy of the input data set is created. The options below may be used OUTPUT statement. in the OUT z gives the name of the new data set. If OUT = is omit ted, the new data set is named using SAS's DATAn convention. PREDICTED =gives the name of the variable whose values are the predicted values of the dependent variable, using the ordinary least-squares technique. RESIDUALS =gives the name of the variable whose values are the residuals, calculated as ACTUAL minus PREDICTED, using the ordinary least-squares technique. JPREDICTED=gives the name of the variable whose values are the predicted values of the dependent variable, using the JACKKNIFE estimates. JRESIDUALS=gives the name of the variable whose values are the residuals, calculated as ACTUAL minus JPREDICTED, using the JACKKNIFE estimates. VPREDICTED=gives the name of the variable whose values are the predicted values of the dependent variable, using the leave-one-out estimates, substituting for the values of independent variables the ones left out. This may be used for validating the model. VRESIDUALS=gives the name of the variable whose values are the residuals calculated as the ACTUAL minus the VPREDICTED. This may be used for validating the model. The above options may be specified in any order and any combination. Any unique abbreviation of the keywords is valid. 865

3 BY St:atement: BY variables; A BY statement: may be used with PROC JACKREG to obtain separate analyses on observat:ions in groups defined by t:he BY variables. When a BY st:at:ement: appears, t:he procedure expect:s the input data set to be sorted in order of t:he BY variables. If your input dat:a set is not: sorted in ascending order, use t:he SORT procedure wit:h a similar BY st:at:ement: to sort the data, or. if appropriate. use the BY statement options NOTSORTED or DESCENDING. For more information, see the discussion of t:he BY st:atemellt in Chapter 8, 'Statements Used in the PRoe Step' in SAS User's Guide: Basics, DETAILS Missing Values If an observation has a missing value for any of the variables in the analysis. that observation is omitted from the analysis. Printed Output The JACKREG procedure produces the following printed output by default : Al. ESTIMATES, the corresponding estimated regression coefficients using the ordinary least squares technique. A2. The names of t:he independent variables included in the model. variable can be accounted for by the model using the ordinary least squares technique. R-SQUARE, which can range from 0 to I, is the ratio of the sum of t:he Squares for the model divided by the tot:al sum of Squares. In general. the larger the value of R-SQUARE, the better the model's fit. A7. The value given in the table for FROB>!T! answers the question 'If the parameter is really equal zero, what is the probability of getting a larger value of T1' Thus, a very small value for this probability indicates that the parameter is not likely to equal zero, and therefore that the independent variable contributes significantly to the model. HI. H2. B3. B4. ESTIMATES, the corresponding Jackknife regression coefficients, calculated as the average of the difference of the weighted least squares coefficients and the psuedo values (coefficients of the regression equation after a group of observat:ions is removed from the analysis). The names of the independent variables included in the equation. STD_ERROR, the standard error of the Jackknife regression coefficients. T_RATIO, which is t:he rat:io of the Jackknife regression coefficients to the standard error of the estimates., f, I ~ i! I A3. STD_ERROR, the standard error of the estimat:ed parameters. A4. T_RATIO, which is the ratio of the estimated regression coefficients to the standard error of the estimates. AS. Th. source of variat:ion OLS SSE, which i. the residual variation that is not accounted for by the model. A6. R-SQUARE, measures how much variat:ion io the dependent 866 B5. JACK SSE. the sum of the square of the residuals when the Jackknife equation is fitted to the data set. B6. R-SQUARE, measures how much variation in the dependent variable can be accounted for by the model using the Jackknife technique. R SQUARE, which can take any values is the ratio of the Sum of the squares for the Jackknife model divided by the corrected total sum of

4 negative). What we learn from this information is that the data set and the fitted model need to be scrutinized further. This suggestion is gived further credance in the second part of the output. There it can be seed that some coefficients change rather markedly when certain observations are left out. For example several of the coefficients change 1n a significant way when observation 2 is left out. It should also be noted that the leave~ne-out R-SQUARE value for observation 2 is much larger than the overall R-SQUARE of This means that the model fits much better if observation 2 is left out and that perhaps we should pay some special attetion to that observation. Note also that if either observations 1 or 24 are left out the value of R-SQUARE is decreased substantially. This suggests that perhaps that those observations are remote in independent variable space. As such they may be 'high leverage~ points. That is, they are situated in such a way as to have a high potential for influencing one or more parameter estimates. In the third part of the output predicted values and residuals are given. By comparing the columns headed ~R_RAIN~ (regular residuals) and 'J_RESID' (jackknife residuals) we can see which observations are better fitted by OLS than the jackknife method and vice-versa. In particular we note that the jackknife method 'misses' the second data point by quite a long way (residual is 9.57). It is sometimes claimed that jackknife methods fit the bulk of the data well but pay scant attention to outliers while OLS methods do the opposite. The 'V RESID' and'v PRED' are particularly useful -as validation-tools. The predicted value corresponds to what we get when each observation is left out of the estimation process and then the values of the independent variables are plugged in to the resulting equation. A couple of things are ~ediately clear. First, observation 2 is most unusual. Second, these residuals are, on average, larger than the ordinary residuals. It is the case that we should expect data points used in the estimation process to fit the derived models better than data not used in that process. Since the validation residuals are formed by taking the difference between a predicted value and an observation which is independent of the predictor set we have a more honest assessment of how well we might expect 'future' observations to fit the given model. Other authors have ~hown the need for transformations on this data set. We, by no means, suggest that either of the models presented here is particularly useful. We simply use it to exhibit the output from PROC JACKREG. In our experience jackknife regressions provide useful diagnostic information (particularly data criticism) which can be incorporated into the overall model Example squares. B7. The value given in the table for PROB>!T! answers the question 'If the parameter is really equal to zero, what is the probability of getting a larger value of T?- Thus, a very small value for this probability indicates that the parameter is not likely to equal to zero. and therefore that the independent variable contributes significantly to the model. 8. The estimated regression coefficients after leaving out a group of observations. 9. R-SQUARE, calculated as the ratio of the sum of the squares for the model divided by the total sum of squares, after a group of observations is removed. The following example has been used many times. See Cook and Weisberg (1982) for a list of the raw data. The observations were collected in 1975 and have to do with weather modification, specifically the use of silver iodide to increase rainfall. Twenty-four days in the summer of 1975 were judged suitable for cloud seeding. On twelve of those days the clouds were seeded and on twelve they were not. The response, Y, is the measure amount of rain that fell in the targeted area in a 6 hour period on a given day. The independent variables are, C % cloud cover in the experimental area P total rainfall 1 hour prior to seeding E = echo motion (8 binary variable having to do with the radar patterns) A = action (seeding; 1, no seeding: 0) Further a suitability measure, S-Ne, is used as an independent variable. Suitable days for seeding are defined as those days on which the suitability measure is greater than or equal to 1.5. In the sample output the standard information concerning OLS estimates is given along with similar information for the jackknife estimates. It may readily be seen that the estimates of the regression coefficients are quite a bit different when we use the jackknife method than they are using OLS. Also the 'R-SQUARE' value associated with the jackknife model is much smaller than the corresponding OLS measure {in fact it is 867

5 building process. be considered as produces 'correct' No regression scheme should a magical black box which results. Jackknifing is a calculation intensive process and for intermediate to large data sets may be impractical even with good hardware. PROC JACKREG contains an option whereby groups of observations can be left out at each step. The procedure leaves out blocks of consecutive observations so the user can arrange the data in such a way as to determine the groups which are left out as part of the jackknife strategy. In most instances the user would rather like that the groups be determined at random. One scheme for accomplishing this task involves using a random number generator and PRoe SORT. Briefly, what needs to be done is to associate a random number with each observation and then sort the dataset by the values of the random number. For a dataset named OLD we can define a new variable called NEWVAR and proceed as follows. DATA NEW; NEWVAR "" PROC SORT; SET OLD; UNIFORM(O); BY NEWVAR; The above statements put the dataset into an essetially 'random order' so that the GROUPS option utilizes random selected groups. ORDINARY LEAST SQUARE DEP VARIABLE : Y ~ I3J VARIABLE ESTIMATES STD ERROR T RATIO PROB>!T! XO ~ A S Q C P E OLS SSE: (jf) R-SQUARE: ~ JACKKNIFE REGRESSION COEFFICENTS DEP VARIABLE : Y i) 31 ".'l '1 VARIABLE ESTIMATES-STD ERROR T_RATIO PROB>!T! XO A S C P )) E JACK SSE: R-SQUARE: ) LEAVE-ONE-OUT REGRESSION COEFFICIENTS LEAVE_OUT XO A S GROUP GROUP GROUP GROUP CROUP GROUP GROUP GROUP GROUP GROUP GROUP GROUP GROUP GROUP GROUP GROUP GROUP GROUP GROUP GROUP GROUP GROUP GROUP GROUP i t t l,, I ~ 868

6 LEAVE-ONE-oUT REGRESSION COEFFICIENTS LEAVE-ONE-OUT REGRESSION COEFFICIENTS LEAVE_our c p E LEAVE_OUT R-SQUARE GROUP GROUP GROUP GROUP GROUP GROUP GROUP GROUP CROUP GROUP GROUP GROUP GROUP GROUP GROUP GROUP GROUP GROUP GROUP GROUP GROUP GROUP GROUP GROUP GROUP GROUP GROUP GROUP GROUP GROUP GROUP GROUP GROUP GROUP GROUP GROUP GROUP GROUP GROUP GROUP GROUP GROUP GROUP GROUP GROUP GROUP GROUP GROUP RESIDUALS AND PREDICTED VALUES P_RAIN R_RAIN J_PRED J_RESID V_PRED V_RESID J B B BB B _

Two-Stage Least Squares

Two-Stage Least Squares Chapter 316 Two-Stage Least Squares Introduction This procedure calculates the two-stage least squares (2SLS) estimate. This method is used fit models that include instrumental variables. 2SLS includes

More information

Exploratory model analysis

Exploratory model analysis Exploratory model analysis with R and GGobi Hadley Wickham 6--8 Introduction Why do we build models? There are two basic reasons: explanation or prediction [Ripley, 4]. Using large ensembles of models

More information

Problem set for Week 7 Linear models: Linear regression, multiple linear regression, ANOVA, ANCOVA

Problem set for Week 7 Linear models: Linear regression, multiple linear regression, ANOVA, ANCOVA ECL 290 Statistical Models in Ecology using R Problem set for Week 7 Linear models: Linear regression, multiple linear regression, ANOVA, ANCOVA Datasets in this problem set adapted from those provided

More information

Robust Linear Regression (Passing- Bablok Median-Slope)

Robust Linear Regression (Passing- Bablok Median-Slope) Chapter 314 Robust Linear Regression (Passing- Bablok Median-Slope) Introduction This procedure performs robust linear regression estimation using the Passing-Bablok (1988) median-slope algorithm. Their

More information

Outline. Topic 16 - Other Remedies. Ridge Regression. Ridge Regression. Ridge Regression. Robust Regression. Regression Trees. Piecewise Linear Model

Outline. Topic 16 - Other Remedies. Ridge Regression. Ridge Regression. Ridge Regression. Robust Regression. Regression Trees. Piecewise Linear Model Topic 16 - Other Remedies Ridge Regression Robust Regression Regression Trees Outline - Fall 2013 Piecewise Linear Model Bootstrapping Topic 16 2 Ridge Regression Modification of least squares that addresses

More information

Chapter 7: Linear regression

Chapter 7: Linear regression Chapter 7: Linear regression Objective (1) Learn how to model association bet. 2 variables using a straight line (called "linear regression"). (2) Learn to assess the quality of regression models. (3)

More information

SAS/STAT 13.1 User s Guide. The SCORE Procedure

SAS/STAT 13.1 User s Guide. The SCORE Procedure SAS/STAT 13.1 User s Guide The SCORE Procedure This document is an individual chapter from SAS/STAT 13.1 User s Guide. The correct bibliographic citation for the complete manual is as follows: SAS Institute

More information

Multiple Regression White paper

Multiple Regression White paper +44 (0) 333 666 7366 Multiple Regression White paper A tool to determine the impact in analysing the effectiveness of advertising spend. Multiple Regression In order to establish if the advertising mechanisms

More information

Exercise: Graphing and Least Squares Fitting in Quattro Pro

Exercise: Graphing and Least Squares Fitting in Quattro Pro Chapter 5 Exercise: Graphing and Least Squares Fitting in Quattro Pro 5.1 Purpose The purpose of this experiment is to become familiar with using Quattro Pro to produce graphs and analyze graphical data.

More information

SAS/STAT 13.1 User s Guide. The NESTED Procedure

SAS/STAT 13.1 User s Guide. The NESTED Procedure SAS/STAT 13.1 User s Guide The NESTED Procedure This document is an individual chapter from SAS/STAT 13.1 User s Guide. The correct bibliographic citation for the complete manual is as follows: SAS Institute

More information

RSM Split-Plot Designs & Diagnostics Solve Real-World Problems

RSM Split-Plot Designs & Diagnostics Solve Real-World Problems RSM Split-Plot Designs & Diagnostics Solve Real-World Problems Shari Kraber Pat Whitcomb Martin Bezener Stat-Ease, Inc. Stat-Ease, Inc. Stat-Ease, Inc. 221 E. Hennepin Ave. 221 E. Hennepin Ave. 221 E.

More information

ST512. Fall Quarter, Exam 1. Directions: Answer questions as directed. Please show work. For true/false questions, circle either true or false.

ST512. Fall Quarter, Exam 1. Directions: Answer questions as directed. Please show work. For true/false questions, circle either true or false. ST512 Fall Quarter, 2005 Exam 1 Name: Directions: Answer questions as directed. Please show work. For true/false questions, circle either true or false. 1. (42 points) A random sample of n = 30 NBA basketball

More information

Spatial Patterns Point Pattern Analysis Geographic Patterns in Areal Data

Spatial Patterns Point Pattern Analysis Geographic Patterns in Areal Data Spatial Patterns We will examine methods that are used to analyze patterns in two sorts of spatial data: Point Pattern Analysis - These methods concern themselves with the location information associated

More information

Example 1 of panel data : Data for 6 airlines (groups) over 15 years (time periods) Example 1

Example 1 of panel data : Data for 6 airlines (groups) over 15 years (time periods) Example 1 Panel data set Consists of n entities or subjects (e.g., firms and states), each of which includes T observations measured at 1 through t time period. total number of observations : nt Panel data have

More information

Analysis of Complex Survey Data with SAS

Analysis of Complex Survey Data with SAS ABSTRACT Analysis of Complex Survey Data with SAS Christine R. Wells, Ph.D., UCLA, Los Angeles, CA The differences between data collected via a complex sampling design and data collected via other methods

More information

NCSS Statistical Software. Robust Regression

NCSS Statistical Software. Robust Regression Chapter 308 Introduction Multiple regression analysis is documented in Chapter 305 Multiple Regression, so that information will not be repeated here. Refer to that chapter for in depth coverage of multiple

More information

The NESTED Procedure (Chapter)

The NESTED Procedure (Chapter) SAS/STAT 9.3 User s Guide The NESTED Procedure (Chapter) SAS Documentation This document is an individual chapter from SAS/STAT 9.3 User s Guide. The correct bibliographic citation for the complete manual

More information

Introduction to Statistical Analyses in SAS

Introduction to Statistical Analyses in SAS Introduction to Statistical Analyses in SAS Programming Workshop Presented by the Applied Statistics Lab Sarah Janse April 5, 2017 1 Introduction Today we will go over some basic statistical analyses in

More information

PubHlth 640 Intermediate Biostatistics Unit 2 - Regression and Correlation. Simple Linear Regression Software: Stata v 10.1

PubHlth 640 Intermediate Biostatistics Unit 2 - Regression and Correlation. Simple Linear Regression Software: Stata v 10.1 PubHlth 640 Intermediate Biostatistics Unit 2 - Regression and Correlation Simple Linear Regression Software: Stata v 10.1 Emergency Calls to the New York Auto Club Source: Chatterjee, S; Handcock MS and

More information

The Bootstrap and Jackknife

The Bootstrap and Jackknife The Bootstrap and Jackknife Summer 2017 Summer Institutes 249 Bootstrap & Jackknife Motivation In scientific research Interest often focuses upon the estimation of some unknown parameter, θ. The parameter

More information

CREATING THE ANALYSIS

CREATING THE ANALYSIS Chapter 14 Multiple Regression Chapter Table of Contents CREATING THE ANALYSIS...214 ModelInformation...217 SummaryofFit...217 AnalysisofVariance...217 TypeIIITests...218 ParameterEstimates...218 Residuals-by-PredictedPlot...219

More information

Serial Correlation and Heteroscedasticity in Time series Regressions. Econometric (EC3090) - Week 11 Agustín Bénétrix

Serial Correlation and Heteroscedasticity in Time series Regressions. Econometric (EC3090) - Week 11 Agustín Bénétrix Serial Correlation and Heteroscedasticity in Time series Regressions Econometric (EC3090) - Week 11 Agustín Bénétrix 1 Properties of OLS with serially correlated errors OLS still unbiased and consistent

More information

Subset Selection in Multiple Regression

Subset Selection in Multiple Regression Chapter 307 Subset Selection in Multiple Regression Introduction Multiple regression analysis is documented in Chapter 305 Multiple Regression, so that information will not be repeated here. Refer to that

More information

Bivariate Linear Regression James M. Murray, Ph.D. University of Wisconsin - La Crosse Updated: October 04, 2017

Bivariate Linear Regression James M. Murray, Ph.D. University of Wisconsin - La Crosse Updated: October 04, 2017 Bivariate Linear Regression James M. Murray, Ph.D. University of Wisconsin - La Crosse Updated: October 4, 217 PDF file location: http://www.murraylax.org/rtutorials/regression_intro.pdf HTML file location:

More information

Regression on SAT Scores of 374 High Schools and K-means on Clustering Schools

Regression on SAT Scores of 374 High Schools and K-means on Clustering Schools Regression on SAT Scores of 374 High Schools and K-means on Clustering Schools Abstract In this project, we study 374 public high schools in New York City. The project seeks to use regression techniques

More information

STAT:5201 Applied Statistic II

STAT:5201 Applied Statistic II STAT:5201 Applied Statistic II Two-Factor Experiment (one fixed blocking factor, one fixed factor of interest) Randomized complete block design (RCBD) Primary Factor: Day length (short or long) Blocking

More information

NCSS Statistical Software

NCSS Statistical Software Chapter 327 Geometric Regression Introduction Geometric regression is a special case of negative binomial regression in which the dispersion parameter is set to one. It is similar to regular multiple regression

More information

THE UNIVERSITY OF BRITISH COLUMBIA FORESTRY 430 and 533. Time: 50 minutes 40 Marks FRST Marks FRST 533 (extra questions)

THE UNIVERSITY OF BRITISH COLUMBIA FORESTRY 430 and 533. Time: 50 minutes 40 Marks FRST Marks FRST 533 (extra questions) THE UNIVERSITY OF BRITISH COLUMBIA FORESTRY 430 and 533 MIDTERM EXAMINATION: October 14, 2005 Instructor: Val LeMay Time: 50 minutes 40 Marks FRST 430 50 Marks FRST 533 (extra questions) This examination

More information

Applied Regression Modeling: A Business Approach

Applied Regression Modeling: A Business Approach i Applied Regression Modeling: A Business Approach Computer software help: SAS code SAS (originally Statistical Analysis Software) is a commercial statistical software package based on a powerful programming

More information

Resources for statistical assistance. Quantitative covariates and regression analysis. Methods for predicting continuous outcomes.

Resources for statistical assistance. Quantitative covariates and regression analysis. Methods for predicting continuous outcomes. Resources for statistical assistance Quantitative covariates and regression analysis Carolyn Taylor Applied Statistics and Data Science Group (ASDa) Department of Statistics, UBC January 24, 2017 Department

More information

CHAPTER 3 AN OVERVIEW OF DESIGN OF EXPERIMENTS AND RESPONSE SURFACE METHODOLOGY

CHAPTER 3 AN OVERVIEW OF DESIGN OF EXPERIMENTS AND RESPONSE SURFACE METHODOLOGY 23 CHAPTER 3 AN OVERVIEW OF DESIGN OF EXPERIMENTS AND RESPONSE SURFACE METHODOLOGY 3.1 DESIGN OF EXPERIMENTS Design of experiments is a systematic approach for investigation of a system or process. A series

More information

DESIGN OF EXPERIMENTS and ROBUST DESIGN

DESIGN OF EXPERIMENTS and ROBUST DESIGN DESIGN OF EXPERIMENTS and ROBUST DESIGN Problems in design and production environments often require experiments to find a solution. Design of experiments are a collection of statistical methods that,

More information

( ) = Y ˆ. Calibration Definition A model is calibrated if its predictions are right on average: ave(response Predicted value) = Predicted value.

( ) = Y ˆ. Calibration Definition A model is calibrated if its predictions are right on average: ave(response Predicted value) = Predicted value. Calibration OVERVIEW... 2 INTRODUCTION... 2 CALIBRATION... 3 ANOTHER REASON FOR CALIBRATION... 4 CHECKING THE CALIBRATION OF A REGRESSION... 5 CALIBRATION IN SIMPLE REGRESSION (DISPLAY.JMP)... 5 TESTING

More information

THIS IS NOT REPRESNTATIVE OF CURRENT CLASS MATERIAL. STOR 455 Midterm 1 September 28, 2010

THIS IS NOT REPRESNTATIVE OF CURRENT CLASS MATERIAL. STOR 455 Midterm 1 September 28, 2010 THIS IS NOT REPRESNTATIVE OF CURRENT CLASS MATERIAL STOR 455 Midterm September 8, INSTRUCTIONS: BOTH THE EXAM AND THE BUBBLE SHEET WILL BE COLLECTED. YOU MUST PRINT YOUR NAME AND SIGN THE HONOR PLEDGE

More information

Lecture 20: Outliers and Influential Points

Lecture 20: Outliers and Influential Points Lecture 20: Outliers and Influential Points An outlier is a point with a large residual. An influential point is a point that has a large impact on the regression. Surprisingly, these are not the same

More information

STANDARDS OF LEARNING CONTENT REVIEW NOTES ALGEBRA II. 3 rd Nine Weeks,

STANDARDS OF LEARNING CONTENT REVIEW NOTES ALGEBRA II. 3 rd Nine Weeks, STANDARDS OF LEARNING CONTENT REVIEW NOTES ALGEBRA II 3 rd Nine Weeks, 2016-2017 1 OVERVIEW Algebra II Content Review Notes are designed by the High School Mathematics Steering Committee as a resource

More information

Stat 5100 Handout #11.a SAS: Variations on Ordinary Least Squares

Stat 5100 Handout #11.a SAS: Variations on Ordinary Least Squares Stat 5100 Handout #11.a SAS: Variations on Ordinary Least Squares Example 1: (Weighted Least Squares) A health researcher is interested in studying the relationship between diastolic blood pressure (bp)

More information

Beta-Regression with SPSS Michael Smithson School of Psychology, The Australian National University

Beta-Regression with SPSS Michael Smithson School of Psychology, The Australian National University 9/1/2005 Beta-Regression with SPSS 1 Beta-Regression with SPSS Michael Smithson School of Psychology, The Australian National University (email: Michael.Smithson@anu.edu.au) SPSS Nonlinear Regression syntax

More information

Frequencies, Unequal Variance Weights, and Sampling Weights: Similarities and Differences in SAS

Frequencies, Unequal Variance Weights, and Sampling Weights: Similarities and Differences in SAS ABSTRACT Paper 1938-2018 Frequencies, Unequal Variance Weights, and Sampling Weights: Similarities and Differences in SAS Robert M. Lucas, Robert M. Lucas Consulting, Fort Collins, CO, USA There is confusion

More information

For our example, we will look at the following factors and factor levels.

For our example, we will look at the following factors and factor levels. In order to review the calculations that are used to generate the Analysis of Variance, we will use the statapult example. By adjusting various settings on the statapult, you are able to throw the ball

More information

Linear and Quadratic Least Squares

Linear and Quadratic Least Squares Linear and Quadratic Least Squares Prepared by Stephanie Quintal, graduate student Dept. of Mathematical Sciences, UMass Lowell in collaboration with Marvin Stick Dept. of Mathematical Sciences, UMass

More information

MATHEMATICS 191, FALL 2004 MATHEMATICAL PROBABILITY Outline #1 (Countability and Uncountability)

MATHEMATICS 191, FALL 2004 MATHEMATICAL PROBABILITY Outline #1 (Countability and Uncountability) MATHEMATICS 191, FALL 2004 MATHEMATICAL PROBABILITY Outline #1 (Countability and Uncountability) Last modified: September 16, 2004 Reference: Apostol, Calculus, Vol. 2, section 13.19 (attached). The aim

More information

Robust Regression. Robust Data Mining Techniques By Boonyakorn Jantaranuson

Robust Regression. Robust Data Mining Techniques By Boonyakorn Jantaranuson Robust Regression Robust Data Mining Techniques By Boonyakorn Jantaranuson Outline Introduction OLS and important terminology Least Median of Squares (LMedS) M-estimator Penalized least squares What is

More information

Recall the expression for the minimum significant difference (w) used in the Tukey fixed-range method for means separation:

Recall the expression for the minimum significant difference (w) used in the Tukey fixed-range method for means separation: Topic 11. Unbalanced Designs [ST&D section 9.6, page 219; chapter 18] 11.1 Definition of missing data Accidents often result in loss of data. Crops are destroyed in some plots, plants and animals die,

More information

The SAS interface is shown in the following screen shot:

The SAS interface is shown in the following screen shot: The SAS interface is shown in the following screen shot: There are several items of importance shown in the screen shot First there are the usual main menu items, such as File, Edit, etc I seldom use anything

More information

SAS/STAT 15.1 User s Guide The HPREG Procedure

SAS/STAT 15.1 User s Guide The HPREG Procedure SAS/STAT 15.1 User s Guide The HPREG Procedure This document is an individual chapter from SAS/STAT 15.1 User s Guide. The correct bibliographic citation for this manual is as follows: SAS Institute Inc.

More information

SAS/STAT 14.2 User s Guide. The SURVEYIMPUTE Procedure

SAS/STAT 14.2 User s Guide. The SURVEYIMPUTE Procedure SAS/STAT 14.2 User s Guide The SURVEYIMPUTE Procedure This document is an individual chapter from SAS/STAT 14.2 User s Guide. The correct bibliographic citation for this manual is as follows: SAS Institute

More information

Maths PoS: Year 7 HT1. Students will colour code as they work through the scheme of work. Students will learn about Number and Shape

Maths PoS: Year 7 HT1. Students will colour code as they work through the scheme of work. Students will learn about Number and Shape Maths PoS: Year 7 HT1 Students will learn about Number and Shape Number: Use positive and negative numbers in context and position them on a number line. Recall quickly multiplication facts up to 10 10

More information

Zero-Inflated Poisson Regression

Zero-Inflated Poisson Regression Chapter 329 Zero-Inflated Poisson Regression Introduction The zero-inflated Poisson (ZIP) regression is used for count data that exhibit overdispersion and excess zeros. The data distribution combines

More information

A. Incorrect! This would be the negative of the range. B. Correct! The range is the maximum data value minus the minimum data value.

A. Incorrect! This would be the negative of the range. B. Correct! The range is the maximum data value minus the minimum data value. AP Statistics - Problem Drill 05: Measures of Variation No. 1 of 10 1. The range is calculated as. (A) The minimum data value minus the maximum data value. (B) The maximum data value minus the minimum

More information

Basics of Multivariate Modelling and Data Analysis

Basics of Multivariate Modelling and Data Analysis Basics of Multivariate Modelling and Data Analysis Kurt-Erik Häggblom 9. Linear regression with latent variables 9.1 Principal component regression (PCR) 9.2 Partial least-squares regression (PLS) [ mostly

More information

D-Optimal Designs. Chapter 888. Introduction. D-Optimal Design Overview

D-Optimal Designs. Chapter 888. Introduction. D-Optimal Design Overview Chapter 888 Introduction This procedure generates D-optimal designs for multi-factor experiments with both quantitative and qualitative factors. The factors can have a mixed number of levels. For example,

More information

One Factor Experiments

One Factor Experiments One Factor Experiments 20-1 Overview Computation of Effects Estimating Experimental Errors Allocation of Variation ANOVA Table and F-Test Visual Diagnostic Tests Confidence Intervals For Effects Unequal

More information

CH5: CORR & SIMPLE LINEAR REFRESSION =======================================

CH5: CORR & SIMPLE LINEAR REFRESSION ======================================= STAT 430 SAS Examples SAS5 ===================== ssh xyz@glue.umd.edu, tap sas913 (old sas82), sas https://www.statlab.umd.edu/sasdoc/sashtml/onldoc.htm CH5: CORR & SIMPLE LINEAR REFRESSION =======================================

More information

How to use FSBforecast Excel add in for regression analysis

How to use FSBforecast Excel add in for regression analysis How to use FSBforecast Excel add in for regression analysis FSBforecast is an Excel add in for data analysis and regression that was developed here at the Fuqua School of Business over the last 3 years

More information

SAS/STAT 13.2 User s Guide. The VARCLUS Procedure

SAS/STAT 13.2 User s Guide. The VARCLUS Procedure SAS/STAT 13.2 User s Guide The VARCLUS Procedure This document is an individual chapter from SAS/STAT 13.2 User s Guide. The correct bibliographic citation for the complete manual is as follows: SAS Institute

More information

Lecture 3 Tao Wang 1

Lecture 3 Tao Wang 1 Lecture 3 Tao Wang 1 Objectives In this chapter, you will learn about: Arithmetic operations Variables and declaration statements Program input using the cin object Common programming errors C++ for Engineers

More information

Further Maths Notes. Common Mistakes. Read the bold words in the exam! Always check data entry. Write equations in terms of variables

Further Maths Notes. Common Mistakes. Read the bold words in the exam! Always check data entry. Write equations in terms of variables Further Maths Notes Common Mistakes Read the bold words in the exam! Always check data entry Remember to interpret data with the multipliers specified (e.g. in thousands) Write equations in terms of variables

More information

C A S I O f x S UNIVERSITY OF SOUTHERN QUEENSLAND. The Learning Centre Learning and Teaching Support Unit

C A S I O f x S UNIVERSITY OF SOUTHERN QUEENSLAND. The Learning Centre Learning and Teaching Support Unit C A S I O f x - 1 0 0 S UNIVERSITY OF SOUTHERN QUEENSLAND The Learning Centre Learning and Teaching Support Unit MASTERING THE CALCULATOR USING THE CASIO fx-100s Learning and Teaching Support Unit (LTSU)

More information

Smoking and Missingness: Computer Syntax 1

Smoking and Missingness: Computer Syntax 1 Smoking and Missingness: Computer Syntax 1 Computer Syntax SAS code is provided for the logistic regression imputation described in this article. This code is listed in parts, with description provided

More information

Year 10 General Mathematics Unit 2

Year 10 General Mathematics Unit 2 Year 11 General Maths Year 10 General Mathematics Unit 2 Bivariate Data Chapter 4 Chapter Four 1 st Edition 2 nd Edition 2013 4A 1, 2, 3, 4, 6, 7, 8, 9, 10, 11 1, 2, 3, 4, 6, 7, 8, 9, 10, 11 2F (FM) 1,

More information

Regression. Dr. G. Bharadwaja Kumar VIT Chennai

Regression. Dr. G. Bharadwaja Kumar VIT Chennai Regression Dr. G. Bharadwaja Kumar VIT Chennai Introduction Statistical models normally specify how one set of variables, called dependent variables, functionally depend on another set of variables, called

More information

Data Management - 50%

Data Management - 50% Exam 1: SAS Big Data Preparation, Statistics, and Visual Exploration Data Management - 50% Navigate within the Data Management Studio Interface Register a new QKB Create and connect to a repository Define

More information

SAS/STAT 14.1 User s Guide. The LOESS Procedure

SAS/STAT 14.1 User s Guide. The LOESS Procedure SAS/STAT 14.1 User s Guide The LOESS Procedure This document is an individual chapter from SAS/STAT 14.1 User s Guide. The correct bibliographic citation for this manual is as follows: SAS Institute Inc.

More information

CDAA No. 4 - Part Two - Multiple Regression - Initial Data Screening

CDAA No. 4 - Part Two - Multiple Regression - Initial Data Screening CDAA No. 4 - Part Two - Multiple Regression - Initial Data Screening Variables Entered/Removed b Variables Entered GPA in other high school, test, Math test, GPA, High school math GPA a Variables Removed

More information

Applied Regression Modeling: A Business Approach

Applied Regression Modeling: A Business Approach i Applied Regression Modeling: A Business Approach Computer software help: SPSS SPSS (originally Statistical Package for the Social Sciences ) is a commercial statistical software package with an easy-to-use

More information

Data-Analysis Exercise Fitting and Extending the Discrete-Time Survival Analysis Model (ALDA, Chapters 11 & 12, pp )

Data-Analysis Exercise Fitting and Extending the Discrete-Time Survival Analysis Model (ALDA, Chapters 11 & 12, pp ) Applied Longitudinal Data Analysis Page 1 Data-Analysis Exercise Fitting and Extending the Discrete-Time Survival Analysis Model (ALDA, Chapters 11 & 12, pp. 357-467) Purpose of the Exercise This data-analytic

More information

1. What specialist uses information obtained from bones to help police solve crimes?

1. What specialist uses information obtained from bones to help police solve crimes? Mathematics: Modeling Our World Unit 4: PREDICTION HANDOUT VIDEO VIEWING GUIDE H4.1 1. What specialist uses information obtained from bones to help police solve crimes? 2.What are some things that can

More information

CSE446: Linear Regression. Spring 2017

CSE446: Linear Regression. Spring 2017 CSE446: Linear Regression Spring 2017 Ali Farhadi Slides adapted from Carlos Guestrin and Luke Zettlemoyer Prediction of continuous variables Billionaire says: Wait, that s not what I meant! You say: Chill

More information

Compute Blocks in Report

Compute Blocks in Report Compute Blocks in Report Compute Blocks Though it is always possible to compute new variables inside a data step, PROC REPORT allows for similar computations to be done internally as well. Computations

More information

Bootstrapping Method for 14 June 2016 R. Russell Rhinehart. Bootstrapping

Bootstrapping Method for  14 June 2016 R. Russell Rhinehart. Bootstrapping Bootstrapping Method for www.r3eda.com 14 June 2016 R. Russell Rhinehart Bootstrapping This is extracted from the book, Nonlinear Regression Modeling for Engineering Applications: Modeling, Model Validation,

More information

Frequently Asked Questions Updated 2006 (TRIM version 3.51) PREPARING DATA & RUNNING TRIM

Frequently Asked Questions Updated 2006 (TRIM version 3.51) PREPARING DATA & RUNNING TRIM Frequently Asked Questions Updated 2006 (TRIM version 3.51) PREPARING DATA & RUNNING TRIM * Which directories are used for input files and output files? See menu-item "Options" and page 22 in the manual.

More information

10 Step-drawdown analyses

10 Step-drawdown analyses 10 Step-drawdown analyses Step-drawdown analyses are done for two reasons. In the case of exploration wells, aquifer tests are commonly preceded by step-drawdown tests to determine the proper discharge

More information

Mastery. PRECALCULUS Student Learning Targets

Mastery. PRECALCULUS Student Learning Targets PRECALCULUS Student Learning Targets Big Idea: Sequences and Series 1. I can describe a sequence as a function where the domain is the set of natural numbers. Connections (Pictures, Vocabulary, Definitions,

More information

A Robust Optimum Response Surface Methodology based On MM-estimator

A Robust Optimum Response Surface Methodology based On MM-estimator A Robust Optimum Response Surface Methodology based On MM-estimator, 2 HABSHAH MIDI,, 2 MOHD SHAFIE MUSTAFA,, 2 ANWAR FITRIANTO Department of Mathematics, Faculty Science, University Putra Malaysia, 434,

More information

1 Training/Validation/Testing

1 Training/Validation/Testing CPSC 340 Final (Fall 2015) Name: Student Number: Please enter your information above, turn off cellphones, space yourselves out throughout the room, and wait until the official start of the exam to begin.

More information

Starting with a great calculator... Variables. Comments. Topic 5: Introduction to Programming in Matlab CSSE, UWA

Starting with a great calculator... Variables. Comments. Topic 5: Introduction to Programming in Matlab CSSE, UWA Starting with a great calculator... Topic 5: Introduction to Programming in Matlab CSSE, UWA! MATLAB is a high level language that allows you to perform calculations on numbers, or arrays of numbers, in

More information

Introduction to Exploratory Data Analysis

Introduction to Exploratory Data Analysis Introduction to Exploratory Data Analysis Ref: NIST/SEMATECH e-handbook of Statistical Methods http://www.itl.nist.gov/div898/handbook/index.htm The original work in Exploratory Data Analysis (EDA) was

More information

Table of Laplace Transforms

Table of Laplace Transforms Table of Laplace Transforms 1 1 2 3 4, p > -1 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 Heaviside Function 27 28. Dirac Delta Function 29 30. 31 32. 1 33 34. 35 36. 37 Laplace Transforms

More information

CREATING A SUMMARY TABLE OF NORMALIZED (Z) SCORES

CREATING A SUMMARY TABLE OF NORMALIZED (Z) SCORES CREATING A SUMMARY TABLE OF NORMALIZED (Z) SCORES Walter W. OWen The Biostatistics Center The George Washington University ABSTRACT Data from the behavioral sciences are often analyzed by normalizing the

More information

SAS Workshop. Introduction to SAS Programming. Iowa State University DAY 2 SESSION IV

SAS Workshop. Introduction to SAS Programming. Iowa State University DAY 2 SESSION IV SAS Workshop Introduction to SAS Programming DAY 2 SESSION IV Iowa State University May 10, 2016 Controlling ODS graphical output from a procedure Many SAS procedures produce default plots in ODS graphics

More information

SASEG 9B Regression Assumptions

SASEG 9B Regression Assumptions SASEG 9B Regression Assumptions (Fall 2015) Sources (adapted with permission)- T. P. Cronan, Jeff Mullins, Ron Freeze, and David E. Douglas Course and Classroom Notes Enterprise Systems, Sam M. Walton

More information

Week 4: Simple Linear Regression III

Week 4: Simple Linear Regression III Week 4: Simple Linear Regression III Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ARR 1 Outline Goodness of

More information

Data Analysis and Solver Plugins for KSpread USER S MANUAL. Tomasz Maliszewski

Data Analysis and Solver Plugins for KSpread USER S MANUAL. Tomasz Maliszewski Data Analysis and Solver Plugins for KSpread USER S MANUAL Tomasz Maliszewski tmaliszewski@wp.pl Table of Content CHAPTER 1: INTRODUCTION... 3 1.1. ABOUT DATA ANALYSIS PLUGIN... 3 1.3. ABOUT SOLVER PLUGIN...

More information

SAS/STAT 14.1 User s Guide. Special SAS Data Sets

SAS/STAT 14.1 User s Guide. Special SAS Data Sets SAS/STAT 14.1 User s Guide Special SAS Data Sets This document is an individual chapter from SAS/STAT 14.1 User s Guide. The correct bibliographic citation for this manual is as follows: SAS Institute

More information

STAT 2607 REVIEW PROBLEMS Word problems must be answered in words of the problem.

STAT 2607 REVIEW PROBLEMS Word problems must be answered in words of the problem. STAT 2607 REVIEW PROBLEMS 1 REMINDER: On the final exam 1. Word problems must be answered in words of the problem. 2. "Test" means that you must carry out a formal hypothesis testing procedure with H0,

More information

Stat 5100 Handout #19 SAS: Influential Observations and Outliers

Stat 5100 Handout #19 SAS: Influential Observations and Outliers Stat 5100 Handout #19 SAS: Influential Observations and Outliers Example: Data collected on 50 countries relevant to a cross-sectional study of a lifecycle savings hypothesis, which states that the response

More information

Using Excel for Graphical Analysis of Data

Using Excel for Graphical Analysis of Data Using Excel for Graphical Analysis of Data Introduction In several upcoming labs, a primary goal will be to determine the mathematical relationship between two variable physical parameters. Graphs are

More information

BACKGROUND INFORMATION ON COMPLEX SAMPLE SURVEYS

BACKGROUND INFORMATION ON COMPLEX SAMPLE SURVEYS Analysis of Complex Sample Survey Data Using the SURVEY PROCEDURES and Macro Coding Patricia A. Berglund, Institute For Social Research-University of Michigan, Ann Arbor, Michigan ABSTRACT The paper presents

More information

S H A R P E L R H UNIVERSITY OF SOUTHERN QUEENSLAND. The Learning Centre Learning and Teaching Support Unit

S H A R P E L R H UNIVERSITY OF SOUTHERN QUEENSLAND. The Learning Centre Learning and Teaching Support Unit S H A R P E L - 5 3 1 R H UNIVERSITY OF SOUTHERN QUEENSLAND The Learning Centre Learning and Teaching Support Unit TABLE OF CONTENTS PAGE Introduction 1 A word about starting out 2 1. Addition and subtraction

More information

4. Feedforward neural networks. 4.1 Feedforward neural network structure

4. Feedforward neural networks. 4.1 Feedforward neural network structure 4. Feedforward neural networks 4.1 Feedforward neural network structure Feedforward neural network is one of the most common network architectures. Its structure and some basic preprocessing issues required

More information

Labor Economics with STATA. Estimating the Human Capital Model Using Artificial Data

Labor Economics with STATA. Estimating the Human Capital Model Using Artificial Data Labor Economics with STATA Liyousew G. Borga December 2, 2015 Estimating the Human Capital Model Using Artificial Data Liyou Borga Labor Economics with STATA December 2, 2015 84 / 105 Outline 1 The Human

More information

Graphical Analysis of Data using Microsoft Excel [2016 Version]

Graphical Analysis of Data using Microsoft Excel [2016 Version] Graphical Analysis of Data using Microsoft Excel [2016 Version] Introduction In several upcoming labs, a primary goal will be to determine the mathematical relationship between two variable physical parameters.

More information

CSC 328/428 Summer Session I 2002 Data Analysis for the Experimenter FINAL EXAM

CSC 328/428 Summer Session I 2002 Data Analysis for the Experimenter FINAL EXAM options pagesize=53 linesize=76 pageno=1 nodate; proc format; value $stcktyp "1"="Growth" "2"="Combined" "3"="Income"; data invstmnt; input stcktyp $ perform; label stkctyp="type of Stock" perform="overall

More information

Optimization of large simulations using statistical software

Optimization of large simulations using statistical software Computational Statistics & Data Analysis 51 (2007) 2747 2752 www.elsevier.com/locate/csda Optimization of large simulations using statistical software Ilya Novikov, Bernice Oberman Gertner Institute for

More information

The SURVEYREG Procedure

The SURVEYREG Procedure SAS/STAT 9.2 User s Guide The SURVEYREG Procedure (Book Excerpt) SAS Documentation This document is an individual chapter from SAS/STAT 9.2 User s Guide. The correct bibliographic citation for the complete

More information

DesignDirector Version 1.0(E)

DesignDirector Version 1.0(E) Statistical Design Support System DesignDirector Version 1.0(E) User s Guide NHK Spring Co.,Ltd. Copyright NHK Spring Co.,Ltd. 1999 All Rights Reserved. Copyright DesignDirector is registered trademarks

More information

Exploring Econometric Model Selection Using Sensitivity Analysis

Exploring Econometric Model Selection Using Sensitivity Analysis Exploring Econometric Model Selection Using Sensitivity Analysis William Becker Paolo Paruolo Andrea Saltelli Nice, 2 nd July 2013 Outline What is the problem we are addressing? Past approaches Hoover

More information

Missing Data Part 1: Overview, Traditional Methods Page 1

Missing Data Part 1: Overview, Traditional Methods Page 1 Missing Data Part 1: Overview, Traditional Methods Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 17, 2015 This discussion borrows heavily from: Applied

More information

EE 511 Linear Regression

EE 511 Linear Regression EE 511 Linear Regression Instructor: Hanna Hajishirzi hannaneh@washington.edu Slides adapted from Ali Farhadi, Mari Ostendorf, Pedro Domingos, Carlos Guestrin, and Luke Zettelmoyer, Announcements Hw1 due

More information