General Factorial Models

Size: px
Start display at page:

Download "General Factorial Models"

Transcription

1 In Chapter 8 in Oehlert STAT:5201 Week 9 - Lecture 2 1 / 34

2 It is possible to have many factors in a factorial experiment. In DDD we saw an example of a 3-factor study with ball size, height, and surface affecting bounce time (Chapter 2 Example 2.2). 2 / 34

3 The general set-up can be extended to many factors, but higher-order interactions can be bothersome to deal with. Recall that a 3-factor interaction, like (αβγ) ijk, describes how a 2-factor interaction changes depending on the level of the third factor. 3 / 34

4 Three-Factor Factorial Effects Models Full Model (includes interaction): Y ijkl = µ + α i + β j + γ k + (αβ) ij + (αγ) ik + (βγ) jk + (αβγ) ijk + ɛ ijk iid with ɛ ijk N(0, σ 2 ) for i = 1,..., a j = 1,..., b k = 1,..., c and l = 1,..., n for a balanced design. Restriction for estimation of parameters as sum-to-zero constraints: 0 = a α i = i=1 b β j = j=1 c γ k = i=1 a (αβ) ij = i=1 b (αβ) ij =... = j=1 k (αβγ) ijk i=1 4 / 34

5 Three-Factor Factorial Effects Models The degrees of freedom work as before: Source df A a-1 B b-1 C c-1 AB (a-1)(b-1) AC (a-1)(c-1) BC (b-1)(c-1) ABC (a-1)(b-1)(c-1) error [1] abc(n-1) c. total [1] abcn-1 [1] Above df for error and c. total are from a balanced design with n observations in each of the a b c cells. 5 / 34

6 Three-Factor Factorial Effects Models If there is 3-way interaction, then the 2-way interaction for a given level of the 3rd factor differs from the 2-way interaction at a different level of the 3rd factor. 6 / 34

7 As we ve mentioned before, you should check (i.e. test) for higher-order interactions first before considering lower level interactions or main effects. If the higher-order interaction is significant, then you shouldn t look at the tests for lower-order effects because these tests won t necessarily be meaningful. One way to think about it, if there is interaction, then by doing a main effects test, you are essentially pooling things that should not be pooled (are not similar), and you can get some false impressions of what s going on with the effects. 7 / 34

8 If the higher-order interaction is significant, one option is to fit the full model, then perform a kind of slice analysis (a special contrast) which will perform a separate hypothesis test for differing levels of a factor (see graphic below). We saw this in an earlier 2-way interaction example... Or you could also physically partition the data into parts and do separate analyses, but some power is lost because you ll have fewer df for error in each separate analysis compared to when the data are all together. 8 / 34

9 The simplest scenario is when no higher-order interactions are present, and we can just consider a main effects model. In that case, we can fit a model where the interactions terms are removed from the model and are placed in the error term. 9 / 34

10 The next example is a special case of the general factorial design with k factors, all at two levels, or a 2 k design. Graphical representation of a 2 3 design. From Montgomery, D.C. (2014) Applied statistics and probability for engineers. 10 / 34

11 Example (SAS: 2 3 design) An engineer is interested in the effects of the following factors on life (in hours) of a machine tool: cutting angle (0=low, 1=high) tool geometry (0=first shape, 1=second shape) cutting speed (0=low,1=high). Three runs are done for each combination of factor levels, and all runs are done in random order. This is a completely randomized design (CRD). Eight treatment groups with n = 3, so N = 24 From D.C. Montgomery (2005). Design and analysis of experiments. John Wiley & Sons: USA. 11 / 34

12 Example (SAS: 2 3 design) 12 / 34

13 Example (SAS: 2 3 design) 13 / 34

14 Example (SAS: 2 3 design) Diagnostic plots for contant variance and normality. The diagnostic plots look OK, and the 3-way interaction was not significant here (previous slide), so that term could be removed from the model (which places it in the error term). Or we can leave the 3-way interaction term in the model and look at the tests for the 2-way interactions in the ANOVA table. 14 / 34

15 Example (SAS: 2 3 design) According to the ANOVA table, the only significant 2-way interaction is between angle and speed or angle*speed. We will visually look at the marginal 2-way interaction plot (averaged across the 3rd factor) for each combination of factors: angle*speed, angle*geometry, and geometry*speed. These plots average over replicates in a cell and over the levels of the unplotted factor / 34

16 Example (SAS: 2 3 design) Marginal 2-way interaction plot for angle*geometry (not significant) This was not a significant interaction in the model. 16 / 34

17 Example (SAS: 2 3 design) Marginal 2-way interaction plot for geometry*speed (not significant) This was not a significant interaction in the model. 17 / 34

18 Example (SAS: 2 3 design) Marginal 2-way interaction plot for angle*speed (significant) This WAS a significant interaction in the model. 18 / 34

19 Example (SAS: 2 3 design) The type of interaction in the angle*speed plot causes concern for making global statements about the main effects for angle and speed. When angle is low (far left side), speed has a positive effect on life, and when angle is high (far right side), speed has a negative effect on life. The minimal model should include: geometry, angle, speed, angle*speed (following the hierarchy principle). 19 / 34

20 Example (SAS: 2 3 design) Most parsimonious model following hierarchical principle. We will consider the main effects for geometry and the interaction effect between angle*speed with a slice option. 20 / 34

21 Example (SAS: 2 3 design) The geometry factor has a simple a main effect. Averaged over all angles and all speeds, the average lifetime for a tool of shape=0 is 35.2 hours, while the average lifetime of a tool of shape=1 is 46.5 hours. Holding the speed and angle constant, changing from shape=0 to shape=1 is associated with an increased lifetime of about 11.5 hours. 21 / 34

22 Example (SAS: 2 3 design) For each angle level (low and high), speed makes a significant difference on tool lifetime (slices significant). When angle is set to low, a high speed gives a longer lifetime (9.2 hours). When angle is set to high, then a low speed gives a longer lifetime (8.5 hours). 22 / 34

23 What if the 3-way interaction had been significant? How should we proceed? We ll consider two options: 1 Subset the data and do separate analyses. 2 Fit the full model to the complete data set and perform a slice analysis. 23 / 34

24 : Partition data Let s partition the data into two parts by the angle factor (low, high), and do an analysis on the factors of geometry and speed for each part. Example (SAS: subset to angle=0) 24 / 34

25 : Partition data Example (SAS: subset to angle=0) There is no significant interaction, only main effects. 25 / 34

26 : Partition data Example (SAS: subset to angle=0) When angle is set to the low level (angle=0), there is no significant interaction between geometry and speed. There is a significant positive speed effect, and a significant positive geometry effect (both main effects). 26 / 34

27 : Partition data Example (SAS: subset to angle=1) 27 / 34

28 : Partition data Example (SAS: subset to angle=1) There is no significant interaction, only main effects. 28 / 34

29 : Partition data Example (SAS: subset to angle=1) When angle is set to the high level (angle=1), there is no significant interaction between geometry and speed. There is a significant negative speed effect, and a significant positive geometry effect (both main effects). 29 / 34

30 : Use slice option One could get a very similar analysis (with more degrees of freedom for error) by fitting the full model and then slicing by angle. We will approach it that way here. Example (SAS: full model, slice by angle) 30 / 34

31 : Use slice option Example (SAS: full model, slice by angle) If you compare the Mean Squares in the above slice output, they match the Mean Squares for the two models we fit in the two subsetted analyses (with 4 separate means), but the F -statistics are different. Why? 31 / 34

32 : Use slice option Example (SAS: full model, slice by angle) The full model (using all the data and all possible terms) provides ˆσ 2 = with 16 d.f. for the error (output below): When we subsetted the data into the Angle low, we found ˆσ 2 = with 8 d.f. for the error. When we subsetted the data into the Angle high, we found ˆσ 2 = with 8 d.f. for the error. As we have made the assumption that σ 2 is the same across all cell means, the full model estimate of σ 2 is a pooled estimate taken from the two subsetted data sets. They are all estimating the same constant variance σ 2, but we gain in d.f. for the error when we use the pooled estimate. 32 / 34

33 : Use slice option Example (SAS: full model, slice by angle) Test for a difference in the four means where Angle held constant at either low or high with α = 0.05 H 0 : µ a 11 = µ a 12 = µ a 21 = µ a 22 vs. H 1 : not H 0 Using the slice option (i.e. using all the data), the threshold for significance is F (0.05,3,16) = 3.23 Using the subsetted data, the threshold for significance is F (0.05,3,8) = 4.07 The threshold for significance is lower when we have more degrees of freedom for error. 33 / 34

34 : Use slice option Example (SAS: full model, slice by angle) Plot of residuals vs. predicted from full model colored by angle level. 34 / 34

General Factorial Models

General Factorial Models In Chapter 8 in Oehlert STAT:5201 Week 9 - Lecture 1 1 / 31 It is possible to have many factors in a factorial experiment. We saw some three-way factorials earlier in the DDD book (HW 1 with 3 factors:

More information

SAS data statements and data: /*Factor A: angle Factor B: geometry Factor C: speed*/

SAS data statements and data: /*Factor A: angle Factor B: geometry Factor C: speed*/ STAT:5201 Applied Statistic II (Factorial with 3 factors as 2 3 design) Three-way ANOVA (Factorial with three factors) with replication Factor A: angle (low=0/high=1) Factor B: geometry (shape A=0/shape

More information

Modeling Effects and Additive Two-Factor Models (i.e. without interaction)

Modeling Effects and Additive Two-Factor Models (i.e. without interaction) Modeling Effects and Additive Two-Factor Models (i.e. without interaction) STAT:5201 Week 4: Lecture 3 1 / 16 Modeling & Effects To model the data......to break-down into its component parts....to define

More information

R-Square Coeff Var Root MSE y Mean

R-Square Coeff Var Root MSE y Mean STAT:50 Applied Statistics II Exam - Practice 00 possible points. Consider a -factor study where each of the factors has 3 levels. The factors are Diet (,,3) and Drug (A,B,C) and there are n = 3 observations

More information

Stat 5303 (Oehlert): Unreplicated 2-Series Factorials 1

Stat 5303 (Oehlert): Unreplicated 2-Series Factorials 1 Stat 5303 (Oehlert): Unreplicated 2-Series Factorials 1 Cmd> a

More information

Recall the crossover design set up as a Latin rectangle: Sequence=Subject A B C A B C 3 C A B B C A

Recall the crossover design set up as a Latin rectangle: Sequence=Subject A B C A B C 3 C A B B C A D. More on Crossover Designs: # periods = # trts Recall the crossover design set up as a Latin rectangle: Period Sequence=Subject 1 2 3 4 5 6 1 A B C A B C 2 B C A C A B 3 C A B B C A With one subject

More information

Recall the expression for the minimum significant difference (w) used in the Tukey fixed-range method for means separation:

Recall the expression for the minimum significant difference (w) used in the Tukey fixed-range method for means separation: Topic 11. Unbalanced Designs [ST&D section 9.6, page 219; chapter 18] 11.1 Definition of missing data Accidents often result in loss of data. Crops are destroyed in some plots, plants and animals die,

More information

Stat 5303 (Oehlert): Unbalanced Factorial Examples 1

Stat 5303 (Oehlert): Unbalanced Factorial Examples 1 Stat 5303 (Oehlert): Unbalanced Factorial Examples 1 > section

More information

Source df SS MS F A a-1 [A] [T] SS A. / MS S/A S/A (a)(n-1) [AS] [A] SS S/A. / MS BxS/A A x B (a-1)(b-1) [AB] [A] [B] + [T] SS AxB

Source df SS MS F A a-1 [A] [T] SS A. / MS S/A S/A (a)(n-1) [AS] [A] SS S/A. / MS BxS/A A x B (a-1)(b-1) [AB] [A] [B] + [T] SS AxB Keppel, G. Design and Analysis: Chapter 17: The Mixed Two-Factor Within-Subjects Design: The Overall Analysis and the Analysis of Main Effects and Simple Effects Keppel describes an Ax(BxS) design, which

More information

Lab #9: ANOVA and TUKEY tests

Lab #9: ANOVA and TUKEY tests Lab #9: ANOVA and TUKEY tests Objectives: 1. Column manipulation in SAS 2. Analysis of variance 3. Tukey test 4. Least Significant Difference test 5. Analysis of variance with PROC GLM 6. Levene test for

More information

The same procedure is used for the other factors.

The same procedure is used for the other factors. When DOE Wisdom software is opened for a new experiment, only two folders appear; the message log folder and the design folder. The message log folder includes any error message information that occurs

More information

Section 4 General Factorial Tutorials

Section 4 General Factorial Tutorials Section 4 General Factorial Tutorials General Factorial Part One: Categorical Introduction Design-Ease software version 6 offers a General Factorial option on the Factorial tab. If you completed the One

More information

Lecture 13: Model selection and regularization

Lecture 13: Model selection and regularization Lecture 13: Model selection and regularization Reading: Sections 6.1-6.2.1 STATS 202: Data mining and analysis October 23, 2017 1 / 17 What do we know so far In linear regression, adding predictors always

More information

NCSS Statistical Software. Design Generator

NCSS Statistical Software. Design Generator Chapter 268 Introduction This program generates factorial, repeated measures, and split-plots designs with up to ten factors. The design is placed in the current database. Crossed Factors Two factors are

More information

BIOMETRICS INFORMATION

BIOMETRICS INFORMATION BIOMETRICS INFORMATION (You re 95% likely to need this information) PAMPHLET NO. # 57 DATE: September 5, 1997 SUBJECT: Interpreting Main Effects when a Two-way Interaction is Present Interpreting the analysis

More information

One Factor Experiments

One Factor Experiments One Factor Experiments 20-1 Overview Computation of Effects Estimating Experimental Errors Allocation of Variation ANOVA Table and F-Test Visual Diagnostic Tests Confidence Intervals For Effects Unequal

More information

Lecture Notes #4: Randomized Block, Latin Square, and Factorials4-1

Lecture Notes #4: Randomized Block, Latin Square, and Factorials4-1 Lecture Notes #4: Randomized Block, Latin Square, and Factorials4-1 Richard Gonzalez Psych 613 Version 2.5 (Oct 2016) LECTURE NOTES #4: Randomized Block, Latin Square, and Factorial Designs Reading Assignment

More information

Analysis of variance - ANOVA

Analysis of variance - ANOVA Analysis of variance - ANOVA Based on a book by Julian J. Faraway University of Iceland (UI) Estimation 1 / 50 Anova In ANOVAs all predictors are categorical/qualitative. The original thinking was to try

More information

Multi-Factored Experiments

Multi-Factored Experiments Design and Analysis of Multi-Factored Experiments Advanced Designs -Hard to Change Factors- Split-Plot Design and Analysis L. M. Lye DOE Course 1 Hard-to-Change Factors Assume that a factor can be varied,

More information

Statistics Lab #7 ANOVA Part 2 & ANCOVA

Statistics Lab #7 ANOVA Part 2 & ANCOVA Statistics Lab #7 ANOVA Part 2 & ANCOVA PSYCH 710 7 Initialize R Initialize R by entering the following commands at the prompt. You must type the commands exactly as shown. options(contrasts=c("contr.sum","contr.poly")

More information

Statistical Bioinformatics (Biomedical Big Data) Notes 2: Installing and Using R

Statistical Bioinformatics (Biomedical Big Data) Notes 2: Installing and Using R Statistical Bioinformatics (Biomedical Big Data) Notes 2: Installing and Using R In this course we will be using R (for Windows) for most of our work. These notes are to help students install R and then

More information

Model Selection and Inference

Model Selection and Inference Model Selection and Inference Merlise Clyde January 29, 2017 Last Class Model for brain weight as a function of body weight In the model with both response and predictor log transformed, are dinosaurs

More information

Analysis of Two-Level Designs

Analysis of Two-Level Designs Chapter 213 Analysis of Two-Level Designs Introduction Several analysis programs are provided for the analysis of designed experiments. The GLM-ANOVA and the Multiple Regression programs are often used.

More information

THIS IS NOT REPRESNTATIVE OF CURRENT CLASS MATERIAL. STOR 455 Midterm 1 September 28, 2010

THIS IS NOT REPRESNTATIVE OF CURRENT CLASS MATERIAL. STOR 455 Midterm 1 September 28, 2010 THIS IS NOT REPRESNTATIVE OF CURRENT CLASS MATERIAL STOR 455 Midterm September 8, INSTRUCTIONS: BOTH THE EXAM AND THE BUBBLE SHEET WILL BE COLLECTED. YOU MUST PRINT YOUR NAME AND SIGN THE HONOR PLEDGE

More information

For our example, we will look at the following factors and factor levels.

For our example, we will look at the following factors and factor levels. In order to review the calculations that are used to generate the Analysis of Variance, we will use the statapult example. By adjusting various settings on the statapult, you are able to throw the ball

More information

Mixed Effects Models. Biljana Jonoska Stojkova Applied Statistics and Data Science Group (ASDa) Department of Statistics, UBC.

Mixed Effects Models. Biljana Jonoska Stojkova Applied Statistics and Data Science Group (ASDa) Department of Statistics, UBC. Mixed Effects Models Biljana Jonoska Stojkova Applied Statistics and Data Science Group (ASDa) Department of Statistics, UBC March 6, 2018 Resources for statistical assistance Department of Statistics

More information

Genotype x Environmental Analysis with R for Windows

Genotype x Environmental Analysis with R for Windows Genotype x Environmental Analysis with R for Windows Biometrics and Statistics Unit Angela Pacheco CIMMYT,Int. 23-24 Junio 2015 About GEI In agricultural experimentation, a large number of genotypes are

More information

2014 Stat-Ease, Inc. All Rights Reserved.

2014 Stat-Ease, Inc. All Rights Reserved. What s New in Design-Expert version 9 Factorial split plots (Two-Level, Multilevel, Optimal) Definitive Screening and Single Factor designs Journal Feature Design layout Graph Columns Design Evaluation

More information

An introduction to SPSS

An introduction to SPSS An introduction to SPSS To open the SPSS software using U of Iowa Virtual Desktop... Go to https://virtualdesktop.uiowa.edu and choose SPSS 24. Contents NOTE: Save data files in a drive that is accessible

More information

Instructor: Padraic Bartlett. Lecture 2: Schreier Diagrams

Instructor: Padraic Bartlett. Lecture 2: Schreier Diagrams Algebraic GT Instructor: Padraic Bartlett Lecture 2: Schreier Diagrams Week 5 Mathcamp 2014 This class s lecture continues last s class s discussion of the interplay between groups and graphs. In specific,

More information

Fly wing length data Sokal and Rohlf Box 10.1 Ch13.xls. on chalk board

Fly wing length data Sokal and Rohlf Box 10.1 Ch13.xls. on chalk board Model Based Statistics in Biology. Part IV. The General Linear Model. Multiple Explanatory Variables. Chapter 13.6 Nested Factors (Hierarchical ANOVA ReCap. Part I (Chapters 1,2,3,4), Part II (Ch 5, 6,

More information

Lecture 25: Review I

Lecture 25: Review I Lecture 25: Review I Reading: Up to chapter 5 in ISLR. STATS 202: Data mining and analysis Jonathan Taylor 1 / 18 Unsupervised learning In unsupervised learning, all the variables are on equal standing,

More information

STAT 5200 Handout #24: Power Calculation in Mixed Models

STAT 5200 Handout #24: Power Calculation in Mixed Models STAT 5200 Handout #24: Power Calculation in Mixed Models Statistical power is the probability of finding an effect (i.e., calling a model term significant), given that the effect is real. ( Effect here

More information

ITSx: Policy Analysis Using Interrupted Time Series

ITSx: Policy Analysis Using Interrupted Time Series ITSx: Policy Analysis Using Interrupted Time Series Week 5 Slides Michael Law, Ph.D. The University of British Columbia COURSE OVERVIEW Layout of the weeks 1. Introduction, setup, data sources 2. Single

More information

Part I. Hierarchical clustering. Hierarchical Clustering. Hierarchical clustering. Produces a set of nested clusters organized as a

Part I. Hierarchical clustering. Hierarchical Clustering. Hierarchical clustering. Produces a set of nested clusters organized as a Week 9 Based in part on slides from textbook, slides of Susan Holmes Part I December 2, 2012 Hierarchical Clustering 1 / 1 Produces a set of nested clusters organized as a Hierarchical hierarchical clustering

More information

Week 4: Simple Linear Regression III

Week 4: Simple Linear Regression III Week 4: Simple Linear Regression III Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ARR 1 Outline Goodness of

More information

Week 6, Week 7 and Week 8 Analyses of Variance

Week 6, Week 7 and Week 8 Analyses of Variance Week 6, Week 7 and Week 8 Analyses of Variance Robyn Crook - 2008 In the next few weeks we will look at analyses of variance. This is an information-heavy handout so take your time reading it, and don

More information

Lecture 5 Finding meaningful clusters in data. 5.1 Kleinberg s axiomatic framework for clustering

Lecture 5 Finding meaningful clusters in data. 5.1 Kleinberg s axiomatic framework for clustering CSE 291: Unsupervised learning Spring 2008 Lecture 5 Finding meaningful clusters in data So far we ve been in the vector quantization mindset, where we want to approximate a data set by a small number

More information

Section 3.4: Diagnostics and Transformations. Jared S. Murray The University of Texas at Austin McCombs School of Business

Section 3.4: Diagnostics and Transformations. Jared S. Murray The University of Texas at Austin McCombs School of Business Section 3.4: Diagnostics and Transformations Jared S. Murray The University of Texas at Austin McCombs School of Business 1 Regression Model Assumptions Y i = β 0 + β 1 X i + ɛ Recall the key assumptions

More information

Stat 602 The Design of Experiments

Stat 602 The Design of Experiments Stat 602 The Design of Experiments Yuqing Xu Department of Statistics University of Wisconsin Madison, WI 53706, USA April 28, 2016 Yuqing Xu (UW-Madison) Stat 602 Week 14 April 28, 2016 1 / 10 Blocking

More information

Design and Analysis of Experiments Prof. Jhareswar Maiti Department of Industrial and Systems Engineering Indian Institute of Technology, Kharagpur

Design and Analysis of Experiments Prof. Jhareswar Maiti Department of Industrial and Systems Engineering Indian Institute of Technology, Kharagpur Design and Analysis of Experiments Prof. Jhareswar Maiti Department of Industrial and Systems Engineering Indian Institute of Technology, Kharagpur Lecture 59 Fractional Factorial Design using MINITAB

More information

STAT 705 Introduction to generalized additive models

STAT 705 Introduction to generalized additive models STAT 705 Introduction to generalized additive models Timothy Hanson Department of Statistics, University of South Carolina Stat 705: Data Analysis II 1 / 22 Generalized additive models Consider a linear

More information

NCSS Statistical Software

NCSS Statistical Software Chapter 245 Introduction This procedure generates R control charts for variables. The format of the control charts is fully customizable. The data for the subgroups can be in a single column or in multiple

More information

Chemical Reaction dataset ( https://stat.wvu.edu/~cjelsema/data/chemicalreaction.txt )

Chemical Reaction dataset ( https://stat.wvu.edu/~cjelsema/data/chemicalreaction.txt ) JMP Output from Chapter 9 Factorial Analysis through JMP Chemical Reaction dataset ( https://stat.wvu.edu/~cjelsema/data/chemicalreaction.txt ) Fitting the Model and checking conditions Analyze > Fit Model

More information

Fall 2012 Points: 35 pts. Consider the following snip it from Section 3.4 of our textbook. Data Description

Fall 2012 Points: 35 pts. Consider the following snip it from Section 3.4 of our textbook. Data Description STAT 360: HW #4 Fall 2012 Points: 35 pts Name: SOLUTION Consider the following snip it from Section 3.4 of our textbook. Data Description The data are haystack measurements taken in Nebraska in 1927 and

More information

Learning Log Title: CHAPTER 6: TRANSFORMATIONS AND SIMILARITY. Date: Lesson: Chapter 6: Transformations and Similarity

Learning Log Title: CHAPTER 6: TRANSFORMATIONS AND SIMILARITY. Date: Lesson: Chapter 6: Transformations and Similarity Chapter 6: Transformations and Similarity CHAPTER 6: TRANSFORMATIONS AND SIMILARITY Date: Lesson: Learning Log Title: Date: Lesson: Learning Log Title: Chapter 6: Transformations and Similarity Date: Lesson:

More information

Week 4: Simple Linear Regression II

Week 4: Simple Linear Regression II Week 4: Simple Linear Regression II Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ARR 1 Outline Algebraic properties

More information

Lecture 24: Generalized Additive Models Stat 704: Data Analysis I, Fall 2010

Lecture 24: Generalized Additive Models Stat 704: Data Analysis I, Fall 2010 Lecture 24: Generalized Additive Models Stat 704: Data Analysis I, Fall 2010 Tim Hanson, Ph.D. University of South Carolina T. Hanson (USC) Stat 704: Data Analysis I, Fall 2010 1 / 26 Additive predictors

More information

Laboratory for Two-Way ANOVA: Interactions

Laboratory for Two-Way ANOVA: Interactions Laboratory for Two-Way ANOVA: Interactions For the last lab, we focused on the basics of the Two-Way ANOVA. That is, you learned how to compute a Brown-Forsythe analysis for a Two-Way ANOVA, as well as

More information

Example 5.25: (page 228) Screenshots from JMP. These examples assume post-hoc analysis using a Protected LSD or Protected Welch strategy.

Example 5.25: (page 228) Screenshots from JMP. These examples assume post-hoc analysis using a Protected LSD or Protected Welch strategy. JMP Output from Chapter 5 Factorial Analysis through JMP Example 5.25: (page 228) Screenshots from JMP. These examples assume post-hoc analysis using a Protected LSD or Protected Welch strategy. Fitting

More information

Machine learning - HT Clustering

Machine learning - HT Clustering Machine learning - HT 2016 10. Clustering Varun Kanade University of Oxford March 4, 2016 Announcements Practical Next Week - No submission Final Exam: Pick up on Monday Material covered next week is not

More information

Problem Set 7 Solutions

Problem Set 7 Solutions 6.42/8.62J Mathematics for Computer Science March 29, 25 Srini Devadas and Eric Lehman Problem Set 7 Solutions Due: Monday, April 4 at 9 PM Problem. Every function has some subset of these properties:

More information

Comparison of Means: The Analysis of Variance: ANOVA

Comparison of Means: The Analysis of Variance: ANOVA Comparison of Means: The Analysis of Variance: ANOVA The Analysis of Variance (ANOVA) is one of the most widely used basic statistical techniques in experimental design and data analysis. In contrast to

More information

Computer Experiments: Space Filling Design and Gaussian Process Modeling

Computer Experiments: Space Filling Design and Gaussian Process Modeling Computer Experiments: Space Filling Design and Gaussian Process Modeling Best Practice Authored by: Cory Natoli Sarah Burke, Ph.D. 30 March 2018 The goal of the STAT COE is to assist in developing rigorous,

More information

Week 5: Multiple Linear Regression II

Week 5: Multiple Linear Regression II Week 5: Multiple Linear Regression II Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ARR 1 Outline Adjusted R

More information

More on Experimental Designs

More on Experimental Designs Chapter 9 More on Experimental Designs The one and two way Anova designs, completely randomized block design and split plot designs are the building blocks for more complicated designs. Some split plot

More information

Bluman & Mayer, Elementary Statistics, A Step by Step Approach, Canadian Edition

Bluman & Mayer, Elementary Statistics, A Step by Step Approach, Canadian Edition Bluman & Mayer, Elementary Statistics, A Step by Step Approach, Canadian Edition Online Learning Centre Technology Step-by-Step - Minitab Minitab is a statistical software application originally created

More information

LL(1) predictive parsing

LL(1) predictive parsing LL(1) predictive parsing Informatics 2A: Lecture 11 John Longley School of Informatics University of Edinburgh jrl@staffmail.ed.ac.uk 13 October, 2011 1 / 12 1 LL(1) grammars and parse tables 2 3 2 / 12

More information

Multiple Regression White paper

Multiple Regression White paper +44 (0) 333 666 7366 Multiple Regression White paper A tool to determine the impact in analysing the effectiveness of advertising spend. Multiple Regression In order to establish if the advertising mechanisms

More information

nag anova factorial (g04cac)

nag anova factorial (g04cac) g04 Analysis of Variance g04cac 1. Purpose nag anova factorial (g04cac) nag anova factorial (g04cac) computes an analysis of variance table and treatment means for a complete factorial design. 2. Specification

More information

Geometric Modeling. Mesh Decimation. Mesh Decimation. Applications. Copyright 2010 Gotsman, Pauly Page 1. Oversampled 3D scan data

Geometric Modeling. Mesh Decimation. Mesh Decimation. Applications. Copyright 2010 Gotsman, Pauly Page 1. Oversampled 3D scan data Applications Oversampled 3D scan data ~150k triangles ~80k triangles 2 Copyright 2010 Gotsman, Pauly Page 1 Applications Overtessellation: E.g. iso-surface extraction 3 Applications Multi-resolution hierarchies

More information

Section 3.2: Multiple Linear Regression II. Jared S. Murray The University of Texas at Austin McCombs School of Business

Section 3.2: Multiple Linear Regression II. Jared S. Murray The University of Texas at Austin McCombs School of Business Section 3.2: Multiple Linear Regression II Jared S. Murray The University of Texas at Austin McCombs School of Business 1 Multiple Linear Regression: Inference and Understanding We can answer new questions

More information

General Multilevel-Categoric Factorial Tutorial

General Multilevel-Categoric Factorial Tutorial DX10-02-3-Gen2Factor.docx Rev. 1/27/2016 General Multilevel-Categoric Factorial Tutorial Part 1 Categoric Treatment Introduction A Case Study on Battery Life Design-Expert software version 10 offers a

More information

Parsing. Earley Parsing. Laura Kallmeyer. Winter 2017/18. Heinrich-Heine-Universität Düsseldorf 1 / 39

Parsing. Earley Parsing. Laura Kallmeyer. Winter 2017/18. Heinrich-Heine-Universität Düsseldorf 1 / 39 Parsing Earley Parsing Laura Kallmeyer Heinrich-Heine-Universität Düsseldorf Winter 2017/18 1 / 39 Table of contents 1 Idea 2 Algorithm 3 Tabulation 4 Parsing 5 Lookaheads 2 / 39 Idea (1) Goal: overcome

More information

Name Date Class. When the bases are the same and you multiply, you add exponents. When the bases are the same and you divide, you subtract exponents.

Name Date Class. When the bases are the same and you multiply, you add exponents. When the bases are the same and you divide, you subtract exponents. 2-1 Integer Exponents A positive exponent tells you how many times to multiply the base as a factor. A negative exponent tells you how many times to divide by the base. Any number to the 0 power is equal

More information

610 R12 Prof Colleen F. Moore Analysis of variance for Unbalanced Between Groups designs in R For Psychology 610 University of Wisconsin--Madison

610 R12 Prof Colleen F. Moore Analysis of variance for Unbalanced Between Groups designs in R For Psychology 610 University of Wisconsin--Madison 610 R12 Prof Colleen F. Moore Analysis of variance for Unbalanced Between Groups designs in R For Psychology 610 University of Wisconsin--Madison R is very touchy about unbalanced designs, partly because

More information

BART STAT8810, Fall 2017

BART STAT8810, Fall 2017 BART STAT8810, Fall 2017 M.T. Pratola November 1, 2017 Today BART: Bayesian Additive Regression Trees BART: Bayesian Additive Regression Trees Additive model generalizes the single-tree regression model:

More information

Resources for statistical assistance. Quantitative covariates and regression analysis. Methods for predicting continuous outcomes.

Resources for statistical assistance. Quantitative covariates and regression analysis. Methods for predicting continuous outcomes. Resources for statistical assistance Quantitative covariates and regression analysis Carolyn Taylor Applied Statistics and Data Science Group (ASDa) Department of Statistics, UBC January 24, 2017 Department

More information

Subset Selection in Multiple Regression

Subset Selection in Multiple Regression Chapter 307 Subset Selection in Multiple Regression Introduction Multiple regression analysis is documented in Chapter 305 Multiple Regression, so that information will not be repeated here. Refer to that

More information

Equivalence and Simplification of Regular Expressions

Equivalence and Simplification of Regular Expressions Equivalence and Simplification of Regular Expressions Wednesday, September 26, 2007 Reading: Stoughton 3.2 CS235 Languages and Automata Department of Computer Science Wellesley College Goals for Today

More information

Scalar Field Visualization. Some slices used by Prof. Mike Bailey

Scalar Field Visualization. Some slices used by Prof. Mike Bailey Scalar Field Visualization Some slices used by Prof. Mike Bailey Scalar Fields The approximation of certain scalar function in space f(x,y,z). Most of time, they come in as some scalar values defined on

More information

Chapter 6: DESCRIPTIVE STATISTICS

Chapter 6: DESCRIPTIVE STATISTICS Chapter 6: DESCRIPTIVE STATISTICS Random Sampling Numerical Summaries Stem-n-Leaf plots Histograms, and Box plots Time Sequence Plots Normal Probability Plots Sections 6-1 to 6-5, and 6-7 Random Sampling

More information

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA, 2015 MODULE 4 : Modelling experimental data Time allowed: Three hours Candidates should answer FIVE questions. All questions carry equal

More information

CHAPTER 3 AN OVERVIEW OF DESIGN OF EXPERIMENTS AND RESPONSE SURFACE METHODOLOGY

CHAPTER 3 AN OVERVIEW OF DESIGN OF EXPERIMENTS AND RESPONSE SURFACE METHODOLOGY 23 CHAPTER 3 AN OVERVIEW OF DESIGN OF EXPERIMENTS AND RESPONSE SURFACE METHODOLOGY 3.1 DESIGN OF EXPERIMENTS Design of experiments is a systematic approach for investigation of a system or process. A series

More information

This is a good time to refresh your memory on double-integration. We will be using this skill in the upcoming lectures.

This is a good time to refresh your memory on double-integration. We will be using this skill in the upcoming lectures. Chapter 5: JOINT PROBABILITY DISTRIBUTIONS Part 1: Sections 5-1.1 to 5-1.4 For both discrete and continuous random variables we will discuss the following... Joint Distributions (for two or more r.v. s)

More information

Lecture 27, April 24, Reading: See class website. Nonparametric regression and kernel smoothing. Structured sparse additive models (GroupSpAM)

Lecture 27, April 24, Reading: See class website. Nonparametric regression and kernel smoothing. Structured sparse additive models (GroupSpAM) School of Computer Science Probabilistic Graphical Models Structured Sparse Additive Models Junming Yin and Eric Xing Lecture 7, April 4, 013 Reading: See class website 1 Outline Nonparametric regression

More information

Chapter 15 Mixed Models. Chapter Table of Contents. Introduction Split Plot Experiment Clustered Data References...

Chapter 15 Mixed Models. Chapter Table of Contents. Introduction Split Plot Experiment Clustered Data References... Chapter 15 Mixed Models Chapter Table of Contents Introduction...309 Split Plot Experiment...311 Clustered Data...320 References...326 308 Chapter 15. Mixed Models Chapter 15 Mixed Models Introduction

More information

Chapter 6 Continued: Partitioning Methods

Chapter 6 Continued: Partitioning Methods Chapter 6 Continued: Partitioning Methods Partitioning methods fix the number of clusters k and seek the best possible partition for that k. The goal is to choose the partition which gives the optimal

More information

In this computer exercise we will work with the analysis of variance in R. We ll take a look at the following topics:

In this computer exercise we will work with the analysis of variance in R. We ll take a look at the following topics: UPPSALA UNIVERSITY Department of Mathematics Måns Thulin, thulin@math.uu.se Analysis of regression and variance Fall 2011 COMPUTER EXERCISE 2: One-way ANOVA In this computer exercise we will work with

More information

Contrasts. 1 An experiment with two factors. Chapter 5

Contrasts. 1 An experiment with two factors. Chapter 5 Chapter 5 Contrasts 5 Contrasts 1 1 An experiment with two factors................. 1 2 The data set........................... 2 3 Interpretation of coefficients for one factor........... 5 3.1 Without

More information

Evaluating Classifiers

Evaluating Classifiers Evaluating Classifiers Reading for this topic: T. Fawcett, An introduction to ROC analysis, Sections 1-4, 7 (linked from class website) Evaluating Classifiers What we want: Classifier that best predicts

More information

SLR parsers. LR(0) items

SLR parsers. LR(0) items SLR parsers LR(0) items As we have seen, in order to make shift-reduce parsing practical, we need a reasonable way to identify viable prefixes (and so, possible handles). Up to now, it has not been clear

More information

Applied Multivariate Analysis

Applied Multivariate Analysis Department of Mathematics and Statistics, University of Vaasa, Finland Spring 2017 Choosing Statistical Method 1 Choice an appropriate method 2 Cross-tabulation More advance analysis of frequency tables

More information

STAT 2607 REVIEW PROBLEMS Word problems must be answered in words of the problem.

STAT 2607 REVIEW PROBLEMS Word problems must be answered in words of the problem. STAT 2607 REVIEW PROBLEMS 1 REMINDER: On the final exam 1. Word problems must be answered in words of the problem. 2. "Test" means that you must carry out a formal hypothesis testing procedure with H0,

More information

nag anova random (g04bbc)

nag anova random (g04bbc) g04 Analysis of Variance g04bbc 1. Purpose nag anova random (g04bbc) nag anova random (g04bbc) computes the analysis of variance and treatment means and standard errors for a randomized block or completely

More information

Statistical Pattern Recognition

Statistical Pattern Recognition Statistical Pattern Recognition Features and Feature Selection Hamid R. Rabiee Jafar Muhammadi Spring 2012 http://ce.sharif.edu/courses/90-91/2/ce725-1/ Agenda Features and Patterns The Curse of Size and

More information

Sta$s$cs & Experimental Design with R. Barbara Kitchenham Keele University

Sta$s$cs & Experimental Design with R. Barbara Kitchenham Keele University Sta$s$cs & Experimental Design with R Barbara Kitchenham Keele University 1 Analysis of Variance Mul$ple groups with Normally distributed data 2 Experimental Design LIST Factors you may be able to control

More information

Introduction to Statistical Analyses in SAS

Introduction to Statistical Analyses in SAS Introduction to Statistical Analyses in SAS Programming Workshop Presented by the Applied Statistics Lab Sarah Janse April 5, 2017 1 Introduction Today we will go over some basic statistical analyses in

More information

Split-Plot General Multilevel-Categoric Factorial Tutorial

Split-Plot General Multilevel-Categoric Factorial Tutorial DX10-04-1-SplitPlotGen Rev. 1/27/2016 Split-Plot General Multilevel-Categoric Factorial Tutorial Introduction In some experiment designs you must restrict the randomization. Otherwise it wouldn t be practical

More information

SAS PROC GLM and PROC MIXED. for Recovering Inter-Effect Information

SAS PROC GLM and PROC MIXED. for Recovering Inter-Effect Information SAS PROC GLM and PROC MIXED for Recovering Inter-Effect Information Walter T. Federer Biometrics Unit Cornell University Warren Hall Ithaca, NY -0 biometrics@comell.edu Russell D. Wolfinger SAS Institute

More information

Chapter 8. Interval Estimation

Chapter 8. Interval Estimation Chapter 8 Interval Estimation We know how to get point estimate, so this chapter is really just about how to get the Introduction Move from generating a single point estimate of a parameter to generating

More information

Unit 4 Syllabus: Properties of Triangles & Quadrilaterals

Unit 4 Syllabus: Properties of Triangles & Quadrilaterals ` Date Period Unit 4 Syllabus: Properties of Triangles & Quadrilaterals Day Topic 1 Midsegments of Triangle and Bisectors in Triangles 2 Concurrent Lines, Medians and Altitudes, and Inequalities in Triangles

More information

Use of Extreme Value Statistics in Modeling Biometric Systems

Use of Extreme Value Statistics in Modeling Biometric Systems Use of Extreme Value Statistics in Modeling Biometric Systems Similarity Scores Two types of matching: Genuine sample Imposter sample Matching scores Enrolled sample 0.95 0.32 Probability Density Decision

More information

Regression Analysis and Linear Regression Models

Regression Analysis and Linear Regression Models Regression Analysis and Linear Regression Models University of Trento - FBK 2 March, 2015 (UNITN-FBK) Regression Analysis and Linear Regression Models 2 March, 2015 1 / 33 Relationship between numerical

More information

Statistics 202: Data Mining. c Jonathan Taylor. Week 8 Based in part on slides from textbook, slides of Susan Holmes. December 2, / 1

Statistics 202: Data Mining. c Jonathan Taylor. Week 8 Based in part on slides from textbook, slides of Susan Holmes. December 2, / 1 Week 8 Based in part on slides from textbook, slides of Susan Holmes December 2, 2012 1 / 1 Part I Clustering 2 / 1 Clustering Clustering Goal: Finding groups of objects such that the objects in a group

More information

University, Sint-Pietersnieuwstraat 41, B-9000 Gent, Belgium; ABSTRACT

University, Sint-Pietersnieuwstraat 41, B-9000 Gent, Belgium; ABSTRACT Optimizing feature extraction in image analysis using experimented designs, a case study evaluating texture algorithms for describing appearance retention in carpets S. A. Orjuela a,d, R. A. Quinones c,

More information

ECE521: Week 11, Lecture March 2017: HMM learning/inference. With thanks to Russ Salakhutdinov

ECE521: Week 11, Lecture March 2017: HMM learning/inference. With thanks to Russ Salakhutdinov ECE521: Week 11, Lecture 20 27 March 2017: HMM learning/inference With thanks to Russ Salakhutdinov Examples of other perspectives Murphy 17.4 End of Russell & Norvig 15.2 (Artificial Intelligence: A Modern

More information

Multiple Linear Regression

Multiple Linear Regression Multiple Linear Regression Rebecca C. Steorts, Duke University STA 325, Chapter 3 ISL 1 / 49 Agenda How to extend beyond a SLR Multiple Linear Regression (MLR) Relationship Between the Response and Predictors

More information

Getting Correct Results from PROC REG

Getting Correct Results from PROC REG Getting Correct Results from PROC REG Nate Derby Stakana Analytics Seattle, WA, USA SUCCESS 3/12/15 Nate Derby Getting Correct Results from PROC REG 1 / 29 Outline PROC REG 1 PROC REG 2 Nate Derby Getting

More information

The Beauty and Joy of Computing

The Beauty and Joy of Computing The Beauty and Joy of Computing Lecture #7 Algorithmic Complexity UC Berkeley EECS Sr Lecturer SOE Dan Data scientists at Yahoo are using prediction markets along with polls, sentiment analysis on Twitter,

More information