A Note on Model Selection in Mixture Experiments

Size: px

Start display at page:

Download "A Note on Model Selection in Mixture Experiments"

Hortense Carter
5 years ago
Views:

1 Journal of Mathematics Statistics (): 9-99, 007 ISSN Science Publications Note on Model Selection in Miture Eperiments Kadri Ulaş KY University of Marmara, Departments of Mathematics, Göztepe Kampüsü Kadıköy, 7 Istanbul Turkey bstract: In miture eperiments, determination of the best model for modeling the miture system is significant in both understing interpreting the system. For obtaining the best model in miture eperiments, different methods have been used. Most commonly used methods are the stepwise type methods. However, the models obtained with these methods are not always the best model depending on the chosen criteria. s the models obtained with these methods can be affected by collinearity, in this paper, an alternative approach is used for the determination of the models taken into account in the modeling of the miture surface, which is obtained on the eperimental region. This approach depends on the eamination of all possible subset regression models obtained for the miture model. To determine the best subset model, the condition numbers of models the model control graphs are also taken into account. Then, proposed approach has been investigated on flare data set, which is widely known in literature. Key words: Miture model, all possible subset selection, variable selection, model reduction, miture components, collinearity, condition number INTRODUCTION In miture eperiments, the measured response is assumed to depend only on the proportions of ingredients present in the miture not on the amount of miture. For eample, the response might be the tensile strength of stainless steel which is a miture of iron, nickel, copper chromium, or, it might be octane rating of a blend of gasoline. The purpose of miture eperiments is to build an appropriate model relating the response(s) to miture components. ll of the work on miture models has been based on response surface concepts. model is fitted to data by an eperimental design. Various miture models can be used in the analysis of miture eperiments. However, the determination of the best model for modeling the miture system is important in both understing interpreting the miture system since the fitted models are used to screen the components, predict the response(s), determine the effects of components on the response(s), or optimize the response(s) over the eperimental region. In general, computer-based methods for choosing the best subset regression have been suggested for determining the best model in miture eperiments. Cornell [], Piepel et al. [-] Martin et al. [5] used the stepwise regression for choosing the model in miture eperiments. In addition, Draper St. John [6] used the backward elimination regression procedure. Different from these methods, there are many various methods which eamine all possible subset regression models. One of these methods is RSQURE procedure in SS. This approach was used in miture eperiments, by Khuri [7] Cornell []. Khuri revised the work done by Cornell [] he gave some collinearity diagnostic measures for each p-parameter submodel with the highest R value. However, for each submodel with p-parameter, the models including different terms should not be ignored as an alternative to the model with the highest R value. Using the model with the highest R value may not be suitable for the interpretation of the miture model, as it can be affected by collinearity compared to other models. The purpose of this study was to get attention on the results obtained by using an alternative approach, in choosing a model in miture eperiments. This approach depends on the eamination of all possible subset regression models obtained for the miture model. By comparing all possible subsets, an investigator can not only determine the best reduced models according to the selected criteria such as R, but also identify alternatives to the best ones. In addition, etra criteria will also be taken into account Corresponding uthor: Kadri Ulaş kay, University of Marmara, Departments of Mathematics, Göztepe Kampüsü Kadıköy, 7 Istanbul Turkey 9

2 J. Math. & Stat. (): 9-99, 007 for the determination of alternative models. In this way, with the help of models including different interaction terms, the miture system can be interpreted much better the role of components in the system can be understood much easier. Miture eperiments: miture eperiment involves miing various proportions of two or more components to make different compositions of an end product. In a -components miture in which i represents the proportion of the ith components present in miture, 0 i i =,,..., i= i = () The composition space of the components takes the form of a regular ( ) dimensional simple. Physical, theoretical, or economic considerations often impose additional constraints on individual components 0 Li i Ui i =,,..., () where L i U i denote lower upper bounds, respectively. In general, restriction () reduce the constraint region given by () to an irregular ( ) dimensional hyperpolyhedron. When the component proportions are restricted by lower upper bounds, collinearity appears all too freuently [8]. It is assumed that the response or property of interest, denoted by η, is to be epressed in terms of a suitable function f of the miture variables i, η = f (,,..., ) () typical model may thus be written yi = ηi + εi () where ε i is assumed that εi NID ( 0, σ ). Miture model forms most commonly used in fitting data are the canonical polynomials introduced by Scheffé [9] in the form ( ) (5) E Y = η = β + β i i ij i j i= i< j For modeling well-behaved systems, generally the Scheffé polynomials are adeuate. For some situations, however, there are better modeling forms than Scheffé polynomials which could be used. For eample, as an alternative to Scheffé miture models, models including inverse term are used in order to model an etreme change in the response behavior of one or more components, which are close to boundary of the simple region [6]. Following, uadratic model including an inverse term has been proposed by Draper St. John, ( ) = βi i + βij i j + β i i (6) E Y i= i< j i= Scheffé polynomial models fails to satisfy the modeling of additive effect of one component at the same time accommodate the curvilinear blending effects of the remaining components. To model these effects jointly, Becker has developed a set of miture models which are homogeneous of degree one [0]. They provide alternatives to the Scheffé polynomials. Becker s three second order models are of the form H: η = βii + βij min( i, j) β... min(,,..., ) i= i < j (7) i j... H: η = β + β β i i ij... i i j i = < + j ( ) H: η = βii + βij( i j) β... (... ) i= i < j In the H model, i j ( i j) 0 ( i + j) = 0. + = whenever s usual, we can represent the Scheffé canonical polynomial models, miture models with inverse terms Becker Homogenous models in matri form by Y = Xβ+ ε (8) where Y is n vector of observations on the response matri, where p is number of variable, X is n p( ) terms in the model, β is the p vector of parameters to be estimated ε is n vector of errors. It was assumed that the errors have the property E( ε ) = 0, E( εε ) = σ I n (9) where I n is identity matri σ is the error variance. Hence E ( Y ) = µ = Xβ where µ is column b XX. vector of all epected responses. The least suares estimator for β is b = ( XX ) Xy variancecovariance matri of b is ( ) ( ) var = σ comprehensive reference on the design analysis of miture data is given by Cornell [,]. Determination comparison of miture models: In miture eperiments, reduction of the model is as important as choosing between different miture model forms, since it is not a very good approach to add all the terms of the chosen model to itself. In a situation like this, the model may include correlated terms. It may also be hard to make comments on the miture system as the parameter values may be affected. In miture eperiments, determination of the best model for modeling the miture system is significant in 9

3 J. Math. & Stat. (): 9-99, 007 both understing interpreting the system. There are various methods for choosing a regression model such as forward selection, backward elimination stepwise regression when there are many cidate model terms. In addition, Cornell [] mentioned that the stepwise regression model can be investigated for various models in miture eperiments. The objective is to obtain a model form that not only contains an adeuate amount of information about the miture system under investigation but whose form also makes sense. There are serious problems with stepwise type methods since they do not give the best model (based on the selected criteria, for eample R ). This is because they hle variables one at a time. In addition, only one model is obtained with these methods. Therefore, there is a possibility of missing better models. Mitures problems are particularly prone to illconditioning (or collinearity) because of constraint (). Collinearity between the terms may lead to inconsistent or confusing conclusions when comparing the result of different stepwise regression procedures. This inconsistency makes variable selection a potentially misleading process when collinearity is present. Illconditioning may not be a problem if the goal is prediction within the eperimental region. However, illconditioning can be a serious problem if interpretation of the coefficients is the objective [8]. Stard variable selection methods do not perform well when the data is highly multi-collinear. For this reason, in order to obtain the best reduced model, all the possible subset regression models should be eamined. The seuential model fitting methods proposed by Draper St. John [6] for miture eperiments can be useful. But, if there are many terms, it can reuire too much labor. more preferable method than these methods is to fit all possible regression models, to evaluate these according to some criterion. In this way a number of best regression models can be selected. In order to find the best subset regression model RESERCH procedure on GENSTT was used []. While using this procedure, three criteria will be taken into account in determining the best models. First of all, linear miture terms (,,..., ) were kept in the model all possible combinations for the rest of the terms were added to the linear miture terms. The reason for keeping the linear miture terms in the model is that the model proves the hierarchy principle. Hierarchy principle is important for the euivalence of the models [7]. s an addition to linear miture terms, the number of models with the term t ( t p ) is p. Therefore, for different t values in the miture t system, as a total of ( p ) subset regression models will be obtained. For the models obtained, the terms which have p value smaller than 0.05 according to F- statistics are meaningful. However this situation can affect because of the collinearity therefore, some important terms for the miture system can be ignored. In this situation, instead of taking models with meaningful terms into account, the VIF values of the terms should be taken into account. The condition numbers of the models can also be used for comparing the reduced models. useful measure of collinearity is the condition number, κ, defined by λ ma κ = λmin where λ ma λ min denote the largest the smallest of the eigenvalues of XX (the columns of X have been scaled to unit length), respectively. Smaller condition numbers indicate more stability (better conditioning) in the least suares estimates than indicated by larger condition numbers. In this study, subset regression models with a condition number less than 0 will be taken into account. In some situations, the condition number of the model with the highest R can be greater than those for other models this affects the parameter values which may cause misinterpretations about the system. The condition number being less than 0 does not guarantee that VIF values of the terms are less than 00. For this reason, the models with condition numbers less than 0 the terms with VIF values less than 00 will be taken into account. Thirdly, in order to eamine which of the models are adeuate, model control graphs should be obtained. For the models whose model control graphs are adeuate, a final decision can be made by looking at R MSE values of the models. The proposed approach will be eamined in the following part over the flare data set. Flare eperiment: McLean nderson [] presented an eample to illustrate their etreme-vertices design. flare is manufactured by miing magnesium ( ),, sodium nitrate ( ), strontium nitrate ( ) 95

4 J. Math. & Stat. (): 9-99, 007 Table : Components proportions illumination response values for flare eperiment Blend No Component Proportions Illumination (000 cles) binder ( ) under the following constraints, The component proportions for design points as well as the measured illumination values are given in Table. Snee [5] used Homogenous miture models (7) for the modeling of the flare data set. Draper St. John [6] made a comparison of the miture models (6) (7) for the flare data set. On the other h, Piepel Cornell [6] gave a summary of the models proposed for the flare data set till now. When these models are eamined, it can be seen that they have three terms as an addition to linear miture terms they also have the highest R values. However, as the eperimental region is restricted, the parameter predictions are affected due to collinearity. For this reason, comparing the models obtained to their condition numbers using the models with small condition number are more accurate for interpreting the system. On the other h, Snee [5] Draper St. John [6] considered pseudocomponents for flare data set. In this paper, subset regression model for actual components will be given by using Scheffé, Homogenous H Models including inverse term. First of all, let s take the second degree Scheffé models into account. For Scheffé model, as an addition to linear miture terms with a condition number less than 0, models with one two terms are obtained. The models with one term have the terms,,. The condition numbers of these models are.6, , respectively. The models with two terms have the terms,,, their condition numbers being eual to each other, is. However, the model recommended for the second degree Scheffé model have the terms, [6]. The condition number of this model is.5. In addition, when the stepwise regression forward selection is used for the Scheffé model taken into account, the model with only is obtained but with backward elimination method, model with is obtained. The condition numbers for these models obtained are , respectively. lthough the model obtained with backward elimination method has the highest R value ( R = 7.59), this model is the most affected by the collinearity among other models with two terms. When the homogenous H model is eamined, five models with one term condition numbers less than 0 are obtained. These models include the terms ecept for ( + ). mong these models the model with only ( + ) term has the terms with VIF values less than 00. s some of the terms in the models with two terms have VIF values greater than 00, they are ignored. On the other h, for the homogenous H model, a model with ( + ) ( + ) is obtained by using backward elimination method. In contrast to this, with forward selection stepwise regression a model with ( + ) is obtained. The condition numbers for these models are 5..7, respectively. The model control graphs of the Scheffé Homogenous H 96

5 subset regression models obtained show that these models are not adeuate. Now let s take miture models including inverse term into account. If the model including inverse term is from the first degree, then the models with condition number less than 0 have, with condition numbers 6.8, , respectively. s the VIF value of is 06, this model was ignored. lthough the model with has the highest R value ( R = 78.), it is the model the most affected from the collinearity with a condition number of 7. The model control graphs of the models with are adeuate. The model with two terms, only has the condition number of this model is.. On the other h, with backward elimination stepwise regression, the model with is obtained. lthough this model is the model with the highest R, its condition number is With forward selection, the model with the terms, is obtained. This model with highest R value was also used by Piepel Cornell [6]. The VIF value of in the model is the condition number is 79.. Therefore, the models that can be recommended for the model including inverse term are only the models with J. Math. & Stat. (): 9-99, 007 from the models with one term. The summary statistics of these models are R = 6., MSE = 79 R = 60., MSE = 68, respectively. If the model including inverse term is from the second degree, there is a linear relation between the terms. For this reason, some of the terms in the model are ignored. In this eample, the terms are kept outside the model as they are the linear composition of other terms. In this situation, si models with two terms condition numbers less than 0 are obtained. These models include the term couples (, ), (, ), (, ), (, ), (, ) (, ). The condition numbers of these models are 5.6, 0.9, 0.6, 5.5, 0.6, 0.9, respectively. The model control graphs of all models ecept for the last model also show that the models are also adeuate. However, as some linear miture terms have VIF values close to or greater than 00 in models with (, ) (, ), they were ignored. VIF values are eual 99. for the term in the 97 first model, the for the second model. Therefore, the models that can be recommended for the second degree models including inverse term include the terms (, ), (, ) (, ). The summary statistics of these models are R = 60.7, MSE = 587 ; R = 60., MSE = 6 R = 55.8, MSE = 0, respectively. Fig. : Model control graphs of model including inverse terms For the models including inverse term from the second degree, si models can also be obtained with three terms condition numbers less than 0. The models with only adeuate control graph include the terms (,, ) (,, ). The condition numbers of these models are ,

6 respectively. However, as VIF values of the term in these models are , respectively, these models were ignored. In addition, although the model with three terms the highest R value include the,,, VIF values of these terms are terms ( )., 9. 9., respectively. The condition number of this model is also In models with three terms, the model with the smallest condition number (.) κ = includes the terms (,, ) J. Math. & Stat. (): 9-99, 007. On the other h, for the model including inverse term from the second degree, only a model with the couple (, ) is obtained by using backward elimination, forward selection stepwise regression. In this model, VIF value for is 6.6 condition number is 7.. Fig. : Miture surfaces obtained for model including inverse terms The models including inverse term are better for the interpretation of the miture system than Scheffé Homogenous H miture models due to their 98 condition numbers adeuate model control graphs. For this reason, the investigator can choose the best model among the models with one or two terms. For eample, the model control graph of the model with terms (, ) is given in Fig.. The miture surface for = 0.0 = 0.08 on the eperimental region for the model is shown respectively in Fig.. CONCLUSION In this study, comparisons of the results, which were obtained by different subset selection methods for the flare data set, were done. The models obtained by backward elimination, forward selection stepwise regression methods are one of the models obtained by all possible subset selection. By using all possible subset selection, we can obtain models according to criteria we want. On the other h, due to effect by collinearity in the models with the highest R value obtained by RSQURE procedure, etra criteria were taken into account in choosing the models that can be used in interpreting the miture system. These criteria are the comparison of the condition numbers the investigation of the model control graphs of the models with small condition numbers. s a result of comparing the condition numbers of the models, models with more consistent parameter values were obtained. Therefore the condition number of the each model should be taken into account when the all possible subset regression models for the determination of the miture model are investigated. s an addition, linear miture terms of the model were kept in the model to prove the hierarchy principle. Hierarchy principle in miture eperiments is essential for the models, obtained by pseudo-components actual components, to be euivalent. Epressing the components in terms of pseudo-components will also alleviate the problems due to the correlations among the coefficients. However, this property due to the structure of the models including inverse term that of homogenous H miture models has to be investigated for both the pseudo-components the actual components. In addition, in the presence of collinearity, ridge trace can be used as an alternative approach in choosing the model for miture eperiments.

7 J. Math. & Stat. (): 9-99, 007 REFERENCES. Cornell, J.., 000. Fitting a Slack-Variable Model to Miture Data: Some Questions Raised. J. Quality Tech.,, : -7. Piepel, G. T. Redgate, 997. Miture eperiment techniues for reducing the number of components applied for modeling waste glass sodium release. J. m. Ceramic Soc., 80: Piepel, G. T. Redgate, 998. miture eperiment analysis of the hald cement data. The m. Stat., 5, : -0.. Piepel, G. F., J.M. Szychowski J.M. Loeppky, 00. ugmenting Scheffé linear miture models with suared /or crossproduct terms. J. Quality Tech.,, : Martin, R. J., L.M. Platts,.B. Seddon E.C. Stillman, 00. The design analysis of a miture eperiment on glass durability. ustralian & New Zeal J. Stat., 5: Draper, N. R. R. C. St. John, 977. mitures model with inverse terms. Technometrics, 9: Khuri,.I Slack-variable models versus Scheffé s miture models. J. pplied Stat., 9: St. John, R. C., 98. Eperiments with Mitures, Ill-Conditioning, ridge regression. J. Quality Tech., 6, : Scheffé, H., 958. Eperiments with mitures. J. Royal Stat. Soc., B, 0: Becker, N. G., 968. Models for the response of a miture. J. Royal Statistical Soc., B, 0: Cornell, J.., 000. Developing Miture Models, re we done?. Journal of Statistical Computation Simulation, 66: 7-.. Cornell, J.., 00. Eperiments with mitures, rd. Ed. Wiley-Interscience.. GENSTT, Release 7., 00. The Guide to Genstat Release 7.: Part Statistics.. McLean, R.. V. L. nderson, 966. Etreme vertices design of miture eperiments. Technometrics, 8: Snee, R. D., 97. Techniues for the analysis of miture data. Technometrics, 5: Piepel, G.F. J.. Cornell, 99. Miture eperiment approaches: eamples, discussion, recommendations. J. Quality Tech., 6:

Selecting the Right Central Composite Design

International Journal of Statistics and Applications 1, (1): 1-3 DOI:.93/j.statistics.11. Selecting the Right Central Composite Design B. A. Oyejola 1, J. C. Nwanya,* 1 Department of Statistics, University