Statistical Models for Management. Instituto Superior de Ciências do Trabalho e da Empresa (ISCTE) Lisbon. February 24 26, 2010

Size: px
Start display at page:

Download "Statistical Models for Management. Instituto Superior de Ciências do Trabalho e da Empresa (ISCTE) Lisbon. February 24 26, 2010"

Transcription

1 Statistical Models for Management Instituto Superior de Ciências do Trabalho e da Empresa (ISCTE) Lisbon February 24 26, 2010 Graeme Hutcheson, University of Manchester Principal Component and Factor Analysis The lecture notes, exercises and data sets associated with this course are available for download from: Factor Analysis (FA) and Principal Components Analysis (PCA) are very similar techniques in that they both attempt to analyse the structure in a data set and define a small number of components or factors that capture most of the variation in the dataframe. With a large number of variables it may be easier to consider a small number of combinations of the original data rather than the entire data frame. The two techniques differ in that PCA identifies components that capture the variation in the dataframe without attempting to interpret the meaning of these components. Factor analysis identifies the structure in the dataframe (often using the PCA technique), but also tries to explain what the structure in the data frame means. In this session we will use the word component to identify the structure determined by PCA and factor to indicate the structure that has been determined by FA. As we are primarilly concerned with providing explanations of the meaning of the structure in our data, most of the discussion will refer to factors, which can be interpreted as meaningful components Introduction to Factor Analysis Factor analysis assumes that relationships between variables are due to the effects of underlying factors and that observed correlations are the result of variables sharing common factors. Consider the hypothetical correlation matrix in table 1 which shows student performance in a number of different academic disciplines: A visual inspection of table 1 suggests that the six disciplines might usefully be divided into two groups. Maths, Physics and Computing appear to be closely related and constitute one group, whilst Art, Drama and English which also appear to be closely related constitute the other group. For these

2 Table 1: Correlation Matrix Maths Physics Computing Art Drama English Maths 1.00 Physics Computing Art Drama English data, a factor analysis should clearly indicate the presence of two underlying factors which could be interpreted as representing the different types of skills required to succeed in the disciplines. Maths, Physics and Computing could be related as they all require an ability to think logically, whereas English, Art and Drama might require a more abstract style of thought. The way in which the disciplines have grouped together could therefore have been determined by the underlying factors of artistic and logical aptitude. Describing a data set in terms of factors (or latent variables as they are sometimes called) can be useful in the identification of underlying processes which determine correlations among the variables. In the example above, the marks obtained by the children might be better understood as a function of whether the discipline requires logical or creative ability rather than skills which are specific to each individual subject. A description of the children s performance given in terms of two separate factors as opposed to six related variables has resulted in a simpler interpretation. General Principles of Factor Analysis Each variable in factor analysis is expressed as a linear combination of factors which are not actually observed. For example, a person s result in an examination might be influenced by a number of factors, such as the person s aptitude in that particular subject, his or her experience with taking examinations, IQ and writing ability. The score a person gets on a test will be a reflection of a number of different abilities (factors) which affect the test score. A person s test score can be predicted by taking account of these abilities, as shown in Equation 1. Test Score = a(factor 1) + b(factor 2) + c(factor 3) + U test score (1) where a, b and c indicate the extent to which the different factors influence the test score and U represents an unknown component of the test score. Applying this equation to the example above we get: Test Score = aiq + bexperience + cwriting ability + U test score This equation is similar to a multiple regression equation except that IQ, Experience and Writing ability are not single independent variables but are labels for the underlying factors. IQ, Experience and Writing ability are called Common Factors, since all variables are expressed as functions of them. The U in the equation is called a Unique Factor, since it represents the part of the test score that cannot be explained by the common factors. U test score is unique to the test score variable. It should be noted that we do not know what these factors are in advance as their meaning can only be determined by interpreting the results of the analysis. 2

3 Equation 1 showed that a particular variable can be expressed in terms of unobserved factors. It is also possible to define an unobserved factor in terms of the observed variables. Each factor is identified as the correlation between the variables in the analysis and Equation 2 defines a factor in these terms. Factor X = β 1 Var 1 + β 2 Var 2 + β 3 Var β k Var k (2) where Var 1, Var 2,... Var k are variables and β 1, β 2,... β k are standardised regression coefficients Applying this equation to the example above we get: Factor X = β 1 Mathematics + β 2 Physics + β 3 Computing + β 4 Art + β 5 Drama + β 6 English where Mathematics, Physics, Computing, Art, Drama and English are variables and β 1, β 2,... β 6 are standardised regression coefficients Equation 2 calculates one of the factors which might underlie the data set. Additional factors can be calculated to explain the remaining variance in the data. For example, if the first factor to be calculated accounted for 40% of the variability in the data there would remain 60% of the variance which is unaccounted for. The next factor to be computed would account for as much of the remaining variance as possible. Factor 1 accounts for the biggest portion of the variance in the data, factor 2 accounts for the next biggest portion of variance, factor 3 accounts for the third next biggest portion of the variance etc... Successively smaller amounts of variance are accounted for by equating further factors until all of the variance is accounted for. Example Data set The technique of Factor analysis will be demonstrated using a real data set which shows children s performance on a number of tests. These tests were designed to assess a range of different abilities and skills. Table 2 shows the 17 variables included in the analysis. Label Table 2: Variables in data file Description active How active art Articulation atten Attention comp Comprehension coord Coordination draw Drawing lexp Expressive language mat Mathematical ability motsk Motor skills newsit Capability in new situations saint Social interaction 1 sencom Sentence completion sint Social interaction 2 temp Temperament under Understanding of language vocab Vocabulary writ Writing 3

4 Level of measurement FA is based on correlations, so continuous data is required. However, this requirement is often relaxed so that ordered data can be used (see Hutcheson and Sofroniou, 1999, for a full discussion of this issus). Measures of Sampling Adequacy A useful method for determining the appropriateness of running a factor analysis is to compute a measure of sampling adequacy. Such measures have been proposed by Kaiser (1970) and are based on an index which compares correlation and partial correlation coefficients (these measures of sampling adequacy are also known as Kaiser Meyer Olkin, or KMO statistics). KMO statistics can be calculated for individual and multiple variables using Equations. As these measures are not required for the understanding of the factor analysis technique, they will not be covered in detail here. Full explanations are, however, provided in Hutcheson and Sofroniou, An algorithm for computing the KMO statistics in R can be obtained from Graeme Hutcheson on request. Principal Components Analysis (PCA) Once the variables that are to be used in the factor analysis have been selected (based on theoretical considerations, the KMO statistics and the level of measurement of the data) the individual components that define the structure in the data can be determined using PCA. PCA identifies linear combinations of the observed variables with the first principal-component, P C(1), being the linear combination of variables that accounts for the largest amount of variance in the sample. The second principal-component, P C(2), is the linear combination of the variables which is uncorrelated with P C(1) and accounts for the maximum amount of the remaining variation in the data. Successive components explain progressively smaller portions of the total sample variance, and are all uncorrelated with each other. Essentially, principal-component analysis transforms a set of correlated variables into a set of uncorrelated components. The principal-components analysis in Rcmdr has transformed the 17 correlated variables (ACTIV to WRIT) into 17 uncorrelated components (Comp.1 to Comp.17). The component loadings show the correlations between each of the variables and the new components. In this analysis as we have 17 variables represented by 17 components, all of the variation in each variable is accounted for (we have not lost any information by transforming the variables into components). The squared loadings for a variable will sum to 1.0. In this form, the data have just been rearranged. The task now is to see if the data can be represented appropriately using fewer components. Selecting the number of components The principal components analysis provides information about the amount of variance explained by each of the components. In the Rcmdr output the component variances show the Eigenvalues for each of the components. The Eigenvalues just indicate the amount of variance in all the data that is accounted for by the component. As we have 17 variables, the component variances also add up to 17 (add them up). We can see that the first component accounts for the largest amount of variance (8.28 out of 17), followed by the second (2 out of 17) and followed by successively smaller components. If we were to just consider the first two components, these would account for about 60% of the variation in the data (10.28 out of 17). An Eigenvalue of 1.0 indicates the same amount of variance as is explained by a single variable. Although many packages have a default to extract only those components that have Eigenvalues of 1 or more, in practice, a useful solution might be 4

5 Principal Components Analysis Data set: factor.txt (available for download from RGSweb) Rcmdr: commands Statistics Dimensional analysis Principal-components analysis... Principal Components Analysis Variables (pick two or more) Analyze correlation matrix OK select all variables select Rcmdr: output >.PC <- princomp(~active+art+atten+comp+coord+draw+lexp+mat+motsk+newsit+saint+sencom+sint+ TEMP+UNDER+VOCAB+WRIT, cor=true, data=dataset) > unclass(loadings(.pc)) # component loadings Comp.1 Comp.2 Comp.3 Comp.4 Comp.5 Comp.6 Comp.7 ACTIVE ART ATTEN COMP COORD DRAW LEXP MAT MOTSK NEWSIT SAINT SENCOM SINT TEMP UNDER VOCAB WRIT Comp.8 Comp.9 Comp.10 Comp.11 Comp.12 Comp.13 ACTIVE ART ATTEN COMP COORD DRAW LEXP MAT MOTSK NEWSIT SAINT SENCOM SINT TEMP UNDER VOCAB WRIT continued overleaf... 5

6 ...continued from overleaf Comp.14 Comp.15 Comp.16 Comp.17 ACTIVE ART ATTEN COMP COORD DRAW LEXP MAT MOTSK NEWSIT SAINT SENCOM SINT TEMP UNDER VOCAB WRIT >.PC$sd^2 # component variances Comp.1 Comp.2 Comp.3 Comp.4 Comp.5 Comp.6 Comp.7 Comp.8 Comp Comp.10 Comp.11 Comp.12 Comp.13 Comp.14 Comp.15 Comp.16 Comp obtained using fewer or more components than this. The current analysis suggests that a solution of around 4 may be appropriate. An easy way to view the Eigenvalues is to use a scree plot (look this up on the web for a description of why it is called a scree plot and also how it might be best interpreted). The commands to obtain a screeplot in Rcmdr are shown below. Principal Components Analysis: drawing a screeplot Data set: factor.txt (available for download from RGSweb) Rcmdr: commands Statistics Dimensional analysis Principal-components analysis... Principal Components Analysis Variables (pick two or more) Screeplot OK select all variables select The analysis above suggests that nearly 70% (( )/17 *100) of the variation in the data (the 17 correlated variables) can be represented by just 3 principal components. 6

7 Figure 1: A screeplot Interpreting the components This section is included here to illustrate the point that the components, whilst accounting for a large proportion of the variance, may not have easily interpretable meanings assigned to them. In practice, the following statistics would not be computed as a matter of course as a factor analysis would probably be used directly. This section therefore provides a demonstration and is not part of the normal analytic procedure. The main point we are to look at here is what do these three components look like and how do they relate to the original variables. In order to answer this question, we can save the 3 principalcomponents to the data set. This can be achieved very simply in Rcmdr using the commands given below. These commands run the principal-components analysis using the correlation matrix and then save the first three components to the data set as variables PC1, PC2 and PC3. Although of limited use for general analysis, it is useful for the purposes of this lecture to see the relationship between the original variables and the 3 components. This will tell us how much of each variable is explained by the 3 components. We can do this in Rcmdr by correlating the components with the variables using a matrix correlation. From the correlation analysis output (only the important results have been reported here) we can see that PC1 (principal-component 1) accounts for of variable ACTIVE and of variable ART. The amount of the variable ACTIVE that is accounted for by all three principalcomponents is the sum of the squared loadings ( ), which equals We can therefore say that 69.08% of the variance in the variable ACTIVE is accounted for by the three principal-components. These statistics are commonly provided in software and are also known as the communalities. 7

8 Principal Components Analysis: saving the components Data set: factor.txt (available for download from RGSweb) Rcmdr: commands Statistics Dimensional analysis Principal-components analysis... Principal Components Analysis Variables (pick two or more) Analyze correlation matrix Add principal components to data set OK select all variables select select Number of Components Number of components to retain select 3 OK Rcmdr: output > Dataset$PC1 <-.PC$scores[,1] > Dataset$PC2 <-.PC$scores[,2] > Dataset$PC3 <-.PC$scores[,3] What we can note from the output is that all of the variables load most highly on principal-component 1. The variables are therefore related most highly to this component. This creates a problem if we wish to assign a meaning to the component, as at the moment, PC1 seems to represent every variable. Although we have identified a number of components to the data, we cannot assign any meaning to these components. This is where the technique of Factor Analysis is of help. Factor Analysis Factor Analysis is probably most easily understood as a technique that redistributes the loadings of the components (see above) so that they can be interpreted. We saw that the principal-components above all loaded highly on component 1. Factor analysis attempts to re-distribute these loadings so that they load on a number of different factors. The hope is that those variables that share similar underlying causes will load together on a single component. The technique used to re-distribute the loadings is called rotation. After a rotation has been applied to the data, the components are called factors. The Rotation Phase We can see from the analysis above that the principal-components are not always easy to interpret as they are often correlated with many variables. In the example above PC1 shows the highest 8

9 Principal Components Analysis: correlating variables and main components Data set: factor.txt (available for download from RGSweb) Rcmdr: commands Statistics Summaries Correlation matrix... Correlation Matrix Variables (pick two or more) Types of correlation Pearson product-moment OK select all variables select Rcmdr: output PC1 PC2 PC3 ACTIVE e e e-01 ART e e e-01 ATTEN e e e-01 COMP e e e-02 COORD e e e-01 DRAW e e e-01 LEXP e e e-01 MAT e e e-01 MOTSK e e e-01 NEWSIT e e e-01 SAINT e e e-01 SENCOM e e e-02 SINT e e e-01 TEMP e e e-01 UNDER e e e-02 VOCAB e e e-01 WRIT e e e-01 loading for all variables apart from temp. Using this matrix it is not easy to assign any description to the factors. In such cases the technique of rotation can be used which transforms the factors to make them more easily interpretable. Orthogonal and Oblique Rotation There are two general types of rotation which can be carried out, Orthogonal and Oblique. Orthogonal rotation refers to the procedure where the computed factors are uncorrelated to one another and in a two factor model this can be graphically represented by the axes remaining at right angles. The Table shown in Figure 2 shows the factor loadings for four variables for a two factor solution before and after an orthogonal rotation. This information is also shown graphically. It can be seen that after rotation the two axes are still at right angles (and hence uncorrelated), this graph therefore represents an orthogonal rotation of components resulting in orthogonal factors. In this case the rotation has resulted in an easy interpretation for the factors. Rcmdr uses the varimax method which attempts to minimise the number of variables which have a high loading on a factor. This enhances the interpretability of the factors. Although other orthogonal rotation methods are available, we shall just deal with varimax. If we allow for some correlation between the factors, sometimes the factor matrix can be simplified (also assuming that it is theoretically justified to have correlated factors). For example in figure 3, if 9

10 the axes went through the dotted lines a simpler pattern matrix would result than would have with orthogonal rotation (keeping the axes at right angles). A rotation carried out which allows for some correlation between the factors is termed OBLIQUE. Oblique rotation has come into favour recently for several reasons. It is unlikely that influences in nature are uncorrelated. Even if they are uncorrelated in the population, they need not be so in the sample. Thus, oblique rotations have often been found to yield substantively meaningful factors. The method Rcmdr uses for oblique rotation is called promax. The table in Figure 3 shows rotated and unrotated two factor solutions for six variables. This information is also shown graphically. It can be seen that the rotated factors are easier to interpret as the variables load highly on only one factor. A factor analysis in R using an orthogonal rotation technique is shown in the example below. 10

11 Figure 2: Orthogonal rotation of Components Factor Two Factor One Initial Components Rotated Factors Component 1 Component 2 Factor 1 Factor 2 v v v v

12 Figure 3: Oblique Rotation of Components Factor Two Factor One Initial Components Rotated Factors Component 1 Component 2 Factor 1 Factor 2 v v v v v v

13 Factor Analysis Data set: factor.txt (available for download from RGSweb) Rcmdr: commands Statistics Dimensional analysis Factor analysis... Factor Analysis Variables (pick three or more) Factor Rotation: Varimax select Factor Scores: None select OK Number of Factors Number of factors to extract select 3 OK select all variables Rcmdr: output Call: factanal(x = ~ACTIVE + ART + ATTEN + COMP + COORD + DRAW + LEXP + MAT + MOTSK + NEWSIT + SAINT + SENCOM + SINT + TEMP + UNDER + VOCAB + WRIT, factors = 3, data = Dataset, scores = "none", rotation = "varimax") Uniquenesses: ACTIVE ART ATTEN COMP COORD DRAW LEXP MAT MOTSK NEWSIT SAINT SENCOM SINT TEMP UNDER VOCAB WRIT Loadings: Factor1 Factor2 Factor3 ACTIVE ART ATTEN COMP COORD DRAW LEXP MAT MOTSK NEWSIT SAINT SENCOM SINT TEMP UNDER VOCAB WRIT Factor1 Factor2 Factor3 SS loadings Proportion Var Cumulative Var Test of the hypothesis that 3 factors are sufficient. The chi square statistic is on 88 degrees of freedom. The p-value is 5.11e-25 13

14 There are a number of things to note from the factor analysis output shown above: The uniqueness gives an indication of the uniqueness of the variable (the variation in the variable that cannot be attributed to any factor), the U test score element in the equation discussed earlier: Test Score = aiq + bexperience + cwriting ability + U test score From this score, we can see that some variables have very small values (eg., SENCOM, VOCAB and COORD) indicating that the factors represent the variation in the variables well, whereas other variables have much higher values (eg., TEMP and MAT), indicating that a smaller amount of the variation in these variables is represented by the factors. From the output, we can also see that the 3 factors account for 63.8% of the variance in the data (this is shown in the Cumulative Var row). Also, the proportion of variance accounted for by each factor is shown in the Proportion Var row. In this case factor 1 accounts for 26.3%. You may note that these statistics are slightly different to those obtained for the PCA model above as the default method for the FA in R is factanal which performs a maximum likelihood factor analysis rather than the principal components. The interpretation and basic theory, however, remain unchanged. The test of the hypothesis that 3 factors are sufficient gives a highly significant chi-square value. This indicates that 3 factors are not sufficient to represent the data. On the basis of this evidence we would ceratinly wish to look at solutions with more than 3 factors. From the Loadings, we can see which variables load on which factor. This information enables us to define the factors and give them labels. We can see that the variables now load highly on particular factors. Using the components (i.e., before any rotation was applied), the variables all loaded onto PC1. Once a rotation has been applied, the variables load on different factors. The table below shows these loadings arranged into order. Table 3: Factor scores Factor 1 VOCAB.875 vocabulary SENCOM.865 Sentence Completion ART.863 articulation LEXP.809 Expressive language UNDER.708 Understanding of language COMP.667 Comprehension Factor 2 SAINT.847 Social Interaction 1 ACTIVE.811 How active SINT.795 Social Interaction 2 Factor 3 COORD.827 Coordination WRIT.786 Writing DRAW.775 Drawing MOTSK.539 Motor skills 14

15 Interpreting Factors The rotated factor matrix provides a much clearer interpretation of the results as can be seen in table 3. The unrotated component matrix did not enable an easy interpretation of the components, however, once a rotation has been completed the interpretation of the factors becomes clearer. Factor 1 relates to linguistic skills, Factor 2 to Social skills and Factor 3 to practical skills. The data set can now be described in terms of three underlying factors instead of 17 variables. The loadings for the factors can be saved as new variables and entered into other analyses such as regression. A graphical illustration of factor analysis Using similar data to that used above, three factors were extracted but couldn t be easily interpreted as Table 4 shows 1 : Table 4: Factor Loadings for the 3-Factor Model Variable Factor 1 Factor 2 Factor 3 Articulation Attention Comprehension Coordination Drawing Memory Motor Skill Sentence Completion Temperament Writing The factors can be interpreted by identifying the variables they are highly related to. For example, if factor 1 is strongly related to the variables, motor skill, drawing and coordination, this could be interpreted as representing physical dexterity. Such a clear-cut identification of the three factors identified in Table 4 is not possible as they are related to many variables. The difficulty with providing interpretations for the factors in such circumstances is demonstrated graphically in Figure 4, where the first two factors from the 3-factor model identified above are shown in a simple twodimensional scatter plot 2. This graph suggests that there are two distinct factors in the data which are represented clearly as two clusters of points. The factors are represented as the axes of the graph and those points falling on or close to an axis indicate a strong relationship with that factor. We can see from the graph that the variables fall midway between the axes and are therefore related to both factors. The presence of the two factors shown in Figure 4 is not obvious from the factor loadings of the initial factors (see Table 4) as the variables are not exclusively related to one factor or the other. Attempting to interpret the factor loading scores obtained in a principal components analysis directly is therefore not an ideal method to identify distinct factors. It can be seen from Figure 5 that the rotated axes in the graph pass much closer to the clusters of points than do the principal component axes. The precise degree of rotation is determined using one of a number of algorithms available in most common statistical software packages. Popular methods include minimizing the number of variables which have high loadings to enhance the interpretability of the factors, and minimizing the number of factors which provides simpler interpretations of the 1 See Hutcheson and Sofroniou, 1999, for all references. 2 In order to show this information in two dimensions, only the first two factors and those variables which form part of a two-factor solution are shown. The variables attention and temperament which form a third factor are omitted from this demonstration. 15

16 Figure 4: A graphical representation of the principal components Figure 5: A graphical representation of factor rotation 16

17 variables (refer to Kim and Mueller, 1994, for a discussion of rotation techniques). Although there are many rotation techniques available, in practice, the different techniques tend to produce similar results when there is a large sample and the factors are relatively well defined (Fava and Velicer, 1992). The factor loadings for two rotation techniques, orthogonal and oblique, are shown in Table 5. In this example, the factors can be interpreted directly from the rotated factor loadings as variables tend to load highly on only one factor. For example, the variable attention, which correlated with all three of the principal components more or less evenly, only correlates highly with a single rotated factor (factor 3). A similar pattern can be seen for the other variables in Table 5. Table 5: A Component Matrix showing Orthogonal and Oblique Rotation of Factors Initial Components Rotated Factors Orthogonal Oblique Variable Articulation Attention Comprehension Coordination Drawing Memory Motor Skills Sent. Comp Temperament Writing

Scaling Techniques in Political Science

Scaling Techniques in Political Science Scaling Techniques in Political Science Eric Guntermann March 14th, 2014 Eric Guntermann Scaling Techniques in Political Science March 14th, 2014 1 / 19 What you need R RStudio R code file Datasets You

More information

Statistical Models for Management. Instituto Superior de Ciências do Trabalho e da Empresa (ISCTE) Lisbon. February 24 26, 2010

Statistical Models for Management. Instituto Superior de Ciências do Trabalho e da Empresa (ISCTE) Lisbon. February 24 26, 2010 Statistical Models for Management Instituto Superior de Ciências do Trabalho e da Empresa (ISCTE) Lisbon February 24 26, 2010 Graeme Hutcheson, University of Manchester Exploratory regression and model

More information

Chapter 1. Using the Cluster Analysis. Background Information

Chapter 1. Using the Cluster Analysis. Background Information Chapter 1 Using the Cluster Analysis Background Information Cluster analysis is the name of a multivariate technique used to identify similar characteristics in a group of observations. In cluster analysis,

More information

Oblique Factor Rotation Explained

Oblique Factor Rotation Explained Oblique Factor Rotation Explained Grant B. Morgan Baylor University September 6, 2014 A Step-by-Step Look at Promax Factor Rotation For this post, I will continue my attempt to demistify factor rotation

More information

GRAPHS AND STATISTICS Residuals Common Core Standard

GRAPHS AND STATISTICS Residuals Common Core Standard B Graphs and Statistics, Lesson 7, Residuals (r. 2018) GRAPHS AND STATISTICS Residuals Common Core Standard Next Generation Standard S-ID.B.6b Informally assess the fit of a function by plotting and analyzing

More information

SPSS INSTRUCTION CHAPTER 9

SPSS INSTRUCTION CHAPTER 9 SPSS INSTRUCTION CHAPTER 9 Chapter 9 does no more than introduce the repeated-measures ANOVA, the MANOVA, and the ANCOVA, and discriminant analysis. But, you can likely envision how complicated it can

More information

Generalized Procrustes Analysis Example with Annotation

Generalized Procrustes Analysis Example with Annotation Generalized Procrustes Analysis Example with Annotation James W. Grice, Ph.D. Oklahoma State University th February 4, 2007 Generalized Procrustes Analysis (GPA) is particularly useful for analyzing repertory

More information

D-Optimal Designs. Chapter 888. Introduction. D-Optimal Design Overview

D-Optimal Designs. Chapter 888. Introduction. D-Optimal Design Overview Chapter 888 Introduction This procedure generates D-optimal designs for multi-factor experiments with both quantitative and qualitative factors. The factors can have a mixed number of levels. For example,

More information

Principal Components Analysis with Spatial Data

Principal Components Analysis with Spatial Data Principal Components Analysis with Spatial Data A SpaceStat Software Tutorial Copyright 2013, BioMedware, Inc. (www.biomedware.com). All rights reserved. SpaceStat and BioMedware are trademarks of BioMedware,

More information

Clustering and Visualisation of Data

Clustering and Visualisation of Data Clustering and Visualisation of Data Hiroshi Shimodaira January-March 28 Cluster analysis aims to partition a data set into meaningful or useful groups, based on distances between data points. In some

More information

Statistical Modelling for Social Scientists. Manchester University. January 20, 21 and 24, Exploratory regression and model selection

Statistical Modelling for Social Scientists. Manchester University. January 20, 21 and 24, Exploratory regression and model selection Statistical Modelling for Social Scientists Manchester University January 20, 21 and 24, 2011 Graeme Hutcheson, University of Manchester Exploratory regression and model selection The lecture notes, exercises

More information

Further Maths Notes. Common Mistakes. Read the bold words in the exam! Always check data entry. Write equations in terms of variables

Further Maths Notes. Common Mistakes. Read the bold words in the exam! Always check data entry. Write equations in terms of variables Further Maths Notes Common Mistakes Read the bold words in the exam! Always check data entry Remember to interpret data with the multipliers specified (e.g. in thousands) Write equations in terms of variables

More information

Introduction to Factor Analysis for Marketing

Introduction to Factor Analysis for Marketing Introduction to Factor Analysis for Marketing SKIM/Sawtooth Software Conference 2016, Rome Chris Chapman, Google. April 2016. Special thanks to Josh Lewandowski at Google for helpful feedback (errors are

More information

Week 7 Picturing Network. Vahe and Bethany

Week 7 Picturing Network. Vahe and Bethany Week 7 Picturing Network Vahe and Bethany Freeman (2005) - Graphic Techniques for Exploring Social Network Data The two main goals of analyzing social network data are identification of cohesive groups

More information

Step-by-Step Guide to Advanced Genetic Analysis

Step-by-Step Guide to Advanced Genetic Analysis Step-by-Step Guide to Advanced Genetic Analysis Page 1 Introduction In the previous document, 1 we covered the standard genetic analyses available in JMP Genomics. Here, we cover the more advanced options

More information

Quantitative Methods in Management

Quantitative Methods in Management Quantitative Methods in Management MBA Glasgow University March 20-23, 2009 Luiz Moutinho, University of Glasgow Graeme Hutcheson, University of Manchester Exploratory Regression The lecture notes, exercises

More information

Graphical Analysis of Data using Microsoft Excel [2016 Version]

Graphical Analysis of Data using Microsoft Excel [2016 Version] Graphical Analysis of Data using Microsoft Excel [2016 Version] Introduction In several upcoming labs, a primary goal will be to determine the mathematical relationship between two variable physical parameters.

More information

Using Excel for Graphical Analysis of Data

Using Excel for Graphical Analysis of Data Using Excel for Graphical Analysis of Data Introduction In several upcoming labs, a primary goal will be to determine the mathematical relationship between two variable physical parameters. Graphs are

More information

Using Excel for Graphical Analysis of Data

Using Excel for Graphical Analysis of Data EXERCISE Using Excel for Graphical Analysis of Data Introduction In several upcoming experiments, a primary goal will be to determine the mathematical relationship between two variable physical parameters.

More information

SELECTION OF A MULTIVARIATE CALIBRATION METHOD

SELECTION OF A MULTIVARIATE CALIBRATION METHOD SELECTION OF A MULTIVARIATE CALIBRATION METHOD 0. Aim of this document Different types of multivariate calibration methods are available. The aim of this document is to help the user select the proper

More information

Chapter 13 Multivariate Techniques. Chapter Table of Contents

Chapter 13 Multivariate Techniques. Chapter Table of Contents Chapter 13 Multivariate Techniques Chapter Table of Contents Introduction...279 Principal Components Analysis...280 Canonical Correlation...289 References...298 278 Chapter 13. Multivariate Techniques

More information

Multiple Regression White paper

Multiple Regression White paper +44 (0) 333 666 7366 Multiple Regression White paper A tool to determine the impact in analysing the effectiveness of advertising spend. Multiple Regression In order to establish if the advertising mechanisms

More information

Lecture 27: Review. Reading: All chapters in ISLR. STATS 202: Data mining and analysis. December 6, 2017

Lecture 27: Review. Reading: All chapters in ISLR. STATS 202: Data mining and analysis. December 6, 2017 Lecture 27: Review Reading: All chapters in ISLR. STATS 202: Data mining and analysis December 6, 2017 1 / 16 Final exam: Announcements Tuesday, December 12, 8:30-11:30 am, in the following rooms: Last

More information

Linear Methods for Regression and Shrinkage Methods

Linear Methods for Regression and Shrinkage Methods Linear Methods for Regression and Shrinkage Methods Reference: The Elements of Statistical Learning, by T. Hastie, R. Tibshirani, J. Friedman, Springer 1 Linear Regression Models Least Squares Input vectors

More information

Lecture 25: Review I

Lecture 25: Review I Lecture 25: Review I Reading: Up to chapter 5 in ISLR. STATS 202: Data mining and analysis Jonathan Taylor 1 / 18 Unsupervised learning In unsupervised learning, all the variables are on equal standing,

More information

Survey of Math: Excel Spreadsheet Guide (for Excel 2016) Page 1 of 9

Survey of Math: Excel Spreadsheet Guide (for Excel 2016) Page 1 of 9 Survey of Math: Excel Spreadsheet Guide (for Excel 2016) Page 1 of 9 Contents 1 Introduction to Using Excel Spreadsheets 2 1.1 A Serious Note About Data Security.................................... 2 1.2

More information

MODERN FACTOR ANALYSIS

MODERN FACTOR ANALYSIS MODERN FACTOR ANALYSIS Harry H. Harman «ö THE pigj UNIVERSITY OF CHICAGO PRESS Contents LIST OF ILLUSTRATIONS GUIDE TO NOTATION xv xvi Parti Foundations of Factor Analysis 1. INTRODUCTION 3 1.1. Brief

More information

A presentation from the Telemetrics Lab

A presentation from the Telemetrics Lab A presentation from the Telemetrics Lab Telemetrics lab Department of Psychology Northwestern University Evanston, Illinois USA November, 2010 Outline 1 Data from a Correlation Matrix Simulated data Real

More information

Chemometrics. Description of Pirouette Algorithms. Technical Note. Abstract

Chemometrics. Description of Pirouette Algorithms. Technical Note. Abstract 19-1214 Chemometrics Technical Note Description of Pirouette Algorithms Abstract This discussion introduces the three analysis realms available in Pirouette and briefly describes each of the algorithms

More information

Lecture 07 Dimensionality Reduction with PCA

Lecture 07 Dimensionality Reduction with PCA Lecture 07 Dimensionality Reduction with PCA 10 February 2016 Taylor B. Arnold Yale Statistics STAT 365/665 1/9 As we have started to see, the curse of dimensionality stops us from being able to fit arbitrarily

More information

CREATING THE ANALYSIS

CREATING THE ANALYSIS Chapter 14 Multiple Regression Chapter Table of Contents CREATING THE ANALYSIS...214 ModelInformation...217 SummaryofFit...217 AnalysisofVariance...217 TypeIIITests...218 ParameterEstimates...218 Residuals-by-PredictedPlot...219

More information

Study Guide. Module 1. Key Terms

Study Guide. Module 1. Key Terms Study Guide Module 1 Key Terms general linear model dummy variable multiple regression model ANOVA model ANCOVA model confounding variable squared multiple correlation adjusted squared multiple correlation

More information

Analysis and Latent Semantic Indexing

Analysis and Latent Semantic Indexing 18 Principal Component Analysis and Latent Semantic Indexing Understand the basics of principal component analysis and latent semantic index- Lab Objective: ing. Principal Component Analysis Understanding

More information

Excel Core Certification

Excel Core Certification Microsoft Office Specialist 2010 Microsoft Excel Core Certification 2010 Lesson 6: Working with Charts Lesson Objectives This lesson introduces you to working with charts. You will look at how to create

More information

Your Name: Section: INTRODUCTION TO STATISTICAL REASONING Computer Lab #4 Scatterplots and Regression

Your Name: Section: INTRODUCTION TO STATISTICAL REASONING Computer Lab #4 Scatterplots and Regression Your Name: Section: 36-201 INTRODUCTION TO STATISTICAL REASONING Computer Lab #4 Scatterplots and Regression Objectives: 1. To learn how to interpret scatterplots. Specifically you will investigate, using

More information

Basics of Multivariate Modelling and Data Analysis

Basics of Multivariate Modelling and Data Analysis Basics of Multivariate Modelling and Data Analysis Kurt-Erik Häggblom 9. Linear regression with latent variables 9.1 Principal component regression (PCR) 9.2 Partial least-squares regression (PLS) [ mostly

More information

General Instructions. Questions

General Instructions. Questions CS246: Mining Massive Data Sets Winter 2018 Problem Set 2 Due 11:59pm February 8, 2018 Only one late period is allowed for this homework (11:59pm 2/13). General Instructions Submission instructions: These

More information

8 th Grade Mathematics Unpacked Content For the new Common Core standards that will be effective in all North Carolina schools in the

8 th Grade Mathematics Unpacked Content For the new Common Core standards that will be effective in all North Carolina schools in the 8 th Grade Mathematics Unpacked Content For the new Common Core standards that will be effective in all North Carolina schools in the 2012-13. This document is designed to help North Carolina educators

More information

DESIGN OF EXPERIMENTS and ROBUST DESIGN

DESIGN OF EXPERIMENTS and ROBUST DESIGN DESIGN OF EXPERIMENTS and ROBUST DESIGN Problems in design and production environments often require experiments to find a solution. Design of experiments are a collection of statistical methods that,

More information

Tutorial #1: Using Latent GOLD choice to Estimate Discrete Choice Models

Tutorial #1: Using Latent GOLD choice to Estimate Discrete Choice Models Tutorial #1: Using Latent GOLD choice to Estimate Discrete Choice Models In this tutorial, we analyze data from a simple choice-based conjoint (CBC) experiment designed to estimate market shares (choice

More information

Step-by-Step Guide to Relatedness and Association Mapping Contents

Step-by-Step Guide to Relatedness and Association Mapping Contents Step-by-Step Guide to Relatedness and Association Mapping Contents OBJECTIVES... 2 INTRODUCTION... 2 RELATEDNESS MEASURES... 2 POPULATION STRUCTURE... 6 Q-K ASSOCIATION ANALYSIS... 10 K MATRIX COMPRESSION...

More information

Tutorial 3. Jun Xu, Teaching Asistant csjunxu/ February 16, COMP4134 Biometrics Authentication

Tutorial 3. Jun Xu, Teaching Asistant   csjunxu/ February 16, COMP4134 Biometrics Authentication Tutorial 3 Jun Xu, Teaching Asistant http://www4.comp.polyu.edu.hk/ csjunxu/ COMP4134 Biometrics Authentication February 16, 2017 Table of Contents Problems Problem 1: Answer the questions Problem 2: Pattern

More information

An introduction to plotting data

An introduction to plotting data An introduction to plotting data Eric D. Black California Institute of Technology February 25, 2014 1 Introduction Plotting data is one of the essential skills every scientist must have. We use it on a

More information

Workshop 8: Model selection

Workshop 8: Model selection Workshop 8: Model selection Selecting among candidate models requires a criterion for evaluating and comparing models, and a strategy for searching the possibilities. In this workshop we will explore some

More information

Chapter 2 Basic Structure of High-Dimensional Spaces

Chapter 2 Basic Structure of High-Dimensional Spaces Chapter 2 Basic Structure of High-Dimensional Spaces Data is naturally represented geometrically by associating each record with a point in the space spanned by the attributes. This idea, although simple,

More information

Enduring Understandings: Some basic math skills are required to be reviewed in preparation for the course.

Enduring Understandings: Some basic math skills are required to be reviewed in preparation for the course. Curriculum Map for Functions, Statistics and Trigonometry September 5 Days Targeted NJ Core Curriculum Content Standards: N-Q.1, N-Q.2, N-Q.3, A-CED.1, A-REI.1, A-REI.3 Enduring Understandings: Some basic

More information

8 th Grade Pre Algebra Pacing Guide 1 st Nine Weeks

8 th Grade Pre Algebra Pacing Guide 1 st Nine Weeks 8 th Grade Pre Algebra Pacing Guide 1 st Nine Weeks MS Objective CCSS Standard I Can Statements Included in MS Framework + Included in Phase 1 infusion Included in Phase 2 infusion 1a. Define, classify,

More information

DI TRANSFORM. The regressive analyses. identify relationships

DI TRANSFORM. The regressive analyses. identify relationships July 2, 2015 DI TRANSFORM MVstats TM Algorithm Overview Summary The DI Transform Multivariate Statistics (MVstats TM ) package includes five algorithm options that operate on most types of geologic, geophysical,

More information

Integers & Absolute Value Properties of Addition Add Integers Subtract Integers. Add & Subtract Like Fractions Add & Subtract Unlike Fractions

Integers & Absolute Value Properties of Addition Add Integers Subtract Integers. Add & Subtract Like Fractions Add & Subtract Unlike Fractions Unit 1: Rational Numbers & Exponents M07.A-N & M08.A-N, M08.B-E Essential Questions Standards Content Skills Vocabulary What happens when you add, subtract, multiply and divide integers? What happens when

More information

Data Mining - Data. Dr. Jean-Michel RICHER Dr. Jean-Michel RICHER Data Mining - Data 1 / 47

Data Mining - Data. Dr. Jean-Michel RICHER Dr. Jean-Michel RICHER Data Mining - Data 1 / 47 Data Mining - Data Dr. Jean-Michel RICHER 2018 jean-michel.richer@univ-angers.fr Dr. Jean-Michel RICHER Data Mining - Data 1 / 47 Outline 1. Introduction 2. Data preprocessing 3. CPA with R 4. Exercise

More information

Statistical Pattern Recognition

Statistical Pattern Recognition Statistical Pattern Recognition Features and Feature Selection Hamid R. Rabiee Jafar Muhammadi Spring 2012 http://ce.sharif.edu/courses/90-91/2/ce725-1/ Agenda Features and Patterns The Curse of Size and

More information

Graphical Presentation for Statistical Data (Relevant to AAT Examination Paper 4: Business Economics and Financial Mathematics) Introduction

Graphical Presentation for Statistical Data (Relevant to AAT Examination Paper 4: Business Economics and Financial Mathematics) Introduction Graphical Presentation for Statistical Data (Relevant to AAT Examination Paper 4: Business Economics and Financial Mathematics) Y O Lam, SCOPE, City University of Hong Kong Introduction The most convenient

More information

SYS 6021 Linear Statistical Models

SYS 6021 Linear Statistical Models SYS 6021 Linear Statistical Models Project 2 Spam Filters Jinghe Zhang Summary The spambase data and time indexed counts of spams and hams are studied to develop accurate spam filters. Static models are

More information

SAT Released Test 8 Problem #28

SAT Released Test 8 Problem #28 SAT Released Test 8 Problem #28 28.) The 22 students in a health class conducted an experiment in which they each recorded their pulse rates, in beats per minute, before and after completing a light exercise

More information

Data Analysis and Solver Plugins for KSpread USER S MANUAL. Tomasz Maliszewski

Data Analysis and Solver Plugins for KSpread USER S MANUAL. Tomasz Maliszewski Data Analysis and Solver Plugins for KSpread USER S MANUAL Tomasz Maliszewski tmaliszewski@wp.pl Table of Content CHAPTER 1: INTRODUCTION... 3 1.1. ABOUT DATA ANALYSIS PLUGIN... 3 1.3. ABOUT SOLVER PLUGIN...

More information

1 Introduction to Using Excel Spreadsheets

1 Introduction to Using Excel Spreadsheets Survey of Math: Excel Spreadsheet Guide (for Excel 2007) Page 1 of 6 1 Introduction to Using Excel Spreadsheets This section of the guide is based on the file (a faux grade sheet created for messing with)

More information

7 Fractions. Number Sense and Numeration Measurement Geometry and Spatial Sense Patterning and Algebra Data Management and Probability

7 Fractions. Number Sense and Numeration Measurement Geometry and Spatial Sense Patterning and Algebra Data Management and Probability 7 Fractions GRADE 7 FRACTIONS continue to develop proficiency by using fractions in mental strategies and in selecting and justifying use; develop proficiency in adding and subtracting simple fractions;

More information

Performance Estimation and Regularization. Kasthuri Kannan, PhD. Machine Learning, Spring 2018

Performance Estimation and Regularization. Kasthuri Kannan, PhD. Machine Learning, Spring 2018 Performance Estimation and Regularization Kasthuri Kannan, PhD. Machine Learning, Spring 2018 Bias- Variance Tradeoff Fundamental to machine learning approaches Bias- Variance Tradeoff Error due to Bias:

More information

CHAPTER 5. BASIC STEPS FOR MODEL DEVELOPMENT

CHAPTER 5. BASIC STEPS FOR MODEL DEVELOPMENT CHAPTER 5. BASIC STEPS FOR MODEL DEVELOPMENT This chapter provides step by step instructions on how to define and estimate each of the three types of LC models (Cluster, DFactor or Regression) and also

More information

Advanced Operations Research Prof. G. Srinivasan Department of Management Studies Indian Institute of Technology, Madras

Advanced Operations Research Prof. G. Srinivasan Department of Management Studies Indian Institute of Technology, Madras Advanced Operations Research Prof. G. Srinivasan Department of Management Studies Indian Institute of Technology, Madras Lecture - 35 Quadratic Programming In this lecture, we continue our discussion on

More information

For our example, we will look at the following factors and factor levels.

For our example, we will look at the following factors and factor levels. In order to review the calculations that are used to generate the Analysis of Variance, we will use the statapult example. By adjusting various settings on the statapult, you are able to throw the ball

More information

Cluster Analysis Gets Complicated

Cluster Analysis Gets Complicated Cluster Analysis Gets Complicated Collinearity is a natural problem in clustering. So how can researchers get around it? Cluster analysis is widely used in segmentation studies for several reasons. First

More information

Facial Expression Detection Using Implemented (PCA) Algorithm

Facial Expression Detection Using Implemented (PCA) Algorithm Facial Expression Detection Using Implemented (PCA) Algorithm Dileep Gautam (M.Tech Cse) Iftm University Moradabad Up India Abstract: Facial expression plays very important role in the communication with

More information

COMBINED METHOD TO VISUALISE AND REDUCE DIMENSIONALITY OF THE FINANCIAL DATA SETS

COMBINED METHOD TO VISUALISE AND REDUCE DIMENSIONALITY OF THE FINANCIAL DATA SETS COMBINED METHOD TO VISUALISE AND REDUCE DIMENSIONALITY OF THE FINANCIAL DATA SETS Toomas Kirt Supervisor: Leo Võhandu Tallinn Technical University Toomas.Kirt@mail.ee Abstract: Key words: For the visualisation

More information

MAT 003 Brian Killough s Instructor Notes Saint Leo University

MAT 003 Brian Killough s Instructor Notes Saint Leo University MAT 003 Brian Killough s Instructor Notes Saint Leo University Success in online courses requires self-motivation and discipline. It is anticipated that students will read the textbook and complete sample

More information

Statistical Analysis of Metabolomics Data. Xiuxia Du Department of Bioinformatics & Genomics University of North Carolina at Charlotte

Statistical Analysis of Metabolomics Data. Xiuxia Du Department of Bioinformatics & Genomics University of North Carolina at Charlotte Statistical Analysis of Metabolomics Data Xiuxia Du Department of Bioinformatics & Genomics University of North Carolina at Charlotte Outline Introduction Data pre-treatment 1. Normalization 2. Centering,

More information

Version 2.4 of Idiogrid

Version 2.4 of Idiogrid Version 2.4 of Idiogrid Structural and Visual Modifications 1. Tab delimited grids in Grid Data window. The most immediately obvious change to this newest version of Idiogrid will be the tab sheets that

More information

5. Compare the volume of a three dimensional figure to surface area.

5. Compare the volume of a three dimensional figure to surface area. 5. Compare the volume of a three dimensional figure to surface area. 1. What are the inferences that can be drawn from sets of data points having a positive association and a negative association. 2. Why

More information

Sections 3-6 have been substantially modified to make the paper more comprehensible. Several figures have been re-plotted and figure captions changed.

Sections 3-6 have been substantially modified to make the paper more comprehensible. Several figures have been re-plotted and figure captions changed. Response to First Referee s Comments General Comments Sections 3-6 have been substantially modified to make the paper more comprehensible. Several figures have been re-plotted and figure captions changed.

More information

MULTIVARIATE ANALYSIS USING R

MULTIVARIATE ANALYSIS USING R MULTIVARIATE ANALYSIS USING R B N Mandal I.A.S.R.I., Library Avenue, New Delhi 110 012 bnmandal @iasri.res.in 1. Introduction This article gives an exposition of how to use the R statistical software for

More information

Frequently Asked Questions Updated 2006 (TRIM version 3.51) PREPARING DATA & RUNNING TRIM

Frequently Asked Questions Updated 2006 (TRIM version 3.51) PREPARING DATA & RUNNING TRIM Frequently Asked Questions Updated 2006 (TRIM version 3.51) PREPARING DATA & RUNNING TRIM * Which directories are used for input files and output files? See menu-item "Options" and page 22 in the manual.

More information

Detecting Polytomous Items That Have Drifted: Using Global Versus Step Difficulty 1,2. Xi Wang and Ronald K. Hambleton

Detecting Polytomous Items That Have Drifted: Using Global Versus Step Difficulty 1,2. Xi Wang and Ronald K. Hambleton Detecting Polytomous Items That Have Drifted: Using Global Versus Step Difficulty 1,2 Xi Wang and Ronald K. Hambleton University of Massachusetts Amherst Introduction When test forms are administered to

More information

Using Large Data Sets Workbook Version A (MEI)

Using Large Data Sets Workbook Version A (MEI) Using Large Data Sets Workbook Version A (MEI) 1 Index Key Skills Page 3 Becoming familiar with the dataset Page 3 Sorting and filtering the dataset Page 4 Producing a table of summary statistics with

More information

Chapter Two: Descriptive Methods 1/50

Chapter Two: Descriptive Methods 1/50 Chapter Two: Descriptive Methods 1/50 2.1 Introduction 2/50 2.1 Introduction We previously said that descriptive statistics is made up of various techniques used to summarize the information contained

More information

Statistical Pattern Recognition

Statistical Pattern Recognition Statistical Pattern Recognition Features and Feature Selection Hamid R. Rabiee Jafar Muhammadi Spring 2013 http://ce.sharif.edu/courses/91-92/2/ce725-1/ Agenda Features and Patterns The Curse of Size and

More information

Grade 8 Unit 1 Congruence and Similarity (4 Weeks)

Grade 8 Unit 1 Congruence and Similarity (4 Weeks) Grade 8 Unit Congruence and Similarity (4 Weeks) Stage Desired Results Established Goals Unit Description In this unit, students will explore and describe the effects of translations, rotations, reflections

More information

This chapter will show how to organize data and then construct appropriate graphs to represent the data in a concise, easy-to-understand form.

This chapter will show how to organize data and then construct appropriate graphs to represent the data in a concise, easy-to-understand form. CHAPTER 2 Frequency Distributions and Graphs Objectives Organize data using frequency distributions. Represent data in frequency distributions graphically using histograms, frequency polygons, and ogives.

More information

( ) =cov X Y = W PRINCIPAL COMPONENT ANALYSIS. Eigenvectors of the covariance matrix are the principal components

( ) =cov X Y = W PRINCIPAL COMPONENT ANALYSIS. Eigenvectors of the covariance matrix are the principal components Review Lecture 14 ! PRINCIPAL COMPONENT ANALYSIS Eigenvectors of the covariance matrix are the principal components 1. =cov X Top K principal components are the eigenvectors with K largest eigenvalues

More information

Tutorial: Using Tina Vision s Quantitative Pattern Recognition Tool.

Tutorial: Using Tina Vision s Quantitative Pattern Recognition Tool. Tina Memo No. 2014-004 Internal Report Tutorial: Using Tina Vision s Quantitative Pattern Recognition Tool. P.D.Tar. Last updated 07 / 06 / 2014 ISBE, Medical School, University of Manchester, Stopford

More information

Statistical Pattern Recognition

Statistical Pattern Recognition Statistical Pattern Recognition Features and Feature Selection Hamid R. Rabiee Jafar Muhammadi Spring 2014 http://ce.sharif.edu/courses/92-93/2/ce725-2/ Agenda Features and Patterns The Curse of Size and

More information

Digital image processing

Digital image processing Digital image processing Image enhancement algorithms: grey scale transformations Any digital image can be represented mathematically in matrix form. The number of lines in the matrix is the number of

More information

Whitepaper Spain SEO Ranking Factors 2012

Whitepaper Spain SEO Ranking Factors 2012 Whitepaper Spain SEO Ranking Factors 2012 Authors: Marcus Tober, Sebastian Weber Searchmetrics GmbH Greifswalder Straße 212 10405 Berlin Phone: +49-30-3229535-0 Fax: +49-30-3229535-99 E-Mail: info@searchmetrics.com

More information

Categorical explanatory variables

Categorical explanatory variables Hutcheson, G. D. (2011). Tutorial: Categorical Explanatory Variables. Journal of Modelling in Management. 6, 2: 225 236. NOTE: this is a slightly updated version of this paper which is distributed to correct

More information

Unit 3: Congruence & Similarity

Unit 3: Congruence & Similarity Approximate Time Frame: 6 weeks Connections to Previous Learning: In previous grades, students made scale drawings of geometric figures and solved problems involving angle measure, surface area, and volume.

More information

Exercise: Graphing and Least Squares Fitting in Quattro Pro

Exercise: Graphing and Least Squares Fitting in Quattro Pro Chapter 5 Exercise: Graphing and Least Squares Fitting in Quattro Pro 5.1 Purpose The purpose of this experiment is to become familiar with using Quattro Pro to produce graphs and analyze graphical data.

More information

Data Mining Chapter 3: Visualizing and Exploring Data Fall 2011 Ming Li Department of Computer Science and Technology Nanjing University

Data Mining Chapter 3: Visualizing and Exploring Data Fall 2011 Ming Li Department of Computer Science and Technology Nanjing University Data Mining Chapter 3: Visualizing and Exploring Data Fall 2011 Ming Li Department of Computer Science and Technology Nanjing University Exploratory data analysis tasks Examine the data, in search of structures

More information

The problem. Needed for contract. 10% Reduction. Resources available

The problem. Needed for contract. 10% Reduction. Resources available As part of May 2010 s P2 paper, a question was asked (Q6) which required candidates to prepare a graph showing the optimum production plan for the organisation in question. In his post examination guide

More information

Mathematics 308 Geometry. Chapter 9. Drawing three dimensional objects

Mathematics 308 Geometry. Chapter 9. Drawing three dimensional objects Mathematics 308 Geometry Chapter 9. Drawing three dimensional objects In this chapter we will see how to draw three dimensional objects with PostScript. The task will be made easier by a package of routines

More information

Interesting Application. Linear Algebra

Interesting Application. Linear Algebra MATH 308A PROJECT: An Interesting Application of Linear Algebra Produced by: Kristen Pilawski Math 308 A Professor James King Fall 2001 Application: Space Shuttle Control Systems Abstract: This report

More information

Year 10 General Mathematics Unit 2

Year 10 General Mathematics Unit 2 Year 11 General Maths Year 10 General Mathematics Unit 2 Bivariate Data Chapter 4 Chapter Four 1 st Edition 2 nd Edition 2013 4A 1, 2, 3, 4, 6, 7, 8, 9, 10, 11 1, 2, 3, 4, 6, 7, 8, 9, 10, 11 2F (FM) 1,

More information

NARROW CORRIDOR. Teacher s Guide Getting Started. Lay Chin Tan Singapore

NARROW CORRIDOR. Teacher s Guide Getting Started. Lay Chin Tan Singapore Teacher s Guide Getting Started Lay Chin Tan Singapore Purpose In this two-day lesson, students are asked to determine whether large, long, and bulky objects fit around the corner of a narrow corridor.

More information

8. MINITAB COMMANDS WEEK-BY-WEEK

8. MINITAB COMMANDS WEEK-BY-WEEK 8. MINITAB COMMANDS WEEK-BY-WEEK In this section of the Study Guide, we give brief information about the Minitab commands that are needed to apply the statistical methods in each week s study. They are

More information

Infographics and Visualisation (or: Beyond the Pie Chart) LSS: ITNPBD4, 1 November 2016

Infographics and Visualisation (or: Beyond the Pie Chart) LSS: ITNPBD4, 1 November 2016 Infographics and Visualisation (or: Beyond the Pie Chart) LSS: ITNPBD4, 1 November 2016 Overview Overview (short: we covered most of this in the tutorial) Why infographics and visualisation What s the

More information

NCSS Statistical Software

NCSS Statistical Software Chapter 245 Introduction This procedure generates R control charts for variables. The format of the control charts is fully customizable. The data for the subgroups can be in a single column or in multiple

More information

SPSS Basics for Probability Distributions

SPSS Basics for Probability Distributions Built-in Statistical Functions in SPSS Begin by defining some variables in the Variable View of a data file, save this file as Probability_Distributions.sav and save the corresponding output file as Probability_Distributions.spo.

More information

Chapter 4: Analyzing Bivariate Data with Fathom

Chapter 4: Analyzing Bivariate Data with Fathom Chapter 4: Analyzing Bivariate Data with Fathom Summary: Building from ideas introduced in Chapter 3, teachers continue to analyze automobile data using Fathom to look for relationships between two quantitative

More information

Whitepaper Italy SEO Ranking Factors 2012

Whitepaper Italy SEO Ranking Factors 2012 Whitepaper Italy SEO Ranking Factors 2012 Authors: Marcus Tober, Sebastian Weber Searchmetrics GmbH Greifswalder Straße 212 10405 Berlin Phone: +49-30-3229535-0 Fax: +49-30-3229535-99 E-Mail: info@searchmetrics.com

More information

JUST THE MATHS UNIT NUMBER STATISTICS 1 (The presentation of data) A.J.Hobson

JUST THE MATHS UNIT NUMBER STATISTICS 1 (The presentation of data) A.J.Hobson JUST THE MATHS UNIT NUMBER 18.1 STATISTICS 1 (The presentation of data) by A.J.Hobson 18.1.1 Introduction 18.1.2 The tabulation of data 18.1.3 The graphical representation of data 18.1.4 Exercises 18.1.5

More information

SAS Graphics Macros for Latent Class Analysis Users Guide

SAS Graphics Macros for Latent Class Analysis Users Guide SAS Graphics Macros for Latent Class Analysis Users Guide Version 2.0.1 John Dziak The Methodology Center Stephanie Lanza The Methodology Center Copyright 2015, Penn State. All rights reserved. Please

More information

Tutorial 1: Welded Frame - Problem Description

Tutorial 1: Welded Frame - Problem Description Tutorial 1: Welded Frame - Problem Description Introduction In this first tutorial, we will analyse a simple frame: firstly as a welded frame, and secondly as a pin jointed truss. In each case, we will

More information