Generalized Procrustes Analysis Example with Annotation

Similar documents
Version 2.4 of Idiogrid

CDAA No. 4 - Part Two - Multiple Regression - Initial Data Screening

Statistical Models for Management. Instituto Superior de Ciências do Trabalho e da Empresa (ISCTE) Lisbon. February 24 26, 2010

SPSS INSTRUCTION CHAPTER 9

Applications Guide 3WayPack TUCKALS3: Analysis of Chopin Data

D-Optimal Designs. Chapter 888. Introduction. D-Optimal Design Overview

If the active datasheet is empty when the StatWizard appears, a dialog box is displayed to assist in entering data.

Chapter 13 Multivariate Techniques. Chapter Table of Contents

StatCalc User Manual. Version 9 for Mac and Windows. Copyright 2018, AcaStat Software. All rights Reserved.

Comparing internal and ideal point preference mapping solutions. Wakeling, Hasted and Macfie

Spatializing GIS Commands with Self-Organizing Maps. Jochen Wendel Barbara P. Buttenfield Roland J. Viger Jeremy M. Smith

Multiple Regression White paper

WebGrid: Knowledge Modeling and Inference through the World Wide Web

Laboratory for Two-Way ANOVA: Interactions

Assignment 2. Classification and Regression using Linear Networks, Multilayer Perceptron Networks, and Radial Basis Functions

Predict Outcomes and Reveal Relationships in Categorical Data

Introduction to Factor Analysis for Marketing

Further Maths Notes. Common Mistakes. Read the bold words in the exam! Always check data entry. Write equations in terms of variables

Statistical Good Practice Guidelines. 1. Introduction. Contents. SSC home Using Excel for Statistics - Tips and Warnings

MSA220 - Statistical Learning for Big Data

Using the DATAMINE Program

The Solution to the Factorial Analysis of Variance

Tips on JMP ing into Mixture Experimentation

What s New in iogas Version 6.3

Supplementary Material

Week 7 Picturing Network. Vahe and Bethany

Chapter 1. Using the Cluster Analysis. Background Information

Rep 5. Conceptual Representation Software. RepGrid Manual for Version 1.0. March 2010

MULTIVARIATE ANALYSIS USING R

Dimensionality Reduction, including by Feature Selection.

TELCOM2125: Network Science and Analysis

New York State Department of Health Medicaid Perinatal Care Quality Improvement Project

Source df SS MS F A a-1 [A] [T] SS A. / MS S/A S/A (a)(n-1) [AS] [A] SS S/A. / MS BxS/A A x B (a-1)(b-1) [AB] [A] [B] + [T] SS AxB

Step-by-Step Guide to Advanced Genetic Analysis

If you are not familiar with IMP you should download and read WhatIsImp and CoordGenManual before proceeding.

Attention modulates spatial priority maps in human occipital, parietal, and frontal cortex

May 1, CODY, Error Backpropagation, Bischop 5.3, and Support Vector Machines (SVM) Bishop Ch 7. May 3, Class HW SVM, PCA, and K-means, Bishop Ch

Sage: Symmetry and Asymmetry in Geometric Data Version 1.21 (compiled 03/1114)

Multi Time Series Rev

7.1 INTRODUCTION Wavelet Transform is a popular multiresolution analysis tool in image processing and

Cognalysis TM Reserving System User Manual

Query Studio Training Guide Cognos 8 February 2010 DRAFT. Arkansas Public School Computer Network 101 East Capitol, Suite 101 Little Rock, AR 72201

The Use of Biplot Analysis and Euclidean Distance with Procrustes Measure for Outliers Detection

0 The PMAnalyzer software and basic theory

ENVI Tutorial: Geologic Hyperspectral Analysis

Package snm. July 20, 2018

Section 1. Introduction. Section 2. Getting Started

PCAGen7a ThreeDPCA7 CVAGen7b ThreeDCVA7 MorphoJ Variation Comparison tpsrelw tpsrelw MorphoJ tpsrelw tpsrelw MorphoJ

Matlab Introduction. Scalar Variables and Arithmetic Operators

Package Funclustering

Minitab 17 commands Prepared by Jeffrey S. Simonoff

For Additional Information...

Ordination (Guerrero Negro)

Multicollinearity and Validation CIVL 7012/8012

Introduction to Software Engineering

Tutorial: RNA-Seq Analysis Part II (Tracks): Non-Specific Matches, Mapping Modes and Expression measures

Bluman & Mayer, Elementary Statistics, A Step by Step Approach, Canadian Edition

Points Lines Connected points X-Y Scatter. X-Y Matrix Star Plot Histogram Box Plot. Bar Group Bar Stacked H-Bar Grouped H-Bar Stacked

Modelling and Visualization of High Dimensional Data. Sample Examination Paper

Supervision Guide For software version September 2013 and later OCR 2014

Single Subject Demo Data Instructions 1) click "New" and answer "No" to the "spatially preprocess" question.

SOST 201 September 20, Stem-and-leaf display 2. Miscellaneous issues class limits, rounding, and interval width.

Statistics with a Hemacytometer

IBM SPSS Categories. Predict outcomes and reveal relationships in categorical data. Highlights. With IBM SPSS Categories you can:

Clustering K-means. Machine Learning CSEP546 Carlos Guestrin University of Washington February 18, Carlos Guestrin

Importance Sampling Spherical Harmonics

IBM SPSS Categories 23

Evaluation of texture features for image segmentation

8. MINITAB COMMANDS WEEK-BY-WEEK

Excel Primer CH141 Fall, 2017

Spatial Patterns Point Pattern Analysis Geographic Patterns in Areal Data

Frequency, proportional, and percentage distributions.

Differential Item Functioning Analyses with STDIF: User s Guide April L. Zenisky, Frédéric Robin, and Ronald K. Hambleton [Version 6/15/2009]

Genotype x Environmental Analysis with R for Windows

Collaborative Filtering Based on Iterative Principal Component Analysis. Dohyun Kim and Bong-Jin Yum*

Regression on SAT Scores of 374 High Schools and K-means on Clustering Schools

Mapping of Hierarchical Activation in the Visual Cortex Suman Chakravartula, Denise Jones, Guillaume Leseur CS229 Final Project Report. Autumn 2008.

10-701/15-781, Fall 2006, Final

Product Catalog. AcaStat. Software

Fault detection with principal component pursuit method

Does BRT return a predicted group membership or value for continuous variable? Yes, in the $fit object In this dataset, R 2 =0.346.

A biometric iris recognition system based on principal components analysis, genetic algorithms and cosine-distance

eschoolplus+ Cognos Query Studio Training Guide Version 2.4

Thermal Face Recognition Matching Algorithms Performance

How to use FSBforecast Excel add in for regression analysis

ORACLE USER PRODUCTIVITY KIT USAGE TRACKING ADMINISTRATION & REPORTING RELEASE SERVICE PACK 1 PART NO. E

Machine Learning for Pre-emptive Identification of Performance Problems in UNIX Servers Helen Cunningham

Shape spaces. Shape usually defined explicitly to be residual: independent of size.

A Framework for Explanation of Machine Learning Decisions

COSC160: Detection and Classification. Jeremy Bolton, PhD Assistant Teaching Professor

Right-click on whatever it is you are trying to change Get help about the screen you are on Help Help Get help interpreting a table

Basics of Multivariate Modelling and Data Analysis

4. Descriptive Statistics: Measures of Variability and Central Tendency

Performance optimization of a rotary mower using Taguchi method

Calculating a PCA and a MDS on a fingerprint data set

HW Assignment 3 (Due by 9:00am on Mar 6)

Principal motion: PCA-based reconstruction of motion histograms

Select Cases. Select Cases GRAPHS. The Select Cases command excludes from further. selection criteria. Select Use filter variables

Minitab Study Card J ENNIFER L EWIS P RIESTLEY, PH.D.

Dimension reduction : PCA and Clustering

Transcription:

Generalized Procrustes Analysis Example with Annotation James W. Grice, Ph.D. Oklahoma State University th February 4, 2007 Generalized Procrustes Analysis (GPA) is particularly useful for analyzing repertory grid data collected from numerous individuals or from the same individual on different occasions. In this brief paper the common steps in GPA will be demonstrated using example grids taken from a recent encyclopedia entry by Grice (2007). Consider the following ratings of six employees by four managers: Manager 1 Manger 2 John Bob Amy Jan Fred Jill John Bob Amy Jan Fred Jill Extraverted 1 1 5 4 5 3 Blunt 2 3 1 3 4 1 Sharing 5 5 2 1 2 3 Patient 5 1 4 2 2 3 Motivated 2 2 4 3 4 1 Creative 5 2 3 4 1 3 Funny 1 2 2 2 4 1 Outgoing 2 4 1 4 3 3 Loud 1 2 2 1 4 1 Manager 3 Manager 4 Outgoing 1 4 2 4 5 3 Easy Going 2 3 1 3 4 3 Carefree 1 5 3 4 4 2 Outgoing 1 3 1 5 5 1 Generous 4 2 5 2 1 5 Nurturing 3 3 5 5 1 5 Trusting 4 2 4 3 1 4 Calm 5 2 4 1 2 5 Organized 5 3 3 2 3 3 Intelligent 5 3 5 2 3 3 Athletic 2 1 1 2 5 1 The managers completed these repertory grids by describing, in their own words (that is, using their own personal constructs), the six employees. Scales (1 to 5 points) were constructed from their descriptions, and each manager then rated each employee on the scales. As can be seen, the managers differed in the number of personal constructs they used to describe the employees. With GPA, such differences are permissible as long as the grids are matched on the elements (that is, the people being rated). More generally, GPA requires either the constructs or the elements to be matched across the grids. Both constructs and elements can be matched as well.

Generalized Procrustes Analysis 2 The four grids are included with the latest version (2.4) of the Idiogrid software ( Manager1.grd, Manager2.grd, etc., files), and they can be loaded and analyzed. Once the grids are loaded, the user selects Analyses > Generalized Procrustes Analysis from the Main Menu in Idiogrid. The GPA options window, with the appropriate options selected will appear as follows: As can be seen, the four grids are selected, and the Matched Figures in Grids option is set to Elements for the current grids. The default tolerance and iteration values are left unchanged, and all of the scaling options are selected. The Centered and Isotropic options are commonly selected and set as the defaults in Idiogrid, but the Dimension scaling option is selected here as well since the grids have different numbers of constructs (that is, the grids differ in their dimensionality). An ANOVA of the results has also been requested along with a randomization test. The consensus grid will also be saved as a new grid for further analysis, and each manager s rescaled and rotated grid will be saved for further analysis as well. Selecting OK to run the analysis, generates the following output:

Generalized Procrustes Analysis 3 Generalized Procrustes Analysis Grids Analyzed (4) Manager 1, Manager 2, Manager 3, Manager 4 Centered Scaling Factors Centering Means Manager 1 Manager 2 Manager 3 Manager 4 Con_1 3.17 2.33 3.33 2.67 Con_2 3.00 2.83 3.17 2.67 Con_3 2.67 3.00 3.00 3.67 Con_4 2.00 2.83 3.00 3.17 Con_5 1.83 0.00 3.00 3.50 Con_6 0.00 0.00 2.50 0.00 Note. Prior to analysis, the grid values were centered about these mean values. Dimensional Scaling Factors Manager 1 : 2.24 Manager 2 : 2.00 Manager 3 : 2.45 Manager 4 : 2.24 Note. Prior to analysis, the values in each grid were divided by their respective dimensional scaling values. Lambda Scaling Factor SS Value : 100 Lambda : 1.67 Note. Prior to anlaysis, the values in each grid were multiplied by Lambda. Isotropic Scaling Factors Manager 1 : 0.75 Manager 2 : 1.01 Manager 3 : 1.48 Manager 4 : 0.94 Note. After the initial Procrustes rotations, the values in each grid were multiplied by their respective isotropic scaling values. Iteration History 1 : -69.87332404583066 2 : -71.14153878390364 3 : -71.15291188773783 4 : -71.15313657697925 5 : -71.15313941934808

[Manager 1 rescaled grid created (see Grid Data window)] [Manager 2 rescaled grid created (see Grid Data window)] [Manager 3 rescaled grid created (see Grid Data window)] [Manager 4 rescaled grid created (see Grid Data window)] Generalized Procrustes Analysis 4 ANOVA Source Table for Matched Figures Figures Consensus Residual Total John 16.71 2.33 19.04 Bob 5.47 6.28 11.76 Amy 9.74 5.87 15.61 Jan 11.01 3.55 14.56 Fred 24.10 1.41 25.51 Jill 11.54 1.99 13.52 Total SS: 78.57 21.43 100.00 ANOVA Source Table for Grids Grids Residual Total Manager 1 9.87 16.09 Manager 2 5.42 24.61 Manager 3 2.64 29.94 Manager 4 3.51 29.36 Total SS: 21.43 100.00 Consensus Proportion: 0.79 Specific ANOVA Residuals Manager 1 Manager 2 Manager 3 Manager 4 John 0.77 0.75 0.18 0.63 Bob 3.65 1.94 0.62 0.08 Amy 3.39 0.86 0.54 1.07 Jan 0.95 1.31 0.13 1.17 Fred 0.72 0.09 0.21 0.39 Jill 0.38 0.47 0.97 0.17 {Consensus Grid created (see Grid Data window)} Randomization Results Observed Consensus Proportion : 0.79 Number of Random Proportions : 100.00 Minimum Random Proportion : 0.65 Maximum Random Proportion : 0.83 Values > Observed Proportion : 5.00 Approximate p-value : 0.05 {Randomization Results Grid created (see Grid Data window)}

Generalized Procrustes Analysis 5 The consensus grid, as well as the specific results from the Randomization test, are saved as new grids in the grid data window: The rows of the consensus grid are labeled Con_1', Con_2', etc. It is important to realize that since the constructs were not matched in the grids, these represent arbitrarily labeled dimensions along which the elements have been ordered in the analysis. Con_1', for instance, is therefore not a construct from a particular manager s grid. It is simply an abstract dimension resulting from the analysis. As will be shown below, the personal constructs of the managers can be correlated with these dimensions, and the elements (that is, the rated employees) can be mapped into spaces formed from these dimensions as well.

Generalized Procrustes Analysis 6 A common strategy is to conduct a Principal Components Analysis on the saved consensus grid. In Idiogrid, the user selects Analyses > Principal Components Analysis. The following options window, with a number of desirable options selected, will open: As can be seen, consensus grid is selected as the Grid to Analyze. The Eigenvalues and Scree Plot option is chosen, as well as the Export Coefficients to Multiple-Group Setup File option. The correlations of the consensus grid will be analyzed, and the first two principal components will be extracted and plotted. Prior to running the analysis, the Graphing Options button should be selected, and the following options chosen:

Generalized Procrustes Analysis 7 As can be seen, the Plot Construct Poles option is set to Neither, and the Include Construct Vectors in Space is not selected. As discussed above, the constructs in this consensus grid are arbitrary dimensions and are therefore uninformative. Once these options are set, OK can be selected, and the OK button of the Principal Components window can be selected to run the PCA. When the OK button is pressed for the PCA, the user will be prompted to save the component score coefficients: Yes should be selected, and then the user will provide a name for file to which the score coefficients will be saved. These score coefficients are essentially the numerical definitions of the principal components that result from the analysis. They will later be used to reconstruct the components when mapping the managers constructs. The following text output, without the annotation, will then be generated for the PCA by Idiogrid: Principal Components Analyses (Correlations) for Consensus Grid Eigenvalues for Unrotated Components Eigenvalue % Variance Cumulative % Scree PC_ 1 3.42 56.94 56.94 ************ PC_ 2 1.24 20.63 77.57 ***** PC_ 3 0.91 15.18 92.74 **** PC_ 4 0.37 6.13 98.87 ** PC_ 5 0.07 1.13 100.00 * Element Loadings (Unrotated) PC_1 PC_2 John -0.97-0.09 Bob 0.47-0.25 Amy -0.61 0.44 Jan 0.60-0.73 Fred 1.09 0.66 Jill -0.57-0.03 Note. Values used for plotting elements in unrotated component space. {Graph Created: Consensus Grid / PC_1 vs. PC_2 (Unrotated)}

Generalized Procrustes Analysis 8 The following graph will also be generated: As can be seen in the graph of the first two components, which explain ~78% of the variance in the consensus grid (see the scree plot in the printed PCA output above), Amy, John, and Jill are similar to one another, and different from Bob and Jan, who are similar to one another. Fred is more similar to Bob and Jan than the other three employees, but he seems to be in a class by himself.

Generalized Procrustes Analysis 9 Now the constructs will be added to the space using the Multiple Groups Components Analysis option in Idiogrid to conduct an extension analysis. The analyses below can be conducted for any single manager s grid or a subset of the manager s grids; however, all of the grids will be analyzed in this example. A concatenated grid is first created by making the Grid Data window visible in Idiogrid, and then selecting Edit > Concatenate Grids > Elements Matched from the Main Menu. The following window opens: As can be seen, the consensus grid and the four managers rescaled grids are moved into the right Selected Grids window. It is extremely important that the consensus grid be listed first when the grids are selected and moved into the Selected Grids window. If one of the manager s rescaled grids is listed first, the extension analysis below will be incorrect. Once the grids are selected, different transformations can be selected (none are selected here), and the OK button pressed to generated the concatenated grid:

Generalized Procrustes Analysis 10 As can be seen in the concatenated grid above the first six rows are comprised of the consensus grid, and the remaining rows are the four managers rescaled grids. Now the components that were created earlier in the PCA must be re-constructed and the managers constructs mapped in the space. This is what methodologists refer to as an extension analysis. Analysis > Multiple Group Components Analysis is selected from the Main Menu opening the following window:

Generalized Procrustes Analysis 11 Multiple Group Components Analysis is essentially a confirmatory type of analysis in which a researcher or practitioner can test the fit of a component model to the grid data. Here the analysis will be used to reconstruct the components extracted previously from the consensus grid so that the managers constructs can be mapped into the space. In the options window above the concatenated grid is selected as the Grid to Analyze, the Number of Components Graphed is set to 2', and the Eigenvalues and Scree Plot and Structure Coefficients options are selected. The Create Components button is then selected to open the following window: In the window above, the Load from File button was pressed, and the component score coefficients that were saved from the PCA above were located and loaded. It can be seen that these coefficients are essentially multiplicative weights that are applied to the Con_1', Con_2', etc. dimensions to re-create the components. The OK button is then pressed to return to the Multiple Groups Components options window. The OK button is then selected to run the Multiple Groups analysis, which generates the following output: Multiple-Group Components Analysis (correlations) for Concatenated Grid Eigenvalues for Constructed Components Eigenvalue % Variance Cumulative % Scree C_ 1 13.87 53.33 53.33 ************ C_ 2 4.44 17.08 70.41 ****

Generalized Procrustes Analysis 12 Structure Coefficients C_1 C_2 Con_1 0.91-0.17 Con_2 0.92 0.08 Con_3-0.68-0.48 Con_4-0.89 0.23 Con_5-0.70 0.16 Con_6 0.07 0.95 extraverted 0.34 0.47 sharing -0.39-0.08 motivated 0.42 0.50 funny 0.82 0.53 loud 0.65 0.75 blunt 0.89-0.02 patient -0.85 0.23 creative -0.68-0.56 outgoing 0.68-0.61 outgoing 0.97 0.00 carefree 0.91-0.08 generous -0.75 0.22 trusting -0.74 0.00 organized -0.60 0.41 athletic 0.35 0.59 easy going 0.77-0.05 outgoing 0.95-0.16 nurturing -0.50-0.48 calm -0.91 0.29 intelligent -0.75 0.47 Element Loadings C_1 C_2 John -1.79 0.60 Bob 0.94-0.32 Amy -1.32 0.42 Jan 1.15-1.33 Fred 2.28 1.39 Jill -1.25-0.75 Note. Values used for plotting elements in component space. {Graph Created: Concatenated Grid / C_1 vs. C_2 (MGCA)} The following graph is also created:

Generalized Procrustes Analysis 13 It can be seen that Fred is funny, extraverted, loud, etc., whereas Jill is creative and nurturing. A common summary strategy in GPA is to examine the structure coefficients and tally constructs for each dimension. For example, the structure coefficients for this analyses are: Structure Coefficients C_1 C_2 Con_1 0.91-0.17 Con_2 0.92 0.08 Con_3-0.68-0.48 Con_4-0.89 0.23 Con_5-0.70 0.16 Con_6 0.07 0.95 extraverted 0.34 0.47 sharing -0.39-0.08 motivated 0.42 0.50 funny 0.82 0.53 loud 0.65 0.75 blunt 0.89-0.02 patient -0.85 0.23 creative -0.68-0.56 outgoing 0.68-0.61 outgoing 0.97 0.00 carefree 0.91-0.08

Generalized Procrustes Analysis 14 generous -0.75 0.22 trusting -0.74 0.00 organized -0.60 0.41 athletic 0.35 0.59 easy going 0.77-0.05 outgoing 0.95-0.16 nurturing -0.50-0.48 calm -0.91 0.29 intelligent -0.75 0.47 A cut-point for salience must be determined; for instance,.70 (common values are 0.30, 0.35, and 0.40). Structure coefficients are correlations, and a cut-point of.70 would indicate at least 2 50% (.70 ) overlap between the constructs and the dimensions. The constructs that equal or exceed 0.70 in absolute value are printed in bold above. The first dimension therefore appears to be characterized primarily by being funny, blunt, outgoing, and carefree compared to being patient, generous, trusting, calm, and intelligent. The second component only has one salient construct, loud. The non-salient constructs can be removed from the graph in Idiogrid. While viewing the graph in the Graphics Output window, select Edit > Change Options from the Main Menu. The following options window will open: As can be seen, the Suppress Construct Labels option has been set to 0.70. Consequently, when the OK button is selected a new graph will be created in which any constructs with structure coefficients less than 0.70 in absolute value will be omitted, as follows:

Generalized Procrustes Analysis 15 As can be seen in this new graph, only the salient constructs are shown. Finally, any graph in Idiogrid can be saved and then inserted into Powerpoint, Word, or most other Microsoft software. Once the graph is inserted in Powerpoint, for instance, the user will need to ungroup the objects in the graph, which can then be edited. In this way, the constructs, elements, vectors, etc. in the 2-dimensional plot can be moved, edited, deleted, etc. Editing the graphs in Microsoft software is particularly useful for deleting the arbitrary Con_1', Con_2', etc. dimension labels that show up as constructs in the extended plots above. In published papers of GPA results, one can often find the output and graphs like those above. With only a little editing outside of Idiogrid, the results and graphs should be ready for presentation or publication.