MULTIDIMENSIONAL SCALING: Using SPSS/PROXSCAL

Similar documents
MULTIDIMENSIONAL SCALING: MULTIDIMENSIONAL SCALING: Using SPSS/PROXSCAL

Modern Multidimensional Scaling

Modern Multidimensional Scaling

Multidimensional Scaling Presentation. Spring Rob Goodman Paul Palisin

Predict Outcomes and Reveal Relationships in Categorical Data

Week 7 Picturing Network. Vahe and Bethany

IBM SPSS Categories 23

Scaling Techniques in Political Science

IBM SPSS Categories. Predict outcomes and reveal relationships in categorical data. Highlights. With IBM SPSS Categories you can:

Vignette: MDS-GUI. Andrew Timm. 1. Introduction. July 24, Multidimensional Scaling

MULTIDIMENSIONAL SCALING: AN INTRODUCTION

Multidimensional Scaling, Social and Behavioral Sciences (Area 4)

A technique for constructing monotonic regression splines to enable non-linear transformation of GIS rasters

Stats fest Multivariate analysis. Multivariate analyses. Aims. Multivariate analyses. Objects. Variables

Unsupervised Learning

The MDS-GUI: A Graphical User Interface for Comprehensive Multidimensional Scaling Applications

Multidimensional Scaling as an Aid for the Analytic Network and Analytic Hierarchy Processes

BiplotGUI. Features Manual. Anthony la Grange Department of Genetics Stellenbosch University South Africa. Version February 2010.

SPSS Modules Features

The Structural Representation of Proximity Matrices With MATLAB

Forestry Applied Multivariate Statistics. Cluster Analysis

ECLT 5810 Clustering

Cluster analysis. Agnieszka Nowak - Brzezinska

1 Introduction Motivation and Aims Functional Imaging Computational Neuroanatomy... 12

ECLT 5810 Clustering

Parametric. Practices. Patrick Cunningham. CAE Associates Inc. and ANSYS Inc. Proprietary 2012 CAE Associates Inc. and ANSYS Inc. All rights reserved.

CS 664 Structure and Motion. Daniel Huttenlocher

Tree Models of Similarity and Association. Clustering and Classification Lecture 5

Generalized least squares (GLS) estimates of the level-2 coefficients,

Multidimensional scaling

9/29/13. Outline Data mining tasks. Clustering algorithms. Applications of clustering in biology

Medoid Partitioning. Chapter 447. Introduction. Dissimilarities. Types of Cluster Variables. Interval Variables. Ordinal Variables.

Local Minima in Regression with Optimal Scaling Transformations

1D Regression. i.i.d. with mean 0. Univariate Linear Regression: fit by least squares. Minimize: to get. The set of all possible functions is...

Dimension reduction : PCA and Clustering

San Jose State University. Math 285: Selected Topics of High Dimensional Data Modeling

General Instructions. Questions

Curve fitting using linear models

Introduction to optimization methods and line search

Unsupervised Learning

Creating a data file and entering data

Preface to the Second Edition. Preface to the First Edition. 1 Introduction 1

Statistical Package for the Social Sciences INTRODUCTION TO SPSS SPSS for Windows Version 16.0: Its first version in 1968 In 1975.

Constrained Clustering with Interactive Similarity Learning

Lecture 3: Camera Calibration, DLT, SVD

Constrained Optimization of the Stress Function for Multidimensional Scaling

Nonparametric Risk Attribution for Factor Models of Portfolios. October 3, 2017 Kellie Ottoboni

Clustering. Bruno Martins. 1 st Semester 2012/2013

Multidimensional Scaling Methods For Many-Object Sets: a. Review. L. Tsogo, M.H. Masson. Universite de Technologie de Compiegne

Logical operators: R provides an extensive list of logical operators. These include

Introduction to Clustering and Classification. Psych 993 Methods for Clustering and Classification Lecture 1

Curve and Surface Fitting with Splines. PAUL DIERCKX Professor, Computer Science Department, Katholieke Universiteit Leuven, Belgium

Introduction (SPSS) Opening SPSS Start All Programs SPSS Inc SPSS 21. SPSS Menus

Contents. 1 Introduction Background Organization Features... 7

3D Computer Vision. Structure from Motion. Prof. Didier Stricker

CSE 547: Machine Learning for Big Data Spring Problem Set 2. Please read the homework submission policies.

SAS/STAT 12.3 User s Guide. The MDS Procedure (Chapter)

Topology and the Analysis of High-Dimensional Data

Latent variable transformation using monotonic B-splines in PLS Path Modeling

CHAPTER 3 WAVELET DECOMPOSITION USING HAAR WAVELET

Computer Vision in a Non-Rigid World

TRICAP Chania. Typical rank when arrays have symmetric slices, and the Carroll & Chang conjecture of equal CP components.

Cluster Analysis. Mu-Chun Su. Department of Computer Science and Information Engineering National Central University 2003/3/11 1

Computational Physics PHYS 420

Package asymmetry. February 1, 2018

Handout 4 - Interpolation Examples

1.1 calculator viewing window find roots in your calculator 1.2 functions find domain and range (from a graph) may need to review interval notation

Smoothing Dissimilarities for Cluster Analysis: Binary Data and Functional Data

Proximity and Data Pre-processing

SAS/STAT 13.2 User s Guide. The MDS Procedure

Review for Quarter 3 Cumulative Test

Consider functions such that then satisfies these properties: So is represented by the cubic polynomials on on and on.

This lecture: IIR Sections Ranked retrieval Scoring documents Term frequency Collection statistics Weighting schemes Vector space scoring

CE366/ME380 Finite Elements in Applied Mechanics I Fall 2007

Advanced Topics In Machine Learning Project Report : Low Dimensional Embedding of a Pose Collection Fabian Prada

4. Feedforward neural networks. 4.1 Feedforward neural network structure

core engine data processing output URL variable list user interface

Introduction to Mplus

CS 231A Computer Vision (Winter 2015) Problem Set 2

Cluster Analysis. CSE634 Data Mining

Wireless Sensor Networks Localization Methods: Multidimensional Scaling vs. Semidefinite Programming Approach

Mar. 20 Math 2335 sec 001 Spring 2014

Non-linear dimension reduction

Visual Representations for Machine Learning

Justify all your answers and write down all important steps. Unsupported answers will be disregarded.

USING THE MATLAB TOOLSET TO IMPROVE EFFICIENCY IN THE EOBD CALIBRATION PROCESS

Euclidean Reconstruction Independent on Camera Intrinsic Parameters

Data Mining: Concepts and Techniques. Chapter March 8, 2007 Data Mining: Concepts and Techniques 1

Data Analysis using SPSS

Invariant shape similarity. Invariant shape similarity. Invariant similarity. Equivalence. Equivalence. Equivalence. Equal SIMILARITY TRANSFORMATION

Last time... Bias-Variance decomposition. This week

Feature description. IE PŁ M. Strzelecki, P. Strumiłło

Data Preprocessing. Why Data Preprocessing? MIT-652 Data Mining Applications. Chapter 3: Data Preprocessing. Multi-Dimensional Measure of Data Quality

UNIT 4. Research Methods in Business

IBM SPSS Statistics Traditional License packages and features

Evaluation Metrics. (Classifiers) CS229 Section Anand Avati

FMA901F: Machine Learning Lecture 3: Linear Models for Regression. Cristian Sminchisescu

Lecture 9: Introduction to Spline Curves

Assignment 4: Mesh Parametrization

Market basket analysis

Transcription:

MULTIDIMENSIONAL SCALING: Using SPSS/PROXSCAL August 2003 : APMC SPSS uses Forrest Young s ALSCAL (Alternating Least Squares Scaling) as its main MDS program. However, ALSCAL has been shown to be sub-optimal giving exaggerated importance to large data dissimilarities (Ramsay). DON T USE IT (or only AYOR!) SPSS10 offers PROXSCAL (PROXimity SCALing) as an alternative to ALSCAL for multidimensional scaling: USE IT!! PROXSCAL performs most Distance Model scaling (for scalar products/vector models, see SPSS Categories). (Pre-SPSS PROXSCAL.pdf Documentation by Busing is available). Data for basic MDS in SPSS10 can be either 1. input directly as a full SSM (square symmetric matrix) of proximities=dis/similarities into SPSS editor. 2. calculate dis/similarity measure within SPSS from a raw datafile (ANALYZE -> CORRELATE -> DISTANCES) ; for metric data, many use Euclidean Distance Measure. 3. OR (in PROXSCAL only) matrix can be OHP 1 of 9

imported as a SSM. SPSS will not accept LT matrices directly, (n.b. universal use of LT matrices in other programs; confusing SPSS documentation suggesting otherwise). 4. procedures for changing a LT into a SSM for SPSS are tedious, but possible (subject of a separate handout). OHP 2 of 9

SETTING UP data-transformation-model EQUIVALENCES IN SPSS PROXSCAL n.b. Use Right Click for supplementary information in SPSS10 PROXSCAL proximities = dis/similarities PROXSCAL Transformed proximities = disparities. 1. DATA Analyze Scale Multidimensional scaling (PROXSCAL) First Window (Data Format) Data Format: ( data are proximities (or create from raw data initiates Create proximities from data procedure) Number of sources: ( One for 2W1M data; Multiple for INDSCAL etc) OHP 3 of 9

Second Window: Transfer Matrix (v1 - v16) across. (Best to keep variables name short at this stage, to avoid over-printing at graphing stage; they can be selectively lengthened later in plotting routine) Note the crucial buttons at the foot of the Transfer Window: MODEL- RESTRICTIONS-OPTIONS-PLOTS-OUTPUTS these are the ones which set the detail of the analysis and run parameters. OHP 4 of 9

SCALING MODEL: Identity means simple Euclidean Weighted Euclidean means INDSCAL Generalised Euclidean means IDIOSCAL (individual rotation and then weighting) Reduced Rank means IDIOSCAL with minimal rank of matrix SHAPE: does not refer to input matrix...which must be SSM Lower triangular means only the lower triangular data are analysed i.e. symmetric Upper triangular: ditto, upper Full matrix: data may be asymmetric, but are symmetrised PROXIMITIES (data) Similarity data (hi means more similar) Dissimilarity (hi means more dissimilar) n.b. this option is default: beware! DIMENSIONS MDS solutions proceed from max (-1) min OHP 5 of 9

PROXSCAL MODEL PARAMETERS (cont.) Proximity TRANSFORMATIONS Ratio (LoM) implies metric analysis Interval (LoM) metric Ordinal (LoM) non-metric. Default is secondary approach to ties, unless... Untie tied values (=primary) Spline (cf Ramsay & MULTISCALE): piece-wise polynomial transformation of the original data. Pieces and shape of transformation are specified by: degree (1=linear; 2= quadratic...), and Number of internal knots. APPLY TRANSFORMATIONS (applies only to INDSCAL and higher models; in effect local versus global application). OHP 6 of 9

RESTRICTIONS = EXTERNAL or CONSTRAINED / CONFIRMATORY ANALYSIS (cf Borg and Groenen 1997, pp181-199). These options allow for: Fixing some (known?) points in a configuration, and estimating the others ( Some co-ordinates fixed) Fitting (regressing) external properties (PRO-FIT) ( Linear combinations...) Additional information is contained in an separate (or integral) SPSS file OHP 7 of 9

PLOTS Common Space = Group or Stimulus Configuration. n.b. The Shepard Diagram is not immediately available in PROXSCAL; it combines (and can in principle be reconstructed from) original vs transformed proximities= disparities ( vs d-hat Monotonic Fit) transformed proximities vs distances of solution (dhat vs d OLS fit) OHP 8 of 9

OUTPUT DISPLAY Common Space (=Stimulus Configuration) Distances (of solution) Transformed proximities (=disparities) Input data (ALWAYS recommended, to ensure the program is working on the data YOU think it is... ) Iteration history (for diagnosis of stress minimization) Multiple stress measures (use Stress1 for comparison with other solutions; not S-STRESS. Note: normalised raw stress raw stress) Stress decomposition = point contribution to stress Save to new file is equivalent of MDSX s PUNCH. Useful for graphic output. OHP 9 of 9