Introductory Concepts for Voxel-Based Statistical Analysis

Similar documents
Linear Models in Medical Imaging. John Kornak MI square February 22, 2011

Linear Models in Medical Imaging. John Kornak MI square February 23, 2010

Linear Models in Medical Imaging. John Kornak MI square February 19, 2013

Linear Models in Medical Imaging. John Kornak MI square February 21, 2012

Multiple Testing and Thresholding

Extending the GLM. Outline. Mixed effects motivation Evaluating mixed effects methods Three methods. Conclusions. Overview

Multiple Testing and Thresholding

Multiple Testing and Thresholding

Multiple comparisons problem and solutions

Medical Image Analysis

New and best-practice approaches to thresholding

CHAPTER 2. Morphometry on rodent brains. A.E.H. Scheenstra J. Dijkstra L. van der Weerd

Correction for multiple comparisons. Cyril Pernet, PhD SBIRC/SINAPSE University of Edinburgh

Controlling for multiple comparisons in imaging analysis. Wednesday, Lecture 2 Jeanette Mumford University of Wisconsin - Madison

Introduction to Neuroimaging Janaina Mourao-Miranda

Controlling for mul2ple comparisons in imaging analysis. Where we re going. Where we re going 8/15/16

Controlling for mul-ple comparisons in imaging analysis. Wednesday, Lecture 2 Jeane:e Mumford University of Wisconsin - Madison

Contents. comparison. Multiple comparison problem. Recap & Introduction Inference & multiple. «Take home» message. DISCOS SPM course, CRC, Liège, 2009

Advances in FDR for fmri -p.1

7/15/2016 ARE YOUR ANALYSES TOO WHY IS YOUR ANALYSIS PARAMETRIC? PARAMETRIC? That s not Normal!

Basic Introduction to Data Analysis. Block Design Demonstration. Robert Savoy

1 Introduction Motivation and Aims Functional Imaging Computational Neuroanatomy... 12

Computational Neuroanatomy

Neuroimaging and mathematical modelling Lesson 2: Voxel Based Morphometry

Statistical Analysis of Neuroimaging Data. Phebe Kemmer BIOS 516 Sept 24, 2015

fmri pre-processing Juergen Dukart

The Anatomical Equivalence Class Formulation and its Application to Shape-based Computational Neuroanatomy

Statistical Methods in functional MRI. False Discovery Rate. Issues with FWER. Lecture 7.2: Multiple Comparisons ( ) 04/25/13

Supplementary methods

Functional MRI in Clinical Research and Practice Preprocessing

Preprocessing II: Between Subjects John Ashburner

The Effect of Correlation and Error Rate Specification on Thresholding Methods in fmri Analysis

Zurich SPM Course Voxel-Based Morphometry. Ged Ridgway (Oxford & UCL) With thanks to John Ashburner and the FIL Methods Group

Basic fmri Design and Analysis. Preprocessing

Analysis of Functional MRI Timeseries Data Using Signal Processing Techniques

SPM8 for Basic and Clinical Investigators. Preprocessing. fmri Preprocessing

Methods for data preprocessing

Function-Structure Integration in FreeSurfer

Network statistics and thresholding

Chapter 6: Linear Model Selection and Regularization

EPI Data Are Acquired Serially. EPI Data Are Acquired Serially 10/23/2011. Functional Connectivity Preprocessing. fmri Preprocessing

An independent component analysis based tool for exploring functional connections in the brain

Statistical Analysis of Metabolomics Data. Xiuxia Du Department of Bioinformatics & Genomics University of North Carolina at Charlotte

MultiVariate Bayesian (MVB) decoding of brain images

Cognitive States Detection in fmri Data Analysis using incremental PCA

Journal of Articles in Support of The Null Hypothesis

Statistical Methods in functional MRI. Standard Analysis. Data Processing Pipeline. Multiple Comparisons Problem. Multiple Comparisons Problem

SPM8 for Basic and Clinical Investigators. Preprocessing

NA-MIC National Alliance for Medical Image Computing fmri Data Analysis

Quantitative MRI of the Brain: Investigation of Cerebral Gray and White Matter Diseases

NIH Public Access Author Manuscript Med Image Comput Comput Assist Interv. Author manuscript; available in PMC 2014 August 15.

Estimating Noise and Dimensionality in BCI Data Sets: Towards Illiteracy Comprehension

Independent Component Analysis of fmri Data

FMRI Pre-Processing and Model- Based Statistics

Voxel-Based Morphometry & DARTEL. Ged Ridgway, London With thanks to John Ashburner and the FIL Methods Group

Computational Methods in NeuroImage Analysis

Neuroimage Processing

Statistical inference on images

A Non-Parametric Approach

The organization of the human cerebral cortex estimated by intrinsic functional connectivity

fmri Basics: Single Subject Analysis

HST.583 Functional Magnetic Resonance Imaging: Data Acquisition and Analysis Fall 2006

First-level fmri modeling

Norbert Schuff VA Medical Center and UCSF

EMPIRICALLY INVESTIGATING THE STATISTICAL VALIDITY OF SPM, FSL AND AFNI FOR SINGLE SUBJECT FMRI ANALYSIS

SGN (4 cr) Chapter 10

Statistical Parametric Maps for Functional MRI Experiments in R: The Package fmri

SnPM is an SPM toolbox developed by Andrew Holmes & Tom Nichols

Statistical Modeling of Neuroimaging Data: Targeting Activation, Task-Related Connectivity, and Prediction

ASAP_2.0 (Automatic Software for ASL Processing) USER S MANUAL

Multivariate pattern classification

FMA901F: Machine Learning Lecture 3: Linear Models for Regression. Cristian Sminchisescu

SPM Introduction. SPM : Overview. SPM: Preprocessing SPM! SPM: Preprocessing. Scott Peltier. FMRI Laboratory University of Michigan

ANALYSIS OF FUNCTIONAL MAGNETIC RESONANCE IMAGING DATA USING SPM99: VOXEL-BASED MORPHOMETRY DONNA ROSE ADDIS

Single Subject Demo Data Instructions 1) click "New" and answer "No" to the "spatially preprocess" question.

CS 229 Final Project Report Learning to Decode Cognitive States of Rat using Functional Magnetic Resonance Imaging Time Series

SPM Introduction SPM! Scott Peltier. FMRI Laboratory University of Michigan. Software to perform computation, manipulation and display of imaging data

Correction of Partial Volume Effects in Arterial Spin Labeling MRI

Multivariate Neuroimaging Analysis: An Approach for Multiple Image Modalities

HHS Public Access Author manuscript Med Image Comput Comput Assist Interv. Author manuscript; available in PMC 2018 March 20.

Effect of age and dementia on topology of brain functional networks. Paul McCarthy, Luba Benuskova, Liz Franz University of Otago, New Zealand

Basic principles of MR image analysis. Basic principles of MR image analysis. Basic principles of MR image analysis

Sparse & Functional Principal Components Analysis

Functional MRI data preprocessing. Cyril Pernet, PhD

Robust Realignment of fmri Time Series Data

Part I. Hierarchical clustering. Hierarchical Clustering. Hierarchical clustering. Produces a set of nested clusters organized as a

MR IMAGE SEGMENTATION

Introduction to fmri. Pre-processing

Cluster failure: Why fmri inferences for spatial extent have inflated false positive rates

Dimension Reduction CS534

Lecture 14 Shape. ch. 9, sec. 1-8, of Machine Vision by Wesley E. Snyder & Hairong Qi. Spring (CMU RI) : BioE 2630 (Pitt)

Prostate Detection Using Principal Component Analysis

Statistics 202: Data Mining. c Jonathan Taylor. Outliers Based in part on slides from textbook, slides of Susan Holmes.

Online Statistical Inference for Quantifying Mandible Growth in CT Images

MEDICAL IMAGE ANALYSIS

Thresholding of Statistical Maps in Functional Neuroimaging via Independent Filtering

MULTI-RESOLUTION STATISTICAL ANALYSIS ON GRAPH STRUCTURED DATA IN NEUROIMAGING

Structural Segmentation

Norbert Schuff Professor of Radiology VA Medical Center and UCSF

PRANA. Project Review and Analysis. March (Pre-release and still under development!)

Transcription:

Introductory Concepts for Voxel-Based Statistical Analysis John Kornak University of California, San Francisco Department of Radiology and Biomedical Imaging Department of Epidemiology and Biostatistics ADVANCED STATISTICAL CONCEPTS FOR MULTIMODAL MRI: THEORY AND APPLICATIONS June 19, 2010

Acknowledgement / Disclaimer Many of the slides in this lecture have been adapted from presentations available on the SPM web site.

Overview 1. Motivation 2. Linear modeling 3. Multiple comparison correction 4. Multivariate methods

Motivation Imaging data statistical methods to look for regional effects Tissue differences between groups or over time VBM, TBM (voxel/tensor-based morphometry) PET (positron emmission tomography), fmri (functional MRI) determine activation in the brain due to thought, stimulus or task Diffusion (DWI, DTI, tractography), Bone mineral density etc. etc.

image data kernel design matrix parameter estimates realignment & motion correction smoothing Linear Model model fitting statistic image thresholding & multiple comparisons normalisation anatomical reference Statistical Parametric Map (test statistics) Corrected thresholds & p-values

Software SPM PET, fmri, VBM and TBM, EEG/MEG (http://www.fil.ion.ucl.uk/spm/ needs Matlab) FSL fmri primarily + DTI (http://www.fmrib.ox.ac.uk/fsl/) R AnalyzeFMRI package + linear models in general (http://www.r-project.org/ and then go to your nearest CRAN mirror)

Part 1 Linear Modeling

image data kernel design matrix parameter estimates realignment & motion correction smoothing Linear Model model fitting statistic image thresholding & multiple comparisons normalisation anatomical reference Statistical Parametric Map (test statistics) Corrected thresholds & p-values

Definitions Univariate response variable y ij subject i - at voxel j in Covariates (x i1, x i2,..., x ik ) = (variables of interest and nuisance variables) Continuous covariates: e.g. age, blood pressure, stimulus intensity, etc., (random or controlled) Factors: e.g. diagnosis, gender, drinking level, experimental condition, (low, medium, high) etc. x i T Complete data is: y ij,x T i ;i 1,...,n; j 1,...,m (n subjects, m voxels)

The (General) Linear Model The linear model is applied at each individual voxel: A linear model takes the form: y ij 1 j x i2 2 j x i3 3 j... x im mj ij e.g. y ij mean, j x i,age age, j x i,gender gender, j... x i,diagnosis diagnosis, j ij ij ~ N(0, 2 ), i.i.d. i 1,...,n j 1,...,m i.i.d. = independently and identically distributed

Eg. Hippocampal Volume HCV ~ Age + Diagnosis + Age*Diagnosis Diagnosis can be normal control (NC) or Alzheimer s disease (AD)

Eg. Hippocampal Volume Structural T1 weighted MRI s Volume measures at each voxel in HC (Voxel or Deformation Based Morphometry) Volume measure = response for each subject Disease status encoded 1 for AD and 0 for NC (the term) x diagnosis

y ij 1 x i,age age, j x i,diag. diag., j x i.age x i,diag. inter, j ij HCV Case 1 age, j 0, diag., j 0, inter, j 0 age

y ij 1 x i,age age, j x i,diag. diag., j x i.age x i,diag. inter, j ij HCV Case 2 age, j 0, diag., j 0, inter, j 0 age

y ij 1 x i,age age, j x i,diag. diag., j x i.age x i,diag. inter, j ij HCV Case 3 age, j 0, diag., j 0, inter, j 0 NC AD age

y ij 1 x i,age age, j x i,diag. diag., j x i.age x i,diag. inter, j ij HCV Case 4 age, j 0, diag., j 0, inter, j 0 NC AD age

y ij 1 x i,age age, j x i,diag. diag., j x i.age x i,diag. inter, j ij HCV Case 4 age, j 0, diag., j 0, inter, j 0 NC AD age

F-test for General Linear Hypothesis Goal: detect voxels with significant disease or condition effects y X T N n 0, c.f. simpler model 2 I n H 0 : A c This is the General Linear Hypothesis and uses F-test (based on ratio of models sum of squares residuals) Can answer e.g., does disease status affect HCV? Or does an age by disease interaction exist? Or is condition A equivalent to condition B in an fmri expmt?

Parameter Estimates Same model for all voxels Different parameters for each voxel beta_0001.img ˆ 0.83 0.16 2.98 beta_0002.img ˆ 0.03 0.06 2.04 ˆ 0.68 0.82 2.17 beta_0003.img

Parameter Estimates Same model for all voxels Different parameters for each voxel beta_0001.img Note: This approach estimates parameters at each voxel for conditions that affect the brain ˆ 0.83 0.16 2.98 beta_0002.img But what if the brain affects the condition - e.g. loss of tissue affects cognitive ability - is this still the right model? Then there is only 1 outcome and the brain voxels form a set of predictors. ˆ ˆ 0.68 0.82 2.17 0.03 0.06 2.04 beta_0003.img

Summary of Part 1 Linear model fitted separately to each voxel to estimate effects of interest Hypothesis test statistics are obtained at each voxel and combined to form a statistical parametric map (SPM)

Part 2 Multiple Comparison Correction

image data kernel design matrix parameter estimates realignment & motion correction smoothing Linear Model model fitting statistic image thresholding & multiple comparisons normalisation anatomical reference Statistical Parametric Map (test statistics) Corrected thresholds & p-values

Multiple Comparison Problem Each voxel obtains a test statistic from the linear model, e.g. t or F Forms statistical maps of the statistics (statistical parametric maps, SPMs) E.g., which of 100,000 voxels are significant? =0.05 5,000 false positive voxels

How can we determine a sensible threshold level? Assessing Statistic Images Where s the signal or change? High Threshold Med. Threshold Low Threshold t > 5.5 t > 3.5 t > 0.5 Good Specificity Poor Power (risk of false negatives) Poor Specificity (risk of false positives) Good Power

Multiple Comparison Solutions: Measuring False Positives 1. Familywise Error Rate (FWER) Familywise Error Existence of one or more false positives 2. False Discovery Rate (FDR) FDR = E(V/R) R voxels declared active, V falsely so Realized false discovery rate: V/R 3. Permutation Testing

FWER: Bonferroni Correction FWE, α, for N independent voxels is α = Nv (v = voxelwise error rate) To control FWE set v = α / N Independent Voxels Spatially Correlated Voxels Bonferroni is too conservative for brain images

FWER: Random Field Theory Euler Characteristic u Topological Measure #blobs - #holes At high thresholds = blobs Threshold Random Field See description at http://imaging.mrc-cbu.cam.ac.uk/imaging/principlesrandomfields Suprathreshold Sets

Multiple Comparison Solutions: Measuring False Positives 1. Familywise Error Rate (FWER) Familywise Error Existence of one or more false positives 2. False Discovery Rate (FDR) FDR = E(V/R) R voxels declared active, V falsely so Realized false discovery rate: V/R 3. Permutation Testing

p-value 0 1 FDR: Benjamini & Hochberg Procedure Select desired limit q on FDR Order p-values, p (1) p (2)... p (V) Let r be largest i such that p (i) i/v q/c(v) Reject all hypotheses corresponding to p (1),..., p (r) NB, no spatial consideration Journal of the Royal Statistical Society Series B (1995) 57:289-300 i/v i/v p (i) q/c(v) 0 1

FDR v FWER (GRF) Illustration Noise Signal Signal+Noise

Control of Per Comparison Rate at 10% 11.3% 11.3% 12.5% 10.8% 11.5% 10.0% 10.7% 11.2% 10.2% 9.5% Percentage of Null Pixels that are False Positives Control of Familywise Error Rate at 10% Occurrence of Familywise Error FWE Control of False Discovery Rate at 10% 6.7% 10.4% 14.9% 9.3% 16.2% 13.8% 14.0% 10.5% 12.2% 8.7% Percentage of Observed Above Threshold Pixels that are False Positives

Summary of Part 2 Massive multiple comparison problem Spatial correlation Multiple approaches (FWER, FDR, NP) and levels (voxel, cluster, set) for multiple comparison correction

Part 3 Multivariate Methods

PCA and ICA Principal Components Analysis and Independent Components Analysis Methods to succinctly summarize highdimensional data sets (transform many variables to a few variables) Many variables are summarized in terms of just a few new variables (the components) Projection of data: high- to low-dimension

PCA Finds a series of projections/principal Components (PCs) each with maximal variability The first PC explains as much of the variability in the full data set as possible The second PC explains as much variability as possible after the variability from the first (PC) has been removed (uncorrelated/orthogonal to the first) Constraint: each PC is a linear combination of the original variables

Linear Combinations A linear combination of N variables is defined as: x, x,..., x 1 2 N a x a x... a x 1 1 2 2 N N x, x,..., xn where 1 2 could be voxel intensities or points in time

2D Toy Example Var 1 Var 2

2D Toy Example 1 st PC Var 1 Var 2

2D Toy Example Minimize sum of squares 1 st PC Var 1 Var 2

2D Toy Example 1 st PC Var 1 2 nd PC Var 2

Note: that there is an issue of scaling Var 1 2D Toy Example 1 st PC 2 nd PC Var 2

Voxel-Based Analysis Applications Consider a set of images e.g. repeated brain images in time for a single subject or single images of many subjects The first PC (image) may explain the effect of group or time The second PC image may explain between subject variability of brain shape It is hoped that each of the PCs is meaningful

Independent Components Analysis (ICA) ICA differs from PCA for technical reasons (but can produce very different results) In PCA the PCs have to be uncorrelated however, uncorrelated does not imply independent independence is a stronger requirement (if data are Gaussian then uncorrelated does imply independent)

Uncorrelated but not independent Var 1 Var 2

Independent Components Analysis (ICA) ICA demands that the components are (maximally) independent of each other It provides a different decomposition of the full data than does PCA (i.e. it gives a different set of linear combinations / components) ICA is computationally more challenging and does not have an inherent ordering of components

Summary of Part 3 PCA & ICA attempt to summarize highdimensional datasets in terms of just a few components PCA seeks linear combinations of the data matrix that are of high variance and uncorrelated ICA seeks linear combinations that are independent PCA & ICA both find components of variability that can be explained by linear combinations of input variables