First-level fmri modeling

Similar documents
Percent Change and Power Calculation NITP 2010

FMRI Pre-Processing and Model- Based Statistics

Efficiency and design optimization

Journal of Articles in Support of The Null Hypothesis

Introduction to Neuroimaging Janaina Mourao-Miranda

Basic Introduction to Data Analysis. Block Design Demonstration. Robert Savoy

Quality Checking an fmri Group Result (art_groupcheck)

Function-Structure Integration in FreeSurfer

the PyHRF package P. Ciuciu1,2 and T. Vincent1,2 Methods meeting at Neurospin 1: CEA/NeuroSpin/LNAO

FSL Pre-Processing Pipeline

Serial Correlation and Heteroscedasticity in Time series Regressions. Econometric (EC3090) - Week 11 Agustín Bénétrix

FSL Pre-Processing Pipeline

Extending the GLM. Outline. Mixed effects motivation Evaluating mixed effects methods Three methods. Conclusions. Overview

7/15/2016 ARE YOUR ANALYSES TOO WHY IS YOUR ANALYSIS PARAMETRIC? PARAMETRIC? That s not Normal!

Tutorial BOLD Module

NA-MIC National Alliance for Medical Image Computing fmri Data Analysis

Event-related design efficiency and How to plot fmri time series. Sepideh Sadaghiani NeuroSpin Methods Meeting 15. Sept. 2008

Journal of Statistical Software

EMPIRICALLY INVESTIGATING THE STATISTICAL VALIDITY OF SPM, FSL AND AFNI FOR SINGLE SUBJECT FMRI ANALYSIS

Basic fmri Design and Analysis. Preprocessing

Functional MRI in Clinical Research and Practice Preprocessing

Analysis of Functional MRI Timeseries Data Using Signal Processing Techniques

HST.583 Functional Magnetic Resonance Imaging: Data Acquisition and Analysis Fall 2006

Intro to ARMA models. FISH 507 Applied Time Series Analysis. Mark Scheuerell 15 Jan 2019

SPM8 for Basic and Clinical Investigators. Preprocessing. fmri Preprocessing

Estimation Efficiency and Statistical Power in Arterial Spin Labeling FMRI.

Introduction to fmri. Pre-processing

Optimal Spline Smoothing of fmri Time Series by Generalized Cross-Validation 1

Experimental Design for fmri OHBM Advanced fmri Educational Course Thomas Liu UCSD Center for Func6onal MRI

Analysis of fmri data within Brainvisa Example with the Saccades database

Independent Component Analysis of fmri Data

Medical Image Analysis

Single Subject Demo Data Instructions 1) click "New" and answer "No" to the "spatially preprocess" question.

Data Visualisation in SPM: An introduction

EPI Data Are Acquired Serially. EPI Data Are Acquired Serially 10/23/2011. Functional Connectivity Preprocessing. fmri Preprocessing

OLS Assumptions and Goodness of Fit

The problem we have now is called variable selection or perhaps model selection. There are several objectives.

MultiVariate Bayesian (MVB) decoding of brain images

Data Visualisation in SPM: An introduction

Multivariate Pattern Classification. Thomas Wolbers Space and Aging Laboratory Centre for Cognitive and Neural Systems

SPM8 for Basic and Clinical Investigators. Preprocessing

Controlling for multiple comparisons in imaging analysis. Wednesday, Lecture 2 Jeanette Mumford University of Wisconsin - Madison

Version. Getting Started: An fmri-cpca Tutorial

Statistical Analysis of Neuroimaging Data. Phebe Kemmer BIOS 516 Sept 24, 2015

Supplementary Figure 1. Decoding results broken down for different ROIs

FEAT 3 - Advanced FMRI Analysis

White Pixel Artifact. Caused by a noise spike during acquisition Spike in K-space <--> sinusoid in image space

An Empirical Comparison of SPM Preprocessing Parameters to the Analysis of fmri Data

FMA901F: Machine Learning Lecture 3: Linear Models for Regression. Cristian Sminchisescu

Introductory Concepts for Voxel-Based Statistical Analysis

Functional MRI data preprocessing. Cyril Pernet, PhD

Last time... Coryn Bailer-Jones. check and if appropriate remove outliers, errors etc. linear regression

CS/NEUR125 Brains, Minds, and Machines. Due: Wednesday, April 5

Detecting Brain Activations in Functional Magnetic Resonance Imaging (fmri) Experiments with a Maximum Cross-Correlation Statistic

EECS 556 Image Processing W 09. Image enhancement. Smoothing and noise removal Sharpening filters

Missing Data Analysis for the Employee Dataset

CDAA No. 4 - Part Two - Multiple Regression - Initial Data Screening

Bias in Resampling-Based Thresholding of Statistical Maps in fmri

CS4442/9542b Artificial Intelligence II prof. Olga Veksler

Multivariate pattern classification

NeuroImaging. (spatial and statistical processing, maybe) Philippe Peigneux, PhD. UR2NF - Neuropsychology and Functional

Adaptive Waveform Inversion: Theory Mike Warner*, Imperial College London, and Lluís Guasch, Sub Salt Solutions Limited

Introduction to Mixed Models: Multivariate Regression

Instance-based Learning

INDEPENDENT COMPONENT ANALYSIS APPLIED TO fmri DATA: A GENERATIVE MODEL FOR VALIDATING RESULTS

Group analysis of fmri data. Why group analysis? What is a population? Reminder: Analysis of data from 1 person. Robert: Roel Willems,

Lab 5: Multi Subject fmri

Residual Analysis for Detecting Mis-modeling in fmri. (Running title: Detecting Mis-modeling in fmri)

Section 2.3: Simple Linear Regression: Predictions and Inference

Missing Data Analysis for the Employee Dataset

Lecture 4: Spatial Domain Transformations

Fitting to a set of data. Lecture on fitting

Linear Methods for Regression and Shrinkage Methods

EE795: Computer Vision and Intelligent Systems

ES-2 Lecture: Fitting models to data

3 Nonlinear Regression

Session scaling; global mean scaling; block effect; mean intensity scaling

Digital Image Processing. Lecture 6

Convolution Product. Change of wave shape as a result of passing through a linear filter

Bayes Net Learning. EECS 474 Fall 2016

SPRITES Moving Two At the Same Using Game State

Dense Image-based Motion Estimation Algorithms & Optical Flow

Linear Models in Medical Imaging. John Kornak MI square February 22, 2011

Cluster failure: Why fmri inferences for spatial extent have inflated false positive rates

1.12 Optimal Filters (Wiener Filters)

CSE 158 Lecture 2. Web Mining and Recommender Systems. Supervised learning Regression

Big Data Methods. Chapter 5: Machine learning. Big Data Methods, Chapter 5, Slide 1

Section 3.2: Multiple Linear Regression II. Jared S. Murray The University of Texas at Austin McCombs School of Business

Regression on SAT Scores of 374 High Schools and K-means on Clustering Schools

Digital Image Processing. Prof. P. K. Biswas. Department of Electronic & Electrical Communication Engineering

SPM Introduction. SPM : Overview. SPM: Preprocessing SPM! SPM: Preprocessing. Scott Peltier. FMRI Laboratory University of Michigan

Lecture on Modeling Tools for Clustering & Regression

CS4442/9542b Artificial Intelligence II prof. Olga Veksler

SPM Introduction SPM! Scott Peltier. FMRI Laboratory University of Michigan. Software to perform computation, manipulation and display of imaging data

Data Analysis and Solver Plugins for KSpread USER S MANUAL. Tomasz Maliszewski

Resources for statistical assistance. Quantitative covariates and regression analysis. Methods for predicting continuous outcomes.

Final Review CMSC 733 Fall 2014

Statistical Methods in functional MRI. False Discovery Rate. Issues with FWER. Lecture 7.2: Multiple Comparisons ( ) 04/25/13

SYDE 575: Introduction to Image Processing

Week 4: Simple Linear Regression II

Transcription:

First-level fmri modeling Monday, Lecture 3 Jeanette Mumford University of Wisconsin - Madison What do we need to remember from the last lecture? What is the general structure of a t- statistic? How about the specific structure for the GLM? What is the residual? What are the assumptions we make with the linear model (Gauss Markov)? Task on Goal in fmri analysis Find voxels with BOLD time series that look like this 1

Task on Goal in fmri analysis Find voxels with BOLD time series that look like this btw, does this remind you of anything?? 2 6 4 Y 1 Y 2 Y 3 Y 4 Y 5 Y 6 Y 7 Y 8. Y 197 Y 198 Y 199 Y 200 3 2 =. 7 6 5 4 1 0 1 0 1 0 0 1 0 1 0 1 1 0 1 0. 1 0 0 1 0 1 0 1 3 apple 7 5 1 2 + Delay of BOLD response Voxel with signal Voxel no signal Voxel signal and drift 2

Voxel with signal Voxel no signal Voxel signal and drift Voxel with signal Voxel no signal Voxel signal and drift Starts off high BOLD issues BOLD response is delayed Convolution FIR modeling BOLD time series suffer from low frequency noise Highpass filtering Prewhitening Precoloring Scaling the data Grand mean scaling Intensity normalization 3

Recall the GLM Single voxel time series Recall the basic structure of a t-statistic How to make a good model Explain as much variability in the data as possible If you miss something it will go into the residual error, e Big residuals Big variance Small t stat 4

Understanding the data Time series drifts down in beginning BOLD response is delayed Simplest Model = Simplest Model = 5

Simplest Model Simplest Model Modeling the delay Hemodynamic response function Real data was used to find good models for the hemodynamic response Stimulus HRF (double gamma) 6

Convolution Combine HRF and expected neural response Typically model derivative of convolved HRF to adjust for small differences in onset (<1s) Different HRF s Too symmetric Basic shape okay, but no post stimulus undershoot Includes post stimulus undershoot Different HRF s Too symmetric Basic shape okay, but no post stimulus undershoot Includes post stimulus undershoot 7

Different HRF s Too symmetric Basic shape okay, but no post stimulus undershoot Includes post stimulus undershoot How convolution is actually done Since timing of trials will not occur on TR and to use a better resolution HRF, we convolve in a finer time resolution Otherwise any trial occurring at any point during the period of the TR will look exactly the same in the model Usampling/downsampling 8

Assumptions of canonical HRF BOLD increases linearly Dale & Buckner, 1997 Assumptions of canonical HRF The width, height and delay are correct Lindquist & Wager (2007) Fit H/W/T separately Works okay-ish Finite impulse response model FIR Make no assumption about the shape of the HRF 9

Constrained basis set Lower the number of regressors in the model by using a basis set Constrained to shapes that are reasonable for HRF shapes Constrained basis set Basis set HRF possibilities FLOBS fmrib Linear Optimal Basis Sets Generates a set of basis sets to model signal Specify ranges for different portions of the hrf 10

Comparison More thoughts about canonical HRF Advantages: Simpler analysis Easily interpretable outcome Simplifies group analysis Disadvantages Biased if canonical HRF is incorrect Unbiased basis sets Advantages Not biased towards a particular shape Allows testing of hypotheses about specific HRF parameters Disadvantages Less powerful Makes group analysis more difficult Tend to overfit the data (i.e., fit noise) 11

We can make a design matrix! Start with task blocks or delta functions Convolve Estimate the GLM and carry out hypothesis! Convolved Boxcar Convolved Boxcar 12

The Noise White noise All frequencies have similar power Not a problem for OLS Colored noise Has structure OLS needs help! More Noise Sources Head motion Cardiac noise What about the drift? Respiratory noise Scanner noise 13

What the noise looks like 1/f structure Power spectra of noise data (Zarahn, Aguirre, D Esposito, NI, 1997) More noise Average spectrum of principal components (Mitra & Pesaran, Biophysical Journal, 1999) More noise Low frequency noise Average spectrum of principal components (Mitra & Pesaran, Biophysical Journal, 1999) 14

More noise Breathing Average spectrum of principal components (Mitra & Pesaran, Biophysical Journal, 1999) More noise Cardiac Average spectrum of principal components (Mitra & Pesaran, Biophysical Journal, 1999) The 1-2 punch Punch 1: Highpass filtering FSL uses gaussian weighted running line smoother SPM fits a DCT basis set Punch 2: Prewhitening We ll get to that later There s also a thing called lowpass filtering (precoloring), but generally it isn t so great and nobody uses it 15

Highpass filtering Simply hack off the low frequency noise SPM: Adds a discrete cosine transform basis set to design matrix Highpass filtering FSL: Gaussian-weighted running line smoother Step 1: Fit a Gaussian weighted running line Highpass filtering FSL: Gaussian-weighted running line smoother Step 1: Fit a Gaussian weighted running line Fit at time t is a weighted average of data around t 16

Highpass filtering Step 2: Subtract Gaussian weighted running line fit IMPORTANT: Must apply filter to both the data and the design. FSL has apply temporal filter box in design setup. Leave it checked! Highpass filtered design (FSL) If it wasn t filtered, this trend wouldn t be here. Highpass filtering Filter below.01 Hz 17

Filter cutoff High, but not higher than paradigm frequency Look at power spectrum of your design and base cutoff on that Block design: Longer than 1 task cycle usually twice the task cycle Event related design: Larger than 66 s (based on the power spectrum of a canonical HRF of a single response) High-pass Filtering Removes the worst of the low frequency trends High-pass From S. Smith Highpass filtering What does it do to the signal?? Signal Power spectrum Lowpass filter Highpass filter Woolrich et al, NI 2001 18

Highpass filtering What does it do to the signal?? Signal Power spectrum Lowpass filter Highpass filter Highpass filtering What does it do to the signal?? Signal Power spectrum Lowpass filter Highpass filter Filtering conclusions Lowpass filtering Idea is to swamp out high frequency noise Easily removes important signal in ER designs Choose cutoff to remove noise, but avoid your signal Highpass filtering Removes low frequency drift We typically avoid designs with low frequencies, so highpass filtering is always used 19

Bandpass filtering High and lowpass filtering Common in functional connectivity analysis since it allows you to focus on a specific frequency Not typically used in standard fmri analyses Model with HP filter Parameter of interest = Model with HP filter Parameter of interest = Use contrast c=(1 0 0 0 0 0 0 0 ) 20

Convolution & HP filter Punch 2: Prewhitening Highpass filtering Analogous to using a roller to paint a wall you can t get the edges very neatly Prewhitening More precise estimate of correlation like using a brush for the edges Prewhitening Remember Gauss Markov? If our errors are distributed with mean 0, constant variance and not temporally autocorrelated then our estimates are unbiased and have the smallest variance of all unbiased estimators. Uh oh, (even after highpass filtering) 21

Find K such that Prewhitening Premultiply GLM by K Find K such that Prewhitening Premultiply GLM by K Awesome! G-M holds for our *new* model Whitening OLS can be used on whitened model 22

Prewhitening Step 1: Fit the linear regression ignoring temporal autocorrelation Step 2: Use residuals from first regression to estimate temporal autocorrelation to obtain K Step 3: Create prewhitened model and estimate in usual way Estimating V We don t know V, so we estimate it There s a bias problem. SPM uses a global covariance estimate to help with this FSL uses a local estimate, but smoothes it Tends to follow 1/f trend Autoregressive (AR) models fit it well fmri noise 23

Whitening SPM Globally estimates correlation Correlation of time series averaged over voxels Structured correlation estimate Scaled AR(1) with correlation 0.2 plus white noise Whitening SPM Only 2 parameters are estimated Convolution, HP filter, Whitening 24

Scaling Grand Mean Scaling Removes intersession variance in global signal due to changes in gain of scanner amplification Allows us to combine data across subjects Whole 4D data set is scaled by a single number Automatically done in software packages Doesn t change variability between time points Scaling Proportional scaling (Intensity normalization) Forces each volume of 4D dataset to have the same mean Also done by modeling the global signal Idea is to remove background activity Problems can occur if true activation is wide spread Negative activations may result 25

Intensity Normalizaton without with Junghofer et al, 2004, NI Signal is lost and negative activation artifacts Other modeling considerations Adding the derivative of the HRF Adding motion parameters to the model Model HRF & Derivative 1. 2 1 0. 8 0. 6 0. 4 0. 2 0-0. 2 0 5 10 15 26

Shifted HRF 1. 2 Blue line is sum of 1 HRF and its derivative. 0. 8 0. 6 0. 4 0. 2 0-0. 2 0 5 10 15 Temporal derivative We model the derivative, but don t study inferences of it Linquist, et al (NI, 2008) suggest this is a bad idea may lead to bias If there s time later, I ll show you how to incorporate the derivative parameter estimate Collinearity When designing your study, you want your tasks to be uncorrelated Correlation between regressors lowers the efficiency of the parameter estimation Parameter estimates are highly variable Can even flip signs 27

Why is it a problem? Why is it a problem? There are an infinite # of solutions for and etc Collinearity illustration Cor= -0.2 X 1 X 2 Cor=0.96 X 1 X 2 28

Highly variable over experiments Intercept X 1 X 2 Correlated Regressors Inflated for correlated case (green) T statistic Bias can go in either direction Highly variable over experiments Intercept X 1 X 2 Correlated Regressors Inflated for correlated case (green) T statistic Bias can go in either direction Highly variable over experiments Intercept X 1 X 2 Correlated Regressors Inflated for correlated case (green) T statistic stats are smaller (due to inflated variance) 29

Residuals don t change The designs explain the same amount of variability Collinearity You can t fix it after the data have been collected You can t tell from the t statistic if you had collinearity FSL has some diagnostics Absolute value of correlation Bad=white off diagonal Eigenvalues from SVD Bad=near 0 Variance Inflation Factor That matrix is okay, but only points out one type of collinearity What if X 1 =X 2 +X 3? Cor(X 1,X 2 ) or Cor(X 1,X 3 ) won t catch this well 30

Variance Inflation Factor That matrix is okay, but only points out one type of collinearity What if X 1 =X 2 +X 3? Cor(X 1,X 2 ) or Cor(X 1,X 3 ) won t catch this well Oh wait, that thing in the previous bullet point looks like a regression equation! Variance Inflation Factor If one regressor is a linear combination of others, we could see that in a linear regression Assume 3 regressors To check for any collinearity with X 1, run the following regression X 1 = 0 + 1 X 2 + 2 X 3 + Variance Inflation Factor If the regression fits well, then we have a problem Assess fit of regression by correlating X 1 with the estimate of X 1 using the model Cor(X 1, ˆ0 + ˆ1X 2 + ˆ2X 3 )=Cor(X 1, c X 1 ) 31

2 6 4 X 1 X 2 X 3 1 1 0 2 2 0 3 3 0 4 4 0 5 5 0 1 0 1 2 0 2 3 0 3 4 0 4 5 0 5 3 Example cor(x 1, X 2 ) =cor(x 1, X 3 ) =.39 X 1 = 1 X 2 + 2 X 3 What s the solution for the betas? 7X 1 = X c 1 =) cor X 1, c X 1 =1 5 2 6 4 X 1 X 2 X 3 1 1 0 2 2 0 3 3 0 4 4 0 5 5 0 1 0 1 2 0 2 3 0 3 4 0 4 5 0 5 3 Example cor(x 1, X 2 ) =cor(x 1, X 3 ) =.39 X 1 = 1 X 2 + 2 X 3 What s the solution for the betas? 7X 1 = X c 1 =) cor X 1, c X 1 =1 5 2 6 4 X 1 X 2 X 3 1 1 0 2 2 0 3 3 0 4 4 0 5 5 0 1 0 1 2 0 2 3 0 3 4 0 4 5 0 5 3 Example cor(x 1, X 2 ) =cor(x 1, X 3 ) =.39 X 1 = 1 X 2 + 2 X 3 What s the solution for the betas? 7X 1 = X c 1 =) cor X 1, c X 1 =1 5 32

2 6 4 X 1 X 2 X 3 1 1 0 2 2 0 3 3 0 4 4 0 5 5 0 1 0 1 2 0 2 3 0 3 4 0 4 5 0 5 3 Example cor(x 1, X 2 ) =cor(x 1, X 3 ) =.39 X 1 = 1 X 2 + 2 X 3 What s the solution for the betas? 7X 1 = X c 1 =) cor X 1, c X 1 =1 5 What is this correlation? Squaring this gives the model s R 2 value R 2 =[Cor(X 1, c X 1 )] 2 Finish VIF VIF uses the model s R 2 1 VIF = 1 R 2 = 1 1 [cor(x 1, X c 1 )] 2 Goal: VIF < 5 As long as it isn t >10, you re probably okay 33

Summary of collinearity Parameter estimates are highly variable when collinearity is present Technically, inferences are still valid! Outliers could possibly cause problems Power takes a hit due to inflated variance Collinearity is directly related to efficiency You are now first level modeling experts! What canonical HRF will you use in SPM? What does adding the derivative do? How can you incorporate the derivative s parameter estimate in the activation magnitude You are now first level modeling experts! How many parameters does SPM use in the whitening matrix? What two things remove low frequency noise in fmri data? Can you trust p-values from models with collinearity? 34

You are now first level modeling experts! Should you ever model the global mean in task fmri data? Break! 35