Can we interpret weight maps in terms of cognitive/clinical neuroscience? Jessica Schrouff

Size: px
Start display at page:

Download "Can we interpret weight maps in terms of cognitive/clinical neuroscience? Jessica Schrouff"

Transcription

1 Can we interpret weight maps in terms of cognitive/clinical neuroscience? Jessica Schrouff Computer Science Department, University College London, UK Laboratory of Behavioral and Cognitive Neuroscience, Stanford University, US

2 Pattern Recognition Models v 1 Linear predictive models (classifier or regression) are parameterized by a weight vector w and a bias term b. w t 1 t 2 t 4 t 3 The weight vector w can be expressed as a linear combination of training examples x i. The general equation for making predictions for a test example x * is: w = N å i=1 a i x i v 2 f ( x*) w x* b 25/06/2017 J. Schrouff - OHBM 2017 PR4NI 2

3 How is w computed? w is estimated by solving an optimization problem consisting of a data fit term E and a penalty/regularization term J. min wîr p { E(w)+ lj(w) } Regularization parameter Data fit term or loss function (E): denotes the price we pay when we make mistakes in the predictions (e.g. squared loss, Hinge loss). Regularization term (J): favours certain properties (e.g. sparsity) and improves the generalisation over unseen examples (e.g. L2-norm, L1-norm). Many machine learning algorithms are particular choices of E and J.

4 Weight map or predictive pattern w has the same dimensionality of the input data and can be plotted as an image. Might provide potential insights into brain function or structure that drives the prediction, but interpretation should be done with care!

5 BOLD signal Pattern recognition analysis labels labels labels labels labels Training Predictive function or Predictive pattern: f New subject Testing Predicted label Very different meaning! Mass-univariate analysis (e.g. General Linear Model, GLM) Time Voxel-wise GLM model estimation Independent statistical test at each Voxel Correction for Multiple comparisons Statistical Parametric Map

6 Using the weights for prediction Weight map (w) Predictive function f (x * ) = w x * +b New example (x*) f ( x * f ( x * ) (5 2) (2 1) ( 6 2) ( 1 1) 0 ) f(x * ) is the predicted score for regression or the distance to the decision boundary for classification models.

7 Thresholding the weight map Weight map (w) Predictive function f (x * ) = w x * +b New example (x*) f ( x * f ( x * ) (5 2) (0 1) ( 6 2) (0 1) 0 ) f(x * ) truncated does not correspond to f(x * )!

8 What do (voxel) weights represent? Signal: - Voxel 1: s n + d(n) - Voxel 2: d(n) Weights: - Voxel 1: w = 1 - Voxel 2: w = -1 s n d(n) c 1 c 2 Haufe et al.,

9 What can affect the magnitude of w? High absolute coefficient/weight in a voxels might be due to: an association between the voxel s signal and the labels multivariate properties (e.g. correlated signal or correlated noise) Zero coefficient/weight in a voxels might be due to: no association between the voxel s signal and the labels a particular type of regularization If the goal is to make local inference about the association between underlying brain signal in a voxel (e.g. activity) and the labels (e.g. experimental condition) further analysis are recommended (e.g. mass univariate GLM) Sato et al and Haufe et al 2014

10 How to interpret the weight vector w? Spatial representation of the predictive function. Shows the contribution of each feature/voxel to the prediction. Multivariate pattern -> All voxels with weights different from zero contribute to the final prediction (no arbitrary threshold should be applied). Mixture of signal of interest and noise

11 Potential strategies to improve interpretability of whole brain predictive patterns A priori 1. Masking 2. Searchlight mapping During model estimation 3. Feature selection 4. Sparse algorithms 5. Atlas based Multiple Kernel Learning (MKL) 6. Using weight stability in model selection A posteriori 7. Atlas based weight summarization 8. Permutation test 9. Transforming weights into activation patterns

12 1. Masking: ROI based on a priori knowledge e.g. Choosing voxels/features based on literature or on previous univariate results (independent dataset). 12

13 1. Masking: ROI based on a priori knowledge Advantages Good accuracy? Anatomically/ functionally relevant Limitations Need an anatomical hypothesis Cannot threshold obtained map Arbitrary choice of region number and size Multiple models = multiple comparisons 13

14 2. Searchlight Mapping Scans the imaged volume with a "searchlight" (e.g. sphere centered at each voxel) and performs a multivariate analysis at each location in the brain (Kriegeskorte et al, 2006) local multivariate pattern

15 2. Searchlight Mapping Advantages Mapping approach -> identifies voxels/neighborhoods that contain predictive or multivariate information Limitations Locally multivariate (does not account for the whole brain network) Arbitrary choice of neighborhood size (e.g. radios of the sphere) Multiple models -> multiple comparison issue Computationally expensive

16 3. Feature selection Selects a subset of informative features according to a specific criterion. Example of feature selection approaches: Univariate t-test Recursive Feature Elimination (RFE, De Martino et al, 2008) RONDINA et al.: SCORS A METHOD BASED ON STABILITY FOR FEATURE SELECTION AND APPING IN NEUROIMAGING 95 Stability selection (SCoRS, Rondina et al, 2014) TABLE V BRAIN REGIONS IMPORTANT TO DISCRIMINATE THE GROUPS FOUND BY SCORS Voxels in white were selected by SCoRS, RFE and t-test (Rondina et al, 2014)

17 3. Feature selection Data driven Advantages Identifies the most relevant voxels for prediction Limitations Computationally expensive (nested cross-validation needs to be implemented to avoid double dipping!)

18 4. Sparse algorithms Choose a regularization term that enforces sparsity or structured sparsity. Examples of sparse and structured sparse approaches: LASSO (Tibshirani, 1996) Elastic-net (Zou and Hastie, 2005) Total Variation (Michel et al, 2011) Graph Laplacian Elastic Net (GraphNET, Grosenick et al, 2011) Sparse Total Variation (Baldassare et al, 2011) Grosenick et al, 2011

19 How regularization affects the weight map? LASSO 86.31% Elastic Net 88.02% Total Variation (TV) 85.79% Weight maps for classifying fmri images during visualization of pleasant vs. unpleasant pictures. All models used a square loss + regularization. Laplacian (LAP) 83.71% Sparse TV 85.86% Sparse LAP 87.05% Baldassarre et al., 2017

20 4. Sparse algorithms Data-driven Advantages Selection embedded in model Limitations Potential instability of the selected features (Baldassarre et al., 2012): different regularization terms = different patterns and similar accuracies. Identifies a subset of predictive voxels

21 5. Atlas based Multiple Kernel Learning (MKL) Parcellation using anatomical atlas Multiple Kernel Learning 116 å f (x * ) = d m f m (x * ) m=1 Region 1 Region 116 The MKL learns and combines different models, represented by different kernels (regions). Sparsity on the kernel combination (e.g. SimpleMKL, Rakotomamonjy, et al. 2008) -> selects a subset of regions. Schrouff, Monteiro et al.(submitted)

22 5. Atlas based Multiple Kernel Learning (MKL) PRoNTo (

23 5. Atlas based Multiple Kernel Learning (MKL) PRoNTo (

24 5. Atlas based Multiple Kernel Learning (MKL) Advantages Limitations Anatomically/ functionally driven L1 regularization -> might not select regions with correlated information Selects a subset of predictive regions Shows the contribution of each region (region weight) and each voxel (voxel weight) for the predictive function Potential instability of the selected regions Atlas dependent

25 6. Using weight stability in model selection Including the weight stability (Baldassarre et al., 2017) or interpretability (reproducibility x representativeness, Kia et al, 2017) in the optimization of hyper-parameters: w 1 w 1 w w 2 2 ~ ς: model performance to optimize δ: model generalization (e.g. accuracy) η: stability or interpretability of weight map 25

26 6. Using weight stability in model selection Advantages Selects model based on both generalization and stability Embedded in model selection Limitations Applies mostly to sparse methods Various parameters to set: how to define η, set w 1 and w 2 Computationally expensive Needs more work to define in which cases it is interesting (e.g. when no trade-off between η and δ)

27 7. Atlas based weight summarization Summarize whole brain weights using a pre-defined atlas (Schrouff et al, 2013). Average the absolute value of the weights within each regions to create a rank according to their contribution to the decision function. NW ROI = å vîroi w v #{v Î ROI} AAL atlas (Tzourio-Mazoyer, 2002)

28 7. Atlas based weight summarization PRoNTo (

29 7. Atlas based weight summarization Advantages Anatomically/ functionally driven Ranks the regions according to their proportional contribution to the predictive function Limitations Not embedded in the model (a posteriori summarization) All regions with weight different from zero contribute to the prediction -> no arbitrary threshold should be applied Atlas dependent

30 8. Permutation test Use permutation to estimate the null distribution of the weights and test the null hypothesis at each voxel to threshold the weights (Mourao-Miranda et al, 2005). Use analytic approximation of the null distribution (Gaonkar and Davatzikos, 2013). p-maps obtained from experimental and analytical permutation tests (Gaonkar and Davatzikos, 2013)

31 8. Permutation test Data driven Advantages Limitations Needs to be corrected for multiple comparison Provides a thresholded weight map -> identifies the most relevant voxels for prediction Computationally expensive

32 9. Deriving activation patterns from weights Derive activation patterns from weights (extraction filters) by transforming backward models into forward models (Haufe et al., 2014) Backward model Forward model (discriminative model) (generative model) W T x(n) = ŝ(n) x(n) = Aŝ(n)+e(n) W : extraction filters A : activation patterns (columns of W) (columns of A) x(n) : data e(n) : noise ŝ(n) : latent factors The activation patterns show in which voxels (channels) the latent factors are reflected.

33 9. Deriving activation patterns from weights Advantages Like in the GLM activation patterns enable neurophysiological interpretation of the coefficients Limitations Depends on the assumptions used to estimate the forward and backward models. The activation patterns do not represent the predictive function (they don t show which voxels/regions are relevant for prediction)

34 Summary Interpretation of machine learning based models can be complex. There are different strategies to improve interpretation, each one has advantages and limitations: Data driven vs. anatomically/functionally driven Multivariate power (local vs. global, sensitivity vs. specificity) Stability of the patterns Computational time

35 Acknowledgements Janaina Mourão-Miranda, UCL, UK Joao M. Monteiro, UCL, UK Maria Joao Rosa, UCL, UK Josef Parvizi and team, Stanford, US PRoNTo team Marie Sklodowska-Curie Actions

36 Useful references Baldassarre, L., Pontil, M. & Mourão-Miranda, J.. «Sparsity is better with stability: combining accuracy and stability for model selection in brain decoding.» Frontiers in Neuroscience: Brain Imaging Methods. (2017) De Martino, F., Valente, G., Staeren, N., Ashburner, J., Goebel, R., & Formisano, E. «Combining multivariate voxel selection and support vector machines for mapping and classification of fmri spatial patterns.» NeuroImage. (2008) 43(1), Gaonkar, B. & Davatzikos, C. «Analytic estimation of statistical significance maps for support vector machine based multivariate image analysis and classification.» NeuroImage. (2013) 78: Grosenick, L., Klingenberg, B., Katovich, K., Knutson, B. & Taylor, J.E. «Interpretable whole-brain prediction analysis with GraphNet.» NeuroImage. (2013) 72, Haufe, S., Meinecke, F., Go rgen, K., Daḧne, S., Haynes, J. D., Blankertz, B. & Bießmann, F. «On the interpretation of weight vectors of linear models in multivariate neuroimaging.» NeuroImage. (2014) 8 7, Kia, S.M., Vega-Pons, S., Weisz, N. & Passerini, A. «Interpretability of Multivariate Brain Maps in Linear Brain Decoding: Definition, and Heuristic Quantification in Multivariate Analysis of MEG Time-Locked Effects.» Frontiers in Neuroscience. (2017) /fnins Kriegeskorte, N., Rainer, G. & Bandettini, P. «Information-based functional brain mapping.» PNAS 103 (2006):

37 Useful references Michel, V., Gramfort, A., Varoquaux, G., Eger, E. & Thirion, B. «Total variation regularization for fmri-based prediction of behavior.» IEEE Trans Med Imaging. (2011) 30, Rakotomamonjy, A., Bach, F., Canu, S. & Grandvalet, Y. «SimpleMKL» Journal of Machine Learning 9 (2008): Rondina J., Hahn T., de Oliveira L., Marquand A., Dresler T., Leitner T., Fallgatter A., Shawe-Taylor J. & Mourao-Miranda J. «SCoRS - a method based on stability for feature selection and mapping in neuroimaging.» IEEE Trans Med Imaging. (2014) Jan:33(1). Sato, J.R., Mourao-Miranda, J., Morais Martin Mda G., Amaro E. Jr., Morettin P.A. & Brammer M.J. «The impact of functional connectivity changes on support vector machines mapping of fmri data.» J Neurosci Methods. (2008) 172(1): Schrouff, J., Cremers, J., Garraux, G., Baldassarre, L., Mourão-Miranda, J. & Phillips, C. «Localizing and comparing weight maps generated from linear kernel machine learning models.» Proceedings of the 3rd workshop on Pattern Recognition in NeuroImaging Tibshirani, R. «Regression shrinkage and selection via the lasso.» Journal of the Royal Statistical Society. 58 (1996): Tzourio-Mazoyer, N., et al. «Automated Anatomical Labeling of activations in SPM using a Macroscopic Anatomical Parcellation of the MNI MRI single-subject brain.» NeuroImage 15 (2002): Zou, H., & Hastie, T. «Regularization and variable selection via the elastic net.» J. R. Statist. Soc. B 67 (2005):

Interpreting predictive models in terms of anatomically labelled regions

Interpreting predictive models in terms of anatomically labelled regions Interpreting predictive models in terms of anatomically labelled regions Data Feature extraction/selection Model specification Accuracy and significance Model interpretation 2 Data Feature extraction/selection

More information

Pa#ern Recogni-on for Neuroimaging Toolbox

Pa#ern Recogni-on for Neuroimaging Toolbox Pa#ern Recogni-on for Neuroimaging Toolbox Pa#ern Recogni-on Methods: Basics João M. Monteiro Based on slides from Jessica Schrouff and Janaina Mourão-Miranda PRoNTo course UCL, London, UK 2017 Outline

More information

Pattern Recognition for Neuroimaging Data

Pattern Recognition for Neuroimaging Data Pattern Recognition for Neuroimaging Data Edinburgh, SPM course April 2013 C. Phillips, Cyclotron Research Centre, ULg, Belgium http://www.cyclotron.ulg.ac.be Overview Introduction Univariate & multivariate

More information

MULTIVARIATE ANALYSES WITH fmri DATA

MULTIVARIATE ANALYSES WITH fmri DATA MULTIVARIATE ANALYSES WITH fmri DATA Sudhir Shankar Raman Translational Neuromodeling Unit (TNU) Institute for Biomedical Engineering University of Zurich & ETH Zurich Motivation Modelling Concepts Learning

More information

Pattern Recognition for Neuroimaging Toolbox: PRoNTo

Pattern Recognition for Neuroimaging Toolbox: PRoNTo Click to edit Master title style Pattern Recognition for Neuroimaging Toolbox: PRoNTo Jessica Schrouff PRNI 2018 June 14 th NUS, Singapore Click Outline to edit Master title style PRoNTo s goals and history

More information

x 1 SpaceNet: Multivariate brain decoding and segmentation Elvis DOHMATOB (Joint work with: M. EICKENBERG, B. THIRION, & G. VAROQUAUX) 75 x y=20

x 1 SpaceNet: Multivariate brain decoding and segmentation Elvis DOHMATOB (Joint work with: M. EICKENBERG, B. THIRION, & G. VAROQUAUX) 75 x y=20 SpaceNet: Multivariate brain decoding and segmentation Elvis DOHMATOB (Joint work with: M. EICKENBERG, B. THIRION, & G. VAROQUAUX) L R 75 x 2 38 0 y=20-38 -75 x 1 1 Introducing the model 2 1 Brain decoding

More information

Voxel selection algorithms for fmri

Voxel selection algorithms for fmri Voxel selection algorithms for fmri Henryk Blasinski December 14, 2012 1 Introduction Functional Magnetic Resonance Imaging (fmri) is a technique to measure and image the Blood- Oxygen Level Dependent

More information

Multivariate pattern classification

Multivariate pattern classification Multivariate pattern classification Thomas Wolbers Space & Ageing Laboratory (www.sal.mvm.ed.ac.uk) Centre for Cognitive and Neural Systems & Centre for Cognitive Ageing and Cognitive Epidemiology Outline

More information

Multi-Voxel Pattern Analysis (MVPA) for fmri

Multi-Voxel Pattern Analysis (MVPA) for fmri MVPA for - and Multi-Voxel Pattern Analysis (MVPA) for 7 th M-BIC Workshop 2012 Maastricht University, Department of Cognitive Neuroscience Maastricht Brain Imaging Center (M-Bic), Maastricht, The Netherlands

More information

Multivariate Pattern Classification. Thomas Wolbers Space and Aging Laboratory Centre for Cognitive and Neural Systems

Multivariate Pattern Classification. Thomas Wolbers Space and Aging Laboratory Centre for Cognitive and Neural Systems Multivariate Pattern Classification Thomas Wolbers Space and Aging Laboratory Centre for Cognitive and Neural Systems Outline WHY PATTERN CLASSIFICATION? PROCESSING STREAM PREPROCESSING / FEATURE REDUCTION

More information

Multi-voxel pattern analysis: Decoding Mental States from fmri Activity Patterns

Multi-voxel pattern analysis: Decoding Mental States from fmri Activity Patterns Multi-voxel pattern analysis: Decoding Mental States from fmri Activity Patterns Artwork by Leon Zernitsky Jesse Rissman NITP Summer Program 2012 Part 1 of 2 Goals of Multi-voxel Pattern Analysis Decoding

More information

Correction for multiple comparisons. Cyril Pernet, PhD SBIRC/SINAPSE University of Edinburgh

Correction for multiple comparisons. Cyril Pernet, PhD SBIRC/SINAPSE University of Edinburgh Correction for multiple comparisons Cyril Pernet, PhD SBIRC/SINAPSE University of Edinburgh Overview Multiple comparisons correction procedures Levels of inferences (set, cluster, voxel) Circularity issues

More information

Introduction to Neuroimaging Janaina Mourao-Miranda

Introduction to Neuroimaging Janaina Mourao-Miranda Introduction to Neuroimaging Janaina Mourao-Miranda Neuroimaging techniques have changed the way neuroscientists address questions about functional anatomy, especially in relation to behavior and clinical

More information

Identifying predictive regions from fmri with TV-L1 prior

Identifying predictive regions from fmri with TV-L1 prior Identifying predictive regions from fmri with TV-L1 prior Alexandre Gramfort, Bertrand Thirion, Gaël Varoquaux To cite this version: Alexandre Gramfort, Bertrand Thirion, Gaël Varoquaux. Identifying predictive

More information

7/15/2016 ARE YOUR ANALYSES TOO WHY IS YOUR ANALYSIS PARAMETRIC? PARAMETRIC? That s not Normal!

7/15/2016 ARE YOUR ANALYSES TOO WHY IS YOUR ANALYSIS PARAMETRIC? PARAMETRIC? That s not Normal! ARE YOUR ANALYSES TOO PARAMETRIC? That s not Normal! Martin M Monti http://montilab.psych.ucla.edu WHY IS YOUR ANALYSIS PARAMETRIC? i. Optimal power (defined as the probability to detect a real difference)

More information

Toward a rigorous statistical framework for brain mapping

Toward a rigorous statistical framework for brain mapping Toward a rigorous statistical framework for brain mapping Bertrand Thirion, bertrand.thirion@inria.fr 1 Cognitive neuroscience How are cognitive activities affected or controlled by neural circuits in

More information

Multivariate analyses & decoding

Multivariate analyses & decoding Multivariate analyses & decoding Kay Henning Brodersen Computational Neuroeconomics Group Department of Economics, University of Zurich Machine Learning and Pattern Recognition Group Department of Computer

More information

CS 229 Final Project Report Learning to Decode Cognitive States of Rat using Functional Magnetic Resonance Imaging Time Series

CS 229 Final Project Report Learning to Decode Cognitive States of Rat using Functional Magnetic Resonance Imaging Time Series CS 229 Final Project Report Learning to Decode Cognitive States of Rat using Functional Magnetic Resonance Imaging Time Series Jingyuan Chen //Department of Electrical Engineering, cjy2010@stanford.edu//

More information

MultiVariate Bayesian (MVB) decoding of brain images

MultiVariate Bayesian (MVB) decoding of brain images MultiVariate Bayesian (MVB) decoding of brain images Alexa Morcom Edinburgh SPM course 2015 With thanks to J. Daunizeau, K. Brodersen for slides stimulus behaviour encoding of sensorial or cognitive state?

More information

NeuroImage 43 (2008) Contents lists available at ScienceDirect. NeuroImage. journal homepage:

NeuroImage 43 (2008) Contents lists available at ScienceDirect. NeuroImage. journal homepage: NeuroImage 43 (2008) 44 58 Contents lists available at ScienceDirect NeuroImage journal homepage: www.elsevier.com/locate/ynimg Combining multivariate voxel selection and support vector machines for mapping

More information

Speeding-up model-selection in GraphNet via early-stopping and univariate feature-screening

Speeding-up model-selection in GraphNet via early-stopping and univariate feature-screening Speeding-up model-selection in GraphNet via early-stopping and univariate feature-screening Elvis Dohmatob, Michael Eickenberg, Bertrand Thirion, Gaël Varoquaux To cite this version: Elvis Dohmatob, Michael

More information

Methods for data preprocessing

Methods for data preprocessing Methods for data preprocessing John Ashburner Wellcome Trust Centre for Neuroimaging, 12 Queen Square, London, UK. Overview Voxel-Based Morphometry Morphometry in general Volumetrics VBM preprocessing

More information

Bayesian Inference in fmri Will Penny

Bayesian Inference in fmri Will Penny Bayesian Inference in fmri Will Penny Bayesian Approaches in Neuroscience Karolinska Institutet, Stockholm February 2016 Overview Posterior Probability Maps Hemodynamic Response Functions Population

More information

Multiresponse Sparse Regression with Application to Multidimensional Scaling

Multiresponse Sparse Regression with Application to Multidimensional Scaling Multiresponse Sparse Regression with Application to Multidimensional Scaling Timo Similä and Jarkko Tikka Helsinki University of Technology, Laboratory of Computer and Information Science P.O. Box 54,

More information

Learning Tensor-Based Features for Whole-Brain fmri Classification

Learning Tensor-Based Features for Whole-Brain fmri Classification Learning Tensor-Based Features for Whole-Brain fmri Classification Xiaonan Song, Lingnan Meng, Qiquan Shi, and Haiping Lu Department of Computer Science, Hong Kong Baptist University haiping@hkbu.edu.hk

More information

Network Lasso: Clustering and Optimization in Large Graphs

Network Lasso: Clustering and Optimization in Large Graphs Network Lasso: Clustering and Optimization in Large Graphs David Hallac, Jure Leskovec, Stephen Boyd Stanford University September 28, 2015 Convex optimization Convex optimization is everywhere Introduction

More information

Basic Introduction to Data Analysis. Block Design Demonstration. Robert Savoy

Basic Introduction to Data Analysis. Block Design Demonstration. Robert Savoy Basic Introduction to Data Analysis Block Design Demonstration Robert Savoy Sample Block Design Experiment Demonstration Use of Visual and Motor Task Separability of Responses Combined Visual and Motor

More information

Variable Selection 6.783, Biomedical Decision Support

Variable Selection 6.783, Biomedical Decision Support 6.783, Biomedical Decision Support (lrosasco@mit.edu) Department of Brain and Cognitive Science- MIT November 2, 2009 About this class Why selecting variables Approaches to variable selection Sparsity-based

More information

Decoding analyses. Neuroimaging: Pattern Analysis 2017

Decoding analyses. Neuroimaging: Pattern Analysis 2017 Decoding analyses Neuroimaging: Pattern Analysis 2017 Scope Conceptual explanation of decoding/machine learning (ML) and related concepts; Not about mathematical basis of ML algorithms! Learning goals

More information

Feature Selection. CE-725: Statistical Pattern Recognition Sharif University of Technology Spring Soleymani

Feature Selection. CE-725: Statistical Pattern Recognition Sharif University of Technology Spring Soleymani Feature Selection CE-725: Statistical Pattern Recognition Sharif University of Technology Spring 2013 Soleymani Outline Dimensionality reduction Feature selection vs. feature extraction Filter univariate

More information

Machine Learning Practical NITP Summer Course Pamela K. Douglas UCLA Semel Institute

Machine Learning Practical NITP Summer Course Pamela K. Douglas UCLA Semel Institute Machine Learning Practical NITP Summer Course 2013 Pamela K. Douglas UCLA Semel Institute Email: pamelita@g.ucla.edu Topics Covered Part I: WEKA Basics J Part II: MONK Data Set & Feature Selection (from

More information

Deriving statistical significance maps for SVM based image classification and group comparisons

Deriving statistical significance maps for SVM based image classification and group comparisons Deriving statistical significance maps for SVM based image classification and group comparisons Bilwaj Gaonkar, Christos Davatzikos Section for Biomedical Image Analysis, University of Pennsylvania, Philadelphia,

More information

A spatial constrained multi target regression model for human brain activity prediction

A spatial constrained multi target regression model for human brain activity prediction Wen and Li Appl Inform (216) 3:1 DOI 1.1186/s4535-16-26-x RESEARCH Open Access A spatial constrained multi target regression model for human brain activity prediction Zhenfu Wen 1,2 and Yuanqing Li 1,2*

More information

Linear Methods for Regression and Shrinkage Methods

Linear Methods for Regression and Shrinkage Methods Linear Methods for Regression and Shrinkage Methods Reference: The Elements of Statistical Learning, by T. Hastie, R. Tibshirani, J. Friedman, Springer 1 Linear Regression Models Least Squares Input vectors

More information

Feature Selection for fmri Classification

Feature Selection for fmri Classification Feature Selection for fmri Classification Chuang Wu Program of Computational Biology Carnegie Mellon University Pittsburgh, PA 15213 chuangw@andrew.cmu.edu Abstract The functional Magnetic Resonance Imaging

More information

Robustness of Sparse Regression Models in fmri Data Analysis

Robustness of Sparse Regression Models in fmri Data Analysis Robustness of Sparse Regression Models in fmri Data Analysis Melissa K. Carroll Department of Computer Science Princeton University Princeton, NJ 854 Guillermo A. Cecchi Yorktown Heights, NY 1598 Irina

More information

Gradient-based Representational Similarity Analysis with Searchlight for Analyzing fmri Data

Gradient-based Representational Similarity Analysis with Searchlight for Analyzing fmri Data Gradient-based Representational Similarity Analysis with Searchlight for Analyzing fmri Data Xiaoliang Sheng,Muhammad Yousefnezhad,Tonglin Xu,Ning Yuan, and Daoqiang Zhang College of Computer Science and

More information

arxiv: v1 [stat.ap] 15 Jun 2018

arxiv: v1 [stat.ap] 15 Jun 2018 Statistical Inference with Ensemble of Clustered Desparsified Lasso Jérôme-Alexis Chevalier 1,2, Joseph Salmon 2, and Bertrand Thirion 1 1 Parietal Team, Inria, CEA, Paris-Saclay University, France 2 Telecom

More information

Practical OmicsFusion

Practical OmicsFusion Practical OmicsFusion Introduction In this practical, we will analyse data, from an experiment which aim was to identify the most important metabolites that are related to potato flesh colour, from an

More information

Lasso Regression: Regularization for feature selection

Lasso Regression: Regularization for feature selection Lasso Regression: Regularization for feature selection CSE 416: Machine Learning Emily Fox University of Washington April 12, 2018 Symptom of overfitting 2 Often, overfitting associated with very large

More information

Neuroimage Processing

Neuroimage Processing Neuroimage Processing Instructor: Moo K. Chung mkchung@wisc.edu Lecture 2-3. General Linear Models (GLM) Voxel-based Morphometry (VBM) September 11, 2009 What is GLM The general linear model (GLM) is a

More information

Leveling Up as a Data Scientist. ds/2014/10/level-up-ds.jpg

Leveling Up as a Data Scientist.   ds/2014/10/level-up-ds.jpg Model Optimization Leveling Up as a Data Scientist http://shorelinechurch.org/wp-content/uploa ds/2014/10/level-up-ds.jpg Bias and Variance Error = (expected loss of accuracy) 2 + flexibility of model

More information

Understanding multivariate pattern analysis for neuroimaging applications

Understanding multivariate pattern analysis for neuroimaging applications Understanding multivariate pattern analysis for neuroimaging applications Dimitri Van De Ville Medical Image Processing Lab, Institute of Bioengineering, Ecole Polytechnique Fédérale de Lausanne Department

More information

Medical Image Analysis

Medical Image Analysis Medical Image Analysis Instructor: Moo K. Chung mchung@stat.wisc.edu Lecture 10. Multiple Comparisons March 06, 2007 This lecture will show you how to construct P-value maps fmri Multiple Comparisons 4-Dimensional

More information

Second revision: Supplementary Material Linking brain-wide multivoxel activation patterns to behaviour: examples from language and math

Second revision: Supplementary Material Linking brain-wide multivoxel activation patterns to behaviour: examples from language and math Second revision: Supplementary Material Linking brain-wide multivoxel activation patterns to behaviour: examples from language and math Rajeev D. S. Raizada, Feng Ming Tsao, Huei-Mei Liu, Ian D. Holloway,

More information

Segmentation of Brain MR Images via Sparse Patch Representation

Segmentation of Brain MR Images via Sparse Patch Representation Segmentation of Brain MR Images via Sparse Patch Representation Tong Tong 1, Robin Wolz 1, Joseph V. Hajnal 2, and Daniel Rueckert 1 1 Department of Computing, Imperial College London, London, UK 2 MRC

More information

Classification of Whole Brain fmri Activation Patterns. Serdar Kemal Balcı

Classification of Whole Brain fmri Activation Patterns. Serdar Kemal Balcı Classification of Whole Brain fmri Activation Patterns by Serdar Kemal Balcı Submitted to the Department of Electrical Engineering and Computer Science in partial fulfillment of the requirements for the

More information

Neuroimaging and mathematical modelling Lesson 2: Voxel Based Morphometry

Neuroimaging and mathematical modelling Lesson 2: Voxel Based Morphometry Neuroimaging and mathematical modelling Lesson 2: Voxel Based Morphometry Nivedita Agarwal, MD Nivedita.agarwal@apss.tn.it Nivedita.agarwal@unitn.it Volume and surface morphometry Brain volume White matter

More information

CSC 411: Lecture 02: Linear Regression

CSC 411: Lecture 02: Linear Regression CSC 411: Lecture 02: Linear Regression Raquel Urtasun & Rich Zemel University of Toronto Sep 16, 2015 Urtasun & Zemel (UofT) CSC 411: 02-Regression Sep 16, 2015 1 / 16 Today Linear regression problem continuous

More information

Temporal Feature Selection for fmri Analysis

Temporal Feature Selection for fmri Analysis Temporal Feature Selection for fmri Analysis Mark Palatucci School of Computer Science Carnegie Mellon University Pittsburgh, Pennsylvania markmp@cmu.edu ABSTRACT Recent work in neuroimaging has shown

More information

Statistical Analysis of Neuroimaging Data. Phebe Kemmer BIOS 516 Sept 24, 2015

Statistical Analysis of Neuroimaging Data. Phebe Kemmer BIOS 516 Sept 24, 2015 Statistical Analysis of Neuroimaging Data Phebe Kemmer BIOS 516 Sept 24, 2015 Review from last time Structural Imaging modalities MRI, CAT, DTI (diffusion tensor imaging) Functional Imaging modalities

More information

RANDOM SUBSPACE METHOD IN CLASSIFICAnON AND MAPPING OF FMRI DATA PATTERNS

RANDOM SUBSPACE METHOD IN CLASSIFICAnON AND MAPPING OF FMRI DATA PATTERNS RANDOM SUBSPACE METHOD IN CLASSIFICAnON AND MAPPING OF FMRI DATA PATTERNS by Tianwen Chen A Dissertation Submitted to the Graduate Faculty of George Mason University in Partial Fulfillment of The Requirements

More information

Machine Learning / Jan 27, 2010

Machine Learning / Jan 27, 2010 Revisiting Logistic Regression & Naïve Bayes Aarti Singh Machine Learning 10-701/15-781 Jan 27, 2010 Generative and Discriminative Classifiers Training classifiers involves learning a mapping f: X -> Y,

More information

A Functional Connectivity Inspired Approach to Non-Local fmri Analysis

A Functional Connectivity Inspired Approach to Non-Local fmri Analysis A Functional Connectivity Inspired Approach to Non-Local fmri Analysis Anders Eklund, Mats Andersson and Hans Knutsson Linköping University Post Print N.B.: When citing this work, cite the original article.

More information

Fast or furious? - User analysis of SF Express Inc

Fast or furious? - User analysis of SF Express Inc CS 229 PROJECT, DEC. 2017 1 Fast or furious? - User analysis of SF Express Inc Gege Wen@gegewen, Yiyuan Zhang@yiyuan12, Kezhen Zhao@zkz I. MOTIVATION The motivation of this project is to predict the likelihood

More information

Regularized Tensor Factorizations & Higher-Order Principal Components Analysis

Regularized Tensor Factorizations & Higher-Order Principal Components Analysis Regularized Tensor Factorizations & Higher-Order Principal Components Analysis Genevera I. Allen Department of Statistics, Rice University, Department of Pediatrics-Neurology, Baylor College of Medicine,

More information

Preprocessing II: Between Subjects John Ashburner

Preprocessing II: Between Subjects John Ashburner Preprocessing II: Between Subjects John Ashburner Pre-processing Overview Statistics or whatever fmri time-series Anatomical MRI Template Smoothed Estimate Spatial Norm Motion Correct Smooth Coregister

More information

Understanding multivariate pattern analysis for neuroimaging applications

Understanding multivariate pattern analysis for neuroimaging applications Understanding multivariate pattern analysis for neuroimaging applications Maria Giulia Preti Dimitri Van De Ville Medical Image Processing Lab, Institute of Bioengineering, Ecole Polytechnique Fédérale

More information

NeuroImage. Sparse logistic regression for whole-brain classification of fmri data

NeuroImage. Sparse logistic regression for whole-brain classification of fmri data YNIMG-07076; No. of pages: 13; 4C: NeuroImage xxx (2010) xxx xxx Contents lists available at ScienceDirect NeuroImage journal homepage: www.elsevier.com/locate/ynimg Sparse logistic regression for whole-brain

More information

Quality Checking an fmri Group Result (art_groupcheck)

Quality Checking an fmri Group Result (art_groupcheck) Quality Checking an fmri Group Result (art_groupcheck) Paul Mazaika, Feb. 24, 2009 A statistical parameter map of fmri group analyses relies on the assumptions of the General Linear Model (GLM). The assumptions

More information

The organization of the human cerebral cortex estimated by intrinsic functional connectivity

The organization of the human cerebral cortex estimated by intrinsic functional connectivity 1 The organization of the human cerebral cortex estimated by intrinsic functional connectivity Journal: Journal of Neurophysiology Author: B. T. Thomas Yeo, et al Link: https://www.ncbi.nlm.nih.gov/pubmed/21653723

More information

Introductory Concepts for Voxel-Based Statistical Analysis

Introductory Concepts for Voxel-Based Statistical Analysis Introductory Concepts for Voxel-Based Statistical Analysis John Kornak University of California, San Francisco Department of Radiology and Biomedical Imaging Department of Epidemiology and Biostatistics

More information

Gene signature selection to predict survival benefits from adjuvant chemotherapy in NSCLC patients

Gene signature selection to predict survival benefits from adjuvant chemotherapy in NSCLC patients 1 Gene signature selection to predict survival benefits from adjuvant chemotherapy in NSCLC patients 1,2 Keyue Ding, Ph.D. Nov. 8, 2014 1 NCIC Clinical Trials Group, Kingston, Ontario, Canada 2 Dept. Public

More information

Learning with infinitely many features

Learning with infinitely many features Learning with infinitely many features R. Flamary, Joint work with A. Rakotomamonjy F. Yger, M. Volpi, M. Dalla Mura, D. Tuia Laboratoire Lagrange, Université de Nice Sophia Antipolis December 2012 Example

More information

Predicting the Word from Brain Activity

Predicting the Word from Brain Activity CS771A Semester Project 25th November, 2016 Aman Garg Apurv Gupta Ayushya Agarwal Gopichand Kotana Kushal Kumar 14073 14124 14168 14249 14346 Abstract In this project, our objective was to predict the

More information

Quick Manual for Sparse Logistic Regression ToolBox ver1.2.1

Quick Manual for Sparse Logistic Regression ToolBox ver1.2.1 Quick Manual for Sparse Logistic Regression ToolBox ver1.2.1 By Okito Yamashita Updated in Feb.2011 First version : Sep.2009 The Sparse Logistic Regression toolbox (SLR toolbox hereafter) is a suite of

More information

Classification of Subject Motion for Improved Reconstruction of Dynamic Magnetic Resonance Imaging

Classification of Subject Motion for Improved Reconstruction of Dynamic Magnetic Resonance Imaging 1 CS 9 Final Project Classification of Subject Motion for Improved Reconstruction of Dynamic Magnetic Resonance Imaging Feiyu Chen Department of Electrical Engineering ABSTRACT Subject motion is a significant

More information

Mapping of Hierarchical Activation in the Visual Cortex Suman Chakravartula, Denise Jones, Guillaume Leseur CS229 Final Project Report. Autumn 2008.

Mapping of Hierarchical Activation in the Visual Cortex Suman Chakravartula, Denise Jones, Guillaume Leseur CS229 Final Project Report. Autumn 2008. Mapping of Hierarchical Activation in the Visual Cortex Suman Chakravartula, Denise Jones, Guillaume Leseur CS229 Final Project Report. Autumn 2008. Introduction There is much that is unknown regarding

More information

CHAPTER 2. Morphometry on rodent brains. A.E.H. Scheenstra J. Dijkstra L. van der Weerd

CHAPTER 2. Morphometry on rodent brains. A.E.H. Scheenstra J. Dijkstra L. van der Weerd CHAPTER 2 Morphometry on rodent brains A.E.H. Scheenstra J. Dijkstra L. van der Weerd This chapter was adapted from: Volumetry and other quantitative measurements to assess the rodent brain, In vivo NMR

More information

Computational Neuroanatomy

Computational Neuroanatomy Computational Neuroanatomy John Ashburner john@fil.ion.ucl.ac.uk Smoothing Motion Correction Between Modality Co-registration Spatial Normalisation Segmentation Morphometry Overview fmri time-series kernel

More information

NeuroImage. Interpretable whole-brain prediction analysis with GraphNet

NeuroImage. Interpretable whole-brain prediction analysis with GraphNet NeuroImage 72 (2013) 304 321 Contents lists available at SciVerse ScienceDirect NeuroImage journal homepage: www.elsevier.com/locate/ynimg Interpretable whole-brain prediction analysis with GraphNet Logan

More information

Machine Learning Feature Creation and Selection

Machine Learning Feature Creation and Selection Machine Learning Feature Creation and Selection Jeff Howbert Introduction to Machine Learning Winter 2012 1 Feature creation Well-conceived new features can sometimes capture the important information

More information

I How does the formulation (5) serve the purpose of the composite parameterization

I How does the formulation (5) serve the purpose of the composite parameterization Supplemental Material to Identifying Alzheimer s Disease-Related Brain Regions from Multi-Modality Neuroimaging Data using Sparse Composite Linear Discrimination Analysis I How does the formulation (5)

More information

SGN (4 cr) Chapter 10

SGN (4 cr) Chapter 10 SGN-41006 (4 cr) Chapter 10 Feature Selection and Extraction Jussi Tohka & Jari Niemi Department of Signal Processing Tampere University of Technology February 18, 2014 J. Tohka & J. Niemi (TUT-SGN) SGN-41006

More information

Graphical Models, Bayesian Method, Sampling, and Variational Inference

Graphical Models, Bayesian Method, Sampling, and Variational Inference Graphical Models, Bayesian Method, Sampling, and Variational Inference With Application in Function MRI Analysis and Other Imaging Problems Wei Liu Scientific Computing and Imaging Institute University

More information

Multivariate fmri Analysis using Canonical Correlation Analysis instead of Classifiers, Comment on Todd et al.

Multivariate fmri Analysis using Canonical Correlation Analysis instead of Classifiers, Comment on Todd et al. Multivariate fmri Analysis using Canonical Correlation Analysis instead of Classifiers, Comment on Todd et al. Anders Eklund a, Hans Knutsson bc a Virginia Tech Carilion Research Institute, Virginia Tech,

More information

Generative and discriminative classification techniques

Generative and discriminative classification techniques Generative and discriminative classification techniques Machine Learning and Category Representation 013-014 Jakob Verbeek, December 13+0, 013 Course website: http://lear.inrialpes.fr/~verbeek/mlcr.13.14

More information

Data mining for neuroimaging data. John Ashburner

Data mining for neuroimaging data. John Ashburner Data mining for neuroimaging data John Ashburner MODELLING The Scientific Process MacKay, David JC. Bayesian interpolation. Neural computation 4, no. 3 (1992): 415-447. Model Selection Search for the best

More information

Effectiveness of Sparse Features: An Application of Sparse PCA

Effectiveness of Sparse Features: An Application of Sparse PCA 000 001 002 003 004 005 006 007 008 009 010 011 012 013 014 015 016 017 018 019 020 021 022 023 024 025 026 027 028 029 030 031 032 033 034 035 036 037 038 039 040 041 042 043 044 045 046 047 048 049 050

More information

Sparsity Based Regularization

Sparsity Based Regularization 9.520: Statistical Learning Theory and Applications March 8th, 200 Sparsity Based Regularization Lecturer: Lorenzo Rosasco Scribe: Ioannis Gkioulekas Introduction In previous lectures, we saw how regularization

More information

Incorporating Prior Information with Fused Sparse Group Lasso: Application to Prediction of Clinical Measures from Neuroimages

Incorporating Prior Information with Fused Sparse Group Lasso: Application to Prediction of Clinical Measures from Neuroimages arxiv:181.6594v3 [stat.me] 23 Feb 218 Incorporating Prior Information with Fused Sparse Group Lasso: Application to Prediction of Clinical Measures from Neuroimages Joanne C. Beer Department of Biostatistics,

More information

SUPERVISED LEARNING METHODS. Stanley Liang, PhD Candidate, Lassonde School of Engineering, York University Helix Science Engagement Programs 2018

SUPERVISED LEARNING METHODS. Stanley Liang, PhD Candidate, Lassonde School of Engineering, York University Helix Science Engagement Programs 2018 SUPERVISED LEARNING METHODS Stanley Liang, PhD Candidate, Lassonde School of Engineering, York University Helix Science Engagement Programs 2018 2 CHOICE OF ML You cannot know which algorithm will work

More information

Yelp Recommendation System

Yelp Recommendation System Yelp Recommendation System Jason Ting, Swaroop Indra Ramaswamy Institute for Computational and Mathematical Engineering Abstract We apply principles and techniques of recommendation systems to develop

More information

Bayesian Spherical Wavelet Shrinkage: Applications to Shape Analysis

Bayesian Spherical Wavelet Shrinkage: Applications to Shape Analysis Bayesian Spherical Wavelet Shrinkage: Applications to Shape Analysis Xavier Le Faucheur a, Brani Vidakovic b and Allen Tannenbaum a a School of Electrical and Computer Engineering, b Department of Biomedical

More information

Network statistics and thresholding

Network statistics and thresholding Network statistics and thresholding Andrew Zalesky azalesky@unimelb.edu.au HBM Educational Course June 25, 2017 Network thresholding Unthresholded Moderate thresholding Severe thresholding Strong link

More information

Features: representation, normalization, selection. Chapter e-9

Features: representation, normalization, selection. Chapter e-9 Features: representation, normalization, selection Chapter e-9 1 Features Distinguish between instances (e.g. an image that you need to classify), and the features you create for an instance. Features

More information

Predictive Analytics: Demystifying Current and Emerging Methodologies. Tom Kolde, FCAS, MAAA Linda Brobeck, FCAS, MAAA

Predictive Analytics: Demystifying Current and Emerging Methodologies. Tom Kolde, FCAS, MAAA Linda Brobeck, FCAS, MAAA Predictive Analytics: Demystifying Current and Emerging Methodologies Tom Kolde, FCAS, MAAA Linda Brobeck, FCAS, MAAA May 18, 2017 About the Presenters Tom Kolde, FCAS, MAAA Consulting Actuary Chicago,

More information

arxiv: v2 [cs.cv] 7 Jun 2015

arxiv: v2 [cs.cv] 7 Jun 2015 Randomized Structural Sparsity via Constrained Block Subsampling for Improved Sensitivity of Discriminative Voxel Identification arxiv:1410.4650v2 [cs.cv] 7 Jun 2015 Yilun Wang a,b,c, Junjie Zheng b, Sheng

More information

Predictive Analysis: Evaluation and Experimentation. Heejun Kim

Predictive Analysis: Evaluation and Experimentation. Heejun Kim Predictive Analysis: Evaluation and Experimentation Heejun Kim June 19, 2018 Evaluation and Experimentation Evaluation Metrics Cross-Validation Significance Tests Evaluation Predictive analysis: training

More information

Automatic segmentation of the cortical grey and white matter in MRI using a Region Growing approach based on anatomical knowledge

Automatic segmentation of the cortical grey and white matter in MRI using a Region Growing approach based on anatomical knowledge Automatic segmentation of the cortical grey and white matter in MRI using a Region Growing approach based on anatomical knowledge Christian Wasserthal 1, Karin Engel 1, Karsten Rink 1 und André Brechmann

More information

Multiple Testing and Thresholding

Multiple Testing and Thresholding Multiple Testing and Thresholding UCLA Advanced NeuroImaging Summer School, 2007 Thanks for the slides Tom Nichols! Overview Multiple Testing Problem Which of my 100,000 voxels are active? Two methods

More information

Applied Statistics for Neuroscientists Part IIa: Machine Learning

Applied Statistics for Neuroscientists Part IIa: Machine Learning Applied Statistics for Neuroscientists Part IIa: Machine Learning Dr. Seyed-Ahmad Ahmadi 04.04.2017 16.11.2017 Outline Machine Learning Difference between statistics and machine learning Modeling the problem

More information

Group Sta*s*cs in MEG/EEG

Group Sta*s*cs in MEG/EEG Group Sta*s*cs in MEG/EEG Will Woods NIF Fellow Brain and Psychological Sciences Research Centre Swinburne University of Technology A Cau*onary tale. A Cau*onary tale. A Cau*onary tale. Overview Introduc*on

More information

A Reduced-Dimension fmri! Shared Response Model

A Reduced-Dimension fmri! Shared Response Model A Reduced-Dimension fmri! Shared Response Model Po-Hsuan! (Cameron)! Chen 1! Janice! Chen 2! Yaara! Yeshurun 2! Uri! Hasson 2! James! Haxby 3! Peter! Ramadge 1! 1 Department of Electrical Engineering,

More information

Stability of Feature Selection Algorithms

Stability of Feature Selection Algorithms Stability of Feature Selection Algorithms Alexandros Kalousis, Jullien Prados, Phong Nguyen Melanie Hilario Artificial Intelligence Group Department of Computer Science University of Geneva Stability of

More information

Naïve Bayes, Gaussian Distributions, Practical Applications

Naïve Bayes, Gaussian Distributions, Practical Applications Naïve Bayes, Gaussian Distributions, Practical Applications Required reading: Mitchell draft chapter, sections 1 and 2. (available on class website) Machine Learning 10-601 Tom M. Mitchell Machine Learning

More information

An Improved Optimization Method for the Relevance Voxel Machine

An Improved Optimization Method for the Relevance Voxel Machine An Improved Optimization Method for the Relevance Voxel Machine Melanie Ganz 1,, Mert R. Sabuncu 1, and Koen Van Leemput 1,3,4 1 Martinos Center for Biomedical Imaging, MGH, Harvard Medical School, USA

More information

Total Variation meets Sparsity: statistical learning with segmenting penalties

Total Variation meets Sparsity: statistical learning with segmenting penalties Total Variation meets Sparsity: statistical learning with segmenting penalties Michael Eickenberg, Elvis Dohmatob, Bertrand Thirion, Gaël Varoquaux To cite this version: Michael Eickenberg, Elvis Dohmatob,

More information

ABSTRACT 1. INTRODUCTION 2. METHODS

ABSTRACT 1. INTRODUCTION 2. METHODS Finding Seeds for Segmentation Using Statistical Fusion Fangxu Xing *a, Andrew J. Asman b, Jerry L. Prince a,c, Bennett A. Landman b,c,d a Department of Electrical and Computer Engineering, Johns Hopkins

More information

Pixels to Voxels: Modeling Visual Representation in the Human Brain

Pixels to Voxels: Modeling Visual Representation in the Human Brain Pixels to Voxels: Modeling Visual Representation in the Human Brain Authors: Pulkit Agrawal, Dustin Stansbury, Jitendra Malik, Jack L. Gallant Presenters: JunYoung Gwak, Kuan Fang Outlines Background Motivation

More information