Can we interpret weight maps in terms of cognitive/clinical neuroscience? Jessica Schrouff
|
|
- Austen Barker
- 5 years ago
- Views:
Transcription
1 Can we interpret weight maps in terms of cognitive/clinical neuroscience? Jessica Schrouff Computer Science Department, University College London, UK Laboratory of Behavioral and Cognitive Neuroscience, Stanford University, US
2 Pattern Recognition Models v 1 Linear predictive models (classifier or regression) are parameterized by a weight vector w and a bias term b. w t 1 t 2 t 4 t 3 The weight vector w can be expressed as a linear combination of training examples x i. The general equation for making predictions for a test example x * is: w = N å i=1 a i x i v 2 f ( x*) w x* b 25/06/2017 J. Schrouff - OHBM 2017 PR4NI 2
3 How is w computed? w is estimated by solving an optimization problem consisting of a data fit term E and a penalty/regularization term J. min wîr p { E(w)+ lj(w) } Regularization parameter Data fit term or loss function (E): denotes the price we pay when we make mistakes in the predictions (e.g. squared loss, Hinge loss). Regularization term (J): favours certain properties (e.g. sparsity) and improves the generalisation over unseen examples (e.g. L2-norm, L1-norm). Many machine learning algorithms are particular choices of E and J.
4 Weight map or predictive pattern w has the same dimensionality of the input data and can be plotted as an image. Might provide potential insights into brain function or structure that drives the prediction, but interpretation should be done with care!
5 BOLD signal Pattern recognition analysis labels labels labels labels labels Training Predictive function or Predictive pattern: f New subject Testing Predicted label Very different meaning! Mass-univariate analysis (e.g. General Linear Model, GLM) Time Voxel-wise GLM model estimation Independent statistical test at each Voxel Correction for Multiple comparisons Statistical Parametric Map
6 Using the weights for prediction Weight map (w) Predictive function f (x * ) = w x * +b New example (x*) f ( x * f ( x * ) (5 2) (2 1) ( 6 2) ( 1 1) 0 ) f(x * ) is the predicted score for regression or the distance to the decision boundary for classification models.
7 Thresholding the weight map Weight map (w) Predictive function f (x * ) = w x * +b New example (x*) f ( x * f ( x * ) (5 2) (0 1) ( 6 2) (0 1) 0 ) f(x * ) truncated does not correspond to f(x * )!
8 What do (voxel) weights represent? Signal: - Voxel 1: s n + d(n) - Voxel 2: d(n) Weights: - Voxel 1: w = 1 - Voxel 2: w = -1 s n d(n) c 1 c 2 Haufe et al.,
9 What can affect the magnitude of w? High absolute coefficient/weight in a voxels might be due to: an association between the voxel s signal and the labels multivariate properties (e.g. correlated signal or correlated noise) Zero coefficient/weight in a voxels might be due to: no association between the voxel s signal and the labels a particular type of regularization If the goal is to make local inference about the association between underlying brain signal in a voxel (e.g. activity) and the labels (e.g. experimental condition) further analysis are recommended (e.g. mass univariate GLM) Sato et al and Haufe et al 2014
10 How to interpret the weight vector w? Spatial representation of the predictive function. Shows the contribution of each feature/voxel to the prediction. Multivariate pattern -> All voxels with weights different from zero contribute to the final prediction (no arbitrary threshold should be applied). Mixture of signal of interest and noise
11 Potential strategies to improve interpretability of whole brain predictive patterns A priori 1. Masking 2. Searchlight mapping During model estimation 3. Feature selection 4. Sparse algorithms 5. Atlas based Multiple Kernel Learning (MKL) 6. Using weight stability in model selection A posteriori 7. Atlas based weight summarization 8. Permutation test 9. Transforming weights into activation patterns
12 1. Masking: ROI based on a priori knowledge e.g. Choosing voxels/features based on literature or on previous univariate results (independent dataset). 12
13 1. Masking: ROI based on a priori knowledge Advantages Good accuracy? Anatomically/ functionally relevant Limitations Need an anatomical hypothesis Cannot threshold obtained map Arbitrary choice of region number and size Multiple models = multiple comparisons 13
14 2. Searchlight Mapping Scans the imaged volume with a "searchlight" (e.g. sphere centered at each voxel) and performs a multivariate analysis at each location in the brain (Kriegeskorte et al, 2006) local multivariate pattern
15 2. Searchlight Mapping Advantages Mapping approach -> identifies voxels/neighborhoods that contain predictive or multivariate information Limitations Locally multivariate (does not account for the whole brain network) Arbitrary choice of neighborhood size (e.g. radios of the sphere) Multiple models -> multiple comparison issue Computationally expensive
16 3. Feature selection Selects a subset of informative features according to a specific criterion. Example of feature selection approaches: Univariate t-test Recursive Feature Elimination (RFE, De Martino et al, 2008) RONDINA et al.: SCORS A METHOD BASED ON STABILITY FOR FEATURE SELECTION AND APPING IN NEUROIMAGING 95 Stability selection (SCoRS, Rondina et al, 2014) TABLE V BRAIN REGIONS IMPORTANT TO DISCRIMINATE THE GROUPS FOUND BY SCORS Voxels in white were selected by SCoRS, RFE and t-test (Rondina et al, 2014)
17 3. Feature selection Data driven Advantages Identifies the most relevant voxels for prediction Limitations Computationally expensive (nested cross-validation needs to be implemented to avoid double dipping!)
18 4. Sparse algorithms Choose a regularization term that enforces sparsity or structured sparsity. Examples of sparse and structured sparse approaches: LASSO (Tibshirani, 1996) Elastic-net (Zou and Hastie, 2005) Total Variation (Michel et al, 2011) Graph Laplacian Elastic Net (GraphNET, Grosenick et al, 2011) Sparse Total Variation (Baldassare et al, 2011) Grosenick et al, 2011
19 How regularization affects the weight map? LASSO 86.31% Elastic Net 88.02% Total Variation (TV) 85.79% Weight maps for classifying fmri images during visualization of pleasant vs. unpleasant pictures. All models used a square loss + regularization. Laplacian (LAP) 83.71% Sparse TV 85.86% Sparse LAP 87.05% Baldassarre et al., 2017
20 4. Sparse algorithms Data-driven Advantages Selection embedded in model Limitations Potential instability of the selected features (Baldassarre et al., 2012): different regularization terms = different patterns and similar accuracies. Identifies a subset of predictive voxels
21 5. Atlas based Multiple Kernel Learning (MKL) Parcellation using anatomical atlas Multiple Kernel Learning 116 å f (x * ) = d m f m (x * ) m=1 Region 1 Region 116 The MKL learns and combines different models, represented by different kernels (regions). Sparsity on the kernel combination (e.g. SimpleMKL, Rakotomamonjy, et al. 2008) -> selects a subset of regions. Schrouff, Monteiro et al.(submitted)
22 5. Atlas based Multiple Kernel Learning (MKL) PRoNTo (
23 5. Atlas based Multiple Kernel Learning (MKL) PRoNTo (
24 5. Atlas based Multiple Kernel Learning (MKL) Advantages Limitations Anatomically/ functionally driven L1 regularization -> might not select regions with correlated information Selects a subset of predictive regions Shows the contribution of each region (region weight) and each voxel (voxel weight) for the predictive function Potential instability of the selected regions Atlas dependent
25 6. Using weight stability in model selection Including the weight stability (Baldassarre et al., 2017) or interpretability (reproducibility x representativeness, Kia et al, 2017) in the optimization of hyper-parameters: w 1 w 1 w w 2 2 ~ ς: model performance to optimize δ: model generalization (e.g. accuracy) η: stability or interpretability of weight map 25
26 6. Using weight stability in model selection Advantages Selects model based on both generalization and stability Embedded in model selection Limitations Applies mostly to sparse methods Various parameters to set: how to define η, set w 1 and w 2 Computationally expensive Needs more work to define in which cases it is interesting (e.g. when no trade-off between η and δ)
27 7. Atlas based weight summarization Summarize whole brain weights using a pre-defined atlas (Schrouff et al, 2013). Average the absolute value of the weights within each regions to create a rank according to their contribution to the decision function. NW ROI = å vîroi w v #{v Î ROI} AAL atlas (Tzourio-Mazoyer, 2002)
28 7. Atlas based weight summarization PRoNTo (
29 7. Atlas based weight summarization Advantages Anatomically/ functionally driven Ranks the regions according to their proportional contribution to the predictive function Limitations Not embedded in the model (a posteriori summarization) All regions with weight different from zero contribute to the prediction -> no arbitrary threshold should be applied Atlas dependent
30 8. Permutation test Use permutation to estimate the null distribution of the weights and test the null hypothesis at each voxel to threshold the weights (Mourao-Miranda et al, 2005). Use analytic approximation of the null distribution (Gaonkar and Davatzikos, 2013). p-maps obtained from experimental and analytical permutation tests (Gaonkar and Davatzikos, 2013)
31 8. Permutation test Data driven Advantages Limitations Needs to be corrected for multiple comparison Provides a thresholded weight map -> identifies the most relevant voxels for prediction Computationally expensive
32 9. Deriving activation patterns from weights Derive activation patterns from weights (extraction filters) by transforming backward models into forward models (Haufe et al., 2014) Backward model Forward model (discriminative model) (generative model) W T x(n) = ŝ(n) x(n) = Aŝ(n)+e(n) W : extraction filters A : activation patterns (columns of W) (columns of A) x(n) : data e(n) : noise ŝ(n) : latent factors The activation patterns show in which voxels (channels) the latent factors are reflected.
33 9. Deriving activation patterns from weights Advantages Like in the GLM activation patterns enable neurophysiological interpretation of the coefficients Limitations Depends on the assumptions used to estimate the forward and backward models. The activation patterns do not represent the predictive function (they don t show which voxels/regions are relevant for prediction)
34 Summary Interpretation of machine learning based models can be complex. There are different strategies to improve interpretation, each one has advantages and limitations: Data driven vs. anatomically/functionally driven Multivariate power (local vs. global, sensitivity vs. specificity) Stability of the patterns Computational time
35 Acknowledgements Janaina Mourão-Miranda, UCL, UK Joao M. Monteiro, UCL, UK Maria Joao Rosa, UCL, UK Josef Parvizi and team, Stanford, US PRoNTo team Marie Sklodowska-Curie Actions
36 Useful references Baldassarre, L., Pontil, M. & Mourão-Miranda, J.. «Sparsity is better with stability: combining accuracy and stability for model selection in brain decoding.» Frontiers in Neuroscience: Brain Imaging Methods. (2017) De Martino, F., Valente, G., Staeren, N., Ashburner, J., Goebel, R., & Formisano, E. «Combining multivariate voxel selection and support vector machines for mapping and classification of fmri spatial patterns.» NeuroImage. (2008) 43(1), Gaonkar, B. & Davatzikos, C. «Analytic estimation of statistical significance maps for support vector machine based multivariate image analysis and classification.» NeuroImage. (2013) 78: Grosenick, L., Klingenberg, B., Katovich, K., Knutson, B. & Taylor, J.E. «Interpretable whole-brain prediction analysis with GraphNet.» NeuroImage. (2013) 72, Haufe, S., Meinecke, F., Go rgen, K., Daḧne, S., Haynes, J. D., Blankertz, B. & Bießmann, F. «On the interpretation of weight vectors of linear models in multivariate neuroimaging.» NeuroImage. (2014) 8 7, Kia, S.M., Vega-Pons, S., Weisz, N. & Passerini, A. «Interpretability of Multivariate Brain Maps in Linear Brain Decoding: Definition, and Heuristic Quantification in Multivariate Analysis of MEG Time-Locked Effects.» Frontiers in Neuroscience. (2017) /fnins Kriegeskorte, N., Rainer, G. & Bandettini, P. «Information-based functional brain mapping.» PNAS 103 (2006):
37 Useful references Michel, V., Gramfort, A., Varoquaux, G., Eger, E. & Thirion, B. «Total variation regularization for fmri-based prediction of behavior.» IEEE Trans Med Imaging. (2011) 30, Rakotomamonjy, A., Bach, F., Canu, S. & Grandvalet, Y. «SimpleMKL» Journal of Machine Learning 9 (2008): Rondina J., Hahn T., de Oliveira L., Marquand A., Dresler T., Leitner T., Fallgatter A., Shawe-Taylor J. & Mourao-Miranda J. «SCoRS - a method based on stability for feature selection and mapping in neuroimaging.» IEEE Trans Med Imaging. (2014) Jan:33(1). Sato, J.R., Mourao-Miranda, J., Morais Martin Mda G., Amaro E. Jr., Morettin P.A. & Brammer M.J. «The impact of functional connectivity changes on support vector machines mapping of fmri data.» J Neurosci Methods. (2008) 172(1): Schrouff, J., Cremers, J., Garraux, G., Baldassarre, L., Mourão-Miranda, J. & Phillips, C. «Localizing and comparing weight maps generated from linear kernel machine learning models.» Proceedings of the 3rd workshop on Pattern Recognition in NeuroImaging Tibshirani, R. «Regression shrinkage and selection via the lasso.» Journal of the Royal Statistical Society. 58 (1996): Tzourio-Mazoyer, N., et al. «Automated Anatomical Labeling of activations in SPM using a Macroscopic Anatomical Parcellation of the MNI MRI single-subject brain.» NeuroImage 15 (2002): Zou, H., & Hastie, T. «Regularization and variable selection via the elastic net.» J. R. Statist. Soc. B 67 (2005):
Interpreting predictive models in terms of anatomically labelled regions
Interpreting predictive models in terms of anatomically labelled regions Data Feature extraction/selection Model specification Accuracy and significance Model interpretation 2 Data Feature extraction/selection
More informationPa#ern Recogni-on for Neuroimaging Toolbox
Pa#ern Recogni-on for Neuroimaging Toolbox Pa#ern Recogni-on Methods: Basics João M. Monteiro Based on slides from Jessica Schrouff and Janaina Mourão-Miranda PRoNTo course UCL, London, UK 2017 Outline
More informationPattern Recognition for Neuroimaging Data
Pattern Recognition for Neuroimaging Data Edinburgh, SPM course April 2013 C. Phillips, Cyclotron Research Centre, ULg, Belgium http://www.cyclotron.ulg.ac.be Overview Introduction Univariate & multivariate
More informationMULTIVARIATE ANALYSES WITH fmri DATA
MULTIVARIATE ANALYSES WITH fmri DATA Sudhir Shankar Raman Translational Neuromodeling Unit (TNU) Institute for Biomedical Engineering University of Zurich & ETH Zurich Motivation Modelling Concepts Learning
More informationPattern Recognition for Neuroimaging Toolbox: PRoNTo
Click to edit Master title style Pattern Recognition for Neuroimaging Toolbox: PRoNTo Jessica Schrouff PRNI 2018 June 14 th NUS, Singapore Click Outline to edit Master title style PRoNTo s goals and history
More informationx 1 SpaceNet: Multivariate brain decoding and segmentation Elvis DOHMATOB (Joint work with: M. EICKENBERG, B. THIRION, & G. VAROQUAUX) 75 x y=20
SpaceNet: Multivariate brain decoding and segmentation Elvis DOHMATOB (Joint work with: M. EICKENBERG, B. THIRION, & G. VAROQUAUX) L R 75 x 2 38 0 y=20-38 -75 x 1 1 Introducing the model 2 1 Brain decoding
More informationVoxel selection algorithms for fmri
Voxel selection algorithms for fmri Henryk Blasinski December 14, 2012 1 Introduction Functional Magnetic Resonance Imaging (fmri) is a technique to measure and image the Blood- Oxygen Level Dependent
More informationMultivariate pattern classification
Multivariate pattern classification Thomas Wolbers Space & Ageing Laboratory (www.sal.mvm.ed.ac.uk) Centre for Cognitive and Neural Systems & Centre for Cognitive Ageing and Cognitive Epidemiology Outline
More informationMulti-Voxel Pattern Analysis (MVPA) for fmri
MVPA for - and Multi-Voxel Pattern Analysis (MVPA) for 7 th M-BIC Workshop 2012 Maastricht University, Department of Cognitive Neuroscience Maastricht Brain Imaging Center (M-Bic), Maastricht, The Netherlands
More informationMultivariate Pattern Classification. Thomas Wolbers Space and Aging Laboratory Centre for Cognitive and Neural Systems
Multivariate Pattern Classification Thomas Wolbers Space and Aging Laboratory Centre for Cognitive and Neural Systems Outline WHY PATTERN CLASSIFICATION? PROCESSING STREAM PREPROCESSING / FEATURE REDUCTION
More informationMulti-voxel pattern analysis: Decoding Mental States from fmri Activity Patterns
Multi-voxel pattern analysis: Decoding Mental States from fmri Activity Patterns Artwork by Leon Zernitsky Jesse Rissman NITP Summer Program 2012 Part 1 of 2 Goals of Multi-voxel Pattern Analysis Decoding
More informationCorrection for multiple comparisons. Cyril Pernet, PhD SBIRC/SINAPSE University of Edinburgh
Correction for multiple comparisons Cyril Pernet, PhD SBIRC/SINAPSE University of Edinburgh Overview Multiple comparisons correction procedures Levels of inferences (set, cluster, voxel) Circularity issues
More informationIntroduction to Neuroimaging Janaina Mourao-Miranda
Introduction to Neuroimaging Janaina Mourao-Miranda Neuroimaging techniques have changed the way neuroscientists address questions about functional anatomy, especially in relation to behavior and clinical
More informationIdentifying predictive regions from fmri with TV-L1 prior
Identifying predictive regions from fmri with TV-L1 prior Alexandre Gramfort, Bertrand Thirion, Gaël Varoquaux To cite this version: Alexandre Gramfort, Bertrand Thirion, Gaël Varoquaux. Identifying predictive
More information7/15/2016 ARE YOUR ANALYSES TOO WHY IS YOUR ANALYSIS PARAMETRIC? PARAMETRIC? That s not Normal!
ARE YOUR ANALYSES TOO PARAMETRIC? That s not Normal! Martin M Monti http://montilab.psych.ucla.edu WHY IS YOUR ANALYSIS PARAMETRIC? i. Optimal power (defined as the probability to detect a real difference)
More informationToward a rigorous statistical framework for brain mapping
Toward a rigorous statistical framework for brain mapping Bertrand Thirion, bertrand.thirion@inria.fr 1 Cognitive neuroscience How are cognitive activities affected or controlled by neural circuits in
More informationMultivariate analyses & decoding
Multivariate analyses & decoding Kay Henning Brodersen Computational Neuroeconomics Group Department of Economics, University of Zurich Machine Learning and Pattern Recognition Group Department of Computer
More informationCS 229 Final Project Report Learning to Decode Cognitive States of Rat using Functional Magnetic Resonance Imaging Time Series
CS 229 Final Project Report Learning to Decode Cognitive States of Rat using Functional Magnetic Resonance Imaging Time Series Jingyuan Chen //Department of Electrical Engineering, cjy2010@stanford.edu//
More informationMultiVariate Bayesian (MVB) decoding of brain images
MultiVariate Bayesian (MVB) decoding of brain images Alexa Morcom Edinburgh SPM course 2015 With thanks to J. Daunizeau, K. Brodersen for slides stimulus behaviour encoding of sensorial or cognitive state?
More informationNeuroImage 43 (2008) Contents lists available at ScienceDirect. NeuroImage. journal homepage:
NeuroImage 43 (2008) 44 58 Contents lists available at ScienceDirect NeuroImage journal homepage: www.elsevier.com/locate/ynimg Combining multivariate voxel selection and support vector machines for mapping
More informationSpeeding-up model-selection in GraphNet via early-stopping and univariate feature-screening
Speeding-up model-selection in GraphNet via early-stopping and univariate feature-screening Elvis Dohmatob, Michael Eickenberg, Bertrand Thirion, Gaël Varoquaux To cite this version: Elvis Dohmatob, Michael
More informationMethods for data preprocessing
Methods for data preprocessing John Ashburner Wellcome Trust Centre for Neuroimaging, 12 Queen Square, London, UK. Overview Voxel-Based Morphometry Morphometry in general Volumetrics VBM preprocessing
More informationBayesian Inference in fmri Will Penny
Bayesian Inference in fmri Will Penny Bayesian Approaches in Neuroscience Karolinska Institutet, Stockholm February 2016 Overview Posterior Probability Maps Hemodynamic Response Functions Population
More informationMultiresponse Sparse Regression with Application to Multidimensional Scaling
Multiresponse Sparse Regression with Application to Multidimensional Scaling Timo Similä and Jarkko Tikka Helsinki University of Technology, Laboratory of Computer and Information Science P.O. Box 54,
More informationLearning Tensor-Based Features for Whole-Brain fmri Classification
Learning Tensor-Based Features for Whole-Brain fmri Classification Xiaonan Song, Lingnan Meng, Qiquan Shi, and Haiping Lu Department of Computer Science, Hong Kong Baptist University haiping@hkbu.edu.hk
More informationNetwork Lasso: Clustering and Optimization in Large Graphs
Network Lasso: Clustering and Optimization in Large Graphs David Hallac, Jure Leskovec, Stephen Boyd Stanford University September 28, 2015 Convex optimization Convex optimization is everywhere Introduction
More informationBasic Introduction to Data Analysis. Block Design Demonstration. Robert Savoy
Basic Introduction to Data Analysis Block Design Demonstration Robert Savoy Sample Block Design Experiment Demonstration Use of Visual and Motor Task Separability of Responses Combined Visual and Motor
More informationVariable Selection 6.783, Biomedical Decision Support
6.783, Biomedical Decision Support (lrosasco@mit.edu) Department of Brain and Cognitive Science- MIT November 2, 2009 About this class Why selecting variables Approaches to variable selection Sparsity-based
More informationDecoding analyses. Neuroimaging: Pattern Analysis 2017
Decoding analyses Neuroimaging: Pattern Analysis 2017 Scope Conceptual explanation of decoding/machine learning (ML) and related concepts; Not about mathematical basis of ML algorithms! Learning goals
More informationFeature Selection. CE-725: Statistical Pattern Recognition Sharif University of Technology Spring Soleymani
Feature Selection CE-725: Statistical Pattern Recognition Sharif University of Technology Spring 2013 Soleymani Outline Dimensionality reduction Feature selection vs. feature extraction Filter univariate
More informationMachine Learning Practical NITP Summer Course Pamela K. Douglas UCLA Semel Institute
Machine Learning Practical NITP Summer Course 2013 Pamela K. Douglas UCLA Semel Institute Email: pamelita@g.ucla.edu Topics Covered Part I: WEKA Basics J Part II: MONK Data Set & Feature Selection (from
More informationDeriving statistical significance maps for SVM based image classification and group comparisons
Deriving statistical significance maps for SVM based image classification and group comparisons Bilwaj Gaonkar, Christos Davatzikos Section for Biomedical Image Analysis, University of Pennsylvania, Philadelphia,
More informationA spatial constrained multi target regression model for human brain activity prediction
Wen and Li Appl Inform (216) 3:1 DOI 1.1186/s4535-16-26-x RESEARCH Open Access A spatial constrained multi target regression model for human brain activity prediction Zhenfu Wen 1,2 and Yuanqing Li 1,2*
More informationLinear Methods for Regression and Shrinkage Methods
Linear Methods for Regression and Shrinkage Methods Reference: The Elements of Statistical Learning, by T. Hastie, R. Tibshirani, J. Friedman, Springer 1 Linear Regression Models Least Squares Input vectors
More informationFeature Selection for fmri Classification
Feature Selection for fmri Classification Chuang Wu Program of Computational Biology Carnegie Mellon University Pittsburgh, PA 15213 chuangw@andrew.cmu.edu Abstract The functional Magnetic Resonance Imaging
More informationRobustness of Sparse Regression Models in fmri Data Analysis
Robustness of Sparse Regression Models in fmri Data Analysis Melissa K. Carroll Department of Computer Science Princeton University Princeton, NJ 854 Guillermo A. Cecchi Yorktown Heights, NY 1598 Irina
More informationGradient-based Representational Similarity Analysis with Searchlight for Analyzing fmri Data
Gradient-based Representational Similarity Analysis with Searchlight for Analyzing fmri Data Xiaoliang Sheng,Muhammad Yousefnezhad,Tonglin Xu,Ning Yuan, and Daoqiang Zhang College of Computer Science and
More informationarxiv: v1 [stat.ap] 15 Jun 2018
Statistical Inference with Ensemble of Clustered Desparsified Lasso Jérôme-Alexis Chevalier 1,2, Joseph Salmon 2, and Bertrand Thirion 1 1 Parietal Team, Inria, CEA, Paris-Saclay University, France 2 Telecom
More informationPractical OmicsFusion
Practical OmicsFusion Introduction In this practical, we will analyse data, from an experiment which aim was to identify the most important metabolites that are related to potato flesh colour, from an
More informationLasso Regression: Regularization for feature selection
Lasso Regression: Regularization for feature selection CSE 416: Machine Learning Emily Fox University of Washington April 12, 2018 Symptom of overfitting 2 Often, overfitting associated with very large
More informationNeuroimage Processing
Neuroimage Processing Instructor: Moo K. Chung mkchung@wisc.edu Lecture 2-3. General Linear Models (GLM) Voxel-based Morphometry (VBM) September 11, 2009 What is GLM The general linear model (GLM) is a
More informationLeveling Up as a Data Scientist. ds/2014/10/level-up-ds.jpg
Model Optimization Leveling Up as a Data Scientist http://shorelinechurch.org/wp-content/uploa ds/2014/10/level-up-ds.jpg Bias and Variance Error = (expected loss of accuracy) 2 + flexibility of model
More informationUnderstanding multivariate pattern analysis for neuroimaging applications
Understanding multivariate pattern analysis for neuroimaging applications Dimitri Van De Ville Medical Image Processing Lab, Institute of Bioengineering, Ecole Polytechnique Fédérale de Lausanne Department
More informationMedical Image Analysis
Medical Image Analysis Instructor: Moo K. Chung mchung@stat.wisc.edu Lecture 10. Multiple Comparisons March 06, 2007 This lecture will show you how to construct P-value maps fmri Multiple Comparisons 4-Dimensional
More informationSecond revision: Supplementary Material Linking brain-wide multivoxel activation patterns to behaviour: examples from language and math
Second revision: Supplementary Material Linking brain-wide multivoxel activation patterns to behaviour: examples from language and math Rajeev D. S. Raizada, Feng Ming Tsao, Huei-Mei Liu, Ian D. Holloway,
More informationSegmentation of Brain MR Images via Sparse Patch Representation
Segmentation of Brain MR Images via Sparse Patch Representation Tong Tong 1, Robin Wolz 1, Joseph V. Hajnal 2, and Daniel Rueckert 1 1 Department of Computing, Imperial College London, London, UK 2 MRC
More informationClassification of Whole Brain fmri Activation Patterns. Serdar Kemal Balcı
Classification of Whole Brain fmri Activation Patterns by Serdar Kemal Balcı Submitted to the Department of Electrical Engineering and Computer Science in partial fulfillment of the requirements for the
More informationNeuroimaging and mathematical modelling Lesson 2: Voxel Based Morphometry
Neuroimaging and mathematical modelling Lesson 2: Voxel Based Morphometry Nivedita Agarwal, MD Nivedita.agarwal@apss.tn.it Nivedita.agarwal@unitn.it Volume and surface morphometry Brain volume White matter
More informationCSC 411: Lecture 02: Linear Regression
CSC 411: Lecture 02: Linear Regression Raquel Urtasun & Rich Zemel University of Toronto Sep 16, 2015 Urtasun & Zemel (UofT) CSC 411: 02-Regression Sep 16, 2015 1 / 16 Today Linear regression problem continuous
More informationTemporal Feature Selection for fmri Analysis
Temporal Feature Selection for fmri Analysis Mark Palatucci School of Computer Science Carnegie Mellon University Pittsburgh, Pennsylvania markmp@cmu.edu ABSTRACT Recent work in neuroimaging has shown
More informationStatistical Analysis of Neuroimaging Data. Phebe Kemmer BIOS 516 Sept 24, 2015
Statistical Analysis of Neuroimaging Data Phebe Kemmer BIOS 516 Sept 24, 2015 Review from last time Structural Imaging modalities MRI, CAT, DTI (diffusion tensor imaging) Functional Imaging modalities
More informationRANDOM SUBSPACE METHOD IN CLASSIFICAnON AND MAPPING OF FMRI DATA PATTERNS
RANDOM SUBSPACE METHOD IN CLASSIFICAnON AND MAPPING OF FMRI DATA PATTERNS by Tianwen Chen A Dissertation Submitted to the Graduate Faculty of George Mason University in Partial Fulfillment of The Requirements
More informationMachine Learning / Jan 27, 2010
Revisiting Logistic Regression & Naïve Bayes Aarti Singh Machine Learning 10-701/15-781 Jan 27, 2010 Generative and Discriminative Classifiers Training classifiers involves learning a mapping f: X -> Y,
More informationA Functional Connectivity Inspired Approach to Non-Local fmri Analysis
A Functional Connectivity Inspired Approach to Non-Local fmri Analysis Anders Eklund, Mats Andersson and Hans Knutsson Linköping University Post Print N.B.: When citing this work, cite the original article.
More informationFast or furious? - User analysis of SF Express Inc
CS 229 PROJECT, DEC. 2017 1 Fast or furious? - User analysis of SF Express Inc Gege Wen@gegewen, Yiyuan Zhang@yiyuan12, Kezhen Zhao@zkz I. MOTIVATION The motivation of this project is to predict the likelihood
More informationRegularized Tensor Factorizations & Higher-Order Principal Components Analysis
Regularized Tensor Factorizations & Higher-Order Principal Components Analysis Genevera I. Allen Department of Statistics, Rice University, Department of Pediatrics-Neurology, Baylor College of Medicine,
More informationPreprocessing II: Between Subjects John Ashburner
Preprocessing II: Between Subjects John Ashburner Pre-processing Overview Statistics or whatever fmri time-series Anatomical MRI Template Smoothed Estimate Spatial Norm Motion Correct Smooth Coregister
More informationUnderstanding multivariate pattern analysis for neuroimaging applications
Understanding multivariate pattern analysis for neuroimaging applications Maria Giulia Preti Dimitri Van De Ville Medical Image Processing Lab, Institute of Bioengineering, Ecole Polytechnique Fédérale
More informationNeuroImage. Sparse logistic regression for whole-brain classification of fmri data
YNIMG-07076; No. of pages: 13; 4C: NeuroImage xxx (2010) xxx xxx Contents lists available at ScienceDirect NeuroImage journal homepage: www.elsevier.com/locate/ynimg Sparse logistic regression for whole-brain
More informationQuality Checking an fmri Group Result (art_groupcheck)
Quality Checking an fmri Group Result (art_groupcheck) Paul Mazaika, Feb. 24, 2009 A statistical parameter map of fmri group analyses relies on the assumptions of the General Linear Model (GLM). The assumptions
More informationThe organization of the human cerebral cortex estimated by intrinsic functional connectivity
1 The organization of the human cerebral cortex estimated by intrinsic functional connectivity Journal: Journal of Neurophysiology Author: B. T. Thomas Yeo, et al Link: https://www.ncbi.nlm.nih.gov/pubmed/21653723
More informationIntroductory Concepts for Voxel-Based Statistical Analysis
Introductory Concepts for Voxel-Based Statistical Analysis John Kornak University of California, San Francisco Department of Radiology and Biomedical Imaging Department of Epidemiology and Biostatistics
More informationGene signature selection to predict survival benefits from adjuvant chemotherapy in NSCLC patients
1 Gene signature selection to predict survival benefits from adjuvant chemotherapy in NSCLC patients 1,2 Keyue Ding, Ph.D. Nov. 8, 2014 1 NCIC Clinical Trials Group, Kingston, Ontario, Canada 2 Dept. Public
More informationLearning with infinitely many features
Learning with infinitely many features R. Flamary, Joint work with A. Rakotomamonjy F. Yger, M. Volpi, M. Dalla Mura, D. Tuia Laboratoire Lagrange, Université de Nice Sophia Antipolis December 2012 Example
More informationPredicting the Word from Brain Activity
CS771A Semester Project 25th November, 2016 Aman Garg Apurv Gupta Ayushya Agarwal Gopichand Kotana Kushal Kumar 14073 14124 14168 14249 14346 Abstract In this project, our objective was to predict the
More informationQuick Manual for Sparse Logistic Regression ToolBox ver1.2.1
Quick Manual for Sparse Logistic Regression ToolBox ver1.2.1 By Okito Yamashita Updated in Feb.2011 First version : Sep.2009 The Sparse Logistic Regression toolbox (SLR toolbox hereafter) is a suite of
More informationClassification of Subject Motion for Improved Reconstruction of Dynamic Magnetic Resonance Imaging
1 CS 9 Final Project Classification of Subject Motion for Improved Reconstruction of Dynamic Magnetic Resonance Imaging Feiyu Chen Department of Electrical Engineering ABSTRACT Subject motion is a significant
More informationMapping of Hierarchical Activation in the Visual Cortex Suman Chakravartula, Denise Jones, Guillaume Leseur CS229 Final Project Report. Autumn 2008.
Mapping of Hierarchical Activation in the Visual Cortex Suman Chakravartula, Denise Jones, Guillaume Leseur CS229 Final Project Report. Autumn 2008. Introduction There is much that is unknown regarding
More informationCHAPTER 2. Morphometry on rodent brains. A.E.H. Scheenstra J. Dijkstra L. van der Weerd
CHAPTER 2 Morphometry on rodent brains A.E.H. Scheenstra J. Dijkstra L. van der Weerd This chapter was adapted from: Volumetry and other quantitative measurements to assess the rodent brain, In vivo NMR
More informationComputational Neuroanatomy
Computational Neuroanatomy John Ashburner john@fil.ion.ucl.ac.uk Smoothing Motion Correction Between Modality Co-registration Spatial Normalisation Segmentation Morphometry Overview fmri time-series kernel
More informationNeuroImage. Interpretable whole-brain prediction analysis with GraphNet
NeuroImage 72 (2013) 304 321 Contents lists available at SciVerse ScienceDirect NeuroImage journal homepage: www.elsevier.com/locate/ynimg Interpretable whole-brain prediction analysis with GraphNet Logan
More informationMachine Learning Feature Creation and Selection
Machine Learning Feature Creation and Selection Jeff Howbert Introduction to Machine Learning Winter 2012 1 Feature creation Well-conceived new features can sometimes capture the important information
More informationI How does the formulation (5) serve the purpose of the composite parameterization
Supplemental Material to Identifying Alzheimer s Disease-Related Brain Regions from Multi-Modality Neuroimaging Data using Sparse Composite Linear Discrimination Analysis I How does the formulation (5)
More informationSGN (4 cr) Chapter 10
SGN-41006 (4 cr) Chapter 10 Feature Selection and Extraction Jussi Tohka & Jari Niemi Department of Signal Processing Tampere University of Technology February 18, 2014 J. Tohka & J. Niemi (TUT-SGN) SGN-41006
More informationGraphical Models, Bayesian Method, Sampling, and Variational Inference
Graphical Models, Bayesian Method, Sampling, and Variational Inference With Application in Function MRI Analysis and Other Imaging Problems Wei Liu Scientific Computing and Imaging Institute University
More informationMultivariate fmri Analysis using Canonical Correlation Analysis instead of Classifiers, Comment on Todd et al.
Multivariate fmri Analysis using Canonical Correlation Analysis instead of Classifiers, Comment on Todd et al. Anders Eklund a, Hans Knutsson bc a Virginia Tech Carilion Research Institute, Virginia Tech,
More informationGenerative and discriminative classification techniques
Generative and discriminative classification techniques Machine Learning and Category Representation 013-014 Jakob Verbeek, December 13+0, 013 Course website: http://lear.inrialpes.fr/~verbeek/mlcr.13.14
More informationData mining for neuroimaging data. John Ashburner
Data mining for neuroimaging data John Ashburner MODELLING The Scientific Process MacKay, David JC. Bayesian interpolation. Neural computation 4, no. 3 (1992): 415-447. Model Selection Search for the best
More informationEffectiveness of Sparse Features: An Application of Sparse PCA
000 001 002 003 004 005 006 007 008 009 010 011 012 013 014 015 016 017 018 019 020 021 022 023 024 025 026 027 028 029 030 031 032 033 034 035 036 037 038 039 040 041 042 043 044 045 046 047 048 049 050
More informationSparsity Based Regularization
9.520: Statistical Learning Theory and Applications March 8th, 200 Sparsity Based Regularization Lecturer: Lorenzo Rosasco Scribe: Ioannis Gkioulekas Introduction In previous lectures, we saw how regularization
More informationIncorporating Prior Information with Fused Sparse Group Lasso: Application to Prediction of Clinical Measures from Neuroimages
arxiv:181.6594v3 [stat.me] 23 Feb 218 Incorporating Prior Information with Fused Sparse Group Lasso: Application to Prediction of Clinical Measures from Neuroimages Joanne C. Beer Department of Biostatistics,
More informationSUPERVISED LEARNING METHODS. Stanley Liang, PhD Candidate, Lassonde School of Engineering, York University Helix Science Engagement Programs 2018
SUPERVISED LEARNING METHODS Stanley Liang, PhD Candidate, Lassonde School of Engineering, York University Helix Science Engagement Programs 2018 2 CHOICE OF ML You cannot know which algorithm will work
More informationYelp Recommendation System
Yelp Recommendation System Jason Ting, Swaroop Indra Ramaswamy Institute for Computational and Mathematical Engineering Abstract We apply principles and techniques of recommendation systems to develop
More informationBayesian Spherical Wavelet Shrinkage: Applications to Shape Analysis
Bayesian Spherical Wavelet Shrinkage: Applications to Shape Analysis Xavier Le Faucheur a, Brani Vidakovic b and Allen Tannenbaum a a School of Electrical and Computer Engineering, b Department of Biomedical
More informationNetwork statistics and thresholding
Network statistics and thresholding Andrew Zalesky azalesky@unimelb.edu.au HBM Educational Course June 25, 2017 Network thresholding Unthresholded Moderate thresholding Severe thresholding Strong link
More informationFeatures: representation, normalization, selection. Chapter e-9
Features: representation, normalization, selection Chapter e-9 1 Features Distinguish between instances (e.g. an image that you need to classify), and the features you create for an instance. Features
More informationPredictive Analytics: Demystifying Current and Emerging Methodologies. Tom Kolde, FCAS, MAAA Linda Brobeck, FCAS, MAAA
Predictive Analytics: Demystifying Current and Emerging Methodologies Tom Kolde, FCAS, MAAA Linda Brobeck, FCAS, MAAA May 18, 2017 About the Presenters Tom Kolde, FCAS, MAAA Consulting Actuary Chicago,
More informationarxiv: v2 [cs.cv] 7 Jun 2015
Randomized Structural Sparsity via Constrained Block Subsampling for Improved Sensitivity of Discriminative Voxel Identification arxiv:1410.4650v2 [cs.cv] 7 Jun 2015 Yilun Wang a,b,c, Junjie Zheng b, Sheng
More informationPredictive Analysis: Evaluation and Experimentation. Heejun Kim
Predictive Analysis: Evaluation and Experimentation Heejun Kim June 19, 2018 Evaluation and Experimentation Evaluation Metrics Cross-Validation Significance Tests Evaluation Predictive analysis: training
More informationAutomatic segmentation of the cortical grey and white matter in MRI using a Region Growing approach based on anatomical knowledge
Automatic segmentation of the cortical grey and white matter in MRI using a Region Growing approach based on anatomical knowledge Christian Wasserthal 1, Karin Engel 1, Karsten Rink 1 und André Brechmann
More informationMultiple Testing and Thresholding
Multiple Testing and Thresholding UCLA Advanced NeuroImaging Summer School, 2007 Thanks for the slides Tom Nichols! Overview Multiple Testing Problem Which of my 100,000 voxels are active? Two methods
More informationApplied Statistics for Neuroscientists Part IIa: Machine Learning
Applied Statistics for Neuroscientists Part IIa: Machine Learning Dr. Seyed-Ahmad Ahmadi 04.04.2017 16.11.2017 Outline Machine Learning Difference between statistics and machine learning Modeling the problem
More informationGroup Sta*s*cs in MEG/EEG
Group Sta*s*cs in MEG/EEG Will Woods NIF Fellow Brain and Psychological Sciences Research Centre Swinburne University of Technology A Cau*onary tale. A Cau*onary tale. A Cau*onary tale. Overview Introduc*on
More informationA Reduced-Dimension fmri! Shared Response Model
A Reduced-Dimension fmri! Shared Response Model Po-Hsuan! (Cameron)! Chen 1! Janice! Chen 2! Yaara! Yeshurun 2! Uri! Hasson 2! James! Haxby 3! Peter! Ramadge 1! 1 Department of Electrical Engineering,
More informationStability of Feature Selection Algorithms
Stability of Feature Selection Algorithms Alexandros Kalousis, Jullien Prados, Phong Nguyen Melanie Hilario Artificial Intelligence Group Department of Computer Science University of Geneva Stability of
More informationNaïve Bayes, Gaussian Distributions, Practical Applications
Naïve Bayes, Gaussian Distributions, Practical Applications Required reading: Mitchell draft chapter, sections 1 and 2. (available on class website) Machine Learning 10-601 Tom M. Mitchell Machine Learning
More informationAn Improved Optimization Method for the Relevance Voxel Machine
An Improved Optimization Method for the Relevance Voxel Machine Melanie Ganz 1,, Mert R. Sabuncu 1, and Koen Van Leemput 1,3,4 1 Martinos Center for Biomedical Imaging, MGH, Harvard Medical School, USA
More informationTotal Variation meets Sparsity: statistical learning with segmenting penalties
Total Variation meets Sparsity: statistical learning with segmenting penalties Michael Eickenberg, Elvis Dohmatob, Bertrand Thirion, Gaël Varoquaux To cite this version: Michael Eickenberg, Elvis Dohmatob,
More informationABSTRACT 1. INTRODUCTION 2. METHODS
Finding Seeds for Segmentation Using Statistical Fusion Fangxu Xing *a, Andrew J. Asman b, Jerry L. Prince a,c, Bennett A. Landman b,c,d a Department of Electrical and Computer Engineering, Johns Hopkins
More informationPixels to Voxels: Modeling Visual Representation in the Human Brain
Pixels to Voxels: Modeling Visual Representation in the Human Brain Authors: Pulkit Agrawal, Dustin Stansbury, Jitendra Malik, Jack L. Gallant Presenters: JunYoung Gwak, Kuan Fang Outlines Background Motivation
More information