Data mining for neuroimaging data. John Ashburner

Similar documents
Methods for data preprocessing

CHAPTER 2. Morphometry on rodent brains. A.E.H. Scheenstra J. Dijkstra L. van der Weerd

Computational Neuroanatomy

Neuroimaging and mathematical modelling Lesson 2: Voxel Based Morphometry

MULTIVARIATE ANALYSES WITH fmri DATA

Statistical Analysis of Neuroimaging Data. Phebe Kemmer BIOS 516 Sept 24, 2015

Basic fmri Design and Analysis. Preprocessing

Functional MRI in Clinical Research and Practice Preprocessing

Bayesian Inference in fmri Will Penny

SPM8 for Basic and Clinical Investigators. Preprocessing. fmri Preprocessing

Voxel-Based Morphometry & DARTEL. Ged Ridgway, London With thanks to John Ashburner and the FIL Methods Group

1 Introduction Motivation and Aims Functional Imaging Computational Neuroanatomy... 12

EPI Data Are Acquired Serially. EPI Data Are Acquired Serially 10/23/2011. Functional Connectivity Preprocessing. fmri Preprocessing

Structural Segmentation

Structural Segmentation

Preprocessing II: Between Subjects John Ashburner

Image Registration + Other Stuff

MultiVariate Bayesian (MVB) decoding of brain images

Expectation Maximization (EM) and Gaussian Mixture Models

I.e. Sex differences in child appetitive traits and Eating in the Absence of Hunger:

Automatic Registration-Based Segmentation for Neonatal Brains Using ANTs and Atropos

Cognitive States Detection in fmri Data Analysis using incremental PCA

Introduction to Neuroimaging Janaina Mourao-Miranda

Supplementary methods

Machine Learning for Medical Image Analysis. A. Criminisi

Functional MRI data preprocessing. Cyril Pernet, PhD

Methodological progress in image registration for ventilation estimation, segmentation propagation and multi-modal fusion

Pattern Recognition for Neuroimaging Data

SPM8 for Basic and Clinical Investigators. Preprocessing

PATTERN CLASSIFICATION AND SCENE ANALYSIS

Correction for multiple comparisons. Cyril Pernet, PhD SBIRC/SINAPSE University of Edinburgh

Interpreting predictive models in terms of anatomically labelled regions

Quantitative MRI of the Brain: Investigation of Cerebral Gray and White Matter Diseases

Latent Variable Models and Expectation Maximization

Classification of Subject Motion for Improved Reconstruction of Dynamic Magnetic Resonance Imaging

Computational Methods in NeuroImage Analysis!

Day 3 Lecture 1. Unsupervised Learning

Fundamentals of Digital Image Processing

Computational anatomy with the SPM software

Multivariate Pattern Classification. Thomas Wolbers Space and Aging Laboratory Centre for Cognitive and Neural Systems

Second revision: Supplementary Material Linking brain-wide multivoxel activation patterns to behaviour: examples from language and math

Zurich SPM Course Voxel-Based Morphometry. Ged Ridgway (Oxford & UCL) With thanks to John Ashburner and the FIL Methods Group

Automatic Generation of Training Data for Brain Tissue Classification from MRI

Overview of machine learning

A Nonparametric Bayesian Approach to Detecting Spatial Activation Patterns in fmri Data

UBC SPM Course Voxel-Based Morphometry & DARTEL. Ged Ridgway

The Anatomical Equivalence Class Formulation and its Application to Shape-based Computational Neuroanatomy

Introduction to Medical Image Processing

Introduction to fmri. Pre-processing

Homework. Gaussian, Bishop 2.3 Non-parametric, Bishop 2.5 Linear regression Pod-cast lecture on-line. Next lectures:

Automatic Generation of Training Data for Brain Tissue Classification from MRI

22 October, 2012 MVA ENS Cachan. Lecture 5: Introduction to generative models Iasonas Kokkinos

Online Statistical Inference for Quantifying Mandible Growth in CT Images

Supervised Learning for Image Segmentation

MEDICAL IMAGE ANALYSIS

Classification using Weka (Brain, Computation, and Neural Learning)

Multivariate pattern classification

Neuroimage Processing

arxiv: v5 [cs.cv] 18 Oct 2018

Statistical Analysis of MRI Data

Keywords- Classification algorithm, Hypertensive, K Nearest Neighbor, Naive Bayesian, Data normalization

What is machine learning?

Feature Selection. CE-725: Statistical Pattern Recognition Sharif University of Technology Spring Soleymani

Detecting Changes In Non-Isotropic Images

Statistical Modeling of Neuroimaging Data: Targeting Activation, Task-Related Connectivity, and Prediction

Normalization for clinical data

Nested Sampling: Introduction and Implementation

Machine Learning / Jan 27, 2010

Statistics (STAT) Statistics (STAT) 1. Prerequisites: grade in C- or higher in STAT 1200 or STAT 1300 or STAT 1400

An Introduction To Automatic Tissue Classification Of Brain MRI. Colm Elliott Mar 2014

A Registration-Based Atlas Propagation Framework for Automatic Whole Heart Segmentation

Computational Medical Imaging Analysis Chapter 4: Image Visualization

SOCIAL MEDIA MINING. Data Mining Essentials

Graphical Models, Bayesian Method, Sampling, and Variational Inference

Unsupervised Learning

Measuring longitudinal brain changes in humans and small animal models. Christos Davatzikos

Estimating image bases for visual image reconstruction from human brain activity

Journal of Articles in Support of The Null Hypothesis

Mining di Dati Web. Lezione 3 - Clustering and Classification

Bayes Net Learning. EECS 474 Fall 2016

Whole Body MRI Intensity Standardization

Manifold Learning: Applications in Neuroimaging

Analysis of Functional MRI Timeseries Data Using Signal Processing Techniques

Cocozza S., et al. : ALTERATIONS OF FUNCTIONAL CONNECTIVITY OF THE MOTOR CORTEX IN FABRY'S DISEASE: AN RS-FMRI STUDY

Evaluation of multiple voxel-based morphometry approaches and applications in the analysis of white matter changes in temporal lobe epilepsy

Understanding multivariate pattern analysis for neuroimaging applications

Clustering and Visualisation of Data

Monocular Human Motion Capture with a Mixture of Regressors. Ankur Agarwal and Bill Triggs GRAVIR-INRIA-CNRS, Grenoble, France

TA Section: Problem Set 4

CS6375: Machine Learning Gautam Kunapuli. Mid-Term Review

Graphical Models. David M. Blei Columbia University. September 17, 2014

Knowledge Discovery. Javier Béjar URL - Spring 2019 CS - MIA

Data Mining Classification: Bayesian Decision Theory

Pa#ern Recogni-on for Neuroimaging Toolbox

Mapping of Hierarchical Activation in the Visual Cortex Suman Chakravartula, Denise Jones, Guillaume Leseur CS229 Final Project Report. Autumn 2008.

Summarizing Complexity in High Dimensional Spaces. Karl Young University of California, San Francisco SciPy 2008

RADIOMICS: potential role in the clinics and challenges

Digital Volume Correlation for Materials Characterization

A Model-Independent, Multi-Image Approach to MR Inhomogeneity Correction

Basic Introduction to Data Analysis. Block Design Demonstration. Robert Savoy

Transcription:

Data mining for neuroimaging data John Ashburner

MODELLING

The Scientific Process MacKay, David JC. Bayesian interpolation. Neural computation 4, no. 3 (1992): 415-447.

Model Selection Search for the best of a number of models: p(y M 0 ), p(y M 1 ), p(y M 2 ), p(y M 3 ), p(y M 4 ) Cross-validation is essentially hypothesis testing. Learn a hypothesis/model from the training data. Test it on the data that was left out. Other model selection strategies are also possible eg Bayesian Model Selection. The complexity of the best model depends on how much data is available.

Models of brain data Currently in neuroimaging, most models are for single voxels (ie mass-univariate). Lots of separate models Assumes independence among voxels. Simple interpretation of differences The alternative is to model all the data. Multivariate More difficult to interpret Simplifying principles may emerge from more complex models.

UNIVARIATE OR MULTIVARIATE?

Are biological structures multivariate? Eventual shape is a result of growth. Growth is a result of gene expression and other factors. Each gene may be expressed in more than one voxel. We have known for a long time that, eg, left leg length is correlated with right leg length. It is, however, far more necessary to bear in mind that there are many unknown laws of correlation of growth, which, when one part of the organisation is modified through variation, and the modifications are accumulated by natural selection for the good of the being, will cause other modifications, often of the most unexpected nature (C. Darwin, 1859).

Can male-female differences be localised?

SOME MULTIVARIATE METHODS

Fisher s Linear Discriminant

Generative Model for Discrimination Generative: P(t=1 x) = p(x t=1)p(t=1) p(x t=0)p(t=0) + p(x t=1)p(t=1) Where x feature data t prediction Discriminative: Directly learns to give P(t=1 x) We are not normally interested in all the variables needed to represent within-group variability. Only after a discriminative direction.

Linear Discrimination

Probabilistic Approaches

Regression

Gaussian Processes for Machine Learning Book by Rasmussen & Williams available for free online at: http://www.gaussianprocess.org/gpml MATLAB code for regression and classification is also available. Regression is relatively simple, but two approaches to classification are included. Laplace Approximation Expectation Propagation more accurate A Variational Bayes approach to GP classification is described in Bishop s PRML book.

TYPES OF FEATURES

Ugly Duckling Theorem An argument asserting that classification is impossible without some sort of bias. Watanabe, Satosi (1969). Knowing and Guessing: A Quantitative Study of Inference and Information. New York: Wiley. pp. 376 377.

How would the data be preprocessed to reveal useful features?

D Arcy Thompson

2D Example

Diffeomorphic Deformations

Jacobian Determinants (relative volumes)

Scalar Momenta (Jacobian scaled residuals from diffeomorphisms)

Reconstructed from Template & Residuals

Original Examples

Predicting Age

Predicting Sex

MINING HOSPITAL IMAGES

Basic Research Data Good quality images Well controlled experiments Typically one scan per subject Mostly healthy subjects No major pathologies

Hospital scans are a bit crappy

Movement Artifact Patients move in the scanner. Data is very hard to make use of. Disease signature of Parkinsons.

Few fully automated algorithms can make good use of hospital data. Most algorithms expect T1-weighted MRI with 1mm isotropic resolution. They do a poor job with hospital scans. More work is needed.

Same subject, different MRI contrasts, different image orientations. Poor through-plane resolution Good in-plane resolution

Recovering information Several approaches for superresolution. Recover higher resolution signal from several lowresolution images taken from different views. Work so far has assumed that all images are of the same modality. Less straightforward for multimodal data.

Generative Model for Resolution Recovery Noise precision Observed images Means Mixing proportions Probability of observations given the reconstructions Prior probability of MoG parameters Class labels Reconstructed images Precisions Probability of reconstructions given the MoG parameters

Simple 2D simulations with 8mm thick slices

Prototype of resolution recovery attempt.