Learning Algorithms for Medical Image Analysis. Matteo Santoro slipguru

Similar documents
arxiv: v1 [cs.lg] 16 Nov 2010

Sparsity Based Spectral Embedding: Application to Multi-Atlas Echocardiography Segmentation!

Methodological progress in image registration for ventilation estimation, segmentation propagation and multi-modal fusion

Bilevel Sparse Coding

Sparse Models in Image Understanding And Computer Vision

8/3/2017. Contour Assessment for Quality Assurance and Data Mining. Objective. Outline. Tom Purdie, PhD, MCCPM

Preprocessing II: Between Subjects John Ashburner

SUPER RESOLUTION RECONSTRUCTION OF CARDIAC MRI USING COUPLED DICTIONARY LEARNING

Efficient Iterative Semi-supervised Classification on Manifold

ELEG Compressive Sensing and Sparse Signal Representations

Institute of Cardiovascular Science, UCL Centre for Cardiovascular Imaging, London, United Kingdom, 2

SUPER-RESOLUTION RECONSTRUCTION ALGORITHM FOR BASED ON COUPLED DICTIONARY LEARNING CARDIAC MRI RECONSTRUCTION

Image Restoration and Background Separation Using Sparse Representation Framework

Collaborative Sparsity and Compressive MRI

Super-Resolution Rebuilding Of Cardiac MRI Using Coupled Dictionary Analyzing

PADDLE Documentation. Release Curzio Basso

The Pre-Image Problem in Kernel Methods

Novel Lossy Compression Algorithms with Stacked Autoencoders

A Sparse and Locally Shift Invariant Feature Extractor Applied to Document Images

PATCH BASED COUPLED DICTIONARY APPROACH FOR CARDIAC MRI IMAGES USING SR RECONSTRUCTION ALGORITHM

Optimized Super Resolution Reconstruction Framework for Cardiac MRI Images Perception

Supplementary Material Estimating Correspondences of Deformable Objects In-the-wild

A Sparse and Locally Shift Invariant Feature Extractor Applied to Document Images

Machine Learning: Think Big and Parallel

arxiv: v1 [cond-mat.dis-nn] 30 Dec 2018

Combine the PA Algorithm with a Proximal Classifier

CSE 573: Artificial Intelligence Autumn 2010

Inter and Intra-Modal Deformable Registration:

Akarsh Pokkunuru EECS Department Contractive Auto-Encoders: Explicit Invariance During Feature Extraction

Dictionary Learning. Intensity Normalization For Generating Synthetic Tissue Contrast In Brain MRI Images. Y.Luktuke. Technische Universiteit Delft

Enhao Gong, PhD Candidate, Electrical Engineering, Stanford University Dr. John Pauly, Professor in Electrical Engineering, Stanford University Dr.

Medical Image Synthesis Methods and Applications

Computational Neuroanatomy

C. Poultney S. Cho pra (NYU Courant Institute) Y. LeCun

Biomedical Image Analysis Using Markov Random Fields & Efficient Linear Programing

Deep Similarity Learning for Multimodal Medical Images

Learning with infinitely many features

Nonrigid Registration using Free-Form Deformations

Logistic Regression: Probabilistic Interpretation

Tri-modal Human Body Segmentation

Classification of Subject Motion for Improved Reconstruction of Dynamic Magnetic Resonance Imaging

Using Machine Learning to Optimize Storage Systems

Automatic Generation of Training Data for Brain Tissue Classification from MRI

Dense 3D Modelling and Monocular Reconstruction of Deformable Objects

Towards an Estimation of Acoustic Impedance from Multiple Ultrasound Images

Adaptive Dictionary Learning For Competitive Classification Of Multiple Sclerosis Lesions

Learning Kernels with Random Features

Cluster Analysis. Mu-Chun Su. Department of Computer Science and Information Engineering National Central University 2003/3/11 1

Learning Feature Hierarchies for Object Recognition

COMPRESSIVE VIDEO SAMPLING

Translation Symmetry Detection: A Repetitive Pattern Analysis Approach

Complex Prediction Problems

Methods for data preprocessing

Iterative CT Reconstruction Using Curvelet-Based Regularization

Learning Convolutional Feature Hierarchies for Visual Recognition

DEEP LEARNING OF COMPRESSED SENSING OPERATORS WITH STRUCTURAL SIMILARITY (SSIM) LOSS

Auto-contouring the Prostate for Online Adaptive Radiotherapy

Nonrigid Surface Modelling. and Fast Recovery. Department of Computer Science and Engineering. Committee: Prof. Leo J. Jia and Prof. K. H.

Super-resolution Reconstruction of Fetal Brain MRI

Machine Learning Classifiers and Boosting

Effectiveness of Sparse Features: An Application of Sparse PCA

Compressive Sensing MRI with Wavelet Tree Sparsity

A Novel Iterative Thresholding Algorithm for Compressed Sensing Reconstruction of Quantitative MRI Parameters from Insufficient Data

High dynamic range magnetic resonance flow imaging in the abdomen

A supervised strategy for deep kernel machine

Neural Networks and Deep Learning

Medical images, segmentation and analysis

Blind Compressed Sensing Using Sparsifying Transforms

Total Variation Denoising with Overlapping Group Sparsity

A Duality Based Algorithm for TV-L 1 -Optical-Flow Image Registration

arxiv: v1 [cs.cv] 17 Nov 2016

Nonrigid Registration Using a Rigidity Constraint

(University Improving of Montreal) Generative Adversarial Networks with Denoising Feature Matching / 17

12 Classification using Support Vector Machines

Deep Learning for Generic Object Recognition

Stanford University. A Distributed Solver for Kernalized SVM

G Practical Magnetic Resonance Imaging II Sackler Institute of Biomedical Sciences New York University School of Medicine. Compressed Sensing

Multi-Input Cardiac Image Super-Resolution using Convolutional Neural Networks

Learning a Manifold as an Atlas Supplementary Material

Prototype of Silver Corpus Merging Framework

Segmentation of 3-D medical image data sets with a combination of region based initial segmentation and active surfaces

Classification: Feature Vectors

Interactive Differential Segmentation of the Prostate using Graph-Cuts with a Feature Detector-based Boundary Term

Feature Extractors. CS 188: Artificial Intelligence Fall Some (Vague) Biology. The Binary Perceptron. Binary Decision Rule.

Discriminative classifiers for image recognition

STIC AmSud Project. Graph cut based segmentation of cardiac ventricles in MRI: a shape-prior based approach

Efficient Algorithms may not be those we think

Class 6 Large-Scale Image Classification

Image segmentation with a statistical shape prior

Tomographic Image Reconstruction in Noisy and Limited Data Settings.

on learned visual embedding patrick pérez Allegro Workshop Inria Rhônes-Alpes 22 July 2015

Comparison Study of Clinical 3D MRI Brain Segmentation Evaluation

IMAGE DENOISING USING NL-MEANS VIA SMOOTH PATCH ORDERING

One-class Problems and Outlier Detection. 陶卿 中国科学院自动化研究所

Opinion Mining by Transformation-Based Domain Adaptation

Variable Selection 6.783, Biomedical Decision Support

Kernels and representation

Announcements. CS 188: Artificial Intelligence Spring Classification: Feature Vectors. Classification: Weights. Learning: Binary Perceptron

A fast algorithm for sparse reconstruction based on shrinkage, subspace optimization and continuation [Wen,Yin,Goldfarb,Zhang 2009]

An Empirical Evaluation of Deep Architectures on Problems with Many Factors of Variation

Transcription:

Learning Algorithms for Medical Image Analysis Matteo Santoro slipguru santoro@disi.unige.it June 8, 2010

Outline 1. learning-based strategies for quantitative image analysis 2. automatic annotation of MR images: the example of synovitis assessment 3. data-driven image representation: sparse coding and dictionary learning 4. experiments and results

Our Research in Medical Image Analysis Definition of new imaging biomarkers and design of algorithms for their assessment. A biomarker is a characteristic that is objectively measured and evaluated as an indicator of normal biological processes, pathogenic processes, or pharmacologic responses to a therapeutic intervention An imaging biomarker is, by extension, a biomarker measured from images.

Our Research in Medical Image Analysis Definition of new imaging biomarkers and design of algorithms for their assessment. 1. Annotation and classification of image parts (i.e. single voxels or 2D/3D patches.) 2. Non-rigid image registration, with specific interest for novel approaches to deal with discontinuous deformation fields.

Our Research in Medical Image Analysis Definition of new imaging biomarkers and design of algorithms for their assessment. 1. Annotation and classification of image parts (i.e. single voxels or 2D/3D patches.) 2. Non-rigid image registration, with specific interest for novel approaches to deal with discontinuous deformation fields. Why learning from examples? It is difficult to translate into an explicit algorithm the process of physicians diagnosis making. To ask the physicians to annotate images as positive or negative examples is a viable alternative that may lead to implicit learning-based algorithms.

The Challenges We are Currently Facing quantitative analysis = how to design efficient representation schemes to make the analysis more accurate? large collections of 3D images = how to design efficient and reliable learning algorithm for large scale problems? weakly annotated data = how to combine supervised and unsupervised learning approaches? anatomical constraints = how to exploit prior knowledge about known properties of tissue and organs?

Automatic Assessment of Synovial Volume Setting Patients: children under 16, affected by Juvenile Idiopathic Arthritis. Goal: to measure the volume of the inflamed synovia, and investigate its use as a viable biomarker. Data: 3D MR images acquired before and after the injection of a contrast medium. Context: http://health-e-child.org

Automatic Assessment of Synovial Volume The naïve approach Given n voxels x i R d and the corresponding labels y i {+1, 1}, the optimal solution may be computed as: f (x) = [K(x, x 1 ),..., K(x, x n )] [ ] (K + nτi) 1 y

Automatic Assessment of Synovial Volume Problems nonlinear methods may have storage and performance problems when n becomes large; data representation is often heterogeneous (e.g. measurements coming from different modalities): x φ(x) = {ϕ 1,..., ϕ k }

Multi-cue Voxel Classifier We look for a more flexible discriminant function: f (φ) = (i,j) I α j i K j i (φ) + b.

Multi-cue Voxel Classifier We look for a more flexible discriminant function: Assumptions The k n basis functions f (φ) = (i,j) I K j i (φ) = exp { α j i Kj i (φ) + b. ϕj ϕ j i 2 2σ 2 j measure the similarity between φ and one of the exemplar voxels with respect to a specific cue. },

Multi-cue Voxel Classifier We look for a more flexible discriminant function: f (φ) = (i,j) I α j i K j i (φ) + b. Model Selection The optimal subset I of basis functions, on which f depends, may be inferred directly from the data by means of a suitable feature selection algorithm.

Multi-cue Voxel Classifier We look for a more flexible discriminant function: f (φ) = (i,j) I α j i K j i (φ) + b. Model Selection The optimal subset I of basis functions, on which f depends, may be inferred directly from the data by means of a suitable feature selection algorithm. Learning Algorithm The goal of learning is to to find the optimal affine combination defined by the coefficients α j i and b.

Multi-cue Voxel Classifier We look for a more flexible discriminant function: Results f (φ) = (i,j) I α j i K j i (φ) + b.

Multi-cue Voxel Classifier We look for a more flexible discriminant function: Results f (φ) = (i,j) I α j i K j i (φ) + b. multi-cue classifier is 15+ times sparser than SVM, and approximately 40 times faster than SVM.

Multi-cue Voxel Classifier We look for a more flexible discriminant function: Results f (φ) = (i,j) I α j i K j i (φ) + b. excellent accuracy and agreement with the manual measurements precision close to the one achieved by manual annotation positive preliminary results with both longitudinal and cross-sectional clinical studies.

Data-driven Image Representation There is empirical evidence that adaptive representation schemes may provide effective models.

Data-driven Image Representation There is empirical evidence that adaptive representation schemes may provide effective models. 1. vector quantisation: 2. convolution: u i = j such that x i De j x i De k k j 3. sparse coding: u i = D T x i min u i { xi Du i 2 + λ u i 1 }

Sparse Coding A signal may be conveniently represented as the superposition of elementary signals, or atoms. Over-complete dictionaries (or frames) and sparse coding offer more flexibility and are supported by successful applications. Tight frames ensure that the optimal representation can be recovered by means of inner products of the signal and the dictionary. Learning the dictionary from data We investigated the possibility of learning directly from data a dictionary endowed with properties similar to that of tight frames.

Dictionary Learning Setting X = [x 1,..., x N ] R d N is the input matrix. U = [u 1,..., u N ] R K N is the coding matrix. The goal is to learn: 1. D = [d 1,..., d K ] R d K (the decoding or synthesis operator), whose columns are the atoms of the dictionary, and 2. C = [c 1,..., c K ] T R K d (the encoding or analysis operator), whose rows are filters that can be convolved with an input signal to obtain its encoding.

Dictionary Learning Objective We aim at learning both D and its dual C by minimising: E(D, C, U) = 1 d X DU 2 F + η K U CX 2 F + 2τ K s.t. d i 2, c i 2 1 N u i 1, i=1 where τ > 0 is a regularisation parameter inducing sparsity in U, while η 0 weights the coding error with respect to the reconstruction error.

Dictionary Learning The Proposed Algorithm: PADDLE The functional E is separately convex in each variable, and we adopt a block coordinate descent strategy: 1. sparse coding step: minimise first with respect to the encoding variables U, and then 2. dictionary update step: minimise with respect to the dictionary D and its dual C. Each step is based on proximal methods. The algorithm has been proved empirically successful, and its convergence towards a critical point of E may be proved.

Experimental Assessment Synthetic Data Data have been generated as random superpositions of a small number of elements of a tight frame. The reconstruction error has reached immediately the minimum achievable with the true generating frame. From top to bottom, we show the original dictionary, the recovered dictionary D and recovered dual C T.

Experimental Assessment Benchmark Datasets Berkeley segmentation dataset we have extracted a random sample of 10 5 patches of size 12 12 from the natural images contained in the dataset the reconstruction error achieved at the various level of sparsity have been constantly lower than the reconstruction error achievable with a comparable number of principal components

Experimental Assessment Benchmark Datasets Berkeley segmentation dataset

Experimental Assessment Benchmark Datasets MNIST dataset we have tested the algorithm on the 50, 000 training images consisting of 28 28 quasi binary images of handwritten digits we have trained the dictionary with 200 atoms Initialisation After 1 iteration After 4 iterations At convergence

Experimental Assessment Object Class Identification Caltech101 Dataset we have investigated the discriminative power of the dictionaries D and C when used to represent the visual content of an image

Experimental Assessment Object Class Identification Caltech101 Dataset we have investigated the discriminative power of the dictionaries D and C when used to represent the visual content of an image the performance obtained by using C is essentially the same as the one obtained with D. Coding based on D Coding based on C State-of-the-art results Mean accuracy 0.987 0.984 0.985 Standard deviation 0.008 0.008 0.008

Experimental Assessment Object Class Identification Caltech101 Dataset we have investigated the discriminative power of the dictionaries D and C when used to represent the visual content of an image the performance obtained by using C is essentially the same as the one obtained with D. Coding based on D Coding based on C State-of-the-art results Mean accuracy 0.987 0.984 0.985 Standard deviation 0.008 0.008 0.008 the coding of new input images requires only a matrix-vector multiplication.

Experimental Assessment Image Denoising

Experimental Assessment Image Denoising

References C. Basso, M. Santoro, C. Malattia, M.B. Damasio, G. Chiusano, P. Tomà, A. Martini and A. Verri. Quantitative Synovitis Assessment via Automatic 3D MRI Annotation. Submitted to IEEE Trans Med Imag. C. Basso, M. Santoro, M. B. Damasio, C. Malattia, A. Verri, P. Tomà, and A. Martini. Automatic estimation of inflamed synovial membrane volume in 3D MR images. Poster at ECR 2010 Scientific Exhibit, 4-8 March 2010, Vienna, Austria. C. Basso, M. Ferrante, M. Santoro, and A. Verri. Automatic Annotation of 3D Multi-Modal MR Images on a Desktop Grid. MICCAI-Grid Workshop, held at MICCAI 2009, 20-24 September 2009, London, UK. C. Basso, M. Santoro, A. Verri and M. Esposito. Segmentation of Inflamed Synovia in Multi-Modal MRI. In Proc. of IEEE ISBI 2009, June 28 - July 1 2009. C. Basso, M. Santoro, S. Villa and A. Verri. PADDLE: Proximal Algorithm for Dual Dictionaries LEarning. Submitted to NIPS 2010.