arxiv: v1 [cs.cv] 29 Oct 2016

Similar documents
Deep Learning with Tensorflow AlexNet

Weakly Supervised Fully Convolutional Network for PET Lesion Segmentation

CHAPTER 6 MODIFIED FUZZY TECHNIQUES BASED IMAGE SEGMENTATION

RADIOMICS: potential role in the clinics and challenges

Available online Journal of Scientific and Engineering Research, 2019, 6(1): Research Article

REAL-TIME ADAPTIVITY IN HEAD-AND-NECK AND LUNG CANCER RADIOTHERAPY IN A GPU ENVIRONMENT

Content-Based Image Recovery

Perceptron: This is convolution!

arxiv: v1 [cs.cv] 20 Dec 2016

Deep Tracking: Biologically Inspired Tracking with Deep Convolutional Networks

Training Convolutional Neural Networks for Translational Invariance on SAR ATR

Tumor Detection and classification of Medical MRI UsingAdvance ROIPropANN Algorithm

GPU implementation for rapid iterative image reconstruction algorithm

Classification of Subject Motion for Improved Reconstruction of Dynamic Magnetic Resonance Imaging

A Novel Image Super-resolution Reconstruction Algorithm based on Modified Sparse Representation

Simultaneous Multiple Surface Segmentation Using Deep Learning

Dynamic Routing Between Capsules

Computer-Aided Diagnosis in Abdominal and Cardiac Radiology Using Neural Networks

Neural Networks. Single-layer neural network. CSE 446: Machine Learning Emily Fox University of Washington March 10, /10/2017

Real-time Object Detection CS 229 Course Project

Study of Residual Networks for Image Recognition

CSE 559A: Computer Vision

Kaggle Data Science Bowl 2017 Technical Report

Machine Learning for Medical Image Analysis. A. Criminisi

Improvement and Evaluation of a Time-of-Flight-based Patient Positioning System

Convolutional Neural Networks. Computer Vision Jia-Bin Huang, Virginia Tech

Medical images, segmentation and analysis

Norbert Schuff VA Medical Center and UCSF

CHAPTER 3 TUMOR DETECTION BASED ON NEURO-FUZZY TECHNIQUE

Fuzzy Set Theory in Computer Vision: Example 3, Part II

Available Online through

arxiv: v1 [cs.cv] 11 Apr 2018

Deep Learning for Computer Vision II

Skin Lesion Classification and Segmentation for Imbalanced Classes using Deep Learning

Structured Prediction using Convolutional Neural Networks

End-To-End Spam Classification With Neural Networks

Enhao Gong, PhD Candidate, Electrical Engineering, Stanford University Dr. John Pauly, Professor in Electrical Engineering, Stanford University Dr.

MICROARRAY IMAGE SEGMENTATION USING CLUSTERING METHODS

Detecting Bone Lesions in Multiple Myeloma Patients using Transfer Learning

An MRI-based Attenuation Correction Method for Combined PET/MRI Applications

Global Journal of Engineering Science and Research Management

Encoder-Decoder Networks for Semantic Segmentation. Sachin Mehta

Projection and Reconstruction-Based Noise Filtering Methods in Cone Beam CT

3D model classification using convolutional neural network

arxiv: v1 [cs.cv] 30 Jul 2017

Deep Learning and Its Applications

Object detection using Region Proposals (RCNN) Ernest Cheung COMP Presentation

Deep Learning in Visual Recognition. Thanks Da Zhang for the slides

COMP9444 Neural Networks and Deep Learning 7. Image Processing. COMP9444 c Alan Blair, 2017

A Survey on Image Segmentation Using Clustering Techniques

Ryerson University CP8208. Soft Computing and Machine Intelligence. Naive Road-Detection using CNNS. Authors: Sarah Asiri - Domenic Curro

Validation of GEANT4 for Accurate Modeling of 111 In SPECT Acquisition

Dynamic digital phantoms

8/3/2017. Contour Assessment for Quality Assurance and Data Mining. Objective. Outline. Tom Purdie, PhD, MCCPM

Improving Positron Emission Tomography Imaging with Machine Learning David Fan-Chung Hsu CS 229 Fall

Neural Network Approach for Automatic Landuse Classification of Satellite Images: One-Against-Rest and Multi-Class Classifiers

Part Localization by Exploiting Deep Convolutional Networks

Automated segmentation methods for liver analysis in oncology applications

Rotation Invariance Neural Network

Color Lesion Boundary Detection Using Live Wire

3D VISUALIZATION OF SEGMENTED CRUCIATE LIGAMENTS 1. INTRODUCTION

Prostate Brachytherapy Seed Segmentation Using Spoke Transform

Deep Learning. Visualizing and Understanding Convolutional Networks. Christopher Funk. Pennsylvania State University.

LUNG NODULE DETECTION IN CT USING 3D CONVOLUTIONAL NEURAL NETWORKS. GE Global Research, Niskayuna, NY

Use of MRI in Radiotherapy: Technical Consideration

K-Means Segmentation of Alzheimer s Disease In Pet Scan Datasets An Implementation

Learning Visual Semantics: Models, Massive Computation, and Innovative Applications

ImageNet Classification with Deep Convolutional Neural Networks

DEEP LEARNING REVIEW. Yann LeCun, Yoshua Bengio & Geoffrey Hinton Nature Presented by Divya Chitimalla

MULTI-VIEW FEATURE FUSION NETWORK FOR VEHICLE RE- IDENTIFICATION

Deep Learning With Noise

arxiv: v2 [cs.cv] 30 Oct 2018

TRANSPARENT OBJECT DETECTION USING REGIONS WITH CONVOLUTIONAL NEURAL NETWORK

Multi-Task Self-Supervised Visual Learning

Detection and Identification of Lung Tissue Pattern in Interstitial Lung Diseases using Convolutional Neural Network

DUAL energy X-ray radiography [1] can be used to separate

Bioimage Informatics

arxiv: v1 [cs.cv] 29 Oct 2017

Skull Segmentation of MR images based on texture features for attenuation correction in PET/MR

VEHICLE CLASSIFICATION And License Plate Recognition

Lung Tumor Segmentation via Fully Convolutional Neural Networks

AUTOMATED DETECTION AND CLASSIFICATION OF CANCER METASTASES IN WHOLE-SLIDE HISTOPATHOLOGY IMAGES USING DEEP LEARNING

SIIM 2017 Scientific Session Analytics & Deep Learning Part 2 Friday, June 2 8:00 am 9:30 am

Automatic Segmentation of Parotids from CT Scans Using Multiple Atlases

Dynalog data tool for IMRT plan verification

INTELLIGENT transportation systems have a significant

SEMANTIC COMPUTING. Lecture 8: Introduction to Deep Learning. TU Dresden, 7 December Dagmar Gromann International Center For Computational Logic

MEDICAL IMAGE ANALYSIS

Auto-Segmentation Using Deformable Image Registration. Disclosure. Objectives 8/4/2011

Engineering Problem and Goal

Partial Least Squares Regression on Grassmannian Manifold for Emotion Recognition

IJESRT. Scientific Journal Impact Factor: (ISRA), Impact Factor: 2.114

INTRODUCTION TO DEEP LEARNING

Tumor Lesion Segmentation from 3D PET using a Machine Learning driven Active Surface

A new approach to reference point location in fingerprint recognition

Using Machine Learning for Classification of Cancer Cells

Facial Expression Classification with Random Filters Feature Extraction

A Systematic Analysis System for CT Liver Image Classification and Image Segmentation by Local Entropy Method

CT Protocol Review: Practical Tips for the Imaging Physicist Physicist

COMPARATIVE DEEP LEARNING FOR CONTENT- BASED MEDICAL IMAGE RETRIEVAL

Transcription:

Machine learning methods for accurate delineation of tumors in PET images arxiv:1610.09493v1 [cs.cv] 29 Oct 2016 Jakub Czakon 1, Filip Drapejkowski 1, Grzegorz Żurek1, Piotr Giedziun 1, Jacek Żebrowski2, and Witold Dyrka 1 1 Stermedia Sp. z o. o., CancerCenter.eu team, ul. A. Ostrowskiego 13, Wroc law, Poland 2 Lower Silesian Oncology Center, Department of Nuclear Medicine - PET-CT Laboratory, pl. L. Hirszfelda 12, Wroc law, Poland Abstract In oncology, Positron Emission Tomography imaging is widely used in diagnostics of cancer metastases, in monitoring of progress in course of the cancer treatment, and in planning radiotherapeutic interventions. Accurate and reproducible delineation of the tumor in the Positron Emission Tomography scans remains a difficult task, despite being crucial for delivering appropriate radiation dose, minimizing adverse side-effects of the therapy, and reliable evaluation of treatment. In this piece of research we attempt to solve the problem of automated delineation of the tumor using 3d implementations of the spatial distance weighted fuzzy c-means, the deep convolutional neural network and a dictionary model. The methods, in diverse ways, combine intensity and spatial information. 1 Motivation Positron Emission Tomography (PET) is a nuclear imaging technique used to monitor metabolic activity of the body. The PET system detects gamma rays generated when electrons collide with positrons emitted by radiolabelled biologically active molecules, such as fluorodeoxyglucose. In oncology, PET is widely used in diagnostics of cancer metastases, in monitoring of progress in course of the cancer treatment, and also in planning radioterapheutic interventions. In the last case, PET scans, often in combination with Computer Tomography (CT) scans, are used to delineate biologically active parts of tumors, which are Corresponding author: witold.dyrka@stermedia.pl Website: CancerCenter.eu 1

then treated with ionizing radiation. Accurate delineation of the tumor is crucial for delivering appropriate radiation dose and minimizing adverse side-effects of the therapy. However, the problem is not trivial as evidenced by substantial intra- and interobserver variations when delineation is performed manually by experts[1]. Therefore it is not surprising that automated methods for accurate and reproducible delineation are sought. 2 Methods In this piece of research we evaluated 3d implementations of Spatial Distance Weighted Fuzzy C-Means (SDWFCM) of Guo et al. [2], Dictionary-based Model (DICT) of Dahl and Larser [3] and Convolutional Neural Network (CNN) [4], and compared them to classical approaches clustering approaches: K-Means (KM) [6] and Gaussian Mixture Models (GMM) [7]. In order to assure consistency of the input data, images detected to be sharper than an arbitrary threshold (after convolution with the 3d Laplacian kernel) were slightly blurred using the Gaussian filter. The step was omitted in the GMM and CNN case. In order to clean resulting labels from small artifacts, the labels were subject to subsequent morphological opening and closing, except the CNN method. We used sklearn implementations of KM and GMM methods. 2.1 Spatial Distance Weighted Fuzzy C-Means The method is a spatial information-aware extension of the Fuzzy C-Means clustering algorithm (FCM) [6]. Both methods assign to each point a coefficient of belonging to one of c clusters. Similarly to FCM, SDWFCM starts by calculating cluster centers and Euclidean distances based on regular features (such as image intensity). However, in the next step the distance matrix is modified with weight based on spatial distance of each pixel to the cluster center. Then, coefficients of belonging are calculated based on the modified distance matrix. The procedure repeats until convergence or maximum runtime. Finally the tumor label is assigned to f most intense clusters. Parameters for the SDWFCM algorithm (as well as for KM and GMM) were optimized for the training set (see Tabl. 1 and ref. [2]). 2.2 Dictionary-based Model The general idea of the method is to assign each patch of the test image, the label which corresponds to the most similar patch in the dictionary. The dictionary, which consists of a set of patch-label pairs is built in two steps. First, a sample of training patches is used to seed the dictionary. Based on the label similarity threshold, similar patches are clustered and averaged. Then, a version of the vector quantization technique is used to adjust the dictionary patches such that real labels of its matches are as close to binarized dictionary labels (also adjusted 2

in the training). Finally, the labeling algorithm assigns binary labels of the most similar dictionary patch. In our implementation, the labeling window walks pixel by pixel and each pixel gets the label of the best fitting patch or an average label, which is then binarized using the final labeling threshold th. DICT was set up with the following parameters: 3x3x3 patches were utilized, 10% of the training patches were used to seed the dictionary using similarity threshold 0.5; all training patches were used for adjusting the dictionary in 10 iterations with the step coefficient τ 0.05 (see ref. [3]). 2.3 Convolutional Neural Network Deep convolutional neural networks have proved to be one of the most efficient tools in image processing. For the tumor delineation problem we tested many CNN architectures consisting of up to dozen layers, including three or four convolutional layers. The input of the network was a batch of 2d or 3d patches from the PET tumor images. The network was trained using the AdaGrad stochastic gradient descent algorithm [5], which aimed at minimizing the Euclidean distance between the output of the final layer of the network and the ground truth. To account for a relatively small number of training samples, the set of available data was artificially augmented with rotationally transformed samples. The best architecture, selected in the 5-fold cross-validation process, was the 3D convolutional neural network with 3 convolutional layers followed by RELU activations with ascending number of filters (Fig. 1). Figure 1: CNN architecture. The network input are 3d patches (5x10x10 pixels) of the PET scan. The network output are 3d patches (the same dimensions as input) with predicted binary tumor segmentation. Neighbouring output patches are combined to give a binary image of the whole scan (not shown). Notations: CONV - convolutional layer, DROP - drop-out layer with ratio r, ACTIV fun - activation layer with function fun. 3

3 Results The training dataset provided by organizers of the PETSEG challenge consisted of 4 scans from 3 clinical samples, 9 scans from 3 phantoms and 6 simulated scans in 2 sets. We assessed efficiency of the methods on the training set in terms of the Dice coefficient. As the scores were calculated on the training set, they are rather upper estimates of the real performance of the methods. Table 1: Dice coefficients for training data. Notations: KM - K-Means, GMM - Gaussian Mixture Model, SDWFCM - Spatial Distance Weighted Fuzzy C-Means, DICT - Dictionary-based model, CNN - Convolutional Neural Network; k/n/c - number of clusters, f - number of selected most intense clusters, m - degree of fuzzy classification, λ - weight of spatial features, nb - number of neighbours. method parameters avg med. clin. phan. simul. balanced KM k=2, f=1 0.82 0.81 0.77 0.79 0.90 0.82 GMM n=4, f=1 0.83 0.83 0.77 0.80 0.90 0.82 SDWFCM c=2, f=1 m=2, λ=0.5 nb=1 0.82 0.82 0.76 0.81 0.88 0.81 DICT see Sec. 2.2 0.82 0.81 0.77 0.78 0.90 0.82 CNN see Sec. 2.3 0.86 0.89 0.78 0.86 0.91 0.85 The best results were obtained using CNN (Dice score mean: 0.86, median: 0.89). The remaining methods performed slightly worse than CNN and similarly to each other (Dice score mean: 0.82-0.83, median: 0.81-0.83). In general, simulated data proved to be the easiest (mean Dice score 0.88-0.91), while clinical data were the hardest (mean Dice score 0.76-0.78), regardless the method. A single most difficult case was clinical 1 for which none of the algorithms exceeded Dice score of 0.69. It is of note, that the Gaussian blurring had a positive impact on segmentation of this case. In fact clinical 1, clinical 3 and phantom 2.1 were the only cases where CNN did not reach Dice score of 0.80. Interestingly, CNN was always better than other methods for phantom data, with average margin over the second best result of almost 7%. Looking more into details, CNN and DICT achieved the highest sensitivity (0.89 and 0.85, respectively), while SDWFCM achieved the highest precision (0.85). In accordance with these results, CNN and DICT were more prone to overestimate the total volume of the tumor (on average by 10%), while SD- WFCM lined towards underestimated volumes (on average by 3%). All methods overestimated the volume of clinical 1 (by 30-49%), phantom 2 (9-58%) and phantom 3 (8-47%), while the volume of clinical 3 was generally underestimated (12-26%). Interestingly, while CNN was the method which overestimated the volume of clinical samples the most (22%), its volume predictions were closest to the ground truth for the phantoms (+9%). Accordingly, the contour mean 4

distance of CNN predictions was the worst for clinical samples (2.28 in comparison to 1.81 by DICT), and the best for phantom samples (0.72 in comparison to 1.30 by DICT). Overally, our early results suggest that deep convolutional neural networks can enable a step forward also in the PET image segmentation. This claim should be further validated with more testing, especially on clinical samples, for which performance gain using CNN was less impressive than for phantom data. Acknowledgements. This research has been conducted under the Support Programme of the Partnership between Higher Education and Science and Business Activity Sector financed by City of Wroclaw. The authors thank Jakub Szewczyk for his help with preparing the manuscript, and Piotr Krajewski for continuous support of the project. References [1] Njeh. Tumor delineation: The weakest link in the search for accuracy in radiotherapy. Journal of Medical Physics 33:136-140, 2008 [2] Guo, Liu, Wu, Hong and Zhang. A New Spatial Fuzzy C-Means for Spatial Clustering. WSEAS Transactions on Computers 14:369-381, 2015. [3] Dahl and Larsen. Learning Dictionaries of Discriminative Image Patches. In: Proc. British Machine Vision Conference. p.77, 2011. [4] Krizhevsky, Sutskever and Hinton. Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, 2012. [5] Duchi, Hazan and Singer. Adaptive subgradient methods for online learning and stochastic optimization. JMLR 12:2121 2159, 2011. [6] Bezdek. Pattern Recognition with Fuzzy Objective Function Algorithms. Kluwer Academic Publishers, Norwell, MA, USA. 1981. [7] Zeng, Huang, Kang and Sang. Image segmentation using spectral clustering of Gaussian mixture models. Neurocomputing 144:346 356, 2014. 5