Sampling-Based Ensemble Segmentation against Inter-operator Variability

Similar documents
ABSTRACT 1. INTRODUCTION 2. METHODS

Multilevel Segmentation and Integrated Bayesian Model Classification with an Application to Brain Tumor Segmentation

Detection and Segmentation of Pathological Structures by the Extended Graph-Shifts Algorithm

Acceleration of Probabilistic Tractography Using Multi-GPU Parallel Processing. Jungsoo Lee, Sun Mi Park, Dae-Shik Kim

Machine Learning for Medical Image Analysis. A. Criminisi

Comparison Study of Clinical 3D MRI Brain Segmentation Evaluation

Supplementary Data. in residuals voxel time-series exhibiting high variance, for example, large sinuses.

Detection and Segmentation of Pathological Structures by the Extended Graph-Shifts Algorithm

Multi-Atlas Segmentation of the Cardiac MR Right Ventricle

BHMSMAfMRI User Manual

Lesion Segmentation and Bias Correction in Breast Ultrasound B-mode Images Including Elastography Information

The framework of the BCLA and its applications

3D Variational Brain Tumor Segmentation using a High Dimensional Feature Set

A Study of Medical Image Analysis System

NIH Public Access Author Manuscript Proc Soc Photo Opt Instrum Eng. Author manuscript; available in PMC 2014 October 07.

3D Variational Brain Tumor Segmentation using a High Dimensional Feature Set

A Clustering-Based Method for. Brain Tumor Segmentation

Attribute Similarity and Mutual-Saliency Weighting for Registration and Label Fusion

Ensemble registration: Combining groupwise registration and segmentation

Automatic Optimization of Segmentation Algorithms Through Simultaneous Truth and Performance Level Estimation (STAPLE)

MIDAS Processing of Segmentation Data for Brain Lesions

Automatic Vascular Tree Formation Using the Mahalanobis Distance

Context-sensitive Classification Forests for Segmentation of Brain Tumor Tissues

Computer-Aided Diagnosis in Abdominal and Cardiac Radiology Using Neural Networks

Hierarchical Multi structure Segmentation Guided by Anatomical Correlations

AN EFFICIENT SKULL STRIPPING ALGORITHM USING CONNECTED REGIONS AND MORPHOLOGICAL OPERATION

Automatic Segmentation of Parotids from CT Scans Using Multiple Atlases

Global Thresholding Techniques to Classify Dead Cells in Diffusion Weighted Magnetic Resonant Images

Whole Body MRI Intensity Standardization

An ICA based Approach for Complex Color Scene Text Binarization

Automatic segmentation of the cortical grey and white matter in MRI using a Region Growing approach based on anatomical knowledge

Medical images, segmentation and analysis

Evaluation of 1D, 2D and 3D nodule size estimation by radiologists for spherical and non-spherical nodules through CT thoracic phantom imaging

Fuzzy-Based Extraction of Vascular Structures from Time-of-Flight MR Images

Ground Truth Estimation by Maximizing Topological Agreements in Electron Microscopy Data

Automated segmentation methods for liver analysis in oncology applications

Segmentation Using a Region Growing Thresholding

Comparison of Vessel Segmentations Using STAPLE

Learning to Identify Fuzzy Regions in Magnetic Resonance Images

Semi-Automatic Detection of Cervical Vertebrae in X-ray Images Using Generalized Hough Transform

OTCYMIST: Otsu-Canny Minimal Spanning Tree for Born-Digital Images

Color Lesion Boundary Detection Using Live Wire

MR IMAGE SEGMENTATION

CITS 4402 Computer Vision

CHAPTER 6 MODIFIED FUZZY TECHNIQUES BASED IMAGE SEGMENTATION

By choosing to view this document, you agree to all provisions of the copyright laws protecting it.

Segmenting Glioma in Multi-Modal Images using a Generative-Discriminative Model for Brain Lesion Segmentation

Segmentation of 3-D medical image data sets with a combination of region based initial segmentation and active surfaces

Optimal MAP Parameters Estimation in STAPLE - Learning from Performance Parameters versus Image Similarity Information

Segmenting Lesions in Multiple Sclerosis Patients James Chen, Jason Su

Rare Event Detection Algorithm. User s Guide

Learning-based Neuroimage Registration

3D Surface Reconstruction of the Brain based on Level Set Method

Fully Automatic Model Creation for Object Localization utilizing the Generalized Hough Transform

Feature Detection. Raul Queiroz Feitosa. 3/30/2017 Feature Detection 1

2 Michael E. Leventon and Sarah F. F. Gibson a b c d Fig. 1. (a, b) Two MR scans of a person's knee. Both images have high resolution in-plane, but ha

Adaptive Local Multi-Atlas Segmentation: Application to Heart Segmentation in Chest CT Scans

Performance Evaluation of Basic Segmented Algorithms for Brain Tumor Detection

Medical Image Segmentation

Advances in Natural and Applied Sciences. Efficient Illumination Correction for Camera Captured Image Documents

Semi-automatic Segmentation of Vertebral Bodies in Volumetric MR Images Using a Statistical Shape+Pose Model

Estimating the ground truth from multiple individual segmentations with application to skin lesion segmentation

Measuring longitudinal brain changes in humans and small animal models. Christos Davatzikos

Adaptive Local Thresholding for Fluorescence Cell Micrographs

SEGMENTATION OF STROKE REGIONS FROM DWI AND ADC SEQUENCES USING A MODIFIED WATERSHED METHOD

A Ray-based Approach for Boundary Estimation of Fiber Bundles Derived from Diffusion Tensor Imaging

Comparison of Vessel Segmentations using STAPLE

Segmenting Glioma in Multi-Modal Images using a Generative Model for Brain Lesion Segmentation

Supervised Learning for Image Segmentation

Magnetic resonance image tissue classification using an automatic method

Validation of Image Segmentation and Expert Quality with an Expectation-Maximization Algorithm

Bagging for One-Class Learning

Knowledge-Based Organ Identification from CT Images. Masahara Kobashi and Linda Shapiro Best-Paper Prize in Pattern Recognition Vol. 28, No.

Correction of Partial Volume Effects in Arterial Spin Labeling MRI

Modified Expectation Maximization Method for Automatic Segmentation of MR Brain Images

Image Thresholding using Ultrafuzziness Optimization Based on Type II Fuzzy Sets

A Novel Criterion Function in Feature Evaluation. Application to the Classification of Corks.

CS 231A Computer Vision (Winter 2014) Problem Set 3

Hierarchical Shape Statistical Model for Segmentation of Lung Fields in Chest Radiographs

3D contour based local manual correction of tumor segmentations in CT scans

CHAPTER 3 RETINAL OPTIC DISC SEGMENTATION

ADAPTIVE GRAPH CUTS WITH TISSUE PRIORS FOR BRAIN MRI SEGMENTATION

Robust Ring Detection In Phase Correlation Surfaces

Copyright 2007 Society of Photo Optical Instrumentation Engineers. This paper was published in Proceedings of SPIE, volume 6514, Medical Imaging

Distributed Detection in Sensor Networks: Connectivity Graph and Small World Networks

Automatic Detection and Segmentation of Kidneys in Magnetic Resonance Images Using Image Processing Techniques

A new interface for manual segmentation of dermoscopic images

Prototype of Silver Corpus Merging Framework

Multi-Atlas Brain MRI Segmentation with Multiway Cut

Hybrid Spline-based Multimodal Registration using a Local Measure for Mutual Information

Comparative Study of ROI Extraction of Palmprint

maximum likelihood estimates. The performance of

SSRG International Journal of Computer Science and Engineering (SSRG-IJCSE) volume1 issue7 September 2014

Automatic Extraction of Tissue Form in Brain Image

Processing math: 100% Intensity Normalization

Contrast Enhancement with Dual Energy CT for the Assessment of Atherosclerosis

Rule-Based Ventral Cavity Multi-organ Automatic Segmentation in CT Scans

Norbert Schuff Professor of Radiology VA Medical Center and UCSF

Supplementary Online Content

arxiv: v1 [cs.cv] 6 Jun 2017

Transcription:

Sampling-Based Ensemble Segmentation against Inter-operator Variability Jing Huo 1, Kazunori Okada, Whitney Pope 1, Matthew Brown 1 1 Center for Computer vision and Imaging Biomarkers, Department of Radiological Sciences, David Geffen School of Medicine at UCLA Computer Science Department, San Francisco State University ABSTRACT Inconsistency and a lack of reproducibility are commonly associated with semi-automated segmentation methods. In this study, we developed an ensemble approach to improve reproducibility and applied it to glioblastoma multiforme (GBM) brain tumor segmentation on T1-weigted contrast enhanced MR volumes. The proposed approach combines samplingbased simulations and ensemble segmentation into a single framework; it generates a set of segmentations by perturbing user initialization and user-specified internal parameters, then fuses the set of segmentations into a single consensus result. Three combination algorithms were applied: majority voting, averaging and expectation-maximization (EM). The reproducibility of the proposed framework was evaluated by a controlled experiment on 16 tumor cases from a multicenter drug trial. The ensemble framework had significantly better reproducibility than the individual base Otsu thresholding method (p<.001). Keyword: GBM, ensemble 1. INTRODUCTION In clinical practice, reproducible and repeatable segmentation is an important prerequisite for longitudinal studies of medical images. Semi-automated segmentation methods are often preferred in common radiographic protocols because they allow expert clinicians to control the segmentation quality which plays a critical role in the final diagnostic decision. However, such semi-automated methods are also inconsistent when input is provided by different readers and/or used with different internal parameter values. This is a trade-off between usability and repeatability, posing a serious technical challenge. Our study focuses on segmenting glioblastoma multiforme (GBM) brain tumors in T1-weighted contrast enhanced MR volumes using the Otsu thresholding method. GBM tumor segmentation offers an ideal test case for our study because the contrast-enhancement heterogeneity of the tumors makes the current state-of-the-art methods highly irreproducible. The Otsu method is used in this study and has the advantages of efficiency, simplicity and usability, however it also suffers from poor reproducibility due to its need for user input in the setting of inhomogeneous tumors. Different user interaction will lead to different thresholds and thus inconsistent segmentation results. The algorithm parameter setup, in this case the number of thresholding levels, will also result inconsistent segmentation results. The main purpose of our study is to improve the reproducibility and stability of the semi-automated Otsu thresholding segmentation applied to GBM brain tumors. The final goal is to generate a robust and stable ensemble result given one single manually-drawn user input. The proposed framework combines sampling-based simulations and ensemble segmentation. It generates a set of segmentations by sampling the user interaction space and algorithm internal parameter space, then the set of inconsistent segmentations are ensembled into a single consensus result. The algorithmic sampling of user interaction space is designed to perturb the manual user interaction to simulate the typical inter-operator variability, thus the fusion of all possible segmentations is expected to be stable, reproducible and repeatable.. MATERIALS AND METHODS The brain volumes were pre-processed by skull-stripping using FSL tools 1. Users define a bounding cube as the volume of interest (VOI), and all the segmentation is done within the VOI. The user interaction is to draw a D bounding box that surrounds the target tumor on an arbitrarily chosen D slice. Medical Imaging 011: Computer-Aided Diagnosis, edited by Ronald M. Summers, Bram van Ginneken, Proc. of SPIE Vol. 7963, 796315 011 SPIE CCC code: 077-786X/11/$18 doi: 10.1117/1.878338 Proc. of SPIE Vol. 7963 796315-1

The base segmentation algorithm is Otsu thresholding method. The concept of Otsu s thresholding method is to find the σ threshold that minimizes the weighted within-class variation: w( = q1( σ1 ( + q( σ (, with the class t t t P( i) ip( i) q1 ( = P( i) σ1 ( = [ i μ1( ] μ1( = probability as i= 1, class variance as i= 1 q1( and class mean as i= 1 q1(. Given an initial μ i ( and σ i (, the algorithm will exhaustively search by altering the threshold value to find the optimal value. The approach can be generalized to multiple classes. All the GBM brain tumors were segmented on T1w post contrast images via the base semi-automated algorithm. First, radiologists draw a bounding box around the tumor, intensity values within the box are collected to form a histogram, and Otsu thresholding is applied, with the assumption that the number of classes within the bounding box is or 3. Afterwards, the 3D image volume is thresholded with the highest thresholding value followed by a connected component analysis, morphological opening and closing. The structure element used for morphological operations is circular with radius 1. The final goal is to generate a robust and stable ensemble result given one D manually-drawn user interaction box. There are two steps in the proposed ensemble framework: first, a set of inconsistent segmentations are generated by sampling user interactions and algorithm internal parameters; second, the set of inconsistent segmentations are fused into a single consensus result..1 Sampling user interactions The purpose of this step is to automatically sample a set of D bounding boxes on different slices that simulate typical inter-operator variability given one manual D bounding box drawn by the user manually on one single D slice. In order to maintain the usability, we wish the user to provide the minimal interaction with only a single D bounding box, from which we perform the sampling automatically. First, given the manually-drawn D input box, a 3D segmentation result is generated using the base method. And then, the 3D segmented tumor is uniformly sampled across all slices, yielding N sampled slices. On each of the N slices, one D bounding box of the tumor is generated. In the end, the set of N D bounding boxes is used as the user interaction variations. The pipeline is shown in Figure 1. Figure 1 Pipeline of automated sampling of user inputs to simulate inter-operator variability. Sampling Internal Algorithm Parameters The internal parameter, L, for the Otsu method is the number of intensity thresholding levels within a user input box. The difficulty of this parameter setup comes from the GBM tumor presentation. GBM brain tumors are composed of tumor cells, edema, and necrotic regions, which present different levels of enhancement, and may or may not be present within a single given tumor. Figure shows one tumor showing different intensity thresholding levels on two difference slices. It is not accurate to set a universal number of levels for all D user interaction boxes. Thus, we ran the base method with both L= and L=3 in this study. Proc. of SPIE Vol. 7963 796315-

Figure. Two slices of one tumor: (a) 3-object problem; (b) -object problem.3 Ensemble Segmentation The automatically sampled N user interaction boxes in Section.1 are used to perform independent 3D Otsu segmentation, and intensity levels (L= and L=3) are applied to each user interaction box. As a result, N inconsistent 3D segmentation results are generated accordingly. In this step, the N inconsistent segmentation results will be fused into a final ensemble segmentation result. We compared three ensemble methods: majority voting, averaging and STAPLE-EM algorithm 3, to fuse the N inconsistent segmentation results. Majority voting generates a binary segmentation. The averaging result is defined as N 1 PM( i, j, k) = Segn ( i, j, k) follows: N n = 1. For the details of STAPLE-EM, readers are referred to [3]. Both averaging and EM method generate a probability map (PM), which is thresholded into binary segmentations by 0.3 empirically. In a summary, the ensemble framework takes a single manual D box as the input, and generates one ensemble 3D segmentation as the output. The complete framework is shown in Figure 3. In this study, we set N=8. Figure 3 The complete ensemble segmentation pipeline 3. Experiments and Results We have 16 GBM tumor cases with T1-weighted post-contrast volumetric MR images (voxel size 0.9*0.9*1mm). The ground truth for the segmentation was manually contoured by a neuroradiologist. The reproducibility of the proposed framework was evaluated by a controlled experiment. For each tumor, the user manually drew eight D bounding boxes on 8 uniformly-sampled slices across the whole tumor volume. For each of the manually-drawn boxes, the base method and the ensemble framework were applied respectively for comparison. The overlap ratio of the 8 3D ensemble results and that of 8 base Otsu results were compared (Figure 4). All three ensemble methods showed significant better reproducibility than the base Otsu method with L=3 (p<.001) and L= (p<.001). Even though EM was visually slightly better than averaging and voting (Figure 5), there was no statistically significant difference with this dataset (p>0.05). Proc. of SPIE Vol. 7963 796315-3

The accuracy of the proposed framework was evaluated by calculating F-measure between the ground truth and the semi-automated methods. There were in total 18 boxes, with eight manually-drawn boxes for each of the 16 tumor cases; the accuracy of each box was shown in Figure 6. All three ensemble methods showed significant improvement over the Otsu method with L=3 (p<.001), but no difference from Otsu method with L= (p>.05). There was no statistically significant difference between the three ensemble methods in accuracy (p>.05) as shown in Figure 7. Figure 4. Reproducibility comparison between individual Otsu and averaging ensemble Figure 8 and Figure 9 showed an example comparing the segmentation results using different manually-drawn user interaction boxes between the base Otsu method and the ensemble framework. Different rows show different manuallydrawn user interaction boxes; while columns show different slices of the same tumor. Figure 8 showed that the base Otsu segmentations are inconsistent using different manually-drawn boxes; and Figure 9 showed that the ensemble segmentations are consistent. Figure 5 The reproducibility comparison between three ensemble methods: vote, averaging and EM Proc. of SPIE Vol. 7963 796315-4

Figure 6 F-measure of the base method and the averaging ensemble. Figure 7 F-measure of the three ensemble methods. 4. DISCUSSION AND CONCLUSION We developed an ensemble framework aiming to improve the reproducibility of the semi-automated segmentation, and applied the framework to GBM brain tumor segmentation on T1w post contrast images. First, we invented an automated sampling of user interactions that simulates typical inter-operator variability, in order to allow for minimal user interaction, as well as sampling internal algorithm parameters. Then, we generated the inconsistent segmentation set using the sampled user interaction boxes and ensemble them into a final segmentation. We evaluated the performance on a difficult task of the single-channel GBM brain tumor segmentation. In conclusion, the proposed automated user interaction sampling and ensemble framework tremendously improved the reproducibility compared to the base method on our dataset on GBM brain tumor segmentation. Reproducibility is crucial for semi-automated segmentation methods, the proposed framework shows great potential in improving the consistency and reproducibility. 5. REFERENCES [1] Smith, S.M., Jenkinson, M., Woolrich, M.W., Beckmann, C.F., Behrens, T.E., Johansen-Berg, H., Bannister, P.R., De Luca, M., Drobnjak, I., Flitney, D.E., Niazy, R.K., Saunders, J., Vickers, J., Zhang, Y., De Stefano, N., Brady, J.M. and Matthews, P.M., Advances in Functional and Structural MR Image Analysis and Implementation as FSL, NeuroImage, 3(S1), 08 19 (004). Proc. of SPIE Vol. 7963 796315-5

[] Otsu, N., A threshold selection method from gray level histograms, IEEE Trans. Syst. Man. Cybern. 9, 6 66 (1979). [3] Warfield, S.K., Zou, K.H. and Wells, W.M., Simultaneous truth and performance level estimation (STAPLE): an algorithm for the validation of image segmentation. IEEE Trans Med Imaging, 3(7), 903-91 (004). Figure 8 Segmentation results from the base Otsu method using different manually-drawn user interactions: rows are 6 different slices of the same tumor; columns are result from different manually-drawn user interaction boxes. Proc. of SPIE Vol. 7963 796315-6

Figure 9 Segmentation results from the ensemble framework using different manually-drawn user interactions: rows are 6 different slices of the same tumor; columns are result from different manually-drawn user interaction boxes. Proc. of SPIE Vol. 7963 796315-7