arxiv: v1 [cs.cv] 14 Dec 2018

Similar documents
3D Densely Convolutional Networks for Volumetric Segmentation. Toan Duc Bui, Jitae Shin, and Taesup Moon

Encoder-Decoder Networks for Semantic Segmentation. Sachin Mehta

Boundary-aware Fully Convolutional Network for Brain Tumor Segmentation

RSRN: Rich Side-output Residual Network for Medial Axis Detection

Object Detection. CS698N Final Project Presentation AKSHAT AGARWAL SIDDHARTH TANWAR

Lecture 7: Semantic Segmentation

arxiv: v1 [cs.cv] 11 Apr 2018

Isointense infant brain MRI segmentation with a dilated convolutional neural network Moeskops, P.; Pluim, J.P.W.

Extend the shallow part of Single Shot MultiBox Detector via Convolutional Neural Network

Automatic Thoracic CT Image Segmentation using Deep Convolutional Neural Networks. Xiao Han, Ph.D.

Channel Locality Block: A Variant of Squeeze-and-Excitation

A FRAMEWORK OF EXTRACTING MULTI-SCALE FEATURES USING MULTIPLE CONVOLUTIONAL NEURAL NETWORKS. Kuan-Chuan Peng and Tsuhan Chen

Efficient Segmentation-Aided Text Detection For Intelligent Robots

Mask R-CNN. presented by Jiageng Zhang, Jingyao Zhan, Yunhan Ma

DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution and Fully Connected CRFs

Kaggle Data Science Bowl 2017 Technical Report

Xiaowei Hu* Lei Zhu* Chi-Wing Fu Jing Qin Pheng-Ann Heng

Deep learning for dense per-pixel prediction. Chunhua Shen The University of Adelaide, Australia

MRI Tumor Segmentation with Densely Connected 3D CNN. Lele Chen, Yue Wu, Adora M. DSouze, Anas Z. Abidin, Axel Wismüller, and Chenliang Xu

arxiv: v1 [stat.ml] 19 Jul 2018

COMPARISON OF OBJECTIVE FUNCTIONS IN CNN-BASED PROSTATE MAGNETIC RESONANCE IMAGE SEGMENTATION

Detecting Bone Lesions in Multiple Myeloma Patients using Transfer Learning

Learning to Match. Jun Xu, Zhengdong Lu, Tianqi Chen, Hang Li

Multi-View 3D Object Detection Network for Autonomous Driving

Deep Tracking: Biologically Inspired Tracking with Deep Convolutional Networks

CAP 6412 Advanced Computer Vision

SSD: Single Shot MultiBox Detector. Author: Wei Liu et al. Presenter: Siyu Jiang

8/3/2017. Contour Assessment for Quality Assurance and Data Mining. Objective. Outline. Tom Purdie, PhD, MCCPM

arxiv: v1 [cs.cv] 16 Nov 2015

arxiv: v2 [cs.cv] 28 Jan 2019

arxiv: v1 [stat.ml] 3 Aug 2017

SEMANTIC SEGMENTATION AVIRAM BAR HAIM & IRIS TAL

arxiv: v1 [cs.cv] 31 Mar 2016

Deep Residual Architecture for Skin Lesion Segmentation

Regionlet Object Detector with Hand-crafted and CNN Feature

IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 26, NO. 10, OCTOBER

Deep Learning in Pulmonary Image Analysis with Incomplete Training Samples

Content-Based Image Recovery

Multi-channel Deep Transfer Learning for Nuclei Segmentation in Glioblastoma Cell Tissue Images

arxiv: v1 [cs.cv] 17 May 2017

arxiv: v1 [cs.cv] 20 Dec 2016

MEDICAL IMAGE COMPUTING (CAP 5937) LECTURE 10: Medical Image Segmentation as an Energy Minimization Problem

MULTI-SCALE OBJECT DETECTION WITH FEATURE FUSION AND REGION OBJECTNESS NETWORK. Wenjie Guan, YueXian Zou*, Xiaoqun Zhou

Rich feature hierarchies for accurate object detection and semantic segmentation

Auto-contouring the Prostate for Online Adaptive Radiotherapy

In-Place Activated BatchNorm for Memory- Optimized Training of DNNs

Multi-Label Whole Heart Segmentation Using CNNs and Anatomical Label Configurations

Deep Learning in Visual Recognition. Thanks Da Zhang for the slides

Detecting Anatomical Landmarks from Limited Medical Imaging Data using Two-Stage Task-Oriented Deep Neural Networks

Deformable Segmentation using Sparse Shape Representation. Shaoting Zhang

Ensemble registration: Combining groupwise registration and segmentation

arxiv: v2 [cs.cv] 19 Nov 2018

Places Challenge 2017

YOLO9000: Better, Faster, Stronger

Advanced Video Analysis & Imaging

Cost-alleviative Learning for Deep Convolutional Neural Network-based Facial Part Labeling

SAFE: Scale Aware Feature Encoder for Scene Text Recognition

Computer Vision Lecture 16

ECE 6554:Advanced Computer Vision Pose Estimation

Temporal HeartNet: Towards Human-Level Automatic Analysis of Fetal Cardiac Screening Video

Volumetric and Multi-View CNNs for Object Classification on 3D Data Supplementary Material

A Deep Learning Framework for Authorship Classification of Paintings

Deep Similarity Learning for Multimodal Medical Images

Inception and Residual Networks. Hantao Zhang. Deep Learning with Python.

arxiv: v2 [cs.cv] 14 May 2018

Automatic detection of books based on Faster R-CNN

Multi-Input Cardiac Image Super-Resolution using Convolutional Neural Networks

LSTM and its variants for visual recognition. Xiaodan Liang Sun Yat-sen University

LARGE-SCALE PERSON RE-IDENTIFICATION AS RETRIEVAL

JOINT DETECTION AND SEGMENTATION WITH DEEP HIERARCHICAL NETWORKS. Zhao Chen Machine Learning Intern, NVIDIA

LUNG NODULE DETECTION IN CT USING 3D CONVOLUTIONAL NEURAL NETWORKS. GE Global Research, Niskayuna, NY

Automatic Lymphocyte Detection in H&E Images with Deep Neural Networks

CEA LIST s participation to the Scalable Concept Image Annotation task of ImageCLEF 2015

TEXT SEGMENTATION ON PHOTOREALISTIC IMAGES

arxiv: v2 [cs.cv] 30 Sep 2018

arxiv: v1 [cs.cv] 20 Apr 2017

Intraoperative Prostate Tracking with Slice-to-Volume Registration in MR

A Registration-Based Atlas Propagation Framework for Automatic Whole Heart Segmentation

ActiveStereoNet: End-to-End Self-Supervised Learning for Active Stereo Systems (Supplementary Materials)

Graph Cuts Based Left Atrium Segmentation Refinement and Right Middle Pulmonary Vein Extraction in C-Arm CT

Prototype of Silver Corpus Merging Framework

Deep Learning. Visualizing and Understanding Convolutional Networks. Christopher Funk. Pennsylvania State University.

Proceedings of the International MultiConference of Engineers and Computer Scientists 2018 Vol I IMECS 2018, March 14-16, 2018, Hong Kong

An efficient face recognition algorithm based on multi-kernel regularization learning

Recurrent Transformer Networks for Semantic Correspondence

Yiqi Yan. May 10, 2017

MULTI-LEVEL 3D CONVOLUTIONAL NEURAL NETWORK FOR OBJECT RECOGNITION SAMBIT GHADAI XIAN LEE ADITYA BALU SOUMIK SARKAR ADARSH KRISHNAMURTHY

H-DenseUNet: Hybrid Densely Connected UNet for Liver and Tumor Segmentation from CT Volumes

TRANSPARENT OBJECT DETECTION USING REGIONS WITH CONVOLUTIONAL NEURAL NETWORK

LSTM: An Image Classification Model Based on Fashion-MNIST Dataset

Sparsity Based Spectral Embedding: Application to Multi-Atlas Echocardiography Segmentation!

Fully Automatic Multi-organ Segmentation based on Multi-boost Learning and Statistical Shape Model Search

Semantic Context Forests for Learning- Based Knee Cartilage Segmentation in 3D MR Images

Image Classification Using Wavelet Coefficients in Low-pass Bands

U-SEGNET: FULLY CONVOLUTIONAL NEURAL NETWORK BASED AUTOMATED BRAIN TISSUE SEGMENTATION TOOL. Pulkit Kumar Pravin Nagar Chetan Arora Anubha Gupta

Enhao Gong, PhD Candidate, Electrical Engineering, Stanford University Dr. John Pauly, Professor in Electrical Engineering, Stanford University Dr.

Weakly Supervised Fully Convolutional Network for PET Lesion Segmentation

Detecting and Parsing of Visual Objects: Humans and Animals. Alan Yuille (UCLA)

arxiv: v1 [cs.cv] 11 Apr 2018

Learning to generate 3D shapes

Transcription:

Pyramid Network with Online Hard Example Mining for Accurate Left Atrium Segmentation Cheng Bian 1, Xin Yang 2,3, Jianqiang Ma 1, Shen Zheng 1, Yu-An Liu 1, Reza Nezafat 3, Pheng-Ann Heng 2, and Yefeng Zheng 1 arxiv:1812.05802v1 [cs.cv] 14 Dec 2018 1 Tencent YouTu Lab 2 Dept. of Computer Science and Engineering, The Chinese University of Hong Kong xinyang@cse.cuhk.edu.hk 3 Department of Medicine (Cardiovascular Division), Beth Israel Deaconess Medical Center and Harvard Medical School Abstract. Accurately segmenting left atrium in MR volume can benefit the ablation procedure of atrial fibrillation. Traditional automated solutions often fail in relieving experts from the labor-intensive manual labeling. In this paper, we propose a deep neural network based solution for automated left atrium segmentation in gadolinium-enhanced MR volumes with promising performance. We firstly argue that, for this volumetric segmentation task, networks in 2D fashion can present great superiorities in time efficiency and segmentation accuracy than networks with 3D fashion. Considering the highly varying shape of atrium and the branchy structure of associated pulmonary veins, we propose to adopt a pyramid module to collect semantic cues in feature maps from multiple scales for fine-grained segmentation. Also, to promote our network in classifying the hard examples, we propose an Online Hard Negative Example Mining strategy to identify voxels in slices with low classification certainties and penalize the wrong predictions on them. Finally, we devise a competitive training scheme to further boost the generalization ability of networks. Extensively verified on 20 testing volumes, our proposed framework achieves an average Dice of 92.83% in segmenting the left atria and pulmonary veins. 1 Introduction Atrial fibrillation is the most common cardiac electrical disorder, which happens around left atrium and becomes a major cause of stroke [9]. Ablation therapy guided by MR imaging is a popular treatment solution for atrial fibrillation. Segmenting left atrium in MR scan can benefit the preoperative assessment, determine the catheter size in pulmonary veins, optimize the therapy plan and reduce fluoroscopy time. However, confined by low efficiency and reproducibility, manually segmenting the left atrium tends to be intractable [6]. *The work was partly finished when Xin Yang was an internship in the Department of Medicine (Cardiovascular Division), Beth Israel Deaconess Medical Center and Harvard Medical School.

Fig. 1. Challenges in segmenting the left atrium and associated pulmonary veins. Green curve and surface rendering denote segmentation ground truth. As illustrated in Fig. 1, except the poor image quality, automatically segmenting left atrium with pulmonary veins in MR volume is very challenging. Firstly, left atrium only occupies a small ratio when compared with the background, which makes it difficult for algorithm to localize and recognize the boundary details. Secondly, due to the similar intensity with surrounding chambers and the thin myocardial wall with respect to the limited MR resolution, boundary ambiguity and deficiency often occur around the atrium and pulmonary veins. Thirdly, the shape and size of left atrium vary greatly across different subjects and timepoints. Also, the topological arrangement of pulmonary veins, especially the number of veins, presents weak pattern in population [12]. Automated left atrium segmentation retains intensive research interest. Deformable model [16], atlas based label fusion [11] and non-rigid registration [17] are typical methods for atrium included cardiovascular structure segmentation. However, these methods depend heavily on the hand-crafted statistical modeling and lack generalization ability against unseen cases, such as the left atrium with rare number of pulmonary veins. In the deep learning era, this segmentation task is embracing new opportunities. Stacked sparse auto-encoders [13], Fully Convolutional Network [7] and Convolutional LSTM [3] were deployed in a 2D fashion for left atrium segmentation. On the other hand, networks in 3D fashion have also been explored [14]. The superiority between 2D and 3D fashion is arguable and case-by-case [2]. Although networks in both 2D and 3D achieved remarkable performance, preserving segmentation details in varying scales and tackling classification ambiguity around boundary are overlooked in previous work. In this paper, we propose a deep network based solution for automated left atrium segmentation in gadolinium-enhanced MR (GE-MR) volumes with promising performance. With investigations on our task, we argue that, networks with 2D architectures can easily outperform their heavy 3D versions with respect to time efficiency and segmentation accuracy. Left atrium body possesses a relative compact structure, while the associated pulmonary veins present to

be branchy in varying scales. As this regard, we propose to further tailor our 2D network with pyramid pooling modules to enlarge receptive field ranges and perceive semantic cues from multiple scales. This pyramid module greatly promotes the segmentation in fine scales. To avoid the learning of non-informative negative samples and enhance the learning of hard negative examples around boundary, we propose an Online Hard Negative Example Mining strategy to identify voxels in slices with low classification certainties and penalize the wrong predictions on them. Finally, we devise a competitive training scheme in which several competitors need to compete with each other. With the scheme, the generalization ability of all competitors can be further improved. Extensively verified on 20 testing volumes, our proposed framework achieves an average Dice of 92.83% in segmenting the left atria and pulmonary veins. Fig. 2. Schematic view of our proposed framework. 2 Methodology Fig. 2 is the schematic view of our proposed framework. Without extra localization modules, our system takes the whole 2D slice as input, hierarchical features are firstly learned and extracted by consecutive residual blocks. Residual blocks are supervised by an auxiliary branch. Then, the adopted pyramid pooling module distills semantic cues from multiple scales. These semantic information are then merged to infer the final predictions. At the end of the network, we introduce the Online Hard Negative Example Mining loss to guide the network selectively search samples in order to learn more efficiently and effectively. 2.1 Choice between 2D and 3D Architecture For volumetric segmentation tasks, 2D and 3D architectures are two main streams and have their different strengths [2,5]. We investigate several state-of-the-art 2D

and 3D architectures for the left atrium segmentation and choose the 2D fashion for following reasons. Firstly, operating in 2D endows our method with super time efficiency, which is about 4 times faster than the 3D version. Secondly, under limited GPU memory, 2D network can get more spaces to expand and deepen its layout which is crucial for the learning capacity of network. Thirdly, 2D network can get large enough and diverse receptive fields to aggregate the in-plane, multiscale context cues for more detailed segmentation. 2.2 Pyramid Network Design Directly applying vanilla 2D networks to segment the left atrium performs poor. The atrium is very small when compared with the background, strong context information, such as the lung and other heart chambers, are needed to localize it. This requires the network have large receptive field and ability to extract proper feature hierarchy. Also, the deformation of atrium and branchy structure of pulmonary veins demand the network can recognize boundaries from macro to nano. Therefore, as shown in Fig. 2, we propose to adopt the PSPNet network [15] as a workhorse to fit our task. Specifically, for the forehead part, we deploy the ResNet101 pre-trained on ImageNet to extract features. To enhance the network with large receptive fields under parameter limitations, we also utilize the dilated convolution in the ResBlock. For the pyramid pooling module, based on the intermediate feature maps generated by ResBlocks, we use 4 pooling layers to aggregate context information from global to fine scales. The bin size of these four levels are set as 1 1, 2 2, 3 3 and 6 6, respectively. We choose the Atrous Spatial Pyramid Pooling as the pooling kernels. The feature maps produced by the pyramid pooling module are then upsampled and concatenated with the original feature maps from ResBlock. A last convolution layer is applied on the concatenated feature maps to output the final predictions. Noted that, we apply the dropout on this last convolution layer as an ensemble strategy to improve the generalization ability of our network. Dropout ratio is set as 0.1. Same with [15], we inject an auxiliary supervision branch at the last ResBlock to tackle the potential training difficulties faced by deep neural networks. As a way to tackle the class imbalance, we use the weighted cross entropy as the basic form of our objective function. 2.3 Online Hard Negative Example Mining For network training, the definition of objective function directly defines the latent mapping that network needs to fit. The quality of samples counted to minimize the objective function can determine how close the network can achieve its goal. In this work, we find that there is an obvious imbalance between the amount of foreground and background samples within slices, which can bias the loss function. Also, most background samples are easy to be classified and only some voxels around the atrium boundary get high probabilities to be false alarms. Therefore, we propose an Online Hard Negative Example Mining (OHNEM)

strategy to tune the objective function and selectively search informative hard negative samples to significantly improve the training efficacy. Motivated by [10], our OHNEM strategy roots in three facts, 1) most negative samples are becoming non-informative as the training progresses and should be excluded from the loss to alleviate the foreground-background imbalance, 2) hard negative examples are those getting high prediction scores to be foreground, 3) we preserve all positive samples to learn the knowledge of foreground thoroughly. In this regard, our OHNEM consists of three major steps in each training iteration. First, determining the number of hard negative examples to be selected. We adaptively define this number as C hn = max(2 C p, min(5, C n /4)), where C p and C n are the number of foreground and background voxels obtained from the label slice, respectively. Then, all the background samples are ranked in a descending order according to their prediction scores. Finally, our loss function only counts the top C hn background candidates and discards the rest. Our loss function equally considers the losses from all the foreground samples and the selected background samples. Effectiveness of the proposed OHNEM is verified with ablation study in Section 3. Fig. 3. The proposed competitive training scheme with multiple model candidates. Winner model in each stage is denoted with yellow dot. 2.4 Competitive Training Scheme Because of the random conditions in training, networks with the same training setting can have different generalization abilities. An ensemble of multiple networks often surpasses the single network. In this paper, we propose a new training scheme to generate an ensemble of multiple networks in a competitive way, denoted as CTS. As shown in Fig. 3, our CTS training scheme is intuitive in design. The scheme simultaneously trains several competitors with the same initial configurations. Only the winner model with the lowest validation loss can be recorded at the validation point. The winner model in stage t can get the chance to broadcast its parameters to initialize all the competitors in next competitive stage. All the competitors are then be fine-tuned in stage t + 1 until the generation of a new winner at next validation point.

With the CTS training scheme, all competitors are facing with the more and more intensive competition. All candidates are enhanced with the local optimal model after each validation stage. They can only struggle to mine more informative examples for effective gradient update, which in turn improves the performances of all competitors. Also, different from the direct and heavy ensemble of multiple networks for testing, our network is light-weight for testing since only the winner model is kept at the end. 3 Experimental Results Experiment materials: We evaluated our method on the Atrial Segmentation Challenge dataset in STACOM 2018. There are two kinds of image size as 576 576 88 and 640 640 88 with the unified spacing 1.0 1.0 1.0mm. We split the whole dataset (100 volumes with ground truth annotation) into 70/10/20 for training/validation/testing. Training dataset is further augmented with random flipping. Every volume is split into slices along the last dimension. Slices are normalized as zero mean and unit variance before training and testing. Implementation details: We implement all the compared methods with the PyTorch using NVIDIA GeForce GTX TITAN X GPUs. We update the weights of network with an Adam optimizer (batch size=2, learning rate=0.001). All the models are trained for 180 epochs with a validation every 30 epochs. Only the model with the lowest validation loss is saved. For the competitive training, the winner is generated every 30 epochs. For the post-processing in testing, small isolated connected components are removed in the final labeling result. We implemented several state-of-the-art 3D and 2D networks varying with different layouts and kernel designs for extensive comparison. We also conducted ablation study on our proposed modules. Table 1. Quantitative evaluation of our proposed framework Method Metrics Dice[%] Conform[%] Jaccard[%] Adb[mm] Hdb[mm] PSPNet [15] 87.904 72.218 78.532 2.159 23.319 PSPNetD 92.053 82.536 85.387 1.564 21.154 Unet-2D 89.638 76.485 81.404 2.185 22.711 Unet-3D 87.020 68.033 77.785 2.082 25.243 DeepLabV3 [4] 88.456 72.449 79.846 2.427 23.099 SegNet [1] 90.762 78.912 83.422 2.050 23.362 GCN [8] 91.812 82.049 84.926 1.597 18.940 PSPNetD+OHNEM 92.688 84.113 86.434 1.405 17.224 PSPNetD+OHNEM+CTS3 92.834 84.445 86.694 1.496 17.891

Quantitative and Qualitative Analysis: We use 5 metrics to evaluate all the compared solutions, including Dice, Conform, Jaccard, Average Distance of Boundaries (Adb) and Hausdorff Distance of Boundaries (Hdb). We first compare our core network PSPNet with other architectures. PSPNet with dilated convolution and dropout is denoted as PSPNetD. We can observe that, sharing the same spirit in modifying kernels to perceive broad context, both PSPNetD and GCN get better results than other architectures which use vanilla convolutions. PSPNetD performs even better than GCN, which may because of the explicit exploration on multiscale semantic cues. Unet-3D loses power in our task and present poor results than all 2D networks. Fig. 4. Visualization of our segmentation results with the mapping of Hausdorff distance. Color bar is annotated with mean in the center, min and max on the ends. Based on PSPNetD, we conduct ablation study on OHNEM and CTS. There occurs a significant improvement (about 0.6% in Dice) when OHNEM module is involved to guide the learning of PSPNetD (denoted as PSPNetD+OHNEM ). This proves the importance of designing proper objective function and selecting effective samples for training. For the competitive training scheme CTS, we can see that, with 3 candidates for competition, PSPNetD+OHNEM+CTS3 brings another 0.16% improvement in Dice over PSPNetD+OHNEM. The learning of hard negative samples is already enhanced in PSPNetD+OHNEM, while CTS further pushes candidate models for more effective learning. Finally, we visualize the segmentation results of PSPNetD+OHNEM+CTS3 in Fig. 4 with Hausdorff distance [mm] between segmentation and ground truth. Our method presents robustness against the shape and size variations in left atrium and pulmonary veins. Most points on segmentation surface around the

atrium body present low Hausdorff distances (cold color) to the ground truth. Large distance values (hot color) often occur around the end of pulmonary veins which are hard to define since the length of of pulmonary vein varies greatly. 4 Conclusions In this paper, we present a fully automatic framework for the segmentation of left atrium and pulmonary veins in GE-MR volume. We start with network architecture in 2D, and adopt kernels for large receptive field. The involvement of pyramid pooling improves our network in combing global and local cues to recognize detailed boundaries. The proposed Online Hard Negative Example Mining significantly boosts the training efficacy. Competitive training scheme pushes the network for further improvement. Verified on 20 testing cases, our method presents promising performance in segmenting left atrium and pulmonary veins across different subjects and imaging conditions. 5 Acknowledgments: The work in this paper was supported by a grant from the Research Grants Council of the Hong Kong Special Administrative Region (Project no. GRF 14225616). References 1. Badrinarayanan, V., et al.: Segnet: A deep convolutional encoder-decoder architecture for image segmentation. IEEE TPAMI 39(12), 2481 2495 (2017) 2. Bernard, O., et al.: Deep learning techniques for automatic mri cardiac multistructures segmentation and diagnosis: Is the problem solved? IEEE TMI (2018) 3. Chen, J., Yang, G., Gao, Z., et al.: Multiview two-task recursive attention model for left atrium and atrial scars segmentation. arxiv preprint arxiv:1806.04597 (2018) 4. Chen, L.C., et al.: Rethinking atrous convolution for semantic image segmentation. arxiv preprint arxiv:1706.05587 (2017) 5. Dou, Q., Yu, L., et al: 3d deeply supervised network for automated segmentation of volumetric medical images. Medical Image Analysis (2017) 6. Karim, R., Housden, R.J., et al.: Evaluation of current algorithms for segmentation of scar tissue from late gadolinium enhancement cardiovascular magnetic resonance of the left atrium: an open-access grand challenge. Journal of Cardiovascular Magnetic Resonance 15(1), 105 (2013) 7. Mortazi, A., Karim, R., et al.: Cardiacnet: Segmentation of left atrium and proximal pulmonary veins from mri using multi-view cnn. In: MICCAI. pp. 377 385. Springer (2017) 8. Peng, C., Zhang, X., et al.: Large kernel matters improve semantic segmentation by global convolutional network. arxiv preprint arxiv:1703.02719 (2017) 9. Peng, P., Lekadir, K., et al.: A review of heart chamber segmentation for structural and functional analysis using cardiac magnetic resonance imaging. Magnetic Resonance Materials in Physics, Biology and Medicine 29(2), 155 195 (2016)

10. Shrivastava, A., Gupta, A., Girshick, R.: Training region-based object detectors with online hard example mining. In: CVPR. pp. 761 769 (2016) 11. Tao, Q., Ipek, E.G., et al.: Fully automatic segmentation of left atrium and pulmonary veins in late gadolinium-enhanced mri: Towards objective atrial scar assessment. Journal of Magnetic Resonance Imaging 44(2), 346 354 (2016) 12. Tobon-Gomez, C., Geers, A.J., et al.: Benchmark for algorithms segmenting the left atrium from 3d ct and mri datasets. IEEE TMI 34(7), 1460 1473 (2015) 13. Yang, G., Zhuang, X., Khan, H., et al.: A fully automatic deep learning method for atrial scarring segmentation from late gadolinium-enhanced mri images. In: ISBI 2017. pp. 844 848. IEEE (2017) 14. Yang, X., Bian, C., Yu, L., Ni, D., Heng, P.A.: Hybrid loss guided convolutional networks for whole heart parsing. In: STACOM. pp. 215 223. Springer (2017) 15. Zhao, H., Shi, J., Qi, X., Wang, X., Jia, J.: Pyramid scene parsing network. In: CVPR. pp. 2881 2890 (2017) 16. Zheng, Y., et al.: Multi-part modeling and segmentation of left atrium in c-arm ct for image-guided ablation of atrial fibrillation. IEEE TMI 33(2), 318 331 (2014) 17. Zhuang, X., et al: A registration-based propagation framework for automatic whole heart segmentation of cardiac mri. IEEE TMI 29(9), 1612 1625 (2010)