Patient surface model and internal anatomical landmarks embedding

Similar documents
A machine learning pipeline for internal anatomical landmark embedding based on a patient surface model

Comparison of Default Patient Surface Model Estimation Methods

Manifold Learning-based Data Sampling for Model Training

Gradient-Based Differential Approach for Patient Motion Compensation in 2D/3D Overlay

Iterative CT Reconstruction Using Curvelet-Based Regularization

Depth-Layer-Based Patient Motion Compensation for the Overlay of 3D Volumes onto X-Ray Sequences

3D Statistical Shape Model Building using Consistent Parameterization

Sparse Principal Axes Statistical Deformation Models for Respiration Analysis and Classification

Mapping Multi-Modal Routine Imaging Data to a Single Reference via Multiple Templates

Respiratory Motion Compensation for C-arm CT Liver Imaging

Towards full-body X-ray images

Improvement and Evaluation of a Time-of-Flight-based Patient Positioning System

radiotherapy Andrew Godley, Ergun Ahunbay, Cheng Peng, and X. Allen Li NCAAPM Spring Meeting 2010 Madison, WI

Registration of Dynamic Range Images

Guided Image Super-Resolution: A New Technique for Photogeometric Super-Resolution in Hybrid 3-D Range Imaging

A NEW FEATURE BASED IMAGE REGISTRATION ALGORITHM INTRODUCTION

A Simulation Study and Experimental Verification of Hand-Eye-Calibration using Monocular X-Ray

Discrete Estimation of Data Completeness for 3D Scan Trajectories with Detector Offset

Scaling Calibration in the ATRACT Algorithm

3D Computer Vision. Structured Light II. Prof. Didier Stricker. Kaiserlautern University.

Structured Light II. Thanks to Ronen Gvili, Szymon Rusinkiewicz and Maks Ovsjanikov

Convolution-Based Truncation Correction for C-Arm CT using Scattered Radiation

Combination of Markerless Surrogates for Motion Estimation in Radiation Therapy

A Multiple-Layer Flexible Mesh Template Matching Method for Nonrigid Registration between a Pelvis Model and CT Images

Structured Light II. Thanks to Ronen Gvili, Szymon Rusinkiewicz and Maks Ovsjanikov

Hierarchical Multi structure Segmentation Guided by Anatomical Correlations

Hybrid Spline-based Multimodal Registration using a Local Measure for Mutual Information

3D Guide Wire Navigation from Single Plane Fluoroscopic Images in Abdominal Catheterizations

Learning a Manifold as an Atlas Supplementary Material

Algorithm research of 3D point cloud registration based on iterative closest point 1

Particle Filtering. CS6240 Multimedia Analysis. Leow Wee Kheng. Department of Computer Science School of Computing National University of Singapore

The Anatomical Equivalence Class Formulation and its Application to Shape-based Computational Neuroanatomy

Computational Design. Stelian Coros

Landmark Detection on 3D Face Scans by Facial Model Registration

ToF/RGB Sensor Fusion for Augmented 3-D Endoscopy using a Fully Automatic Calibration Scheme

Learning based face hallucination techniques: A survey

Development of 3D Model-based Morphometric Method for Assessment of Human Weight-bearing Joint. Taeho Kim

Utilizing Salient Region Features for 3D Multi-Modality Medical Image Registration

Automatic Patient Pose Estimation Using Pressure Sensing Mattresses

Simultaneous Model-based Segmentation of Multiple Objects

Overview of Proposed TG-132 Recommendations

Spatial-temporal Total Variation Regularization (STTVR) for 4D-CT Reconstruction

SCAPE: Shape Completion and Animation of People

Personal Navigation and Indoor Mapping: Performance Characterization of Kinect Sensor-based Trajectory Recovery

Nonrigid Surface Modelling. and Fast Recovery. Department of Computer Science and Engineering. Committee: Prof. Leo J. Jia and Prof. K. H.

3D object recognition used by team robotto

Using Probability Maps for Multi organ Automatic Segmentation

DETC D FACE RECOGNITION UNDER ISOMETRIC EXPRESSION DEFORMATIONS

2D Vessel Segmentation Using Local Adaptive Contrast Enhancement

On-line, Incremental Learning of a Robust Active Shape Model

A Non-Linear Image Registration Scheme for Real-Time Liver Ultrasound Tracking using Normalized Gradient Fields

Selecting Models from Videos for Appearance-Based Face Recognition

Accurate 3D Face and Body Modeling from a Single Fixed Kinect

PROSTATE CANCER DETECTION USING LABEL IMAGE CONSTRAINED MULTIATLAS SELECTION

Total Variation Regularization Method for 3D Rotational Coronary Angiography

Real-time Respiratory Motion Analysis Using Manifold Ray Casting of Volumetrically Fused Multi-View Range Imaging

Instance-level recognition

Total Variation Regularization Method for 3-D Rotational Coronary Angiography

Automatic Vascular Tree Formation Using the Mahalanobis Distance

Topological Mapping. Discrete Bayes Filter

Edge-Preserving Denoising for Segmentation in CT-Images

Interactive Boundary Detection for Automatic Definition of 2D Opacity Transfer Function

Automatic Generation of Shape Models Using Nonrigid Registration with a Single Segmented Template Mesh

Head Frontal-View Identification Using Extended LLE

RIGID IMAGE REGISTRATION

Automatic Ascending Aorta Detection in CTA Datasets

3D Registration based on Normalized Mutual Information

Kinematic Analysis of Lumbar Spine Undergoing Extension and Dynamic Neural Foramina Cross Section Measurement

Iterative Closest Point Algorithm in the Presence of Anisotropic Noise

A new approach to reference point location in fingerprint recognition

Su et al. Shape Descriptors - III

Semi-Automatic Prediction of Landmarks on Human Models in Varying Poses

Human pose estimation using Active Shape Models

Assessing Accuracy Factors in Deformable 2D/3D Medical Image Registration Using a Statistical Pelvis Model

Fully Automatic Multi-organ Segmentation based on Multi-boost Learning and Statistical Shape Model Search

Automatic Lung Surface Registration Using Selective Distance Measure in Temporal CT Scans

Respiratory Motion Estimation using a 3D Diaphragm Model

Artefakt-resistente Bewegungsschätzung für die bewegungskompensierte CT

Is deformable image registration a solved problem?

Automatic Removal of Externally Attached Fiducial Markers in Cone Beam C-arm CT

CIS 520, Machine Learning, Fall 2015: Assignment 7 Due: Mon, Nov 16, :59pm, PDF to Canvas [100 points]

Image-based Compensation for Involuntary Motion in Weight-bearing C-arm CBCT Scanning of Knees

Image Registration. Prof. Dr. Lucas Ferrari de Oliveira UFPR Informatics Department

Comparison of Different Metrics for Appearance-model-based 2D/3D-registration with X-ray Images

Automatic Modelling Image Represented Objects Using a Statistic Based Approach

ROBUST OBJECT TRACKING BY SIMULTANEOUS GENERATION OF AN OBJECT MODEL

Supplementary Material : Partial Sum Minimization of Singular Values in RPCA for Low-Level Vision

Improved Semi-Automatic Basket Catheter Reconstruction from Two X-Ray Views. Xia Zhong Pattern Recognition Lab (CS 5)

VALIDATION OF DIR. Raj Varadhan, PhD, DABMP Minneapolis Radiation Oncology

Pathology Hinting as the Combination of Automatic Segmentation with a Statistical Shape Model

Statistical Learning of Human Body through Feature Wireframe

What s new in. teamplay? Discover the latest features and functionalities. siemens.com/teamplay

Moving Metal Artifact Reduction for Cone-Beam CT (CBCT) Scans of the Thorax Region

Prototype of Silver Corpus Merging Framework

Generation of Triangle Meshes from Time-of-Flight Data for Surface Registration

Quality Guided Image Denoising for Low-Cost Fundus Imaging

Combining Gabor Features: Summing vs.voting in Human Face Recognition *

Instance-level recognition

3D Shape Representation and Analysis of the Human Body and Ontology for Anthropometric Landmarks

A Statistical Approach to Culture Colors Distribution in Video Sensors Angela D Angelo, Jean-Luc Dugelay

Transcription:

Patient surface model and internal anatomical landmarks embedding Xia Zhong 1, Norbert Strobel 2,4, Annette Birkhold 2, Markus Kowarschik 2, Rebecca Fahrig 2, Andreas Maier 1,3 1 Pattern Recognition Lab, FAU Erlangen-Nürnberg 2 Siemens Healthcare GmbH, Forchheim Germany 3 Erlangen Graduate School in Advanced Optical Technologies(SAOT) 4 Fakultät für Elektrotechnik, Hochschule für angewandte Wissenschaften Würzburg-Schweinfurt xia.zhong@fau.de Abstract. The patient surface model has shown to be a useful asset to improve existing diagnostic and interventional tasks in a clinical environment. For example, in combination with RGB-D cameras, a patient surface model can be used to automate and accelerate the diagnostic imaging workflow, manage patient dose, and provide navigation assistance. A shortcoming of today s patient surface models, however, is that, internal anatomical landmarks are not present. In this paper, we introduce a method to estimate internal anatomical landmarks based on the surface model of a patient. Our method relies on two major steps. First, we fit a template surface model is to a segmented surface of a CT dataset with annotated internal landmarks using keypoint and feature descriptor based rigid alignment and atlas-based non-rigid registration. In a second step, we find for each internal landmark a neighborhood on the template surface and learn a generalized linear embedding between neighboring surface vertices in the template and the internal landmark. We trained and evaluated our method using cross-validation in 20 datasets over 50 internal landmarks. We compared the performance of four different generalized linear models. The best mean estimation error over all the landmarks was achieved using the lasso regression method with a mean error of 12.19 ± 6.98 mm. 1 Introduction Patient modeling has shown great potential in medical applications. One reason is that RGB-D cameras have become more easily available. For example, in the diagnostic environment, Singh et al. [1] demonstrated that a patient surface model can be fitted to the patient using a RGB-D camera ahead of CT scans to automate and accelerate the workflow. In the interventional environment, a patient surface model was used to estimate the skin dose [2], monitor the breathing motion, or provide navigation assistance [3]. The patient model used in these applications is a surface model. Related activities described in the literature derive an articulated skeleton [4], model the shape of the surface model

2 Zhong et al. and compare different estimation methods [5]. Unfortunately, clinical relevant internal anatomical landmarks are not included in the surface model, as it only models the outer shape of a patient. In this paper, we propose a method for finding internal landmarks of the human body e.g. vertebra, based on a surface model and evaluate its accuracy. Once internal landmarks are found, we can embed an appropriately adjusted spine model into the surface model. As organs have a fixed position relative to the spine, fitting a spine model to a surface model can, for example, be used for organ-specific positioning of a C-arm system. 2 Materials and Methods Our learning-based method embeds the internal landmarks into a patient surface model. We use CT datasets with annotated landmarks to train our method. For each CT data set, the surface is segmented and represented as a surface mesh x. We register a template surface model ˆx to this segmented surface. Afterwards, we learn the mapping function M i to establish the relation between model ˆx and associated landmark l such that M i = arg min M i l t,i M i ( ˆx t ) 2 2 (1) t 2.1 Surface model registration and fitting We first trained an Atlas for the patient model using the Civilian American and European Surface Anthropometry Resource Project (CAESAR) data. Using an active shape model (ASM), a patient surface model can be described as ˆx t = x + Db t, where the x is the mean surface model in the atlas, D describes a matrix comprising the modes of variation, and b t is the associated weighting vector. Then, for each segmented surface model x t, we need to find the best fitting weighting vector b t and associated rigid transformation T t by minimizing b t, T t = arg min b,t D (x T ( x + Db))2 (2) In Eq. (2), the function D calculates the minimal distance between two meshes. To solve this minimization problem efficiently, we rely on alternating minimization. We first estimate the transformation T based on a given b. This is a well-known surface mesh rigid registration problem and can be solved using feature descriptors e.g. HoG, SPIN, and LDSIFT [6] and feature matching. The challenge in our case is that the two meshes may partially overlap and the deformation between two meshes may be non-rigid. As patients usually are positioned head first, supine, i.e., are on their back, the outer shape of the patient may potentially lack distinctive key points. To establish correspondences between feature points robustly, we first sample the key points as propsed by Sahillioğlu et al. [7], such that the sampling is almost uniform. In this way, we

Patient surface model and internal anatomical landmarks embedding 3 avoid accumulation of feature points in high curvature regions when identifying key points. For each key point, we calculate the LDSIFT feature descriptor, which is robust to non-rigid deformations [6]. We further incorporate the prior knowledge, that the transformation T comprises mainly translations and that rotation is very limited. In our actual implementation, we divided the surface mesh into eight primary districts using the principal axis of the mesh, and rejected any cross-district matching. Due to the non-rigid deformation between two meshes, we may find ambiguity in rigid transformation. To counteract this effect, we consider all feasible results using RANSAC as motion cluster and use mean shift algorithm to search the cluster center of the biggest cluster. This cluster center is used as our motion estimation T t. Knowing the transformation T t, the weighting factor b t can be estimated by minimizing b t = arg min b D (x T t ( x + Db)) 2 + α 1 R (x, ˆx) (3) where the scalar α is a parameter, and the function R calculates the overlap ratio between the segmented and the estimated surface mesh. This function regularizes the minimization problem such that the estimated surface will not be degenerated. After this step, the residual error between estimated and segmented surface is minimized using non-rigid mesh registration, as proposed by Allen et al. [8]. 2.2 Surface and internal landmark embedding After fitting the surface model ˆx to the segmented CT surface x, we look for the mapping M i between ˆx and the annotated internal landmarks l i in CT data sets. For the i th landmark in the t th CT data l t,i, we assume that the landmark position (coordinates) can be expressed as a generalized linear combination of neighboring vertices ˆx t,j N (l i ). The neighborhood is defined using k-nearest neighbor with Mahalanobis metric. We used the covariance matrix of the surface model x to calculate the Mahalanobis distance. We select a large neighborhood such that the residual error due to non-linearity in mapping is minimized. The number and indicies of neighborhood vertices can vary in each data set. We use the unions according to the indicies as the neighborhood for each internal landmark. To describe this more formally, we introduce matrix X N (li) = [X T 1,N (l i),, XT N,N (l i) ]T and matrix L i = [l T 1,i,, lt 1,i ]T where matrix X t,n (li) = [x t,1,, x t,m ] comprises all vertices in an associated neighborhood for the t th data set and the landmark l t,i. The scalar N denotes the number of dataset and the scalar M denotes the number of neighboring vertices. At this point, we can formulate a generalized linear mapping with a cost function for the linear mapping weighting vector w i for landmarks l t,i. w i = arg min w i X N (li)w i L i 2 + λp(w i ) (4) In Eq. (4), the scalar λ is a Lagrange multiplier for the penalty function P. The penalty function P is different for each generalized model. In ridge regression, the

4 Zhong et al. L2 norm of w is used while in lasso, the L1 norm is used respectively. The elastic net regression using a mixture of L1 and L2 norm while in linear embedding the sum of w is used. We investigated the linear embedding [9], ridge regression, lasso regression and elastic net regression method to estimate the landmark position. Noted, that in case of linear embedding regression, the minimization problem can be solved in closed form. 3 Evaluation and Results We trained our algorithm using Anatomy3 from the Visceral dataset [10]. This dataset comprises 20 full-body CT scans with 50 associated internal landmarks in each dataset. An example of the dataset can be found in Fig. 2. We evaluated our algorithm using leave-one-out cross-validation and the estimation error is given in mean ± standard deviation in mm. We compared the estimation result using linear embedding (LLE), ridge regression (Ridge), lasso regression (Lasso) and elastic net regression (Elastic). The results are shown in Fig. 1. As we can see from the results, in most of the cases, the ridge regression has the worst performance with an overall estimation error of 22.19 ± 12.36 mm. The closed-form solution of LLE method outperformed ridge regression and reduced the mean error to 13.43 ± 12.00 mm. Using elastic regression method, the standard deviation was reduced further and the mean estimation error was 12.71 ± 7.32 mm. The best performance was achieved using lasso regression. The overall landmark estimation error then was 12.19 ± 6.98 mm. 60.00 Mean Internal Landmark Estimation Error 50.00 40.00 30.00 20.00 10.00 0.00 C3 C4 C5 C6 C7 L1 L2 L3 L4 L5 Th1 Th10 Th11 Th12 Th2 Th3 Th4 Th5 Th6 Th7 Th8 Th9 aorta bifurcation aortic arch bronchus left bronchus right clavicle left clavicle right crista iliaca left crista iliaca right eye left eye right ischiadicum left ischiadicum right renalpelvis left renalpelvis right sternoclavicular left sternoclavicular right symphysis trachea bifurcation trochanter major left trochanter major right trochanter minor left trochanter minor right tuberculum left tuberculum right vci bifurcation ventricle left Error in mm ventricle right xyphoideus Ridge LLE Elastic Lasso Fig. 1. Mean internal landmark estimation error using linear embedding (LLE), ridge regression (Ridge), elastic net regression (Elastic) and lasso regression (Lasso)

4 Discussion Patient surface model and internal anatomical landmarks embedding 5 The goal of our work has been to position anatomical landmarks inside a surface based on surface vertices close to associated landmarks. We found that our approach based on a linear mapping relative to a template surface yielded good matches between predicted landmark positions and their actual positions. The results also indicate that our template fitting method is robust as the accuracy for the template fitting is crucial for the subsequent internal landmark estimation. We can also see from the results that the ridge and LLE methods have a higher estimation error than Elastic and Lasso regression. One of the reasons can be that by introducing L1 norm as a regularizer, the Elastic and Lasso regression impose the sparsity of weighting vector w. This might be an indication that we could improve our neighborhood selection method and only include vertices which are significant for the estimation. In this work, we do not investigate the mapping using non-linear methods as the number of our datasets is limited. So far, we estimated internal landmarks independently and have not yet considered their joint estimation. As human anatomy follows certain rules there should be a potential to improve our results by exploiting any mutual relationships between them. Therefore, for certain groups of landmarks, e.g., the spine, an active shape model of the spine could potentially further improve the estimation result. We see our work of embedding the internal landmarks to the surface model as adding prior information to our datasets. In Fig. 2, we show an example of fitting an adapted spline model using the estimated internal landmarks. In future work, we will look into the refinement method of this initial estimation when pre-operative image data is available, i.e., we will try to calculate an a posteriori estimate based on our a-priori estimate. Acknowledgments We gratefully acknowledge the support of Siemens Healthineers, Forchheim, Germany. We also thank Siemens Corporate Technology for providing the avatar database. Note that the concepts and information presented in this paper are based on research, and they are not commercially available. References 1. Singh V, Chang Y, Ma K, et al. Estimating a Patient Surface Model for Optimizing the Medical Scanning Workflow. In: Med Image Comput Comput Assist Interv; 2014. p. 472 479. 2. Johnson PB, Borrego D, Balter S, et al. Skin dose mapping for fluoroscopically guided interventions. Med Phys. 2011;38(10):5490 5499. 3. Bauer S, Wasza J, Haase S, et al. Multi-modal Surface Registration for Markerless Initial Patient Setup in Radiation Therapy using Microsoft s Kinect Sensor. In: Proc IEEE Int Conf Comput Vis; 2011. p. 1175 1181. 4. Anguelov D, Koller D, Pang HC, et al. Recovering articulated object models from 3D range data. In: Uncertain Artif Intell; 2004. p. 18 26. 5. Zhong X, Strobel N, Kowarschik M, et al. Comparison of Default Patient Surface Model Estimation Methods. In: Proc BVM; 2017. p. 281 286.

6 Zhong et al. Fig. 2. A sample of the Anatomy3 training set is shown on the left. The annotated internal landmarks are indicated using red crosses and the segmented internal organs are shown in different colors. The middle figure shows the output of our algorithm, i.e., internal landmarks derived based on the surface. The result on the right demonstrates the same idea, but this time a patient surface model was used which was generated based on patient meta data such as height and weight. 6. Darom T, Keller Y. Scale-invariant features for 3-D mesh models. vol. 21; 2012. p. 2758 2769. 7. Sahillioğlu Y, Yemez Y. Minimum-distortion isometric shape correspondence using EM algorithm. IEEE Trans Pattern Anal Mach Intell. 2012;34(11):2203 2215. 8. Allen B, Curless B, Popović Z. The space of human body shapes: reconstruction and parameterization from range scans. In: ACM Trans Graph. vol. 22; 2003. p. 587 594. 9. Roweis ST, Saul LK. Nonlinear dimensionality reduction by locally linear embedding. science. 2000;290(5500):2323 2326. 10. del Toro OAJ, Goksel O, Menze B, et al. VISCERAL VISual Concept Extraction challenge in RAdioLogy: ISBI 2014 challenge organization. Proc VISCERAL Challenge at ISBI. 2014; p. 6 15.