MULTIPLE REGION OF INTEREST TRACKING OF NON-RIGID OBJECTS USING DEMON'S ALGORITHM

Similar documents
Colour Segmentation-based Computation of Dense Optical Flow with Application to Video Object Segmentation

Fast Natural Feature Tracking for Mobile Augmented Reality Applications

Face Tracking : An implementation of the Kanade-Lucas-Tomasi Tracking algorithm

EE795: Computer Vision and Intelligent Systems

Robust Camera Pan and Zoom Change Detection Using Optical Flow

Efficient Block Matching Algorithm for Motion Estimation

CS6670: Computer Vision

NIH Public Access Author Manuscript Proc Int Conf Image Proc. Author manuscript; available in PMC 2013 May 03.

EE795: Computer Vision and Intelligent Systems

Low Cost Motion Capture

SE 263 R. Venkatesh Babu. Object Tracking. R. Venkatesh Babu

Curvature guided surface registration using level sets

Motion Estimation. There are three main types (or applications) of motion estimation:

Marcel Worring Intelligent Sensory Information Systems

Optical flow and tracking

Parallel Non-Rigid Registration on a Cluster of Workstations

Evaluation of Moving Object Tracking Techniques for Video Surveillance Applications

Video Alignment. Literature Survey. Spring 2005 Prof. Brian Evans Multidimensional Digital Signal Processing Project The University of Texas at Austin

Dominant plane detection using optical flow and Independent Component Analysis

CS4495 Fall 2014 Computer Vision Problem Set 5: Optic Flow

Dense Image-based Motion Estimation Algorithms & Optical Flow

Image processing and features

Using Optical Flow for Stabilizing Image Sequences. Peter O Donovan

DESIGN AND IMPLEMENTATION OF VISUAL FEEDBACK FOR AN ACTIVE TRACKING

Lecture 28 Intro to Tracking

Recall: Blob Merge/Split Lecture 28

Optical Flow Estimation

arxiv: v1 [cs.cv] 28 Sep 2018

Segmentation and Tracking of Partial Planar Templates

Video Alignment. Final Report. Spring 2005 Prof. Brian Evans Multidimensional Digital Signal Processing Project The University of Texas at Austin

Image Coding with Active Appearance Models

Feature Tracking and Optical Flow

Feature Tracking and Optical Flow

Automatic Segmentation of Parotids from CT Scans Using Multiple Atlases

William Yang Group 14 Mentor: Dr. Rogerio Richa Visual Tracking of Surgical Tools in Retinal Surgery using Particle Filtering

Multimodal image registration by edge attraction and regularization using a B-spline grid

A MOTION MODEL BASED VIDEO STABILISATION ALGORITHM

Occluded Facial Expression Tracking

Video Processing for Judicial Applications

arxiv: v1 [cs.cv] 2 May 2016

LOCAL-GLOBAL OPTICAL FLOW FOR IMAGE REGISTRATION

Adaptive Multi-Stage 2D Image Motion Field Estimation

Implementation and Comparison of Feature Detection Methods in Image Mosaicing

Using temporal seeding to constrain the disparity search range in stereo matching

An Improvement of the Occlusion Detection Performance in Sequential Images Using Optical Flow

COMPUTER VISION > OPTICAL FLOW UTRECHT UNIVERSITY RONALD POPPE

Optical Flow-Based Motion Estimation. Thanks to Steve Seitz, Simon Baker, Takeo Kanade, and anyone else who helped develop these slides.

Corner Detection. Harvey Rhody Chester F. Carlson Center for Imaging Science Rochester Institute of Technology

2D Video Stabilization for Industrial High-Speed Cameras

Chapter 3 Image Registration. Chapter 3 Image Registration

Fast Image Registration via Joint Gradient Maximization: Application to Multi-Modal Data

Local Image Registration: An Adaptive Filtering Framework

Generation of a Disparity Map Using Piecewise Linear Transformation

Mixture Models and EM

CS201: Computer Vision Introduction to Tracking

Human Motion Detection and Tracking for Video Surveillance

Motion. 1 Introduction. 2 Optical Flow. Sohaib A Khan. 2.1 Brightness Constancy Equation

Capturing, Modeling, Rendering 3D Structures

Texture Image Segmentation using FCM

Visual Tracking. Image Processing Laboratory Dipartimento di Matematica e Informatica Università degli studi di Catania.

AN EFFICIENT BINARY CORNER DETECTOR. P. Saeedi, P. Lawrence and D. Lowe

16720 Computer Vision: Homework 3 Template Tracking and Layered Motion.

Shape Descriptor using Polar Plot for Shape Recognition.

CS4495 Fall 2014 Computer Vision Problem Set 6: Particle Tracking

A Novel Image Super-resolution Reconstruction Algorithm based on Modified Sparse Representation

Representing Moving Images with Layers. J. Y. Wang and E. H. Adelson MIT Media Lab

International Journal of Advance Engineering and Research Development

Integration of Multiple-baseline Color Stereo Vision with Focus and Defocus Analysis for 3D Shape Measurement

Visual Tracking (1) Pixel-intensity-based methods

An Approach for Reduction of Rain Streaks from a Single Image

A Real-time Algorithm for Atmospheric Turbulence Correction

Cellular Learning Automata-Based Color Image Segmentation using Adaptive Chains

Nonrigid Registration using Free-Form Deformations

Particle Filtering. CS6240 Multimedia Analysis. Leow Wee Kheng. Department of Computer Science School of Computing National University of Singapore

Improvement and Evaluation of a Time-of-Flight-based Patient Positioning System

Matching. Compare region of image to region of image. Today, simplest kind of matching. Intensities similar.

Lucas-Kanade Motion Estimation. Thanks to Steve Seitz, Simon Baker, Takeo Kanade, and anyone else who helped develop these slides.

AN EFFICIENT APPROACH FOR IMPROVING CANNY EDGE DETECTION ALGORITHM

Chapter 9 Object Tracking an Overview

International Conference on Information Sciences, Machinery, Materials and Energy (ICISMME 2015)

Visual Tracking. Antonino Furnari. Image Processing Lab Dipartimento di Matematica e Informatica Università degli Studi di Catania

Robust Model-Free Tracking of Non-Rigid Shape. Abstract

CS 4495 Computer Vision Motion and Optic Flow

A Simple Algorithm for Image Denoising Based on MS Segmentation

Globally Stabilized 3L Curve Fitting

Motion and Optical Flow. Slides from Ce Liu, Steve Seitz, Larry Zitnick, Ali Farhadi

Computer Vision II Lecture 4

AN ADAPTIVE MESH METHOD FOR OBJECT TRACKING

Accurate 3D Face and Body Modeling from a Single Fixed Kinect

AUTONOMOUS IMAGE EXTRACTION AND SEGMENTATION OF IMAGE USING UAV S

Moving Object Tracking in Video Using MATLAB

A Non-Linear Image Registration Scheme for Real-Time Liver Ultrasound Tracking using Normalized Gradient Fields

An Edge-Based Approach to Motion Detection*

Moving Object Segmentation Method Based on Motion Information Classification by X-means and Spatial Region Segmentation

ELEC Dr Reji Mathew Electrical Engineering UNSW

Face Tracking. Synonyms. Definition. Main Body Text. Amit K. Roy-Chowdhury and Yilei Xu. Facial Motion Estimation

PRINCIPAL COMPONENT ANALYSIS IMAGE DENOISING USING LOCAL PIXEL GROUPING

On Skeletons Attached to Grey Scale Images. Institute for Studies in Theoretical Physics and Mathematics Tehran, Iran ABSTRACT

Real-Time Scene Reconstruction. Remington Gong Benjamin Harris Iuri Prilepov

Leow Wee Kheng CS4243 Computer Vision and Pattern Recognition. Motion Tracking. CS4243 Motion Tracking 1

Transcription:

MULTIPLE REGION OF INTEREST TRACKING OF NON-RIGID OBJECTS USING DEMON'S ALGORITHM Rohan Pillai 1, Abhishikta Yalavali 2 Saima Mohan 1, and Amol Patil 1 1 igate, Tower 3, Magarpatta City, Pune, India, {rohan.pillai,saima.mohan,amol.vpatil}@igate.com, 2 abhishikthaharsha@gmail.com ABSTRACT In this paper we propose an algorithm for tracking multiple ROI (region of interest) undergoing non-rigid transformations. Demon's algorithm based on the idea of Maxwell's demon, has been applied here to estimate the displacement field for tracking of multiple ROI. This algorithm works on pixel intensities of the sequence of images thus making it suitable for tracking objects/regions undergoing non-rigid transformations. We have incorporated a pyramid-based approach for demon's algorithm computations of displacement field, which leads to significant reduction in the convergence speed and improvement in the accuracy. This algorithm is applied for tracking non-rigid objects in laproscopy videos which would aid surgeons in Minimal Invasive Surgery (MIS). KEYWORDS Computer vision, Maxwell's demon, multi-resolution pyramid, non-rigid deformation, optical flow, ROI tracking 1. INTRODUCTION Tracking a region of interest is well-known in image processing community. In its long history, a number of algorithm have been designed that address this problem quite successfully 1. In some real world situations, objects tend undergo non-rigid transformation and are thus difficult to track.these scenarios especially are prominent in medical applications where usually the region under consideration undergoes non rigid transformation mainly due to camera movement. In this paper we address the problem of tracking multiple region of interest (ROI) whose geometric appearance are changing gradually over time. We demonstrate the application of this algorithm for tracking multiple ROI's in a video recorded during endoscopy. In our previous paper 6, we have successfully demonstrated the use of demon based technique for tracking an ROI in video sequence. In our current paper, we refine our earlier work by considering the entire image and using a multi-resolution approach for calculating the displacement field. This introduction of multi-resolution approach has resulted in improved Jan Zizka (Eds) : CCSIT, SIPP, AISC, PDCTA - 2013 pp. 249 256, 2013. CS & IT-CSCP 2013 DOI : 10.5121/csit.2013.3628

250 Computer Science & Information Technology (CS & IT) computational efficiency and accuracy of the tracking algorithm which will be discussed in detail in subsequent sections of this paper. The overview of demons algorithm along with its use in tracking is presented in section 2. The multi-resolution approach is discussed in section 3. Discussions, results and conclusion appear in section 4, 5 and 6 respectively. 2. DEMON BASED TRACKING Thirion 7 introduced the demons forces for registration of images acquired from different modalities. The concept behind his experiment was to deform one image to match another. The work involved calculation of displacement of a physical point present in two images with the support of gradient based optical flow computation. The expression for displacement field of two images is as given in equation (1) Here is the displacement along x and y direction of a point, p, in the static image S with reference to a movable image M. is the gradient at point p on the image S. If is the image to be deformed then movable image M is constructed using a deformation operator as given in equation (2) and (3). Thirion had re-normalized the optical flow equation by introducing the intensity difference between the images in the denominator. Due to this, the displacement of each point has to be calculated iteratively until the squared error is minimized. The objective of image registration technique is to solve the minimization problem, to find the displacement field equation (??), where, d is the dissimilarity metric to measure the similarity between two images. Furthermore, to improve convergence speed and stability of the registration process, He Wang et al. 8 introduced the forces of the movable image. Here, α is a positive homogenization factor introduced by Cachier et al. 4 displacement field. to bound the

Computer Science & Information Technology (CS & IT) 251 Tracking objects in a video can be accomplished by calculating the interframe displacement of the objects. Assuming that the scene being recorded is not undergoing any rapid movement or change, equation (5) gives us the displacement of points in one image with reference to the next image. The displacement field is Gaussian smoothed for regularization 2. This displacement field is used to track a Region of Interest (ROI) in a video. The metric function, d, used in this paper is Sum of Squared Difference (SSD) of the images. 3. PYRAMIDAL IMPLEMENTATION Bouguet 3 demonstrated the use of pyramidal implementation in computing the optical flow algorithm based on Lucas Kanade method. The pyramid representation used in this paper is similar to that proposed by Bouguet. The method specified by Burt and Adelson 9 is used to construct recursively the pyramidal representation of the images I t-1 and I t as: {I t-1 } m=0,1, L where m is the pyramid Level. The value L is height of the pyramid. This value can be heuristically programmed by incorporating an estimation technique. Depending upon the expected inter frame displacement, L can take values 2, 3 or 4. In the image pyramid, image at m th level is at the resolution of the (m-1) th level. The algorithm starts with the L th level images and the initial displacement field initialized to a zero vector. Fig. 1. Illustration showing hierarchy of pyramid structure of an image The displacement field is iteratively computed for the images at this level until the termination criteria is satisfied. The termination criteria is defined by the iteration count or the rate of change of error. The estimated displacement field at the L th level is then rescaled to the images at the ( L - 1 ) th level in the pyramid and used as the initial displacement field for estimation. The estimation requires that the displacement field be updated with values computed as per equation (5) so as to decrease the error. This process is repeated until the displacement field for the 0 th level (original) images in the pyramid is obtained. Figure 2 summarizes the Pyramidal Implementation of Demon's based algorithm as a flowchart.

252 Computer Science & Information Technology (CS & IT) Fig. 2. Sequential execution of Pyramidal Implementation of Demon's based algorithm 4. DISCUSSION The ROI marked by a surgeon corresponds to an array of underlying pixels in an image. The displacement field calculated by the above algorithm represents the distance of a physical point in a frame being shifted with reference to the corresponding point in the previous frame. This displacement field can be applied to one or more ROI's which will propagate over the temporal sequence without any manual intervention. Using this principle multiple ROI's in the image can be tracked. The total number of computations performed on an n x x n y image would be Kmn x n y, where m computations are performed on each pixel of an image per iteration and the image converges after K iterations. Thus the computation time needed is directly proportional to the size of the image.

Computer Science & Information Technology (CS & IT) 253 With the introduction of pyramidal approach it is observed that the total computation time has reduced significantly. The displacement field calculated at lower resolution images is refined as we work towards the higher resolution images. Hence, resulting in fewer iterations at subsequent levels of the pyramid to converge. The computational advantage gained is illustrated in the performance chart shown in figure 3(a). For successive iteration the model image has a higher resemblance to the static image. Thus the error metric, SSD, that measures the dissimilarity between the two images is expected to decrease with each iteration. However, the SSD metric is image resolution dependent hence can not be used while comparing convergence between different pyramid levels. To establish a benchmark when comparing the displacement field produced at each level in the pyramid, we rescale the displacement field at each iteration to full image size and then use it to compute error with respect to the original image. Figure 3(b) shows the graph of error convergence with respect to iterations for single resolution and pyramidal approach. This confirms that the displacement field calculated at every level in the pyramid facilitates the overall convergence. The two results can be compared as they are operating on the same base images. In Figure 3(b) a sharp drop in error is observed at iteration number 5 and 18 which is caused by transition from one level in the pyramid to another. An Interesting observation about the type of convergence can be noted in figure 3 when plotted with respect to iteration count, the convergence graph looks like a series of decaying exponential. However, when the convergence graph is plotted against time, it looks more like a fast decaying exponential. Pyramidal implementation provides the freedom of making certain parameters like, smoothing kernel, termination criteria and the positive homogenization factor level dependent. The performance of the system can be further improved by selecting suitable values for these parameters. Pyramidal implementation provides the freedom of making certain parameters like, smoothing kernel, termination criteria and the positive homogenization factor level dependent. The performance of the system can be further improved by selecting suitable values for these parameters. (a)

254 Computer Science & Information Technology (CS & IT) (b) Fig. 3. Performance graph. (a) Plot depicting the time taken by single resolution (dotted line) and the pyramidal implementation (dashed line) to converge. (b) Graph of error calculated after every iteration for pyramidal implementation 5. RESULTS The performance of the algorithm was tested on a stream of synthesized images involving the motion of two blobs in arbitrary directions. For the numerical experiment, synthetic images of size 512x512 are generated with objects of interest imposed on a noisy background. To construct a noise model in our synthetic image, we used Gaussian noise (having zero mean and 0.005 variance) along with salt and pepper noise (0.005 probability) generated by MATLAB. For the first experiment figure 4, the objects considered are rigid bodies moving in different directions. Figure 4(a) indicates the initial state of objects with green marker enclosing them and figure 4(b) illustrates the process of tracking for an intermediate frame in the simulated video sequence. Fig. 4. Comparison of single resolution approach (green marker) and multi-resolution approach (red marker) for tracking multiple objects over time. (a) Second frame in the video (b) Result of tracking ROI's after 42 th frame. In medical imaging, regions of interest being filmed tend to undergo affine transformation due to continuous change in the camera position. The applicability of the discussed algorithm to track such non rigid objects is demonstrated in figure 5. It can be observed that the red line continues to

Computer Science & Information Technology (CS & IT) 255 capture the complete object including the gradual change in appearance of the object. The array corresponding to the ROI to accommodate for non-rigid transformation. As the size of the deformable region changes over time the length of the marker array should be made adaptive to accommodate this. Figure 6 shows the results for an endoscope video with a resolution of 640x480 as input. The algorithm works on the green channel of the video. The ROI's shown in blue are marked by the end user for tracking. The region undergoes non-rigid deformation over time which is tracked by the algorithm. Fig. 5. Illustration of experimental setup for tracking an object deforming over time. (a) First frame in the video where red marker indicates the ROI. (b) After 39 th frame, the red marker continues to enclose the region which has undergone deformation Fig. 6. Results of tracking multiple user-defined ROI on the endoscopy stream. The blue marker indicates the output of the multi-resolution approach and the green marker depicts the single-resolution approach. (a) Initial frame in the sequence (b) Result of tracking on the 54 th frame in the sequence. 6. CONCLUSIONS & FUTURE WORK In this paper, we demonstrated the use of pyramidal implementation for estimating the displacement field based on Thirion's demon. The algorithm successfully tracks multiple ROI's that undergo non-rigid transformation. Based on the experiments carried out it clearly indicates the advantage of pyramidal approach in terms of improvement in the convergence speed. Moreover, the result obtained using pyramidal approach generated robust and artifact free results as compared to the single resolution implementation.

256 Computer Science & Information Technology (CS & IT) Though the current approach is well suited for tracking multiple ROI's it is limited to cases of occlusion. Future work will include parallelism of the current implementation using GPU to achieve even faster convergence. Apart from vision based assistance to MIS, the demon based tracking approach can also be applied for pathological cell detection and tracking. A classical clinical condition where cells deform rapidly over time and requires to be tracked for continuous analysis. REFERENCES [1] S.S. Beauchemin and J.L. Barron. The computation of optical flow. ACM Computing Surveys, 1995. [2] P. Anandan. A computational framework and an algorithm for the measurement of visual. Technical report, University of Massachusetts Amherst, 1987. [3] Jean-Yves Bouguet. Pyramidal implementation of the lucas kanade feature tracker description of the algorithm. Technical report, Intel Corporation Microprocessor Research Labs, 2000. [4] Pascal Cachier, Xavier Pennec, and Nicholas Ayache. Fast non rigid matching by gradient descent: Study and improvements of the "demons" algo- rithm. Technical report, INSTITUT NATIONAL DE RECHERCHE EN INFORMATIQUE ET EN AUTOMATIQUE, June 1999. [5] Visvanathan Ramesh, Dorin Comaniciu and Peter Meer. Kernel-based object tracking. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 25, NO. 5, 2003. [6] Abhinav Kumar, B Madhusudan Rao, Rajesh Ghole, Amol Patil, and Nilesh Ghatpande. Demons based tracking for non-rigid transformed region of interest. In TENCON 2011-2011 IEEE Region 10 Conference, IEEEXplore, November 2011. [7] J.-P. Thirion. Non-rigid matching using demons. In Proc. IEEE Computer Society Conf. Computer Vision and Pattern Recognition CVPR '96, 1996. [8] He Wang, Lei Dong, Jennifer O'Daniel, Radhe Mohan, Adam S Garden, K Kian Ang, Deborah A Kuban, Mark Bonnen, Joe Y Chang, and Rex Cheung. Validation of an accelerated 'demons' algorithm for deformable image registration in radiation therapy. Physics in Medicine and Biology, 50(12), 2005. [9] Peter J. Burt and Edward H. Adelson. The laplacian pyramid as a compact image code. IEEE Transactions on Communications, April 1983.