Representation and Detection of Deformable Shapes

Size: px
Start display at page:

Download "Representation and Detection of Deformable Shapes"


1 Representation and Detection of Deformable Shapes Pedro F. Felzenszwalb Artificial Intelligence Laboratory Massachusetts Institute of Technology Cambridge, MA Abstract We present a new method for detecting deformable shapes in images. The main difficulty with deformable template models is the very large (or infinite) number of possible non-rigid transformations of the templates. This makes the problem of finding an optimal match of a deformable template to an image incredibly hard. Using a new representation for deformable shapes we show how to efficiently find a global optimal solution to the non-rigid matching problem. Our matching algorithm can minimize a large class of energy functions, making it applicable to a wide range of problems. We present experimental results of detecting shapes in medical and natural images. Because we don t rely on local search techniques, our method is very robust, yielding good matches even in images with high clutter. 1 Introduction In this paper we address the problem of detecting deformable shapes in images. Our approach falls within the framework of deformable template matching, where one wants to find a non-rigid transformation that maps a model to an image. In this setting, an energy function associates a cost with each potential transformation of the model, and we want to find a transformation with the lowest possible cost. Typically the energy function is a sum of two terms, the first one attracts the deformed model towards salient image features, while the second one penalizes large deformations of the model. Most of the existing non-rigid matching techniques either require initialization near the final solution or are too slow for practical use. This is because the number of possible transformations of a template is very large. In contrast, we present an algorithm that can quickly find a global optimal non-rigid transformation without any kind of initialization. The search over transformations is done efficiently by exploiting special properties of a new representation for deformable shapes. We represent shapes using triangulated polygons, as illustrated in Figure 1. The triangles in a shape form a structure that can be used to model shape deformation. In fact, by picking a special triangulation we obtain a representation that is related to Blum s symmetric axis transform [4]. We note that the triangles in any triangulated polygon are connected together in a special way. This allows us to obtain an efficient dynamic programming algorithm to find the best match of a model shape to an image, where we allow each triangle in the model to undergo an arbitrary affine transformation. The quality of a match is given by an energy function that is a sum of terms, one for each triangle. This type of energy function is quite general, and can be used to represent a wide range of deformable template models. In particular, we can represent both the boundary and the internal structure of a deformable object. Our experimental results illustrate the robustness of our method, showing accurate detection of deformable shapes even in highly cluttered scenes. The quality of the results is a consequence of using a good representation for deformable shapes together with an algorithm that can find a global optimal solution to the non-rigid matching problem. We show results both on medical and natural images, demonstrating the wide applicability of our techniques. 1.1 Related work The basic idea of matching a deformable model to an image goes back to Fischler and Elschlager [8] and Widrow [16]. More recently, Grenander [10] introduced a framework which provides a very general setting to represent deformable objects. Other influential models were presented in [11] and [5]. A few efficient and provably good matching algorithms have been developed for restricted sets of deformable models. For example, in [6] a dynamic programming algorithm was used to detect open deformable contours in images. Dynamic programming was also used in [1] to match models consisting of a number of landmarks with positions con-

2 Figure 1: Rabbit, pear, and hand shapes represented by triangulated polygons. strained by a decomposable graphical model. Efficient algorithms also exist for the related problem of computing a non-rigid match between two pre-segmented objects (such as [2] and [14]). Here we present the first efficient algorithm that can find optimal non-rigid matches between a deformable template and an image, where both boundary and region information of the template can be taken into account. Even when we use an energy function with a data term that depends only on the boundary of a shape we take into account region information when measuring shape deformation. In this way we obtain more realistic models of deformation than is possible using only boundary models. 2 Triangulated polygons As mentioned in the introduction, we represent shapes using triangulated polygons. The polygonal boundaries approximate the boundary of each shape, and the triangulations provide a decomposition of the shapes into parts. We restrict ourselves to simple polygons, which are polygons without holes. A triangulation of a polygon P is a decomposition of P into triangles, defined by diagonals. Each diagonal is a line segment that connects two vertices of P and lies in the interior of the polygon. Moreover, no two diagonals cross. We know (see [7]) that every simple polygon can be triangulated, and any triangulation of a polygon with n vertices consists of exactly n 2 triangles. Let T be a triangulation of a polygon P. By considering T as a decomposition of P into parts, we obtain a simple and intuitive way to non-rigidly deform the polygon. For example, a finger of the hand in Figure 1 can be bent by changing the shape of a single triangle. In practice we use a particular triangulation of the polygon known as the constrained Delaunay triangulation. The constrained Delaunay triangulation can be computed efficiently and yields a decomposition of the polygon into meaningful parts. In fact, it yields a representation that is closely related to the medial axis of a shape (see [13]). Any triangulation of a simple polygon has an important property which is the key to our efficient matching algorithm. We denoted by G T the dual graph of a triangulation T. The nodes of G T correspond to the triangles in T, and two nodes are connected when the corresponding triangles share an edge. Figure 2 illustrates a triangulated polygon and its dual graph. The important property is that G T is a tree. To see this, just note that each diagonal in T cuts the polygon into two disjoint parts, so removing an edge from G T disconnects the graph. Since G T is a tree, there is a nice order of elimination for the vertices and triangles of T. We use this fact to compute an optimal match of a model to an image using dynamic programming. Note that every tree has a leaf, and a leaf in G T corresponds to a triangle with some vertex v that is not in any other triangle. If we delete v and its triangle from T we obtain a new triangulated polygon. Repeating this procedure we get an order of elimination for the vertices (and triangles) of T. The order is such that when eliminating the i-th vertex, it is in exactly one triangle of the current triangulated polygon. If we consider the graph structure defined by T, the ordering described above is a perfect elimination scheme for the vertices of the graph. Graphs which admit perfect elimination schemes are known as decomposable, chordal or triangulated graphs (see [9]). Decomposable graphs are important because many problems which are hard to solve for general graphs can be solved efficiently in this restricted class. 3 Matching Let P be a simple polygon representing a model shape. An embedding of P in the plane is defined by a continuous function f :P R 2. We consider a set of embeddings which are extensions of maps g :V R 2, where V are the vertices of P. A triangulation of P gives a

3 Each term c ijk should take into account the shape of the embedded triangle and the image data covered by the embedding. Our current implementation uses a simple energy function similar to typical deformable template matching costs. For each triangle t, a deformation cost measures how far the corresponding affine map f t is from a similarity transformation (this makes our shape models invariant to translations, rotations and uniform scalings). A data cost attracts the boundary of the embedded polygon to image locations with high gradient magnitude. More details are given in Section 3.1. While the implementation described here uses a fairly simple energy function, the formulation can handle richer concepts. For example, the deformation costs could be tuned for individual triangles, taking into account that different parts of the shape may be more flexible than others. In fact, we have developed a method to learn deformation models from training data (this will be described in a longer report). Also, the data costs could take into account the whole area covered by the embedded polygon. For example, if we have a grayscale template associated with a shape we can use the correlation between the deformed grayscale template and the image to obtain a data cost. Figure 2: Triangulated polygon and dual graph. natural extension of g to all of the polygon as a piecewise affine map f. The function f sends each triangle (v 1, v 2, v 3 ) T to the triangle (g(v 1 ), g(v 2 ), g(v 3 )) using linear interpolation. In this way, the restriction of f to each triangle t T is an affine map f t. To see that f is well defined (and continuous) just note that if two triangles a, b T touch, then f a and f b agree along the intersection of a and b. What may seem surprising is that all embeddings which map each triangle according to an affine transformation are extensions of some g. This follows from the fact that an affine transformation is defined by the image of three non-collinear points. We define an energy function which assigns a cost to each map g, relative to an image I. The matching problem is to find g with minimum energy (which corresponds to the best location for the deformable shape in the image). We consider energy functions with the following structural form: E(g, I) = c ijk (g(v i ), g(v j ), g(v k ), I). (1) (v i,v j,v k ) T 3.1 Energy function In our framework, each triangle in a shape is mapped to the image plane using an affine transformation. In matrix form, we can write the affine transformation as h(x) = Ax + a. We restrict our attention to transformations which preserve orientation (det(a) > 0). This ensures that the corresponding embedding f is locally one-to-one. Let α and β be the singular values of A. The transformation h takes a unit circle to an ellipse with major and minor axes of length α and β. The value log(α/β) is called the log-anisotropy of h and is commonly used as a measure of how far h is from a similarity transform (see [15]). We use the log-anisotropy measure to assign a deformation cost for each affine map (and let the cost be infinity if the affine map is not orientation preserving). The deformation costs are combined with a data cost that attracts the shape boundary to locations in the image that have high gradient magnitude, E(g, I) = log 2 (α t /β t ) t T ( I f)(s) f (s) λ P f ds, (s) where log(α t /β t ) is the log-anisotropy of f t. The term ( I f)(s) f (s) is the component of the image gradient that is perpendicular to the shape boundary at f(s). We divide the gradient term in the integral by f (s) to make the energy scale invariant. Note that the integral can be broken up into an integral for each edge in the polygon. This allows us to write the energy function in the form of equation (1), where the cost for each triangle will be the deformation cost plus one integral term for each boundary edge that belongs to the triangle. 4 Algorithm As discussed in the last section, the matching problem is to find a map g :V R 2 with lowest possible energy. The only approximation we make is to consider

4 a discrete set of possible locations for each vertex. Let G R 2 be a grid of locations in the image. In the discrete setting g maps each vertex v i to a location l i G. The form of the energy function in equation (1) is quite general, and we depend on the structure of T to be able to find an optimal g efficiently. As noted in Section 2, there is a nice order of elimination for the vertices and triangles of T. The order is such that when eliminating the i-th vertex, it is in exactly one triangle of the current triangulated polygon. The matching algorithm works by sequentially eliminating the vertices of T, using the nice elimination order. This is an instance of a well known dynamic programming technique (see [3]). After eliminating v 1,..., v i 1, vertex v i is in exactly one triangle, say with nodes v j and v k. The two nodes v j and v k are the parents of v i, which we indicate by letting p[i].a = j and p[i].b = k. We compute the cost of the best placement for v i as a function of the locations for v j and v k. This cost is stored in V [j, k](l j, l k ). When we get to the last two vertices we can solve for their best location and trace back to find the best location of the other vertices, as is typical in dynamic programming. Algorithm Match(I) ( Find the best embedding of a shape in an image ) 1. for i = 1 to n 2 2. ( Eliminate the i-th vertex ) 3. j p[i].a 4. k p[i].b 5. for each pair of locations l j and l k in G 6. V [j, k](l j, l k ) min li G c ijk (l i, l j, l k, I) +V [i, j](l i, l j )+V [i, k](l i, l k ) 7. Pick l n 1 and l n minimizing V [n 1, n] and trace back to obtain the other optimal locations. This algorithm runs in O(nm 3 ) time and uses O(nm 2 ) space, where n is the number of vertices in the polygon and m is the number of possible locations for each vertex. In practice we can speed up the algorithm by noting that given positions l j and l k for the parents of the i-th vertex there is a unique similarity transformation taking v j and v k to the respective locations. This similarity transformation defines an ideal location for v i. We only need to consider locations for v i that are near this ideal location, because locations that are far introduce too much deformation in the model. With this optimization the running time of the algorithm is essentially O(nm 2 ). Note that in line 7 of the matching algorithm each entry in V [n 1, n] corresponds to the quality of an optimal embedding for the deformable shape given particular locations for v n 1 and v n. We can detect multiple instances of a shape in a single image by finding peaks in V [n 1, n]. We simply trace back from each peak that has a value above some given fixed threshold. 5 Experiments We present experimental results of our matching algorithm on both medical and natural images. In each case we used a binary picture of the target object to build a model. First we computed a polygonal approximation of the example shape, and then the Delaunay triangulation of the resulting polygon. For the matching results shown here we used a grid of possible locations in the image for the vertices of the models. Our matching algorithm took approximately five minutes in each image when running on a standard workstation. In Figure 3 we show a model for the corpus callosum generated from a manually segmented shape. The best match of the model to several MR images is shown in Figure 4. Note how these images have very low contrast, and the shape of the corpus callosum varies quite a bit. The quality of our results is similar to the quality of results obtained using the best available methods for model based segmentation of medical images (such as [12]). The main advantage of our method is that it does not require any initialization. Figure 5 shows a model for maple leaves, constructed from a binary silhouette. The best match of the model to a few images is shown in Figure 6. The leaves in each image are different, and the viewing direction varies. Note how our method can handle the variation in shape even in the presence of occlusion and clutter. In particular, the last image shows how we can hallucinate the location of a large occluded part of the leaf. Techniques that rely on local search to find non-rigid transformations tend to fail on cluttered images because they get stuck on local optimum solutions. To check the performance of our algorithm on difficult inputs we corrupted one of the leaf images with random Gaussian noise. Figure 7 shows the corrupted images with increasing amounts of noise (corresponding to σ = 50, 150 and 250) and the matching results for each input. We can identify the approximate location of the leaf even when it is barely visible. In Figure 8 we demonstrate how our matching algorithm can be used to detect multiple instances of an object in an image. As discussed in the last section we simply selected peaks in V [n 1, n] with value above a pre-determined threshold to generate each detection.

5 Figure 3: A model for the corpus callosum generated from a binary picture. Figure 4: Matching the corpus callosum model to different images.

6 Figure 5: A model for maple leaves generated from a binary picture. Figure 6: Matching the leaf model to different cluttered images. Each image is of a different maple leaf. Figure 7: Matching the leaf model to an image corrupted by increasing amounts of Gaussian noise.

7 Figure 8: Detection of multiple leaves in one image. 6 Conclusion We have described a new representation for deformable shapes based on triangulated polygons. Our models are invariant to translation, rotation and scale and can be non-rigidly deformed in an intuitive way. The problem of matching a shape to an image was defined in terms of an energy function to be minimized, as is typical for deformable template models. We concentrated on a particular class of energy functions that can be used to represent both the boundary and the internal structure of a deformable object. In contrast to previous work we can efficiently find a global optimal solution to the matching problem. This allows us to detect deformable objects without any kind of initialization, even in the presence of occlusion and background clutter. Acknowledgments We would like to thank Eric Grimson and Dan Huttenlocher for their comments and support. This work was supported in part by NSF-ITR award # and DARPA contract N References [1] Y. Amit and A. Kong. Graphical templates for model registration. IEEE PAMI, 18(3): , [2] R. Basri, L. Costa, D. Geiger, and D. Jacobs. Determining the similarity of deformable shapes. Vision Research, 38: , [3] U. Bertele and F. Brioschi. Nonserial Dynamic Programming. Academic Press, [4] H. Blum. Biological shape and visual science. Theoretical Biology, 38: , [5] T. Cootes, C. Taylor, D. Cooper, and J. Graham. Active shape models: Their training and application. CVIU, 61(1):38 59, [6] J. Coughlan, A. Yuille, C. English, and D. Snow. Efficient deformable template detection and localization without user initialization. CVIU, 78(3): , [7] M. de Berg, M. van Kreveld, M. Overmars, and O. Schwarzkopf. Computational Geometry. Springer-Verlag, [8] M. Fischler and R. Elschlager. The representation and matching of pictorial structures. IEEE TC, 22(1):67 92, [9] M. Golumbic. Algorithmic Graph Theory and Perfect Graphs. Academic Press, [10] U. Grenander. Elements of Pattern Theory. Johns Hopkins University Press, [11] A. Jain, Y. Zhong, and S. Lakshmanan. Object matching using deformable templates. IEEE PAMI, 18(3): , [12] M. Leventon, W. Grimson, and O. Faugeras. Statistical shape influence in geodesic active contours. In IEEE CVPR, pages , [13] L. Prasad. Morphological analysis of shapes. CNLS Newsletter 139, Center for Nonlinear Studies, Los Alamos National Laboratory, July [14] T. Sebastian, P. Klein, and B. Kimia. Recognition of shapes by editing shock graphs. In IEEE ICCV, pages , [15] C. Small. The Statistical Theory of Shapes. Springer-Verlag, [16] B. Widrow. The rubber mask technique. PR, 5(3): , 1973.

Representation and Detection of Shapes in Images. Pedro F. Felzenszwalb

Representation and Detection of Shapes in Images. Pedro F. Felzenszwalb Representation and Detection of Shapes in Images by Pedro F. Felzenszwalb Submitted to the Department of Electrical Engineering and Computer Science in partial fulfillment of the requirements for the degree

More information

Hierarchical Matching of Deformable Shapes

Hierarchical Matching of Deformable Shapes Hierarchical Matching of Deformable Shapes Pedro Felzenszwalb University of Chicago Joshua Schwartz University of Chicago Abstract We introduce a new representation

More information

Lecture 3: Art Gallery Problems and Polygon Triangulation

Lecture 3: Art Gallery Problems and Polygon Triangulation EECS 396/496: Computational Geometry Fall 2017 Lecture 3: Art Gallery Problems and Polygon Triangulation Lecturer: Huck Bennett In this lecture, we study the problem of guarding an art gallery (specified

More information

Hierarchical Matching of Deformable Shapes

Hierarchical Matching of Deformable Shapes Hierarchical Matching of Deformable Shapes Pedro F. Felzenszwalb University of Chicago Joshua D. Schwartz University of Chicago Abstract p 1 q 1 We describe a

More information

Chapter 21 A Stochastic Grammar for Natural Shapes

Chapter 21 A Stochastic Grammar for Natural Shapes Chapter 21 A Stochastic Grammar for Natural Shapes Pedro F. Felzenszwalb 21.1 Introduction In this chapter, we consider the problem of detecting objects using a generic model for natural shapes. A common

More information

Edge and local feature detection - 2. Importance of edge detection in computer vision

Edge and local feature detection - 2. Importance of edge detection in computer vision Edge and local feature detection Gradient based edge detection Edge detection by function fitting Second derivative edge detectors Edge linking and the construction of the chain graph Edge and local feature

More information

Announcements. Edges. Last Lecture. Gradients: Numerical Derivatives f(x) Edge Detection, Lines. Intro Computer Vision. CSE 152 Lecture 10

Announcements. Edges. Last Lecture. Gradients: Numerical Derivatives f(x) Edge Detection, Lines. Intro Computer Vision. CSE 152 Lecture 10 Announcements Assignment 2 due Tuesday, May 4. Edge Detection, Lines Midterm: Thursday, May 6. Introduction to Computer Vision CSE 152 Lecture 10 Edges Last Lecture 1. Object boundaries 2. Surface normal

More information


HOUGH TRANSFORM CS 6350 C V HOUGH TRANSFORM CS 6350 C V HOUGH TRANSFORM The problem: Given a set of points in 2-D, find if a sub-set of these points, fall on a LINE. Hough Transform One powerful global method for detecting edges

More information

Object Recognition Using Pictorial Structures. Daniel Huttenlocher Computer Science Department. In This Talk. Object recognition in computer vision

Object Recognition Using Pictorial Structures. Daniel Huttenlocher Computer Science Department. In This Talk. Object recognition in computer vision Object Recognition Using Pictorial Structures Daniel Huttenlocher Computer Science Department Joint work with Pedro Felzenszwalb, MIT AI Lab In This Talk Object recognition in computer vision Brief definition

More information


SUMMARY: DISTINCTIVE IMAGE FEATURES FROM SCALE- INVARIANT KEYPOINTS SUMMARY: DISTINCTIVE IMAGE FEATURES FROM SCALE- INVARIANT KEYPOINTS Cognitive Robotics Original: David G. Lowe, 004 Summary: Coen van Leeuwen, s1460919 Abstract: This article presents a method to extract

More information

Chapter 11 Representation & Description

Chapter 11 Representation & Description Chain Codes Chain codes are used to represent a boundary by a connected sequence of straight-line segments of specified length and direction. The direction of each segment is coded by using a numbering

More information

Object Recognition with Deformable Models

Object Recognition with Deformable Models Object Recognition with Deformable Models Pedro F. Felzenszwalb Department of Computer Science University of Chicago Joint work with: Dan Huttenlocher, Joshua Schwartz, David McAllester, Deva Ramanan.

More information

CS 534: Computer Vision Segmentation and Perceptual Grouping

CS 534: Computer Vision Segmentation and Perceptual Grouping CS 534: Computer Vision Segmentation and Perceptual Grouping Ahmed Elgammal Dept of Computer Science CS 534 Segmentation - 1 Outlines Mid-level vision What is segmentation Perceptual Grouping Segmentation

More information

Image Registration with Local Rigidity Constraints

Image Registration with Local Rigidity Constraints Image Registration with Local Rigidity Constraints Jan Modersitzki Institute of Mathematics, University of Lübeck, Wallstraße 40, D-23560 Lübeck Email: Abstract. Registration

More information

Digital Image Processing Fundamentals

Digital Image Processing Fundamentals Ioannis Pitas Digital Image Processing Fundamentals Chapter 7 Shape Description Answers to the Chapter Questions Thessaloniki 1998 Chapter 7: Shape description 7.1 Introduction 1. Why is invariance to

More information

2D rendering takes a photo of the 2D scene with a virtual camera that selects an axis aligned rectangle from the scene. The photograph is placed into

2D rendering takes a photo of the 2D scene with a virtual camera that selects an axis aligned rectangle from the scene. The photograph is placed into 2D rendering takes a photo of the 2D scene with a virtual camera that selects an axis aligned rectangle from the scene. The photograph is placed into the viewport of the current application window. A pixel

More information

Fitting. Instructor: Jason Corso (jjcorso)!!! EECS Fall 2014! Foundations of Computer Vision!

Fitting. Instructor: Jason Corso (jjcorso)!!! EECS Fall 2014! Foundations of Computer Vision! Fitting EECS 598-08 Fall 2014! Foundations of Computer Vision!! Instructor: Jason Corso (jjcorso)!!! Readings: FP 10; SZ 4.3, 5.1! Date: 10/8/14!! Materials on these

More information

Augmented Reality VU. Computer Vision 3D Registration (2) Prof. Vincent Lepetit

Augmented Reality VU. Computer Vision 3D Registration (2) Prof. Vincent Lepetit Augmented Reality VU Computer Vision 3D Registration (2) Prof. Vincent Lepetit Feature Point-Based 3D Tracking Feature Points for 3D Tracking Much less ambiguous than edges; Point-to-point reprojection

More information

EECS 442 Computer vision. Fitting methods

EECS 442 Computer vision. Fitting methods EECS 442 Computer vision Fitting methods - Problem formulation - Least square methods - RANSAC - Hough transforms - Multi-model fitting - Fitting helps matching! Reading: [HZ] Chapters: 4, 11 [FP] Chapters:

More information

Segmentation and Tracking of Partial Planar Templates

Segmentation and Tracking of Partial Planar Templates Segmentation and Tracking of Partial Planar Templates Abdelsalam Masoud William Hoff Colorado School of Mines Colorado School of Mines Golden, CO 800 Golden, CO 800 Abstract

More information

Multi-view stereo. Many slides adapted from S. Seitz

Multi-view stereo. Many slides adapted from S. Seitz Multi-view stereo Many slides adapted from S. Seitz Beyond two-view stereo The third eye can be used for verification Multiple-baseline stereo Pick a reference image, and slide the corresponding window

More information

Isosurface Rendering. CSC 7443: Scientific Information Visualization

Isosurface Rendering. CSC 7443: Scientific Information Visualization Isosurface Rendering What is Isosurfacing? An isosurface is the 3D surface representing the locations of a constant scalar value within a volume A surface with the same scalar field value Isosurfaces form

More information

Preferred directions for resolving the non-uniqueness of Delaunay triangulations

Preferred directions for resolving the non-uniqueness of Delaunay triangulations Preferred directions for resolving the non-uniqueness of Delaunay triangulations Christopher Dyken and Michael S. Floater Abstract: This note proposes a simple rule to determine a unique triangulation

More information

EE795: Computer Vision and Intelligent Systems

EE795: Computer Vision and Intelligent Systems EE795: Computer Vision and Intelligent Systems Spring 2012 TTh 17:30-18:45 WRI C225 Lecture 04 130131 2 Outline Review Histogram Equalization Image Filtering Linear

More information

CS6100: Topics in Design and Analysis of Algorithms

CS6100: Topics in Design and Analysis of Algorithms CS6100: Topics in Design and Analysis of Algorithms Guarding and Triangulating Polygons John Augustine CS6100 (Even 2012): Guarding and Triangulating Polygons The Art Gallery Problem A simple polygon is

More information


COMPUTER AND ROBOT VISION VOLUME COMPUTER AND ROBOT VISION Robert M. Haralick University of Washington Linda G. Shapiro University of Washington T V ADDISON-WESLEY PUBLISHING COMPANY Reading, Massachusetts Menlo Park, California

More information

CS443: Digital Imaging and Multimedia Perceptual Grouping Detecting Lines and Simple Curves

CS443: Digital Imaging and Multimedia Perceptual Grouping Detecting Lines and Simple Curves CS443: Digital Imaging and Multimedia Perceptual Grouping Detecting Lines and Simple Curves Spring 2008 Ahmed Elgammal Dept. of Computer Science Rutgers University Outlines Perceptual Grouping and Segmentation

More information

An Automated Image-based Method for Multi-Leaf Collimator Positioning Verification in Intensity Modulated Radiation Therapy

An Automated Image-based Method for Multi-Leaf Collimator Positioning Verification in Intensity Modulated Radiation Therapy An Automated Image-based Method for Multi-Leaf Collimator Positioning Verification in Intensity Modulated Radiation Therapy Chenyang Xu 1, Siemens Corporate Research, Inc., Princeton, NJ, USA Xiaolei Huang,

More information

Bildverarbeitung für die Medizin 2007

Bildverarbeitung für die Medizin 2007 Bildverarbeitung für die Medizin 2007 Image Registration with Local Rigidity Constraints Jan Modersitzki Institute of Mathematics, University of Lübeck, Wallstraße 40, D-23560 Lübeck 1 Summary Registration

More information

Fitting: The Hough transform

Fitting: The Hough transform Fitting: The Hough transform Voting schemes Let each feature vote for all the models that are compatible with it Hopefully the noise features will not vote consistently for any single model Missing data

More information

Nonrigid Surface Modelling. and Fast Recovery. Department of Computer Science and Engineering. Committee: Prof. Leo J. Jia and Prof. K. H.

Nonrigid Surface Modelling. and Fast Recovery. Department of Computer Science and Engineering. Committee: Prof. Leo J. Jia and Prof. K. H. Nonrigid Surface Modelling and Fast Recovery Zhu Jianke Supervisor: Prof. Michael R. Lyu Committee: Prof. Leo J. Jia and Prof. K. H. Wong Department of Computer Science and Engineering May 11, 2007 1 2

More information

Shape Descriptor using Polar Plot for Shape Recognition.

Shape Descriptor using Polar Plot for Shape Recognition. Shape Descriptor using Polar Plot for Shape Recognition. Brijesh Pillai ECE Graduate Student, Clemson University Abstract : This paper presents my work on computing shape models that

More information

A New Algorithm for Shape Detection

A New Algorithm for Shape Detection IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 19, Issue 3, Ver. I (May.-June. 2017), PP 71-76 A New Algorithm for Shape Detection Hewa

More information

A Hierarchical Compositional System for Rapid Object Detection

A Hierarchical Compositional System for Rapid Object Detection A Hierarchical Compositional System for Rapid Object Detection Long Zhu and Alan Yuille Department of Statistics University of California at Los Angeles Los Angeles, CA 90095 {lzhu,yuille}

More information

Open-Curve Shape Correspondence Without Endpoint Correspondence

Open-Curve Shape Correspondence Without Endpoint Correspondence Open-Curve Shape Correspondence Without Endpoint Correspondence Theodor Richardson and Song Wang Department of Computer Science and Engineering, University of South Carolina, Columbia, SC 29208, USA,

More information

A Keypoint Descriptor Inspired by Retinal Computation

A Keypoint Descriptor Inspired by Retinal Computation A Keypoint Descriptor Inspired by Retinal Computation Bongsoo Suh, Sungjoon Choi, Han Lee Stanford University {bssuh,sungjoonchoi,hanlee} Abstract. The main goal of our project is to implement

More information

A Method of Automated Landmark Generation for Automated 3D PDM Construction

A Method of Automated Landmark Generation for Automated 3D PDM Construction A Method of Automated Landmark Generation for Automated 3D PDM Construction A. D. Brett and C. J. Taylor Department of Medical Biophysics University of Manchester Manchester M13 9PT, Uk

More information

The SIFT (Scale Invariant Feature

The SIFT (Scale Invariant Feature The SIFT (Scale Invariant Feature Transform) Detector and Descriptor developed by David Lowe University of British Columbia Initial paper ICCV 1999 Newer journal paper IJCV 2004 Review: Matt Brown s Canonical

More information

Pictorial Structures for Object Recognition

Pictorial Structures for Object Recognition Pictorial Structures for Object Recognition Felzenszwalb and Huttenlocher Presented by Stephen Krotosky Pictorial Structures Introduced by Fischler and Elschlager in 1973 Objects are modeled by a collection

More information

CSE 554 Lecture 7: Deformation II

CSE 554 Lecture 7: Deformation II CSE 554 Lecture 7: Deformation II Fall 2011 CSE554 Deformation II Slide 1 Review Rigid-body alignment Non-rigid deformation Intrinsic methods: deforming the boundary points An optimization problem Minimize

More information

Lecture 8 Fitting and Matching

Lecture 8 Fitting and Matching Lecture 8 Fitting and Matching Problem formulation Least square methods RANSAC Hough transforms Multi-model fitting Fitting helps matching! Reading: [HZ] Chapter: 4 Estimation 2D projective transformation

More information

Fundamentals of Stereo Vision Michael Bleyer LVA Stereo Vision

Fundamentals of Stereo Vision Michael Bleyer LVA Stereo Vision Fundamentals of Stereo Vision Michael Bleyer LVA Stereo Vision What Happened Last Time? Human 3D perception (3D cinema) Computational stereo Intuitive explanation of what is meant by disparity Stereo matching

More information

Segmentation with non-linear constraints on appearance, complexity, and geometry

Segmentation with non-linear constraints on appearance, complexity, and geometry IPAM February 2013 Western Univesity Segmentation with non-linear constraints on appearance, complexity, and geometry Yuri Boykov Andrew Delong Lena Gorelick Hossam Isack Anton Osokin Frank Schmidt Olga

More information

Segmentation and Grouping

Segmentation and Grouping Segmentation and Grouping How and what do we see? Fundamental Problems ' Focus of attention, or grouping ' What subsets of pixels do we consider as possible objects? ' All connected subsets? ' Representation

More information

Lecture 2 September 3

Lecture 2 September 3 EE 381V: Large Scale Optimization Fall 2012 Lecture 2 September 3 Lecturer: Caramanis & Sanghavi Scribe: Hongbo Si, Qiaoyang Ye 2.1 Overview of the last Lecture The focus of the last lecture was to give

More information

Image retrieval based on region shape similarity

Image retrieval based on region shape similarity Image retrieval based on region shape similarity Cheng Chang Liu Wenyin Hongjiang Zhang Microsoft Research China, 49 Zhichun Road, Beijing 8, China {wyliu, hjzhang} ABSTRACT This paper presents

More information

Human pose estimation using Active Shape Models

Human pose estimation using Active Shape Models Human pose estimation using Active Shape Models Changhyuk Jang and Keechul Jung Abstract Human pose estimation can be executed using Active Shape Models. The existing techniques for applying to human-body

More information

Using Semi-Regular 4 8 Meshes for Subdivision Surfaces

Using Semi-Regular 4 8 Meshes for Subdivision Surfaces Using Semi-Regular 8 Meshes for Subdivision Surfaces Luiz Velho IMPA Instituto de Matemática Pura e Aplicada Abstract. Semi-regular 8 meshes are refinable triangulated quadrangulations. They provide a

More information


LATEST TRENDS on APPLIED MATHEMATICS, SIMULATION, MODELLING 3D surface reconstruction of objects by using stereoscopic viewing Baki Koyuncu, Kurtuluş Küllü Computer Engineering Department, Ankara University, Ankara,

More information

Stereo Vision. MAN-522 Computer Vision

Stereo Vision. MAN-522 Computer Vision Stereo Vision MAN-522 Computer Vision What is the goal of stereo vision? The recovery of the 3D structure of a scene using two or more images of the 3D scene, each acquired from a different viewpoint in

More information

EXAM SOLUTIONS. Image Processing and Computer Vision Course 2D1421 Monday, 13 th of March 2006,

EXAM SOLUTIONS. Image Processing and Computer Vision Course 2D1421 Monday, 13 th of March 2006, School of Computer Science and Communication, KTH Danica Kragic EXAM SOLUTIONS Image Processing and Computer Vision Course 2D1421 Monday, 13 th of March 2006, 14.00 19.00 Grade table 0-25 U 26-35 3 36-45

More information


STATISTICS AND ANALYSIS OF SHAPE Control and Cybernetics vol. 36 (2007) No. 2 Book review: STATISTICS AND ANALYSIS OF SHAPE by H. Krim, A. Yezzi, Jr., eds. There are numerous definitions of a notion of shape of an object. These definitions

More information

Shape Classification Using Regional Descriptors and Tangent Function

Shape Classification Using Regional Descriptors and Tangent Function Shape Classification Using Regional Descriptors and Tangent Function Meetal Kalantri Rahul Dhuture Amit Fulsunge Abstract In this paper three novel hybrid regional descriptor

More information

Chapter 9 Object Tracking an Overview

Chapter 9 Object Tracking an Overview Chapter 9 Object Tracking an Overview The output of the background subtraction algorithm, described in the previous chapter, is a classification (segmentation) of pixels into foreground pixels (those belonging

More information

Matching. Compare region of image to region of image. Today, simplest kind of matching. Intensities similar.

Matching. Compare region of image to region of image. Today, simplest kind of matching. Intensities similar. Matching Compare region of image to region of image. We talked about this for stereo. Important for motion. Epipolar constraint unknown. But motion small. Recognition Find object in image. Recognize object.

More information

Lecture 9: Hough Transform and Thresholding base Segmentation

Lecture 9: Hough Transform and Thresholding base Segmentation #1 Lecture 9: Hough Transform and Thresholding base Segmentation Saad Bedros Hough Transform Robust method to find a shape in an image Shape can be described in parametric form A voting

More information

Exploring Curve Fitting for Fingers in Egocentric Images

Exploring Curve Fitting for Fingers in Egocentric Images Exploring Curve Fitting for Fingers in Egocentric Images Akanksha Saran Robotics Institute, Carnegie Mellon University 16-811: Math Fundamentals for Robotics Final Project Report Email:

More information

Lecture 9 Fitting and Matching

Lecture 9 Fitting and Matching Lecture 9 Fitting and Matching Problem formulation Least square methods RANSAC Hough transforms Multi- model fitting Fitting helps matching! Reading: [HZ] Chapter: 4 Estimation 2D projective transformation

More information

Practical Image and Video Processing Using MATLAB

Practical Image and Video Processing Using MATLAB Practical Image and Video Processing Using MATLAB Chapter 14 Edge detection What will we learn? What is edge detection and why is it so important to computer vision? What are the main edge detection techniques

More information

Implicit Active Shape Models for 3D Segmentation in MR Imaging

Implicit Active Shape Models for 3D Segmentation in MR Imaging Implicit Active Shape Models for 3D Segmentation in MR Imaging Mikaël Rousson 1, Nikos Paragios 2, and Rachid Deriche 1 1 I.N.R.I.A. Sophia Antipolis, France E-mail: {Mikael.Rousson,Rachid.Deriche}

More information

Algebraic Geometry of Segmentation and Tracking

Algebraic Geometry of Segmentation and Tracking Ma191b Winter 2017 Geometry of Neuroscience Geometry of lines in 3-space and Segmentation and Tracking This lecture is based on the papers: Reference: Marco Pellegrini, Ray shooting and lines in space.

More information

Morphological Image Processing

Morphological Image Processing Morphological Image Processing Binary image processing In binary images, we conventionally take background as black (0) and foreground objects as white (1 or 255) Morphology Figure 4.1 objects on a conveyor

More information

3D Computer Vision. Dense 3D Reconstruction II. Prof. Didier Stricker. Christiano Gava

3D Computer Vision. Dense 3D Reconstruction II. Prof. Didier Stricker. Christiano Gava 3D Computer Vision Dense 3D Reconstruction II Prof. Didier Stricker Christiano Gava Kaiserlautern University DFKI Deutsches Forschungszentrum für Künstliche Intelligenz

More information

Normalized cuts and image segmentation

Normalized cuts and image segmentation Normalized cuts and image segmentation Department of EE University of Washington Yeping Su Xiaodan Song Normalized Cuts and Image Segmentation, IEEE Trans. PAMI, August 2000 5/20/2003 1 Outline 1. Image

More information


EE 584 MACHINE VISION EE 584 MACHINE VISION Binary Images Analysis Geometrical & Topological Properties Connectedness Binary Algorithms Morphology Binary Images Binary (two-valued; black/white) images gives better efficiency

More information

Image Coding with Active Appearance Models

Image Coding with Active Appearance Models Image Coding with Active Appearance Models Simon Baker, Iain Matthews, and Jeff Schneider CMU-RI-TR-03-13 The Robotics Institute Carnegie Mellon University Abstract Image coding is the task of representing

More information

Fitting: The Hough transform

Fitting: The Hough transform Fitting: The Hough transform Voting schemes Let each feature vote for all the models that are compatible with it Hopefully the noise features will not vote consistently for any single model Missing data

More information

Multi-Scale Free-Form Surface Description

Multi-Scale Free-Form Surface Description Multi-Scale Free-Form Surface Description Farzin Mokhtarian, Nasser Khalili and Peter Yuen Centre for Vision Speech and Signal Processing Dept. of Electronic and Electrical Engineering University of Surrey,

More information



More information

Graphs: Introduction. Ali Shokoufandeh, Department of Computer Science, Drexel University

Graphs: Introduction. Ali Shokoufandeh, Department of Computer Science, Drexel University Graphs: Introduction Ali Shokoufandeh, Department of Computer Science, Drexel University Overview of this talk Introduction: Notations and Definitions Graphs and Modeling Algorithmic Graph Theory and Combinatorial

More information

Automatic Logo Detection and Removal

Automatic Logo Detection and Removal Automatic Logo Detection and Removal Miriam Cha, Pooya Khorrami and Matthew Wagner Electrical and Computer Engineering Carnegie Mellon University Pittsburgh, PA 15213 {mcha,pkhorrami,mwagner}

More information

Broad field that includes low-level operations as well as complex high-level algorithms

Broad field that includes low-level operations as well as complex high-level algorithms Image processing About Broad field that includes low-level operations as well as complex high-level algorithms Low-level image processing Computer vision Computational photography Several procedures and

More information

Practical Linear Algebra: A Geometry Toolbox

Practical Linear Algebra: A Geometry Toolbox Practical Linear Algebra: A Geometry Toolbox Third edition Chapter 17: Breaking It Up: Triangles Gerald Farin & Dianne Hansford CRC Press, Taylor & Francis Group, An A K Peters Book

More information

Fitting: The Hough transform

Fitting: The Hough transform Fitting: The Hough transform Voting schemes Let each feature vote for all the models that are compatible with it Hopefully the noise features will not vote consistently for any single model Missing data

More information

Visibility: Finding the Staircase Kernel in Orthogonal Polygons

Visibility: Finding the Staircase Kernel in Orthogonal Polygons Visibility: Finding the Staircase Kernel in Orthogonal Polygons 8 Visibility: Finding the Staircase Kernel in Orthogonal Polygons Tzvetalin S. Vassilev, Nipissing University, Canada Stefan Pape, Nipissing

More information

3D Pose Estimation of Cactus Leaves using an Active Shape Model

3D Pose Estimation of Cactus Leaves using an Active Shape Model 3D Pose Estimation of Cactus Leaves using an Active Shape Model Thomas B. Moeslund, Michael Aagaard, Dennis Lerche Laboratory of Computer Vision and Media Technology Aalborg University, Denmark E-mail:

More information


DEFORMABLE MATCHING OF HAND SHAPES FOR USER VERIFICATION. Ani1 K. Jain and Nicolae Duta DEFORMABLE MATCHING OF HAND SHAPES FOR USER VERIFICATION Ani1 K. Jain and Nicolae Duta Department of Computer Science and Engineering Michigan State University, East Lansing, MI 48824-1026, USA E-mail:

More information

Robotics Programming Laboratory

Robotics Programming Laboratory Chair of Software Engineering Robotics Programming Laboratory Bertrand Meyer Jiwon Shin Lecture 8: Robot Perception Perception car

More information

Shape Analysis. Erkut Erdem. BIL717, April 2012!

Shape Analysis. Erkut Erdem. BIL717, April 2012! Shape Analysis Erkut Erdem BIL717, April 2012! Introduction Shape is the primary source of visual information Objects can be immediately recognized and classified based on their shapes Other visual clues

More information

What have we leaned so far?

What have we leaned so far? What have we leaned so far? Camera structure Eye structure Project 1: High Dynamic Range Imaging What have we learned so far? Image Filtering Image Warping Camera Projection Model Project 2: Panoramic

More information

Raghuraman Gopalan Center for Automation Research University of Maryland, College Park

Raghuraman Gopalan Center for Automation Research University of Maryland, College Park 2D Shape Matching (and Object Recognition) Raghuraman Gopalan Center for Automation Research University of Maryland, College Park 1 Outline What is a shape? Part 1: Matching/ Recognition Shape contexts

More information

Local qualitative shape from stereo. without detailed correspondence. Extended Abstract. Shimon Edelman. Internet:

Local qualitative shape from stereo. without detailed correspondence. Extended Abstract. Shimon Edelman. Internet: Local qualitative shape from stereo without detailed correspondence Extended Abstract Shimon Edelman Center for Biological Information Processing MIT E25-201, Cambridge MA 02139 Internet:

More information

274 Curves on Surfaces, Lecture 5

274 Curves on Surfaces, Lecture 5 274 Curves on Surfaces, Lecture 5 Dylan Thurston Notes by Qiaochu Yuan Fall 2012 5 Ideal polygons Previously we discussed three models of the hyperbolic plane: the Poincaré disk, the upper half-plane,

More information

Using Subspace Constraints to Improve Feature Tracking Presented by Bryan Poling. Based on work by Bryan Poling, Gilad Lerman, and Arthur Szlam

Using Subspace Constraints to Improve Feature Tracking Presented by Bryan Poling. Based on work by Bryan Poling, Gilad Lerman, and Arthur Szlam Presented by Based on work by, Gilad Lerman, and Arthur Szlam What is Tracking? Broad Definition Tracking, or Object tracking, is a general term for following some thing through multiple frames of a video

More information

A Statistical Consistency Check for the Space Carving Algorithm.

A Statistical Consistency Check for the Space Carving Algorithm. A Statistical Consistency Check for the Space Carving Algorithm. A. Broadhurst and R. Cipolla Dept. of Engineering, Univ. of Cambridge, Cambridge, CB2 1PZ aeb29 cipolla Abstract This paper

More information

Local Image preprocessing (cont d)

Local Image preprocessing (cont d) Local Image preprocessing (cont d) 1 Outline - Edge detectors - Corner detectors - Reading: textbook 5.3.1-5.3.5 and 5.3.10 2 What are edges? Edges correspond to relevant features in the image. An edge

More information


COMPUTER VISION > OPTICAL FLOW UTRECHT UNIVERSITY RONALD POPPE COMPUTER VISION 2017-2018 > OPTICAL FLOW UTRECHT UNIVERSITY RONALD POPPE OUTLINE Optical flow Lucas-Kanade Horn-Schunck Applications of optical flow Optical flow tracking Histograms of oriented flow Assignment

More information

coding of various parts showing different features, the possibility of rotation or of hiding covering parts of the object's surface to gain an insight

coding of various parts showing different features, the possibility of rotation or of hiding covering parts of the object's surface to gain an insight Three-Dimensional Object Reconstruction from Layered Spatial Data Michael Dangl and Robert Sablatnig Vienna University of Technology, Institute of Computer Aided Automation, Pattern Recognition and Image

More information

Multimedia Computing: Algorithms, Systems, and Applications: Edge Detection

Multimedia Computing: Algorithms, Systems, and Applications: Edge Detection Multimedia Computing: Algorithms, Systems, and Applications: Edge Detection By Dr. Yu Cao Department of Computer Science The University of Massachusetts Lowell Lowell, MA 01854, USA Part of the slides

More information

THE description and representation of the shape of an object

THE description and representation of the shape of an object Enhancement of Shape Description and Representation by Slope Ali Salem Bin Samma and Rosalina Abdul Salam Abstract Representation and description of object shapes by the slopes of their contours or borders

More information

Lecture 16: Object recognition: Part-based generative models

Lecture 16: Object recognition: Part-based generative models Lecture 16: Object recognition: Part-based generative models Professor Stanford Vision Lab 1 What we will learn today? Introduction Constellation model Weakly supervised training One-shot learning (Problem

More information

An Adaptive Eigenshape Model

An Adaptive Eigenshape Model An Adaptive Eigenshape Model Adam Baumberg and David Hogg School of Computer Studies University of Leeds, Leeds LS2 9JT, U.K. Abstract There has been a great deal of recent interest

More information

Sparse Shape Registration for Occluded Facial Feature Localization

Sparse Shape Registration for Occluded Facial Feature Localization Shape Registration for Occluded Facial Feature Localization Fei Yang, Junzhou Huang and Dimitris Metaxas Abstract This paper proposes a sparsity driven shape registration method for occluded facial feature

More information

Globally Stabilized 3L Curve Fitting

Globally Stabilized 3L Curve Fitting Globally Stabilized 3L Curve Fitting Turker Sahin and Mustafa Unel Department of Computer Engineering, Gebze Institute of Technology Cayirova Campus 44 Gebze/Kocaeli Turkey {htsahin,munel}

More information

4 Integer Linear Programming (ILP)

4 Integer Linear Programming (ILP) TDA6/DIT37 DISCRETE OPTIMIZATION 17 PERIOD 3 WEEK III 4 Integer Linear Programg (ILP) 14 An integer linear program, ILP for short, has the same form as a linear program (LP). The only difference is that

More information


HUMAN COMPUTER INTERFACE BASED ON HAND TRACKING Proceedings of MUSME 2011, the International Symposium on Multibody Systems and Mechatronics Valencia, Spain, 25-28 October 2011 HUMAN COMPUTER INTERFACE BASED ON HAND TRACKING Pedro Achanccaray, Cristian

More information

Subspace Clustering with Global Dimension Minimization And Application to Motion Segmentation

Subspace Clustering with Global Dimension Minimization And Application to Motion Segmentation Subspace Clustering with Global Dimension Minimization And Application to Motion Segmentation Bryan Poling University of Minnesota Joint work with Gilad Lerman University of Minnesota The Problem of Subspace

More information

Topic 6 Representation and Description

Topic 6 Representation and Description Topic 6 Representation and Description Background Segmentation divides the image into regions Each region should be represented and described in a form suitable for further processing/decision-making Representation

More information

Three applications of Euler s formula. Chapter 10

Three applications of Euler s formula. Chapter 10 Three applications of Euler s formula Chapter 10 A graph is planar if it can be drawn in the plane R without crossing edges (or, equivalently, on the -dimensional sphere S ). We talk of a plane graph if

More information

Combinatorial optimization and its applications in image Processing. Filip Malmberg

Combinatorial optimization and its applications in image Processing. Filip Malmberg Combinatorial optimization and its applications in image Processing Filip Malmberg Part 1: Optimization in image processing Optimization in image processing Many image processing problems can be formulated

More information