Graph Based Image Segmentation

Similar documents
Image Segmentation Using Iterated Graph Cuts BasedonMulti-scaleSmoothing

Image Segmentation Using Iterated Graph Cuts Based on Multi-scale Smoothing

CS 534: Computer Vision Segmentation II Graph Cuts and Image Segmentation

CSCI-B609: A Theorist s Toolkit, Fall 2016 Sept. 6, Firstly let s consider a real world problem: community detection.

CS 534: Computer Vision Segmentation and Perceptual Grouping

Supervised texture detection in images

Iterative MAP and ML Estimations for Image Segmentation

Graphs, graph algorithms (for image segmentation),... in progress

Image Segmentation for Image Object Extraction

Maximum flows and minimal cuts. Filip Malmberg

CS4670 / 5670: Computer Vision Noah Snavely

CS 5540 Spring 2013 Assignment 3, v1.0 Due: Apr. 24th 11:59PM

STUDYING THE FEASIBILITY AND IMPORTANCE OF GRAPH-BASED IMAGE SEGMENTATION TECHNIQUES

CAP5415-Computer Vision Lecture 13-Image/Video Segmentation Part II. Dr. Ulas Bagci

Segmentation and Grouping

MULTI-REGION SEGMENTATION

PAPER Graph Cuts Segmentation by Using Local Texture Features of Multiresolution Analysis

Minimum Cost Edge Disjoint Paths

Interactive Differential Segmentation of the Prostate using Graph-Cuts with a Feature Detector-based Boundary Term

Targil 12 : Image Segmentation. Image segmentation. Why do we need it? Image segmentation

Colour Segmentation-based Computation of Dense Optical Flow with Application to Video Object Segmentation

Image segmentation via graph cuts with connectivity priors

Lecture 11: Maximum flow and minimum cut

Mathematical and Algorithmic Foundations Linear Programming and Matchings

Discriminative Clustering for Image Co-segmentation

A Connection between Network Coding and. Convolutional Codes

intro, applications MRF, labeling... how it can be computed at all? Applications in segmentation: GraphCut, GrabCut, demos

Lecture 11: E-M and MeanShift. CAP 5415 Fall 2007

Multi-view object segmentation in space and time. Abdelaziz Djelouah, Jean Sebastien Franco, Edmond Boyer

Big Data Analytics. Special Topics for Computer Science CSE CSE Feb 11

Design of Orthogonal Graph Wavelet Filter Banks

Fast Approximate Energy Minimization via Graph Cuts

HIGH RESOLUTION REMOTE SENSING IMAGE SEGMENTATION BASED ON GRAPH THEORY AND FRACTAL NET EVOLUTION APPROACH

Spatial-Color Pixel Classification by Spectral Clustering for Color Image Segmentation

CS6670: Computer Vision

Segmentation Computer Vision Spring 2018, Lecture 27

Graph Cuts vs. Level Sets. part I Basics of Graph Cuts

Image Segmentation continued Graph Based Methods. Some slides: courtesy of O. Capms, Penn State, J.Ponce and D. Fortsyth, Computer Vision Book

Image Segmentation continued Graph Based Methods

Graphs and Network Flows IE411. Lecture 21. Dr. Ted Ralphs

Announcements. Image Segmentation. From images to objects. Extracting objects. Status reports next Thursday ~5min presentations in class

Interactive Image Segmentation

Image Segmentation. Selim Aksoy. Bilkent University

Image Segmentation. Selim Aksoy. Bilkent University

6. Lecture notes on matroid intersection

s-t Graph Cuts for Binary Energy Minimization

Contrast adjustment via Bayesian sequential partitioning

Bipartite Graph Partitioning and Content-based Image Clustering

MEDICAL IMAGE COMPUTING (CAP 5937) LECTURE 10: Medical Image Segmentation as an Energy Minimization Problem

MEDICAL IMAGE COMPUTING (CAP 5937) LECTURE 10: Medical Image Segmentation as an Energy Minimization Problem

USING ONE GRAPH-CUT TO FUSE MULTIPLE CANDIDATE MAPS IN DEPTH ESTIMATION

1. Lecture notes on bipartite matching February 4th,

Ensemble Combination for Solving the Parameter Selection Problem in Image Segmentation

Improving Image Segmentation Quality Via Graph Theory

Some Advanced Topics in Linear Programming

6.801/866. Segmentation and Line Fitting. T. Darrell

CS4670: Computer Vision

Data Analysis 3. Support Vector Machines. Jan Platoš October 30, 2017

arxiv: v2 [cs.dm] 28 Dec 2010

Discriminative Clustering for Image Co-segmentation

Image Segmentation. Srikumar Ramalingam School of Computing University of Utah. Slides borrowed from Ross Whitaker

3 No-Wait Job Shops with Variable Processing Times

Normalized Graph cuts. by Gopalkrishna Veni School of Computing University of Utah

Introduction to Machine Learning

Norbert Schuff VA Medical Center and UCSF

Flows on surface embedded graphs. Erin Chambers

Computer Vision 5 Segmentation by Clustering

Vertex-Colouring Edge-Weightings

Segmentation Based Stereo. Michael Bleyer LVA Stereo Vision

On Structural Parameterizations of the Matching Cut Problem

Theorem 2.9: nearest addition algorithm

GRAPH BASED SEGMENTATION WITH MINIMAL USER INTERACTION. Huaizhong Zhang, Ehab Essa, and Xianghua Xie

Practical Level Planarity Testing and Layout with Embedding Constraints

Image Analysis - Lecture 5

Graph-Based Superpixel Labeling for Enhancement of Online Video Segmentation

General properties of staircase and convex dual feasible functions

Lecture 7: Segmentation. Thursday, Sept 20

Robotics Programming Laboratory

Topic 4 Image Segmentation

COT 6936: Topics in Algorithms! Giri Narasimhan. ECS 254A / EC 2443; Phone: x3748

GRMA: Generalized Range Move Algorithms for the Efficient Optimization of MRFs

Clustering. SC4/SM4 Data Mining and Machine Learning, Hilary Term 2017 Dino Sejdinovic

Experiments with Edge Detection using One-dimensional Surface Fitting

1. Lecture notes on bipartite matching

Combining Top-down and Bottom-up Segmentation

A Graph Theoretic Approach to Image Database Retrieval

BAYESIAN HIGH PRIORITY REGION GROWING FOR CHANGE DETECTION

Paths, Flowers and Vertex Cover

Prof. Feng Liu. Spring /17/2017. With slides by F. Durand, Y.Y. Chuang, R. Raskar, and C.

Parallel Algorithm Design. Parallel Algorithm Design p. 1

Lazy Snapping. A paper from Siggraph04 by Yin Li, Jian Sun, Chi-KeungTang, Heung-Yeung Shum, Microsoft Research Asia. Presented by Gerhard Röthlin

CHAPTER 6 IDENTIFICATION OF CLUSTERS USING VISUAL VALIDATION VAT ALGORITHM

CS 664 Slides #11 Image Segmentation. Prof. Dan Huttenlocher Fall 2003

Multiple cosegmentation

Research on Recognition and Classification of Moving Objects in Mixed Traffic Based on Video Detection

Pairwise Clustering and Graphical Models

CSE 473/573 Computer Vision and Image Processing (CVIP) Ifeoma Nwogu. Lectures 21 & 22 Segmentation and clustering

Segmentation (continued)

On vertex-coloring edge-weighting of graphs

Digital Image Procesing

Transcription:

AUTOMATYKA 2011 Tom 15 Zeszyt 3 Anna Fabijañska* Graph Based Image Segmentation 1. Introduction Image segmentation is one of the fundamental problems in machine vision. In general it aims at extracting objects of interest from background. Because this step is crucial for almost every application of vision systems, problem of image segmentation has been widely studied over the years and a very large number of segmentation techniques have already been proposed. Recent research on image segmentation has seen an increasing interest in graph based techniques which extract objects from background by partitioning graphs into subgraphs. The most representative methods for graph based segmentation are [1]: spectral graph partitioning and combinatorial graph cuts. The first group of methods use the eigenvectors of the graph Laplacian (or its variants) to partition the graph [2]. Whereas combinatorial graph cuts try to define subgraphs by solving min-cut/max-flow problem [3, 4]. Spectral graph partitioning methods are of NP-complexity therefore they often apply some relaxing techniques to find an approximate cut of a graph. These methods however will not be regarded in this paper. For further details the reader is referred to [1, 2]. The main attention of this paper is focused on these image segmentation methods which use combinatorial graph cuts. Specifically, the efficient method of min-cut/max-flow segmentation proposed by Boykov and Jolly in [3] is investigated. The method divides image into subregions by computing a global optimum among all segmentations satisfying some hard constrains imposed for object and background. The optimum is computed in term of minimization of energy which incorporates some regional and boundary conditions. Although the considered method raised an interest and popularity it has not been fully investigated yet. Incorporating different conditions into energy being minimized leads to different results. Therefore this paper considers what energy kind of can be minimized by graph cuts and how incorporated regional and boundary terms influence properties of segmented image. * Technical University of Lodz, Lodz, Poland 93

94 Anna Fabijañska This following part of this paper is organized as follows. Firstly in Section 2 some basic background on graph theory is given. Next, in Section 3 graph representation of an image is explained. In Section 4 the idea of graph based segmentation is explained. The glance at min-cut/max-flow image segmentation is given in Section 5. The results of image segmentation obtained for various images using different energy functions are presented and discussed in Section 6. Finally, Section 7 concludes the paper. 2. Background on graph theory Graph is a data structure consisting of set of vertices (or nodes) and set of edges connecting the vertices. In more formal sense graph G is often defined as an ordered pair G = (V, E) comprising set V of vertices together with set E of edges. Each edge e ij relates two vertices and is described by a set {v i, v j } of two corresponding elements from set V. If pairs of vertices {v i, v j } are unordered edges have no orientation the graph is called undirected or simple. Additionally, when every node is connected with all other nodes the graph is called fully connected [5]. Exemplary undirected graph is presented in Figure 1a. Set of vertices and set of edges corresponding with the graph equal respectively V = {A, B, C, D, E, F} and E = {{A,B}, {A,C}, {A,D}, {B,C}, {B,E}, {D,E}, {D,F}}. a) b) Fig. 1. Exemplary graphs; (a) simple, undirected graph; (b) weighted undirected graph When weight w ij is associated with every edge e ij, the graph is called a weighted graph. Weights are usually real numbers which express some property of connection between the nodes. Exemplary weighted graph is presented in Figure 1b. A weighted directed graph with two terminal nodes S (source) and T (sink) can be considered as a flow network with edge capacities determined by its weights. An edge can receive flow not exceeding its capacity. A cut (S, T) of a directed graph is a set of edges C@E such that the two terminals become separated on the induced graph G' = (V, E/C). Minimum cut (min-cut) is a cut of minimum total capacity. According to max-flow min-cut theorem the minimal cut is equal to maximum flow that can be passed from the source S to sink T. This concept is presented in Figure 2. The maximum flow which can be send from S to T range 7 and is equal to total capacity of minimal cut C = {{S,A},{S,B}}.

Graph Based Image Segmentation 95 Fig. 2. An idea of min-cut/max-flow theorem 3. Graph based representation of an image Graph based approaches to image segmentation consider image as a weighted graph where pixels are represented by nodes v i V and each edge e ij = {v i, v j } E has the corresponding nonnegative weight w ij. Weights describe similarity (or dissimilarity) between the neighboring pixels. Regarding graph based image representation explained above image segmentation is then simply partitioning V into two disjoint sets A and B where A B = V and A B = Ø. The partitioning is performed according to some criterion and aims at removing edges that connect parts A and B (see Fig. 3). a) b) c) Fig. 3. Idea of graph partitioning; (a) input graph G = (V, E); (b) edges to be removed are denoted by dashed line; (c) disjoint subgraphs G A = (A, E A ) and G B = (B, E B ) of graph G The main problems to be solved during graph-based image segmentation is how to choose edges to be removed in order to divide graph into pieces. Boykov and Jolly proposed method which determines these edges as minimum cut of a planar graph composed from consecutive pixels [3]. The method is briefly characterized in the next section. 4. Glance at min-cut/max-flow image segmentation 4.1. Graph construction In the regarded method an image is represented by a weighted undirected graph G = (V, E) where nodes correspond with pixels p of an image P. Additionally, in the graph there are

96 Anna Fabijañska also two special nodes: an object terminal S (called source) and a background terminal T (called sink). Therefore: V = P { S, T} (1) The set E consists of two types of undirected edges: n-links which connect neighboring pixels and t-links which connect pixels with terminals. Every pixel has up to four n-links to its (spatially) nearest neighbors and two t-links: {p, S} and {p,t} connecting it to source and sink respectively. Weights B {pq} assigned to n-links represent boundary term and describe similarity between the neighboring nodes p and q. Specifically, the higher the weight the higher similarity between pixels; values close to 0 correspond with very low similarities. Weights R p ( ) assigned to t-links represent regional term and define the individual penalties R p (obj) and R p (bkg) for assigning pixel p to object and background respectively. Exemplary graph obtained for a 3 3 image is presented in Figure 4. The weights of edges are given in Table 1. Symbols O and B used in Table 1 denote object and background respectively. Fig. 4. Exemplary graph obtained for a 3 3 image [3] Table 1 Weights for n-links and t-links Edge Weight For {p, q} B {pq} {p, q} N {p, S} {p, T} λ R p (bkg) p P, p O B K p O 0 p B λ R p (obj) p P, p O B 0 p O K p B

Graph Based Image Segmentation 97 Value of K is defined by following equation: K = 1+ max B p P { pq, } (2) q :{ p, q } and λ is a scaling factor indicating the relative importance of the regional term versus the boundary term. 4.2. Image segmentation Having the graph image representation as defined in the previous subsection the border between the object and the background (optimal in terms of properties build into the edge weights) is a minimum cost cut on the graph. The minimum cut is determined via minimization of energy function E given by Equation (3). EA ( ) =λ RA ( ) + BA ( ) (3) where A = (A 1, A 2,, A P ) is segmentation vector specifying the assignment of pixel p in P either to the background or to the object. Specifically, A i = 1 if p belongs to object; otherwise A i = 1. Additionally: RA ( ) = Rp( Ap) (4) p P BA ( ) = B{ pq, } δ( Ap, Aq) (5) { pq, } where: 1 δ ( Ap, Aq) = 0 A A p p A q = A q (6) According to min-cut/max-flow theorem segmentation vector A that minimizes energy function given by Equation (3) is equivalent to maximum flow that can be send via graph defined in Subection 4.1 from terminal S to terminal T. Border between the object and the background is indicated by edges that gets saturated when the maximal flow is send between S and T. To find maximal flow Boykov and Kolmogorov proposed new end efficient algorithm based on augmenting paths [4]. However, they didn t define clear and unequivocal criterions for assigning weights to t-links and n-links. In the next sections results of using different boundary and regional conditions are analyzed.

98 Anna Fabijañska 5. Input data 8-bit monochromatic images of different character and levels of details are regarded in this work. The spatial resolution of each image is 256 256 pixels. The images and their names are presented in Figure 5. (a) starfish (b) plane (c) cars (d) texture (e) cottage (f) pentagon Fig. 5. Test images used to test the performance of graph cut algorithm; (a) starfish; (b) plane; (c) cars; (d) texture; (e) cottage; (f) pentagon 6. Tests and results In this section the influence of boundary and regional conditions on image segmentation using graph cut approach is presented on exemplary images (see Sec. 5). Before the processing, regions belonging to the foreground and the background were indicated manually in order to impose some hard constrains on image segmentation. Input images with frames indicating the foreground and the background are presented in Figure 6a and Figure 7a (first columns). Specifically, red color represents regions which must be included into the background while green color indicates images regions contained in the foreground. Properties of these regions were next used to qualify the remaining pixels into these areas. For describing the boundary conditions two functions given by Equations (7) and (8) were tested. As they are to represent the similarity between the neighboring nodes p and q their values decrease with increasing difference between intensities I p and I q of the neighboring nodes.

Graph Based Image Segmentation 99 B{ pq, } = 255 Ip Iq (7) { pq, } Ip Iq B = e (8) While regarding boundary conditions four cases were investigated. Firstly (Case 1), only difference between the intensity I p of the node p and the average intensities of background I B and foreground I O were considered and weights were determined as described by Equation (9). R(obj) = 1 ( I I )/ I R(bkg) = 1 ( I I )/ I p B B p O O (9) Next (Case 2), the probabilities of each intensity to belong to foreground and background were used for describing boundary conditions in accordance with the following equation: R(obj) = Pr( Ip O) R(bkg) = Pr( Ip B) (10) where: Pr(I p O) and Pr(I p B) denote probabilities of intensity I p to belong to the foreground and the background respectively. The probabilities were determined based on intensity distribution in every region. This was followed by extending the regional terms by gradient properties (Case 3). Specifically, average gradient magnitude and average gradient direction in foreground and background supplemented the average region intensity regarded in the Case 1 (see Eq. (11)). ( Ip IO)( Ip IO)( ϕ( Ip) ϕ( IO)) R(obj) = 1 IO IOϕ( IO) ( Ip IB)( Ip IB)( ϕ( Ip) ϕ( IB)) R(bkg) = 1 IB IBϕ( IB) (11) where: IO, I B average intensity in foreground and background region, I p intensity of node p, I, I average gradient magnitude in foreground and background region, O B I p gradient magnitude in node p, ϕ ( IO), ϕ ( IB) average gradient direction in foreground and background region, ϕ ( I p ) gradient direction in node p.

100 Anna Fabijañska Finally (Case 4), probabilities of each pixel to belong to foreground and background were investigated based on analysis of image intensity distribution, gradient magnitude distribution and gradient direction distribution in accordance with Equation (12). R(obj) = Pr( Ip O) Pr( Ip O) Pr( ϕ( Ip) O) R(bkg) = Pr( Ip B) Pr( Ip B) Pr( ϕ( Ip) B) (12) where: Pr(I p O) and Pr(I p B) denote probabilities of intensity I p to belong to the foreground and the background respectively, Pr( I p O) and Pr( I p B) denote probabilities of gradient magnitude I p to belong to the foreground and the background respectively and Pr(ϕ( I p ) O) and Pr(ϕ( Ip) B) are probabilities of gradient direction ϕ( I p ) to belong to the foreground and the background respectively. The probabilities were determined based on the corresponding property distribution in every region. Results of min-cut/max-flow segmentation applied to exemplary images are shown in Figures 6 and 7. Specifically, Figure 6 corresponds to results obtained using Equation (7) for describing boundary conditions and consecutively Equations (9) (12) (Cases 1 4 respectively) for describing regional conditions. In Figure 7 results obtained using Equation (8) for boundary conditions and Equations (9) (12) for regional conditions are shown. In both Figures the first column presents original images with regional constrains imposed on the background and the foreground. The following columns present results obtained in Cases 1 4. Case id is indicated in a column caption. Results presented in Figures 6 and 7 clearly show that selection of functions describing the boundary and regional relations in the image significantly influences the results of image segmentation. Firstly, it should be stated, that using exponential function for describing boundary conditions penalizes significantly discontinuities between pixels of similar intensities. The effect is clearly seen in Figure 7 when visibly similar neighboring pixels are included into the different regions. This make the function insufficient for segmentation of regions covered by highly varied texture (see Fig. 7, images: starfish, texture). In this case linear function provides significantly better results (see Fig. 6) and successfully segments objects even of very diverse patterned texture. When regarding regional conditions it should be stated that uniform objects can be successfully extracted from the scene via graph cut, based only on mean intensities of regions indicated as the background and the foreground (see Eq. (9)). This was in case of plane image (Fig. 5, Case 1). Investigating probabilities of pixel intensities to belong to the certain regions was satisfactory for segmentation of moderately differentiated textures (see Fig. 6, images: starfish, tanks, pentagon, Case 2). Finally, in order to obtain satisfactory segmentation of objects characterized by extremely diverse texture pattern it was necessary to include gradient distribution information into function describing boundary properties (see Fig. 6, image: texture, Case 4).

Graph Based Image Segmentation 101 Fig. 6. Results of graph min-cut/max-flow segmentation applied to exemplary images. For boundary conditions linear function given by Equation (7) was applied

102 Anna Fabijañska Fig. 7. Results of graph min-cut/max-flow segmentation applied to exemplary images. For boundary conditions exponential function given by Equation (8) was applied

Graph Based Image Segmentation 103 7. Conclusions In this paper graph min-cut/max-flow segmentation algorithm was investigated. Although the method has recently gained an increasing interest of the researchers, it lacks clear and unequivocal information how to describe boundary and regional conditions used during image segmentation. Therefore, problem of boundary and regional penalties selection was investigated in this paper. Functions for describing these both terms were proposed and their influence on image segmentation results were investigated. The obtained results can be then treated as a guidelines for development of new boundary and regional constrains for graph min-cut/ max-flow segmentation. Acknowledgements The presented work is supported by Ministry of Science and Higher Education of Poland from funds for science in years 2010 2012 (project no. N N516 490439). Anna Fabijañska receives financial support from the Foundation for Polish Science in a framework of START fellowship. References [1] Wang J., Graph Based Image Segmentation. A Modern Approach. VDM Verlag, Germany, 2008. [2] Shi J., Malik J., Normalized Cuts and Image Segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence, nr 8, t. 22, 2000, 888 905. [3] Boykov Y., Jolly M.-P., Interactive Graph Cuts for Optimal Boundary & Region Segmentation of Objects in N-D Images. International Conference on Computer Vision, t. 1, 2001, 105 112. [4] Boykov Y., Kolmogorov V., An Experimental Comparison of Min-Cut/Max-Flow Algorithms for Energy Minimization in Vision. IEEE Transactions on Pattern Analysis and Machine Intelligence, nr 9, t. 26, 2004, 1124 1137. [5] Diestel R., Graph Theory. Springer, Berlin, Germany, 2010.