Superpixel Optimization using Higher-Order Energy

Size: px
Start display at page:

Download "Superpixel Optimization using Higher-Order Energy"

Transcription

1 JOURNAL OF L A TEX CLASS FILES Superpixel Optimization using Higher-Order Energy Jianteng Peng, Jianbing Shen, Senior Member, IEEE, Angela Yao, and Xuelong Li, Fellow, IEEE Abstract A novel superpixel extraction algorithm using a higher-order energy optimization framework is proposed in this paper. We first adopt the k-means clustering technique to quickly get an initial superpixel result. Then a higherorder energy function is employed to optimize and refine these initial superpixels. We use a more general higher-order energy function which includes a first-order data term, a second-order smoothness term and a higher-order term. The pre-segments are employed to provide the prior information of sufficient edges and segment regions for our higher-order energy term. According to the texture measurement in different local regions, our algorithm adaptively computes the proper ratios of different energy terms to obtain a better superpixel performance. The experimental results demonstrate that our method using the higher-order energy generates better results with well-aligned boundaries and homogeneous effects than the existing superpixel algorithms. Index Terms Superpixel, higher-order energy, optimization. I. INTRODUCTION Superpixels [2] group together pixels of similar characteristics and are widely used in a variety of image processing algorithms such as segmentation [9], [20], [30], [3], [34], synthesis [8], [6], saliency detection [23], [25], [34] and tracking [4]. The grouping of pixels allows for a more compact representation of images, which in turn improves computational efficiency. The challenge of extracting superpixels is to preserve sufficient levels of detail that adhere to image boundaries while being as compact as possible. One class of algorithms for extracting superpixels [2], [7], [22], [32] is based on a graph topology and optimization framework and includes the well known normalized cut [2] and graph-cut [7] superpixel algorithms. The original image is mapped to a graph; superpixels are then extracted from connected subgraphs formed by cutting graph edges. Veksler et al. [7] introduced the use of both a data term and a smoothness term that are typical in graph-cut based optimization. This work was supported in part by the National Basic Research Program of China (973 Program) (No. 203CB328805), the National Natural Science Foundation of China (Nos and 62506), the Program for New Century Excellent Talents in University (NCET--0789), and Shaanxi Key Innovation Team of Science and Technology (No. 202KCT-04). Specialized Fund for Joint Building Program of Beijing Municipal Education Commission. (Corresponding author: Jianbing Shen.) J. Peng and J. Shen are with the Beijing Laboratory of Intelligent Information Technology, School of Computer Science, Beijing Institute of Technology, Beijing 0008, P. R. China. shenjianbing@bit.edu.cn. A. Yao is with the Institute of Computer Science II, University of Bonn, Germany. yao@cs.uni-bonn.de. Xuelong Li is with the Center for OPTical IMagery Analysis and Learning (OPTIMAL), the School of Computer Science, Northwestern Polytechnical University, Xi an 7029, Shaanxi, P. R. China. xuelong li@nwpu.edu.cn Copyright (c) 205 IEEE. Personal use of this material is permitted. However, permission to use this material for any other purposes must be obtained from the IEEE by sending an to pubs-permissions@ieee.org. Fig.. An illustration of the proposed superpixel optimization method. The higher-order adaptive energy optimization (step (b)) is applied on the initial superpixel results (I init ), which are obtained by applying the fast k-means clustering (step (a)) on the input image. The pre-segmentation image (I seg ) by mean-shift algorithm is used as a form of prior information to construct our higher-order energy term. Zhang et al. [22] further extended this by using a pseudoboolean optimization with only two labels to achieve squarelike superpixels with similar shape and size. In both works, the ratio of the energy terms remains fixed; our algorithm adapts the ratio based on local image statistics. Non-graph based superpixel algorithms include the TurboPixel method [3] and the SLIC algorithm [5], [33]. The TurboPixel algorithm [3] formulates the superpixel problem as a set of local region-growing processes and is based on a geometric-flow level-set evolution algorithm. The SLIC algorithm et al. [5] represents each image pixel as a feature point and clustering the feature points into superpixels using fast k-means. The fast k-means, while very efficient, extracts superpixels that are not always well-aligned to image boundaries. We adopt a similar scheme to initialize our superpixels but then further refine them using the higher-order energy term to improve superpixel alignment. Graph-cut [5] based energy minimization algorithms have been very popular for solving various image processing problems [3], [7], [24], including that of superpixel extraction. Most algorithms use energy functions with only unary and pairwise terms (e.g. data and smoothness terms) because higher-order functions are difficult to optimize efficiently. Higher-order terms, however, are necessary to capture the complex and rich statistics of natural scenes [9], [4] and several methods have been proposed for efficient optimization of these terms [5], [0], [4], [2], [35]. Inspired by the improvements seen for higher-order modeling in other image

2 JOURNAL OF L A TEX CLASS FILES 2 processing tasks such as segmentation, stereo matching and denoising [26], [9], [0], [2], [28], we propose the use of higher-order terms for superpixel extraction. Our higher-order term makes use of image segmentation results and encourages the superpixels to be well aligned to these segments. Furthermore, we propose an adaptive weighting of the energy terms so as to emphasize the impact of the different energy terms according to local image texture. The resulting superpixels are homogenous in shape and size but also well aligned to major image boundaries. An overview of the method can be found in Fig.. In the context of previous work on image superpixels, our main contributions can be summarized as follows: Proposal of a novel image superpixel algorithm, the first of its kind, which uses a higher-order energy function to extract superpixels well-aligned to object boundaries. Adaptation of the energy function to be texture-aware so as to yield superpixels homogenous in shape and size. An efficient superpixel optimization of the proposed higher-order energy function is developed, where the fast k-means initialization step is used for acceleration.. II. OUR APPROACH A. Energy Optimization Formulation We formulate superpixel extraction as an energy optimization problem. We initialize based on a fast k-means clustering and then optimize the energy function E(L) defined as Fig. 2. An illustration of the relationship between the pixel p and the center p c of initial superpixel S i. The data term measures the difference between p and p c by gradient feature. R pc is a local window (typically 3 3) with its center p c. at p c, where p c O init, or the set of all superpixel centers in I init. dist xy (p, p c ) is the Euclidean distance between pixel p and center pixel p c with the same superpixel label. When distances exceed d max, f defaults to µ max, a very large upper threshold (µ max = in our experiments). Fig. 2 shows the relationship of p to p c visually. The data term E data is then defined as a summation over P, the set of all pixels in image I: E data (L) = p P f(l p ). E(L) = λ u E data (L)+λ p E smooth (L)+λ h E high (L) () where L is the superpixel label which is initialized by I init (I init is generated by fast k-means). Note that each variable in L gets the value from the set L =, 2,..., N init }, where N init is the total number of superpixels in I init, and it is set manually. From an image with P pixels, L then be a vector with length of P, i.e. L L P. E data and E smooth are the standard unary data term and pairwise smoothness term used in other graph-cut based energy functions [5], [3], [7], [24], while E high is a high-order term. λ u, λ p and λ h are the weight parameters for the unary, pairwise and higher-order energy terms respectively. When λ h is set to 0, then Eq.() degenerates to the standard lower-order energy function. B. Lower-order terms E data and E smooth We compute E data according to image gradients rather than the standard approach of using intensity since gradients are less sensitive to lighting variations in the scene. The more similar a pixel p is to the centering pixels of the superpixel, based on its image gradient value, the more likely it is that p will belong to that superpixel. We define such a similarity measure f as f(l p ) = Ip R pc, if dist xy (p, p c ) < d max µ max, where I p is the gradient value of pixel p. R pc denotes the average gradient of the pixels in local window R pc centered (2) The initial superpixel set in I init tend to be irregular in shape as the fast k-means clustering is extremely sensitive to pixel intensity. We handle the shape irregularity with the smoothness term E smooth, where E smooth (L) = (p,q) N V pq ω pq φ ( L init(p), L init(q) ). The smoothness term is computed over N, or the set of all pairs of neighbouring pixels in image I. V is an indicator variable equal to when p and q have differing superpixel labels, i.e. if Lp L V pq = q, (3) 0. while ω measures similarity between neighbouring superpixels p and q according to feature vector f = hue, saturation, intensity}: ω pq = dist xy (p, q) exp f p f q 2 2σ 2 }. φ( ) controls the extent to which the initial clustering I init influences the smoothness term and is defined as if Linit(p) = L φ(l init(p), L init(q) ) = init(q). β, where(0 < β < ). (4) Here, L init(p) and L init(q) are the labels of pixels p and q from L init. The function φ allows for a preference of preserving

3 JOURNAL OF L A TEX CLASS FILES 3 superpixel boundaries from I init between pixels p and q, while β influences the probability that these boundaries will change (we set β as 0.8 in our experiments). C. Higher-order term E high We formulate E high to encourage the superpixel boundaries to respect the edges of objects, while reducing boundaries which cut across objects. To approximate object boundaries, we use mean shift [] to obtain a crude segmentation I seg to act as a prior. We choose mean-shift for its speed and edge preserving capabilities; though other alternatives for generating I seg exist [6], [], [27]. Note that I seg need not be very accurate since the E high can automatically select the right edges from the segments to follow during the optimization process. Let M be the set of all pre-segments in I seg and C M be a specific pre-segment of M. The higher-order energy term E high is then defined as the sum over all the pre-segments in M: E high (L) = C M ψ(l C ). (5) where ψ( ) is a penalty term which encourages alignment of superpixel edges to pre-segmentation boundaries:, N(L C ) > ψ(l C ) = N(L (6) C),. where N(L C ) represents the number of pixels belonging to the set of superpixels intersecting with the boundary of presegment C, whose formula is shown below. ψ( ) is proportional to N(L C ) when N(L C ) is not larger than. is an upper limit on the number of pixels which can influence this term. We set = C /0 in our experiments. C denotes the number of pixels in pre-segment C. N(L C ) = S L C S L C,S L C where S L is the set of pixels belonging to a superpixel from L. As shown in Eq.(6), ψ( ) is set to a maximum value of defined as: = C [ + q(c)], where q(c) is a measure of segmentation quality defined as exp var(c)}, C > S q(c) = L (7) 0, based on var(c), or the variance of the pixel intensity in presegment C. S L represents the average size of superpixels in L. As such, when pre-segment C is too small, then q(c) = 0 and limits and the impact of the higher-order penalty, thereby preventing small and fragmented pre-segments from impacting the superpixels. The data and smoothness energy terms can be converted into st-graph structures [5] which can be then by optimized with graph-cut [5], [24]. To optimize the higher-order energy term in an efficient manner, however, requires some approximations, such as order reduction, to transform the higher-order energy terms into a lower-order term. Furthermore, if the variables of the higher-order energy function has multiple solutions, the variables will need to be simplified into binary variables after the order reduction. Move-making algorithms are one of the most popular ways of minimizing energy functions with multiple discrete variables and we use the α-expansion move algorithm [5]. In α- expansion, each variable is assigned some initial label; for each successive iteration, some label α is selected and all non-α members can either change to α or keep its own label. The algorithm continues to iterate through each possible label of α until it reaches convergence. More formally, let transformation function T be defined as α, if t i = 0 T α (x i, t i ) = x p i, if t (8) i = where x i denotes a given variable in the energy function and x p i is its former label before the expansion move. t i is the operation for x i ; when 0, the label changes from x p i to α and when, it stays the same. We now describe how to apply α-expansion towards minimizing our high-order energy term. We have defined C as a particular pre-segment from I seg, then ψ(x C ) is the corresponding higher-order term. We use the notation ψ(x C ) to replace ψ(l C ) from Eq.(5) here in order to emphasize its variable, and actually these two notations have the same meaning. C sp be the set of all pixels belonging to superpixels which cross the boundary of pre-segment C. Let C r be the intersecting set of pixels C r = C C sp and C c be the complementary set of pixels C c = C C sp. Note that we do not assume C c to be non-empty necessarily during the α-expansion. When C is a very small pre-segment which only contains superpixels crossing its boundaries, then N(L C ) = C and ψ(l C ) = from Eq.(6). Therefore, regardless of how the labels in C change during the expansion, the higher-order penalty will stay the same. After a single expansion move, let the set R contain the previously non-α pixels now assigned to α, with superpixel labels x p C and xn C before and after the expansion respectively. The higher-order energy term can then be reformulated into α-expansion terms as follows: ψ(t α (x i, t i )) = ψ(x n C) = min N(x C), } (9) In traditional α-expansion, the superpixel labels of all pixels belonging to C, have a chance to be assigned as α and expand. However, the expansion move is significantly more complex when the superpixels which cross the boundary of C are also able to expand. As such, we restrict expansion to only labels of superpixels which are entirely present in C. According to the locations of the newly expanded region R, there are two situations in our expansion moves (see Fig. 3): (s.) when R lies entirely in segment C (R C) and (s.2) when R lies on the boundary of segment C (R C

4 JOURNAL OF L A TEX CLASS FILES 4 Fig. 4. The st-graph obtained by the Theorem in Eq.(3). Fig. 3. Different situations in the α-expansion moves. Situation means that the pixels with their labels chosen as α are entirely in segment C. Situation 2 represents that the group of pixels with newly obtained label α is laying on the boundary of segment C., R C ). The energy of an expansion move can then be defined as: ψ(t α (x i, t i )) = More specifically, max min Cr R, }, s. min ( C r + R C sp ), }, s.2 (0) (a) Fig. 5. Comparisons between data-smoothness energy and data-smoothness higher-order energy optimization. (a) is the result by lower-order energy optimization; (b) is the result by our higher-order energy optimization. (b) C r R, if C r R < function: and R C = 0 ψ(t α ) = max [ C r + R C sp ], if C r + R C sp < and R C 0, () Here we denote t C = t i i C}, and let fk m(t C) be the number of variables in segment C taking the operation k (k 0, }). We then define fk m(t C) as follows: fk m f0 m (t C ) = i C (t C ) = ( t i), if k = 0 f m (t C ) = i C t (2) i, if k = We can therefore transform our higher-order energy by the α-expansion move algorithm to: f m (t Cr ), if f m (t Cr ) < and f0 m (t Csp Cr ) = 0 ψ(t α ) = max [f 0 m (t Cc ) + C r ], if f0 m (t Cc ) < C r and f0 m (t Csp Cr ) 0, (3) By introducing the additional binary variables m 0 and, the above higher-order energy function can be approximately transformed into a lower-order pairwise energy function (see Eq. (4)) and allow our higher-order energy function to be solved with graph-cut. Theorem. The α-expansion move energy can be approximately transformed into the second-order pairwise energy ψ(t α (x i, t i )) = min θ t i ( m 0 ) + r 0 m 0 + m 0, i C r θ ( t i ) + r ( ) + δ} (4) where θ = /, r 0 =, r = ( C r ), and δ = ( Cr ). The α-expansion move energy can be upper-bounded by the second-order pairwise energy. By minimizing this upperbound, we can make expansion moves that always decrease the original energy. The detailed proof of this transformation is given in Appendix A. We build the st-graph by the α- expansion move algorithm as shown in Fig. 4. The comparison results between the lower-order energy and our higher-order energy are given in Fig. 5. The quality of superpixels is improved by adding the higher-order term in our energy function. D. Texture-aware adaptive energy optimization By selectively setting the λs in Eq.() to zero, one can determine the impact of each term; Fig. 6 shows an example. The superpixels extracted from using only the data term tend to match object edges better but are less homogeneous in size and shape, while the superpixels extracted from using only the smoothness term (Fig. 6(b)) have smooth boundaries but are less aligned to image edges. Using both the data and smoothness terms, Fig. 6(c) offers some tradeoff between the two but the superpixels lack regularity and some are still fragmented. Including the higher-order energy term, however

5 JOURNAL OF L A TEX CLASS FILES 5 (a) (b) Fig. 8. Strategy of our image patch-based adaptive optimization: (a) calculate G mean and G var for patch p(i); (b) use the piecewise functions in Fig. 9 to compute the corresponding λ u, λ p and λ h for energy function; (c) optimize the energy function and get the superpixel result of patch p(i); (d) select the next patch p(i + ) according to the clustering center points; (e) repeat these steps from (a) to (d) for patch p(i + ). (c) (d) Fig. 6. Impact of the different terms in the energy function: (a) data term only (λ u, λ p, λ h ) = (, 0, 0) (b) smoothness term only, (λ u, λ p, λ h ) = (0,, 0) (c) data and smoothness terms, (λ u, λ p, λ h ) = (,, 0) (d) data, smoothness and higher-order terms, (λ u, λ p, λ h ) = (,, ). (a) (b) Fig. 9. The piecewise functions for computing λ u and λ h. Fig. 7. Image patches (from p() to p(6)) have the same size. The ratio of each energy term in our adaptive energy function is determined by both the mean and variance values of their gradients. Fig. 6(d), results in superpixels regular in size which adhere nicely to objects in the image. We observe that local image regions can be divided into three types according to its local texture: plain regions with little to no texture, e.g. cloudless sky, monochrome backgrounds, highly textured regions, e.g. fur or feathers or grass, and edge regions with some texture that contain boundaries of objects. For plain regions, the smoothness term will have a strong role in the optimization in order to obtain homogenous superpixels and as such, the ratio of λ p should be much higher than textured regions. In regions with object boundaries however, the relevance of the data and higher-order terms will have a much bigger influence and the more complicated these regions are, the higher that the ratios of λ u and λ h should be with respect to λ p. Based on our observations of these three types of image TABLE I patch p() p(2) p(3) p(4) p(5) p(6) G mean G var *G mean and G var are the mean and variance of gradients in image patches. regions, we adjust the λs according to the texture of image patches centered on respective superpixels (see Fig. 8 for an overview). Measuring the mean and variance values of image patch gradients is a good way of classifying image region into one of these three types. Plain regions have low means and variances, while highly textured regions have higher mean gradients but still relatively low variance. Finally, edge regions have both high mean and high variance gradients. We illustrate this in Fig. 7 and Table I. We fix λ p to 0 in our experiments and then adjust λ u and λ h according to gradient mean G mean and gradient variance G var. We use a piecewise linear function based on four parameters: ξ m, ξ m2, ξ v, and ξ v2 (see Fig. 9). To set these parameters, we chose 50 images from the Berkeley segmentation dataset and randomly cut 9 (3 of each texture type) image patches from each. We compute the mean and variance of image gradients in each of these patches and use the computed minima and maxima to set ξ m =.5, ξ m2 = 5.5, ξ v =.0, ξ v2 = 2.5 for our experiments. We summarize the overall texture-adaptive higher-order energy optimization in Algorith. E. Discussion of superpixel absorption In the process of α-expansion, it is possible that some of the superpixels get absorbed by other expanding superpixels. This absorption phenomenon is rare; in most iterations, the superpixels only change shape and become better aligned to the object. However, when a pixel does get absorbed, we still continue using its center from I init for computational simplicity in finding patch. Fig. 0 gives an example of such a case. On the top left, the red and blue dots denote the centers of

6 JOURNAL OF L A TEX CLASS FILES 6 Algorith Texture-adaptive higher-order energy optimization for extracting superpixels initialize: superpixels I init and pre-segments M for each superpixel S i in I init do ) find the center pixel (x i, y i ) of S i and the associated image patch p i (see Fig. 8). 2) compute G mean and G var of p i and the associated λ i u, λ i p and λ i h (see Fig. 9) 3) determine superpixel label L i of p i by minimizing Eq.() 4) I init L i end for output: I final = I init as the new center of the absorbed superpixels. The α-expansion process with superpixel absorption (Fig. 0). the i-th iteration in the red patch:. Choose a as α and expand: (a) (b) 2. Choose b as α and expand: (b) (c) 3. Choose c as α and expand: (c) (d) 4. Label d is absorbed by superpixel c: (d) (e) 5. Choose e as α and expand: (e) (f) 6. Go back to step 2, choose the label a, b, c, e as α to do the α-expansion, until we get the minimum energy. the (i + )-th iteration in the blue patch: 7. Move the optimization patch from p(i) to p(i + ): (f) (g) 8. Choose a as α and expand: (g) (h) 9. Choose f as α and expand: (h) (i) 0. Choose c as α and expand: (i) (j). Choose b as α and expand: (j) (k) 2. Choose e as α and expand: (k) (l) 3. Go back to step 8, choose the label a, f, c, b, e as α to do the α-expansion, until we obtain the minimum energy. (a) (b) (c) (d) (e) (f) (g) (h) (i) (j) (k) (l) Fig. 0. Illustrating the α-expansion process with superpixel absorption and label distribution in the i-th and (i + )-th iterations. The purple dot in (g) is the new center of the newly fused superpixel after the absorption. two initial superpixels before the optimization process. We first optimize the red patch p i in the i-th iteration, which absorbs the blue superpixel, making a newly fused superpixel centered at the purple dot (Fig. 0(g)). However, we will continue using the blue dot as the center of p(i + ) on the (i + )-th patch optimization iteration. This simplification is based on the assumption that the centers of absorbed superpixels will be relatively close to the centers of the newly created ones, thereby having little influence on the local texture measurements and the associated λs. As shown in Fig. 0, the blue superpixel is absorbed in (d), and the purple dot in (g) is the new center of the fused superpixel, which is close to the blue dot. Therefore, the textures around the purple dot and the blue dot are similar. It is an approximate computation if we use the initial center III. EXPERIMENTAL RESULTS We conduct a variety of experiments to evaluate the effectiveness of the proposed algorithm. We use the test images from the Berkeley segmentation dataset (BSD), which includes three hundred images with human-labeled segmentations as ground-truth and compare the results of our proposed algorithm against that of the SLIC [5], graph-cut (GC) [7] and pseudo-boolean optimization (PB) [22] methods. A. ualitative comparison The experimental results by the SLIC, GC and PB are all generated from implementations provided by the authors on their respective websites. To achieve as fair of a comparison as possible, we use the generic parameter settings provided by the authors to run their algorithms. We make a qualitative comparison of the superpixels extracted by our method to that of these three methods in Fig.. The SLIC approach [5] generates superpixels which are well-aligned to the edges of objects, but the superpixels themselves have non-smooth boundaries (see first column of Fig. ). The reason is that the SLIC method only considers the color and location features in the k-means clustering stage, which makes the boundaries of superpixels sensitive to the color of pixels. The GC method [7] uses both a data and smoothness term and generates superpixels with smooth boundaries but are poorly aligned to objects in the scene (see second column of Fig. ). The PB method [22] initializes the superpixels as narrow vertical and horizontal strips of equal width before the pseudo-boolean optimization which strongly restricts the basic shape of the resulting superpixels (see third column of Fig. ). In comparison, our method, by adopting both lower-order and higher-order energy terms, is able to align the superpixel boundaries to object edges adaptively (see last column of Fig. ), resulting in smooth, uniformly

7 JOURNAL OF L A TEX CLASS FILES 7 Fig.. The superpixels by SLIC [5], graph-cut (GC) [7], pseudo-boolean optimization (PB) [22] and our method respectively (from left to right). shaped superpixels in the sky regions and more well-aligned superpixels in regions containing object boundaries. B. uantitative comparison In addition to the qualitative comparison, we evaluate and compare our method to other superpixel approaches quantitatively. We adopt three quantitative measurements [3], [7], [2]: boundary recall (BR), achievable segmentation accuracy (ASA) and undersegmentation error (UE). Let S and G denote the labels in the superpixel result and ground truth respectively and n s and n g be the number of segments in each set. Furthermore, let S k denotes a superpixel in S and G i a segment in G, i.e. S k S = S, S 2,, S ns } and G i G = G, G 2,, G ng }. The boundaries of superpixels and ground truth segments are expressed as δs and δg. Boundary Recall measures the percentage of image segmentation boundary pixels p δg that are within a distance ϵ away from some superpixel boundary pixel q δs: BR G (S) = p δg ϕ(min q δs dist xy (p, q) < ϵ), (5) δg where ϕ( ) is an indicator function equaling when it e- valuates to be true and 0. As such, the better the superpixels are aligned to the ground truth segments, the larger BR will be. We set ϵ = 2 in our experiments. Achievable Segmentation Accuracy measures the percentage of pixels in superpixels S that are achievable to form the ground truth segments G. For every superpixel S k, we can find a segment G i that has the largest overlap; the fraction of the overlap is the achievable part and can be calculated as: k ASA G (S) = max i S k Gi i G (6) i where the denominator i G i is the total number of pixels in the image. Under-segmentation Error measures the number of pixels within a superpixel which are located outside of a segmentation boundary. If a superpixel S k is entirely contained within a ground truth segment G i, it is considered a correct superpixel and does not contribute to the value of UE. Therefore, the smaller UE is, the better superpixel extraction will be. UE is computed as follows: i k:s UE G (S) = S k Gi k G i i G (7) i We plot the quantitative measures of our method in comparison to the SLIC [5], GC [7], PB [22], and TurboPixel (TP) [3] in Fig. 2. The measures are determined by averaging all test image results in the BSD benchmark. Our proposed method outperforms the others according to the BR and ASA measures (see Fig. 2(a) and (b)) across the entire

8 JOURNAL OF L A TEX CLASS FILES Boundary Recall GC SLIC Turbo PB Ours Achievable Segmentation Accuracy GC SLIC Turbo PB Ours Undersegmentation Error GC SLIC Turbo PB Ours (a) (b) (c) Fig. 2. Comparison of boundary recall, achievable segmentaiton accuracy and undersegmentation error between SLIC [5], GC [7], PB [22], TurboPixel (TP) [3] and our method respectively (from left to right) on the BSD benchmark. Higher BR and ASA values indicate a better performance in superpixel alignment; lower UE values represents less error in the superpixelling results. BR Value data and smooth adaptive data, smooth and higher order ASA Value data and smooth adaptive data, smooth and higher order UE Value data and smooth adaptive data, smooth and higher order (a) (b) (c) (d) Fig. 3. An illustration that shows the advantage of our adaptive higher-order optimization. (a) gives the comparison results between the higher-order energy (top) and our adaptive higher-order energy (bottom). The quantitative evaluations (BR, ASA, UE) of the top and bottom images in (a) are (0.8289, 0.946, ) and (0.8255, , ), respectively. (b), (c), and (d) are the BR, ASA, and UE curves between the lower-order energy optimization and our adaptive higher-order energy optimization on the elephant image in (a). range of superpixel densities. The high BR values indicates the superiority of our method in aligning the superpixels, while the high ASA values suggests that our adaptive segmentation can be further extended to other unsupervised image segmentation applications. In terms of UE, our method achieves comparable results as the best-performing TurboPixel for UE curve (see Fig. 2(c)) though we are better at lower superpixel densities. C. Texture-adaptive results By including a higher-order energy term, we ensure that extracted superpixels are well-aligned to object edges and the difference can be seen in the BR, ASA and UE curves in Fig. 3(a), (b) and (c) comparing the results of using only the lower-energy terms and our adaptive higher-order energy. In particular, the texture-adaptation allows for the extraction of superpixels more homogenous in shape and size in highly texture regions. This is due to the use of a smaller of λ h, thereby limiting the contribution of the higher-order term. For example, in Fig. 3(a), the superpixels using non-adaptive and adaptive λs respectively are similar in the smooth and object boundary regions. In highly texture regions, however, the adaptive algorithm is able to generate superpixels more homogeneous in shape and size (see superpixels on body of elephant and in the grass). (a) Fig. 4. Run-time comparison: (a) our approach without k-means clustering step (run-time: seconds); (b) our full approach with k-means clustering initialization (run-time: seconds). The quantitative evaluations (BR, ASA, UE) of (a) and (b) are (0.887,0.9895,0.064) and (0.8858,0.9900,0.054), respectively. (b)

9 JOURNAL OF L A TEX CLASS FILES 9 As stated in the overview, we use a fast k-means clustering initialization step to accelerate computation speed. Fig. 4 compares the superpixels of direct optimization (a) and the fast k-means initialization (b). We use regular image patches as initial superpixels to generate Fig.4(a), where the input image is first divided into regular patches like [7] and then the boundaries of these patches are treated as the boundaries of initial superpixels for computing N(L C ). The two results are virtually identical, even though using k-means requires only seconds of computation time in comparison to seconds by the direct optimization. Finally, our method is insensitive to the pre-segmentation method. We show in Fig. 5 pre-segmentation results from different approaches [], [6], [], [27] as well as the corresponding superpixel results. Note that the main role of the pre-segments is to provide prior information for the higherorder energy term; as such, as long as the segments contain sufficient edges and segment regions, we can extract fairly consistent superpixels even from very different pre-segments. We propose the use of the classic mean-shift algorithm [] due to its computational efficiency. (a) (c) (e) (b) (d) (f) IV. CONCLUSION We have presented a novel superpixel algorithm using higher-order energy optimization. In order to quickly obtain an initial superpixel result, fast k-means clustering is performed on the input image. We then apply our higher-order energy optimization on the initial superpixels to produce the final superpixel results. By using adaptive energy terms according to local texture information, our extracted superpixels are well aligned to object boundaries yet still remain relatively homogenous in shape and size. Our experimental results show that our approach achieves better superpixel performance than the state-of-the-art approaches. For future work, we first intend to design new metrics for supervoxels in alignment and regularity by extending the evaluation metrics in [29]. To make our algorithm perform in real-time, we also plan on exploring parallel acceleration implementations [8]. Finally, it would be interesting to model different forms of higher-order potentials and other image priors for superpixel segmentations. Appendix A (g) Fig. 5. Comparison with different pre-segments. (a), (c), (e) and (g) are the pre-segments by mean-shift [], efficient graph [6], edge-preserving method [], and probabilistic method [27], respectively. (b), (d), (f) and (h) are the corresponding superpixel results by our method using the pre-segments in (a), (c), (e) and (g), respectively. The quantitative evaluations (BR, ASA, UE) of (b), (d), (f), and (h) are (0.794,0.9606,0.3936), (0.7892,0.9609,0.3855), (0.7904,0.9608,0.3862), and (0.794,0.9607,0.3942), respectively. D. Computation speed and Pre-Segmentation (h) Theorem. The α-expansion move energy can be approximately transformed into the pairwise function: ψ(t α (x i, t i )) = min θ t i ( m 0 ) + r 0 m 0 + m 0, i C r θ ( t i ) + r ( ) + δ} (8) where θ = /, r 0 =, r = ( C r ), and δ = ( Cr ). Proof. We first decompose the above energy function as: ψ(t α (x i, t i )) = F 0 (t Cr ) + F (t Cc ) F 0 (t Cr ) = min θ t i ( m 0 ) + r 0 m 0 m 0 i C r F (t Cc ) = min θ ( t i ) + r ( ) + θ C r Then we transform F 0 (t Cr ) and F (t Cc ) as follows: F 0 (t Cr ) = min θ m 0 = min m 0 = min m 0 t i ( m 0 ) + r 0 m 0 i C r t i ( m 0 ) + m 0 i C r f m (t Cr )( m 0 ) + m 0 (9)

10 JOURNAL OF L A TEX CLASS FILES 0 In the same way, we obtain the following equations: F (t Cc ) = min θ ( t i ) + r ( ) + θ C r = min θ ( t i ) + ( C r )( ) + θ C r = min θ ( t i ) + ( θ C r )( ) + θ C r = min θ ( t i ) + θ C r + = min θ[ = min ( t i ) + C r ] + ( ) [f 0 m (t Cc ) + C r ] + ( ) (20) We now define a new set Co = Csp Cr, which contains the pixels in Csp but not in Cr. After making z(f m 0 (t Co )) = minf m 0 (t Co ), } and adding the constraint conditions that relate to Co, the following optimization functions are obtained: Because the constraint conditions f m 0 (t Csp Cr ) = 0 and f m 0 (t Csp Cr ) 0 are exclusive, we then add Eq.(2) and Eq.(22) together: F 0 (t Cr, t Co ) + F (t Cc, t Co ) = + max f m (t Cr ), iff m (t Cr ) < andf0 m (t Csp Cr ) = 0 + max [f 0 m (t Cc ) + C r ], iff0 m (t Cc ) < C r andf0 m (t Csp Cr ) 0 2, (24) Since our higher-order term is built for every segment C and the region Co is not in C, we would like to delete the constraint of Co to derive Eq.(2) and Eq.(22) approximately. Also, investigating Eq.(2) and Eq.(22), we can find that Co is not even in the value of F 0 (t Cr, t Co ) and F (t Cc, t Co ). Co only appears in the constraints of Eq.(2) and Eq.(22). Therefor we then relax the constraint conditions and weaken the constraint of Co as follows: F 0 (t Cr, t Co ) = min m 0 f (t Cr )( m 0 )[ z(f m 0 (t Co ))] + m 0 F (t Cc, t Co ) = min s.t. m 0 + [ z(f m 0 (t Co ))] > 0 [f m 0 (t Cc ) + C r ] z(f m 0 (t Co )) + ( ) s.t. ( ) + z(f m 0 (t Co )) > 0 (2) From the above constraint in F 0 (t Cr, t Co ), we can conclude that m 0 = 0 and z(f0 m (t Co )) = can not be satisfied simultaneously. If m 0 = 0, then there must be z(f0 m (t Co )) =, F 0 (t Cr, t Co ) = f m (t Cr ); if m 0 =, F 0 (t Cr, t Co ) =, and we further have: F 0 (t Cr, t Co ) max f m (t Cr ), if f m (t Cr ) < = and f0 m (t Co ) = 0, max f m (t Cr ), if f m (t Cr ) < = and f0 m (t Csp Cr ) = 0, In the same way, we obtain F (t Cc, t Co ) as follows: (22) F (t Cc, t Co ) max [f 0 m (t Cc ) + C r ], if f0 m (t Cc ) < C r = and f0 m (t Co ) =, max [f 0 m (t Cc ) + C r ], if f0 m (t Cc ) < C r = and f0 m (t Csp Cr ) 0, (23) F 0 (t Cr ) F 0 (t Cr, t Co ), F (t Cc ) F (t Cc, t Co ) (25) Finally, we get the result as follows: ψ(t α (x i, t i )) = F 0 (t Cr ) + F (t Cc ) = f m (t Cr ), iff m (t Cr ) < andf0 m (t Csp Cr ) = 0 [f 0 m (t Cc ) + C r ], iff0 m (t Cc ) < C r (26) andf0 m (t Csp Cr ) 0, This is the solution of Eq.(3) in Section II. C. When the labels in one pre-segment C expand, they will also affect the higher-order values of other neighboring pre-segments. We assume that the penalty of other pre-segment does not increase, when the labels in C expand. That is to say, Eq.(4) in Section II. C is a pairwise transformation of the upper bound of the higher-order expansion energy. Therefore, the α-expansion move energy (Eq.(3)) can be approximately transformed into the pairwise function (Eq.(4)) as its upper bound. REFERENCES [] D. Comaniciu and P. Meer, Mean shift: a robust approach toward feature space analysis, IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 24, no. 5, pp , [2] X. Ren and J. Malik, Learning a classification model for segmentation, In: Proceedings of IEEE ICCV, pp. 0-7, [3] C. W. Ngo, Y. F. Ma, and H. J. Zhang, Video summarization and scene detection by graph modeling, IEEE Trans. on Circuits and Systems for Video Technology, vol. 5, no. 2, pp , [4] Y. Yuan, J. Fang,. Wang, Robust superpixel tracking via depth fusion, IEEE Trans. on Circuits and Systems for Video Technology, vol. 24, no., pp. 5-26, 204. [5] V. Kolmogorov, and R. Zabih, What energy functions can be minimized via graph cuts? IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 26, no. 2, pp , [6] P. Felzenszwalb and D. Huttenlocher, Efficient graph-based image segmentation, International Journal of Computer Vision, vol. 59, no. 2, pp. 67-8, 2004.

11 JOURNAL OF L A TEX CLASS FILES [7] J. M. Bioucas-Dias and G. Valadao, Phase unwrapping via graph cuts, IEEE Trans. on Image Processing, vol. 6, no. 3, pp , [8] Y. P. Tsai, C. H. Ko, Y. P. Hung, and Z. C. Shih, Background removal of multiview images by learning shape priors, IEEE Trans. on Image Processing, vol. 6, no. 0, pp , [9] N. Komodakis, N. Paragios, Beyond pairwise energies: efficient optimization for higher-order MRFs, In: Proceedings of IEEE CVPR, pp , [0] S. Ramalingam, P. Kohli, K. Alahari, and P. H. S. Torr, Exact inference in multi-label CRFs with higher order cliques, In: Proceedings of IEEE CVPR, pp. -8, [] G. Sfikas, C. Nikou, and N. Galatsanos, Edge-preserving spatially varying mixtures for image segmentation, In: Proceedings of IEEE CVPR, pp. -7, [2] H. Ishikawa, Higher-order clique reduction in binary graph cut, In Proceedings of IEEE CVPR, pp , [3] A. Levinshtein, A. Stere, K. Kutulakos, D. Fleet, S. Dickinson, and K. Siddiqi, Turbopixels: fast superpixels using geometric flows, IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 3, no. 2, pp , [4] P. Kohli, L. Ladicky, and P. Torr, Robust higher order potentials for enforcing label consistency, International Journal of Computer Vision, vol. 82, no. 3, pp , [5] R. Achanta, A. Shaji, K. Smith, A. Lucchi, P. Fua, and S. Süsstrunk, SLIC superpixels, EPFL Technical Report no , 200. [6] L. C. Tran, C. J. Pal, and T.. Nguyen, View synthesis based on conditional random fields and graph cuts, In: Proceedings of IEEE ICIP, pp , 200. [7] O. Veksler, Y. Boykov, and P. Mehrani, Superpixels and supervoxels in an energy optimization framework, In: Proceedings of ECCV, pp , 200. [8] P. Strandmark and F. Kahl, Parallel and distributed graph cuts by dual decomposition, In: Proceedings of IEEE CVPR, pp , 200. [9] Y. Mu, B. Zhou, and S. Yan, Information-theoretic analysis of input strokes in visual object cutout, IEEE Trans. on Multimedia, vol. 2, no. 8, pp , 200. [20] M. B. Salah, A. Mitiche, and I. B. Ayed, Multiregion image segmentation by parametric kernel graph cuts, IEEE Trans. on Image Processing, vol. 20, no. 2, pp , 20. [2] M. Y. Liu, O. Tuzel, S. Ramalingam, and R. Chellappa, Entropy rate superpixel segmentation, In: Proceedings of IEEE CVPR, pp , 20. [22] Y. Zhang, R. I. Hartley, J. Mashford, and S. Burn, Superpixels via pseudo-boolean optimization, In: Proceedings of IEEE ICCV, pp , 20. [23] H. Li and K. N. Ngan, A co-saliency model of image pairs, IEEE Trans. on Image Processing, vol. 20, no. 2, pp , 20. [24] E. Bae, J. Shi, and X. C. Tai, Graph cuts for curvature based image denoising, IEEE Trans. on Image Processing, vol. 20, no. 5, pp , 20. [25] Z. Liu, R. Shi, L. Shen, Y. Xue, K. N. Ngan, and Z. Zhang, Unsupervised salient object segmentation based on kernel density estimation and two-phase graph cut, IEEE Trans. on Multimedia, vol. 4, no. 2, pp , 202. [26] K. Park and S. Gould, On learning higher-order consistency potentials for multi-class pixel labeling, In Proceedings of ECCV, pp , 202. [27] B. Andres, J. H. Kappes, T. Beier, U. Kothe, and F. A. Hamprecht, Probabilistic image segmentation with closedness constraints, In: Proceedings of IEEE ICCV, pp , 20. [28] J. Peng, J. Shen, Y. Jia, and X. Li, Saliency cut in stereo images, In Proceedings of IEEE ICCV Workshop, pp , 203. [29] C. Xu and J. J. Corso, Evaluation of super-voxel methods for early video processing, In: Proceedings of IEEE CVPR, pp , 202. [30] P. Arbelaez, M. Maire, C. Fowlkes, and J. Malik, Contour detection and hierarchical image segmentation, IEEE Trans. on Pattern Analysis and Machine Intelligence, Vol. 33, No. 5, pp , 20. [3] J. Shen, Y. Du, and X. Li, Interactive segmentation using constrained Laplacian optimization, IEEE Trans. on Circuits and Systems for Video Technology, vol. 24, no. 7, pp , 204. [32] J. Shen, Y. Du, W. Wang, X. Li, Lazy random walks for superpixel segmentation, IEEE Trans. on Image Processing, vol. 23, no. 4, , 204. [33] R. Achanta, A. Shaji, K. Smith, A. Lucchi, P. Fua, and S. Süsstrunk, SLIC superpixels compared to state-of-the-art superpixel methods, IEEE Trans. on Pattern Analysis and Machine Intelligence, Vol. 34, no., pp , 202. [34] W. Wang, J. Shen, F. Porikli, Saliency-aware geodesic video object segmentation, In: Proceedings of IEEE CVPR, 205. [35] A. Fix, A. Gruber, E. Boros, and R. Zabih, A graph cut algorithm for higher-order Markov Random Fields, In: Proceedings of IEEE ICCV, pp , 20. Jianteng Peng is currently working toward the PhD degree in the School of Computer Science, Beijing Institute of Technology, Beijing, China. His current research interests include image and video segmentation using higher-order energy optimization. Jianbing Shen (M -SM 2) is a Full Professor with the School of Computer Science, Beijing Institute of Technology, Beijing, China. He has published about 50 journal and conference papers such as IEEE TIP, IEEE TCSVT, IEEE TCYB, IEEE TMM, IEEE CVPR, IEEE ICCV, IEEE ICME. He has also obtained many flagship honors including the Fok Ying Tung Education Foundation from Ministry of Education, the Program for Beijing Excellent Youth Talents from Beijing Municipal Education Commission, and the Program for New Century Excellent Talents in University from Ministry of Education. His research interests include computer vision and multimedia processing. He is on the editorial boards of Neurocomputing. Angela Yao is currently an Assistant Professor at the Institute of Computer Science at the University of Bonn. She received a BASc degree in Engineering Science from the University of Toronto in 2006 and a PhD in Information Technology and Electrical Engineering from ETH Zurich in 202. Her research interests are in computer vision and machine learning, with special focus on human pose estimation and action recognition. Xuelong Li (M 02-SM 07-F 2) Xuelong Li (M 02-SM 07-F 2) is a full professor with the Center for OPTical IMagery Analysis and Learning (OPTIMAL), the School of Computer Science, Northwestern Polytechnical University, Xi an 7029, Shaanxi, P. R. China.

Superpixel Segmentation using Depth

Superpixel Segmentation using Depth Superpixel Segmentation using Depth Information Superpixel Segmentation using Depth Information David Stutz June 25th, 2014 David Stutz June 25th, 2014 01 Introduction - Table of Contents 1 Introduction

More information

SCALP: Superpixels with Contour Adherence using Linear Path

SCALP: Superpixels with Contour Adherence using Linear Path SCALP: Superpixels with Contour Adherence using Linear Path Rémi Giraud 1,2 remi.giraud@labri.fr with Vinh-Thong Ta 1 and Nicolas Papadakis 2 1 LaBRI CNRS UMR 5800 - University of Bordeaux, FRANCE PICTURA

More information

Saliency Cut in Stereo Images

Saliency Cut in Stereo Images 2013 IEEE International Conference on Computer Vision Workshops Saliency Cut in Stereo Images Jianteng Peng, Jianbing Shen, Yunde Jia Beijing Laboratory of Intelligent Information Technology, School of

More information

DEPTH-ADAPTIVE SUPERVOXELS FOR RGB-D VIDEO SEGMENTATION. Alexander Schick. Fraunhofer IOSB Karlsruhe

DEPTH-ADAPTIVE SUPERVOXELS FOR RGB-D VIDEO SEGMENTATION. Alexander Schick. Fraunhofer IOSB Karlsruhe DEPTH-ADAPTIVE SUPERVOXELS FOR RGB-D VIDEO SEGMENTATION David Weikersdorfer Neuroscientific System Theory Technische Universität München Alexander Schick Fraunhofer IOSB Karlsruhe Daniel Cremers Computer

More information

arxiv: v1 [cs.cv] 14 Sep 2015

arxiv: v1 [cs.cv] 14 Sep 2015 gslicr: SLIC superpixels at over 250Hz Carl Yuheng Ren carl@robots.ox.ac.uk University of Oxford Ian D Reid ian.reid@adelaide.edu.au University of Adelaide September 15, 2015 Victor Adrian Prisacariu victor@robots.ox.ac.uk

More information

Superpixels via Pseudo-Boolean Optimization

Superpixels via Pseudo-Boolean Optimization Superpixels via Pseudo-Boolean Optimization Yuhang Zhang, Richard Hartley The Australian National University {yuhang.zhang, richard.hartley}@anu.edu.au John Mashford, Stewart Burn CSIRO {john.mashford,

More information

SCALP: Superpixels with Contour Adherence using Linear Path

SCALP: Superpixels with Contour Adherence using Linear Path : Superpixels with Contour Adherence using Linear Path Rémi Giraud, Vinh-Thong Ta, Nicolas Papadakis To cite this version: Rémi Giraud, Vinh-Thong Ta, Nicolas Papadakis. : Superpixels with Contour Adherence

More information

Cluster Sensing Superpixel and Grouping

Cluster Sensing Superpixel and Grouping Cluster Sensing Superpixel and Grouping Rui Li Lu Fang Abstract Superpixel algorithms have shown significant potential in computer vision applications since they can be used to accelerate other computationally

More information

SMURFS: Superpixels from Multi-scale Refinement of Super-regions

SMURFS: Superpixels from Multi-scale Refinement of Super-regions LUENGO, BASHAM, FRENCH: SMURFS SUPERPIXELS 1 SMURFS: Superpixels from Multi-scale Refinement of Super-regions Imanol Luengo 1 imanol.luengo@nottingham.ac.uk Mark Basham 2 mark.basham@diamond.ac.uk Andrew

More information

Image Segmentation. Lecture14: Image Segmentation. Sample Segmentation Results. Use of Image Segmentation

Image Segmentation. Lecture14: Image Segmentation. Sample Segmentation Results. Use of Image Segmentation Image Segmentation CSED441:Introduction to Computer Vision (2015S) Lecture14: Image Segmentation What is image segmentation? Process of partitioning an image into multiple homogeneous segments Process

More information

Supervised texture detection in images

Supervised texture detection in images Supervised texture detection in images Branislav Mičušík and Allan Hanbury Pattern Recognition and Image Processing Group, Institute of Computer Aided Automation, Vienna University of Technology Favoritenstraße

More information

Resolution-independent superpixels based on convex constrained meshes without small angles

Resolution-independent superpixels based on convex constrained meshes without small angles 1 2 Resolution-independent superpixels based on convex constrained meshes without small angles 3 4 5 6 7 Jeremy Forsythe 1,2, Vitaliy Kurlin 3, Andrew Fitzgibbon 4 1 Vienna University of Technology, Favoritenstr.

More information

Energy Minimization for Segmentation in Computer Vision

Energy Minimization for Segmentation in Computer Vision S * = arg S min E(S) Energy Minimization for Segmentation in Computer Vision Meng Tang, Dmitrii Marin, Ismail Ben Ayed, Yuri Boykov Outline Clustering/segmentation methods K-means, GrabCut, Normalized

More information

Superpixels Generating from the Pixel-based K-Means Clustering

Superpixels Generating from the Pixel-based K-Means Clustering Superpixels Generating from the Pixel-based K-Means Clustering Shang-Chia Wei, Tso-Jung Yen Institute of Statistical Science Academia Sinica Taipei, Taiwan 11529, R.O.C. wsc@stat.sinica.edu.tw, tjyen@stat.sinica.edu.tw

More information

FOREGROUND SEGMENTATION BASED ON MULTI-RESOLUTION AND MATTING

FOREGROUND SEGMENTATION BASED ON MULTI-RESOLUTION AND MATTING FOREGROUND SEGMENTATION BASED ON MULTI-RESOLUTION AND MATTING Xintong Yu 1,2, Xiaohan Liu 1,2, Yisong Chen 1 1 Graphics Laboratory, EECS Department, Peking University 2 Beijing University of Posts and

More information

Segmentation in electron microscopy images

Segmentation in electron microscopy images Segmentation in electron microscopy images Aurelien Lucchi, Kevin Smith, Yunpeng Li Bohumil Maco, Graham Knott, Pascal Fua. http://cvlab.epfl.ch/research/medical/neurons/ Outline Automated Approach to

More information

Superpixel Segmentation using Linear Spectral Clustering

Superpixel Segmentation using Linear Spectral Clustering Superpixel Segmentation using Linear Spectral Clustering Zhengqin Li Jiansheng Chen Department of Electronic Engineering, Tsinghua University, Beijing, China li-zq12@mails.tsinghua.edu.cn jschenthu@mail.tsinghua.edu.cn

More information

Content-based Image and Video Retrieval. Image Segmentation

Content-based Image and Video Retrieval. Image Segmentation Content-based Image and Video Retrieval Vorlesung, SS 2011 Image Segmentation 2.5.2011 / 9.5.2011 Image Segmentation One of the key problem in computer vision Identification of homogenous region in the

More information

Markov Networks in Computer Vision. Sargur Srihari

Markov Networks in Computer Vision. Sargur Srihari Markov Networks in Computer Vision Sargur srihari@cedar.buffalo.edu 1 Markov Networks for Computer Vision Important application area for MNs 1. Image segmentation 2. Removal of blur/noise 3. Stereo reconstruction

More information

Compact Watershed and Preemptive SLIC: On improving trade-offs of superpixel segmentation algorithms

Compact Watershed and Preemptive SLIC: On improving trade-offs of superpixel segmentation algorithms To appear in Proc. of International Conference on Pattern Recognition (ICPR), 2014. DOI: not yet available c 2014 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained

More information

A Feature Point Matching Based Approach for Video Objects Segmentation

A Feature Point Matching Based Approach for Video Objects Segmentation A Feature Point Matching Based Approach for Video Objects Segmentation Yan Zhang, Zhong Zhou, Wei Wu State Key Laboratory of Virtual Reality Technology and Systems, Beijing, P.R. China School of Computer

More information

CS 534: Computer Vision Segmentation and Perceptual Grouping

CS 534: Computer Vision Segmentation and Perceptual Grouping CS 534: Computer Vision Segmentation and Perceptual Grouping Ahmed Elgammal Dept of Computer Science CS 534 Segmentation - 1 Outlines Mid-level vision What is segmentation Perceptual Grouping Segmentation

More information

Accelerated gslic for Superpixel Generation used in Object Segmentation

Accelerated gslic for Superpixel Generation used in Object Segmentation Accelerated gslic for Superpixel Generation used in Object Segmentation Robert Birkus Supervised by: Ing. Wanda Benesova, PhD. Institute of Applied Informatics Faculty of Informatics and Information Technologies

More information

Markov Networks in Computer Vision

Markov Networks in Computer Vision Markov Networks in Computer Vision Sargur Srihari srihari@cedar.buffalo.edu 1 Markov Networks for Computer Vision Some applications: 1. Image segmentation 2. Removal of blur/noise 3. Stereo reconstruction

More information

SALIENT OBJECT DETECTION FOR RGB-D IMAGE VIA SALIENCY EVOLUTION.

SALIENT OBJECT DETECTION FOR RGB-D IMAGE VIA SALIENCY EVOLUTION. SALIENT OBJECT DETECTION FOR RGB-D IMAGE VIA SALIENCY EVOLUTION Jingfan Guo,2, Tongwei Ren,2,, Jia Bei,2 State Key Laboratory for Novel Software Technology, Nanjing University, China 2 Software Institute,

More information

Improving Image Segmentation Quality Via Graph Theory

Improving Image Segmentation Quality Via Graph Theory International Symposium on Computers & Informatics (ISCI 05) Improving Image Segmentation Quality Via Graph Theory Xiangxiang Li, Songhao Zhu School of Automatic, Nanjing University of Post and Telecommunications,

More information

Learning and Inferring Depth from Monocular Images. Jiyan Pan April 1, 2009

Learning and Inferring Depth from Monocular Images. Jiyan Pan April 1, 2009 Learning and Inferring Depth from Monocular Images Jiyan Pan April 1, 2009 Traditional ways of inferring depth Binocular disparity Structure from motion Defocus Given a single monocular image, how to infer

More information

Grouping and Segmentation

Grouping and Segmentation 03/17/15 Grouping and Segmentation Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem Today s class Segmentation and grouping Gestalt cues By clustering (mean-shift) By boundaries (watershed)

More information

2.1 Optimized Importance Map

2.1 Optimized Importance Map 3rd International Conference on Multimedia Technology(ICMT 2013) Improved Image Resizing using Seam Carving and scaling Yan Zhang 1, Jonathan Z. Sun, Jingliang Peng Abstract. Seam Carving, the popular

More information

Separating Objects and Clutter in Indoor Scenes

Separating Objects and Clutter in Indoor Scenes Separating Objects and Clutter in Indoor Scenes Salman H. Khan School of Computer Science & Software Engineering, The University of Western Australia Co-authors: Xuming He, Mohammed Bennamoun, Ferdous

More information

Segmentation with non-linear constraints on appearance, complexity, and geometry

Segmentation with non-linear constraints on appearance, complexity, and geometry IPAM February 2013 Western Univesity Segmentation with non-linear constraints on appearance, complexity, and geometry Yuri Boykov Andrew Delong Lena Gorelick Hossam Isack Anton Osokin Frank Schmidt Olga

More information

Image Resizing Based on Gradient Vector Flow Analysis

Image Resizing Based on Gradient Vector Flow Analysis Image Resizing Based on Gradient Vector Flow Analysis Sebastiano Battiato battiato@dmi.unict.it Giovanni Puglisi puglisi@dmi.unict.it Giovanni Maria Farinella gfarinellao@dmi.unict.it Daniele Ravì rav@dmi.unict.it

More information

Robot localization method based on visual features and their geometric relationship

Robot localization method based on visual features and their geometric relationship , pp.46-50 http://dx.doi.org/10.14257/astl.2015.85.11 Robot localization method based on visual features and their geometric relationship Sangyun Lee 1, Changkyung Eem 2, and Hyunki Hong 3 1 Department

More information

Image Segmentation Using Iterated Graph Cuts Based on Multi-scale Smoothing

Image Segmentation Using Iterated Graph Cuts Based on Multi-scale Smoothing Image Segmentation Using Iterated Graph Cuts Based on Multi-scale Smoothing Tomoyuki Nagahashi 1, Hironobu Fujiyoshi 1, and Takeo Kanade 2 1 Dept. of Computer Science, Chubu University. Matsumoto 1200,

More information

IMPROVED FINE STRUCTURE MODELING VIA GUIDED STOCHASTIC CLIQUE FORMATION IN FULLY CONNECTED CONDITIONAL RANDOM FIELDS

IMPROVED FINE STRUCTURE MODELING VIA GUIDED STOCHASTIC CLIQUE FORMATION IN FULLY CONNECTED CONDITIONAL RANDOM FIELDS IMPROVED FINE STRUCTURE MODELING VIA GUIDED STOCHASTIC CLIQUE FORMATION IN FULLY CONNECTED CONDITIONAL RANDOM FIELDS M. J. Shafiee, A. G. Chung, A. Wong, and P. Fieguth Vision & Image Processing Lab, Systems

More information

Using the Kolmogorov-Smirnov Test for Image Segmentation

Using the Kolmogorov-Smirnov Test for Image Segmentation Using the Kolmogorov-Smirnov Test for Image Segmentation Yong Jae Lee CS395T Computational Statistics Final Project Report May 6th, 2009 I. INTRODUCTION Image segmentation is a fundamental task in computer

More information

Lecture 16 Segmentation and Scene understanding

Lecture 16 Segmentation and Scene understanding Lecture 16 Segmentation and Scene understanding Introduction! Mean-shift! Graph-based segmentation! Top-down segmentation! Silvio Savarese Lecture 15 -! 3-Mar-14 Segmentation Silvio Savarese Lecture 15

More information

Graph-Based Superpixel Labeling for Enhancement of Online Video Segmentation

Graph-Based Superpixel Labeling for Enhancement of Online Video Segmentation Graph-Based Superpixel Labeling for Enhancement of Online Video Segmentation Alaa E. Abdel-Hakim Electrical Engineering Department Assiut University Assiut, Egypt alaa.aly@eng.au.edu.eg Mostafa Izz Cairo

More information

Discrete Optimization of Ray Potentials for Semantic 3D Reconstruction

Discrete Optimization of Ray Potentials for Semantic 3D Reconstruction Discrete Optimization of Ray Potentials for Semantic 3D Reconstruction Marc Pollefeys Joined work with Nikolay Savinov, Christian Haene, Lubor Ladicky 2 Comparison to Volumetric Fusion Higher-order ray

More information

Supplementary Material: Adaptive and Move Making Auxiliary Cuts for Binary Pairwise Energies

Supplementary Material: Adaptive and Move Making Auxiliary Cuts for Binary Pairwise Energies Supplementary Material: Adaptive and Move Making Auxiliary Cuts for Binary Pairwise Energies Lena Gorelick Yuri Boykov Olga Veksler Computer Science Department University of Western Ontario Abstract Below

More information

Graph based Image Segmentation using improved SLIC Superpixel algorithm

Graph based Image Segmentation using improved SLIC Superpixel algorithm Graph based Image Segmentation using improved SLIC Superpixel algorithm Prasanna Regmi 1, B.J.M. Ravi Kumar 2 1 Computer Science and Systems Engineering, Andhra University, AP, India 2 Computer Science

More information

Superpixels and Supervoxels in an Energy Optimization Framework

Superpixels and Supervoxels in an Energy Optimization Framework Superpixels and Supervoxels in an Energy Optimization Framework Olga Veksler, Yuri Boykov, and Paria Mehrani Computer Science Department, University of Western Ontario London, Canada {olga,yuri,pmehrani}@uwo.ca

More information

Edge-Based Split-and-Merge Superpixel Segmentation

Edge-Based Split-and-Merge Superpixel Segmentation Proceeding of the 2015 IEEE International Conference on Information and Automation Lijing, China, August 2015 Edge-Based Split-and-Merge Superpixel Segmentation Li Li, Jian Yao, Jinge Tu, Xiaohu Lu, Kai

More information

Superpixel Segmentation: An Evaluation

Superpixel Segmentation: An Evaluation Superpixel Segmentation: An Evaluation David Stutz Computer Vision Group, RWTH Aachen University david.stutz@rwth-aachen.de Abstract. In recent years, superpixel algorithms have become a standard tool

More information

intro, applications MRF, labeling... how it can be computed at all? Applications in segmentation: GraphCut, GrabCut, demos

intro, applications MRF, labeling... how it can be computed at all? Applications in segmentation: GraphCut, GrabCut, demos Image as Markov Random Field and Applications 1 Tomáš Svoboda, svoboda@cmp.felk.cvut.cz Czech Technical University in Prague, Center for Machine Perception http://cmp.felk.cvut.cz Talk Outline Last update:

More information

People Tracking and Segmentation Using Efficient Shape Sequences Matching

People Tracking and Segmentation Using Efficient Shape Sequences Matching People Tracking and Segmentation Using Efficient Shape Sequences Matching Junqiu Wang, Yasushi Yagi, and Yasushi Makihara The Institute of Scientific and Industrial Research, Osaka University 8-1 Mihogaoka,

More information

Saliency Detection in Aerial Imagery

Saliency Detection in Aerial Imagery Saliency Detection in Aerial Imagery using Multi-scale SLIC Segmentation Samir Sahli 1, Daniel A. Lavigne 2 and Yunlong Sheng 1 1- COPL, Image Science group, Laval University, Quebec, Canada 2- Defence

More information

Supplementary Materials for Salient Object Detection: A

Supplementary Materials for Salient Object Detection: A Supplementary Materials for Salient Object Detection: A Discriminative Regional Feature Integration Approach Huaizu Jiang, Zejian Yuan, Ming-Ming Cheng, Yihong Gong Nanning Zheng, and Jingdong Wang Abstract

More information

Markov Random Fields and Segmentation with Graph Cuts

Markov Random Fields and Segmentation with Graph Cuts Markov Random Fields and Segmentation with Graph Cuts Computer Vision Jia-Bin Huang, Virginia Tech Many slides from D. Hoiem Administrative stuffs Final project Proposal due Oct 27 (Thursday) HW 4 is out

More information

Uncertainty Driven Multi-Scale Optimization

Uncertainty Driven Multi-Scale Optimization Uncertainty Driven Multi-Scale Optimization Pushmeet Kohli 1 Victor Lempitsky 2 Carsten Rother 1 1 Microsoft Research Cambridge 2 University of Oxford Abstract. This paper proposes a new multi-scale energy

More information

Superpixel Segmentation Based Gradient Maps on RGB-D Dataset

Superpixel Segmentation Based Gradient Maps on RGB-D Dataset Superpixel Segmentation Based Gradient Maps on RGB-D Dataset Lixing Jiang, Huimin Lu, Vo Duc My, Artur Koch and Andreas Zell Abstract Superpixels aim to group homogenous pixels by a series of characteristics

More information

CSE 473/573 Computer Vision and Image Processing (CVIP) Ifeoma Nwogu. Lectures 21 & 22 Segmentation and clustering

CSE 473/573 Computer Vision and Image Processing (CVIP) Ifeoma Nwogu. Lectures 21 & 22 Segmentation and clustering CSE 473/573 Computer Vision and Image Processing (CVIP) Ifeoma Nwogu Lectures 21 & 22 Segmentation and clustering 1 Schedule Last class We started on segmentation Today Segmentation continued Readings

More information

Color Image Segmentation Using a Spatial K-Means Clustering Algorithm

Color Image Segmentation Using a Spatial K-Means Clustering Algorithm Color Image Segmentation Using a Spatial K-Means Clustering Algorithm Dana Elena Ilea and Paul F. Whelan Vision Systems Group School of Electronic Engineering Dublin City University Dublin 9, Ireland danailea@eeng.dcu.ie

More information

This is an author-deposited version published in : Eprints ID : 15223

This is an author-deposited version published in :   Eprints ID : 15223 Open Archive TOULOUSE Archive Ouverte (OATAO) OATAO is an open access repository that collects the work of Toulouse researchers and makes it freely available over the web where possible. This is an author-deposited

More information

Video annotation based on adaptive annular spatial partition scheme

Video annotation based on adaptive annular spatial partition scheme Video annotation based on adaptive annular spatial partition scheme Guiguang Ding a), Lu Zhang, and Xiaoxu Li Key Laboratory for Information System Security, Ministry of Education, Tsinghua National Laboratory

More information

OCCLUSION BOUNDARIES ESTIMATION FROM A HIGH-RESOLUTION SAR IMAGE

OCCLUSION BOUNDARIES ESTIMATION FROM A HIGH-RESOLUTION SAR IMAGE OCCLUSION BOUNDARIES ESTIMATION FROM A HIGH-RESOLUTION SAR IMAGE Wenju He, Marc Jäger, and Olaf Hellwich Berlin University of Technology FR3-1, Franklinstr. 28, 10587 Berlin, Germany {wenjuhe, jaeger,

More information

Iterative MAP and ML Estimations for Image Segmentation

Iterative MAP and ML Estimations for Image Segmentation Iterative MAP and ML Estimations for Image Segmentation Shifeng Chen 1, Liangliang Cao 2, Jianzhuang Liu 1, and Xiaoou Tang 1,3 1 Dept. of IE, The Chinese University of Hong Kong {sfchen5, jzliu}@ie.cuhk.edu.hk

More information

Segmentation & Clustering

Segmentation & Clustering EECS 442 Computer vision Segmentation & Clustering Segmentation in human vision K-mean clustering Mean-shift Graph-cut Reading: Chapters 14 [FP] Some slides of this lectures are courtesy of prof F. Li,

More information

Discrete Optimization Methods in Computer Vision CSE 6389 Slides by: Boykov Modified and Presented by: Mostafa Parchami Basic overview of graph cuts

Discrete Optimization Methods in Computer Vision CSE 6389 Slides by: Boykov Modified and Presented by: Mostafa Parchami Basic overview of graph cuts Discrete Optimization Methods in Computer Vision CSE 6389 Slides by: Boykov Modified and Presented by: Mostafa Parchami Basic overview of graph cuts [Yuri Boykov, Olga Veksler, Ramin Zabih, Fast Approximation

More information

A discriminative view of MRF pre-processing algorithms supplementary material

A discriminative view of MRF pre-processing algorithms supplementary material A discriminative view of MRF pre-processing algorithms supplementary material Chen Wang 1,2 Charles Herrmann 2 Ramin Zabih 1,2 1 Google Research 2 Cornell University {chenwang, cih, rdz}@cs.cornell.edu

More information

A Novel Extreme Point Selection Algorithm in SIFT

A Novel Extreme Point Selection Algorithm in SIFT A Novel Extreme Point Selection Algorithm in SIFT Ding Zuchun School of Electronic and Communication, South China University of Technolog Guangzhou, China zucding@gmail.com Abstract. This paper proposes

More information

Detection of Man-made Structures in Natural Images

Detection of Man-made Structures in Natural Images Detection of Man-made Structures in Natural Images Tim Rees December 17, 2004 Abstract Object detection in images is a very active research topic in many disciplines. Probabilistic methods have been applied

More information

Segmentation. Bottom up Segmentation Semantic Segmentation

Segmentation. Bottom up Segmentation Semantic Segmentation Segmentation Bottom up Segmentation Semantic Segmentation Semantic Labeling of Street Scenes Ground Truth Labels 11 classes, almost all occur simultaneously, large changes in viewpoint, scale sky, road,

More information

Robust color segmentation algorithms in illumination variation conditions

Robust color segmentation algorithms in illumination variation conditions 286 CHINESE OPTICS LETTERS / Vol. 8, No. / March 10, 2010 Robust color segmentation algorithms in illumination variation conditions Jinhui Lan ( ) and Kai Shen ( Department of Measurement and Control Technologies,

More information

Multiscale Superpixels and Supervoxels Based on Hierarchical Edge-Weighted Centroidal Voronoi Tessellation

Multiscale Superpixels and Supervoxels Based on Hierarchical Edge-Weighted Centroidal Voronoi Tessellation 2015 IEEE Winter Conference on Applications of Computer Vision Multiscale Superpixels and Supervoxels Based on Hierarchical Edge-Weighted Centroidal Voronoi Tessellation Youjie Zhou 1, Lili Ju 2 and Song

More information

Estimating Human Pose in Images. Navraj Singh December 11, 2009

Estimating Human Pose in Images. Navraj Singh December 11, 2009 Estimating Human Pose in Images Navraj Singh December 11, 2009 Introduction This project attempts to improve the performance of an existing method of estimating the pose of humans in still images. Tasks

More information

Iterated Graph Cuts for Image Segmentation

Iterated Graph Cuts for Image Segmentation Iterated Graph Cuts for Image Segmentation Bo Peng 1, Lei Zhang 1, and Jian Yang 2 1 Department of Computing, The Hong Kong Polytechnic University, Kowloon, Hong Kong, China. 2 School of Computer Science

More information

Color Image Segmentation

Color Image Segmentation Color Image Segmentation Yining Deng, B. S. Manjunath and Hyundoo Shin* Department of Electrical and Computer Engineering University of California, Santa Barbara, CA 93106-9560 *Samsung Electronics Inc.

More information

Colorado School of Mines. Computer Vision. Professor William Hoff Dept of Electrical Engineering &Computer Science.

Colorado School of Mines. Computer Vision. Professor William Hoff Dept of Electrical Engineering &Computer Science. Professor William Hoff Dept of Electrical Engineering &Computer Science http://inside.mines.edu/~whoff/ 1 Image Segmentation Some material for these slides comes from https://www.csd.uwo.ca/courses/cs4487a/

More information

Colour Segmentation-based Computation of Dense Optical Flow with Application to Video Object Segmentation

Colour Segmentation-based Computation of Dense Optical Flow with Application to Video Object Segmentation ÖGAI Journal 24/1 11 Colour Segmentation-based Computation of Dense Optical Flow with Application to Video Object Segmentation Michael Bleyer, Margrit Gelautz, Christoph Rhemann Vienna University of Technology

More information

Graph Based Image Segmentation

Graph Based Image Segmentation AUTOMATYKA 2011 Tom 15 Zeszyt 3 Anna Fabijañska* Graph Based Image Segmentation 1. Introduction Image segmentation is one of the fundamental problems in machine vision. In general it aims at extracting

More information

Contextual Classification with Functional Max-Margin Markov Networks

Contextual Classification with Functional Max-Margin Markov Networks Contextual Classification with Functional Max-Margin Markov Networks Dan Munoz Nicolas Vandapel Drew Bagnell Martial Hebert Geometry Estimation (Hoiem et al.) Sky Problem 3-D Point Cloud Classification

More information

COSC160: Detection and Classification. Jeremy Bolton, PhD Assistant Teaching Professor

COSC160: Detection and Classification. Jeremy Bolton, PhD Assistant Teaching Professor COSC160: Detection and Classification Jeremy Bolton, PhD Assistant Teaching Professor Outline I. Problem I. Strategies II. Features for training III. Using spatial information? IV. Reducing dimensionality

More information

3D Computer Vision. Structured Light II. Prof. Didier Stricker. Kaiserlautern University.

3D Computer Vision. Structured Light II. Prof. Didier Stricker. Kaiserlautern University. 3D Computer Vision Structured Light II Prof. Didier Stricker Kaiserlautern University http://ags.cs.uni-kl.de/ DFKI Deutsches Forschungszentrum für Künstliche Intelligenz http://av.dfki.de 1 Introduction

More information

Manifold SLIC: A Fast Method to Compute Content-Sensitive Superpixels

Manifold SLIC: A Fast Method to Compute Content-Sensitive Superpixels Manifold IC: A Fast Method to Compute Content-Sensitive Superpixels Yong-Jin Liu, Cheng-Chi Yu, Min-Jing Yu Tsinghua University, China {liuyongjin,ycc13,yumj14}@tsinghua.edu.cn Ying He Nanyang Technological

More information

Towards Spatio-Temporally Consistent Semantic Mapping

Towards Spatio-Temporally Consistent Semantic Mapping Towards Spatio-Temporally Consistent Semantic Mapping Zhe Zhao, Xiaoping Chen University of Science and Technology of China, zhaozhe@mail.ustc.edu.cn,xpchen@ustc.edu.cn Abstract. Intelligent robots require

More information

Title: Adaptive Region Merging Segmentation of Airborne Imagery for Roof Condition Assessment. Abstract:

Title: Adaptive Region Merging Segmentation of Airborne Imagery for Roof Condition Assessment. Abstract: Title: Adaptive Region Merging Segmentation of Airborne Imagery for Roof Condition Assessment Abstract: In order to perform residential roof condition assessment with very-high-resolution airborne imagery,

More information

Small Object Segmentation Based on Visual Saliency in Natural Images

Small Object Segmentation Based on Visual Saliency in Natural Images J Inf Process Syst, Vol.9, No.4, pp.592-601, December 2013 http://dx.doi.org/10.3745/jips.2013.9.4.592 pissn 1976-913X eissn 2092-805X Small Object Segmentation Based on Visual Saliency in Natural Images

More information

10708 Graphical Models: Homework 4

10708 Graphical Models: Homework 4 10708 Graphical Models: Homework 4 Due November 12th, beginning of class October 29, 2008 Instructions: There are six questions on this assignment. Each question has the name of one of the TAs beside it,

More information

Robust Higher Order Potentials for Enforcing Label Consistency

Robust Higher Order Potentials for Enforcing Label Consistency Int J Comput Vis (2009) 82: 302 324 DOI 10.1007/s11263-008-0202-0 Robust Higher Order Potentials for Enforcing Label Consistency Pushmeet Kohli L ubor Ladický Philip H.S. Torr Received: 17 August 2008

More information

UNSUPERVISED CO-SEGMENTATION BASED ON A NEW GLOBAL GMM CONSTRAINT IN MRF. Hongkai Yu, Min Xian, and Xiaojun Qi

UNSUPERVISED CO-SEGMENTATION BASED ON A NEW GLOBAL GMM CONSTRAINT IN MRF. Hongkai Yu, Min Xian, and Xiaojun Qi UNSUPERVISED CO-SEGMENTATION BASED ON A NEW GLOBAL GMM CONSTRAINT IN MRF Hongkai Yu, Min Xian, and Xiaojun Qi Department of Computer Science, Utah State University, Logan, UT 84322-4205 hongkai.yu@aggiemail.usu.edu,

More information

STUDYING THE FEASIBILITY AND IMPORTANCE OF GRAPH-BASED IMAGE SEGMENTATION TECHNIQUES

STUDYING THE FEASIBILITY AND IMPORTANCE OF GRAPH-BASED IMAGE SEGMENTATION TECHNIQUES 25-29 JATIT. All rights reserved. STUDYING THE FEASIBILITY AND IMPORTANCE OF GRAPH-BASED IMAGE SEGMENTATION TECHNIQUES DR.S.V.KASMIR RAJA, 2 A.SHAIK ABDUL KHADIR, 3 DR.S.S.RIAZ AHAMED. Dean (Research),

More information

Image Segmentation Using Iterated Graph Cuts BasedonMulti-scaleSmoothing

Image Segmentation Using Iterated Graph Cuts BasedonMulti-scaleSmoothing Image Segmentation Using Iterated Graph Cuts BasedonMulti-scaleSmoothing Tomoyuki Nagahashi 1, Hironobu Fujiyoshi 1, and Takeo Kanade 2 1 Dept. of Computer Science, Chubu University. Matsumoto 1200, Kasugai,

More information

Structured Light II. Thanks to Ronen Gvili, Szymon Rusinkiewicz and Maks Ovsjanikov

Structured Light II. Thanks to Ronen Gvili, Szymon Rusinkiewicz and Maks Ovsjanikov Structured Light II Johannes Köhler Johannes.koehler@dfki.de Thanks to Ronen Gvili, Szymon Rusinkiewicz and Maks Ovsjanikov Introduction Previous lecture: Structured Light I Active Scanning Camera/emitter

More information

Object Tracking using Superpixel Confidence Map in Centroid Shifting Method

Object Tracking using Superpixel Confidence Map in Centroid Shifting Method Indian Journal of Science and Technology, Vol 9(35), DOI: 10.17485/ijst/2016/v9i35/101783, September 2016 ISSN (Print) : 0974-6846 ISSN (Online) : 0974-5645 Object Tracking using Superpixel Confidence

More information

Histogram and watershed based segmentation of color images

Histogram and watershed based segmentation of color images Histogram and watershed based segmentation of color images O. Lezoray H. Cardot LUSAC EA 2607 IUT Saint-Lô, 120 rue de l'exode, 50000 Saint-Lô, FRANCE Abstract A novel method for color image segmentation

More information

Urban Scene Segmentation, Recognition and Remodeling. Part III. Jinglu Wang 11/24/2016 ACCV 2016 TUTORIAL

Urban Scene Segmentation, Recognition and Remodeling. Part III. Jinglu Wang 11/24/2016 ACCV 2016 TUTORIAL Part III Jinglu Wang Urban Scene Segmentation, Recognition and Remodeling 102 Outline Introduction Related work Approaches Conclusion and future work o o - - ) 11/7/16 103 Introduction Motivation Motivation

More information

Efficient Parallel Optimization for Potts Energy with Hierarchical Fusion

Efficient Parallel Optimization for Potts Energy with Hierarchical Fusion Efficient Parallel Optimization for Potts Energy with Hierarchical Fusion Olga Veksler University of Western Ontario London, Canada olga@csd.uwo.ca Abstract Potts frequently occurs in computer vision applications.

More information

Segmentation Based Stereo. Michael Bleyer LVA Stereo Vision

Segmentation Based Stereo. Michael Bleyer LVA Stereo Vision Segmentation Based Stereo Michael Bleyer LVA Stereo Vision What happened last time? Once again, we have looked at our energy function: E ( D) = m( p, dp) + p I < p, q > We have investigated the matching

More information

Graph Cuts vs. Level Sets. part I Basics of Graph Cuts

Graph Cuts vs. Level Sets. part I Basics of Graph Cuts ECCV 2006 tutorial on Graph Cuts vs. Level Sets part I Basics of Graph Cuts Yuri Boykov University of Western Ontario Daniel Cremers University of Bonn Vladimir Kolmogorov University College London Graph

More information

Segmentation Computer Vision Spring 2018, Lecture 27

Segmentation Computer Vision Spring 2018, Lecture 27 Segmentation http://www.cs.cmu.edu/~16385/ 16-385 Computer Vision Spring 218, Lecture 27 Course announcements Homework 7 is due on Sunday 6 th. - Any questions about homework 7? - How many of you have

More information

CS 534: Computer Vision Segmentation and Perceptual Grouping

CS 534: Computer Vision Segmentation and Perceptual Grouping CS 534: Computer Vision Segmentation and Perceptual Grouping Spring 2005 Ahmed Elgammal Dept of Computer Science CS 534 Segmentation - 1 Where are we? Image Formation Human vision Cameras Geometric Camera

More information

A Benchmark for Interactive Image Segmentation Algorithms

A Benchmark for Interactive Image Segmentation Algorithms A Benchmark for Interactive Image Segmentation Algorithms Yibiao Zhao 1,3, Xiaohan Nie 2,3, Yanbiao Duan 2,3, Yaping Huang 1, Siwei Luo 1 1 Beijing Jiaotong University, 2 Beijing Institute of Technology,

More information

Image Segmentation. Shengnan Wang

Image Segmentation. Shengnan Wang Image Segmentation Shengnan Wang shengnan@cs.wisc.edu Contents I. Introduction to Segmentation II. Mean Shift Theory 1. What is Mean Shift? 2. Density Estimation Methods 3. Deriving the Mean Shift 4. Mean

More information

String distance for automatic image classification

String distance for automatic image classification String distance for automatic image classification Nguyen Hong Thinh*, Le Vu Ha*, Barat Cecile** and Ducottet Christophe** *University of Engineering and Technology, Vietnam National University of HaNoi,

More information

Query-Sensitive Similarity Measure for Content-Based Image Retrieval

Query-Sensitive Similarity Measure for Content-Based Image Retrieval Query-Sensitive Similarity Measure for Content-Based Image Retrieval Zhi-Hua Zhou Hong-Bin Dai National Laboratory for Novel Software Technology Nanjing University, Nanjing 2193, China {zhouzh, daihb}@lamda.nju.edu.cn

More information

Co-Saliency Detection Based on Hierarchical Segmentation

Co-Saliency Detection Based on Hierarchical Segmentation 88 IEEE SIGNAL PROCESSING LETTERS, VOL. 21, NO. 1, JANUARY 2014 Co-Saliency Detection Based on Hierarchical Segmentation Zhi Liu, Member, IEEE, Wenbin Zou, Lina Li, Liquan Shen, and Olivier Le Meur Abstract

More information

[2006] IEEE. Reprinted, with permission, from [Wenjing Jia, Huaifeng Zhang, Xiangjian He, and Qiang Wu, A Comparison on Histogram Based Image

[2006] IEEE. Reprinted, with permission, from [Wenjing Jia, Huaifeng Zhang, Xiangjian He, and Qiang Wu, A Comparison on Histogram Based Image [6] IEEE. Reprinted, with permission, from [Wenjing Jia, Huaifeng Zhang, Xiangjian He, and Qiang Wu, A Comparison on Histogram Based Image Matching Methods, Video and Signal Based Surveillance, 6. AVSS

More information

2 Proposed Methodology

2 Proposed Methodology 3rd International Conference on Multimedia Technology(ICMT 2013) Object Detection in Image with Complex Background Dong Li, Yali Li, Fei He, Shengjin Wang 1 State Key Laboratory of Intelligent Technology

More information

Comparison of Graph Cuts with Belief Propagation for Stereo, using Identical MRF Parameters

Comparison of Graph Cuts with Belief Propagation for Stereo, using Identical MRF Parameters Comparison of Graph Cuts with Belief Propagation for Stereo, using Identical MRF Parameters Marshall F. Tappen William T. Freeman Computer Science and Artificial Intelligence Laboratory Massachusetts Institute

More information