Graph Cuts vs. Level Sets. part I Basics of Graph Cuts

Similar documents
Discrete Optimization Methods in Computer Vision CSE 6389 Slides by: Boykov Modified and Presented by: Mostafa Parchami Basic overview of graph cuts

From Photohulls to Photoflux Optimization

Segmentation with non-linear constraints on appearance, complexity, and geometry

CS6670: Computer Vision

Maximum flows and minimal cuts. Filip Malmberg

3D Photography: Stereo Matching

Introduction à la vision artificielle X

CAP5415-Computer Vision Lecture 13-Image/Video Segmentation Part II. Dr. Ulas Bagci

Continuous and Discrete Optimization Methods in Computer Vision. Daniel Cremers Department of Computer Science University of Bonn

Multi-Label Moves for Multi-Label Energies

s-t Graph Cuts for Binary Energy Minimization

intro, applications MRF, labeling... how it can be computed at all? Applications in segmentation: GraphCut, GrabCut, demos

Photoflux Maximizing Shapes

Markov/Conditional Random Fields, Graph Cut, and applications in Computer Vision

Image Segmentation Using Iterated Graph Cuts Based on Multi-scale Smoothing

Computer Vision : Exercise 4 Labelling Problems

MEDICAL IMAGE COMPUTING (CAP 5937) LECTURE 10: Medical Image Segmentation as an Energy Minimization Problem

How to fit a surface to a point cloud?

Graph Cuts in Vision and Graphics: Theories and Applications

Multi-view Stereo via Volumetric Graph-cuts and Occlusion Robust Photo-Consistency

GRAPH BASED SEGMENTATION WITH MINIMAL USER INTERACTION. Huaizhong Zhang, Ehab Essa, and Xianghua Xie

Models for grids. Computer vision: models, learning and inference. Multi label Denoising. Binary Denoising. Denoising Goal.

What have we leaned so far?

Multi-view Stereo. Ivo Boyadzhiev CS7670: September 13, 2011

Mathematics in Image Processing

MEDICAL IMAGE COMPUTING (CAP 5937) LECTURE 10: Medical Image Segmentation as an Energy Minimization Problem

Multi-view stereo. Many slides adapted from S. Seitz

USING ONE GRAPH-CUT TO FUSE MULTIPLE CANDIDATE MAPS IN DEPTH ESTIMATION

Stereo Correspondence with Occlusions using Graph Cuts

Volumetric stereo with silhouette and feature constraints

Segmentation with non-linear regional constraints via line-search cuts

CS4495/6495 Introduction to Computer Vision. 3B-L3 Stereo correspondence

Image Segmentation Using Iterated Graph Cuts BasedonMulti-scaleSmoothing

Segmentation in electron microscopy images

Joint 3D-Reconstruction and Background Separation in Multiple Views using Graph Cuts

Dr. Ulas Bagci

Markov Random Fields and Segmentation with Graph Cuts

Multi-view shape reconstruction

Supervised texture detection in images

Lecture 16 Segmentation and Scene understanding

Image Based Reconstruction II

Multi-View Reconstruction using Narrow-Band Graph-Cuts and Surface Normal Optimization

Energy Minimization for Segmentation in Computer Vision

Applied Bayesian Nonparametrics 5. Spatial Models via Gaussian Processes, not MRFs Tutorial at CVPR 2012 Erik Sudderth Brown University

Multi-View 3D-Reconstruction

Announcements. Image Segmentation. From images to objects. Extracting objects. Status reports next Thursday ~5min presentations in class

Supplementary Material: Adaptive and Move Making Auxiliary Cuts for Binary Pairwise Energies

Fast Approximate Energy Minimization via Graph Cuts

Shape from Silhouettes

Quadratic Pseudo-Boolean Optimization(QPBO): Theory and Applications At-a-Glance

Multiview Stereo via Volumetric Graph-Cuts and Occlusion Robust Photo-Consistency

CS664 Lecture #21: SIFT, object recognition, dynamic programming

Graph Cut based Continuous Stereo Matching using Locally Shared Labels

Geometric Reconstruction Dense reconstruction of scene geometry

Capacity Scaling for Graph Cuts in Vision

Some books on linear algebra

MRFs and Segmentation with Graph Cuts

Image Segmentation. Marc Pollefeys. ETH Zurich. Slide credits: V. Ferrari, K. Grauman, B. Leibe, S. Lazebnik, S. Seitz,Y Boykov, W. Freeman, P.

Energy Minimization Under Constraints on Label Counts

Super Resolution Using Graph-cut

Comparison of Graph Cuts with Belief Propagation for Stereo, using Identical MRF Parameters

Regularization and Markov Random Fields (MRF) CS 664 Spring 2008

A Comparative Study of Energy Minimization Methods for Markov Random Fields

Fast Trust Region for Segmentation

Algorithms for Markov Random Fields in Computer Vision

Brain tumor segmentation with minimal user assistance

Normalized cuts and image segmentation

BIL Computer Vision Apr 16, 2014

Applications of parametric maxflow in computer vision

Stereo. 11/02/2012 CS129, Brown James Hays. Slides by Kristen Grauman

Prof. Feng Liu. Spring /17/2017. With slides by F. Durand, Y.Y. Chuang, R. Raskar, and C.

CS4670: Computer Vision

In IEEE conference on Computer Vision and Pattern Recognition (CVPR), New York, June 2006, vol. I, p Active Graph Cuts

In Defense of 3D-Label Stereo

Segmentation. Bottom up Segmentation Semantic Segmentation

Prof. Trevor Darrell Lecture 20: Markov Random Fields

Image Composition. COS 526 Princeton University

EECS 442 Computer vision. Announcements

Image Segmentation continued Graph Based Methods. Some slides: courtesy of O. Capms, Penn State, J.Ponce and D. Fortsyth, Computer Vision Book

3D Computer Vision. Dense 3D Reconstruction II. Prof. Didier Stricker. Christiano Gava

Stereo. Outline. Multiple views 3/29/2017. Thurs Mar 30 Kristen Grauman UT Austin. Multi-view geometry, matching, invariant features, stereo vision

Project Updates Short lecture Volumetric Modeling +2 papers

Figure 1: Standard graph cut segmentation (top) and normalized likelihood of object intensity used in graph edge weights (bottom). Likely regions thro

Targil 10 : Why Mosaic? Why is this a challenge? Exposure differences Scene illumination Miss-registration Moving objects

Chaplin, Modern Times, 1936

Stereo: Disparity and Matching

Computer Vision I - Filtering and Feature detection

Efficient Parallel Optimization for Potts Energy with Hierarchical Fusion

Graph Cuts. Srikumar Ramalingam School of Computing University of Utah

Multi-View Stereo via Graph Cuts on the Dual of an Adaptive Tetrahedral Mesh

Iterative MAP and ML Estimations for Image Segmentation

Collaborative Multi Organ Segmentation by Integrating Deformable and Graphical Models

Shift-Map Image Editing

Statistical and Learning Techniques in Computer Vision Lecture 1: Markov Random Fields Jens Rittscher and Chuck Stewart

Shape Prior Segmentation of Multiple Objects with Graph Cuts

Fundamentals of Stereo Vision Michael Bleyer LVA Stereo Vision

Public Library, Stereoscopic Looking Room, Chicago, by Phillips, 1923

Markov Random Fields with Efficient Approximations

Stereo vision. Many slides adapted from Steve Seitz

Measuring Uncertainty in Graph Cut Solutions - Efficiently Computing Min-marginal Energies using Dynamic Graph Cuts

Transcription:

ECCV 2006 tutorial on Graph Cuts vs. Level Sets part I Basics of Graph Cuts Yuri Boykov University of Western Ontario Daniel Cremers University of Bonn Vladimir Kolmogorov University College London

Graph Cuts versus Level Sets Part I: Basics of graph cuts Part II: Basics of level-sets Part III: Connecting graph cuts and level-sets Part IV: Global vs. local optimization algorithms

Graph Cuts versus Level Sets Part I: Basics of graph cuts Main idea for min-cut/max-flow methods and applications Implicit and explicit representation of boundaries Graph cut energy (different views) submodularity, geometric functionals, posterior energy (MRF) Extensions to multi-label problems convex and non-convex (robust) interactions, a-expansions

1D Graph cut shortest path on a graph Example: find the shortest closed contour in a given domain of a graph Shortest paths approach p Graph Cuts approach Compute the shortest path p ->p for a point p. Repeat for all points on the gray line. Then choose the optimal contour. Compute the minimum cut that separates red region from blue region

Graph cuts for optimal boundary detection (simple example à la Boykov&Jolly, ICCV 01) hard constraint n-links t a cut Minimum cost cut can be computed in polynomial time (max-flow/min-cut algorithms) w s σ hard constraint I exp 2σ pq pq = 2 I pq

Minimum s-t cuts algorithms Augmenting paths [Ford & Fulkerson, 1962] Push-relabel [Goldberg-Tarjan, 1986]

Augmenting Paths source S sink T Find a path from S to T along non-saturated edges Increase flow along this path until some edge saturates A graph with two terminals

Augmenting Paths source S sink T Find a path from S to T along non-saturated edges Increase flow along this path until some edge saturates A graph with two terminals Find next path Increase flow

Augmenting Paths source S sink T Find a path from S to T along non-saturated edges Increase flow along this path until some edge saturates A graph with two terminals MAX FLOW MIN CUT Iterate until all paths from S to T have at least one saturated edge

Optimal boundary in 2D max-flow = min-cut

Optimal boundary in 3D 3D bone segmentation (real time screen capture)

Graph cuts applied to multi-view reconstruction Calibrated images of Lambertian scene 3D model of scene CVPR 05 slides from Vogiatzis, Torr, Cippola

Graph cuts applied to multi-view reconstruction ρ(x) photoconsistency CVPR 05 slides from Vogiatzis, Torr, Cippola

Estimating photoconsistency in a narrow bad Occlusion 1. Get nearest point on outer surface 2. Use outer surface for occlusions 2. Discard occluded views CVPR 05 slides from Vogiatzis, Torr, Cippola

Graph cuts applied to multi-view reconstruction The cost of the cut integrates photoconsistency over the whole space Source Sink CVPR 05 slides from Vogiatzis, Torr, Cippola

Graph cuts applied to multi-view reconstruction surface of good photoconsistency visual hull (silhouettes) CVPR 05 slides from Vogiatzis, Torr, Cippola

Graph cuts for video textures Graph-cuts video textures (Kwatra, Schodl, Essa, Bobick 2003) a cut 1 2 Short video clip Long video clip 3D generalization of image-quilting (Efros & Freeman, 2001)

Graph cuts for video textures Graph-cuts video textures (Kwatra, Schodl, Essa, Bobick 2003) original short clip synthetic infinite texture

Cuts on directed graphs s Flux of a vector field through hypersurface with orientation (A or B) t Cut on a directed graph Cost of a cut includes only edges from the source to the sink components Cut s cost (on a directed graph) changes if terminals are swapped Swapping terminals s and t is similar to switching surface orientation

Cuts on directed graphs

Segmentation of elongated structures

Simple shape priors preferred surface normals p q α [Funka-Lea et al. 06] blob prior w p > q f Extra penalty ( I) + c (1 cos( α))

Adding regional properties (another segmentation example à la Boykov&Jolly 01) D p (t) n-links t a cut t-link w pq s t-link D p (s) regional bias example s t I and I suppose are given expected intensities of object and background D D p p ( s) ( t) = = const const I I p p I I s t NOTE: hard constrains are not required, in general.

Adding regional properties (another segmentation example à la Boykov&Jolly 01) D p (t) n-links t a cut t-link w pq s t-link D p (s) expected intensities of object and background s s t I I p p t Dp ( t) = const I p I can be re-estimated ( s) = const I NOTE: hard constrains are not required, in general. EM-style optimization of piece-vice constant Mumford-Shah model D I

Adding regional properties (another example à la Boykov&Jolly, ICCV 01)

Adding regional properties (another example à la Boykov&Jolly, ICCV 01) More generally, regional bias can be based on any intensity models of object and background D p (s) t a cut D L ) = lnpr( I L ) ( p p p p Pr( I p t) (t) D p s Pr( I p s) given object and background intensity histograms I p I

Iterative learning of regional models GMMRF cuts (Blake et al., ECCV04) Grab-cut (Rother et al., SIGGRAPH 04) parametric regional model Gaussian Mixture (GM) designed to guarantee convergence

Shrinking bias Minimization of non-negative boundary costs (sum of non-negative n-links) gives regularization/smoothing of segmentation results Choosing n-link costs from local image gradients helps image-adaptive regularization May result in over-smoothing or shrinking Typical for all surface regularization techniques Graph cuts are no different from snakes or level-sets on that

Shrinking bias Optimal cut depending on the size of the hard constrained region

Shrinking bias Image intensities on one scan line actual segment ideal segment Shrinking (or under-segmentation) is allowed by intensity gradient ramp

Shrinking bias object protrusion Surface regularization approach to multi-view stereo CVPR 05 slides from Vogiatzis, Torr, Cippola

Regional term can counter-act shrinking bias All voxels in the center of the scene are connected to the object terminal object protrusion Ballooning force favouring bigger volumes that fill the visual hull L.D. Cohen and I. Cohen. Finite-element methods for active contour models and balloons for 2-d and 3-d images. PAMI, 15(11):1131 1147, November 1993. CVPR 05 slides from Vogiatzis, Torr, Cippola

Shrinking bias CVPR 05 slides from Vogiatzis, Torr, Cippola

Uniform ballooning CVPR 05 slides from Vogiatzis, Torr, Cippola

Regional term based on Laplacian zero-crossings (flux) Basic idea Image intensities on a scan line + + - - Laplacian of intensities Can be seen as intelligent ballooning Vasilevsky and Sidiqqi, 2002 R. Kimmel and A. M. Bruckstein 2003

Integrating Laplacian zero-crossings into graph cuts (Kolmogorov&Boykov 05) graph cuts The image is courtesy of David Fleet University of Toronto Smart regional terms counteract shrinking bias

Implicit vs. Explicit graph cuts Most current graph cuts technique implicitly use surfaces represented via binary (interior/exterior) labeling of pixels 0 0 0 1 0 0 0 1 0 1 1 1

Implicit and Explicit graph cuts Except, a recent explicit surfaces representation method - Kirsanov and Gortler, 2004

Explicit Graph Cuts for multi-view reconstruction Lempitsky et al., ECCV 2006 Explicit surface patches allow local estimation of visibility when computing globally optimal solution local oriented visibility estimate Compare with Vogiatzis et al., CVPR 05 approach for visibility

Explicit Graph Cuts Regularization + uniform ballooning some details are still over-smoothed Lempitsky et al., ECCV 2006

Explicit Graph Cuts Regularization + intelligent ballooning Low noise and no shrinking Boykov and Lempitsky, 2006

Explicit Graph Cuts Space carving Kutulakos and Seitz, 2000

Three ways to look at energy of graph cuts I: Binary submodular energy II: Approximating continuous surface functionals III: Posterior energy (MAP-MRF)

Simple example of energy E( L) = D ( L ) + w δ ( L p Regional term Boundary term p p t-links pq N pq n-links p L q ) L p { s, t} D p (t) n-links t a cut t-link w pq s t-link D p (s) binary object segmentation

Graph cuts for minimization of submodular binary energies E( L) = p E p ( L Tight characterization of binary energies that can be globally minimized by s-t graph cuts is known [survey of Boros and Hummer, 2002, also Kolmogorov&Zabih 2002] E(L) can be minimized by s-t graph cuts Regional term t-links Non-submodular cases can be addressed with some optimality guarantees, e.g. QPBO algorithm reviewed in Boros and Hummer, 2002, nice slides by Kolmogorov at CVPR and Oxford-Brooks 2005 more in PART IV p ) + pq N E( L p, L q I Boundary term n-links ) L p { s, t} E ( s, s) + E( t, t) E( s, t) + E( t, s) submodularity condition for binary energies

Graph cuts for minimization of continuous surface functionals E( C) = C g( ) ds + C r N, v r x ds + f Ω( C) ( x) II dp Geometric length any convex, symmetric metric g e.g. Riemannian Flux any vector field v Regional bias any scalar function f Tight characterization of energies of binary cuts C as functionals of continuous surfaces [Boykov&Kolmogorov, ICCV 2003] [Kolmogorov&Boykov, ICCV 2005] more in PART III

Graph cuts for minimization of posterior energy III Greig at. al. [IJRSS, 1989] Posterior energy (MRF, Ising model) E( L) = lnpr( D ) + p Lp V p Likelihood (data term) pq N pq ( L p, L Spatial prior (regularization) q ) L p { s, t} Example: binary image restoration

Graph cuts algorithms can minimize multi-label energies as well

Multi-scan-line stereo with s-t graph cuts (Roy&Cox 98) y x

Multi-scan-line stereo with s-t graph cuts (Roy&Cox 98) t cut labels cut Disparity labels y s L(p) y p x x

s-t graph-cuts for multi-label energy minimization Ishikawa 1998, 2000, 2003 Modification of construction by Roy&Cox 1998 E ( L ) = D ( L ) + V ( L, L ) 1 Linear interactions p p p pq N p q Convex interactions L p R V(dL) V(dL) dl=lp-lq dl=lp-lq

Pixel interactions V: convex vs. discontinuity-preserving Convex Interactions V Robust discontinuity preserving Interactions V linear model V(dL) dl=lp-lq V(dL) Potts model dl=lp-lq V(dL) V(dL) dl=lp-lq dl=lp-lq

Pixel interactions: convex vs. discontinuity-preserving linear V truncated linear V

Robust interactions NP-hard problem (3 or more labels) two labels can be solved via s-t cuts (Greig at. al., 1989) a-expansion approximation algorithm (Boykov, Veksler, Zabih 1998, 2001) guaranteed approximation quality (Veksler, thesis 2001) within a factor of 2 from the global minima (Potts model) applies to a wide class of energies with robust interactions Potts model (BVZ 1989) Metric interactions (BVZ 2001) Submodular interactions (e.g Boros and Hummer, 2002, KZ 2004)

a-expansion algorithm 1. Start with any initial solution 2. For each label a in any (e.g. random) order 1. Compute optimal a-expansion move (s-t graph cuts) 2. Decline the move if there is no energy decrease 3. Stop when no expansion move would decrease energy

a-expansion move Basic idea: break multi-way cut computation into a sequence of binary s-t cuts a other labels

a-expansion moves In each a-expansion a given label a grabs space from other labels initial solution -expansion -expansion -expansion -expansion -expansion -expansion -expansion For each move we choose expansion that gives the largest decrease in the energy: binary optimization problem

a-expansions: examples of metric interactions noisy noisy shaded diamond diamond V ( α, β ) V ( α, β ) Potts V α β α Truncated linear V β

Multi-way graph cuts Multi-object Extraction Obvious generalization of binary object extraction technique (Boykov, Jolly, Funkalea 2004)

Multi-way graph cuts Stereo/Motion with slanted surfaces (Birchfield &Tomasi 1999) Labels = parameterized surfaces EM based: E step = compute surface boundaries M step = re-estimate surface parameters

Multi-way graph cuts stereo vision ground BVZ KZ 1998 truth 2002 depth map original pair of stereo images

Multi-way graph cuts Graph-cut textures (Kwatra, Schodl, Essa, Bobick 2003) E A H B A B F E F I C C D G J H I J D G similar to image-quilting (Efros & Freeman, 2001)

Multi-way graph cuts Graph-cut textures (Kwatra, Schodl, Essa, Bobick 2003)

a-expansions vs. simulated annealing normalized simulated annealing, correlation, start 19 for hours, annealing, 20.3% 24.7% err err Annealing a-expansions (BVZ 89,01) 90 seconds, 5.8% err Our method Smoothness Energy 100000 80000 60000 40000 20000 0 1 10 100 1000 10000 100000 Time in seconds