Discrete Optimization Methods in Computer Vision CSE 6389 Slides by: Boykov Modified and Presented by: Mostafa Parchami Basic overview of graph cuts

Similar documents
Graph Cuts vs. Level Sets. part I Basics of Graph Cuts

Approximate decoding: ICM, block methods, alpha-beta swap & alpha-expansion

CS6670: Computer Vision

Multi-Label Moves for Multi-Label Energies

intro, applications MRF, labeling... how it can be computed at all? Applications in segmentation: GraphCut, GrabCut, demos

s-t Graph Cuts for Binary Energy Minimization

Markov/Conditional Random Fields, Graph Cut, and applications in Computer Vision

CAP5415-Computer Vision Lecture 13-Image/Video Segmentation Part II. Dr. Ulas Bagci

Segmentation with non-linear constraints on appearance, complexity, and geometry

Computer Vision : Exercise 4 Labelling Problems

Introduction à la vision artificielle X

What have we leaned so far?

MRFs and Segmentation with Graph Cuts

Mathematics in Image Processing

Maximum flows and minimal cuts. Filip Malmberg

Energy Minimization for Segmentation in Computer Vision

Models for grids. Computer vision: models, learning and inference. Multi label Denoising. Binary Denoising. Denoising Goal.

Markov Random Fields and Segmentation with Graph Cuts

Fast Approximate Energy Minimization via Graph Cuts

Continuous and Discrete Optimization Methods in Computer Vision. Daniel Cremers Department of Computer Science University of Bonn

Dr. Ulas Bagci

Segmentation. Bottom up Segmentation Semantic Segmentation

Quadratic Pseudo-Boolean Optimization(QPBO): Theory and Applications At-a-Glance

CS4495/6495 Introduction to Computer Vision. 3B-L3 Stereo correspondence

Energy Minimization Under Constraints on Label Counts

Segmentation with non-linear regional constraints via line-search cuts

MEDICAL IMAGE COMPUTING (CAP 5937) LECTURE 10: Medical Image Segmentation as an Energy Minimization Problem

Graph Cuts in Vision and Graphics: Theories and Applications

How to fit a surface to a point cloud?

3D Photography: Stereo Matching

Supervised texture detection in images

Applied Bayesian Nonparametrics 5. Spatial Models via Gaussian Processes, not MRFs Tutorial at CVPR 2012 Erik Sudderth Brown University

Stereo Correspondence with Occlusions using Graph Cuts

USING ONE GRAPH-CUT TO FUSE MULTIPLE CANDIDATE MAPS IN DEPTH ESTIMATION

Segmentation in electron microscopy images

Markov Random Fields with Efficient Approximations

Super Resolution Using Graph-cut

Image Segmentation Using Iterated Graph Cuts Based on Multi-scale Smoothing

Learning CRFs using Graph Cuts

Stereo. 11/02/2012 CS129, Brown James Hays. Slides by Kristen Grauman

Prof. Trevor Darrell Lecture 20: Markov Random Fields

A Comparative Study of Energy Minimization Methods for Markov Random Fields

Efficient Parallel Optimization for Potts Energy with Hierarchical Fusion

Image Segmentation. Marc Pollefeys. ETH Zurich. Slide credits: V. Ferrari, K. Grauman, B. Leibe, S. Lazebnik, S. Seitz,Y Boykov, W. Freeman, P.

Markov Random Fields and Segmentation with Graph Cuts

Capacity Scaling for Graph Cuts in Vision

Multi-view Stereo. Ivo Boyadzhiev CS7670: September 13, 2011

Comparison of Graph Cuts with Belief Propagation for Stereo, using Identical MRF Parameters

Graph Cuts. Srikumar Ramalingam School of Computing University of Utah

MEDICAL IMAGE COMPUTING (CAP 5937) LECTURE 10: Medical Image Segmentation as an Energy Minimization Problem

Announcements. Image Segmentation. From images to objects. Extracting objects. Status reports next Thursday ~5min presentations in class

Markov Networks in Computer Vision. Sargur Srihari

Graph Cut based Continuous Stereo Matching using Locally Shared Labels

GRAPH BASED SEGMENTATION WITH MINIMAL USER INTERACTION. Huaizhong Zhang, Ehab Essa, and Xianghua Xie

Prof. Feng Liu. Spring /17/2017. With slides by F. Durand, Y.Y. Chuang, R. Raskar, and C.

BIL Computer Vision Apr 16, 2014

Stereo. Outline. Multiple views 3/29/2017. Thurs Mar 30 Kristen Grauman UT Austin. Multi-view geometry, matching, invariant features, stereo vision

Data Term. Michael Bleyer LVA Stereo Vision

Markov Networks in Computer Vision

Fast Trust Region for Segmentation

Supplementary Material: Adaptive and Move Making Auxiliary Cuts for Binary Pairwise Energies

Multi-view Stereo via Volumetric Graph-cuts and Occlusion Robust Photo-Consistency

Reduce, Reuse & Recycle: Efficiently Solving Multi-Label MRFs

Image Restoration using Markov Random Fields

Lecture 10: Multi view geometry

From Photohulls to Photoflux Optimization

CS 4495 Computer Vision A. Bobick. Motion and Optic Flow. Stereo Matching

Comparison of energy minimization algorithms for highly connected graphs

Multi-view stereo. Many slides adapted from S. Seitz

In IEEE conference on Computer Vision and Pattern Recognition (CVPR), New York, June 2006, vol. I, p Active Graph Cuts

Image Segmentation continued Graph Based Methods. Some slides: courtesy of O. Capms, Penn State, J.Ponce and D. Fortsyth, Computer Vision Book

Joint 3D-Reconstruction and Background Separation in Multiple Views using Graph Cuts

Image Based Reconstruction II

Algorithms for Markov Random Fields in Computer Vision

Image Segmentation Using Iterated Graph Cuts BasedonMulti-scaleSmoothing

Data-driven Depth Inference from a Single Still Image

A Principled Deep Random Field Model for Image Segmentation

Lecture 16 Segmentation and Scene understanding

Segmentation Based Stereo. Michael Bleyer LVA Stereo Vision

Chaplin, Modern Times, 1936

In Defense of 3D-Label Stereo

Graph Cut Matching In Computer Vision

Iteratively Reweighted Graph Cut for Multi-label MRFs with Non-convex Priors

Analysis: TextonBoost and Semantic Texton Forests. Daniel Munoz Februrary 9, 2009

Stereo. Many slides adapted from Steve Seitz

Volumetric stereo with silhouette and feature constraints

Efficient new-view synthesis using pairwise dictionary priors

Extensions of submodularity and their application in computer vision

Iterative MAP and ML Estimations for Image Segmentation

CS 4495 Computer Vision A. Bobick. Motion and Optic Flow. Stereo Matching

Stereo: Disparity and Matching

Minimal Surfaces for Stereo

Multiple View Geometry

Geometric Reconstruction Dense reconstruction of scene geometry

Fast Approximate Energy Minimization via Graph Cuts

An Efficient Graph Cut Algorithm for Computer Vision Problems

CSE 417 Network Flows (pt 4) Min Cost Flows

SPM-BP: Sped-up PatchMatch Belief Propagation for Continuous MRFs. Yu Li, Dongbo Min, Michael S. Brown, Minh N. Do, Jiangbo Lu

Human Body Recognition and Tracking: How the Kinect Works. Kinect RGB-D Camera. What the Kinect Does. How Kinect Works: Overview

Mesh segmentation. Florent Lafarge Inria Sophia Antipolis - Mediterranee

Transcription:

Discrete Optimization Methods in Computer Vision CSE 6389 Slides by: Boykov Modified and Presented by: Mostafa Parchami Basic overview of graph cuts [Yuri Boykov, Olga Veksler, Ramin Zabih, Fast Approximation Energy Minimization via Graph Cuts, PAMI 01]

Disclaimer Can not possibly cover all discrete optimization methods widely used in computer vision in the last 30 years Mainly concentrate on Discrete energy minimization methods that can be applied to Markov Random Fields with binary or n-labels applicable to a wide spectrum of problems in vision Methods motivated by LP relaxations good bounds on the solutions

Discrete Optimization Methods in Computer Vision Basic overview of graph cuts binary labeling a few basic examples energy optimization submodularity (discrete view) continuous functionals (geometric view) posterior MRF energy (statistical view) extensions to multi-label problems interactions: convex, robust, metric move-based optimization

Markov Random Fields in Computer Vision Node y i : pixel label Edge: constrained pairs Cost to assign a label to each pixel Cost to assign a pair of labels to connected pixels Energy ( y;, data) i 1 ( yi;, data) 2( yi, y j;, data) i, jedges

Solving MRF with Graph Cuts Source (Label 0) Cost to assign to 0 Cost to split nodes Cost to assign to 1 Sink (Label 1) Energy ( y;, data) i 1 ( yi;, data) 2( yi, y j;, data) i, jedges

Solving MRF with Graph Cuts Cont. Source (Label 0) Cost to assign to 0 Cost to split nodes Cost to assign to 1 Sink (Label 1) Energy ( y;, data) i 1 ( yi;, data) 2( yi, y j;, data) i, jedges

2D Graph cut shortest path on a graph Example: find the shortest closed contour in a given domain of a graph Shortest paths approach (live wire, intelligent scissors) p Graph Cuts approach Compute the shortest path p ->p for a point p. Repeat for all points on the gray line. Then choose the optimal contour. Compute the minimum cut that separates red region from blue region

Graph cuts for optimal boundary detection (B&J, ICCV 01) hard constraint n-links t a cut Minimum cost cut can be computed in polynomial time (max-flow/min-cut algorithms) w s pq hard constraint I 2 pq exp 2 I pq

Standard minimum s-t cuts algorithms Augmenting paths [Ford & Fulkerson, 1962] Push-relabel [Goldberg-Tarjan, 1986] adapted to N-D grids used in computer vision Tree recycling (dynamic trees) [B&K, 2004] Flow recycling (dynamic cuts) [Kohli & Torr, 2005] Cut recycling (active cuts) [Juan & Boykov, 2006] Hierarchical methods - in search space [Lombaert et al., CVPR 2005] - in edge weights (capacity scaling) [Juan et al., ICCV07]

Optimal boundary in 2D max-flow = min-cut

Optimal boundary in 3D 3D bone segmentation (real time screen capture)

Graph cuts applied to multi-view reconstruction surface of good photoconsistency visual hull (silhouettes) CVPR 05 slides from Vogiatzis, Torr, Cippola

Adding regional properties (B&J, ICCV 01) D p (t) n-links t a cut w pq regional bias example I s suppose are given expected intensities of object and background s D p (s) t and I s 2 2 D ( s) exp I I / 2 NOTE: hard constrains are not required, in general. D p t 2 2 p( t) exp I p I / 2 p

Adding regional properties (B&J, ICCV 01) D p (t) n-links t a cut w pq s D p (s) expected intensities of object and background I s and can be re-estimated I t NOTE: hard constrains are not required, in general. EM-style optimization of piece-vise constant Mumford-Shah model D D s 2 2 p( s) exp I p I / 2 t 2 2 p( t) exp I p I / 2

Adding regional properties (B&J, ICCV 01) More generally, regional bias can be based on any intensity models of object and background D p (s) t D ( Lp) ln Pr( I p Lp a cut ) p Pr( I p t) D p (t) s Pr( I p s) given object and background intensity histograms I p I

Adding regional properties (B&J, ICCV 01)

Iterative learning of regional color-models GMMRF cuts (Blake et al., ECCV04) Grab-cut (Rother et al., SIGGRAPH 04) parametric regional model Gaussian Mixture (GM) designed to guarantee convergence

At least three ways to look at energy of graph cuts I: Binary submodular energy II: Approximating continuous surface functionals III: Posterior energy (MAP-MRF)

Simple example of energy Regional term Boundary term E( L) D ( L ) w ( L p p p t-links pqn pq n-links p L q ) D p (t) n-links t a cut L p { s, t} w pq s D p (s) binary object segmentation

Graph cuts for minimization of submodular binary energies I E( L) Regional term p E p ( L t-links Characterization of binary energies that can be globally minimized by s-t graph cuts [Boros&Hummer, 2002, K&Z 2004] p ) Boundary term pqn E( L p n-links, L q ) L p { s, t} E(L) can be minimized by s-t graph cuts E( s, s) E( t, t) E( s, t) E( t, s) Submodularity ( convexity ) Non-submodular cases can be addressed with some optimality guarantees, e.g. QPBO algorithm (see Boros&Hummer, 2002, Tavares et al. 06, Rother et al. 07)

Graph cuts for minimization of continuous surface functionals II E( C) C g( ) ds C N, v x ds f ( C) ( x) dp Geometric length any convex, symmetric metric g e.g. Riemannian Flux any vector field v Regional bias any scalar function f Characterization of energies of binary cuts C as functionals of continuous surfaces [B&K, ICCV 2003] [K&B, ICCV 2005]

One extension using parametric max-flow methods optimization of ratio functionals E( C) C C N, v x g( ) ds ds E( C) C ( C) g( ) ds f ( x) dp In 2D can use DP [Cox et al 96, Jermyn&Ishikawa 01] In 3D, [Kolmogorov, Boykov, Rother, ICCV 07]

Graph cuts for minimization of posterior energy Greig at al. [IJRSS, 1989] Posterior energy (MRF, Ising model) E( L) ln Pr( D ) p Lp V p Likelihood (data term) pqn pq ( L p, L Spatial prior (regularization) q ) L p { s, t} III Example: binary image restoration

Graph cuts algorithms can minimize multi-label energies as well

Problem Definition: Stereo Matching Triangulate on two images of the same scene point to recover depth. depth baseline Camera calibration Finding all correspondence Computing depth or surfaces left Right

Disparity labels labels Multi-scan-line stereo with s-t graph cuts (Roy&Cox 98) t cut cut L(p) y y s p x x

s-t graph-cuts for multi-label energy minimization Ishikawa 1998, 2000, 2003 Generalization of construction by Roy&Cox 1998 E L) p D p ( L p ) pqn V ( L Linear interactions Convex interactions p, L q ) L p R ( 1 V(dL) V(dL) dl=lp-lq dl=lp-lq

Pixel interactions V: convex vs. discontinuity-preserving Convex Interactions V Robust or discontinuity preserving Interactions V (weak membrane models, see a book by Blake and Zisserman, 87) linear model V(dL) dl=lp-lq V(dL) Potts model dl=lp-lq V(dL) V(dL) dl=lp-lq dl=lp-lq

Pixel interactions: convex vs. discontinuity-preserving linear V truncated linear V

Fast Approximate Energy Minimization Simple Way to Optimize - ICM 1. Choose a pixel 2. Fix the disparities at the rest of the pixels 3. Set the pixel to the optimal disparity 4. Iterate Advantages Energy always guaranteed to decrease Easy to code Disadvantages Convergence Scalability Make Bigger Moves!

Fast Approx. Energy Minimization Cont. The problem with ICM is that you can only modify one pixel at a time Get stuck in local minima easily We need a way of moving multiple pixels at once Boykov, Veksler and Zabih introduced two types of moves: alpha-beta swap alpha-expansion

Moving Policies ICM One pixel moves (From BKZ-PAMI 01)

Moving Policies Fix all nodes that aren't labelled alpha or beta With remaining nodes, find optimal swap Some alpha nodes change to beta Vice Versa Some stay the same (From BKZ-PAMI 01) alpha-beta swap

Moving Policies Any node can change to alpha (From BKZ-PAMI 01) alpha-expansion

a-expansion & a-b swap overview

alpha-beta graphs

alpha-beta swap properties

alpha-beta swap: proof

a-expansion NP-hard problem (3 or more labels) two labels can be solved via s-t cuts (Greig at. al., 1989) a-expansion approximation algorithm (Boykov, Veksler, Zabih 1998, 2001) guaranteed approximation quality (Veksler, thesis 2001) within a factor of 2 from the global minima (Potts model) applies to a wide class of energies with robust interactions Potts model (BVZ 1989) metric interactions (BVZ 2001) can be extended to arbitrary interactions with weaker guarantees truncation (Kolmogorov et al. 2005) QPBO (Boros and Hummer, 2002) Other move algorithms (e.g. a-b swap, jump-moves)

a-expansion algorithm 1. Start with any initial solution 2. For each label a in any (e.g. random) order 1. Compute optimal a-expansion move (s-t graph cuts) 2. Decline the move if there is no energy decrease 3. Stop when no expansion move would decrease energy

a-expansion move Basic idea: break multi-way cut computation into a sequence of binary s-t cuts a other labels

a-expansion moves In each a-expansion a given label a grabs space from other labels initial solution -expansion -expansion -expansion -expansion -expansion -expansion -expansion For each move we choose expansion that gives the largest decrease in the energy: binary optimization problem

Expansion move The difference between the optimal solution and the solution from the expansion move is bounded

Metric interactions V(a,b)=0 iff a=b V(a,b) = V(b,a) >= 0 V(a,c) <= V(a,b)+V(b,c) Triangular inequality Implies that every expansion move (a binary problem) is submodular V is called a metric if it obeys all 3 Can use Expansion Moves V is called a semi-metric if it obeys the last two Can only use Swap-Moves Subject of recent research.

alpha expansion algorithm

alpha expansion algorithm

2-D alpha expansion

a-expansions: examples of metric interactions noisy noisy shaded diamond diamond Potts V Truncated linear V

Multi-way graph cuts Multi-object Extraction

Multi-way graph cuts Stereo/Motion with slanted surfaces (Birchfield &Tomasi 1999) Labels = parameterized surfaces EM based: E step = compute surface boundaries M step = re-estimate surface parameters

Multi-way graph cuts stereo vision ground BVZ KZ 1998 truth 2002 depth map original pair of stereo images

Multi-way graph cuts Graph-cut textures (Kwatra, Schodl, Essa, Bobick 2003) B A B C D E F G H I J similar to image-quilting (Efros & Freeman, 2001)

a-expansions vs. simulated annealing normalized simulated annealing, correlation, start 19 for hours, annealing, 20.3% 24.7% err err Annealing a-expansions (BVZ 89,01) 90 seconds, 5.8% err Our method Smoothness Energy 100000 80000 60000 40000 20000 0 1 10 100 1000 10000 100000 Time in seconds