Multi-view stereo. Many slides adapted from S. Seitz

Similar documents
Some books on linear algebra

Image Based Reconstruction II

Prof. Trevor Darrell Lecture 18: Multiview and Photometric Stereo

Geometric Reconstruction Dense reconstruction of scene geometry

Multiple View Geometry

Dense 3D Reconstruction. Christiano Gava

Some books on linear algebra

3D Photography: Stereo Matching

Volumetric Scene Reconstruction from Multiple Views

Dense 3D Reconstruction. Christiano Gava

Multiview Reconstruction

BIL Computer Vision Apr 16, 2014

Multi-View Stereo for Static and Dynamic Scenes

Multi-View 3D-Reconstruction

Efficient View-Dependent Sampling of Visual Hulls

Shape from Silhouettes

EECS 442 Computer vision. Announcements

Shape from Silhouettes I

Shape from Silhouettes I

CS 4495 Computer Vision A. Bobick. Motion and Optic Flow. Stereo Matching

Shape from Silhouettes II

Shape from Silhouettes I CV book Szelisky

CS5670: Computer Vision

Lecture 8 Active stereo & Volumetric stereo

Chaplin, Modern Times, 1936

Computer Vision Lecture 17

Stereo. Outline. Multiple views 3/29/2017. Thurs Mar 30 Kristen Grauman UT Austin. Multi-view geometry, matching, invariant features, stereo vision

Computer Vision Lecture 17

What have we leaned so far?

Hybrid Rendering for Collaborative, Immersive Virtual Environments

Image-based rendering using plane-sweeping modelisation

Step-by-Step Model Buidling

Visual Shapes of Silhouette Sets

CS 4495 Computer Vision A. Bobick. Motion and Optic Flow. Stereo Matching

Multi-view Stereo. Ivo Boyadzhiev CS7670: September 13, 2011

Volumetric stereo with silhouette and feature constraints

Lecture 10: Multi view geometry

Multi-View 3D-Reconstruction

Stereo. 11/02/2012 CS129, Brown James Hays. Slides by Kristen Grauman

An Efficient Visual Hull Computation Algorithm

EECS 442 Computer vision. Stereo systems. Stereo vision Rectification Correspondence problem Active stereo vision systems

Stereo vision. Many slides adapted from Steve Seitz

Stereo Vision. MAN-522 Computer Vision

Stereo II CSE 576. Ali Farhadi. Several slides from Larry Zitnick and Steve Seitz

Structure from Motion

A Statistical Consistency Check for the Space Carving Algorithm.

Multiple View Geometry

Lecture 10: Multi-view geometry

Lecture 8 Active stereo & Volumetric stereo

Project Updates Short lecture Volumetric Modeling +2 papers

CS4495/6495 Introduction to Computer Vision. 3B-L3 Stereo correspondence

Announcements. Stereo Vision Wrapup & Intro Recognition

Recap: Features and filters. Recap: Grouping & fitting. Now: Multiple views 10/29/2008. Epipolar geometry & stereo vision. Why multiple views?

Stereo Vision A simple system. Dr. Gerhard Roth Winter 2012

Image-Based Modeling and Rendering. Image-Based Modeling and Rendering. Final projects IBMR. What we have learnt so far. What IBMR is about

Final project bits and pieces

Polyhedral Visual Hulls for Real-Time Rendering

Real-Time Video-Based Rendering from Multiple Cameras

7. The Geometry of Multi Views. Computer Engineering, i Sejong University. Dongil Han

Talk plan. 3d model. Applications: cultural heritage 5/9/ d shape reconstruction from photographs: a Multi-View Stereo approach

PERFORMANCE CAPTURE FROM SPARSE MULTI-VIEW VIDEO

Structured light 3D reconstruction

Multi-View Reconstruction using Narrow-Band Graph-Cuts and Surface Normal Optimization

Recap from Previous Lecture

3D Computer Vision. Dense 3D Reconstruction II. Prof. Didier Stricker. Christiano Gava

Epipolar Geometry and Stereo Vision

Lecture 10 Multi-view Stereo (3D Dense Reconstruction) Davide Scaramuzza

Foundations of Image Understanding, L. S. Davis, ed., c Kluwer, Boston, 2001, pp

Fundamental matrix. Let p be a point in left image, p in right image. Epipolar relation. Epipolar mapping described by a 3x3 matrix F

VOLUMETRIC MODEL REFINEMENT BY SHELL CARVING

CS5670: Computer Vision

Photo Tourism: Exploring Photo Collections in 3D

Dense Matching of Multiple Wide-baseline Views

Stereo. Many slides adapted from Steve Seitz

3D Surface Reconstruction from 2D Multiview Images using Voxel Mapping

Lecture 10 Dense 3D Reconstruction

Epipolar Geometry and Stereo Vision

3D Computer Vision. Depth Cameras. Prof. Didier Stricker. Oliver Wasenmüller

View Synthesis using Convex and Visual Hulls. Abstract

Project 3 code & artifact due Tuesday Final project proposals due noon Wed (by ) Readings Szeliski, Chapter 10 (through 10.5)

Acquisition and Visualization of Colored 3D Objects

Binocular stereo. Given a calibrated binocular stereo pair, fuse it to produce a depth image. Where does the depth information come from?

Projective Surface Refinement for Free-Viewpoint Video

Interactive Free-Viewpoint Video

Hardware-Accelerated Rendering of Photo Hulls

Epipolar Geometry Prof. D. Stricker. With slides from A. Zisserman, S. Lazebnik, Seitz

There are many cues in monocular vision which suggests that vision in stereo starts very early from two similar 2D images. Lets see a few...

Correspondence and Stereopsis. Original notes by W. Correa. Figures from [Forsyth & Ponce] and [Trucco & Verri]

Multi-view Stereo via Volumetric Graph-cuts and Occlusion Robust Photo-Consistency

Introduction à la vision artificielle X

Stereo Epipolar Geometry for General Cameras. Sanja Fidler CSC420: Intro to Image Understanding 1 / 33

What Do N Photographs Tell Us About 3D Shape?

But First: Multi-View Projective Geometry

SIMPLE ROOM SHAPE MODELING WITH SPARSE 3D POINT INFORMATION USING PHOTOGRAMMETRY AND APPLICATION SOFTWARE

Stereo: Disparity and Matching

IMAGE-BASED RENDERING

Lecture 9 & 10: Stereo Vision

Passive 3D Photography

Supplementary Material: Piecewise Planar and Compact Floorplan Reconstruction from Images

Public Library, Stereoscopic Looking Room, Chicago, by Phillips, 1923

Transcription:

Multi-view stereo Many slides adapted from S. Seitz

Beyond two-view stereo The third eye can be used for verification

Multiple-baseline stereo Pick a reference image, and slide the corresponding window along the corresponding epipolar lines of all other images, using inverse depth relative to the first image as the search parameter M. Okutomi and T. Kanade, A Multiple-Baseline Stereo System, IEEE Trans. on Pattern Analysis and Machine Intelligence, 15(4):353-363 (1993).

Multiple-baseline stereo For larger baselines, must search larger area in second image width of a pixel 1/z pixel matching score 1/z width of a pixel

Multiple-baseline stereo Use the sum of correlation scores to rank matches

Multiple-baseline stereo results I1 I2 I10 M. Okutomi and T. Kanade, A Multiple-Baseline Stereo System, IEEE Trans. on Pattern Analysis and Machine Intelligence, 15(4):353-363 (1993).

Summary: Multiple-baseline stereo Basic Approach Choose a reference view Plot SSD vs. inverse depth instead of disparity Replace two-view SSD with sum of SSD over all baselines Limitations Must choose a reference view Occlusions become an issue for large baseline Possible solution: use a virtual view

Plane Sweep Stereo Choose a virtual view Sweep family of planes at different depths with respect to the virtual camera input image input image composite virtual camera each plane defines an image composite homography R. Collins. A space-sweep approach to true multi-image matching. CVPR 1996.

Plane Sweep Stereo For each depth plane For each pixel in the composite image, compute the variance For each pixel, select the depth that gives the lowest variance

Plane Sweep Stereo For each depth plane For each pixel in the composite image, compute the variance For each pixel, select the depth that gives the lowest variance Can be accelerated using graphics hardware R. Yang and M. Pollefeys. Multi-Resolution Real-Time Stereo on Commodity Graphics Hardware, CVPR 2003

Volumetric stereo In plane sweep stereo, the sampling of the scene still depends on the reference view We can use a voxel volume to get a viewindependent representation

Volumetric stereo Scene Volume V Input Images (Calibrated) Goal: Determine occupancy, color of points in V

Discrete formulation: Voxel Coloring Discretized Scene Volume Input Images (Calibrated) Goal: Assign RGB values to voxels in V photo-consistent with images

Photo-consistency A photo-consistent scene is a scene that exactly reproduces your input images from the same camera viewpoints You can t use your input cameras and images to tell the difference between a photo-consistent scene and the true scene True Scene Photo-Consistent Scenes All Scenes

Space Carving Image 1 Image N... Space Carving Algorithm Initialize to a volume V containing the true scene Choose a voxel on the current surface Project to visible input images Carve if not photo-consistent Repeat until convergence K. N. Kutulakos and S. M. Seitz, A Theory of Shape by Space Carving, ICCV 1999

Which shape do you get? V V True Scene Photo Hull The Photo Hull is the UNION of all photo-consistent scenes in V It is a photo-consistent scene reconstruction Tightest possible bound on the true scene Source: S. Seitz

Space Carving Results: African Violet Input Image (1 of 45) Reconstruction Reconstruction Reconstruction Source: S. Seitz

Space Carving Results: Hand Input Image (1 of 100) Views of Reconstruction

Reconstruction from Silhouettes The case of binary images: a voxel is photoconsistent if it lies inside the object s silhouette in all views Binary Images

Reconstruction from Silhouettes The case of binary images: a voxel is photoconsistent if it lies inside the object s silhouette in all views Binary Images Finding the silhouette-consistent shape (visual hull): Backproject each silhouette Intersect backprojected volumes

Volume intersection Reconstruction Contains the True Scene But is generally not the same

Voxel algorithm for volume intersection Color voxel black if on silhouette in every image

Properties of Volume Intersection Pros Easy to implement, fast Cons No concavities

Properties of Volume Intersection Pros Easy to implement, fast Cons No concavities Reconstruction is not photo-consistent if texture information is available Requires silhouette extraction

Polyhedral volume intersection B. Baumgart, Geometric Modeling for Computer Vision, Stanford Artificial Intelligence Laboratory, Memo no. AIM-249, Stanford University, October 1974.

Polyhedral volume intersection: Pros and cons Pros No voxelization artifacts Cons Depends on discretization of outlines Numerical problems when polygons to be intersected are almost coplanar Does not take advantage of epipolar geometry

Image-based visual hulls Pick a pixel in the virtual view Project corresponding visual ray into every other view to get a set of epipolar lines Find intervals where epipolar lines overlap with silhouettes Virtual view Lift intervals back onto the 3D ray and find their intersection Wojciech Matusik, Christopher Buehler, Ramesh Raskar, Steven Gortler, and Leonard McMillan. Image-based Visual Hulls. In SIGGRAPH 2000

Image-based visual hulls Wojciech Matusik, Christopher Buehler, Ramesh Raskar, Steven Gortler, and Leonard McMillan. Image-based Visual Hulls. In SIGGRAPH 2000

Image-based visual hulls: Pros and cons Pros Can work in real time Takes advantage of epipolar geometry Cons Need to recompute the visual hull every time the virtual view is changed

Understanding the shape of visual hulls In principle, we can use epipolar geometry to compute an exact representation of the visual hull in 3D S. Lazebnik, Y. Furukawa, and J. Ponce, Projective Visual Hulls, IJCV 2007.

Review: Multi-view stereo Multiple-baseline stereo Pick one input view as reference Inverse depth instead of disparity Plane sweep stereo Virtual view Volumetric stereo Photo-consistency Space carving Shape from silhouettes Visual hull: intersection of visual cones

Reconstruction from Silhouettes The case of binary images: a voxel is photoconsistent if it lies inside the object s silhouette in all views Binary Images

Reconstruction from Silhouettes The case of binary images: a voxel is photoconsistent if it lies inside the object s silhouette in all views Binary Images Finding the silhouette-consistent shape (visual hull): Backproject each silhouette Intersect backprojected volumes

Photo-consistency vs. silhouette-consistency True Scene Photo Hull Visual Hull

Understanding the shape of visual hulls What part of the visual hull belongs to the surface?

Understanding the shape of visual hulls The visual hull does not correspond to the true surface because the epipolar constraint is not valid for silhouette points X X x Point on x visual hull O O

Understanding the shape of visual hulls The visual hull does not correspond to the true surface because the epipolar constraint is not valid for silhouette points X X O x x Point on visual hull O

Understanding the shape of visual hulls The visual hull does not correspond to the true surface because the epipolar constraint is not valid for silhouette points Exception: frontier points X x x Epipolar tangency O O

Carved visual hulls The visual hull is a good starting point for optimizing photo-consistency Easy to compute Tight outer boundary of the object Parts of the visual hull (rims) already lie on the surface and are already photo-consistent Yasutaka Furukawa and Jean Ponce, Carved Visual Hulls for Image-Based Modeling, ECCV 2006.

Carved visual hulls 1. Compute visual hull 2. Use dynamic programming to find rims and constrain them to be fixed 3. Carve the visual hull to optimize photo-consistency Yasutaka Furukawa and Jean Ponce, Carved Visual Hulls for Image-Based Modeling, ECCV 2006.

Carved visual hulls Yasutaka Furukawa and Jean Ponce, Carved Visual Hulls for Image-Based Modeling, ECCV 2006.

Carved visual hulls: Pros and cons Pros Visual hull gives a reasonable initial mesh that can be iteratively deformed Cons Need silhouette extraction Have to compute a lot of points that don t lie on the object Finding rims is difficult The carving step can get caught in local minima Possible solution: use sparse feature correspondences as initialization

Feature-based stereo matching T. Tuytelaars and L. Van Gool, "Matching Widely Separated Views based on Affine Invariant Regions" Int. Journal on Computer Vision, 59(1), pp. 61-85, 2004.

Feature-based stereo matching Pros Robust to clutter and occlusion Only find matches at reliable points Can use invariant local features to deal with foreshortening, scale change, wide baselines Cons You only get a sparse cloud of points (or oriented patches), not a dense depth map or a complete surface

From feature matching to dense stereo 1. Extract features 2. Get a sparse set of initial matches 3. Iteratively expand matches to nearby locations 4. Use visibility constraints to filter out false matches 5. Perform surface reconstruction Yasutaka Furukawa and Jean Ponce, Accurate, Dense, and Robust Multi-View Stereopsis, CVPR 2007.

From feature matching to dense stereo http://www-cvr.ai.uiuc.edu/~yfurukaw/ Yasutaka Furukawa and Jean Ponce, Accurate, Dense, and Robust Multi-View Stereopsis, CVPR 2007.

Stereo from community photo collections M. Goesele, N. Snavely, B. Curless, H. Hoppe, S. Seitz, Multi-View Stereo for Community Photo Collections, ICCV 2007 http://grail.cs.washington.edu/projects/mvscpc/

Stereo from community photo collections stereo laser scan Comparison: 90% of points within 0.128 m of laser scan (building height 51m) M. Goesele, N. Snavely, B. Curless, H. Hoppe, S. Seitz, Multi-View Stereo for Community Photo Collections, ICCV 2007

Stereo from community photo collections Up to now, we ve always assumed that camera calibration is known For photos taken from the Internet, we need structure from motion techniques to reconstruct both camera positions and 3D points

Multi-view stereo: Summary Multiple-baseline stereo Pick one input view as reference Inverse depth instead of disparity Plane sweep stereo Virtual view Volumetric stereo Photo-consistency Space carving Shape from silhouettes Visual hull: intersection of visual cones Volumetric, polyhedral, image-based Carved visual hulls Feature-based stereo From sparse to dense correspondences