Dense 3D Reconstruction. Christiano Gava

Similar documents
Dense 3D Reconstruction. Christiano Gava

Multiple View Geometry

3D Computer Vision. Dense 3D Reconstruction II. Prof. Didier Stricker. Christiano Gava

Multi-view stereo. Many slides adapted from S. Seitz

Computer Vision Lecture 17

Computer Vision Lecture 17

Computer Vision I. Announcement. Stereo Vision Outline. Stereo II. CSE252A Lecture 15

Stereo II CSE 576. Ali Farhadi. Several slides from Larry Zitnick and Steve Seitz

EE795: Computer Vision and Intelligent Systems

CS 4495 Computer Vision A. Bobick. Motion and Optic Flow. Stereo Matching

Step-by-Step Model Buidling

CS 4495 Computer Vision A. Bobick. Motion and Optic Flow. Stereo Matching

Lecture 14: Basic Multi-View Geometry

Stereo and Epipolar geometry

Stereo. 11/02/2012 CS129, Brown James Hays. Slides by Kristen Grauman

Application questions. Theoretical questions

Two-view geometry Computer Vision Spring 2018, Lecture 10

Correspondence and Stereopsis. Original notes by W. Correa. Figures from [Forsyth & Ponce] and [Trucco & Verri]

Lecture 10 Dense 3D Reconstruction

Stereo Vision. MAN-522 Computer Vision

calibrated coordinates Linear transformation pixel coordinates

Chaplin, Modern Times, 1936

Lecture 10 Multi-view Stereo (3D Dense Reconstruction) Davide Scaramuzza

Stereo. Outline. Multiple views 3/29/2017. Thurs Mar 30 Kristen Grauman UT Austin. Multi-view geometry, matching, invariant features, stereo vision

Geometric Reconstruction Dense reconstruction of scene geometry

Multi-View Stereo for Static and Dynamic Scenes

Cameras and Stereo CSE 455. Linda Shapiro

Final project bits and pieces

There are many cues in monocular vision which suggests that vision in stereo starts very early from two similar 2D images. Lets see a few...

Fundamentals of Stereo Vision Michael Bleyer LVA Stereo Vision

Fundamental matrix. Let p be a point in left image, p in right image. Epipolar relation. Epipolar mapping described by a 3x3 matrix F

Camera Calibration. Schedule. Jesus J Caban. Note: You have until next Monday to let me know. ! Today:! Camera calibration

BIL Computer Vision Apr 16, 2014

Geometric camera models and calibration

Recap: Features and filters. Recap: Grouping & fitting. Now: Multiple views 10/29/2008. Epipolar geometry & stereo vision. Why multiple views?

Image Based Reconstruction II

Multi-stable Perception. Necker Cube

Today. Stereo (two view) reconstruction. Multiview geometry. Today. Multiview geometry. Computational Photography

Multi-View 3D-Reconstruction

What have we leaned so far?

Rectification and Disparity

Epipolar Geometry Prof. D. Stricker. With slides from A. Zisserman, S. Lazebnik, Seitz

EE795: Computer Vision and Intelligent Systems

Lecture 9: Epipolar Geometry

CS5670: Computer Vision

Epipolar Geometry and Stereo Vision

Public Library, Stereoscopic Looking Room, Chicago, by Phillips, 1923

Depth. Common Classification Tasks. Example: AlexNet. Another Example: Inception. Another Example: Inception. Depth

Stereo vision. Many slides adapted from Steve Seitz

Structure from Motion. Introduction to Computer Vision CSE 152 Lecture 10

Stereo Vision II: Dense Stereo Matching

CS231A Midterm Review. Friday 5/6/2016

Lecture 14: Computer Vision

Lecture 9 & 10: Stereo Vision

Computer Vision I. Dense Stereo Correspondences. Anita Sellent 1/15/16

arxiv: v1 [cs.cv] 28 Sep 2018

CS5670: Computer Vision

Project 2 due today Project 3 out today. Readings Szeliski, Chapter 10 (through 10.5)

3D Photography: Stereo Matching

Epipolar Geometry and Stereo Vision

3D Computer Vision. Depth Cameras. Prof. Didier Stricker. Oliver Wasenmüller

Miniature faking. In close-up photo, the depth of field is limited.

Stereo Epipolar Geometry for General Cameras. Sanja Fidler CSC420: Intro to Image Understanding 1 / 33

Lecture 10: Multi view geometry

Multi-view Stereo. Ivo Boyadzhiev CS7670: September 13, 2011

55:148 Digital Image Processing Chapter 11 3D Vision, Geometry

Depth from two cameras: stereopsis

A virtual tour of free viewpoint rendering

KinectFusion: Real-Time Dense Surface Mapping and Tracking

Perception II: Pinhole camera and Stereo Vision

Lecture 10: Multi-view geometry

Epipolar Geometry CSE P576. Dr. Matthew Brown

Image processing and features

Binocular stereo. Given a calibrated binocular stereo pair, fuse it to produce a depth image. Where does the depth information come from?

Announcements. Stereo Vision Wrapup & Intro Recognition

Project Updates Short lecture Volumetric Modeling +2 papers

International Journal for Research in Applied Science & Engineering Technology (IJRASET) A Review: 3D Image Reconstruction From Multiple Images

Multiple View Geometry

Final Exam Study Guide

Two-View Geometry (Course 23, Lecture D)

Lecture'9'&'10:'' Stereo'Vision'

CSE 252B: Computer Vision II

Depth from two cameras: stereopsis

Direct Methods in Visual Odometry

Index. 3D reconstruction, point algorithm, point algorithm, point algorithm, point algorithm, 263

Project 4 Results. Representation. Data. Learning. Zachary, Hung-I, Paul, Emanuel. SIFT and HoG are popular and successful.

Prof. Trevor Darrell Lecture 18: Multiview and Photometric Stereo

CS6670: Computer Vision

3D Computer Vision. Structure from Motion. Prof. Didier Stricker

Stereo matching. Francesco Isgrò. 3D Reconstruction and Stereo p.1/21

Announcements. Stereo

Accurate and Dense Wide-Baseline Stereo Matching Using SW-POC

Reminder: Lecture 20: The Eight-Point Algorithm. Essential/Fundamental Matrix. E/F Matrix Summary. Computing F. Computing F from Point Matches

CEng Computational Vision

Stereo imaging ideal geometry

CS201 Computer Vision Camera Geometry

3D Sensing and Reconstruction Readings: Ch 12: , Ch 13: ,

Structure from motion

Rectification and Distortion Correction

3D Photography: Active Ranging, Structured Light, ICP

Transcription:

Dense 3D Reconstruction Christiano Gava christiano.gava@dfki.de

Outline Previous lecture: structure and motion II Structure and motion loop Triangulation Wide baseline matching (SIFT) Today: dense 3D reconstruction The matching problem 2-view reconstruction Multi-view reconstruction Next lecture: structured light 12/11/2012 Lecture 3D Computer Vision 2

Why dense reconstruction? Accurate 3D models cultural heritage! Reconstruction of houses, buildings, famous touristic sites Piazza San Marco Venice 13.703 images 27.707.825 points Interesting! But how can we go from pictures to 3D models? 12/11/2012 Lecture 3D Computer Vision 3

Overview I 12/11/2012 Lecture 3D Computer Vision 4

Overview II 12/11/2012 Lecture 3D Computer Vision 5

Dense 3D reconstruction When we take a picture, a 3D scene is projected onto a 2D image loss of depth information 3D reconstruction is the inverse process: build the 3D scene from a set of 2D images recover depth information How can we do that? MATCHING (difficult) + TRIANGULATION (previous lecture) 12/11/2012 Lecture 3D Computer Vision 6

Two Subproblems In general, 3D reconstruction can be divided into two major problems: Matching finding corresponding elements among the images Triangulation establishing 3D coordinates based on 2D image correspondences Which image entities should we match? 12/11/2012 Lecture 3D Computer Vision 7

Outline Previous lecture: structure and motion II Structure and motion loop Triangulation Wide baseline matching (SIFT) Today: dense 3D reconstruction The matching problem 2-view reconstruction Multi-view reconstruction Next lecture: structured light 12/11/2012 Lecture 3D Computer Vision 8

Matching Harris corners Lines/edges Feature-based matching SIFT keypoints, etc. Suitable for Structure and Motion But we are interested in dense reconstruction we need dense matching! match as many pixels as possible 12/11/2012 Lecture 3D Computer Vision 9

Dense matching Choice of camera setup Small baseline X wide baseline Matching is relatively easy Higher depth uncertainty Matching is hard Lower depth uncertainty 12/11/2012 Lecture 3D Computer Vision 10

Dense matching For every point on an image, there are many possible matches on the other image(s) Many points look similar high ambiguity What can we do? 12/11/2012 Lecture 3D Computer Vision 11

Dense matching We can use camera geometry finding the correspondence becomes a 1D search problem thanks to the epipolar geometry 12/11/2012 Lecture 3D Computer Vision 12

Dense matching Even using the epipolar constraint, there are many possible matches? Use of pixel neighborhood information correlation-based approaches (also called window-based approaches) 12/11/2012 Lecture 3D Computer Vision 13

Common window-based approaches Normalized Cross-Correlation (NCC) Sum of Squared Differences (SSD) Sum of Absolute Differences (SAD) 12/11/2012 Lecture 3D Computer Vision 14

Recall NCC write regions as vectors region A region B a b vector a vector b 12/11/2012 Lecture 3D Computer Vision 15

Window-based matching Search along the epipolar line for the best match Best match: - Maximum NCC - Minimum SSD or SAD 12/11/2012 Lecture 3D Computer Vision 16

Dense matching challenges Scene elements do not always look the same in the images Camera-related problems Image noise Lens distortion Color/chromatic aberration Viewpoint-related problems Perspective distortions Occlusions Specular reflections Scene-related problems Illumination changes Moving objects 12/11/2012 Lecture 3D Computer Vision 17

Dense matching challenges Scene elements do not always look the same in the images Camera-related problems Image noise Lens distortion Color/chromatic aberration Viewpoint-related problems Perspective distortions Occlusions Specular reflections Scene-related problems Illumination changes Moving objects 12/12/2012 Lecture 3D Computer Vision 18

Dense matching challenges Scene elements do not always look the same in the images Camera-related problems Image noise Lens distortion Color/chromatic aberration Viewpoint-related problems Perspective distortions Occlusions Specular reflections Scene-related problems Illumination changes Moving objects 12/12/2012 Lecture 3D Computer Vision 19

Dense matching challenges Scene elements do not always look the same in the images Camera-related problems Image noise Lens distortion Color/chromatic aberration Viewpoint-related problems Perspective distortions Occlusions Specular reflections Scene-related problems Illumination changes Moving objects 12/12/2012 Lecture 3D Computer Vision 20

Dense matching challenges Scene elements do not always look the same in the images Camera-related problems Image noise Lens distortion Color/chromatic aberration Viewpoint-related problems Perspective distortions Occlusions Specular reflections Scene-related problems Illumination changes Moving objects 12/12/2012 Lecture 3D Computer Vision 21

Dense matching challenges Scene elements do not always look the same in the images Camera-related problems Image noise Lens distortion Color/chromatic aberration Viewpoint-related problems Perspective distortions Occlusions Specular reflections Scene-related problems Illumination changes Moving objects 12/12/2012 Lecture 3D Computer Vision 22

Dense matching challenges Scene elements do not always look the same in the images Camera-related problems Image noise Lens distortion Color/chromatic aberration Viewpoint-related problems Perspective distortions Occlusions Specular reflections Scene-related problems Illumination changes Moving objects 12/12/2012 Lecture 3D Computer Vision 23

Dense matching challenges Scene elements do not always look the same in the images Camera-related problems Image noise Lens distortion Color/chromatic aberration Viewpoint-related problems Perspective distortions Occlusions Specular reflections Scene-related problems Illumination changes Moving objects 12/12/2012 Lecture 3D Computer Vision 24

Dense matching challenges Matching ambiguity Low textured regions Repetitive texture pattern Some assumptions are made 12/11/2012 Lecture 3D Computer Vision 25

Assumptions Appearance The projections of a scene (3D) patch should have similar appearances in all images Uniqueness A point in an image will have only one match in another image Ordering Order of appearance (e.g. left to right) is not altered by camera displacement Smoothness Depth varies smoothly (objects have smooth surfaces) 12/11/2012 Lecture 3D Computer Vision 26

Outline Previous lecture: structure and motion II Structure and motion loop Triangulation Wide baseline matching (SIFT) Today: dense 3D reconstruction The matching problem 2-view reconstruction Multi-view reconstruction Next lecture: structured light 12/11/2012 Lecture 3D Computer Vision 27

2-view reconstruction Input: a pair of images (usually left and right) Output: a disparity or depth map left image right image depth map disparity 1 depth 12/11/2012 Lecture 3D Computer Vision 28

2-view reconstruction General case: converging cameras Need to use epipolar geometry This geometry can be simplified by rectification One of the images Both images 12/11/2012 Lecture 3D Computer Vision 29

Image rectification Rectifying one image Image mapping is a 2D homography (projective transformation) Rectifying both images Project images onto plane parallel to baseline epipolar plane 12/11/2012 Lecture 3D Computer Vision 30

Example of rectifying both images 12/11/2012 Lecture 3D Computer Vision 31

Parallel cameras Triangulation reduces to a simple geometric problem x Z x d 12/11/2012 Lecture 3D Computer Vision 32

General dense correspondence algorithm For each pixel in the left image compute the neighbourhood cross correlation along the corresponding epipolar line in the right image the corresponding pixel is the one with the highest cross correlation Parameters size (scale) of neighbourhood search disparity Other constraints uniqueness ordering smoothness of depth field Applicability textured scene, largely fronto-parallel 12/11/2012 Lecture 3D Computer Vision 33

Example with NCC epipolar line 12/11/2012 Lecture 3D Computer Vision 34

Example with NCC left image band right image band 1 0.5 cross correlation 0 12/11/2012 Lecture 3D Computer Vision 35 x

Example with NCC target region left image band right image band 1 0.5 cross correlation 0 Why is it not as good as before? 12/11/2012 Lecture 3D Computer Vision 36 x

2-view reconstruction 1.The neighbourhood region does not have a distinctive spatial intensity distribution 2.Foreshortening effects fronto-parallel surface imaged length the same slanting surface imaged lengths differ

Occlusion surface slice 1 2 3 4,5 6 1 2,3 4 5 6 12/11/2012 Lecture 3D Computer Vision 38

Outline Previous lecture: structure and motion II Structure and motion loop Triangulation Wide baseline matching (SIFT) Today: dense 3D reconstruction The matching problem 2-view reconstruction Multi-view reconstruction Next lecture: structured light 12/11/2012 Lecture 3D Computer Vision 39

Multi-view reconstruction Many views of the same scene More images More constraints! x 1 x 2 x l 3 31 l 32 l 31 = F T 13 x 1 l 32 = F T 23 x 2 12/11/2012 Lecture 3D Computer Vision 40

Multi-view reconstruction More constraints Less occlusions More robust Appearance constraint can be relaxed However More complex Strong perspective distortions Propagation of calibration uncertainties Dense matching along many images is still hard Precise calibration is essential! 12/11/2012 Lecture 3D Computer Vision 41

Multi-view triangulation Assuming we have found correct matches along n images, how do we triangulate them? X j Exactly as we did before! Remember the algebraic solution x 1j Stack equations for all camera views: x 3j C 1 x 2j C 2 C 3 12/11/2012 Lecture 3D Computer Vision 42

Multi-view reconstruction MATCHING SCENE REPRESENTATION TRIANGULATION 12/11/2012 Lecture 3D Computer Vision 43

Scene Representation Voxel grid Depth map Mesh Point cloud 12/11/2012 Lecture 3D Computer Vision 44

Divide the scene into cubes Voxel grid Needs to sample the 3D space Voxels are marked as occupied/empty according to a photometric consistency measure (NCC, SSD ) Accuracy x size of voxels No assumption about the scene Memory allocation issues 12/12/2012 Lecture 3D Computer Vision 45

Divide the scene into cubes Voxel grid Needs to sample the 3D space Voxels are marked as occupied/empty according to a photometric consistency measure (NCC, SSD ) Accuracy x size of voxels No assumption about the scene Memory allocation issues 12/11/2012 Lecture 3D Computer Vision 46

Divide the scene into cubes Voxel grid Needs to sample the 3D space Voxels are marked as occupied/empty according to a photometric consistency measure (NCC, SSD ) Accuracy x size of voxels No assumption about the scene Memory allocation issues 12/12/2012 Lecture 3D Computer Vision 47

Divide the scene into cubes Voxel grid Needs to sample the 3D space Voxels are marked as occupied/empty according to a photometric consistency measure (NCC, SSD ) Accuracy x size of voxels No assumption about the scene Memory allocation issues 12/12/2012 Lecture 3D Computer Vision 48

Usually one depth map per view (by matching and triangulating features in small sets of images) Depth map No sampling of 3D space Simple Ideal for multi-core Depth maps have to be merged 12/11/2012 Lecture 3D Computer Vision 49

Usually one depth map per view (by matching and triangulating features in small sets of images) Depth map No sampling of 3D space Simple Ideal for multi-core Depth maps have to be merged 12/12/2012 Lecture 3D Computer Vision 50

Point cloud Flexible Points are independent Global or local Allows divide and conquer Sparse or dense May be noisy 12/11/2012 Lecture 3D Computer Vision 51

Point cloud Flexible Points are independent Global or local Allows divide and conquer Sparse or dense May be noisy 12/12/2012 Lecture 3D Computer Vision 52

Point cloud Flexible Points are independent Global or local Allows divide and conquer Sparse or dense May be noisy 12/12/2012 Lecture 3D Computer Vision 53

Point cloud 12/11/2012 Lecture 3D Computer Vision 54

Mesh Models the surface as a connected set of planar facets Usually triangular meshes Triangles are local good approximations of the surface Less noisy Good for optimization (mesh refinement) May become difficult to handle for complex surfaces 12/11/2012 Lecture 3D Computer Vision 55

Mesh Models the surface as a connected set of planar facets Usually triangular meshes Triangles are local good approximations of the surface Less noisy Good for optimization (mesh refinement) May become difficult to handle for complex surfaces 12/12/2012 Lecture 3D Computer Vision 56

Mesh Models the surface as a connected set of planar facets Usually triangular meshes Triangles are local good approximations of the surface Less noisy Good for optimization (mesh refinement) May become difficult to handle for complex surfaces 12/12/2012 Lecture 3D Computer Vision 57

Mesh 12/11/2012 Lecture 3D Computer Vision 58

Mesh 12/11/2012 Lecture 3D Computer Vision 59

Interesting links Middleburry stereo data set: http://vision.middlebury.edu/stereo/data/ Middlebury multi-view data set: http://vision.middlebury.edu/mview/ PMVS version 2: http://grail.cs.washington.edu/software/pmvs/ CVLab multi-view data sets: http://cvlab.epfl.ch/data/strechamvs/ http://cvlab.epfl.ch/~strecha/multiview/densemvs.html http://cvlab.epfl.ch/research/surface/emvs/ 12/12/2012 Lecture 3D Computer Vision 60

References Furukawa, Y. & Ponce, J. Accurate, Dense and Robust Multi-View Stereopsis, TPAMI, 2008, 01, 1-14 Furukawa, Y.; Curless, B.; Seitz, S. M. & Szeliski, R. Towards Internet- Scale Multi-View Stereo, CVPR, 2010 Hiep, V. H.; Keriven, R.; Labatut, P. & Pons, J.-P. Towards highresolution large-scale multi-view stereo, CVPR, 2009, 1430-1437 Goesele, M.; Snavely, N.; Curless, B.; Hoppe, H. & Seitz, S. M. Multi- View Stereo for Community Photo Collections, ICCV, 2007, 0, 1-8 Hartley, R. I. & Zisserman, A. Multiple View Geometry in Computer Vision, Cambridge University Press, ISBN: 0521540518, 2004 12/12/2012 Lecture 3D Computer Vision 61

Organisational information Oral exams: Qualification: A minimum average score of 60% in the exercises Period: 11.03.2013-15.03.2013 DFKI, room 1.27 Registration: email to Leivy Michelly Kaul (kaul@dfki.uni-kl.de) to get an appointment (mention days where you can t make it). Lectures: 16.01.2013: no lecture; you are expected to work through a scientific paper based on central questions (will be uploaded to the website shortly before). The paper will be the basis for the lecture at 23.01.2013. The last lecture will be a question lecture and can be moved from 30.01.2013 to a date closer to the actual exam period (will be coordinated by Johannes and Tobias in the last exercise session). Lecture 3D Computer Vision 62

Thank you!

JUST A REMINDER 12/11/2012 Lecture 3D Computer Vision 64

Triangulation If images are not rectified X j x 1j x 2j C 1 C 2 12/11/2012 Lecture 3D Computer Vision 65

Triangulation Given: corresponding measured (i.e. noisy) feature points x and x and camera poses, P and P Problem: in the presence of noise, back projected rays do not intersect Rays are skew in space and measured points do not lie on corresponding epipolar lines 12/11/2012 Lecture 3D Computer Vision 66

Triangulation: algebraic solution Equation for one camera view (camera pose P and feature point x) 4 entries, 3 DOF i th row of P (3x4) matrix 3 equations, only 2 linearly independent drop third row 12/11/2012 Lecture 3D Computer Vision 67

Triangulation: algebraic solution Stack equations for all camera views: X has 3 DOF (4 parameters due to homogeneous representation) One camera view gives two linearly independent equations Two camera views are sufficient for a minimal solution Solution for X is eigenvector of A T A corresponding to smallest eigenvalue 68

Triangulation: minimize geometric error Estimate 3D point, which exactly satisfies the supplied camera geometry P and P, so it projects as where and are closest to the actual image measurements Assumes perfect camera poses! 12/11/2012 Lecture 3D Computer Vision 69