CS5670: Computer Vision

Similar documents
Project 3 code & artifact due Tuesday Final project proposals due noon Wed (by ) Readings Szeliski, Chapter 10 (through 10.5)

Project 2 due today Project 3 out today. Readings Szeliski, Chapter 10 (through 10.5)

Recap from Previous Lecture

Binocular stereo. Given a calibrated binocular stereo pair, fuse it to produce a depth image. Where does the depth information come from?

Multiple View Geometry

What have we leaned so far?

Public Library, Stereoscopic Looking Room, Chicago, by Phillips, 1923

Stereo II CSE 576. Ali Farhadi. Several slides from Larry Zitnick and Steve Seitz

Stereo vision. Many slides adapted from Steve Seitz

Today. Stereo (two view) reconstruction. Multiview geometry. Today. Multiview geometry. Computational Photography

Stereo. 11/02/2012 CS129, Brown James Hays. Slides by Kristen Grauman

Introduction to 3D Imaging: Perceiving 3D from 2D Images

CS 4495 Computer Vision A. Bobick. Motion and Optic Flow. Stereo Matching

CS4495/6495 Introduction to Computer Vision. 3B-L3 Stereo correspondence

Chaplin, Modern Times, 1936

Lecture 14: Computer Vision

There are many cues in monocular vision which suggests that vision in stereo starts very early from two similar 2D images. Lets see a few...

Stereo. Many slides adapted from Steve Seitz

Computer Vision Lecture 17

Computer Vision Lecture 17

Introduction to Computer Vision. Week 10, Winter 2010 Instructor: Prof. Ko Nishino

CS 4495 Computer Vision A. Bobick. Motion and Optic Flow. Stereo Matching

BIL Computer Vision Apr 16, 2014

Image Based Reconstruction II

Stereo: Disparity and Matching

Stereo Matching.

Final project bits and pieces

Lecture 9 & 10: Stereo Vision

Fundamentals of Stereo Vision Michael Bleyer LVA Stereo Vision

Fundamental matrix. Let p be a point in left image, p in right image. Epipolar relation. Epipolar mapping described by a 3x3 matrix F

Stereo and structured light

Epipolar Geometry and Stereo Vision

Computer Vision I. Announcements. Random Dot Stereograms. Stereo III. CSE252A Lecture 16

Epipolar Geometry and Stereo Vision

Stereo and Epipolar geometry

Stereo Vision Computer Vision (Kris Kitani) Carnegie Mellon University

Stereo. Outline. Multiple views 3/29/2017. Thurs Mar 30 Kristen Grauman UT Austin. Multi-view geometry, matching, invariant features, stereo vision

Correspondence and Stereopsis. Original notes by W. Correa. Figures from [Forsyth & Ponce] and [Trucco & Verri]

EECS 442 Computer vision. Stereo systems. Stereo vision Rectification Correspondence problem Active stereo vision systems

Lecture 10: Multi view geometry

Cameras and Stereo CSE 455. Linda Shapiro

Lecture'9'&'10:'' Stereo'Vision'

Announcements. Stereo Vision Wrapup & Intro Recognition

Image Rectification (Stereo) (New book: 7.2.1, old book: 11.1)

Epipolar Geometry and Stereo Vision

Recap: Features and filters. Recap: Grouping & fitting. Now: Multiple views 10/29/2008. Epipolar geometry & stereo vision. Why multiple views?

3D Sensing and Reconstruction Readings: Ch 12: , Ch 13: ,

Epipolar Constraint. Epipolar Lines. Epipolar Geometry. Another look (with math).

Colorado School of Mines. Computer Vision. Professor William Hoff Dept of Electrical Engineering &Computer Science.

6.819 / 6.869: Advances in Computer Vision Antonio Torralba and Bill Freeman. Lecture 11 Geometry, Camera Calibration, and Stereo.

Dense 3D Reconstruction. Christiano Gava

Matching. Compare region of image to region of image. Today, simplest kind of matching. Intensities similar.

Lecture 14: Basic Multi-View Geometry

Lecture 6 Stereo Systems Multi-view geometry

EE795: Computer Vision and Intelligent Systems

Epipolar Geometry and Stereo Vision

Stereo Vision A simple system. Dr. Gerhard Roth Winter 2012

Dense 3D Reconstruction. Christiano Gava

A virtual tour of free viewpoint rendering

Introduction à la vision artificielle X

Lecture 10: Multi-view geometry

Stereo Matching. Stereo Matching. Face modeling. Z-keying: mix live and synthetic

7. The Geometry of Multi Views. Computer Engineering, i Sejong University. Dongil Han

Stereo imaging ideal geometry

Colorado School of Mines. Computer Vision. Professor William Hoff Dept of Electrical Engineering &Computer Science.

Miniature faking. In close-up photo, the depth of field is limited.

Step-by-Step Model Buidling

CS 2770: Intro to Computer Vision. Multiple Views. Prof. Adriana Kovashka University of Pittsburgh March 14, 2017

Multiple View Geometry

EE795: Computer Vision and Intelligent Systems

Camera Calibration. Schedule. Jesus J Caban. Note: You have until next Monday to let me know. ! Today:! Camera calibration

CS6670: Computer Vision

Human Body Recognition and Tracking: How the Kinect Works. Kinect RGB-D Camera. What the Kinect Does. How Kinect Works: Overview

Computer Vision, Lecture 11

Last time: Disparity. Lecture 11: Stereo II. Last time: Triangulation. Last time: Multi-view geometry. Last time: Epipolar geometry

Geometric Reconstruction Dense reconstruction of scene geometry

INFO - H Pattern recognition and image analysis. Vision

Robert Collins CSE486, Penn State. Lecture 09: Stereo Algorithms

Stereo Vision. MAN-522 Computer Vision

Multi-view stereo. Many slides adapted from S. Seitz

Perception II: Pinhole camera and Stereo Vision

Rectification and Disparity

Camera Drones Lecture 3 3D data generation

Complex Sensors: Cameras, Visual Sensing. The Robotics Primer (Ch. 9) ECE 497: Introduction to Mobile Robotics -Visual Sensors

Lecture 19: Depth Cameras. Visual Computing Systems CMU , Fall 2013

Multiple Baseline Stereo

Computer Vision I. Announcement. Stereo Vision Outline. Stereo II. CSE252A Lecture 15

55:148 Digital Image Processing Chapter 11 3D Vision, Geometry

3D Photography: Stereo Matching

Think-Pair-Share. What visual or physiological cues help us to perceive 3D shape and depth?

Rectification and Distortion Correction

Capturing, Modeling, Rendering 3D Structures

CS6670: Computer Vision

Computer Vision I. Dense Stereo Correspondences. Anita Sellent 1/15/16

Lecture 6 Stereo Systems Multi- view geometry Professor Silvio Savarese Computational Vision and Geometry Lab Silvio Savarese Lecture 6-24-Jan-15

Flexible Calibration of a Portable Structured Light System through Surface Plane

CS6670: Computer Vision

CSCI 5980: Assignment #3 Homography

CS201 Computer Vision Camera Geometry

Feature Transfer and Matching in Disparate Stereo Views through the use of Plane Homographies

Transcription:

CS5670: Computer Vision Noah Snavely, Zhengqi Li Stereo Single image stereogram, by Niklas Een

Mark Twain at Pool Table", no date, UCR Museum of Photography

Stereo Given two images from different viewpoints How can we compute the depth of each point in the image? Based on how much each pixel moves between the two images

How do we get 3D from Stereo Images? Perception of depth arises from disparity of a given 3D point in your right and left retinal images 3D point left image right image disparity: the difference in image location of the same 3D point when projected under perspective to two different cameras d = xleft -xright

real image point z E f x i Recall :Perspective Projection O camera lens D image of point B in front image This is the axis of the real image plane. O is the center of projection. This is the axis of the front image plane, which we use. Z x B 3D object point x i f = x z (from similar triangles)

Projection for Stereo Images Simple Model: Optic axes of 2 cameras are parallel image planes z camera baseline L O camera R b f f xl xr Z x-b z f = x xl X z f = x-b z xr f (from similar triangles) = y = yl y yr P=(x,z) Y-axis is perpendicular to the page.

3D from Stereo Images: Triangulation For stereo cameras with parallel optical axes, focal length f, baseline b, corresponding image points (xl,yl) and (xr,yr), the location of the 3D point can be derived from previous slide s equations: Depth z = f*b / (xl - xr) = f*b/d x = xl*z/f or b + xr*z/f y = yl*z/f or yr*z/f This method of determining depth from disparity d is called triangulation. Note that depth is inversely proportional to disparity

Depth z = f*b / (xl - xr) = f*b/d x = xl*z/f or b + xr*z/f y = yl*z/f or yr*z/f Two main problems: 1. Need to know focal length f, baseline b 2. - use prior knowledge or camera calibration (http://www.vision.caltech.edu/bouguetj/calib_doc ) Need to find corresponding point (xr,yr) for each (xl,yl) Correspondence problem

Q: Given a point in the left image, do you need to search the entire right image for the corresponding point?

Epipolar Constraint for Correspondence Epipolar plane = plane connecting C1, P C2, and point P. C1 b y1 P1 C2 P2 z1 y2 x z2 epipolar plane *Epipolar plane cuts through image planes forming an epipolar line in each plane *Match for P1 (or P2) in the other image must lie on epipolar line

Epipolar Constraint for Correspondence Match for P1 in the other image must lie on epipolar line So need search only along this line P P1 C1 b C2 Epipolar line

What if the optical axes of the 2 cameras are not parallel to each other? Does Epipolar constraint still holds?

Epipolar constraint still holds P e1 and e2 are the epipolar lines for point P P1 C1 y1 e1 x1 y2 e2 P2 x2 C2 But the epipolar lines may no longer be horizontal Java demo: http://www.ai.sri.com/~luong/research/meta3dviewer/epipolargeo.html

Example Yellow epipolar lines for the three points shown on the left image (from a slide by Pascal Fua) Given a point P1 in left image on epipolar line e1, can find epipolar line e2 provided we know relative orientations of cameras Requires camera calibration (see lecture 5)

Alternate approach: Stereo image rectification Reproject image planes onto a common plane parallel to the line between optical centers Epipolar line is horizontal after this transformation Two homographies ( 3x3 transforms), one for each input image reprojection, is computed. See: C. Loop and Z. Zhang. Computing Rectifying Homographies for Stereo Vision. IEEE Conf. Computer Vision and Pattern Recognition, 1999.

Original stereo pair After rectification

Epipolar geometry epipolar lines (x 1, y 1 ) (x 2, y 1 ) Two images captured by a purely horizontal translating camera (rectified stereo pair) x 1 x 2 = the disparity of pixel (x 1, y 1 )

Your basic stereo algorithm For each epipolar line For each pixel in the left image compare with every pixel on same epipolar line in right image pick pixel with minimum match cost Improvement: match windows

Matching using Sum of Squared Differences (SSD)

Stereo matching based on SSD SSD Best matching disparity d min d

Window size Effect of window size Smaller window + Larger window + W = 3 W = 21 Better results with adaptive window T. Kanade and M. Okutomi, A Stereo Matching Algorithm with an Adaptive Window: Theory and Experiment,, Proc. International Conference on Robotics and Automation, 1991. D. Scharstein and R. Szeliski. Stereo matching with nonlinear diffusion. International Journal of Computer Vision, 28(2):155-174, July 1998

Problems with window size Input stereo pair W = 3 Effect of window size W Smaller window + Good precision, more detail Sensitive to noise Larger window + Robust to noise Reduced precision, less detail W = 21

Stereo results Data from University of Tsukuba Similar results on other images without ground truth Scene Ground truth

Results with window search Window-based matching (best window size) Ground truth

Better methods exist... State of the Art method Boykov et al., Fast Approximate Energy Minimization via Graph Cuts, International Conference on Computer Vision, September 1999. Ground truth For the latest and greatest: http://www.middlebury.edu/stereo/

Stereo as energy minimization What defines a good stereo correspondence? 1. Match quality Want each pixel to find a good match in the other image 2. Smoothness If two pixels are adjacent, they should (usually) move about the same amount

Stereo as energy minimization Find disparity map d that minimizes an energy function Simple pixel / window matching SSD distance between windows I(x, = y) and J(x + d(x,y), y)

Stereo as energy minimization I(x, y) J(x, y) y = 141 d x C(x, y, d); the disparity space image (DSI)

Stereo as energy minimization y = 141 d x Simple pixel / window matching: choose the minimum of each column in the DSI independently:

{ { Stereo as energy minimization Better objective function match cost Want each pixel to find a good match in the other image smoothness cost Adjacent pixels should (usually) move about the same amount

Stereo as energy minimization match cost: smoothness cost: : set of neighboring pixels 4-connected neighborhood 8-connected neighborhood

Smoothness cost How do we choose V? L 1 distance Potts model

Dynamic programming Can minimize this independently per scanline using dynamic programming (DP) : minimum cost of solution such that d(x,y) = d

Dynamic programming y = 141 d x Finds smooth path through DPI from left to right

Dynamic Programming

Depth from disparity X x x z f f C baseline C

Questions?

Real-time stereo Nomad robot searches for meteorites in Antartica http://www.frc.ri.cmu.edu/projects/meteorobot/index.html Used for robot navigation (and other tasks) Several software-based real-time stereo techniques have been developed (most based on simple discrete search)

Stereo reconstruction pipeline Steps Calibrate cameras Rectify images Compute disparity Estimate depth What will cause errors? Camera calibration errors Poor image resolution Occlusions Violations of brightness constancy (specular reflections) Large motions Low-contrast image regions

Active stereo with structured light Li Zhang s one-shot stereo camera 1 camera 1 projector projector camera 2 Project structured light patterns onto the object simplifies the correspondence problem

Laser scanning Digital Michelangelo Project http://graphics.stanford.edu/projects/mich/ Optical triangulation Project a single stripe of laser light Scan it across the surface of the object This is a very precise version of structured light scanning

Laser scanned models The Digital Michelangelo Project, Levoy et al.

Laser scanned models The Digital Michelangelo Project, Levoy et al.

Laser scanned models The Digital Michelangelo Project, Levoy et al.

Laser scanned models The Digital Michelangelo Project, Levoy et al.