Stereo. 11/02/2012 CS129, Brown James Hays. Slides by Kristen Grauman

Similar documents
Stereo. Many slides adapted from Steve Seitz

Stereo vision. Many slides adapted from Steve Seitz

Binocular stereo. Given a calibrated binocular stereo pair, fuse it to produce a depth image. Where does the depth information come from?

Stereo: Disparity and Matching

CS 4495 Computer Vision A. Bobick. Motion and Optic Flow. Stereo Matching

BIL Computer Vision Apr 16, 2014

CS 4495 Computer Vision A. Bobick. Motion and Optic Flow. Stereo Matching

Stereo. Outline. Multiple views 3/29/2017. Thurs Mar 30 Kristen Grauman UT Austin. Multi-view geometry, matching, invariant features, stereo vision

Epipolar Geometry and Stereo Vision

Project 4 Results. Representation. Data. Learning. Zachary, Hung-I, Paul, Emanuel. SIFT and HoG are popular and successful.

Recap: Features and filters. Recap: Grouping & fitting. Now: Multiple views 10/29/2008. Epipolar geometry & stereo vision. Why multiple views?

Epipolar Geometry and Stereo Vision

Chaplin, Modern Times, 1936

CS4495/6495 Introduction to Computer Vision. 3B-L3 Stereo correspondence

Computer Vision Lecture 17

Computer Vision Lecture 17

Miniature faking. In close-up photo, the depth of field is limited.

Epipolar Geometry and Stereo Vision

Final project bits and pieces

There are many cues in monocular vision which suggests that vision in stereo starts very early from two similar 2D images. Lets see a few...

Lecture 9 & 10: Stereo Vision

Think-Pair-Share. What visual or physiological cues help us to perceive 3D shape and depth?

Fundamental matrix. Let p be a point in left image, p in right image. Epipolar relation. Epipolar mapping described by a 3x3 matrix F

Image Based Reconstruction II

Stereo II CSE 576. Ali Farhadi. Several slides from Larry Zitnick and Steve Seitz

EECS 442 Computer vision. Stereo systems. Stereo vision Rectification Correspondence problem Active stereo vision systems

Introduction à la vision artificielle X

Lecture 10: Multi view geometry

Lecture 10: Multi-view geometry

CS5670: Computer Vision

Epipolar Geometry and Stereo Vision

Lecture'9'&'10:'' Stereo'Vision'

Recap from Previous Lecture

Cameras and Stereo CSE 455. Linda Shapiro

Multiple View Geometry

Image Rectification (Stereo) (New book: 7.2.1, old book: 11.1)

Lecture 6 Stereo Systems Multi-view geometry

Project 2 due today Project 3 out today. Readings Szeliski, Chapter 10 (through 10.5)

6.819 / 6.869: Advances in Computer Vision Antonio Torralba and Bill Freeman. Lecture 11 Geometry, Camera Calibration, and Stereo.

Today. Stereo (two view) reconstruction. Multiview geometry. Today. Multiview geometry. Computational Photography

Multiple View Geometry

Project 3 code & artifact due Tuesday Final project proposals due noon Wed (by ) Readings Szeliski, Chapter 10 (through 10.5)

Recovering structure from a single view Pinhole perspective projection

Kinect Device. How the Kinect Works. Kinect Device. What the Kinect does 4/27/16. Subhransu Maji Slides credit: Derek Hoiem, University of Illinois

Mosaics wrapup & Stereo

Human Body Recognition and Tracking: How the Kinect Works. Kinect RGB-D Camera. What the Kinect Does. How Kinect Works: Overview

Lecture 14: Computer Vision

CS 2770: Intro to Computer Vision. Multiple Views. Prof. Adriana Kovashka University of Pittsburgh March 14, 2017

Lecture 14: Basic Multi-View Geometry

Spatial track: range acquisition modeling

Colorado School of Mines. Computer Vision. Professor William Hoff Dept of Electrical Engineering &Computer Science.

A virtual tour of free viewpoint rendering

Stereo Epipolar Geometry for General Cameras. Sanja Fidler CSC420: Intro to Image Understanding 1 / 33

What have we leaned so far?

Computer Vision I. Announcement. Stereo Vision Outline. Stereo II. CSE252A Lecture 15

Public Library, Stereoscopic Looking Room, Chicago, by Phillips, 1923

Dense 3D Reconstruction. Christiano Gava

Lecture 6 Stereo Systems Multi- view geometry Professor Silvio Savarese Computational Vision and Geometry Lab Silvio Savarese Lecture 6-24-Jan-15

Stereo and structured light

3D Sensing and Reconstruction Readings: Ch 12: , Ch 13: ,

Camera Geometry II. COS 429 Princeton University

Stereo Vision. MAN-522 Computer Vision

Rectification and Disparity

Perception II: Pinhole camera and Stereo Vision

CEng Computational Vision

Stereo and Epipolar geometry

Dense 3D Reconstruction. Christiano Gava

Epipolar Geometry Prof. D. Stricker. With slides from A. Zisserman, S. Lazebnik, Seitz

Rectification. Dr. Gerhard Roth

Introduction to Computer Vision. Week 10, Winter 2010 Instructor: Prof. Ko Nishino

Fundamentals of Stereo Vision Michael Bleyer LVA Stereo Vision

Camera Calibration. Schedule. Jesus J Caban. Note: You have until next Monday to let me know. ! Today:! Camera calibration

55:148 Digital Image Processing Chapter 11 3D Vision, Geometry

CS201 Computer Vision Camera Geometry

Stereo CSE 576. Ali Farhadi. Several slides from Larry Zitnick and Steve Seitz

Lecture 9: Epipolar Geometry

Computer Vision I. Announcements. Random Dot Stereograms. Stereo III. CSE252A Lecture 16

3D Computer Vision. Depth Cameras. Prof. Didier Stricker. Oliver Wasenmüller

Undergrad HTAs / TAs. Help me make the course better! HTA deadline today (! sorry) TA deadline March 21 st, opens March 15th

Depth from two cameras: stereopsis

Announcements. Stereo Vision Wrapup & Intro Recognition

Stereo Image Rectification for Simple Panoramic Image Generation

Passive 3D Photography

Stereo Vision Computer Vision (Kris Kitani) Carnegie Mellon University

Reminder: Lecture 20: The Eight-Point Algorithm. Essential/Fundamental Matrix. E/F Matrix Summary. Computing F. Computing F from Point Matches

Two-view geometry Computer Vision Spring 2018, Lecture 10

Depth from two cameras: stereopsis

Ninio, J. and Stevens, K. A. (2000) Variations on the Hermann grid: an extinction illusion. Perception, 29,

7. The Geometry of Multi Views. Computer Engineering, i Sejong University. Dongil Han

Structure from Motion and Multi- view Geometry. Last lecture

CS 1674: Intro to Computer Vision. Midterm Review. Prof. Adriana Kovashka University of Pittsburgh October 10, 2016

Multi-view stereo. Many slides adapted from S. Seitz

Step-by-Step Model Buidling

Flexible Calibration of a Portable Structured Light System through Surface Plane

EE795: Computer Vision and Intelligent Systems

Epipolar geometry. x x

3D Scanning. Qixing Huang Feb. 9 th Slide Credit: Yasutaka Furukawa

Epipolar geometry contd.

Lecture 8 Active stereo & Volumetric stereo

3D Photography: Stereo Matching

Transcription:

Stereo 11/02/2012 CS129, Brown James Hays Slides by Kristen Grauman

Multiple views Multi-view geometry, matching, invariant features, stereo vision Lowe Hartley and Zisserman

Why multiple views? Structure and depth are inherently ambiguous from single views. Images from Lana Lazebnik

Why multiple views? Structure and depth are inherently ambiguous from single views. P1 P2 P1 =P2 Optical center

What cues help us to perceive 3d shape and depth?

[Figure from Prados & Faugeras 2006] Shading

Focus/defocus Images from same point of view, different camera parameters 3d shape / depth estimates [figs from H. Jin and P. Favaro, 2002]

Texture [From A.M. Loh. The recovery of 3-D structure using visual texture patterns. PhD thesis]

Perspective effects Image credit: S. Seitz

Motion Figures from L. Zhang http://www.brainconnection.com/teasers/?main=illusion/motion-shape

Estimating depth with stereo Stereo: shape from motion between two views We ll need to consider: Info on camera pose ( calibration ) Image point correspondences scene point optical center image plane

Stereo vision Two cameras, simultaneous views Single moving camera and static scene

Camera parameters Camera frame 2 Extrinsic parameters: Camera frame 1 Camera frame 2 Camera frame 1 Intrinsic parameters: Image coordinates relative to camera Pixel coordinates Extrinsic params: rotation matrix and translation vector Intrinsic params: focal length, pixel sizes (mm), image center point, radial distortion parameters We ll assume for now that these parameters are given and fixed.

Geometry for a simple stereo system First, assuming parallel optical axes, known camera parameters (i.e., calibrated cameras):

World point image point (left) Depth of p image point (right) Focal length optical center (left) baseline optical center (right)

Geometry for a simple stereo system Assume parallel optical axes, known camera parameters (i.e., calibrated cameras). What is expression for Z? Similar triangles (p l, P, p r ) and (O l, P, O r ): T x l Z f x r T Z disparity Z f T x r x l

Depth from disparity image I(x,y) Disparity map D(x,y) image I (x,y ) (x,y )=(x+d(x,y), y) So if we could find the corresponding points in two images, we could estimate relative depth

General case, with calibrated cameras The two cameras need not have parallel optical axes. Vs.

Stereo correspondence constraints Given p in left image, where can corresponding point p be?

Stereo correspondence constraints

Epipolar constraint Geometry of two views constrains where the corresponding pixel for some image point in the first view must occur in the second view. It must be on the line carved out by a plane connecting the world point and optical centers.

Epipolar geometry Epipolar Plane Epipolar Line Epipole Baseline Epipole http://www.ai.sri.com/~luong/research/meta3dviewer/epipolargeo.html

Epipolar geometry: terms Baseline: line joining the camera centers Epipole: point of intersection of baseline with image plane Epipolar plane: plane containing baseline and world point Epipolar line: intersection of epipolar plane with the image plane All epipolar lines intersect at the epipole An epipolar plane intersects the left and right image planes in epipolar lines Why is the epipolar constraint useful?

Epipolar constraint This is useful because it reduces the correspondence problem to a 1D search along an epipolar line. Image from Andrew Zisserman

Example

What do the epipolar lines look like? 1. O l O r 2. O l O r

Figure from Hartley & Zisserman Example: converging cameras

Example: parallel cameras Where are the epipoles? Figure from Hartley & Zisserman

Example: Forward motion What would the epipolar lines look like if the camera moves directly forward?

Example: Forward motion e e Epipole has same coordinates in both images. Points move along lines radiating from e: Focus of expansion

Stereo image rectification

Stereo image rectification Reproject image planes onto a common plane parallel to the line between camera centers Pixel motion is horizontal after this transformation Two homographies (3x3 transform), one for each input image reprojection C. Loop and Z. Zhang. Computing Rectifying Homographies for Stereo Vision. IEEE Conf. Computer Vision and Pattern Recognition, 1999.

Rectification example

The correspondence problem Epipolar geometry constrains our search, but we still have a difficult correspondence problem.

Basic stereo matching algorithm If necessary, rectify the two stereo images to transform epipolar lines into scanlines For each pixel x in the first image Find corresponding epipolar scanline in the right image Examine all pixels on the scanline and pick the best match x Compute disparity x-x and set depth(x) = fb/(x-x )

Correspondence search Left Right scanline Matching cost disparity Slide a window along the right scanline and compare contents of that window with the reference window in the left image Matching cost: SSD or normalized correlation

Correspondence search Left Right scanline SSD

Correspondence search Left Right scanline Norm. corr

Effect of window size Smaller window + More detail More noise W = 3 W = 20 Larger window + Smoother disparity maps Less detail

Failures of correspondence search Textureless surfaces Occlusions, repetition Non-Lambertian surfaces, specularities

Results with window search Data Window-based matching Ground truth

Stereo as energy minimization What defines a good stereo correspondence? 1. Match quality Want each pixel to find a good match in the other image 2. Smoothness If two pixels are adjacent, they should (usually) move about the same amount

Scanline stereo Try to coherently match pixels on the entire scanline Different scanlines are still optimized independently Left image Right image

Left occlusion Shortest paths for scan-line stereo Left image I Right image I S left Right occlusion q C occl t s p S right C occl C corr Can be implemented with dynamic programming Ohta & Kanade 85, Cox et al. 96 Slide credit: Y. Boykov

Coherent stereo on 2D grid Scanline stereo generates streaking artifacts Can t use dynamic programming to find spatially coherent disparities/ correspondences on a 2D grid

Stereo matching as energy minimization I 1 I 2 D W 1 (i ) W 2 (i+d(i )) D(i ) 2 W 1 ( i) W2 ( i D( i)) D( i) D( j) E( D) i neighborsi, j data term smoothness term Energy functions of this form can be minimized using graph cuts Y. Boykov, O. Veksler, and R. Zabih, Fast Approximate Energy Minimization via Graph Cuts, PAMI 2001

Many of these constraints can be encoded in an energy function and solved using graph cuts Before Graph cuts Ground truth Y. Boykov, O. Veksler, and R. Zabih, Fast Approximate Energy Minimization via Graph Cuts, PAMI 2001 For the latest and greatest: http://www.middlebury.edu/stereo/

Active stereo with structured light Project structured light patterns onto the object Simplifies the correspondence problem Allows us to use only one camera camera projector L. Zhang, B. Curless, and S. M. Seitz. Rapid Shape Acquisition Using Color Structured Light and Multi-pass Dynamic Programming. 3DPVT 2002

Laser scanning Digital Michelangelo Project http://graphics.stanford.edu/projects/mich/ Optical triangulation Project a single stripe of laser light Scan it across the surface of the object This is a very precise version of structured light scanning

Kinect: Structured infrared light http://bbzippo.wordpress.com/2010/11/28/kinect-in-infrared/

Summary: Key idea: Epipolar constraint X X X x x x x Potential matches for x have to lie on the corresponding line l. Potential matches for x have to lie on the corresponding line l.

Summary Epipolar geometry Epipoles are intersection of baseline with image planes Matching point in second image is on a line passing through its epipole Fundamental matrix maps from a point in one image to a line (its epipolar line) in the other Can solve for F given corresponding points (e.g., interest points) Stereo depth estimation Estimate disparity by finding corresponding points along scanlines Depth is inverse to disparity