Stereo. Outline. Multiple views 3/29/2017. Thurs Mar 30 Kristen Grauman UT Austin. Multi-view geometry, matching, invariant features, stereo vision

Similar documents
CS4495/6495 Introduction to Computer Vision. 3B-L3 Stereo correspondence

Stereo: Disparity and Matching

Chaplin, Modern Times, 1936

CS 4495 Computer Vision A. Bobick. Motion and Optic Flow. Stereo Matching

Stereo. 11/02/2012 CS129, Brown James Hays. Slides by Kristen Grauman

CS 4495 Computer Vision A. Bobick. Motion and Optic Flow. Stereo Matching

Final project bits and pieces

Recap: Features and filters. Recap: Grouping & fitting. Now: Multiple views 10/29/2008. Epipolar geometry & stereo vision. Why multiple views?

Stereo vision. Many slides adapted from Steve Seitz

Computer Vision Lecture 17

Computer Vision Lecture 17

Fundamental matrix. Let p be a point in left image, p in right image. Epipolar relation. Epipolar mapping described by a 3x3 matrix F

Mosaics wrapup & Stereo

Stereo. Many slides adapted from Steve Seitz

Introduction à la vision artificielle X

Binocular stereo. Given a calibrated binocular stereo pair, fuse it to produce a depth image. Where does the depth information come from?

Epipolar Geometry and Stereo Vision

BIL Computer Vision Apr 16, 2014

Last time: Disparity. Lecture 11: Stereo II. Last time: Triangulation. Last time: Multi-view geometry. Last time: Epipolar geometry

Project 4 Results. Representation. Data. Learning. Zachary, Hung-I, Paul, Emanuel. SIFT and HoG are popular and successful.

EECS 442 Computer vision. Stereo systems. Stereo vision Rectification Correspondence problem Active stereo vision systems

CEng Computational Vision

Epipolar Geometry and Stereo Vision

Epipolar Geometry and Stereo Vision

Stereo II CSE 576. Ali Farhadi. Several slides from Larry Zitnick and Steve Seitz

Lecture 10: Multi view geometry

Public Library, Stereoscopic Looking Room, Chicago, by Phillips, 1923

Lecture 14: Basic Multi-View Geometry

Lecture 10: Multi-view geometry

CS5670: Computer Vision

Image Based Reconstruction II

There are many cues in monocular vision which suggests that vision in stereo starts very early from two similar 2D images. Lets see a few...

Think-Pair-Share. What visual or physiological cues help us to perceive 3D shape and depth?

Epipolar Geometry and Stereo Vision

Dense 3D Reconstruction. Christiano Gava

Miniature faking. In close-up photo, the depth of field is limited.

Multiple View Geometry

6.819 / 6.869: Advances in Computer Vision Antonio Torralba and Bill Freeman. Lecture 11 Geometry, Camera Calibration, and Stereo.

Ninio, J. and Stevens, K. A. (2000) Variations on the Hermann grid: an extinction illusion. Perception, 29,

Multiple View Geometry

Lecture 9 & 10: Stereo Vision

Lecture 14: Computer Vision

Multi-stable Perception. Necker Cube

Multi-view stereo. Many slides adapted from S. Seitz

Dense 3D Reconstruction. Christiano Gava

What have we leaned so far?

Project 2 due today Project 3 out today. Readings Szeliski, Chapter 10 (through 10.5)

EE795: Computer Vision and Intelligent Systems

Fundamentals of Stereo Vision Michael Bleyer LVA Stereo Vision

Stereo Vision. MAN-522 Computer Vision

Announcements. Stereo Vision Wrapup & Intro Recognition

CS 2770: Intro to Computer Vision. Multiple Views. Prof. Adriana Kovashka University of Pittsburgh March 14, 2017

Correspondence and Stereopsis. Original notes by W. Correa. Figures from [Forsyth & Ponce] and [Trucco & Verri]

Recap from Previous Lecture

Project 3 code & artifact due Tuesday Final project proposals due noon Wed (by ) Readings Szeliski, Chapter 10 (through 10.5)

CS 4495/7495 Computer Vision Frank Dellaert, Fall 07. Dense Stereo Some Slides by Forsyth & Ponce, Jim Rehg, Sing Bing Kang

7. The Geometry of Multi Views. Computer Engineering, i Sejong University. Dongil Han

Finally: Motion and tracking. Motion 4/20/2011. CS 376 Lecture 24 Motion 1. Video. Uses of motion. Motion parallax. Motion field

Stereo Wrap + Motion. Computer Vision I. CSE252A Lecture 17

Image Rectification (Stereo) (New book: 7.2.1, old book: 11.1)

Today. Stereo (two view) reconstruction. Multiview geometry. Today. Multiview geometry. Computational Photography

Why is computer vision difficult?

CS 1674: Intro to Computer Vision. Midterm Review. Prof. Adriana Kovashka University of Pittsburgh October 10, 2016

A virtual tour of free viewpoint rendering

Computer Vision I. Announcement. Stereo Vision Outline. Stereo II. CSE252A Lecture 15

Stereo Matching. Stereo Matching. Face modeling. Z-keying: mix live and synthetic

Passive 3D Photography

Human Body Recognition and Tracking: How the Kinect Works. Kinect RGB-D Camera. What the Kinect Does. How Kinect Works: Overview

Lecture 6 Stereo Systems Multi- view geometry Professor Silvio Savarese Computational Vision and Geometry Lab Silvio Savarese Lecture 6-24-Jan-15

Stereo imaging ideal geometry

Stereo and Epipolar geometry

Assignment 2: Stereo and 3D Reconstruction from Disparity

Lecture 19: Motion. Effect of window size 11/20/2007. Sources of error in correspondences. Review Problem set 3. Tuesday, Nov 20

Lecture 6 Stereo Systems Multi-view geometry

Stereo Matching.

Kinect Device. How the Kinect Works. Kinect Device. What the Kinect does 4/27/16. Subhransu Maji Slides credit: Derek Hoiem, University of Illinois

Final Exam Study Guide

Midterm Wed. Local features: detection and description. Today. Last time. Local features: main components. Goal: interest operator repeatability

3D Photography: Stereo Matching

Step-by-Step Model Buidling

Announcements. Stereo Vision II. Midterm. Example: Helmholtz Stereo Depth + Normals + BRDF. Stereo

EE795: Computer Vision and Intelligent Systems

Local invariant features

Lecture 20: Tracking. Tuesday, Nov 27

Lecture'9'&'10:'' Stereo'Vision'

Depth from two cameras: stereopsis

CPSC 425: Computer Vision

Image Transfer Methods. Satya Prakash Mallick Jan 28 th, 2003

Local features: detection and description. Local invariant features

Computer Vision I. Announcements. Random Dot Stereograms. Stereo III. CSE252A Lecture 16

Depth from two cameras: stereopsis

Structure from motion

Computer Vision Lecture 20

Lecture 9: Epipolar Geometry

Computer Vision Lecture 20

3D Computer Vision. Dense 3D Reconstruction II. Prof. Didier Stricker. Christiano Gava

3D Sensing and Reconstruction Readings: Ch 12: , Ch 13: ,

Stereo Epipolar Geometry for General Cameras. Sanja Fidler CSC420: Intro to Image Understanding 1 / 33

EXAM SOLUTIONS. Image Processing and Computer Vision Course 2D1421 Monday, 13 th of March 2006,

Camera Drones Lecture 3 3D data generation

Transcription:

Stereo Thurs Mar 30 Kristen Grauman UT Austin Outline Last time: Human stereopsis Epipolar geometry and the epipolar constraint Case example with parallel optical axes General case with calibrated cameras Today: Stereo solutions Correspondences Additional constraints Multiple views Multi-view geometry, matching, invariant features, stereo vision Lowe Hartley and Zisserman Kristen Grauman 1

Estimating depth with stereo Stereo: shape from motion between two views We ll need to consider: Info on camera pose ( calibration ) Image point correspondences scene point optical center image plane Recall: Depth from disparity image I(x,y) Disparity map D(x,y) image I (x,y ) (x,y )=(x+d(x,y), y) So if we could find the corresponding points in two images, we could estimate relative depth Recall: Epipolar constraint Geometry of two views constrains where the corresponding pixel for some image point in the first view must occur in the second view. It must be on the line carved out by a plane connecting the world point and optical centers. 2

Recall: Epipolar geometry Epipolar Plane Epipolar Line Epipole Baseline Epipole http://www.ai.sri.com/~luong/research/meta3dviewer/epipolargeo.html Epipolar constraint This is useful because it reduces the correspondence problem to a 1D search along an epipolar line. Image from Andrew Zisserman Review questions Why perform rectification for stereo? What are the extrinsic camera parameters relating two stereo cameras? 3

What is the relationship between the two cameras? e e Figure from Hartley & Zisserman Review questions Why perform rectification for stereo? What are the extrinsic camera parameters relating two stereo cameras? What s the result of convolving a disparity map with [-1 1]? Depth for segmentation Edges in disparity in conjunction with image edges enhances contours found Danijela Markovic and Margrit Gelautz, Interactive Media Systems Group, Vienna University of Technology 4

Depth for segmentation Danijela Markovic and Margrit Gelautz, Interactive Media Systems Group, Vienna University of Technology Outline Last time: Human stereopsis Epipolar geometry and the epipolar constraint Case example with parallel optical axes General case with calibrated cameras Today: Stereo solutions Correspondences Additional constraints Correspondence problem Multiple match hypotheses satisfy epipolar constraint, but which is correct? Figure from Gee & Cipolla 1999 5

Correspondence problem Beyond the hard constraint of epipolar geometry, there are soft constraints to help identify corresponding points Similarity Uniqueness Ordering Disparity gradient To find matches in the image pair, we will assume Most scene points visible from both views Image regions for the matches are similar in appearance Dense correspondence search For each epipolar line For each pixel / window in the left image compare with every pixel / window on same epipolar line in right image pick position with minimum match cost (e.g., SSD, correlation) Adapted from Li Zhang Correspondence problem Parallel camera example: epipolar lines are corresponding image scanlines 6

Correspondence problem Intensity profiles Correspondence problem Neighborhoods of corresponding points are similar in intensity patterns. Normalized cross correlation 7

Correlation-based window matching Textureless regions Textureless regions are non-distinct; high ambiguity for matches. Effect of window size? 8

Effect of window size W = 3 W = 20 Want window large enough to have sufficient intensity variation, yet small enough to contain only pixels with about the same disparity. Figures from Li Zhang Foreshortening effects Occlusion Slide credit: David Kriegman 9

Sparse correspondence search Restrict search to sparse set of detected features (e.g., corners) Rather than pixel values (or lists of pixel values) use feature descriptor and an associated feature distance Still narrow search further by epipolar geometry Tradeoffs between dense and sparse search? Correspondence problem Beyond the hard constraint of epipolar geometry, there are soft constraints to help identify corresponding points Similarity Uniqueness Disparity gradient Ordering Uniqueness constraint Up to one match in right image for every point in left image for opaque objects Figure from Gee & Cipolla 1999 10

Disparity gradient constraint Assume piecewise continuous surface, so want disparity estimates to be locally smooth Figure from Gee & Cipolla 1999 Ordering constraint Points on same surface will typically be in same order in both views Figure from Gee & Cipolla 1999 Ordering constraint Won t always hold, e.g. consider transparent object, or an occluding surface Figures from Forsyth & Ponce 11

Beyond matched pairs Optimize correspondence assignments jointly Scanline at a time (DP) Full 2D grid (graph cuts) Scanline stereo Try to coherently match pixels on the entire scanline Different scanlines are still optimized independently Left image Right image intensity Shortest paths for scan-line stereo S left Left image I Right image I S right Can be implemented with dynamic programming Ohta & Kanade 85, Cox et al. 96 Slide credit: Y. Boykov 12

Coherent stereo on 2D grid Scanline stereo generates streaking artifacts Can t use dynamic programming to find spatially coherent disparities/ correspondences on a 2D grid Stereo matching as energy minimization I 1 I 2 D W 1 (i) W 2 (i+d(i)) D(i) E E Edata( I1, I 2, D) Esmooth ( D) W 2 1( i) W2 ( i D( i data )) i Esmooth D( i) D( j) neighbors i, j Stereo matching as energy minimization I 1 I 2 D W 1 (i) W 2 (i+d(i)) D(i) E E Edata ( I1, I 2, D) Esmooth ( D) W 2 1( i) W2( i D( i data )) i Energy functions of this form can be minimized using graph cuts Y. Boykov, O. Veksler, and R. Zabih, Fast Approximate Energy Minimization via Graph Cuts, PAMI 2001 Source: Steve Seitz Esmooth D( i) D( j) neighbors i, j 13

Error sources Low-contrast ; textureless image regions Occlusions Camera calibration errors Violations of brightness constancy (e.g., specular reflections) Large motions Virtual viewpoint video C. Zitnick et al, High-quality video view interpolation using a layered representation, SIGGRAPH 2004. Virtual viewpoint video C. Larry Zitnick et al, High-quality video view interpolation using a layered representation, SIGGRAPH 2004. http://research.microsoft.com/ivm/vvv/ 14

Video examples https://www.youtube.com/watch?v=sz0ub HvEttI https://www.youtube.com/watch?v=keiirxr Rb1k https://www.youtube.com/watch?v=1dt9g wx1gvm https://www.youtube.com/watch?v=cizgvz 8rjKA Summary Depth from stereo: main idea is to triangulate from corresponding image points. Epipolar geometry defined by two cameras We ve assumed known extrinsic parameters relating their poses Epipolar constraint limits where points from one view will be imaged in the other Makes search for correspondences quicker To estimate depth Limit search by epipolar constraint Compute correspondences, incorporate matching preferences Coming up Instance recognition Indexing local features efficiently Spatial verification models 15