There are many cues in monocular vision which suggests that vision in stereo starts very early from two similar 2D images. Lets see a few...

Similar documents
Stereo. 11/02/2012 CS129, Brown James Hays. Slides by Kristen Grauman

EECS 442 Computer vision. Stereo systems. Stereo vision Rectification Correspondence problem Active stereo vision systems

Binocular stereo. Given a calibrated binocular stereo pair, fuse it to produce a depth image. Where does the depth information come from?

BIL Computer Vision Apr 16, 2014

Stereo vision. Many slides adapted from Steve Seitz

Stereo: Disparity and Matching

Project 4 Results. Representation. Data. Learning. Zachary, Hung-I, Paul, Emanuel. SIFT and HoG are popular and successful.

Lecture 6 Stereo Systems Multi-view geometry

Stereo. Many slides adapted from Steve Seitz

Recap: Features and filters. Recap: Grouping & fitting. Now: Multiple views 10/29/2008. Epipolar geometry & stereo vision. Why multiple views?

CS5670: Computer Vision

Lecture 9 & 10: Stereo Vision

Think-Pair-Share. What visual or physiological cues help us to perceive 3D shape and depth?

Lecture 10: Multi view geometry

Computer Vision Lecture 17

Computer Vision Lecture 17

Recovering structure from a single view Pinhole perspective projection

Lecture 10: Multi-view geometry

Epipolar Geometry and Stereo Vision

Multiple View Geometry

CS 4495 Computer Vision A. Bobick. Motion and Optic Flow. Stereo Matching

Mosaics wrapup & Stereo

Multiple View Geometry

Miniature faking. In close-up photo, the depth of field is limited.

Lecture'9'&'10:'' Stereo'Vision'

Epipolar Geometry and Stereo Vision

6.819 / 6.869: Advances in Computer Vision Antonio Torralba and Bill Freeman. Lecture 11 Geometry, Camera Calibration, and Stereo.

Epipolar Geometry and Stereo Vision

CS 4495 Computer Vision A. Bobick. Motion and Optic Flow. Stereo Matching

Today. Stereo (two view) reconstruction. Multiview geometry. Today. Multiview geometry. Computational Photography

Recap from Previous Lecture

Stereo II CSE 576. Ali Farhadi. Several slides from Larry Zitnick and Steve Seitz

Epipolar Geometry and Stereo Vision

Lecture 9: Epipolar Geometry

Project 3 code & artifact due Tuesday Final project proposals due noon Wed (by ) Readings Szeliski, Chapter 10 (through 10.5)

Lecture 14: Basic Multi-View Geometry

Colorado School of Mines. Computer Vision. Professor William Hoff Dept of Electrical Engineering &Computer Science.

Image Rectification (Stereo) (New book: 7.2.1, old book: 11.1)

Image Based Reconstruction II

Lecture 6 Stereo Systems Multi- view geometry Professor Silvio Savarese Computational Vision and Geometry Lab Silvio Savarese Lecture 6-24-Jan-15

Stereo and Epipolar geometry

Stereo. Outline. Multiple views 3/29/2017. Thurs Mar 30 Kristen Grauman UT Austin. Multi-view geometry, matching, invariant features, stereo vision

Chaplin, Modern Times, 1936

Project 2 due today Project 3 out today. Readings Szeliski, Chapter 10 (through 10.5)

Multi-View Geometry (Ch7 New book. Ch 10/11 old book)

Lecture 14: Computer Vision

Final project bits and pieces

Dense 3D Reconstruction. Christiano Gava

Camera Calibration. Schedule. Jesus J Caban. Note: You have until next Monday to let me know. ! Today:! Camera calibration

Rectification and Distortion Correction

EE795: Computer Vision and Intelligent Systems

Computer Vision I. Announcements. Random Dot Stereograms. Stereo III. CSE252A Lecture 16

Colorado School of Mines. Computer Vision. Professor William Hoff Dept of Electrical Engineering &Computer Science.

7. The Geometry of Multi Views. Computer Engineering, i Sejong University. Dongil Han

calibrated coordinates Linear transformation pixel coordinates

Stereo and structured light

55:148 Digital Image Processing Chapter 11 3D Vision, Geometry

Cameras and Stereo CSE 455. Linda Shapiro

Two-view geometry Computer Vision Spring 2018, Lecture 10

Fundamental matrix. Let p be a point in left image, p in right image. Epipolar relation. Epipolar mapping described by a 3x3 matrix F

What have we leaned so far?

Announcements. Stereo

Public Library, Stereoscopic Looking Room, Chicago, by Phillips, 1923

3D Sensing and Reconstruction Readings: Ch 12: , Ch 13: ,

Dense 3D Reconstruction. Christiano Gava

EE795: Computer Vision and Intelligent Systems

Reminder: Lecture 20: The Eight-Point Algorithm. Essential/Fundamental Matrix. E/F Matrix Summary. Computing F. Computing F from Point Matches

Structure from motion

Rectification and Disparity

Stereo Vision. MAN-522 Computer Vision

CS201 Computer Vision Camera Geometry

Structure from Motion

Computer Vision I. Announcement. Stereo Vision Outline. Stereo II. CSE252A Lecture 15

Structure from motion

CS4495/6495 Introduction to Computer Vision. 3B-L3 Stereo correspondence

Correspondence and Stereopsis. Original notes by W. Correa. Figures from [Forsyth & Ponce] and [Trucco & Verri]

Stereo Vision A simple system. Dr. Gerhard Roth Winter 2012

Announcements. Stereo

Depth from two cameras: stereopsis

3D Geometry and Camera Calibration

Homographies and RANSAC

55:148 Digital Image Processing Chapter 11 3D Vision, Geometry

Stereo CSE 576. Ali Farhadi. Several slides from Larry Zitnick and Steve Seitz

Depth from two cameras: stereopsis

Epipolar Geometry Prof. D. Stricker. With slides from A. Zisserman, S. Lazebnik, Seitz

Introduction to Computer Vision. Week 10, Winter 2010 Instructor: Prof. Ko Nishino

CS 2770: Intro to Computer Vision. Multiple Views. Prof. Adriana Kovashka University of Pittsburgh March 14, 2017

Step-by-Step Model Buidling

Robert Collins CSE486, Penn State Lecture 08: Introduction to Stereo

Computer Vision, Lecture 11

Stereo Epipolar Geometry for General Cameras. Sanja Fidler CSC420: Intro to Image Understanding 1 / 33

Introduction à la vision artificielle X

Lecture 8 Active stereo & Volumetric stereo

Lecture 5 Epipolar Geometry

Machine vision. Summary # 11: Stereo vision and epipolar geometry. u l = λx. v l = λy

Structure from Motion

Structure from Motion CSC 767

Capturing, Modeling, Rendering 3D Structures

Stereo. Shadows: Occlusions: 3D (Depth) from 2D. Depth Cues. Viewing Stereo Stereograms Autostereograms Depth from Stereo

Announcements. Stereo Vision II. Midterm. Example: Helmholtz Stereo Depth + Normals + BRDF. Stereo

Transcription:

STEREO VISION The slides are from several sources through James Hays (Brown); Srinivasa Narasimhan (CMU); Silvio Savarese (U. of Michigan); Bill Freeman and Antonio Torralba (MIT), including their own slides.

There are many cues in monocular vision which suggests that vision in stereo starts very early from two similar 2D images. Lets see a few...

Depth cues: Linear perspective

Depth cues: Aerial perspective

Depth ordering cues: Occlusion Source: J. Koenderink

Shape cues: Texture gradient

the two cameras are calibrated

Triangulation: the geometry behind Find shortest segment connecting the two viewing rays and let X be the midpoint of that segment X K known x x 2 1 K' known O 1 cameras calibrated O 2 R and t also known

Triangulation: is a linear approach λ 1 x1 = 1 2 x 2 = 2 P X λ P X P X 0 1 P X = 0 [x1 ]P1 X = 0 x [x 2 ]P2 X = 0 x 1 x 2 2 = Vector product written as matrix multiplication. The P_1 and P_2 (3x4) are known. Cross product as matrix multiplication: The x_1 and x_2 (3x1) are also known. Two independent 0 equations a aper point, b in terms of three unknown entries of X (the 4th coord. = 1) x

Stereo photography and stereo viewers Take two pictures of the same subject from two slightly different viewpoints and display so that each eye sees only one of the images. Invented by Sir Charles Wheatstone, 1838 a few stereo images blue/red stereo viewer

two images of the Eiffel tower superposed http://www.johnsonshawmuseum.org

Random dot stereograms Julesz 1960. Can we identify without an object a 3D structure from two images? Experiment. Pair of synthetic identical images and moving a small rectangular part in the middle one pixel to the left. When viewed monocularly they appear random, when viewed stereoscopically, is a 3D structure. Human binocular fusion appears early in the central nervous system.

Depth without objects Julesz, 1971 Bela Julesz

Stereo vision P p p O 1 O 2 Goal: estimate the position of P given the observation of P from two view points. Assumptions: known camera parameters and position (K, R, T).

Stereo vision calibrated cameras P p p O 1 O 2 Subgoals: - Solve the correspondence problem between the images; p, p'. - Use corresponding observations to triangulate in 3D.

If two images become parallel... P p p O O when views are parallel these two steps becomes much easier -- correspondence and triangulate.

E matrix with parallel images P v v e 1 p p e 2 y u u K O z x O K` Parallel epipolar lines. Epipoles at infinity. v = v Rectification: making two images parallel.

Making the two images parallel. P H p p H' O H, H' homographies O GOAL: Estimate the perspective transformations that makes images parallel. Impose v =v Leaves degrees of freedom for determining H and H'!!! If not appropriate H (H') is chosen, severe projective distortions are present.

First frame will call the "camera", the second frame the "world". X 2is the inhomogeneous coordinates in the world coordinates X 1is the inhomogeneous coordinates in the camera coordinates R and t take from world coordinate system to the camera c.s. describing the camera with respect the world coordinates. By choice, the zero in the second coor. system is the first origin T of the camera O = [0 0 0 1] and e' is the epipole in second c.s. The origin of the camera center O in second c.s. is related by T X 1= R (X 2 - t) (inverse R, t) to O' measured in the first c.s. Thus, the coordinates of the world origin in the first coor. system T T is the point O' = [ -R t 1] and e is the epipole of O' in first c.s. X 2= [0 0 0] T X -> O' first c.s. 1

P K I 0 P P P K R T P p p K e e K O R,T O Compute epipoles. e K R T T [e 1 sign does not matter T T e = K[I 0] [-R t 1] e' = K' [R t] [0 1] T e 2 1] T e K T

P. p p e e O O Map e to the x-axis at location [1,0,1] T (normalization). T two 3x3 matrix e [e e 1 1 1 2 ] 0 1 T H R T 1 H H

P p p e O O Send epipole to infinity. H 2 1 0 1 0 1 0 0 0 1 e 1 0 1 T 1 0 0 T

P H = H 2 H 1 H' e p p e O O p <==> p' epipolar lines are l = e x p l' = e' x p' can be computed. Matched pair of transformation. Parallel in each image To have v = v' need to find the final separately. transformations.

P H = H 2 H 1 H' p p O O' Align the epipolar lines across the two images based of the matched transformation. Descriped also in [Hartley and Zisserman] Chapter 11 (Sec. 11.12)... more advanced.

IMAGE RECTIFICATION Denote E from image 2 to 1 and is given. If the images are in parallel, and the t is known, the matrix E becomes E = [1 0 0] x = [i] x. The two 3 3 homographies are H and H. Let p and p be the rectified homogeous points p [i] x p = 0 related to the original points by p = Hp p = H p. The fundamental matrix F is F = H [i] x H p Fp = 0. The two homographies have more degrees of freedom then we need to solve the alignment.

Image rectification. Loop and Zhang (2001). The essential matrix is known. Denote E from image 2 to 1. T T p' H' [i] H p = 0 -- 8 DOF each homography H, H'. X A homography is decomposed in (shearing.similarity).projective -- order starts at right! projective: epipolar lines become parallel in each image; similarity: rotate the points into alignment with line maps in the two images simultaneously. Its enough... shearing: minimizing the horizontal distortion in each image. after projective and similarity a b 0 0 1 0 0 0 1 two intervals with known original aspect ratio

Rectification is useful. Slides of Loop, Zhang 2001. Projective transformation.

Rotate, translate, uniformly scale only. Shearing with u(u') parameters only, reduces the projective distortion separately in both images. E.g., to preserve perpendicularity and aspect ratio of two line segments. The two cameras have some distortion.

Geometry for a simple stereo system Assuming by now parallel optical axes, known camera parameters (i.e., calibrated cameras).

3D: X_l; X_r X_l = X_r - T The two cameras at T distance along the x axis. T = X_r - X_l = (Z x_r)/f - (Z x_l)/f Sometime disparity is x_l - x_r.

Depth from disparity image I(x,y) Disparity map D(x,y) image I (x,y ) (x,y )=(x+d(x,y), y) So if we could find the corresponding points in two images, we could estimate relative depth

Example: depth from disparity Venus data pair B = T translation x x B f z x x Stereo pair Disparity map / depth map http://vision.middlebury.edu/stereo/ Disparity map with occlusions

Basic stereo matching algorithm Rectify the two stereo images to transform epipolar lines into scanlines For each pixel x in the first image Find corresponding epipolar scanline in the right image. Examine all pixels on the scanline and pick the best match x. Compute disparity x-x and set depth(x) = fb/(x-x ).

The epipolar constraint helps, but much ambiguity remains.

Correlation Methods (from 1970's on) p...for normalized correlation Two parallel image. Pick up a window around p(u,v).

Correlation Methods Pick up a window around p(u,v). Build vector w. Slide the window along v line in image 2 and compute w. Keep sliding until w w (inner product) is maximized.

Correlation Methods Normalized correlation has to maximize: (w w)(w (w w)(w w ) w )

Correspondence search with... Left Right scanline...maximization of the normalized correlation in the second image. Norm. corr

Correspondence search with... Left Right scanline Matching cost disparity...the minimum of the sum squared differences (SSD).

Example: Left Right scanline Minimization of the sum squared differences (SSD) in the second image. SSD

Image block matching How do we determine correspondences? block matching d is the disparity (horizontal motion) How big should the neighborhood be? CSE 576, Slide Spring credit: 2008 Rick Szeliski Stereo matching

Neighborhood size Smaller neighborhood: more details. Larger neighborhood: fewer isolated mistakes. w = 3 w = 20 CSE 576, Slide Spring credit: 2008 Rick Szeliski Stereo matching

Issues on stereo matching: T = B! Foreshortening effect large B/z ratio O O Occlusions O O

small B/z ratio depth(x) = fb/(x-x ) O O Small error in measurements implies large error in estimating depth.

Homogeneous regions Hard to match pixels in these regions.

Repetitive patterns Correspondence problem is difficult. Apply nonlocal constraints to help enforce the correspondences.

Nonlocal constraints can have other problems Uniqueness For any point in one image, there should be at most one matching gpoint in the other image Ordering Corresponding points should be in the same order in both views Ordering constraint doesn t hold

http://vision.middlebury.edu/stereo/ Stereo testing and comparisons D. Scharstein and R. Szeliski. "A Taxonomy and Evaluation of Dense Two- Frame Stereo Correspondence Algorithms," International Journal of Computer Vision, 47 (2002), pp. 7-42. Scene Tsukuba laboratory left image Ground truth...154 submissions, July 2013

Stereo best algorithms 94

Example: window correlation Window-based matching (best window size) Ground truth not the best method

A few of the other methods: - based on edges, corners, gradients; - rank statistics; - dynamic programming; - matching as energy minimization (graph cuts); - multiple-baseline stereo; - space carving stereo etc... None of the methods is the ultimate solution. Stereo (like everything else) works only if the matching is robust.

Active stereo (point) P p projector O Replace one of the two cameras by a projector - Single camera. - Projector geometry calibrated too. - What s the advantage of having the projector? Correspondence problem solved!

Active stereo (stripe) projector O - If the projector and camera are parallel... correspondence problem solved. Otherwise, transform it into parallel first.

High precision: Laser scanning Object Laser sheet Direction of travel CCD image plane Laser Cylindrical lens CCD Digital Michelangelo Project Levoy et al. http://graphics.stanford.edu/projects/mich/ Optical triangulation Project a single stripe of laser light. across Scan the it across surfacethe of surface the object of the object. This is a very precise version of structured light scanning. Source: S. Seitz

Example: laser scanning The Digital Michelangelo Project, Levoy et al. Source: S. Seitz

Active stereo (color-coded stripes) L. Zhang, B. Curless, and S. M. Seitz 2002 S. Rusinkiewicz & Levoy 2002 projector - Dense reconstruction. Projector, image are parallel. - Correspondence found by a technique based on the color code. O

KINECT Structured infrared light