Announcements. Stereo Vision Wrapup & Intro Recognition

Similar documents
Announcements. Stereo Vision II. Midterm. Example: Helmholtz Stereo Depth + Normals + BRDF. Stereo

Computer Vision I. Announcement

Computer Vision I. Announcements. Random Dot Stereograms. Stereo III. CSE252A Lecture 16

Stereo Wrap + Motion. Computer Vision I. CSE252A Lecture 17

Computer Vision, Lecture 11

Stereo (Part 2) Introduction to Computer Vision CSE 152 Lecture 9

Computer Vision I. Announcement. Stereo Vision Outline. Stereo II. CSE252A Lecture 15

Announcements. Recognition I. Gradient Space (p,q) What is the reflectance map?

Stereo Matching.

Public Library, Stereoscopic Looking Room, Chicago, by Phillips, 1923

CS5670: Computer Vision

Stereo and Epipolar geometry

Recap from Previous Lecture

Project 3 code & artifact due Tuesday Final project proposals due noon Wed (by ) Readings Szeliski, Chapter 10 (through 10.5)

CS4495/6495 Introduction to Computer Vision. 3B-L3 Stereo correspondence

Stereo. 11/02/2012 CS129, Brown James Hays. Slides by Kristen Grauman

Lecture 14: Computer Vision

EECS 442 Computer vision. Stereo systems. Stereo vision Rectification Correspondence problem Active stereo vision systems

Stereo. Outline. Multiple views 3/29/2017. Thurs Mar 30 Kristen Grauman UT Austin. Multi-view geometry, matching, invariant features, stereo vision

Chaplin, Modern Times, 1936

What have we leaned so far?

Epipolar Constraint. Epipolar Lines. Epipolar Geometry. Another look (with math).

CS 4495 Computer Vision A. Bobick. Motion and Optic Flow. Stereo Matching

CS 4495 Computer Vision A. Bobick. Motion and Optic Flow. Stereo Matching

Project 2 due today Project 3 out today. Readings Szeliski, Chapter 10 (through 10.5)

Multiple View Geometry

Computer Vision Lecture 17

Computer Vision Lecture 17

Fundamentals of Stereo Vision Michael Bleyer LVA Stereo Vision

Dense 3D Reconstruction. Christiano Gava

Lecture 10: Multi view geometry

Multi-view stereo. Many slides adapted from S. Seitz

BIL Computer Vision Apr 16, 2014

Stereo II CSE 576. Ali Farhadi. Several slides from Larry Zitnick and Steve Seitz

Final project bits and pieces

Dense 3D Reconstruction. Christiano Gava

7. The Geometry of Multi Views. Computer Engineering, i Sejong University. Dongil Han

Stereo: Disparity and Matching

There are many cues in monocular vision which suggests that vision in stereo starts very early from two similar 2D images. Lets see a few...

Correspondence and Stereopsis. Original notes by W. Correa. Figures from [Forsyth & Ponce] and [Trucco & Verri]

Multiple View Geometry

EE795: Computer Vision and Intelligent Systems

Announcements. Recognition I. Optical Flow: Where do pixels move to? dy dt. I + y. I = x. di dt. dx dt. = t

Stereo vision. Many slides adapted from Steve Seitz

Step-by-Step Model Buidling

Lecture 10: Multi-view geometry

Recap: Features and filters. Recap: Grouping & fitting. Now: Multiple views 10/29/2008. Epipolar geometry & stereo vision. Why multiple views?

Image Based Reconstruction II

What is Computer Vision? Introduction. We all make mistakes. Why is this hard? What was happening. What do you see? Intro Computer Vision

Matching. Compare region of image to region of image. Today, simplest kind of matching. Intensities similar.

Introduction à la vision artificielle X

Today. Stereo (two view) reconstruction. Multiview geometry. Today. Multiview geometry. Computational Photography

Structure from Motion. Introduction to Computer Vision CSE 152 Lecture 10

Announcements. Recognition (Part 3) Model-Based Vision. A Rough Recognition Spectrum. Pose consistency. Recognition by Hypothesize and Test

Recognition (Part 4) Introduction to Computer Vision CSE 152 Lecture 17

Final Exam Study Guide CSE/EE 486 Fall 2007

Introduction to Computer Vision. Week 10, Winter 2010 Instructor: Prof. Ko Nishino

Stereo. Many slides adapted from Steve Seitz

Colorado School of Mines. Computer Vision. Professor William Hoff Dept of Electrical Engineering &Computer Science.

A virtual tour of free viewpoint rendering

Lecture 6 Stereo Systems Multi- view geometry Professor Silvio Savarese Computational Vision and Geometry Lab Silvio Savarese Lecture 6-24-Jan-15

Passive 3D Photography

Announcements. Recognition. Recognition. Recognition. Recognition. Homework 3 is due May 18, 11:59 PM Reading: Computer Vision I CSE 152 Lecture 14

Stereo matching. Reading: Chapter 11 In Szeliski s book

Fundamental matrix. Let p be a point in left image, p in right image. Epipolar relation. Epipolar mapping described by a 3x3 matrix F

Geometric Reconstruction Dense reconstruction of scene geometry

Final Exam Study Guide

Lecture 9 & 10: Stereo Vision

Improved depth map estimation in Stereo Vision

Announcements. Introduction. Why is this hard? What is Computer Vision? We all make mistakes. What do you see? Class Web Page is up:

Stereo Vision. MAN-522 Computer Vision

Some books on linear algebra

Stereo Vision II: Dense Stereo Matching

Epipolar Geometry and Stereo Vision

Finally: Motion and tracking. Motion 4/20/2011. CS 376 Lecture 24 Motion 1. Video. Uses of motion. Motion parallax. Motion field

Passive 3D Photography

Colorado School of Mines. Computer Vision. Professor William Hoff Dept of Electrical Engineering &Computer Science.

Lecture 6 Stereo Systems Multi-view geometry

Lecture 14: Basic Multi-View Geometry

Lecture 19: Depth Cameras. Visual Computing Systems CMU , Fall 2013

Vision is inferential. (

High Accuracy Depth Measurement using Multi-view Stereo

Outdoor Scene Reconstruction from Multiple Image Sequences Captured by a Hand-held Video Camera

Binocular stereo. Given a calibrated binocular stereo pair, fuse it to produce a depth image. Where does the depth information come from?

CS664 Lecture #21: SIFT, object recognition, dynamic programming

CEng Computational Vision

Human Body Recognition and Tracking: How the Kinect Works. Kinect RGB-D Camera. What the Kinect Does. How Kinect Works: Overview

Depth from two cameras: stereopsis

Epipolar Geometry and Stereo Vision

Some books on linear algebra

Lecture 10 Multi-view Stereo (3D Dense Reconstruction) Davide Scaramuzza

Research on an Adaptive Terrain Reconstruction of Sequence Images in Deep Space Exploration

Depth from two cameras: stereopsis

Multi-Flash Stereopsis: Depth Edge Preserving Stereo with Small Baseline Illumination

Introduction to Computer Vision

Complex Sensors: Cameras, Visual Sensing. The Robotics Primer (Ch. 9) ECE 497: Introduction to Mobile Robotics -Visual Sensors

Depth. Common Classification Tasks. Example: AlexNet. Another Example: Inception. Another Example: Inception. Depth

Stereo Matching. Stereo Matching. Face modeling. Z-keying: mix live and synthetic

Announcements. Motion. Structure-from-Motion (SFM) Motion. Discrete Motion: Some Counting

Video Mosaics for Virtual Environments, R. Szeliski. Review by: Christopher Rasmussen

Transcription:

Announcements Stereo Vision Wrapup & Intro Introduction to Computer Vision CSE 152 Lecture 17 HW3 due date postpone to Thursday HW4 to posted by Thursday, due next Friday. Order of material we ll first cover recognition so that you re prepared for assignment. Then return to motion. Final Exam: Tuesday 6/7 8:00-11:00 AM, Here Rectification Rectification Given a pair of images, transform both images so that epipolar lines are scan lines. Using epipolar & constant Brightness constraints for stereo matching Correspondence Search Algorithm For each epipolar line For each pixel in the left image compare with every pixel on same epipolar line in right image pick pixel with minimum match cost This will never work, so: Improvement: match windows For i = 1:nrows for j=1:ncols best(i,j) = -1 for k = mindisparity:maxdisparity c = Match_Metric(I 1 (i,j),i 2 (i,j+k),winsize) if (c > best(i,j)) best(i,j) = c disparities(i,j) = k O(nrows * ncols * disparities * winx * winy) 1

Match Metric Summary Stereo results MATCH METRIC Normalized Cross-Correlation (NCC) DEFINITION Data from University of Tsukuba Sum of Squared Differences (SSD) These two are actually the same Normalized SSD Sum of Absolute Differences (SAD) Zero Mean SAD Rank Census Scene Ground truth Disparity Map Results with window correlation Results with better method Window-based matching (best window size) Ground truth Near State of the art method Boykov et al., Fast Approximate Energy Minimization via Graph Cuts, International Conference on Computer Vision, September 1999. Ground truth Window size Window shape Lighting Ambiguity Ordering Some Issues Half occluded regions Effect of window size Window size W = 3 W = 20 Better results with adaptive window T. Kanade and M. Okutomi, A Stereo Matching Algorithm with an Adaptive Window: Theory and Experiment,, Proc. International Conference on Robotics and Automation, 1991. D. Scharstein and R. Szeliski. Stereo matching with nonlinear diffusion. International Journal of Computer Vision, 28(2): 155-174, July 1998 2

Window Shape and Forshortening Window Shape: Fronto-parallel Configuration W p U 1 U 2 W r W l Lighting Conditions (Photometric Variations) Ambiguity W(P l ) W(P r ) A challenge: Multiple Interpretations Multiple Interpretations 3

Multiple Interpretations Multiple Interpretations Problem of Occlusion CONSTRAINT Stereo Constraints BRIEF DESCRIPTION Monotonic Ordering 1-D Epipolar Search Arbitrary images of the same scene may be rectified based on epipolar geometry such that stereo matches lie along onedimensional scanlines. This reduces the computational complexity and also reduces the likelihood of false matches. Points along an epipolar scanline appear in the same order in both stereo images, assuming that all objects in the scene are approximately the same distance from the cameras. Image Brightness Constancy Match Uniqueness Disparity Continuity Disparity Limit Fronto-Parallel Surfaces Feature Similarity Structural Grouping Assuming Lambertian surfaces, the brightness of corresponding points in stereo images are the same. For every point in one stereo image, there is at most one corresponding point in the other image. Disparities vary smoothly (i.e. disparity gradient is small) over most of the image. This assumption is violated at object boundaries. The search space may be reduced significantly by limiting the disparity range, reducing both computational complexity and the likelihood of false matches. The implicit assumption made by area-based matching is that objects have fronto-parallel surfaces (i.e. depth is constant within the region of local support). This assumption is violated by sloping and creased surfaces. Corresponding features must be similar (e.g. edges must have roughly the same length and orientation). Corresponding feature groupings and their connectivity must be consistent. (From G. Hager) Stereo Matching using Dynamic Programming Stereo matching Similarity measure (SSD or NCC) Optimal path (dynamic programming ) Constraints epipolar ordering uniqueness disparity limit disparity gradient limit Trade-off Matching cost (data) Discontinuities (prior) (Ohta and Kanade, 1985) Reprinted from Stereo by Intra- and Intet-Scanline Search, by Y. Ohta and T. Kanade, IEEE Trans. on Pattern Analysis and Machine CSE152, Spring Intelligence, 2011 7(2):139-154 (1985). 1985 IEEE. (From Pollefeys) (Cox et al. CVGIP 96; Koch 96; Falkenhagen 97; Van Meerbergen,Vergauwen,Pollefeys,VanGool IJCV 02) 4

Some Challenges & Problems Photometric issues: specularities strongly non-lambertian BRDF s Variations on Binocular Stereo 1. Trinocular Stereopsis 2. Helmholtz Reciprocity Stereopsis Surface structure lack of texture repeating texture within horopter bracket Geometric ambiguities as surfaces turn away, difficult to get accurate reconstruction (affine approximate can help) at the occluding contour, likelihood of good match but incorrect reconstruction Trinocular Epipolar Constraints More on stereo The Middleburry Stereo Vision Research Page http://cat.middlebury.edu/stereo/ Recommed reading" D. Scharstein and R. Szeliski. " A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms. IJCV 47(1/2/3):7-42, April-June 2002. PDF file (1.15 MB) - includes current evaluation. Microsoft Research Technical Report MSR-TR-2001-81, November 2001." These constraints are not indepent! Myron Z. Brown, Darius Burschka, and Gregory D. Hager. Advances in Computational Stereo. IEEE Transactions on Pattern Analysis and Machine Intelligence, 25(8): 993-1008, 2003. Announcements I Introduction to Computer Vision CSE 152 Lecture 17 HW 4 to be posted later today. It does not require a lot of coding, but does require understanding Order of material changed we ll first cover recognition so that you re prepared for assignment. Then return to motion. 5

Given a database of objects and an image determine what, if any of the objects are present in the image. Given a database of objects and an image determine what, if any of the objects are present in the image. Where are the coral heads and which ones are healthy and which are bleached? Given a database of objects and an image determine what, if any of the objects are present in the image. Bleached Input Image! Healthy Partially Bleached Segmented/labeled Image Object : The Problem Categories vs. Instances Camel Given: A database D of known objects and an image I: Mammal Face 1. Determine which (if any) objects in D appear in I 2. Determine the pose (rotation and translation) of the object Pose Est. (where is it 3D) Barbara Steele Two challenges: Segmentation (where is it 2D) Recognizing instances (what is it) Recognizing categories WHAT AND WHERE!!! 6

Challenges Within-class variability Different objects within the class have different shapes or different material characteristics Deformable Articulated Compositional Pose variability: 2-D Image transformation (translation, rotation, scale) 3-D Pose Variability (perspective, orthographic projection) Lighting Direction (multiple sources & type) Color Shadows Occlusion partial Clutter in background -> false positives Object Issues: How general is the problem? 2D vs. 3D range of viewing conditions available context segmentation cues What sort of data is best suited to the problem? Whole images Local 2D features (color, texture, 3D (range) data What information do we have in the database? Collection of images? 3-D models? Learned representation? Learned classifiers? How many objects are involved? small: brute force search large:?? A Rough Spectrum Appearance-Based (Eigenface, Fisherface) Geometric Invariants Shape Contexts Image Abstractions/ Volumetric Primitives Local Features + Spatial Relations 3-D Model-Based Bags of Features Function Increasing Generality 7