CS231A Midterm Review. Friday 5/6/2016

Similar documents
Structure from motion

Structure from Motion. Introduction to Computer Vision CSE 152 Lecture 10

CS231A Course Notes 4: Stereo Systems and Structure from Motion

Camera Geometry II. COS 429 Princeton University

Structure from motion

3D Computer Vision. Structure from Motion. Prof. Didier Stricker

calibrated coordinates Linear transformation pixel coordinates

Structure from motion

BIL Computer Vision Apr 16, 2014

Structure from Motion

55:148 Digital Image Processing Chapter 11 3D Vision, Geometry

Computer Vision I - Algorithms and Applications: Multi-View 3D reconstruction

EECS 442 Computer vision. Fitting methods

Final Exam Study Guide

Structure from Motion CSC 767

Structure from Motion

Lecture 8 Fitting and Matching

3D Reconstruction from Two Views

Lecture 9 Fitting and Matching

CEE598 - Visual Sensing for Civil Infrastructure Eng. & Mgmt.

Multiple View Geometry in Computer Vision

Fitting. Instructor: Jason Corso (jjcorso)! web.eecs.umich.edu/~jjcorso/t/598f14!! EECS Fall 2014! Foundations of Computer Vision!

Lecture 6 Stereo Systems Multi- view geometry Professor Silvio Savarese Computational Vision and Geometry Lab Silvio Savarese Lecture 6-24-Jan-15

Structure from Motion

CS231M Mobile Computer Vision Structure from motion

Lecture 10: Multi-view geometry

Dense 3D Reconstruction. Christiano Gava

Stereo and Epipolar geometry

Feature Matching and Robust Fitting

Stereo Vision. MAN-522 Computer Vision

CS 231A Computer Vision (Winter 2018) Problem Set 3

Index. 3D reconstruction, point algorithm, point algorithm, point algorithm, point algorithm, 253

Dense 3D Reconstruction. Christiano Gava

Lecture 9: Epipolar Geometry

Lecture 9: Hough Transform and Thresholding base Segmentation

CS 231A Computer Vision (Winter 2014) Problem Set 3

Feature Detectors and Descriptors: Corners, Lines, etc.

Fitting. Fitting. Slides S. Lazebnik Harris Corners Pkwy, Charlotte, NC

Epipolar Geometry CSE P576. Dr. Matthew Brown

UNIVERSITY OF TORONTO Faculty of Applied Science and Engineering. ROB501H1 F: Computer Vision for Robotics. Midterm Examination.

EECS 442: Final Project

Index. 3D reconstruction, point algorithm, point algorithm, point algorithm, point algorithm, 263

PS3 Review Session. Kuan Fang CS231A 02/16/2018

1 Projective Geometry

Lecture 6 Stereo Systems Multi-view geometry

Object Recognition with Invariant Features

COMPARATIVE STUDY OF DIFFERENT APPROACHES FOR EFFICIENT RECTIFICATION UNDER GENERAL MOTION

Unit 3 Multiple View Geometry

C280, Computer Vision

Final Exam Study Guide CSE/EE 486 Fall 2007

CS 664 Structure and Motion. Daniel Huttenlocher

Lecture 10: Multi view geometry

CS 532: 3D Computer Vision 7 th Set of Notes

Contents. 1 Introduction Background Organization Features... 7

Srikumar Ramalingam. Review. 3D Reconstruction. Pose Estimation Revisited. School of Computing University of Utah

CSE 252B: Computer Vision II

EE795: Computer Vision and Intelligent Systems

Week 2: Two-View Geometry. Padua Summer 08 Frank Dellaert

Computer Vision Projective Geometry and Calibration. Pinhole cameras

Introduction to Computer Vision

Camera Registration in a 3D City Model. Min Ding CS294-6 Final Presentation Dec 13, 2006

Camera Calibration. Schedule. Jesus J Caban. Note: You have until next Monday to let me know. ! Today:! Camera calibration

Agenda. Rotations. Camera models. Camera calibration. Homographies

Two-view geometry Computer Vision Spring 2018, Lecture 10

Multiple View Geometry

Epipolar Geometry Prof. D. Stricker. With slides from A. Zisserman, S. Lazebnik, Seitz

Geometric camera models and calibration

Homography estimation

Visual Odometry. Features, Tracking, Essential Matrix, and RANSAC. Stephan Weiss Computer Vision Group NASA-JPL / CalTech

1 (5 max) 2 (10 max) 3 (20 max) 4 (30 max) 5 (10 max) 6 (15 extra max) total (75 max + 15 extra)

EXAM SOLUTIONS. Image Processing and Computer Vision Course 2D1421 Monday, 13 th of March 2006,

Vision par ordinateur

RANSAC and some HOUGH transform

Computer Vision cmput 428/615

The end of affine cameras

3D Modeling using multiple images Exam January 2008

Projective geometry for 3D Computer Vision

3D Reconstruction of a Hopkins Landmark

Agenda. Rotations. Camera calibration. Homography. Ransac

Motion Tracking and Event Understanding in Video Sequences

Application questions. Theoretical questions

Multiview Stereo COSC450. Lecture 8

55:148 Digital Image Processing Chapter 11 3D Vision, Geometry

Instance-level recognition II.

CS223b Midterm Exam, Computer Vision. Monday February 25th, Winter 2008, Prof. Jana Kosecka

HISTOGRAMS OF ORIENTATIO N GRADIENTS

Announcements. Hough Transform [ Patented 1962 ] Generalized Hough Transform, line fitting. Assignment 2: Due today Midterm: Thursday, May 5 in class

Srikumar Ramalingam. Review. 3D Reconstruction. Pose Estimation Revisited. School of Computing University of Utah

Lecture 8: Fitting. Tuesday, Sept 25

Last lecture. Passive Stereo Spacetime Stereo

Epipolar Geometry and Stereo Vision

But First: Multi-View Projective Geometry

Midterm Exam Solutions

Lecture 5 Epipolar Geometry

Combining Appearance and Topology for Wide

Model Fitting, RANSAC. Jana Kosecka

Structure from Motion

11/28/17. Midterm Review. Magritte, Homesickness. Computational Photography Derek Hoiem, University of Illinois

3D Photography: Epipolar geometry

Recovering structure from a single view Pinhole perspective projection

Transcription:

CS231A Midterm Review Friday 5/6/2016

Outline General Logistics Camera Models Non-perspective cameras Calibration Single View Metrology Epipolar Geometry Structure from Motion Active Stereo and Volumetric Stereo Fitting and Matching RANSAC Hough Transform Detectors and Descriptors

Midterm Logistics In class midterm at Skilling Aud. 3:00pm-4:20pm on Monday 5/9/2016 SCPD students not taking exam at Stanford should have already set up a proctor. Exam covers material from lecture 1-10 (through detectors and descriptors) Approximately 10 TF, 5 MC, 8 Short Answer. Open book, open notes, open computer without network access

Camera Models

Transformations in 2D Isometric Similarity Affine Projective

Properties of projective transformations points project to points lines project to lines distant objects look smaller angles are not preserved parallel lines in 3D meet in the image

Non-perspective cameras Orthographic Weak Perspective Affine

Calibration Problem

Calibration Problem

Calibration Problem

Calibration Problem

Homogeneous MxN Linear System

Quick SVD recap Any real mxn matrix A can be decomposed uniquely as A = U V T where U & V are orthogonal and is diagonal. If A has rank r, A = U r r V T r = rx i=1 iu i v T i r = diag( 1, 2,..., r) U =[u 1,u 2,...,u r ] V =[v 1,v 2,...,v r ] Be familiar with how to use SVD to solve Ax=0 and Ax=b

Single View Metrology Vanishing points Vanishing lines Construction of lines from points Directions and normals of vanishing points and vanishing lines.

Lines in a 2D plane

Points at infinity Intersection of two lines x = l l 0 Line through two points l = x x 0 Point at infinity x 1 Line at infinity l 1 Vanishing point: the projective projection of a point at infinity into the image plane

Vanishing points and directions

Vanishing lines

Single View Calibration As we saw in problem set 1, we can calculate the internal camera matrix K provided a single view and knowledge of the the scene geometry.

Multi-view Geometry Camera Geometry: given corresponding points in two images, find camera matrices, position and pose. Scene Geometry: Find correspondences of 3D points from its projection into 2 or more images. Correspondence: Given a point p in one image, how can I find the corresponding point p in another?

Epipolar Geometry

Epipolar Geometry

Fundamental Matrix F = K1 T [T x ] RK2 1 p T 1 Fp 2 =0 l 1 = Fp 2 l 2 = Fp 1 F is 3x3 matrix; 7 DOF F is singular (rank 2)

Epipolar Geometry Only need F to establish a relationship between the two corresponding points in the image No knowledge of position of P in 3D, or intrinsic/extrinsic parameters.

8-point algorithm

8-point algorithm

Solving the Correspondence issues: occlusion illumination foreshortening homogeneous regions repetitive patterns

Structure From Motion Structure = 3D points Motion = projection matrices

Affine Structure From Motion Problem: estimate the m matrices Ai, m matrices bi, and the n positions Xj from the mxn observations xij. Affine camera x ij = A i X j + b i ˆx ij = A i ˆXj normalized

Affine Structure From Motion Factorization method Affine camera x ij = A i X j + b i ˆx ij = A i ˆXj normalized

Factorization method use SVD! D should be rank 3

Structure From Motion Structure = 3D points Motion = projection matrices

Structure From Motion 1. Recover structure and motion up to perspective ambiguity Algebraic approach using the Fundamental matrix Factorization method (by SVD) Bundle adjustment 2. Resolving the perspective ambiguity Structure = 3D points Motion = projection matrices

Structure From Motion Limitations: 1. Algebraic approach using the Fundamental matrix Yields pairwise solutions. 2. Factorization method (by SVD) Assumes that all points are visible in every image, doesn t handle occlusions. Bundle adjustment via optimization Attempts to minimize the re-projection error (pixel distance between the projection of a reconstructed point into the estimated cameras for all the cameras and all the points).

Active Stereo and Volumetric Stereo Active Stereo Structured lighting Depth Sensing Volumetric Stereo Space carving Shadow carving Voxel coloring

Active Stereo

Volumetric Stereo

Volumetric Stereo

Volumetric Stereo Space Carving remove voxels via multiple views.

Fitting and Matching Least Squares methods RANSAC Hough transform

Fitting Goal: Find the best model parameters to fit the data. Things we might want to fit include Lines Curves Homographic transformations Fundamental matrices Issues with Fitting Noisy data Outliers Missing data

Least squares methods E = nx (y i mx i b) 2 i=1 Goal is the find the line that minimizes the residuals.

Least squares methods E = nx (ax i + by i d) 2 i=1 Ah =0 h = data parameters 2 3 a 4b5 d Robustness estimator to handle outliers

RANSAC Randomized iterative method to fit a parametric model to data based on random sampling of data. The typical approach 1. Sample the minimum number of points required to fit the model. 2. Fit a model to the sample. 3. Compute inliers. 4. Refine model based on all points.

RANSAC: fitting a circle 1. Select 3 random points. Find center and radius of the circumscribed circle to the triangle formed by the 3 points. 2. Compute distances of all the points to the center of the circle, and calculate inliers and outliers 3. Repeat at most N times

RANSAC Randomized iterative method to fit a parametric model to data based on random sampling of data. Pros: General method suited for a wide range of model fitting problems Easy to implement Cons: Only handles a moderate percentage of outliers lots of parameters to tune

Hough Transform A voting scheme to fit a parametric model to data by selecting the model with the most votes.

Hough Transform A voting scheme to fit a parametric model to data by selecting the model with the most votes.

Hough Transform to fit a Circle with known radius R y b x = a + R cos( ) y = b + R sin( ) x a Each point in the geometric space (left) generates a circle in parametric space (right). The circles in parametric space intersect at the (a,b) that is the center in the geometric space.

Hough Transform to fit a Multiple Circles with known R Each point in the geometric space (left) generates a circle in parametric space (right). The circles in parametric space intersect at the (a,b) that is the center in the geometric space.

Hough transform Pros: All points are processed independently, so can handle occlusions Some robustness to noise: noisy points unlikely to contribute consistently to any single bin. Can detect multipole instances of a model in a single pass Cons: Complexity of search time increases exponentially with the number of model parameters Spurious peaks due to uniform noise Quantization: hard to pick a good grid size.

Detectors and Descriptors Properties of detectors Edge detectors (Canny edge detector) Harris Corner detector Blob detectors Difference of Gaussians (DoG) Properties of descriptors SIFT HOG

Good Luck!