Structure from Motion. Introduction to Computer Vision CSE 152 Lecture 10

Similar documents
Stereo and Epipolar geometry

Structure from motion

Step-by-Step Model Buidling

Multiview Stereo COSC450. Lecture 8

Announcements. Motion. Structure-from-Motion (SFM) Motion. Discrete Motion: Some Counting

Structure from motion

Announcements. Motion. Structure-from-Motion (SFM) Motion. Discrete Motion: Some Counting

Two-view geometry Computer Vision Spring 2018, Lecture 10

CS231A Midterm Review. Friday 5/6/2016

Epipolar Geometry CSE P576. Dr. Matthew Brown

3D Modeling using multiple images Exam January 2008

Last lecture. Passive Stereo Spacetime Stereo

Srikumar Ramalingam. Review. 3D Reconstruction. Pose Estimation Revisited. School of Computing University of Utah

55:148 Digital Image Processing Chapter 11 3D Vision, Geometry

Structure from Motion

Index. 3D reconstruction, point algorithm, point algorithm, point algorithm, point algorithm, 263

Computational Optical Imaging - Optique Numerique. -- Multiple View Geometry and Stereo --

Epipolar geometry. x x

Lecture 8.2 Structure from Motion. Thomas Opsahl

Structure from Motion CSC 767

Introduction to Computer Vision

Recovering structure from a single view Pinhole perspective projection

Lecture 9: Epipolar Geometry

Lecture 14: Basic Multi-View Geometry

Announcements. Stereo

Multiple View Geometry

calibrated coordinates Linear transformation pixel coordinates

Two-View Geometry (Course 23, Lecture D)

BIL Computer Vision Apr 16, 2014

Computational Optical Imaging - Optique Numerique. -- Single and Multiple View Geometry, Stereo matching --

Computer Vision Lecture 17

Computer Vision Lecture 17

Dense 3D Reconstruction. Christiano Gava

Srikumar Ramalingam. Review. 3D Reconstruction. Pose Estimation Revisited. School of Computing University of Utah

Structure from Motion and Multi- view Geometry. Last lecture

Epipolar Geometry in Stereo, Motion and Object Recognition

Camera Drones Lecture 3 3D data generation

Computer Vision I. Announcements. Random Dot Stereograms. Stereo III. CSE252A Lecture 16

Unit 3 Multiple View Geometry

arxiv: v1 [cs.cv] 28 Sep 2018

Image correspondences and structure from motion

Announcements. Stereo

Index. 3D reconstruction, point algorithm, point algorithm, point algorithm, point algorithm, 253

C280, Computer Vision

Homographies and RANSAC

CS 532: 3D Computer Vision 7 th Set of Notes

3D Reconstruction from Two Views

Rectification and Distortion Correction

Augmenting Reality, Naturally:

Humanoid Robotics. Projective Geometry, Homogeneous Coordinates. (brief introduction) Maren Bennewitz

CSCI 5980/8980: Assignment #4. Fundamental Matrix

Multiple Views Geometry

arxiv: v1 [cs.cv] 28 Sep 2018

Contents. 1 Introduction Background Organization Features... 7

The end of affine cameras

3D Computer Vision. Structure from Motion. Prof. Didier Stricker

3D Reconstruction of a Hopkins Landmark

Week 2: Two-View Geometry. Padua Summer 08 Frank Dellaert

Stereo Vision. MAN-522 Computer Vision

Structure from motion

3D Reconstruction from Scene Knowledge

Structure from Motion

Structure from Motion

Computer Vision I - Algorithms and Applications: Multi-View 3D reconstruction

CSE 252B: Computer Vision II

Epipolar Geometry and Stereo Vision

Visual Pose Estimation System for Autonomous Rendezvous of Spacecraft

CS664 Lecture #19: Layers, RANSAC, panoramas, epipolar geometry

Robust Geometry Estimation from two Images

Multiple View Geometry in Computer Vision Second Edition

CS 4495 Computer Vision A. Bobick. Motion and Optic Flow. Stereo Matching

55:148 Digital Image Processing Chapter 11 3D Vision, Geometry

Improving Initial Estimations for Structure from Motion Methods

EECS 442: Final Project

Reminder: Lecture 20: The Eight-Point Algorithm. Essential/Fundamental Matrix. E/F Matrix Summary. Computing F. Computing F from Point Matches

Lecture 10: Multi-view geometry

Epipolar Geometry and Stereo Vision

Vision par ordinateur

CS231M Mobile Computer Vision Structure from motion

CS 231A Computer Vision (Winter 2015) Problem Set 2

A General Expression of the Fundamental Matrix for Both Perspective and Affine Cameras

Lecture 5 Epipolar Geometry

Invariant Features from Interest Point Groups

Dense 3D Reconstruction. Christiano Gava

Epipolar Geometry and Stereo Vision

The Projective Vision Toolkit

Instance-level recognition II.

3D reconstruction class 11

Computer Vision CSCI-GA Assignment 1.

Structure from Motion. Prof. Marco Marcon

1 Projective Geometry

Fundamental matrix. Let p be a point in left image, p in right image. Epipolar relation. Epipolar mapping described by a 3x3 matrix F

Geometric camera models and calibration

CS231A Course Notes 4: Stereo Systems and Structure from Motion

Passive 3D Photography

Geometry of Multiple views

Towards a visual perception system for LNG pipe inspection

Estimation of common groundplane based on co-motion statistics

Visual Pose Estimation and Identification for Satellite Rendezvous Operations

Application questions. Theoretical questions

Transcription:

Structure from Motion CSE 152 Lecture 10

Announcements Homework 3 is due May 9, 11:59 PM Reading: Chapter 8: Structure from Motion Optional: Multiple View Geometry in Computer Vision, 2nd edition, Hartley and Zisserman

Motion

Motion Some problems of motion Correspondence: Where have elements of the image moved between image frames? Reconstruction: Given correspondences, what is 3D geometry of scene? Motion segmentation: What are regions of image corresponding to different moving objects? Tracking: Where have objects moved in the image? (related to correspondence and segmentation)

Structure from Motion The change in rotation and translation between the two cameras (the motion ) Locations of points on the object (the structure ) MOVING CAMERAS ARE LIKE STEREO

Structure from Motion (SFM) Objective Given two or more images (or video frames), without knowledge of the camera poses (rotations and translations), estimate the camera poses and 3D structure of scene. Considerations Discrete motion (wide baseline) vs. continuous (infinitesimal) motion Calibrated vs. uncalibrated Two views vs. multiple views Orthographic (affine) vs. perspective

Discrete Motion, Calibrated Consider m images of n points, how many unknowns? Unknowns 3D Structure: 3n First normalized camera P ^ = [I 0] Rotations: 3(m 1) Translations (to scale): 3(m 1) 1 Total: 3n + 6(m 1) 1 Measurements 2nm Solution when 3n + 6(m 1) 1 2nm

Discrete Motion, Uncalibrated Consider m images of n points, how many unknowns? Unknowns 3D Structure: 3n Cameras (to 3D projective transformation): 11m 15 Total: 3n + 11m 15 Measurements 2nm Solution when 3n + 11m 15 2nm

Two Views, Calibrated Input: Two images (or video frames) Detect feature points Determine feature correspondences Compute the essential matrix Retrieve the relative camera rotation and translation (to scale) from the essential matrix Optional: Perform dense stereo matching using recovered epipolar geometry Reconstruct corresponding 3D scene points (to scale)

Two Views, Uncalibrated Input: Two images (or video frames) Detect feature points Determine feature correspondences Compute the fundamental matrix Retrieve the relative camera 3D projective transformation from the fundamental matrix Optional: Perform dense stereo matching using recovered epipolar geometry Reconstruct corresponding 3D scene points (to 3D projective transformation)

Essential Matrix (Calibrated) Number of point correspondences and solutions 5 point correspondences, up to 10 (real) solutions 6 point correspondences, 1 solution 7 point correspondences, 1 or 3 real solutions (and 2 or 0 complex ones) 8 or more point correspondences, 1 solution

Fundamental Matrix (Uncalibrated) Number of point correspondences and solutions 7 point correspondences, 1 or 3 real solutions (and 2 or 0 complex ones) 8 or more point correspondences, 1 solution

Fundamental matrix Mathematics Linear estimation (8 or more correspondences) Retrieval of camera projection matrices and 3D projective transformation Essential matrix Linear estimation (8 or more correspondences) Retrieval of normalized camera projection matrices and 3D rotation and translation (to scale)

Feature detection Select strongest features (e.g. 1000/image)

Feature matching Evaluate normalized cross correlation (or sum of squared differences) for all features with similar coordinates e.g. w w h h ( x, y ) [ x, x + ] [ y y + ] 10 10 10, 10 Keep mutual best matches Still many wrong matches!?

Greedy Algorithm: Comments Given feature in one image, find best match in second image irrespective of other matches Suitable for small motions, little rotation, small search window Otherwise Must compare descriptor over rotation Cannot consider all potential pairings (way too many), so Manual correspondence (e.g., photogrammetry) Use robust outlier rejection (e.g., RANSAC) More descriptive features (line segments, SIFT, larger regions, color) Use video sequence to track, but perform SFM w/ first and last image

Reconstruction N-View Geometry Bundle adjustment Simultaneous adjustment of parameters for all cameras and all 3D scene points Minimize reprojection error in all images Reconstruction of cameras and 3D scene points to similarity (calibrated) or projective (uncalibrated) ambiguity Factorization (see text)

Direct Reconstruction Projective to Euclidean 5 (or more) points correspondences

N-View Geometry

N-View Geometry

Example results N-View Geometry

Mid-level vision Next Lecture Grouping and model fitting Reading: Chapter 10: Grouping and Model Fitting