Computer Vision I - Algorithms and Applications: Multi-View 3D reconstruction

Similar documents
Structure from motion

Structure from motion

3D reconstruction class 11

Structure from Motion CSC 767

Structure from Motion

calibrated coordinates Linear transformation pixel coordinates

BIL Computer Vision Apr 16, 2014

Structure from Motion

More on single-view geometry class 10

Invariance of l and the Conic Dual to Circular Points C

Unit 3 Multiple View Geometry

Multiple View Geometry in computer vision

Projective geometry for Computer Vision

Structure from Motion

Lecture 9: Epipolar Geometry

Multiple View Geometry in Computer Vision Second Edition

Robust Geometry Estimation from two Images

Computer Vision I - Appearance-based Matching and Projective Geometry

CS231M Mobile Computer Vision Structure from motion

Structure from motion

55:148 Digital Image Processing Chapter 11 3D Vision, Geometry

Camera models and calibration

CEE598 - Visual Sensing for Civil Infrastructure Eng. & Mgmt.

Lecture 10: Multi-view geometry

Multiple Views Geometry

1 Projective Geometry

Epipolar Geometry class 11

Computer Vision I - Appearance-based Matching and Projective Geometry

Auto-calibration. Computer Vision II CSE 252B

CS231A Midterm Review. Friday 5/6/2016

CS 532: 3D Computer Vision 7 th Set of Notes

Two-view geometry Computer Vision Spring 2018, Lecture 10

The end of affine cameras

Multiple View Geometry

Projective geometry for 3D Computer Vision

Stereo Vision. MAN-522 Computer Vision

Structure from Motion. Introduction to Computer Vision CSE 152 Lecture 10

Lecture 10: Multi view geometry

C / 35. C18 Computer Vision. David Murray. dwm/courses/4cv.

C280, Computer Vision

Multiple View Geometry in Computer Vision

Stereo and Epipolar geometry

Computer Vision Projective Geometry and Calibration. Pinhole cameras

Refining Single View Calibration With the Aid of Metric Scene Properties

Epipolar Geometry Prof. D. Stricker. With slides from A. Zisserman, S. Lazebnik, Seitz

CS231A Course Notes 4: Stereo Systems and Structure from Motion

Geometry of Multiple views

Epipolar Geometry CSE P576. Dr. Matthew Brown

CS-9645 Introduction to Computer Vision Techniques Winter 2019

Projective geometry, camera models and calibration

COMPARATIVE STUDY OF DIFFERENT APPROACHES FOR EFFICIENT RECTIFICATION UNDER GENERAL MOTION

Camera Geometry II. COS 429 Princeton University

Multiview Stereo COSC450. Lecture 8

Step-by-Step Model Buidling

Elements of Computer Vision: Multiple View Geometry. 1 Introduction. 2 Elements of Geometry. Andrea Fusiello

Machine vision. Summary # 11: Stereo vision and epipolar geometry. u l = λx. v l = λy

Rectification and Distortion Correction

3D Reconstruction from Two Views

Lecture 5 Epipolar Geometry

Computer Vision I - Robust Geometry Estimation from two Cameras

Computer Vision cmput 428/615

Camera Calibration. Schedule. Jesus J Caban. Note: You have until next Monday to let me know. ! Today:! Camera calibration

CS201 Computer Vision Camera Geometry

Epipolar Geometry and Stereo Vision

Reminder: Lecture 20: The Eight-Point Algorithm. Essential/Fundamental Matrix. E/F Matrix Summary. Computing F. Computing F from Point Matches

3D Photography: Epipolar geometry

Homogeneous Coordinates. Lecture18: Camera Models. Representation of Line and Point in 2D. Cross Product. Overall scaling is NOT important.

Lecture 6 Stereo Systems Multi-view geometry

CS 664 Structure and Motion. Daniel Huttenlocher

3D Modeling using multiple images Exam January 2008

3D Sensing and Reconstruction Readings: Ch 12: , Ch 13: ,

Camera Drones Lecture 3 3D data generation

CSE 252B: Computer Vision II

Vision 3D articielle Multiple view geometry

Lecture 3: Camera Calibration, DLT, SVD

Srikumar Ramalingam. Review. 3D Reconstruction. Pose Estimation Revisited. School of Computing University of Utah

CS 231A Computer Vision (Winter 2015) Problem Set 2

Dense 3D Reconstruction. Christiano Gava

CS 664 Slides #9 Multi-Camera Geometry. Prof. Dan Huttenlocher Fall 2003

Structure from Motion. Prof. Marco Marcon

A Stratified Approach for Camera Calibration Using Spheres

Visual Recognition: Image Formation

Automatic Estimation of Epipolar Geometry

Srikumar Ramalingam. Review. 3D Reconstruction. Pose Estimation Revisited. School of Computing University of Utah

Projective 2D Geometry

Last lecture. Passive Stereo Spacetime Stereo

Vision par ordinateur

Geometric camera models and calibration

Metric Rectification for Perspective Images of Planes

CS 4495 Computer Vision A. Bobick. Motion and Optic Flow. Stereo Matching

55:148 Digital Image Processing Chapter 11 3D Vision, Geometry

Camera Model and Calibration

3D Geometric Computer Vision. Martin Jagersand Univ. Alberta Edmonton, Alberta, Canada

Index. 3D reconstruction, point algorithm, point algorithm, point algorithm, point algorithm, 263

Today. Stereo (two view) reconstruction. Multiview geometry. Today. Multiview geometry. Computational Photography

Week 2: Two-View Geometry. Padua Summer 08 Frank Dellaert

Humanoid Robotics. Projective Geometry, Homogeneous Coordinates. (brief introduction) Maren Bennewitz

But First: Multi-View Projective Geometry

Computer Vision I Name : CSE 252A, Fall 2012 Student ID : David Kriegman Assignment #1. (Due date: 10/23/2012) x P. = z

Metric Structure from Motion

Transcription:

Computer Vision I - Algorithms and Applications: Multi-View 3D reconstruction Carsten Rother 09/12/2013 Computer Vision I: Multi-View 3D reconstruction

Roadmap this lecture Computer Vision I: Multi-View 3D reconstruction 09/12/2013 2 Two-view reconstruction From projective to metric space (e.g. self-calibration) Multi-view reconstruction Iterative projective cameras Closed form: affine cameras Closed form: reference plane Next lecture: dense labeling problems in computer vision: stereo matching ICP, KinectFusion

3D reconstruction - Definitions Computer Vision I: Multi-View 3D reconstruction 09/12/2013 3 Sparse Structure from Motion (SfM) SLAM in robotics: Simultaneous Localization and Mapping: Place a robot in an unknown location and in an unknown environment and have the robot incrementally build a map of this environment while simultaneously using this map to compute its location Dense Multi-view reconstruction

Roadmap this lecture Computer Vision I: Multi-View 3D reconstruction 09/12/2013 4 Two-view reconstruction From projective to metric space (e.g. self-calibration) Multi-view reconstruction Iterative projective cameras Closed form: affine cameras Closed form: reference plane Next lecture: dense labeling problems in computer vision: stereo matching ICP, KinectFusion

First: Two-View reconstruction Computer Vision I: Multi-View 3D reconstruction 09/12/2013 5

Epipolar Geometry (Reminder) Computer Vision I: Robust Two-View Geometry 09/12/2013 6 Fundamental matrix F: p 2 T Fp 1 = 0

2-view reconstruction Computer Vision I: Multi-View 3D reconstruction 09/12/2013 7 We have x T Fx = 0 Can we get P, P such that: x = PX ; x = P X and x T Fx = 0 for all 3D points X Derivation (blackboard) see HZ page 256 P = I 3 3 0 ]; P = e F e ] 3 4 3 4

Derivation Computer Vision I: Multi-View 3D reconstruction 09/12/2013 8 T

Derivation Computer Vision I: Multi-View 3D reconstruction 09/12/2013 9

Triangulation - algebraic Computer Vision I: Multi-View 3D reconstruction 09/12/2013 10 Input: x, x, P, P Output: X s Triangulation is also called intersection Simple algebraic solution: 1) λx = P X and λ x = P X 2) Eliminate λ, λ by taking ratios. This gives 4 linear independent equations for 3 unknowns: X = (X 1, X 2, X 3, X 4 ) where X = 1. An example ratio is: x 1 x 2 = p 1 X 1 +p 2 X 2 +p 3 X 3 +p 4 X 4 p 5 X 1 +p 6 X 2 +p 7 X 3 +p 8 X 4 3) This gives (as usual) a least square optimization problem: A X = 0 with X = 1 where A is of size 4 4. This can be solved in closed-form using SVD.

Triangulation - geometric Computer Vision I: Multi-View 3D reconstruction 09/12/2013 11 Minize re-projection error with Fixed fundamental matrix. ^ ^ ^ ^ ^ ^ x, x = argmin d x, x 2 + d x, x 2 subject to x Fx = 0 x, ^ x ^

Triangulation - geometric Computer Vision I: Multi-View 3D reconstruction 09/12/2013 12 Minimize re-projection error with fixed fundamental matrix. ^ ^ ^ ^ ^ ^ x, x = argmin d x, x 2 + d x, x 2 subject to x Fx = 0 x, ^ x ^ Solution can be expressed as a 6-degree polynomial in t This has up to 6 solutions and can computed (roots of polynomial)

Triangulation - uncertainty Computer Vision I: Multi-View 3D reconstruction 09/12/2013 13 Large baseline Smaller area smaller baseline Larger area Very small baseline Very larger area

Roadmap this lecture Computer Vision I: Multi-View 3D reconstruction 09/12/2013 14 Two-view reconstruction From projective to metric space (e.g. self-calibration) Multi-view reconstruction Iterative projective cameras Closed form: affine cameras Closed form: reference plane Next lecture: dense labeling problems in computer vision: stereo matching ICP, KinectFusion

3D reconstruction Computer Vision I: Multi-View 3D reconstruction 09/12/2013 15 Given: m cameras and n static 3D points Formally: x ij = P j X i for j = 1 m; i = 1 n Important in practice we do not have all points visible in all views, i.e. Number x ij mn Goal: find all P j s and X i s Example: Visibility matrix

Iterative multi-view reconstruction Method 1 Computer Vision I: Multi-View 3D reconstruction 09/12/2013 16 Three views of an un-calibrated or calibrated camera

Iterative multi-view reconstruction Method 1 Computer Vision I: Multi-View 3D reconstruction 09/12/2013 17 Three views of an un-calibrated or calibrated camera Similiar problem as intersection (and same as camera calibration): λx = P X and λ x = PX with 6 points you get 12 constraints and we have 11 unknowns.

Iterative multi-view reconstruction Method 1 Computer Vision I: Multi-View 3D reconstruction 09/12/2013 18 Three views of an un-calibrated or calibrated camera Similiar problem as intersection (and same as camera calibration): λx = P X and λ x = PX with 6 points you get 12 constraints and we have 11 unknowns. You can iterate between: intersection and re-sectioning to get all points and cameras reconstructed (in projective or metric space)

Iterative multi-view reconstruction Method 2 Computer Vision I: Multi-View 3D reconstruction 09/12/2013 19 Three views of an un-calibrated or calibrated camera Reconstruct Points and Camera 1 and 2

Iterative multi-view reconstruction Method 2 Computer Vision I: Multi-View 3D reconstruction 09/12/2013 20 Three views of an un-calibrated or calibrated camera Zipp the 2 reconstructions together. 1. They share a camera and 7+ points (needed for F-matrix). 2. Get H 4 4 from X 1 7 = HX 1 7 (here points X 1 7 are in second reconstruction and X 1 7 in first reconstruction) Reconstruct Points and Camera 2 and 3 3. Zipp them together: P 3 = P 3 H 1 ; X i = HX i (Here P 3 is camera in second reconstruction) Note:

The full pipeline Computer Vision I: Multi-View 3D reconstruction 09/12/2013 21 Comment: Between 3 and 4 views there exists Trifocal and Quadrifocal tensor (as Fundamental matrix for 2 views) not discussed in this lecture [See page 453 HZ]

Bundle adjustment Computer Vision I: Multi-View 3D reconstruction 09/12/2013 22 Global refinement of jointly structure (points) and cameras Minimize geometric error: argmin {P s,x s} α ij d(p j X i, x ij ) j i here α ij is 1 if X j visible in view P j (otherwise 0) Non-linear optimization with e.g. Levenberg-Marquard

Example Computer Vision I: Multi-View 3D reconstruction 09/12/2013 23

Main Problem of iterative methods is Drift Computer Vision I: Multi-View 3D reconstruction 09/12/2013 24 Solution: 1) look for Loop closure if possible 2) global methods (next)

Roadmap this lecture Computer Vision I: Multi-View 3D reconstruction 09/12/2013 25 Two-view reconstruction From projective to metric space (e.g. self-calibration) Multi-view reconstruction Iterative projective cameras Closed form: affine cameras Closed form: reference plane Next lecture: dense labeling problems in computer vision: stereo matching ICP, KinectFusion

Reminder: affine cameras (from lecture 4) Computer Vision I: Multi-View 3D reconstruction 09/12/2013 26 Affine camera has 8 DoF: x y 1 = a b c d e f g h 0 0 0 1 X Y Z 1 In short: x = MX + t ~ ~ 2 3 2 1 2 1

Reminder: affine cameras (from lecture 4) Computer Vision I: Multi-View 3D reconstruction 09/12/2013 27 (normal focal length) Parallel 3D lines map to parallel 2D lines (since points stay at infinity) (very large focal length) Close to parallel projection

Multi-View Reconstruction for affine cameras Computer Vision I: Multi-View 3D reconstruction 09/12/2013 29 (derivation on blackboard)

Multi-View Reconstruction for affine cameras Computer Vision I: Multi-View 3D reconstruction 09/12/2013 30 (derivation on blackboard) W MX = 0 W = MX

Comments / Extensions Computer Vision I: Multi-View 3D reconstruction 09/12/2013 31 Main restriction is that all points have to be visible in all views. (can be used for a subset of views and then zipping subviews together) Extensions to missing data have been done (see HZ chapter 18) Extensions to projective cameras have been done (ch. 18.4) Extensions to non-rigidly moving scenes (ch. 18.3)

Roadmap this lecture Computer Vision I: Multi-View 3D reconstruction 09/12/2013 32 Two-view reconstruction From projective to metric space (e.g. self-calibration) Multi-view reconstruction Iterative projective cameras Closed form: affine cameras Closed form: reference plane Next lecture: dense labeling problems in computer vision: stereo matching ICP, KinectFusion

Direct reference plane approach (DRP) Computer Vision I: Multi-View 3D reconstruction 09/12/2013 33 H = KR is called infinity homography since it is the mapping from plane at infinity to image: x y x x x = KR (I C) ~ z =KR y z = H y z 0 Idea: simply define an arbitrary plane as π = 0,0,0,1 T (this can be done in projective space) [Rother PhD Thesis 2003]

Direct reference plane approach (DRP) Computer Vision I: Multi-View 3D reconstruction 09/12/2013 34 Derivation on blackboard [Rother PhD Thesis 2003]

Result Computer Vision I: Multi-View 3D reconstruction 09/12/2013 35

How to get infinite Homographies Computer Vision I: Multi-View 3D reconstruction 09/12/2013 36 Real Plane in the scene: Fixed / known K and R, e.g. translating camera with fixed camera intrinsics Orthogonal scene directions and a square pixel camera see above we get out: K, R (up to a small ambiguity)

Result: University Stockholm Computer Vision I: Multi-View 3D reconstruction 09/12/2013 37 (Show video)

Half-way slide Computer Vision I: Multi-View 3D reconstruction 09/12/2013 38 3 Minutes break

Roadmap this lecture Computer Vision I: Multi-View 3D reconstruction 09/12/2013 39 Two-view reconstruction From projective to metric space (e.g. self-calibration) Multi-view reconstruction Iterative projective cameras Closed form: affine cameras Closed form: reference plane Next lecture: dense labeling problems in computer vision: stereo matching ICP, KinectFusion

Scale ambiguity Computer Vision I: Multi-View 3D reconstruction 09/12/2013 40 Is the pumpkin 5 meters or 30 cm high?

Scale ambiguity Computer Vision I: Multi-View 3D reconstruction 09/12/2013 41

Structure from Motion Ambiguity Computer Vision I: Multi-View 3D reconstruction 09/12/2013 42 We can always write: x = P k k X = 1 k P (kx) It is impossible to recover the absolute scale of the scene!

Projective ambiguity Computer Vision I: Multi-View 3D reconstruction 09/12/2013 43 We can write (most general): x ij = P j X i = P j Q 1 QX i = P j X i Q has 15 DoF (projective ambiguity) If we do not have any additional information about the cameras or points then we cannot recover Q. Possible information (we will see details later) known calibration matrix calibration matrix is same for all cameras external constraints: given 3D points

Projective ambiguity Computer Vision I: Multi-View 3D reconstruction 09/12/2013 44 3D points map to image points This is a protectively correct reconstruction but not a nice looking one

Affine ambiguity Computer Vision I: Multi-View 3D reconstruction 09/12/2013 45 We can write (most general): x ij = P j X i = P j Q 1 QX i = P j X i Q has now 12 DoF (affine ambiguity) With affine cameras we get only affine ambiguity Q leaves the plane at infinity π = 0,0,0,1 T in place: any point on π moves like: Q a, b, c, 0 T = a, b, c, 0 Therefore parallel 3D lines stay parallel for any Q

From Projective to Affine Computer Vision I: Multi-View 3D reconstruction 09/12/2013 46 X 3 X 2 The red directions point to places where the projection of a vanishing point lies. X 1 Step: Take points X 1 3 = x 1 3, y 1 3, z 1 3, 1 T and move to a point at infinity: x 1 3, y 1 3, z 1 3, 0 T. Same as having plane at infinity in its canonical position π = 0,0,0,1 T

Affine ambiguity Computer Vision I: Multi-View 3D reconstruction 09/12/2013 47 3D Points at infinity stay at infinity

Similarity Ambiguity (Metric space) Computer Vision I: Multi-View 3D reconstruction 09/12/2013 48 We can write (most general): x ij = P j X i = P j Q 1 QX i = P j X i Q has now 7 DoF (similarity ambiguity) Q preserves angles, ratios of lengths, etc. For visualization purpose this ambiguity is sufficient. We don t need to know what 1m, 1cm, 1mm, etc. means Note, if we don t care about the choice of Q we can set for instance the camera center of first camera to 0.

From Projective/Affine to Metric Computer Vision I: Multi-View 3D reconstruction 09/12/2013 49 Essentially, we need the true camera calibration matrices K (see details later) Known K Steps: (see HZ page 277) One option: with known K operate with essential matrix and get out R,C ~ of all cameras

Similarity Ambiguity Computer Vision I: Multi-View 3D reconstruction 09/12/2013 50

How to upgrade a reconstruction Computer Vision I: Multi-View 3D reconstruction 09/12/2013 51 Illustrating some ways to upgrade from Projective to Affine and Metric (see details in HZ page 270ff and chapter 19) Camera is calibrated Calibration from external constraints Calibration from a mix of in- and external constraints x ij = P j X i = P j Q 1 QX i = P j X i Find plane at infinity and move in canonical position: One of the cameras is affine see HZ page 271) 3 non-collinear 3D vanishing points Calibration from internal constraints, called self/auto calibration Translational motion (HZ page 268)

Projective to Affine Three 3D vanishing points Computer Vision I: Multi-View 3D reconstruction 09/12/2013 52 Goal: Find plane at infinity and move to π = 0,0,0,1 T Possible method 1. Identify three pairs of lines in the 3D reconstruction that have to be parallel, respectively. 2. Compute the three 3D intersection points X 1 3 3. Move X 1 3 somewhere on the plane at infinity π = 0,0,0,1 T. Define the equation: X 1 3 = Q X 1 3 where X 1 3 = 1,0,0,0 T, 0,1,0,0 T, 0,0,1,0 T The red directions point to places where the projection of a vanishing point lies. 4. Compute an Q using SVD 5. Update all points and cameras P j = Q 1 P j ; X i = QX i

Projective to Metric Direct Method Computer Vision I: Multi-View 3D reconstruction 09/12/2013 53 Given: five known 3D points Compute Q: 1) QX i = X i (each 3D point gives 3 linear independent equations) 2) 5 points give 15 equations, enough to compute Q using SVD Upgrade cameras and points: P j = P j Q 1 and X i = QX i

But without external knowledge? Computer Vision I: Multi-View 3D reconstruction 09/12/2013 54 For a camera P = K [I 0] the ray outwards is: x = P X hence X = K 1 x The angle Θ is computed as the normalized rays d 1, d 2 : cos Θ = d 1 T d 2 d 1 T d 1 d 2 T d 2 = K 1 x 1 T K 1 x 2 K 1 x 1 T K 1 x 1 K 1 x 2 T K 1 x 2 = x 1 T ω x 2 x 1 T ωx 1 x 2 T ωx 2 We define the matrix: ω = K T K 1 Comment: (K 1 ) T = (K T ) 1 =: K T

But without external knowledge? Computer Vision I: Multi-View 3D reconstruction 09/12/2013 55 We have: cos Θ = x 1 T ω x 2 x 1 T ωx 1 x 2 T ωx 2 If we were to know ω then we can compute angle Θ K can be derived from ω = K T K 1 using Cholesky decomposition (see HZ page 582) Comment, if Θ = 90 o then we have x 1 T ω x 2 = 0 How do we get ω?

ω is the Image of the absolute Conic (IAC) Computer Vision I: Multi-View 3D reconstruction 09/12/2013 56 ω is called the image of the absolute conic Ω = I 3 3 on the plane at infinity π = 0,0,0,1 T Remember a conic C is defined as: x T C x = 0. Here it is: x, y, 1 Ω x, y, 1 T = 0 That is: x 2 + y 2 = 1 an imaginary circle with radius i the points (i, 0) and (0, i) lie on the conic. H ω image

ω is the Image of the absolute Conic (IAC) Computer Vision I: Multi-View 3D reconstruction 09/12/2013 57 ω is called the image of the absolute conic Ω = I 3 3 on the plane at infinity π = 0,0,0,1 T Proof. H 1. The homography H = KR is the mapping from the pane at infinity to the image plane. Since: ~ x = KR I C] x, y, z, 0 T is x = KR x, y, z T ω image 2. The conic C = Ω = I 3 3 maps from the plane at infinity to π to the image as: ω = H T CH 1 = KR T I KR 1 = K T R T R 1 K 1 = K T K 1 Note, ω depends on K only and not on R, C. Hence it plays a central role in auto-calibration. ~

Degrees of Freedom of ω Computer Vision I: Multi-View 3D reconstruction 09/12/2013 58 We have: K = f s p x 0 mf p y 0 0 1 then K 1 = a b c 0 d e 0 0 1 where a, b, c, d, e are some values that depend on: f, m, s, p x, p y Then it is: ω = (K 1 ) T K 1 = a 0 0 b d 0 c e 1 a b c 0 d e 0 0 1 = a 2 ab ac ab b 2 + d 2 bc + de ac bc + de c 2 + e 2 + 1 = ω 1 ω 2 ω 3 ω 2 ω 4 ω 5 ω 3 ω 5 ω 6 This means that ω has 5 DoF (scale is not unique)

How to compute the IAC? Computer Vision I: Multi-View 3D reconstruction 09/12/2013 59 External constraints: orthogonal directions, etc. f s p x K = 0 mf p y 0 0 1 camera matrix Internal constraints: Square pixels (i.e. m = 1, s = 0 in K) Camera matrix is the same in two or more views (e.g. video sequence with no zooming) If only internal constraints are used then it is called auto/self-calibration.

How to compute the IAC? Computer Vision I: Multi-View 3D reconstruction 09/12/2013 60 We need 5 constraints on ω to determine it uniquely

Example: internal + external constraints Computer Vision I: Multi-View 3D reconstruction 09/12/2013 61 Square pixel cameras (i.e. m = 1, s = 0 in K) gives two constraints: ω 1 = ω 4 and ω 2 = 0 f s p x K = 0 mf p y 0 0 1 camera matrix Then ω = ω 1 0 ω 3 0 ω 1 ω 5 ω 3 ω 5 w 6 with only 3 DoF v 1 Given 3 image points v 1 3 that point to orthogonal directions, respectively: v 1 T ω v 2 = 0; v 1 T ω v 3 = 0; v 2 T ω v 3 = 0 v 2 v 3 This gives an equation system Aω = 0 with A of size 3 4. Hence ω can be obtained with SVD K can be derived from ω using Cholesky decomposition

Example: internal constraints (practically important) Computer Vision I: Multi-View 3D reconstruction 09/12/2013 62 Assume two cameras with: s = 0, and m, p x, p y known The only unknowns are the two focal lengths: f 0, f 1 Then you get the so-called Kruppa equations: u T 1 ω 1 0 u 1 = u 0 T ω 1 0 u 1 = u 0 T ω 1 0 u 0 σ 2 0 v T 0 ω 1 1 v 0 σ 0 σ 1 v T 0 ω 1 1 v 1 σ 2 1 v T 1 ω 1 1 v 1 f s p x K = 0 mf p y 0 0 1 camera matrix where SVD of F = u 0 u 1 e 1 0 σ 1 0 σ 0 0 0 0 0 0 v 0 T v 1 T e 0 T 1 and ω i 1 = (K i T K i 1 ) = K i K i T = diag(f i 2, f i 2, 1) This can be solved in closed form (see next slide)

Sketch of the solution for f 0, f 1 Computer Vision I: Multi-View 3D reconstruction 09/12/2013 63

Example: internal constraints (sketch) Computer Vision I: Multi-View 3D reconstruction 09/12/2013 64 Assume K is constant over 3+ frames then K can be computed We know (lecture 6) we can get K, R, C from P = K R (I 3 3 C) ~ ~ We have already P 1, P 2, P 3 and it is x i1 = P 1 X i = P 1 Q 1 QX i = P 1 X i x i2 = P 2 X i = P 2 Q 1 QX i = P 2 X i x i3 = P 3 X i = P 3 Q 1 QX i = P 3 X i Try to find a Q such that all P 1, P 2, P 3 have the same K but different ~ R 1 3 and C 1 3 See details in chapter 19 HZ

How to upgrade a reconstruction Computer Vision I: Multi-View 3D reconstruction 09/12/2013 65 Illustrating some ways to upgrade from Projective to Affine and Metric (see details in HZ page 270ff and chapter 19) Camera is calibrated Calibration from external constraints (example: 5 known 3D points) Calibration from a mix of in- and external constraints (example: single camera: 3 orthogonal vanishing points and a square pixel camera) Calibration from internal constraints, called self/auto calibration (example: 3+ views with same K; or two cameras with unknown focal length) Find plane at infinity and move in canonical position: One of the cameras is affine. See HZ page 271) 3 non-collinear 3D vanishing points Translational motion (HZ page 268)

Roadmap this lecture Computer Vision I: Multi-View 3D reconstruction 09/12/2013 66 Two-view reconstruction From projective to metric space (e.g. self-calibration) Multi-view reconstruction Iterative projective cameras Closed form: affine cameras Closed form: reference plane Next lecture: dense labeling problems in computer vision: stereo matching ICP, KinectFusion