Advanced Digital Photography and Geometry Capture. Visual Imaging in the Electronic Age Lecture #10 Donald P. Greenberg September 24, 2015

Similar documents
Advanced Digital Photography and Geometry Capture. Visual Imaging in the Electronic Age Lecture #10 Donald P. Greenberg September 24, 2015

Advanced Digital Sensors/Photography Geometry Capture Lecture #10 September 21, 2017 Donald P. Greenberg

Amateur photography was once largely a personal RECONSTRUCTING ROME

Chaplin, Modern Times, 1936

CS 4495 Computer Vision A. Bobick. Motion and Optic Flow. Stereo Matching

Camera Calibration. COS 429 Princeton University

CS 4495 Computer Vision A. Bobick. Motion and Optic Flow. Stereo Matching

Fundamental matrix. Let p be a point in left image, p in right image. Epipolar relation. Epipolar mapping described by a 3x3 matrix F

Visual Imaging in the Electronic Age Assignment #3 Draft Geometry Capture

Homographies and RANSAC

Image correspondences and structure from motion

Large Scale 3D Reconstruction by Structure from Motion

Ninio, J. and Stevens, K. A. (2000) Variations on the Hermann grid: an extinction illusion. Perception, 29,

Introduction to 3D Machine Vision

CS6670: Computer Vision

Final project bits and pieces

Epipolar Geometry and Stereo Vision

Depth Sensors Kinect V2 A. Fornaser

Multi-view stereo. Many slides adapted from S. Seitz

CS4670: Computer Vision

3D Scanning. Qixing Huang Feb. 9 th Slide Credit: Yasutaka Furukawa

CS5670: Computer Vision

3D Photography: Active Ranging, Structured Light, ICP

A Systems View of Large- Scale 3D Reconstruction

3D Photography: Stereo

What have we leaned so far?

There are many cues in monocular vision which suggests that vision in stereo starts very early from two similar 2D images. Lets see a few...

CS5670: Computer Vision

Introduction to Computer Vision. Srikumar Ramalingam School of Computing University of Utah

Lecture 10: Multi view geometry

1 Projective Geometry

COMPARISON OF LASER SCANNING, PHOTOGRAMMETRY AND SfM-MVS PIPELINE APPLIED IN STRUCTURES AND ARTIFICIAL SURFACES

arxiv: v1 [cs.cv] 28 Sep 2018

Image Based Reconstruction II

Miniature faking. In close-up photo, the depth of field is limited.

Structure from Motion. Introduction to Computer Vision CSE 152 Lecture 10

CS223b Midterm Exam, Computer Vision. Monday February 25th, Winter 2008, Prof. Jana Kosecka

Noah Snavely Steven M. Seitz. Richard Szeliski. University of Washington. Microsoft Research. Modified from authors slides

Srikumar Ramalingam. Review. 3D Reconstruction. Pose Estimation Revisited. School of Computing University of Utah

SIMPLE ROOM SHAPE MODELING WITH SPARSE 3D POINT INFORMATION USING PHOTOGRAMMETRY AND APPLICATION SOFTWARE

Multiple View Geometry

Advanced Digital Photography Geometry and Motion Capture Lecture #11 September 29, 2015 Donald P. Greenberg

International Journal for Research in Applied Science & Engineering Technology (IJRASET) A Review: 3D Image Reconstruction From Multiple Images

Photo Tourism: Exploring Photo Collections in 3D

Camera Calibration. Schedule. Jesus J Caban. Note: You have until next Monday to let me know. ! Today:! Camera calibration

Dense 3D Reconstruction. Christiano Gava

Image stitching. Digital Visual Effects Yung-Yu Chuang. with slides by Richard Szeliski, Steve Seitz, Matthew Brown and Vaclav Hlavac

Photo Tourism: Exploring Photo Collections in 3D

BIL Computer Vision Apr 16, 2014

Photo Tourism: Exploring Photo Collections in 3D

Structure from motion

Structure from Motion

The raycloud A Vision Beyond the Point Cloud

CS 4758: Automated Semantic Mapping of Environment

I N T R O D U C T I O N T O C O M P U T E R G R A P H I C S

High Definition Modeling of Calw, Badstrasse and its Google Earth Integration

Step-by-Step Model Buidling

3D Computer Vision. Depth Cameras. Prof. Didier Stricker. Oliver Wasenmüller

Scene Modeling for a Single View

Stereo and Epipolar geometry

3D Fusion of Infrared Images with Dense RGB Reconstruction from Multiple Views - with Application to Fire-fighting Robots

Dense 3D Reconstruction. Christiano Gava

Why is computer vision difficult?

Models and The Viewing Pipeline. Jian Huang CS456

Scene Modeling for a Single View

Lecture 8.2 Structure from Motion. Thomas Opsahl

L2 Data Acquisition. Mechanical measurement (CMM) Structured light Range images Shape from shading Other methods

CS4670: Computer Vision

Project 3 code & artifact due Tuesday Final project proposals due noon Wed (by ) Readings Szeliski, Chapter 10 (through 10.5)

Local Features: Detection, Description & Matching

CS4495/6495 Introduction to Computer Vision

Structure from motion

3D shape from the structure of pencils of planes and geometric constraints

Image-Based Rendering

3D Object Representations. COS 526, Fall 2016 Princeton University

Structure from Motion

Single-view 3D Reconstruction

Srikumar Ramalingam. Review. 3D Reconstruction. Pose Estimation Revisited. School of Computing University of Utah

Scene Modeling for a Single View

Computer Vision Lecture 17

Lecture 10: Multi-view geometry

Computer Vision I - Appearance-based Matching and Projective Geometry

Shadows in the graphics pipeline

Computer Vision Lecture 17

Structure from Motion CSC 767

Today s lecture. Image Alignment and Stitching. Readings. Motion models

CS 2770: Intro to Computer Vision. Multiple Views. Prof. Adriana Kovashka University of Pittsburgh March 14, 2017

Lecture 19: Depth Cameras. Visual Computing Systems CMU , Fall 2013

Stereo Epipolar Geometry for General Cameras. Sanja Fidler CSC420: Intro to Image Understanding 1 / 33

Instance-level recognition

lecture 10 - depth from blur, binocular stereo

Sensing Deforming and Moving Objects with Commercial Off the Shelf Hardware

Stereo vision. Many slides adapted from Steve Seitz

Overview of Active Vision Techniques

Project 4 Results. Representation. Data. Learning. Zachary, Hung-I, Paul, Emanuel. SIFT and HoG are popular and successful.

Local Feature Detectors

All human beings desire to know. [...] sight, more than any other senses, gives us knowledge of things and clarifies many differences among them.

Modeling 3D viewpoint for part-based object recognition of rigid objects

Ceilbot vision and mapping system

A Qualitative Analysis of 3D Display Technology

Transcription:

Advanced Digital Photography and Geometry Capture Visual Imaging in the Electronic Age Lecture #10 Donald P. Greenberg September 24, 2015

Eye of a Fly AWARE-2 Duke University http://www.nanowerk.com/spotlight/spotid=3744.php

AWARE-2 http://mosaic.disp.duke.edu:90/aware/image_list/image_list/public

AWARE-2 online demo http://www.gigapan.com/galleries/10940/gigapans/140282

Insect-Eye Camera MAVs http://spectrum.ieee.org/robotics/robotics-hardware/insecteye-camera-offers-wideangle-vision-for-tiny-drones

Sony s Curved Sensor 2014

Sony s Curved Sensor 2014

Additional Uses of Depth Imaging Gestural Interfaces Cheap 3D scanners Motion capture Bio-metrics Manipulating 3D models

Canon s 250-megapixel camera sensor 09/08/15 Can read letters on an airplane 11 miles away!

Al Molnar, Cornell University John Wallace, LaserFocusWorld, 4/3/2012.

Al Molnar, Cornell University

Digital Geometry Capture Photographic methods Laser scanning Pattern projection methods Time of Flight

Camera Definition Camera View direction Model The camera location, view direction, and frustum must be defined relative to the object.

Viewing Frustum - User Parameters Location and Orientation (vectors): Eye location of eye point At a point along view direction vector Up a vector defining the Y axis on the hither plane fov Up Shape (scalars): Eye fov field of view angle aspect view plane aspect ratio hither perpendicular distance to hither plane yon perpendicular distance to yon plane hither yon At aspect = width / height

In general, what are the unknown variables? Unknown observe position- Xe, Ye, Ze Unknown viewer direction- Φ, θ, Ψ Unknown focal length- f

Simple case Known camera positions (x e, y e, z e ), camera optics, Identify corresponding points in each image. Jeremiah Fairbank. View dependent perspective images. Master's thesis, Cornell University, August 2005.

Early Work - 1975

Sagan House

Autodesk 123 Catch

1 2 3 Catch Autodesk

Capturing Geometry from Photographs Can we reconstruct the 3D geometry from an arbitrary set of photographs?

Reconstructing Rome 1 The advent of digital photography and the recent growth of photo-sharing websites ( ) have brought about the seismic change in photography and the use of photo collections. 1 A search for the word Rome on returns two million photos. This collection, or others like it, capture every popular site, facade, statue, fountain, interior, café, etc.

Characteristics of Typical Photo Sets The photos are unstructured No particular order or distribution of camera viewpoints The photos are uncalibrated Nothing is known about the camera settings (exposure, focal length, etc.) The scale is enormous (millions, not thousands of photos) and We need to do this fast!

Correspondence and 3D Structure from Different Camera Positions Note: The pictures are in correspondence 2D dots with same color correspond to the same 3D points.

3D Structure from Different Camera Positions Camera 1 Camera 2 Camera 3

3D Structure from Different Camera Positions Camera 3 Camera 1 Camera 2 Assuming the position of the red dot is known, there is reprojection error in Camera 3.

Change the Problem to an optimization problem Minimize the sum of the squares of the reprojection errors. This non-linear least squares problem is difficult to solve due to local minima and maxima. Authors selectively started with a few choice cameras and points and grew scenes incrementally (a process known as bundle adjustment).

Trevi Fountain, Rome Italy http://en.wikipedia.org/wiki/trevi_fountain

Feature Detection and Matching The position and orientation of scale-invariant feature transform (SIFT) features on an image of the Trevi Fountain. Sameer Agarwal, Yasutaka Furukawa, Naoh Snavely, Brian Curless, Steve M. Seitz, Richard Szeliski. Reconstructing Rome, IEEE Computer, June 2010.

Feature Detection and Matching A track corresponding to a point on the face of the central statue of Oceanus at the Trevi Fountain, the embodiment of a river encircling the world in Greek mythology. Sameer Agarwal, Yasutaka Furukawa, Naoh Snavely, Brian Curless, Steve M. Seitz, Richard Szeliski. Reconstructing Rome, IEEE Computer, June 2010.

Colosseum The Colosseum (Rome) Reconstructed dense 3D point models. For places with many available images, reconstruction quality is very high. Sameer Agarwal, Yasutaka Furukawa, Naoh Snavely, Brian Curless, Steve M. Seitz, Richard Szeliski. Reconstructing Rome, IEEE Computer, June 2010.

Cornell Campus, McGraw Hall - Noah Snavely

Digital Geometry Capture Photographic methods Laser scanning Pattern projection methods

Cyberware Scanner Diagram

Cyberware Scanner

Uncle Don

Cyberware vs. 1 2 3 Catch

Digital Geometry Capture Photographic methods Laser scanning Pattern projection methods

Microsoft s Kinect Making Things See, Greg Borenstein

Structured Light Imaging (Kinect) Kinect uses a spatial pseudo random neighborhood pattern with unique coding with different sized dots.

Kinect speckle pattern Near region (0.8 1.2m) Small dots Middle region (1.2 2.0m) Far region (2.0 3.5 m) Large dots

Point Cloud Drawn from the Kinect s 3D data

Depth Image The gray scale indicates the depth of the object for each pixel. Black indicates no depth information because closer objects obscure those that are further away.

Kinect: Depth Image and Real Image

Digital Geometry Capture Photographic methods Laser scanning Pattern projection methods Time of Flight

Pulsed Modulation

Matterport

Matterport

Matterport

Floored http://labs.floored.com/clients/mrp-realty/900-g-roof/

end