Advanced Digital Photography and Geometry Capture. Visual Imaging in the Electronic Age Lecture #10 Donald P. Greenberg September 24, 2015

Similar documents
Advanced Digital Photography and Geometry Capture. Visual Imaging in the Electronic Age Lecture #10 Donald P. Greenberg September 24, 2015

Advanced Digital Sensors/Photography Geometry Capture Lecture #10 September 21, 2017 Donald P. Greenberg

Amateur photography was once largely a personal RECONSTRUCTING ROME

Chaplin, Modern Times, 1936

CS 4495 Computer Vision A. Bobick. Motion and Optic Flow. Stereo Matching

Visual Imaging in the Electronic Age Assignment #3 Draft Geometry Capture

CS 4495 Computer Vision A. Bobick. Motion and Optic Flow. Stereo Matching

Fundamental matrix. Let p be a point in left image, p in right image. Epipolar relation. Epipolar mapping described by a 3x3 matrix F

Camera Calibration. COS 429 Princeton University

Image correspondences and structure from motion

Homographies and RANSAC

Depth Sensors Kinect V2 A. Fornaser

Ninio, J. and Stevens, K. A. (2000) Variations on the Hermann grid: an extinction illusion. Perception, 29,

Introduction to 3D Machine Vision

Final project bits and pieces

CS4670: Computer Vision

Large Scale 3D Reconstruction by Structure from Motion

CS6670: Computer Vision

Epipolar Geometry and Stereo Vision

Introduction to Computer Vision. Srikumar Ramalingam School of Computing University of Utah

Multi-view stereo. Many slides adapted from S. Seitz

3D Scanning. Qixing Huang Feb. 9 th Slide Credit: Yasutaka Furukawa

CS5670: Computer Vision

A Systems View of Large- Scale 3D Reconstruction

Lecture 10: Multi view geometry

3D Photography: Active Ranging, Structured Light, ICP

3D Fusion of Infrared Images with Dense RGB Reconstruction from Multiple Views - with Application to Fire-fighting Robots

3D Photography: Stereo

Image Based Reconstruction II

Miniature faking. In close-up photo, the depth of field is limited.

What have we leaned so far?

arxiv: v1 [cs.cv] 28 Sep 2018

There are many cues in monocular vision which suggests that vision in stereo starts very early from two similar 2D images. Lets see a few...

Advanced Digital Photography Geometry and Motion Capture Lecture #11 September 29, 2015 Donald P. Greenberg

SIMPLE ROOM SHAPE MODELING WITH SPARSE 3D POINT INFORMATION USING PHOTOGRAMMETRY AND APPLICATION SOFTWARE

Multiple View Geometry

International Journal for Research in Applied Science & Engineering Technology (IJRASET) A Review: 3D Image Reconstruction From Multiple Images

CS5670: Computer Vision

1 Projective Geometry

COMPARISON OF LASER SCANNING, PHOTOGRAMMETRY AND SfM-MVS PIPELINE APPLIED IN STRUCTURES AND ARTIFICIAL SURFACES

Structure from Motion. Introduction to Computer Vision CSE 152 Lecture 10

CS223b Midterm Exam, Computer Vision. Monday February 25th, Winter 2008, Prof. Jana Kosecka

Srikumar Ramalingam. Review. 3D Reconstruction. Pose Estimation Revisited. School of Computing University of Utah

BIL Computer Vision Apr 16, 2014

CS 4758: Automated Semantic Mapping of Environment

Step-by-Step Model Buidling

Why is computer vision difficult?

Stereo and Epipolar geometry

Camera Calibration. Schedule. Jesus J Caban. Note: You have until next Monday to let me know. ! Today:! Camera calibration

Photo Tourism: Exploring Photo Collections in 3D

Dense 3D Reconstruction. Christiano Gava

Image stitching. Digital Visual Effects Yung-Yu Chuang. with slides by Richard Szeliski, Steve Seitz, Matthew Brown and Vaclav Hlavac

Structure from motion

Structure from Motion

Noah Snavely Steven M. Seitz. Richard Szeliski. University of Washington. Microsoft Research. Modified from authors slides

The raycloud A Vision Beyond the Point Cloud

I N T R O D U C T I O N T O C O M P U T E R G R A P H I C S

High Definition Modeling of Calw, Badstrasse and its Google Earth Integration

3D Computer Vision. Depth Cameras. Prof. Didier Stricker. Oliver Wasenmüller

Local Features: Detection, Description & Matching

Dense 3D Reconstruction. Christiano Gava

All human beings desire to know. [...] sight, more than any other senses, gives us knowledge of things and clarifies many differences among them.

Models and The Viewing Pipeline. Jian Huang CS456

Photo Tourism: Exploring Photo Collections in 3D

Lecture 19: Depth Cameras. Visual Computing Systems CMU , Fall 2013

Computer Vision Lecture 17

Photo Tourism: Exploring Photo Collections in 3D

Computer Vision Lecture 17

Instance-level recognition

L2 Data Acquisition. Mechanical measurement (CMM) Structured light Range images Shape from shading Other methods

Lecture 8.2 Structure from Motion. Thomas Opsahl

CS4670: Computer Vision

Stereo Epipolar Geometry for General Cameras. Sanja Fidler CSC420: Intro to Image Understanding 1 / 33

Project 3 code & artifact due Tuesday Final project proposals due noon Wed (by ) Readings Szeliski, Chapter 10 (through 10.5)

Stereo vision. Many slides adapted from Steve Seitz

CS4495/6495 Introduction to Computer Vision

Structure from motion

Image-Based Rendering

3D Object Representations. COS 526, Fall 2016 Princeton University

3D shape from the structure of pencils of planes and geometric constraints

Structure from Motion

CSE 4392/5369. Dr. Gian Luca Mariottini, Ph.D.

Single-view 3D Reconstruction

Scene Modeling for a Single View

Srikumar Ramalingam. Review. 3D Reconstruction. Pose Estimation Revisited. School of Computing University of Utah

Lecture 10: Multi-view geometry

Shadows in the graphics pipeline

Structure from Motion CSC 767

Modeling 3D viewpoint for part-based object recognition of rigid objects

Computer Vision I - Appearance-based Matching and Projective Geometry

Instance-level recognition

Think-Pair-Share. What visual or physiological cues help us to perceive 3D shape and depth?

Scene Modeling for a Single View

CS 2770: Intro to Computer Vision. Multiple Views. Prof. Adriana Kovashka University of Pittsburgh March 14, 2017

Binocular stereo. Given a calibrated binocular stereo pair, fuse it to produce a depth image. Where does the depth information come from?

Public Library, Stereoscopic Looking Room, Chicago, by Phillips, 1923

lecture 10 - depth from blur, binocular stereo

Sensing Deforming and Moving Objects with Commercial Off the Shelf Hardware

Overview of Active Vision Techniques

Project 4 Results. Representation. Data. Learning. Zachary, Hung-I, Paul, Emanuel. SIFT and HoG are popular and successful.

Transcription:

Advanced Digital Photography and Geometry Capture Visual Imaging in the Electronic Age Lecture #10 Donald P. Greenberg September 24, 2015

Eye of a Fly AWARE-2 Duke University http://www.nanowerk.com/spotlight/spotid=3744.php

Insect-Eye Camera MAVs http://spectrum.ieee.org/robotics/robotics-hardware/insecteye-camera-offers-wideangle-vision-for-tiny-drones

Sony s Curved Sensor 2014

Canon s 250-megapixel camera sensor 09/08/15 Can read letters on an airplane 11 miles away!

Al Molnar, Cornell University John Wallace, LaserFocusWorld, 4/3/2012.

Al Molnar, Cornell University

Depth Imaging Original Goals Uses for: Military Defense Security and surveillance

Additional Uses of Depth Imaging Gestural Interfaces Cheap 3D scanners Motion capture Bio-metrics Face recognition Gait analysis Skeletonization Manipulating 3D models

Digital Geometry Capture Photographic methods Laser scanning Pattern projection methods Time of Flight

Camera Definition Camera View direction Model The camera location, view direction, and frustum must be defined relative to the object.

Viewing Frustum - User Parameters Location and Orientation (vectors): Eye location of eye point At a point along view direction vector Up a vector defining the Y axis on the hither plane fov Up Shape (scalars): Eye fov field of view angle aspect view plane aspect ratio hither perpendicular distance to hither plane yon perpendicular distance to yon plane hither yon At aspect = width / height

In general, what are the unknown variables? Unknown observe position- Xe, Ye, Ze Unknown viewer direction- Φ, θ, Ψ Unknown focal length- f

Simple case Known camera positions (x e, y e, z e ), camera optics, Identify corresponding points in each image. Jeremiah Fairbank. View dependent perspective images. Master's thesis, Cornell University, August 2005.

Early Work - 1975

Sagan House

1 2 3 Catch Autodesk

Autodesk s Remake 2016 Daan Claessen

Capturing Geometry from Photographs Can we reconstruct the 3D geometry from an arbitrary set of photographs?

Reconstructing Rome 1 The advent of digital photography and the recent growth of photo-sharing websites ( ) have brought about the seismic change in photography and the use of photo collections. 1 A search for the word Rome on returns two million photos. This collection, or others like it, capture every popular site, facade, statue, fountain, interior, café, etc.

Characteristics of Typical Photo Sets The photos are unstructured No particular order or distribution of camera viewpoints The photos are uncalibrated Nothing is known about the camera settings (exposure, focal length, etc.) The scale is enormous (millions, not thousands of photos) and We need to do this fast!

Correspondence and 3D Structure from Different Camera Positions Note: The pictures are in correspondence 2D dots with same color correspond to the same 3D points.

3D Structure from Different Camera Positions Camera 1 Camera 2 Camera 3

3D Structure from Different Camera Positions Camera 3 Camera 1 Camera 2 Assuming the position of the red dot is known, there is reprojection error in Camera 3.

Change the Problem to an optimization problem Minimize the sum of the squares of the reprojection errors. This non-linear least squares problem is difficult to solve due to local minima and maxima. Authors selectively started with a few choice cameras and points and grew scenes incrementally (a process known as bundle adjustment).

Trevi Fountain, Rome Italy http://en.wikipedia.org/wiki/trevi_fountain

Feature Detection and Matching The position and orientation of scale-invariant feature transform (SIFT) features on an image of the Trevi Fountain. Sameer Agarwal, Yasutaka Furukawa, Naoh Snavely, Brian Curless, Steve M. Seitz, Richard Szeliski. Reconstructing Rome, IEEE Computer, June 2010.

Feature Detection and Matching A track corresponding to a point on the face of the central statue of Oceanus at the Trevi Fountain, the embodiment of a river encircling the world in Greek mythology. Sameer Agarwal, Yasutaka Furukawa, Naoh Snavely, Brian Curless, Steve M. Seitz, Richard Szeliski. Reconstructing Rome, IEEE Computer, June 2010.

Colosseum The Colosseum (Rome) Reconstructed dense 3D point models. For places with many available images, reconstruction quality is very high. Sameer Agarwal, Yasutaka Furukawa, Naoh Snavely, Brian Curless, Steve M. Seitz, Richard Szeliski. Reconstructing Rome, IEEE Computer, June 2010.

Cornell Campus, McGraw Hall - Noah Snavely

Digital Geometry Capture Photographic methods Laser scanning Pattern projection methods

Cyberware Scanner Diagram

Cyberware Scanner

Uncle Don

Cyberware vs. 1 2 3 Catch

Digital Geometry Capture Photographic methods Laser scanning Pattern projection methods

Microsoft s Kinect Making Things See, Greg Borenstein

Structured Light Imaging (Kinect) Kinect uses a spatial pseudo random neighborhood pattern with unique coding with different sized dots.

Kinect speckle pattern Near region (0.8 1.2m) Small dots Middle region (1.2 2.0m) Far region (2.0 3.5 m) Large dots

Point Cloud Drawn from the Kinect s 3D data

Depth Image The gray scale indicates the depth of the object for each pixel. Black indicates no depth information because closer objects obscure those that are further away.

Kinect: Depth Image and Real Image

Digital Geometry Capture Photographic methods Laser scanning Pattern projection methods Time of Flight

Pulsed Modulation

Matterport

Matterport

Matterport 2016

Matterport

Floored http://labs.floored.com/clients/mrp-realty/900-g-roof/

end