Augmenting Reality, Naturally:

Similar documents
Object Recognition with Invariant Features

Structure from Motion. Introduction to Computer Vision CSE 152 Lecture 10

Scene Modelling, Recognition and Tracking with Invariant Image Features

Stereo and Epipolar geometry

Invariant Features from Interest Point Groups

Augmenting Reality, Naturally

arxiv: v1 [cs.cv] 28 Sep 2018

3D Reconstruction of a Hopkins Landmark

Computational Optical Imaging - Optique Numerique. -- Multiple View Geometry and Stereo --

Instance-level recognition

Srikumar Ramalingam. Review. 3D Reconstruction. Pose Estimation Revisited. School of Computing University of Utah

Srikumar Ramalingam. Review. 3D Reconstruction. Pose Estimation Revisited. School of Computing University of Utah

Instance-level recognition part 2

Image Features: Detection, Description, and Matching and their Applications

Computational Optical Imaging - Optique Numerique. -- Single and Multiple View Geometry, Stereo matching --

arxiv: v1 [cs.cv] 28 Sep 2018

Instance-level recognition

Augmented Reality VU. Computer Vision 3D Registration (2) Prof. Vincent Lepetit

URBAN STRUCTURE ESTIMATION USING PARALLEL AND ORTHOGONAL LINES

Outline. Introduction System Overview Camera Calibration Marker Tracking Pose Estimation of Markers Conclusion. Media IC & System Lab Po-Chen Wu 2

Dense 3D Reconstruction. Christiano Gava

Specular 3D Object Tracking by View Generative Learning

/10/$ IEEE 4048

Global localization from a single feature correspondence

3D object recognition used by team robotto

Dense 3D Reconstruction. Christiano Gava

Feature Based Registration - Image Alignment

Motion Estimation and Optical Flow Tracking

Homographies and RANSAC

Instance-level recognition II.

Viewpoint Invariant Features from Single Images Using 3D Geometry

Eppur si muove ( And yet it moves )

Chapter 3 Image Registration. Chapter 3 Image Registration

Local Features Tutorial: Nov. 8, 04

Structured light 3D reconstruction

Last lecture. Passive Stereo Spacetime Stereo

An Algorithm for Seamless Image Stitching and Its Application

Structure from motion

Stereo Vision. MAN-522 Computer Vision

Fast Natural Feature Tracking for Mobile Augmented Reality Applications

Nonlinear Mean Shift for Robust Pose Estimation

Geometry for Computer Vision

Video Google: A Text Retrieval Approach to Object Matching in Videos

Structure from motion

An Image Based 3D Reconstruction System for Large Indoor Scenes

CS231A Midterm Review. Friday 5/6/2016

Step-by-Step Model Buidling

Patch-based Object Recognition. Basic Idea

Image Features: Local Descriptors. Sanja Fidler CSC420: Intro to Image Understanding 1/ 58

EECS 442: Final Project

Feature descriptors and matching

Adaptive Zoom Distance Measuring System of Camera Based on the Ranging of Binocular Vision

Multiple View Geometry in Computer Vision Second Edition

CSCI 5980/8980: Assignment #4. Fundamental Matrix

Image correspondences and structure from motion

Improving Initial Estimations for Structure from Motion Methods

CPSC 425: Computer Vision

Stable Structure from Motion for Unordered Image Collections

Camera Drones Lecture 3 3D data generation

3D Reconstruction From Multiple Views Based on Scale-Invariant Feature Transform. Wenqi Zhu

Multiple View Geometry

Factorization Method Using Interpolated Feature Tracking via Projective Geometry

Lecture 10: Multi view geometry

Region Graphs for Organizing Image Collections

3D reconstruction how accurate can it be?

Structured Light II. Thanks to Ronen Gvili, Szymon Rusinkiewicz and Maks Ovsjanikov

3D from Images - Assisted Modeling, Photogrammetry. Marco Callieri ISTI-CNR, Pisa, Italy

Object Reconstruction

Keypoint-based Recognition and Object Search

The SIFT (Scale Invariant Feature

A Systems View of Large- Scale 3D Reconstruction

Image processing and features

Vision-based endoscope tracking for 3D ultrasound image-guided surgical navigation [Yang et al. 2014, Comp Med Imaging and Graphics]

Overview. Augmented reality and applications Marker-based augmented reality. Camera model. Binary markers Textured planar markers

Quasiconvex Optimization for Robust Geometric Reconstruction

BIL Computer Vision Apr 16, 2014

Structure from Motion CSC 767

Intrinsic and Extrinsic Camera Parameter Estimation with Zoomable Camera for Augmented Reality

Tecnologie per la ricostruzione di modelli 3D da immagini. Marco Callieri ISTI-CNR, Pisa, Italy

CS 231A Computer Vision (Winter 2014) Problem Set 3

Estimating Geospatial Trajectory of a Moving Camera

Fundamental Matrices from Moving Objects Using Line Motion Barcodes

Scale Invariant Feature Transform by David Lowe

Absolute Scale Structure from Motion Using a Refractive Plate

Accurate and Dense Wide-Baseline Stereo Matching Using SW-POC

3D FACE RECONSTRUCTION BASED ON EPIPOLAR GEOMETRY

CS 231A Computer Vision (Fall 2012) Problem Set 3

UNIK 4690 Maskinsyn Introduction

Chaplin, Modern Times, 1936

Multiple View Geometry

An Overview of Matchmoving using Structure from Motion Methods

CS 4495 Computer Vision A. Bobick. Motion and Optic Flow. Stereo Matching

3D Computer Vision. Structured Light II. Prof. Didier Stricker. Kaiserlautern University.

Accurate Motion Estimation and High-Precision 3D Reconstruction by Sensor Fusion

HISTOGRAMS OF ORIENTATIO N GRADIENTS

Colorado School of Mines. Computer Vision. Professor William Hoff Dept of Electrical Engineering &Computer Science.

Epipolar Geometry and Stereo Vision

CS 223B Computer Vision Problem Set 3

Computer Vision I - Appearance-based Matching and Projective Geometry

What have we leaned so far?

Transcription:

Augmenting Reality, Naturally: Scene Modelling, Recognition and Tracking with Invariant Image Features by Iryna Gordon in collaboration with David G. Lowe Laboratory for Computational Intelligence Department of Computer Science University of British Columbia, Canada 1

the highlights automation: acquisition of scene representation camera auto-calibration computer vision scene recognition from arbitrary viewpoints versatility: easy setup unconstrained scene geometry unconstrained camera motion distinctive natural features 2

natural features Scale Invariant Feature Transform (SIFT) characterized by image location, scale, orientation and a descriptor vector invariant to image scale and orientation partially invariant to illumination & viewpoint changes robust to image noise highly distinctive and plentiful David G. Lowe. Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision, 2004. 3

what the system needs computer + off-the-shelf video camera set of reference images: + - unordered - acquired with a handheld camera - unknown viewpoints - at least 2 images 4

what the system does 5

modelling reality: feature matching best match smallest Euclidean distance between descriptor vectors 2-view matches found via Best-Bin-First (BBF) search on a k-d tree epipolar constraints computed for N -1 image pairs with RANSAC image pairs selected by constructing a spanning tree on the image set: F. Schaffalitzky and A. Zisserman. Multi-view matching for unordered image sets, or How do I organize my holiday snaps?. ECCV, 2002. 6

7 v'j'ju'j'japzyafpzxft Π Π ' j ' j ' j i j i ij Z,,Y X R t X x Euclidean 3D structure & auto-calibration from multi-view matches via direct bundle adjustment: 2 min i j ij ij w j ij x x a ~ modelling reality: scene structure R. Szeliski and Sing Bing Kang. Recovering 3D shape and motion from image streams using non-linear least squares. Cambridge Research, 1993.

modelling reality: an improvement Problem: computation time increases exponentially with the number of unknown parameters trouble converging if the cameras are too far apart (> 90 degrees) Solution: select a subset of images to construct a partial model incrementally update the model by resectioning and triangulation images processed in order automatically determined by the spanning tree 8

This image cannot currently be displayed. modelling reality: object placement initial placement in 2D determining relative depth adjusting size and pose rendered object in reference images 9

camera pose estimation model points appearances in reference images are stored in a k-d tree x ~, 2D-to-3D matches found with RANSAC for each video frame t tj X j camera pose computed via non-linear optimization: x x 2 min w ~ tj tj p t j tj p 2 W p t 2 1t we regularize the solution to reduce virtual jitter iteratively adjusted for each video frame: 2 p p 1tN 2 2 W t 2 N 2 W p 1tt p 2 10

video examples 11

in the future... optimize online computations for real-time performance: SIFT recognition with a frame-to-frame feature tracker introduce multiple feature types: SIFT features with edge-based image descriptors perform further testing: scalability to large environments multiple objects: real and virtual 12

13

thank you! questions? http://www.cs.ubc.ca/~skrypnyk/arproject/ 14

modelling reality: an example 20 input images 010 50 20 iterations: error = = 62.5 4.2 0.2 1.7 pixels 15

registration accuracy ground truth: ARToolKit marker measurement: virtual square stationary camera moving camera moving camera 16