Object Reconstruction

Similar documents
3D Environment Reconstruction

Local Features Tutorial: Nov. 8, 04

SIFT: SCALE INVARIANT FEATURE TRANSFORM SURF: SPEEDED UP ROBUST FEATURES BASHAR ALSADIK EOS DEPT. TOPMAP M13 3D GEOINFORMATION FROM IMAGES 2014

Mobile Point Fusion. Real-time 3d surface reconstruction out of depth images on a mobile platform

SCALE INVARIANT FEATURE TRANSFORM (SIFT)

CS 231A Computer Vision (Winter 2018) Problem Set 3

Robotics Programming Laboratory

SIFT: Scale Invariant Feature Transform

CAP 5415 Computer Vision Fall 2012

Eppur si muove ( And yet it moves )

Panoramic Image Stitching

EE368 Project Report CD Cover Recognition Using Modified SIFT Algorithm

arxiv: v1 [cs.cv] 28 Sep 2018

Pictures at an Exhibition

The SIFT (Scale Invariant Feature

SUMMARY: DISTINCTIVE IMAGE FEATURES FROM SCALE- INVARIANT KEYPOINTS

Scale Invariant Feature Transform by David Lowe

3D Line Segment Based Model Generation by RGB-D Camera for Camera Pose Estimation

Object Recognition with Invariant Features

10/03/11. Model Fitting. Computer Vision CS 143, Brown. James Hays. Slides from Silvio Savarese, Svetlana Lazebnik, and Derek Hoiem

A New Approach For 3D Image Reconstruction From Multiple Images

Computational Optical Imaging - Optique Numerique. -- Multiple View Geometry and Stereo --

Introduction. Introduction. Related Research. SIFT method. SIFT method. Distinctive Image Features from Scale-Invariant. Scale.

Feature Based Registration - Image Alignment

Obtaining Feature Correspondences

CS231A Section 6: Problem Set 3

Image correspondences and structure from motion

Introduction to SLAM Part II. Paul Robertson

Photo by Carl Warner

Augmented Reality VU. Computer Vision 3D Registration (2) Prof. Vincent Lepetit

User-assisted Segmentation and 3D Motion Tracking. Michael Fleder Sudeep Pillai Jeremy Scott

CSE 527: Introduction to Computer Vision

Srikumar Ramalingam. Review. 3D Reconstruction. Pose Estimation Revisited. School of Computing University of Utah

Image Features: Local Descriptors. Sanja Fidler CSC420: Intro to Image Understanding 1/ 58

Instance-level recognition

Augmenting Reality, Naturally:

Feature Detection. Raul Queiroz Feitosa. 3/30/2017 Feature Detection 1

arxiv: v1 [cs.cv] 28 Sep 2018

Motion Estimation and Optical Flow Tracking

A NEW FEATURE BASED IMAGE REGISTRATION ALGORITHM INTRODUCTION

Homographies and RANSAC

Visualization of Temperature Change using RGB-D Camera and Thermal Camera

EECS150 - Digital Design Lecture 14 FIFO 2 and SIFT. Recap and Outline

KinectFusion: Real-Time Dense Surface Mapping and Tracking

Mesh from Depth Images Using GR 2 T

Clustering in Registration of 3D Point Clouds

Implementing the Scale Invariant Feature Transform(SIFT) Method

School of Computing University of Utah

Automatic Image Alignment (feature-based)

A Comparison of SIFT and SURF

Comparison of Feature Detection and Matching Approaches: SIFT and SURF

Dynamic 2D/3D Registration for the Kinect

Local Feature Detectors

Instance-level recognition

Personal Navigation and Indoor Mapping: Performance Characterization of Kinect Sensor-based Trajectory Recovery

Key properties of local features

CS 231A Computer Vision (Winter 2014) Problem Set 3

Scale Invariant Feature Transform

Memory Management Method for 3D Scanner Using GPGPU

Srikumar Ramalingam. Review. 3D Reconstruction. Pose Estimation Revisited. School of Computing University of Utah

High-speed Three-dimensional Mapping by Direct Estimation of a Small Motion Using Range Images

3D Reconstruction From Multiple Views Based on Scale-Invariant Feature Transform. Wenqi Zhu

Local Features: Detection, Description & Matching

Scale Invariant Feature Transform

BSB663 Image Processing Pinar Duygulu. Slides are adapted from Selim Aksoy

Overview. Augmented reality and applications Marker-based augmented reality. Camera model. Binary markers Textured planar markers

3D Visualization through Planar Pattern Based Augmented Reality

Edge and corner detection

Multi-modal Registration of Visual Data. Massimiliano Corsini Visual Computing Lab, ISTI - CNR - Italy

Large Scale 3D Reconstruction by Structure from Motion

Feature descriptors. Alain Pagani Prof. Didier Stricker. Computer Vision: Object and People Tracking

Visualization of Temperature Change using RGB-D Camera and Thermal Camera

A Systems View of Large- Scale 3D Reconstruction

CS 558: Computer Vision 4 th Set of Notes

CS 378: Autonomous Intelligent Robotics. Instructor: Jivko Sinapov

A Comparison of SIFT, PCA-SIFT and SURF

3D Computer Vision. Structured Light II. Prof. Didier Stricker. Kaiserlautern University.

Dual Back-to-Back Kinects for 3-D Reconstruction

3D PLANE-BASED MAPS SIMPLIFICATION FOR RGB-D SLAM SYSTEMS

Lecture 10 Detectors and descriptors

Rigid ICP registration with Kinect

Stereo and Epipolar geometry

Outline 7/2/201011/6/

Lecture: RANSAC and feature detectors

A New Representation for Video Inspection. Fabio Viola

CEE598 - Visual Sensing for Civil Infrastructure Eng. & Mgmt.

Methods for Representing and Recognizing 3D objects

ICP and 3D-Reconstruction

Structure from Motion. Introduction to Computer Vision CSE 152 Lecture 10

Outline. 1 Why we re interested in Real-Time tracking and mapping. 3 Kinect Fusion System Overview. 4 Real-time Surface Mapping

Visual Tracking (1) Tracking of Feature Points and Planar Rigid Objects

TA Section 7 Problem Set 3. SIFT (Lowe 2004) Shape Context (Belongie et al. 2002) Voxel Coloring (Seitz and Dyer 1999)

Geometric Reconstruction Dense reconstruction of scene geometry

3D object recognition used by team robotto

A Real-Time RGB-D Registration and Mapping Approach by Heuristically Switching Between Photometric And Geometric Information

FOREGROUND DETECTION ON DEPTH MAPS USING SKELETAL REPRESENTATION OF OBJECT SILHOUETTES

Patch-based Object Recognition. Basic Idea

Local features and image matching. Prof. Xin Yang HUST

Accurate Motion Estimation and High-Precision 3D Reconstruction by Sensor Fusion

Computer Vision for HCI. Topics of This Lecture

Transcription:

B. Scholz Object Reconstruction 1 / 39 MIN-Fakultät Fachbereich Informatik Object Reconstruction Benjamin Scholz Universität Hamburg Fakultät für Mathematik, Informatik und Naturwissenschaften Fachbereich Informatik Technische Aspekte Multimodaler Systeme 11.05.2017

B. Scholz Object Reconstruction 2 / 39 Table of Contents 1. Introduction Motivation 2. Object Reconstruction using a Camera Structure from Motion Collecting Images Feature Detection (SIFT) Matching Features from Different Images Bundle Adjustment 3. Object Reconstruction using RGB-D Iterative Closest Point (ICP) 4. 3D Object Representations

B. Scholz Object Reconstruction 3 / 39 Motivation Reconstructing a real-world object Using vision-sensors to achieve this Fields where this is used: Robotics Medicine Movies/Video Games Archaeology

B. Scholz Object Reconstruction 4 / 39 Motivation From Multi-View Normal Field Integration for 3D Reconstruction of Mirroring Objects by Michael Weinmann, Aljosa Osep, Roland Ruiters, and Reinhard Klein

B. Scholz Object Reconstruction 5 / 39 Structure from Motion Multiple images Different angles (motion) Using RGB-camera

B. Scholz Object Reconstruction 6 / 39 Steps 1. Collecting images 2. Feature detection (SIFT) 3. Feature Matching 4. Filtering (RANSAC) 5. Metric reconstruction (point cloud) 6. Final object reconstruction

B. Scholz Object Reconstruction 7 / 39 Collecting Images Taking pictures from different angles by hand Gathering images from the Internet Robotic arm Drones

B. Scholz Object Reconstruction 8 / 39 Feature Detection (SIFT) SIFT feature matching. From CAP 5415 Computer Vision Fall 2012 - Lecture 5 by Dr. Mubarak Shah

B. Scholz Object Reconstruction 9 / 39 Feature Detection (SIFT) Scale Invariant Feature Transform Invariance to scale and rotation Produces highly distinctive features Robust to occlusion, clutter or noise

B. Scholz Object Reconstruction 10 / 39 SIFT - Scale-Space Extrema Detection Scale space representation at different scales. Retrieved from https://en.wikipedia.org/wiki/scale_space

B. Scholz Object Reconstruction 11 / 39 SIFT - Scale-Space Extrema Detection Using Gaussian Filter L(x, y, σ) = G(x, y, σ) I(x, y) Different values of σ result in different scales Difference of scales D(x, y, σ) = L(x, y, kσ) L(x, y, σ)

B. Scholz Object Reconstruction 12 / 39 SIFT - Scale-Space Extrema Detection Building octaves. From Distinctive Image Features from Scale-Invariant Keypoints by David G. Lowe

B. Scholz Object Reconstruction 13 / 39 SIFT - Scale-Space Extrema Detection Local extrema detection. From Distinctive Image Features from Scale-Invariant Keypoints by David G. Lowe

B. Scholz Object Reconstruction 14 / 39 SIFT - Keypoint Localization and Orientation Assignment Removing low-contrast keypoints Removing unstable features on edges Using gradient to detect edges Orientation assignment Gradient magnitude and orientation are computed using pixel differences 36 bins for orientations from 0 to 360

B. Scholz Object Reconstruction 15 / 39 SIFT - The Local Image Descriptor Sample gradient magnitude and orientations around keypoint Achieving scale invariance Achieving rotation invariance Gaussian weighting function to assign importance

B. Scholz Object Reconstruction 16 / 39 SIFT - The Local Image Descriptor From Distinctive Image Features from Scale-Invariant Keypoints by David G. Lowe

B. Scholz Object Reconstruction 17 / 39 Matching Features from Different Images Nearest neighbor matching Creates mismatches Method needed for finding correct correspondences

B. Scholz Object Reconstruction 18 / 39 RANSAC Random Sample Consensus 1. Select random subset of data 2. Fit model to subset 3. Count inliers 4. Repeat Highest number of inliers

B. Scholz Object Reconstruction 19 / 39 RANSAC - Example From Stanford CS231A, Computer Vision: from 3D reconstruction to recognition, Winter 2015 lecture 9 by Prof. Silvio Savarese

B. Scholz Object Reconstruction 20 / 39 RANSAC - Example From Stanford CS231A, Computer Vision: from 3D reconstruction to recognition, Winter 2015 lecture 9 by Prof. Silvio Savarese

B. Scholz Object Reconstruction 21 / 39 RANSAC - Example From CS 390: ST in Computer Science: Image and Video Understanding, Matching SIFT by Hao Jiang

B. Scholz Object Reconstruction 22 / 39 RANSAC - Example From CS 390: ST in Computer Science: Image and Video Understanding, Matching SIFT by Hao Jiang

B. Scholz Object Reconstruction 23 / 39 RANSAC - Example From CS 390: ST in Computer Science: Image and Video Understanding, Matching SIFT by Hao Jiang

B. Scholz Object Reconstruction 24 / 39 Matching Features from Different Images Stitching images together Still no 3D structure Rough estimation of camera location Using robots advantageous

B. Scholz Object Reconstruction 25 / 39 Bundle Adjustment Features form point-cloud Refining 3D structure, camera motion Complex problem

B. Scholz Object Reconstruction 26 / 39 Bundle Adjustment Minimizing reprojection error From 3D Models from the Black Box: Investigating the Current State of Image-Based Modeling by Hoang Minh Nguyen, Burkhard Wünsche, Patrice Delmas and Christof Lutteroth

B. Scholz Object Reconstruction 27 / 39 Bundle Adjustment Considers all images/cameras From Lecture 6: Multi-view Stereo & Structure from Motion by Prof. Rob Fergus. Retrieved from http://cs.nyu.edu/ fergus/teaching/vision/11_12_multiview.pdf

B. Scholz Object Reconstruction 28 / 39 Bundle Adjustment From Photogrammetric Computer Vision, Lecture Notes by Volker Rodehorst

B. Scholz Object Reconstruction 29 / 39 Object Reconstruction using RGB-D RGB-D cameras like Kinect provide depth-data, so it does not have to be computed Produces Point-Clouds We can omit the step of feature detection Approximating camera location becomes easier

B. Scholz Object Reconstruction 30 / 39 Iterative Closest Point (ICP) Matching two corresponding point clouds P = p 1,..., p n and Q = q 1,..., q n Minimizing sum of squared error E(R, t) = 1 n n i=1 p i Rq i t 2 Here R is the rotation matrix and t is the translation vector

B. Scholz Object Reconstruction 31 / 39 Iterative Closest Point (ICP) If correspondences of points are known we can compute the solution in closed form From Iterative Closest Point, presented by M.T. Hajiaghayi. Retrieved from http://groups.csail.mit.edu/graphics/classes/6.838/f01/lectures/iterativealgs/icp/align.html

B. Scholz Object Reconstruction 32 / 39 Iterative Closest Point (ICP) Calculate center of mass of both point clouds and subtract it from each point: p i = p i µ p and q i = q i µ q W = N p i=1 p i q T i Using single value decomposition on W gives us the singular values σ 1, σ 2 and σ 3 of W as well as U and V The optimal solution is now: R = UV T, t = µ p Rµ q With minimal error of E(R, t) = N p i=1 ( p i 2 + q i 2 ) 2(σ 1 + σ 2 + σ 3 )

B. Scholz Object Reconstruction 33 / 39 Iterative Closest Point (ICP) Problem: We usually do not know the correspondences Assume correspondences Using method iteratively

https://taylorwang.wordpress.com/2012/04/06/iterative-closest-point-algorithm-point-cloudmesh-registration/ B. Scholz Object Reconstruction 34 / 39 Iterative Closest Point (ICP) From Iterative Closest Point algorithm-point cloud/mesh registration by Taylor Wang. Retrieved from

B. Scholz Object Reconstruction 35 / 39 Iterative Closest Point (ICP) Result: large point cloud Representation of object Inferring camera locations Applications working in real-time are feasible (KinectFusion)

B. Scholz Object Reconstruction 36 / 39 3D Object Representations Point cloud 3D mesh Truncated Signed Distance Function (TSDF)

B. Scholz Object Reconstruction 37 / 39 3D Object Representations - 3D Mesh Retrieved from http://www.cmap.polytechnique.fr/ peyre/geodesic_computations/

B. Scholz Object Reconstruction 38 / 39 References [1] M. A. Fischler and R. C. Bolles. Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Communications of the ACM, 24(6):381 395, 1981. [2] K. Häming and G. Peters. The structure-from-motion reconstruction pipeline a survey with focus on short image sequences. Kybernetika, 46(5):926 937, 2010. [3] D. G. Lowe. Distinctive image features from scale-invariant keypoints. International journal of computer vision, 60(2):91 110, 2004.

B. Scholz Object Reconstruction 39 / 39 References (cont.) [4] R. A. Newcombe, S. Izadi, O. Hilliges, D. Molyneaux, D. Kim, A. J. Davison, P. Kohi, J. Shotton, S. Hodges, and A. Fitzgibbon. Kinectfusion: Real-time dense surface mapping and tracking. In Mixed and augmented reality (ISMAR), 2011 10th IEEE international symposium on, pages 127 136. IEEE, 2011. [5] O. Özyesil, V. Voroninski, R. Basri, and A. Singer. A survey on structure from motion. CoRR, abs/1701.08493, 2017. [6] S. Rusinkiewicz and M. Levoy. Efficient variants of the icp algorithm. In 3-D Digital Imaging and Modeling, 2001. Proceedings. Third International Conference on, pages 145 152. IEEE, 2001.