Feature-based methods for image matching

Similar documents
SIFT: SCALE INVARIANT FEATURE TRANSFORM SURF: SPEEDED UP ROBUST FEATURES BASHAR ALSADIK EOS DEPT. TOPMAP M13 3D GEOINFORMATION FROM IMAGES 2014

SIFT: Scale Invariant Feature Transform

SIFT - scale-invariant feature transform Konrad Schindler

SCALE INVARIANT FEATURE TRANSFORM (SIFT)

Feature Based Registration - Image Alignment

Keypoint detection. (image registration, panorama stitching, motion estimation + tracking, recognition )

The SIFT (Scale Invariant Feature

Local Features: Detection, Description & Matching

Augmented Reality VU. Computer Vision 3D Registration (2) Prof. Vincent Lepetit

Introduction. Introduction. Related Research. SIFT method. SIFT method. Distinctive Image Features from Scale-Invariant. Scale.

EE368 Project Report CD Cover Recognition Using Modified SIFT Algorithm

Feature Detection. Raul Queiroz Feitosa. 3/30/2017 Feature Detection 1

Scale Invariant Feature Transform

Local features and image matching. Prof. Xin Yang HUST

CS 4495 Computer Vision A. Bobick. CS 4495 Computer Vision. Features 2 SIFT descriptor. Aaron Bobick School of Interactive Computing

Scale Invariant Feature Transform

CS 558: Computer Vision 4 th Set of Notes

Scott Smith Advanced Image Processing March 15, Speeded-Up Robust Features SURF

Comparison of Feature Detection and Matching Approaches: SIFT and SURF

Obtaining Feature Correspondences

Implementing the Scale Invariant Feature Transform(SIFT) Method

Implementation and Comparison of Feature Detection Methods in Image Mosaicing

CAP 5415 Computer Vision Fall 2012

Multi-modal Registration of Visual Data. Massimiliano Corsini Visual Computing Lab, ISTI - CNR - Italy

3D Object Recognition using Multiclass SVM-KNN

A Comparison of SIFT, PCA-SIFT and SURF

Local features: detection and description May 12 th, 2015

Instance-level recognition part 2

Instance-level recognition

Automatic Image Alignment (feature-based)

Instance-level recognition II.

Outline 7/2/201011/6/

Local Image Features

AK Computer Vision Feature Point Detectors and Descriptors

Perception IV: Place Recognition, Line Extraction

Image Features: Local Descriptors. Sanja Fidler CSC420: Intro to Image Understanding 1/ 58

Motion illusion, rotating snakes

Instance-level recognition

Stitching and Blending

Edge detection. Gradient-based edge operators

Feature Detection and Matching

Feature descriptors. Alain Pagani Prof. Didier Stricker. Computer Vision: Object and People Tracking

Motion Estimation and Optical Flow Tracking

Key properties of local features

Chapter 3 Image Registration. Chapter 3 Image Registration

BSB663 Image Processing Pinar Duygulu. Slides are adapted from Selim Aksoy

Feature Detectors and Descriptors: Corners, Lines, etc.

Computer Vision for HCI. Topics of This Lecture

Local features: detection and description. Local invariant features

Lecture 24: Parallel Databases. Wednesday, November 24, 2010

Building a Panorama. Matching features. Matching with Features. How do we build a panorama? Computational Photography, 6.882

SURF. Lecture6: SURF and HOG. Integral Image. Feature Evaluation with Integral Image

A NEW FEATURE BASED IMAGE REGISTRATION ALGORITHM INTRODUCTION

EE368/CS232 Digital Image Processing Winter

CS 231A Computer Vision (Winter 2018) Problem Set 3

Keypoint-based Recognition and Object Search

Image Features: Detection, Description, and Matching and their Applications

Requirements for region detection

Local Feature Detectors

Image Features. Work on project 1. All is Vanity, by C. Allan Gilbert,

Image Stitching. Slides from Rick Szeliski, Steve Seitz, Derek Hoiem, Ira Kemelmacher, Ali Farhadi

Lecture 12 Recognition

Harder case. Image matching. Even harder case. Harder still? by Diva Sian. by swashford

Lecture 12 Recognition

3D Environment Reconstruction

Object Recognition with Invariant Features

CEE598 - Visual Sensing for Civil Infrastructure Eng. & Mgmt.

Indexing local features and instance recognition May 14 th, 2015

Local Patch Descriptors

Visual Tracking (1) Tracking of Feature Points and Planar Rigid Objects

SUMMARY: DISTINCTIVE IMAGE FEATURES FROM SCALE- INVARIANT KEYPOINTS

Lecture 10 Detectors and descriptors

From Pixels to Information Recent Advances in Visual Search

Object Detection by Point Feature Matching using Matlab

CS4670: Computer Vision

Modern-era mobile phones and tablets

CSE 527: Introduction to Computer Vision

Features Points. Andrea Torsello DAIS Università Ca Foscari via Torino 155, Mestre (VE)

Computer Vision. Recap: Smoothing with a Gaussian. Recap: Effect of σ on derivatives. Computer Science Tripos Part II. Dr Christopher Town

EECS150 - Digital Design Lecture 14 FIFO 2 and SIFT. Recap and Outline

Perception. Autonomous Mobile Robots. Sensors Vision Uncertainties, Line extraction from laser scans. Autonomous Systems Lab. Zürich.

CS101 Lecture 12: Image Compression. What You ll Learn Today

2D Image Processing Feature Descriptors

RANSAC and some HOUGH transform

Local Image Features

Local Features Tutorial: Nov. 8, 04

3D Visualization through Planar Pattern Based Augmented Reality

Faster-than-SIFT Object Detection

Local invariant features

Hough Transform and RANSAC

Viewpoint Invariant Features from Single Images Using 3D Geometry

A Comparison of SIFT and SURF

Feature Descriptors. CS 510 Lecture #21 April 29 th, 2013

Lecture 12 Recognition. Davide Scaramuzza

K-Means Based Matching Algorithm for Multi-Resolution Feature Descriptors

Pictures at an Exhibition

Visual Odometry. Features, Tracking, Essential Matrix, and RANSAC. Stephan Weiss Computer Vision Group NASA-JPL / CalTech

Automatic Panoramic Image Stitching. Dr. Matthew Brown, University of Bath

Eppur si muove ( And yet it moves )

A Novel Algorithm for Color Image matching using Wavelet-SIFT

Transcription:

Feature-based methods for image matching Bag of Visual Words approach Feature descriptors SIFT descriptor SURF descriptor Geometric consistency check Vocabulary tree Digital Image Processing: Bernd Girod, 2013 Stanford University -- Image Matching 1

A Bag of Words self-evident Liberty truths happiness endowed inalienable Creator pursuit Life Digital Image Processing: Bernd Girod, 2013 Stanford University -- Image Matching 2

Representing a Text as a Bag of Words We hold these truths to be self-evident, that all men are created equal, that they are endowed by their Creator with certain unalienable Rights, that among these are Life, Liberty and the pursuit of Happiness. That to secure these rights, Governments are instituted among Men, deriving their just powers from the consent of the governed, That whenever any Form of Government becomes destructive of these ends, it is the Right of the People to alter or to abolish it, and to institute new Government, laying its foundation on such principles and organizing its powers in such form, as to them shall seem most likely to effect their Safety and Happiness. Prudence, indeed, will dictate that Governments long established should not be changed for light and transient causes; and accordingly all experience hath shewn, that mankind are more disposed to suffer, while evils are sufferable, than to right themselves by abolishing the forms to which they are accustomed. But when a long train of abuses and usurpations, pursuing invariably the same Object evinces a design to reduce them under absolute Despotism, it is their right, it is their duty, to throw off such Government, and to provide new Guards for their future security. self-evident Liberty truths happiness endowed inalienable Creator pursuit Life Digital Image Processing: Bernd Girod, 2013 Stanford University -- Image Matching 3

Representing an Image as a Bag of Visual Words Digital Image Processing: Bernd Girod, 2013 Stanford University -- Image Matching 4

Feature descriptors Represent local pattern around a keypoint by a vector ( feature descriptor ) Establish feature correspondences by finding the nearest neighbor in descriptor space Digital Image Processing: Bernd Girod, 2013 Stanford University -- Image Matching 5

Scale/rotation invariant feature descriptors Scale invariance: extract features at scale provided by keypoint detection Rotation invariance: Detect dominant orientation Average gradient direction Peak detection in gradient direction histogram Rotate coordinate system to dominant orientation Multiple strong orientation peaks: generate second feature point Digital Image Processing: Bernd Girod, 2013 Stanford University -- Image Matching 6

SIFT descriptors SIFT - Scale-Invariant Feature Transform [Lowe,1999, 2004] Sample thresholded image gradients at 16x16 locations in scale space (in local coordinate system for rotation and scale invariance) For each of 4x4 subregion, generate orientation histogram with 8 directions each; each observation weighted with magnitude of image gradient and a window function 128-dimensional feature vector Digital Image Processing: Bernd Girod, 2013 Stanford University -- Image Matching 7

SURF descriptors n SURF Speeded Up Robust Features [Bay et al. 2006]! n n n Compute horizontal and vertical pixel differences, dx, dy (in local coordinate system for rotation and scale invariance, window size 20σ x 20σ, where σ 2 is feature scale) Sum dx, dy, and dx, dy over 4x4 subregions (SURF-64) or 3x3 subregions (SURF-36)" Normalize vector for gain invariance, but distinguish bright blobs and dark blobs based on sign of Laplacian (trace of Hessian matrix)" Digital Image Processing: Bernd Girod, 2013 Stanford University -- Image Matching 8

Computing feature descriptors Gray" D xx D xy D xx D yy - (0.9D xy ) 2 D yy Filters" Blob Response" Digital Image Processing: Bernd Girod, 2013 Stanford University -- Image Matching 9

Computing feature descriptors Gray" D xx D xy D xx D yy - (0.9D xy ) 2 D yy Filters" Blob Response" Digital Image Processing: Bernd Girod, 2013 Stanford University -- Image Matching 10

Computing feature descriptors Gray" D xx D xy D xx D yy - (0.9D xy ) 2 D yy Filters" Blob Response" Digital Image Processing: Bernd Girod, 2013 Stanford University -- Image Matching 11

Computing feature descriptors Gray" Dxx Dxy DxxDyy(0.9Dxy)2 Dyy Filters" Digital Image Processing: Bernd Girod, 2013 Stanford University -- Image Matching 12 Blob Response"

Computing feature descriptors Maxima" y" x" Digital Image Processing: Bernd Girod, 2013 Stanford University -- Image Matching 13

Computing feature descriptors Orient Along " Dominant Gradient" Digital Image Processing: Bernd Girod, 2013 Stanford University -- Image Matching 14 Gradient Field for Oriented Patch"

Computing feature descriptors SURF Descriptor" Σ d x Σ d y Σ d x Σ d y Σ d x Σ d y Σ d x Σ d y " SIFT Descriptor" Σ Σ Σ Σ Σ Σ Σ Σ " d y d x Σ d x Σ d y Σ d x Σ d y Digital Image Processing: Bernd Girod, 2013 Stanford University -- Image Matching 15

Bag of Visual Words Matching Pairwise Comparison Digital Image Processing: Bernd Girod, 2013 Stanford University -- Image Matching 10

Geometric mapping Notation: Homogeneous coordinates; reference image x = x y 1 Inhomogeneous coordinates; target image x = x y Translation ( ) T ( ) T Euclidean transformation (rotation and translation) cosθ sinθ tx x= sinθ cosθ t x y Scaled rotation (similarity transform) s cosθ s sinθ tx x= s sinθ s cosθ t x y Digital Image Processing: Bernd Girod, 2013 Stanford University -- Image Matching 12

Geometric mapping Affine transformation a00 a01 a02 x= a10 a11 a x 12 Motion of planar surface in 3d under orthographic projection Parallel lines are preserved Digital Image Processing: Bernd Girod, 2013 Stanford University -- Image Matching 13

Geometric mapping Motion of planar surface in 3d under perspective projection Homography x Inhomogeneous coordinates (after normalization) Straight lines are preserved h h h h h h h h h 00 01 02 10 11 12 20 21 22 x h x+ h y+ h h x+ h y+ h x = y = h x+ h y+ h h x+ h y+ h 00 01 02 10 11 12 20 21 22 20 21 22 Digital Image Processing: Bernd Girod, 2013 Stanford University -- Image Matching 14

RANSAC RANdom Sample Consensus [Fischer, Bolles, 1981] Randomly select subset of k correspondences Compute geometric mapping parameters by linear regression Apply geometric mapping to all keypoints Count no. of inliers (closer than from the corresponding keypoint, typical = 1 3 pixels) Repeat process S times, keep geometric maping with largest no. of inliers Required number of trials Total probability of success Probability of valid correspondence Use small number of correspondences Digital Image Processing: Bernd Girod, 2013 Stanford University -- Image Matching 15

RANSAC with Affine Model Digital Image Processing: Bernd Girod, 2013 Stanford University -- Image Matching 16

RANSAC with Homography Digital Image Processing: Bernd Girod, 2013 Stanford University -- Image Matching 17

SURF features & affine RANSAC Pairwise Comparison Digital Image Processing: Bernd Girod, 2013 Stanford University -- Image Matching 18

Comparing Feature Histograms Speed up by comparing histograms of features: pairwise image comparison only for similar histograms Histogram intersection Query histogram Histogram of database entry ρ = n min( Q i, D i ) i=1 n D i=1 i [Swain, Ballard 1991] Equivalent to mean absolute difference, if both histograms contain same number of samples Digital Image Processing: Bernd Girod, 2013 Stanford University -- Image Matching 20

Growing Vocabulary Tree Digital Image Processing: Bernd Girod, 2013 Stanford University -- Image Matching 21 [Nistér and Stewenius, 2006]

Growing Vocabulary Tree Digital Image Processing: Bernd Girod, 2013 Stanford University -- Image Matching 22 [Nistér and Stewenius, 2006]

Growing Vocabulary Tree Digital Image Processing: Bernd Girod, 2013 Stanford University -- Image Matching 23 [Nistér and Stewenius, 2006]

Growing Vocabulary Tree k = 3 Digital Image Processing: Bernd Girod, 2013 Stanford University -- Image Matching 24 [Nistér and Stewenius, 2006]

Growing Vocabulary Tree k = 3 Digital Image Processing: Bernd Girod, 2013 Stanford University -- Image Matching 25 [Nistér and Stewenius, 2006]

Querying Vocabulary Tree Query Digital Image Processing: Bernd Girod, 2013 Stanford University -- Image Matching 26

Querying: Hard Binning vs. Soft Binning node 1 node 1 d 1 d 1 node 2 query descriptor d d 3 2 node 3 node 2 query d d 3 2 node 3 Hard Binning [Nistér and Stewenius, CVPR 2006] Soft Binning [Philbin et al., CVPR 2008] Digital Image Processing: Bernd Girod, 2013 Stanford University -- Image Matching 27

Stanford Mobile Visual Search Dataset Digital Image Processing: Bernd Girod, 2013 Stanford University -- Image Matching 28

Stanford Mobile Visual Search Dataset Digital Image Processing: Bernd Girod, 2013 Stanford University -- Image Matching 29

Querying: Hard Binning vs. Soft Binning SURF features 6-level tree 1M leaf nodes 3269 query images 100 top tree results Digital Image Processing: Bernd Girod, 2013 Stanford University -- Image Matching 30

Digital Image Processing: Bernd Girod, 2013 Stanford University -- Image Matching 31