Vision and Image Processing Lab., CRV Tutorial day- May 30, 2010 Ottawa, Canada

Similar documents
Local invariant features

IMPROVING SPATIO-TEMPORAL FEATURE EXTRACTION TECHNIQUES AND THEIR APPLICATIONS IN ACTION CLASSIFICATION. Maral Mesmakhosroshahi, Joohee Kim

Local Image Features

Computer Vision for HCI. Topics of This Lecture

Outline 7/2/201011/6/

Extracting Spatio-temporal Local Features Considering Consecutiveness of Motions

Local Image Features

Building a Panorama. Matching features. Matching with Features. How do we build a panorama? Computational Photography, 6.882

Scale Invariant Feature Transform

Prof. Feng Liu. Spring /26/2017

Evaluation and comparison of interest points/regions

Scale Invariant Feature Transform

Motion illusion, rotating snakes

SIFT - scale-invariant feature transform Konrad Schindler

CEE598 - Visual Sensing for Civil Infrastructure Eng. & Mgmt.

Evaluation of Local Space-time Descriptors based on Cuboid Detector in Human Action Recognition

Local features: detection and description. Local invariant features

An Evaluation of Volumetric Interest Points

SIFT: SCALE INVARIANT FEATURE TRANSFORM SURF: SPEEDED UP ROBUST FEATURES BASHAR ALSADIK EOS DEPT. TOPMAP M13 3D GEOINFORMATION FROM IMAGES 2014

Feature Based Registration - Image Alignment

EVENT DETECTION AND HUMAN BEHAVIOR RECOGNITION. Ing. Lorenzo Seidenari

The SIFT (Scale Invariant Feature

Motion Estimation and Optical Flow Tracking

Corner Detection. GV12/3072 Image Processing.

Problems with template matching

Bag of Optical Flow Volumes for Image Sequence Recognition 1

CS229: Action Recognition in Tennis

SUMMARY: DISTINCTIVE IMAGE FEATURES FROM SCALE- INVARIANT KEYPOINTS

Computer vision: models, learning and inference. Chapter 13 Image preprocessing and feature extraction

Introduction. Introduction. Related Research. SIFT method. SIFT method. Distinctive Image Features from Scale-Invariant. Scale.

SCALE INVARIANT FEATURE TRANSFORM (SIFT)

Image Features: Detection, Description, and Matching and their Applications

CS 4495 Computer Vision A. Bobick. CS 4495 Computer Vision. Features 2 SIFT descriptor. Aaron Bobick School of Interactive Computing

Image Features. Work on project 1. All is Vanity, by C. Allan Gilbert,

Lecture 10 Detectors and descriptors

AK Computer Vision Feature Point Detectors and Descriptors

DETECTORS AND DESCRIPTORS FOR PHOTOGRAMMETRIC APPLICATIONS

Video Google: A Text Retrieval Approach to Object Matching in Videos

Velocity adaptation of space-time interest points

Multi-modal Registration of Visual Data. Massimiliano Corsini Visual Computing Lab, ISTI - CNR - Italy

CS 556: Computer Vision. Lecture 3

SURF. Lecture6: SURF and HOG. Integral Image. Feature Evaluation with Integral Image

Local features and image matching. Prof. Xin Yang HUST

Patch-based Object Recognition. Basic Idea

MoSIFT: Recognizing Human Actions in Surveillance Videos

EECS150 - Digital Design Lecture 14 FIFO 2 and SIFT. Recap and Outline

Midterm Wed. Local features: detection and description. Today. Last time. Local features: main components. Goal: interest operator repeatability

A Tutorial on VLFeat

Local features: detection and description May 12 th, 2015

Local Image Features

Scale Invariant Feature Transform by David Lowe

CAP 5415 Computer Vision Fall 2012

Features Points. Andrea Torsello DAIS Università Ca Foscari via Torino 155, Mestre (VE)

Chapter 2 Action Representation

Local Features: Detection, Description & Matching

ECE Digital Image Processing and Introduction to Computer Vision

Computer Vision. Recap: Smoothing with a Gaussian. Recap: Effect of σ on derivatives. Computer Science Tripos Part II. Dr Christopher Town

Advanced Video Content Analysis and Video Compression (5LSH0), Module 4

Implementation and Comparison of Feature Detection Methods in Image Mosaicing

Lecture 18: Human Motion Recognition

A NEW FEATURE BASED IMAGE REGISTRATION ALGORITHM INTRODUCTION

Structure Guided Salient Region Detector

CS 556: Computer Vision. Lecture 3

Image Matching. AKA: Image registration, the correspondence problem, Tracking,

A Probabilistic Framework for Recognizing Similar Actions using Spatio-Temporal Features

CS 558: Computer Vision 4 th Set of Notes

Evaluation of local descriptors for action recognition in videos

Lecture: RANSAC and feature detectors

CHAPTER 1 Introduction 1. CHAPTER 2 Images, Sampling and Frequency Domain Processing 37

Action Recognition & Categories via Spatial-Temporal Features

Requirements for region detection

Harder case. Image matching. Even harder case. Harder still? by Diva Sian. by swashford

Discovering Visual Hierarchy through Unsupervised Learning Haider Razvi

Feature Detection. Raul Queiroz Feitosa. 3/30/2017 Feature Detection 1

Feature descriptors. Alain Pagani Prof. Didier Stricker. Computer Vision: Object and People Tracking

3D Vision. Viktor Larsson. Spring 2019

Discriminant Interest Points are Stable

Augmented Reality VU. Computer Vision 3D Registration (2) Prof. Vincent Lepetit

Automatic Image Alignment (feature-based)

Click to edit title style

Action Recognition using Randomised Ferns

Lecture 4.1 Feature descriptors. Trym Vegard Haavardsholm

Comparison of Local Feature Descriptors

Image matching. Announcements. Harder case. Even harder case. Project 1 Out today Help session at the end of class. by Diva Sian.

Lecture 6: Finding Features (part 1/2)

Wikipedia - Mysid

Feature Matching and Robust Fitting

Feature descriptors and matching

Today. Main questions 10/30/2008. Bag of words models. Last time: Local invariant features. Harris corner detector: rotation invariant detection

FESID: Finite Element Scale Invariant Detector

CS231A Section 6: Problem Set 3

Learning realistic human actions from movies

Key properties of local features

Evaluation of local spatio-temporal features for action recognition

2D Image Processing Feature Descriptors

Image Segmentation and Registration

Computer Vision I - Basics of Image Processing Part 2

Performance Evaluation of Scale-Interpolated Hessian-Laplace and Haar Descriptors for Feature Matching

Image Processing. Image Features

School of Computing University of Utah

Transcription:

Spatio-Temporal Salient Features Amir H. Shabani Vision and Image Processing Lab., University of Waterloo, ON CRV Tutorial day- May 30, 2010 Ottawa, Canada 1

Applications Automated surveillance for scene analysis, Elderly home monitoring for assisted living, Content-based video retrieval, Video analysis Human-computer interaction (HCI), 2

Application Human Action Recognition (Weizmann data set) 3

Motivations Conventional video analysis methods require tracking the objects or features and motion estimation. segmentation is hard in presence of non-stationary background or sensory. appearance/ motion model varies from one object to another. 4

Motivations (2) Conventional video analysis methods require tracking the objects or features and motion estimation. segmentation is hard in presence of non-stationary background or sensory. appearance/ motion model varies from one object to another. Q: is there any way to bypass these tasks or perform them implicitly? there are key frames and key places in the video that carry most information of what is happening in the video. The key space-time points correspond to the video events. Salient features characterize the video events. 5

Motivation (3): from image to video Spatial Salient Features Applications: Tracking, Object Recognition,... 6

Salient Feature Extraction Salient feature extraction consists of three steps: Video filtering at different spatio-temporal scales Key point detection Key point description using the characteristic of the point s surrounding volume. Key point detection: (1) Saliency map construction (2) non-max suppression (and thresholding) 7

(1) compute gradient distribution matrix (M) in a local neighbourhood of a point. (2) find points for which both curvature are significant. Corners are stable in arbitrary lighting condition. 8 ) (x, ) (x, ) (x, ) (x, ) ( 2 y 2 x 2 D D y x D y x D I D I I I I I I g M (M) k trace (M) C 2 det 2D Harris-Affine Corner Detector 2 2 1 2 1 ) ( k

Spatio-temporal temporal Harris Corners Q: Interest? high variation in both space and time. => Extend the Harris corner function into 3D spatio-temporal domain. Spatial and temporal Gaussian filtering in the computation of the autocorrelation matrix. 9

Spatio-temporal temporal Harris Corners Q: Interest? high variation in both space and time. => Extend the Harris corner function into 3D spatio-temporal domain. Spatial and temporal Gaussian filtering in the computation of the autocorrelation matrix. (1) Compute the Harris saliency map. (2) Points with maximum saliency in a local volume => 3D corners. 10

Spatio-temporal temporal Harris Corners Hand clapping Walking 11

Why Multi-scale Salient Features? Spatial and temporal scale events require salient features at different spatial and temporal scales. 12

Spatio-temporal temporal Hessian Blobs Extension of the Hessian blob detector into 3D spatio-temporal domain: Spatial and temporal Gaussian filtering in the computation of the Hessian matrix. (1) Compute the determinant of Hessian matrix as the saliency map. (2) Points with maximum saliency in a local volume => center of 3D blobs. 13

Cuboids Spatial Gaussian along with temporal Gabor filtering. (1) Compute the energy of the filter response (2) Points with maximum energy in a local volume => center of cuboids. 14

Other Spatio-temporal temporal key Points Entropy-based interest points Dense sampling interest points 3D Scale-Invariant Feature Transform Salient opponent-based motion features (our paper in CRV 2010)... 15

Spatio-temporal temporal Feature Description So far: For event detection in video, we utilize the salient features. Video filtering at different spatio-temporal scales Key point detection Key point description using the characteristic of the point s surrounding volume. 16

2 Dimensional Scale-Invariant Feature Transform Gaussians (F) Difference of Gaussian (DoG~LoG) 2D SIFT features Oriented gradients 17

3D SIFT Descriptor

Other Feature Descriptors 3D SIFT 3D HOG HOG + HOF Local jets,... Histogram of oriented spatial grad. (HOG) Histogram of optical flow (HOF)

Summary For video analysis we can study the events characterised by spatiotemporal salient features. Spatial and temporal scale events require salient features at different spatial and temporal scales. Salient feature extraction requires scale-space filtering, interest point detection, and feature description. 20

A Challenge: Time Causality Temporal Gaussian/Gabor filter requires both prior and posterior frames. Past Present Future Biological vision promotes causal filtering for the motion perception. Q: how we can address the time causality? A: come to our paper presentation on Wednesday 21

References Evaluation of local spatio-temporal features for action recognition, Wang et al., BMCV, 2009 Space time interest points, Laptev and Lindeberg, ICCV 2003. Behavior recognition via sparse spatio-temporal filters, Dollar et al., IEEE Workshop VS-PETS, 2005. An efficient dense and scale-invariant spatio-temporal interest point detector, Willems et al., ECCV 2008. A comparison of affine covariant region detectors, IJCV 2006, Mikolajczyk et al. 22

Thank you 23

Definition Interest point: Distinctive in a local region rich in local image structure, with its surrounding Repeatable : reproducible under different views Stable to noise, geometric/photometric deformation Has well-defined position in the image space 24

Covariant to Scale Change 25

Covariant to Scale Change 26

Some Requirements Scale change Geometric transformation Corner: significant changes in all directions Photometric transformation 27

Salient Features Scale-Invariant feature Transform (SIFT) Harris-affine corner detection Hessian-affine corner detection Edge-based Regions (EBR) Intensity-based Regions (IBR) Maximally Stable Extremal Regions (MSER) Entropy-based salient regions 28

Scale- Invariant Feature Transform (SIFT) Scale-space Theorem: a local (3D) maximum of NLoG in (x,y,σ) is a point that can be identified with different size in images ---> it is a scale-invariant keypoint. 29

Harris-Affine Corner Detector (1) compute gradient distribution matrix (M) in a local neighbourhood of a point. M 2 D g( ) I I I 2 x x (x, I y D ) (x, D ) I x I I y 2 y (x, (x, D D ) ) (2) find points for which both curvature are significant. C 2 det (M) k trace (M) 2 k ( 1 2 1 2) Corners are stable in arbitrary lighting condition. 30