Detecting and Tracking Moving Objects for Video Surveillance. Isaac Cohen and Gerard Medioni University of Southern California

Similar documents
Event Detection and Analysis from Video Streams

Representing Moving Images with Layers. J. Y. Wang and E. H. Adelson MIT Media Lab

Motion Tracking and Event Understanding in Video Sequences

Outline. Data Association Scenarios. Data Association Scenarios. Data Association Scenarios

Dense Image-based Motion Estimation Algorithms & Optical Flow

Global Flow Estimation. Lecture 9

EE795: Computer Vision and Intelligent Systems

Map-Enhanced UAV Image Sequence Registration and Synchronization of Multiple Image Sequences

EE795: Computer Vision and Intelligent Systems

Motion and Target Tracking (Overview) Suya You. Integrated Media Systems Center Computer Science Department University of Southern California

Optical Flow-Based Motion Estimation. Thanks to Steve Seitz, Simon Baker, Takeo Kanade, and anyone else who helped develop these slides.

2 Algorithm Description Active contours are initialized using the output of the SUSAN edge detector [10]. Edge runs that contain a reasonable number (

Mixture Models and EM

Vehicle and Person Tracking in UAV Videos

Global Flow Estimation. Lecture 9

EE795: Computer Vision and Intelligent Systems

Motion in 2D image sequences

Lecture 28 Intro to Tracking

Recall: Blob Merge/Split Lecture 28

Marcel Worring Intelligent Sensory Information Systems

Review for the Final

Motion Estimation. There are three main types (or applications) of motion estimation:

VEHICLE QUEUE DETECTION IN SATELLITE IMAGES OF URBAN AREAS

Accurate 3D Face and Body Modeling from a Single Fixed Kinect

Classification of Subject Motion for Improved Reconstruction of Dynamic Magnetic Resonance Imaging

Efficient Acquisition of Human Existence Priors from Motion Trajectories

Performance study on point target detection using super-resolution reconstruction

University of Florida CISE department Gator Engineering. Clustering Part 2

Aircraft Tracking Based on KLT Feature Tracker and Image Modeling

Motion. 1 Introduction. 2 Optical Flow. Sohaib A Khan. 2.1 Brightness Constancy Equation

Leow Wee Kheng CS4243 Computer Vision and Pattern Recognition. Motion Tracking. CS4243 Motion Tracking 1

CSE 527: Introduction to Computer Vision

Multimedia Systems Video II (Video Coding) Mahdi Amiri April 2012 Sharif University of Technology

Nonrigid Surface Modelling. and Fast Recovery. Department of Computer Science and Engineering. Committee: Prof. Leo J. Jia and Prof. K. H.

Visual Tracking (1) Feature Point Tracking and Block Matching

Colour Segmentation-based Computation of Dense Optical Flow with Application to Video Object Segmentation

Motion-based obstacle detection and tracking for car driving assistance

16720 Computer Vision: Homework 3 Template Tracking and Layered Motion.

Feature Matching and Robust Fitting

BCC Optical Stabilizer Filter

CS 664 Segmentation. Daniel Huttenlocher

Prof. Fanny Ficuciello Robotics for Bioengineering Visual Servoing

Presented at the FIG Congress 2018, May 6-11, 2018 in Istanbul, Turkey

Continuous Multi-Views Tracking using Tensor Voting

Bus Detection and recognition for visually impaired people

CS 664 Structure and Motion. Daniel Huttenlocher

CS664 Lecture #18: Motion

CHAPTER 5 MOTION DETECTION AND ANALYSIS

CS-465 Computer Vision

Learning and Inferring Depth from Monocular Images. Jiyan Pan April 1, 2009

Hierarchical Clustering 4/5/17

Clustering. Informal goal. General types of clustering. Applications: Clustering in information search and analysis. Example applications in search

Optical flow and tracking

CV: 3D to 2D mathematics. Perspective transformation; camera calibration; stereo computation; and more

Autonomous Navigation for Flying Robots

Journal of Industrial Engineering Research

Image Segmentation. Selim Aksoy. Bilkent University

Image Segmentation. Selim Aksoy. Bilkent University

Displacement estimation

Using 3D Scene Structure to Improve Tracking

Flexible Visual Inspection. IAS-13 Industrial Forum Horizon 2020 Dr. Eng. Stefano Tonello - CEO

Two-Frames Accurate Motion Segmentation Using Tensor Voting and Graph-Cuts

Traffic Surveillance using Aerial Video

Detection of Moving Objects in Colour based and Graph s axis Change method

Matching. Compare region of image to region of image. Today, simplest kind of matching. Intensities similar.

CSE 158. Web Mining and Recommender Systems. Midterm recap

A GPU-based implementation of Motion Detection from a Moving Platform

Fitting D.A. Forsyth, CS 543

AUTOMATIC OBJECT DETECTION IN VIDEO SEQUENCES WITH CAMERA IN MOTION. Ninad Thakoor, Jean Gao and Huamei Chen

DT-Binarize: A Hybrid Binarization Method using Decision Tree for Protein Crystallization Images

CS 4495 Computer Vision Motion and Optic Flow

INF4820, Algorithms for AI and NLP: Evaluating Classifiers Clustering

Computer Vision 2. SS 18 Dr. Benjamin Guthier Professur für Bildverarbeitung. Computer Vision 2 Dr. Benjamin Guthier

Lecture: k-means & mean-shift clustering

Lecture: k-means & mean-shift clustering

Locally Weighted Learning for Control. Alexander Skoglund Machine Learning Course AASS, June 2005

Idle Object Detection in Video for Banking ATM Applications

CSE 5243 INTRO. TO DATA MINING

CLASSIFICATION WITH RADIAL BASIS AND PROBABILISTIC NEURAL NETWORKS

Center for Automation Research, University of Maryland. The independence measure is the residual normal

Laser sensors. Transmitter. Receiver. Basilio Bona ROBOTICA 03CFIOR

Moving Object Segmentation using Scene Understanding

Surface Registration. Gianpaolo Palma

Spatio temporal Segmentation using Laserscanner and Video Sequences

Chapter 3 Image Registration. Chapter 3 Image Registration

k-means A classical clustering algorithm

COMPARATIVE STUDY OF DIFFERENT APPROACHES FOR EFFICIENT RECTIFICATION UNDER GENERAL MOTION

Information Retrieval and Web Search Engines

SURVEILLANCE cameras have been widely deployed to

Unsupervised Learning

LOCAL-GLOBAL OPTICAL FLOW FOR IMAGE REGISTRATION

Automated Video Analysis of Crowd Behavior

Clustering. Robert M. Haralick. Computer Science, Graduate Center City University of New York

Tracking-Learning-Detection

Unsupervised Camera Motion Estimation and Moving Object Detection in Videos

Computer Vision I - Filtering and Feature detection

VIDEO STABILIZATION WITH L1-L2 OPTIMIZATION. Hui Qu, Li Song

Registration: Rigid vs. Deformable

Object detection using Region Proposals (RCNN) Ernest Cheung COMP Presentation

Gesture based PTZ camera control

Transcription:

Detecting and Tracking Moving Objects for Video Surveillance Isaac Cohen and Gerard Medioni University of Southern California

Their application sounds familiar. Video surveillance Sensors with pan-tilt and zoom Sensors mounted on moving airborne platforms Requirement for detection and tracking of moving objects and the relationship of their trajectories Requirement for high-level description of the whole video sequence

Approach First find optical flow and perform motion compensation to stabilize. Find large numbers of moving regions. Use the residual flow field and its normal component to detect errors. Define a attributed graph whose nodes are detected regions and edges are possible matches between two regions detected in two different frames. Use the graph as a dynamic template for tracking moving objects.

Their Idea Detection after stabilization doesn t work well. So integrate the detection into the stabilization algorithm by locating regions of the image where a residual motion occurs using the normal component of the optical flow field.

What is Normal Flow? The optical flow equation constrains the image velocity in the direction of the local image gradient, but not the tangential velocity. Normal flow corresponds to the image velocity along the image gradient. It is computed from both image gradients and temporal gradients of the stabilized sequence. The amplitude is large near moving regions. The amplitude is near zero near stationary regions.

Computation of Normal Flow Let τ ij denote the warping of image to reference frame j. The stabilized image sequence is defined by I i (τ ij ). Given reference image I 0 and target I 1, image stabilization consists of registering the two images and computing the geometric transform τ that warps I 1 so it aligns with I 0.

Parameter Estimation of the Geometric Transform Minimize the least squares equation Detect and remove outliers through an iterative process.

Definition of Normal Flow The warping function is integrated into the formula so that the image gradients are computed on the original image grids and not the warped ones. This simplifies the computation and allows for a more accurate estimation of the residual normal flow.

Finding Moving Objects Given a pair of image frames, find the moving objects by thresholding the normal flow. Does this overcome problems of other approaches?

Graph Representation of Moving Objects Nodes are the moving regions. Edges are the relationships between 2 moving regions detected in 2 separate frames. Each new frame generates a set of regions corresponding to the moving objects. We want to know which moving objects in the new frame are the same as those in the old.

Why is this Difficult? Little information about the objects is known. Objects tend to be of small size in aerial imagery. Large changes in object size are possible.

Sample Detection

Detection Videos seq1.avi seq1_det.avi seq6.avi seq6_det.avi

Example Graph: What s going on?

Attributes of a Node

Keeping Track of Moving Regions Among the detected regions, some small ones should be merged into a larger one. They cluster the detected regions in the graph, instead of using single images. Use a median shape template to keep track of the different moving regions.

Moving objects that don t appear in some frames can be hypothesized. hypothesized

Extraction of Object Trajectories The graph representation changes as new frames are acquired and processed. The goal is to find the full trajectory of each moving object. But we don t know where each one starts or where it ends. So we have to consider each node with no predecessor a possible start and each with no successor a possible end.

Optimal Path Assign each edge of the graph a cost, which is the similarity between the connected nodes. i c ij = C ij / (1 + d ij2 ) j C ij is the gray-level and shape correlation between i and j. d ij is the distance between their centroids.

Optimal Path This formulation does not lead to the optimal solution. Instead they define the length l j of node j as the maximal length of the path starting at that node. Then the modified cost function C ij =l j c ij is used to find the optimal paths from each node without predecessor, using a greedy search method.

Quantitative Evaluation TP = true positives of moving objects FP = false positives of moving objects FN = false negatives (not detected) Metrics: DR = TP / (TP + FN) Detection Ratio FAR = FP / (TP + FP) False Alarm Ratio

Evaluation Results on 5 Shots

Tracking Demos seq1_mos_track.avi seq6_track.avi

Questions/Comments Have they solved our problem? Has anyone done better? No one seems to use the metrics. But a 2005 paper by Nicolescu and Medioni compares results of 4 methods on fake sequences (and beats them, of course) Another recent paper by Xiao and Shah says they compared their results to those of Ke and Kanade, Wang and Adelson, and Ayer and Sawhney, but no numbers are given. Can we beat these people?