Fast trajectory matching using small binary images

Similar documents
An Approach for Real Time Moving Object Extraction based on Edge Region Determination

An ICA based Approach for Complex Color Scene Text Binarization

Graph Matching Iris Image Blocks with Local Binary Pattern

Photometric Stereo with Auto-Radiometric Calibration

Video Inter-frame Forgery Identification Based on Optical Flow Consistency

Robust Shape Retrieval Using Maximum Likelihood Theory

Efficient ERP Distance Measure for Searching of Similar Time Series Trajectories

Factorization with Missing and Noisy Data

Research on Evaluation Method of Video Stabilization

A Novel Extreme Point Selection Algorithm in SIFT

An Adaptive Threshold LBP Algorithm for Face Recognition

arxiv: v1 [cs.cv] 28 Sep 2018

An Introduction to Content Based Image Retrieval

String distance for automatic image classification

ROBUST LINE-BASED CALIBRATION OF LENS DISTORTION FROM A SINGLE VIEW

Dynamic Obstacle Detection Based on Background Compensation in Robot s Movement Space

A Fast Personal Palm print Authentication based on 3D-Multi Wavelet Transformation

Human Motion Detection and Tracking for Video Surveillance

CS 543: Final Project Report Texture Classification using 2-D Noncausal HMMs

A METHOD FOR CONTENT-BASED SEARCHING OF 3D MODEL DATABASES

An efficient face recognition algorithm based on multi-kernel regularization learning

Random spatial sampling and majority voting based image thresholding

FACIAL MOVEMENT BASED PERSON AUTHENTICATION

A NEW METHOD FOR FINDING SIMILAR PATTERNS IN MOVING BODIES

Video Alignment. Literature Survey. Spring 2005 Prof. Brian Evans Multidimensional Digital Signal Processing Project The University of Texas at Austin

Affine-invariant shape matching and recognition under partial occlusion

Fundamental Matrices from Moving Objects Using Line Motion Barcodes

Package SimilarityMeasures

A Real-time Detection for Traffic Surveillance Video Shaking

Object detection using non-redundant local Binary Patterns

ADVANCED IMAGE PROCESSING METHODS FOR ULTRASONIC NDE RESEARCH C. H. Chen, University of Massachusetts Dartmouth, N.

Image retrieval based on region shape similarity

Accelerometer Gesture Recognition

Linear Discriminant Analysis for 3D Face Recognition System

Improving Latent Fingerprint Matching Performance by Orientation Field Estimation using Localized Dictionaries

A Modified Mean Shift Algorithm for Visual Object Tracking

WATERMARKING FOR LIGHT FIELD RENDERING 1

[2006] IEEE. Reprinted, with permission, from [Wenjing Jia, Huaifeng Zhang, Xiangjian He, and Qiang Wu, A Comparison on Histogram Based Image

CHAPTER 5 MOTION DETECTION AND ANALYSIS

arxiv: v1 [cs.cv] 2 May 2016

1. INTRODUCTION. AMS Subject Classification. 68U10 Image Processing

Adaptive Zoom Distance Measuring System of Camera Based on the Ranging of Binocular Vision

Structured Light II. Thanks to Ronen Gvili, Szymon Rusinkiewicz and Maks Ovsjanikov

Experimentation on the use of Chromaticity Features, Local Binary Pattern and Discrete Cosine Transform in Colour Texture Analysis

An Integrated Face Recognition Algorithm Based on Wavelet Subspace

EE368 Project Report CD Cover Recognition Using Modified SIFT Algorithm

Recovering light directions and camera poses from a single sphere.

Multi-Camera Occlusion and Sudden-Appearance-Change Detection Using Hidden Markovian Chains

Local Image Registration: An Adaptive Filtering Framework

FACE RECOGNITION USING SUPPORT VECTOR MACHINES

Feature-level Fusion for Effective Palmprint Authentication

Mobile Human Detection Systems based on Sliding Windows Approach-A Review

Detecting motion by means of 2D and 3D information

Content Based Image Retrieval Using Color Quantizes, EDBTC and LBP Features

Component-based Face Recognition with 3D Morphable Models

A Comparison of Still-Image Compression Standards Using Different Image Quality Metrics and Proposed Methods for Improving Lossy Image Quality

Robot localization method based on visual features and their geometric relationship

Deep Tracking: Biologically Inspired Tracking with Deep Convolutional Networks

A New Feature Local Binary Patterns (FLBP) Method

2 Proposed Methodology

Fingerprint Recognition using Texture Features

RESTORATION OF DEGRADED DOCUMENTS USING IMAGE BINARIZATION TECHNIQUE

Time Series Clustering Ensemble Algorithm Based on Locality Preserving Projection

Chapter 7. Conclusions and Future Work

Clustering Spatio-Temporal Patterns using Levelwise Search

IMPROVED FACE RECOGNITION USING ICP TECHNIQUES INCAMERA SURVEILLANCE SYSTEMS. Kirthiga, M.E-Communication system, PREC, Thanjavur

3D Computer Vision. Structured Light II. Prof. Didier Stricker. Kaiserlautern University.

Toward Part-based Document Image Decoding

Research on Recognition and Classification of Moving Objects in Mixed Traffic Based on Video Detection

Real-time Vehicle Matching for Multi-camera Tunnel Surveillance

Detecting Printed and Handwritten Partial Copies of Line Drawings Embedded in Complex Backgrounds

Visual Hulls from Single Uncalibrated Snapshots Using Two Planar Mirrors

Multi-Camera Calibration, Object Tracking and Query Generation

A Factorization Method for Structure from Planar Motion

A Graph Theoretic Approach to Image Database Retrieval

Sign Language Recognition using Dynamic Time Warping and Hand Shape Distance Based on Histogram of Oriented Gradient Features

An Adaptive Eigenshape Model

Estimating Human Pose in Images. Navraj Singh December 11, 2009

Colour Segmentation-based Computation of Dense Optical Flow with Application to Video Object Segmentation

Finger Vein Biometric Approach for Personal Identification Using IRT Feature and Gabor Filter Implementation

A Laplacian Based Novel Approach to Efficient Text Localization in Grayscale Images

Accelerating Pattern Matching or HowMuchCanYouSlide?

Short Survey on Static Hand Gesture Recognition

Fabric Image Retrieval Using Combined Feature Set and SVM

Research and application of volleyball target tracking algorithm based on surf corner detection

CS 231A Computer Vision (Fall 2012) Problem Set 3

Notes 9: Optical Flow

A CURVELET-BASED DISTANCE MEASURE FOR SEISMIC IMAGES. Yazeed Alaudah and Ghassan AlRegib

VARIABLE RATE STEGANOGRAPHY IN DIGITAL IMAGES USING TWO, THREE AND FOUR NEIGHBOR PIXELS

Video shot segmentation using late fusion technique

Water-Filling: A Novel Way for Image Structural Feature Extraction

An Image Based Approach to Compute Object Distance

Final Exam Assigned: 11/21/02 Due: 12/05/02 at 2:30pm

FACE DETECTION AND LOCALIZATION USING DATASET OF TINY IMAGES

High Capacity Reversible Watermarking Scheme for 2D Vector Maps

IJSER. Real Time Object Visual Inspection Based On Template Matching Using FPGA

Performance Evaluation Metrics and Statistics for Positional Tracker Evaluation

Gait analysis for person recognition using principal component analysis and support vector machines

Defining a Better Vehicle Trajectory With GMM

Texture Analysis using Homomorphic Based Completed Local Binary Pattern

Transcription:

Title Fast trajectory matching using small binary images Author(s) Zhuo, W; Schnieders, D; Wong, KKY Citation The 3rd International Conference on Multimedia Technology (ICMT 2013), Guangzhou, China, 29 November-1 December 2013. Issued Date 2013 URL http://hdl.handle.net/10722/189619 Rights This work is licensed under a Creative Commons Attribution- NonCommercial-NoDerivatives 4.0 International License.

Fast Trajectory Matching Using Small Binary Images Wei Zhuo 1,2, Dirk Schnieders 1, and Kwan-Yee K. Wong 1 1 Department of Computer Science, The University of Hong Kong, Hong Kong 2 School of Electric and Information Engineering, Xian Jiaotong University, China Abstract. This paper proposes a new trajectory matching method using logic operations on binary images. By using small binary images we are able to effectively utilize the large word size offered in modern CPU architectures, resulting in a very efficient evaluation of similarities between trajectories. The efficiency is caused by the fact that all bits in the same word are processed in parallel. Representing trajectories as small binary images has other advantages, such as a low space requirement and good noise resistance. The proposed method is evaluated on a publicly available dataset, and is compared to the more sophisticated Longest Common Subsequence (LCSS) method. In addition, synthetic experiments show the good efficiency and accuracy of the proposed method, enabling real time trajectory retrieval on databases with millions of trajectories. Keywords: Trajectory Matching, Video Surveillance, Image Processing 1 Introduction Governments and private institutions capture and store large quantities of video surveillance footage for later retrieval. Video surveillance has received great attention in past decades and robust object segmentation and tracking methods are available. The motion path that an object takes in time, i.e. its trajectory, can be accurately determined from video surveillance footage. Trajectories provide a rich source of information and can be used to classify an object s behavior and discover objects that followed a certain motion characteristic or moved in a similar way. For instance, by comparing trajectories of newly tracked objects with those trajectories previously obtained, abnormal behavior can be detected. In these types of applications it is crucial to query large quantities of tracking data both quickly and accurately. In this paper, we propose a novel trajectory matching method using logic operations on binary images. Instead of using raw object trajectory data, i.e. a sequence of image points, we approximate trajectories with small binary images. By doing so we are able to save significant cost in both time and space at the expense of negligible loss in accuracy. The cost saving in time is achieved because binary images allow us to design an effective cost function for comparison of trajectories which effectively utilizes a large word size in modern hardware

2 Fast Trajectory Matching architectures. On current hardware architectures, 64 pixels in the binary images are compared to a query binary image in parallel. As a result, we are able to quickly estimate similarities between trajectories. The cost saving in space follows from the fact that the raw trajectory representation typically requires significantly more space compared to our small binary image representation. The rest of this paper is organized as follows. Sect. 2 reviews the existing literature on trajectory matching and discusses the differences between existing work nd the proposed method. In Sect. 3, we introduce the trajectory representation and in Sect. 4 we propose the similarity measure. In Sect. 5 we present experimental results on synthetic and real data, followed by conclusions in Sect. 6. 2 Literature review The most widespread prior trajectory representations include chain codes, piecewise linear approximations, polynomials and splines. Chain code [1] is a simple representation of trajectories, consisting of piecewise linear segments encoded with 8 basic directions. Other representations, like splines and polynomial interpolation, are more sophisticated, however often require a more complex computation and are therefore less efficient. Others attempt to represent motion trajectory in a feature space. For example, Bashir et al. [2] segmented each trajectory into multiple atomic sub-trajectories and characterized each sub-trajectory by its principle component. Zhou et al. [3] extracted features using tensor subspace analysis and then performed standard mean shift to cluster the trajectory features. Chen et al. [4] introduced null-space representation which is invariant to viewpoints. Chen and Chang [5] proposed a wavelet-based approach that decomposed a trajectory into sub-trajectories which were described by a feature vector that includes the acceleration, initial velocity, arc length, order and multi scale edge points. In contrast to the existing work above, we represent a trajectory as a small binary image. To retrieve similar trajectories, a proper matching technique and similarity metric should be developed. The similarity metric will depend on the trajectory representation. Several types of trajectory matching methods have been proposed in the literature, and we will briefly discuss the following: Dynamic Time Warping (DTW), Longest Common Subsequence (LCSS), vector matching and statistical matching. DTW [6] is a dynamic matching technique that uses dynamic programming and has the advantage of being conceptually very simple, but unfortunately is quite sensitive to noise. LCSS [7] has been shown to achieve a very high accuracy and we will use it for experimental comparison in this paper. Vector matching techniques perform matching using a similarity metric. Commonly used distances include Euler-distance, city-distance and Minkowski-distance. Other methods, such as [8] and [9], employ statistical matching techniques to find appropriate trajectories. The existing methods in the literature achieve quite remarkable trajectory matching accuracy and are suitable to obtain similar (scale, translation and rotation invariance) trajectories on small databases. However with the increasing

Fast Trajectory Matching 3 amount of tracking information, comparing millions of database items quickly is very desirable. To achieve such a high performance we assume scale, translation and rotation variance in our approach, which is of particular interest in video surveillance where the camera is stationary. A strong robustness to noise is achieved by utilizing relatively small binary images. 3 Trajectory Representation Let a raw trajectory be given as a sequence of absolute 2D points in an image of size m n Γ = [ x 0 y 0 ], [ x 1 y 1 ],..., [ x e y e ]. (1) We represent this trajectory as an 8-adjacent path in a small m n binary image I 0 I 1... I n 1 I n I n+1... I 2n 1 I =.... (2) I (m 1)n I (m 1)n+1... I mn 1 To consistently transfer a trajectory from the modality in Eq. (1) to our modality in Eq. (2), we assume a constant aspect ratio, i.e. m/n = m /n. In our approach, hardware bit-level parallelism is effectively exploited by representing I as a bit vector T = [ w 0 w 1... w k ], (3) which consists of k words of length w given by w i = [ I wi... I w(i+1) 1 ] 0 < i < k. (4) To avoid internal fragmentation we let mn be a multiple of w. This representation is motivated by the fact that on modern hardware w is usually quite large, i.e. on 64-bit architectures w = 64, resulting in the following key benefits of our approach: 1. a relative small number of words per trajectory, and 2. an efficient computation by processing w bits in parallel. 4 Trajectory Matching A simple approach for matching a query trajectory T to a target trajectory T is to maximize the overlapping trajectory segments. This can be achieved by formulating a similarity measure s(t, T ) = T T (5) = w 0 w 0 +... + w k w k,

4 Fast Trajectory Matching where w represents the population count operation, i.e. the number of set bits in the word w, and represents the bitwise AND operator. This equation is motivated by the fact that both bitwise AND and population count are available on modern microprocessors enabling us to evaluate Eq. (5) quickly. We find the trajectory that maximizes this similarity measure as ˆT = max s(t, T ). (6) T Unfortunately, the dissimilarity measure in Eq. (5) is not optimal because it will only measure the length of overlapping trajectory segments but it completely ignores non-overlapping trajectory segments. Alternatively, we may formulate a dissimilarity function where d(t, T ) = T T (7) = w 0 w 0 +... + w k w k, w i w i = (w i w i) (w i w i) (8) represents bitwise exclusive disjunction. Note that Eq. (7) measures how long the non-overlapping trajectory segments are, i.e. it measures the absolute length of those paths which appear in T or T but not in both. This measurement is also known as the Hamming distance and determines the minimum number of bit substitutions required to transform T to T or vice versa. To find the best matched trajectory we minimize this dissimilarity function ˆT = min d(t, T ). (9) T Unfortunately, the similarity measure in Eq. (7) is also not optimal because it measures the absolute number of bit substitutions required. It does not directly take into account overlapping path segments and two paths that are nonoverlapping may achieve a lower dissimilarity score than two paths that are partially overlapping. To overcome this problem we propose to indirectly take the overlapping path segments into account. Note that the dissimilarity function in Eq. (7) satisfies the following inequality 0 d(t, T ) T T, (10) where represents bitwise OR. In this paper, we propose to optimize for the largest ratio between non-overlapping and all paths and find the best matched trajectory as follows d(t, T ˆT ) = max T T T. (11)

Fast Trajectory Matching 5 240x320 96x128 48x64 24x32 12x16 (a) operations / s x 10 6 12 10 8 6 4 2 Eq. (7) Eq. (11) number of words 200 150 100 50 12x16 24x32 48x64 96x128 image size (b) 0 12x16 24x32 48x64 96x128 image size (c) Fig. 1. Efficiency of the proposed method (a) Raw trajectory rendered on image obtained from a surveillance camera [10] and four binary images used in the proposed method (plotted to scale). (b) Theoretical efficiency of Eq. (7) and Eq. (11) using the binary images shown in (a) as the query. (c) Number of words used per binary image against image size. 5 Experimental Evaluation 5.1 Synthetic Data We first measure the efficiency of the proposed method by measuring the number of times the matching function can be executed per second. This theoretical efficiency is expected to be very high due to the simplicity of the matching function and the availability of hardware implementations. We have implemented Eq. (5), Eq. (7), Eq. (11) and measured their efficiency on an Intel i7 CPU system that natively supported population count. We measured the performance in number of evaluations per second and plot results for different image sizes in Fig. 1. Since we found the performance of Eq. (5) and Eq. (7) to be equivalent we have only plotted the performance of Eq. (7). As expected, the efficiency of Eq. (11) is slightly less than Eq. (7) due to the additional operations performed. Nevertheless, it can be seen that Eq. (11) can be evaluated around 10 million times per second for small binary images of size 12 16. This large number follows from the fact that a binary image of size 12 16 just required k = 3 words. In a second experiment we synthetically generated 30 original trajectories and generated 50 noisy variations for each of these trajectories. To generate noisy trajectory from a given raw trajectory, we first add uniformly distributed noise to the points of the trajectory. The magnitude of this noise was 20% of the standard deviation of all the trajectory points. A bezier-spline was subsequently fitted to these noisy trajectory points. Fig. 2 shows an example of an original trajectory and a synthetically generated noisy trajectory. The generated binary

y 6 Fast Trajectory Matching 250 Original trajectory Generated noisy trajectory 200 150 100 50 180 230 280 x (a) 12 16 24 32 48 64 96 128 (b) 12 16 24 32 48 64 96 128 (c) Fig. 2. Noisy trajectory generated for the synthetic experiment (a) An example of an original and noisy raw trajectory from the synthetic experiment. (b) The original trajectory represented as binary images. (c) The noisy trajectory represented as binary images. 12 16 24 32 48 64 96 128 Eq. (5) 87% 91% 92% 89% Eq. (7) 88% 86% 67% 41% Eq. (11) 94% 97% 96% 95% (a) 12 16 24 32 48 64 96 128 Eq. (5) 89% 95% 99% 98% Eq. (7) 98% 97% 82% 54% Eq. (11) 98% 100% 100% 100% (b) Fig. 3. Accuracy of the proposed method (a) Percentage of top 50 noisy retrievals that belong to the same set as the query trajectory. (b) Percentage of top 10 noisy retrievals that belong to the same set as the query trajectory.

Fast Trajectory Matching 7 images are also shown for reference. There are a total of 1500 noisy trajectories that are clustered into 30 groups. We randomly selected a trajectory from the set of original trajectories and query the database to select the top 50 retrievals based on Eq. (5), Eq. (7) or Eq. (11). To test the accuracy of the proposed method, we determined how many retrievals belong to the same cluster as the query trajectory. We repeated this experiment for different sizes of the binary images. The results are shown in Fig. 3. The experiment has also been repeated using just the top 10 retrievals. It can be seen that reasonable good results can be obtained with a very small 12 16 binary image size and that results obtain by Eq. (11) are consistently better than those obtained by Eq. (7) and Eq. (5). This follows from the fact that neither Eq. (7) nor Eq. (5) perform normalization and only compare absolute similarity and dissimilarity respectively. We conclude that a small binary image of size 12 16 results in high accuracy and quick computation. 5.2 Real Data We evaluated the proposed method on 3206 public available trajectories from the database reported in [10]. Given the high efficiency and accuracy, we generated small binary images of size 12 16 for all the trajectories in the database. Fig. 4 shows the top 10 retrieval results using Eq. (7), Eq. (5) and Eq. (11) for a specific query. To query the whole database took just 0.5 milliseconds when using Eq. (11), whereas LCSS required several seconds. It can be seen that the results obtained by Eq. (7) and Eq. (11) are good and the latter performs superior because overlapping as well as non-overlapping trajectories are considered. Results retrieved by Eq. (5) are not as good because just overlapping trajectory segments are considered. Similarly, LCSS focuses on finding the longest common subsequence and does not always return desirable results. 6 Conclusions In this paper, we presented a novel trajectory matching method using logic operations on small binary images. By approximating the trajectories using this novel representation, we were able to save significant cost in both time and space at the expense of retrieval accuracy that is negligible in many applications. References 1. Kaneko, T., Okudaira, M.: Encoding of arbitrary curves based on the chain code representation. Transactions on Communications 33 (1985) 697 707 2. Bashir, F., Khokhar, A., Schonfeld, D.: Segmented trajectory based indexing and retrieval of video data. In: ICIP. (2003) 3. Zhou, H., Tao, D., Yuan, Y., Li, X.: Object trajectory clustering via tensor analysis. In: ICIP. (2009)

8 Fast Trajectory Matching (a) Rank Eq. (5) Eq. (7) Eq. (11) LCSS[7] 1 2 3 4 5 6 7 8 9 10 (b) Fig. 4. Experimental results on real data: Retrieval results obtained on a query trajectory (a), and top 10 retrieval results using binary images of size 12 16 and Eq. (5), Eq. (7), Eq. (11) and LCSS[7] (b).

Fast Trajectory Matching 9 4. Chen, X., Schonfeld, D., Khokhar, A.: Robust null space representation and sampling for view-invariant motion trajectory analysis. In: CVPR. (2008) 5. Chen, W., Chang, S.: Motion trajectory matching of video objects. In: Electronic Imaging, International Society for Optics and Photonics (1999) 544 553 6. Hu, W., Tan, T., Wang, L., Maybank, S.: A survey on visual surveillance of object motion and behaviors. IEEE Transactions on Systems, Man, and Cybernetics 34 (2004) 334 352 7. Vlachos, M., Kollios, G., Gunopulos, D.: Discovering similar multidimensional trajectories. In: International Conference on Data Engineering. (2002) 8. Johnson, N., Hogg, D.: Learning the distribution of object trajectories for event recognition. Image and Vision Computing 14 (1996) 583 592 9. Porikli, F.: Trajectory distance metric using hidden markov model based representation. In: ECCV Workshops. (2004) 10. Basharat, A., Gritai, A., Shah, M.: Learning object motion patterns for anomaly detection and improved object detection. In: CVPR. (2008)