Tracking. Hao Guan( 管皓 ) School of Computer Science Fudan University

Similar documents
Automatic Initialization of the TLD Object Tracker: Milestone Update

Struck: Structured Output Tracking with Kernels. Presented by Mike Liu, Yuhang Ming, and Jing Wang May 24, 2017

Robustifying the Flock of Trackers

Tracking-Learning-Detection

2 Cascade detection and tracking

IN computer vision develop mathematical techniques in

A System for Real-time Detection and Tracking of Vehicles from a Single Car-mounted Camera

REAL-TIME, LONG-TERM HAND TRACKING WITH UNSUPERVISED INITIALIZATION

Tri-modal Human Body Segmentation

Human Detection and Tracking for Video Surveillance: A Cognitive Science Approach

Available online at ScienceDirect. Procedia Computer Science 54 (2015 )

P-N Learning: Bootstrapping Binary Classifiers by Structural Constraints

Evaluation and comparison of interest points/regions

Prof. Feng Liu. Spring /26/2017

Postprint.

COSC160: Detection and Classification. Jeremy Bolton, PhD Assistant Teaching Professor

Selection of Scale-Invariant Parts for Object Class Recognition

Feature Based Registration - Image Alignment

Aircraft Tracking Based on KLT Feature Tracker and Image Modeling

HOG-based Pedestriant Detector Training

Augmented Reality VU. Computer Vision 3D Registration (2) Prof. Vincent Lepetit

Real-time Object Tracking via Online Discriminative Feature Selection

Part-based Face Recognition Using Near Infrared Images

Large-Scale Traffic Sign Recognition based on Local Features and Color Segmentation

Part-based Face Recognition Using Near Infrared Images

OBJECT tracking has been extensively studied in computer

Online Spatial-temporal Data Fusion for Robust Adaptive Tracking

Detecting and Identifying Moving Objects in Real-Time

Robotics Programming Laboratory

Computer Vision. Exercise Session 10 Image Categorization

Keywords:- Object tracking, multiple instance learning, supervised learning, online boosting, ODFS tracker, classifier. IJSER

The Kinect Sensor. Luís Carriço FCUL 2014/15

Object Tracking using HOG and SVM

Learning to Recognize Faces in Realistic Conditions

Deep Tracking: Biologically Inspired Tracking with Deep Convolutional Networks

Babu Madhav Institute of Information Technology Years Integrated M.Sc.(IT)(Semester - 7)

Multiple-Person Tracking by Detection

Slides for Data Mining by I. H. Witten and E. Frank

Segmentation and Tracking of Partial Planar Templates

Multi-Object Tracking Based on Tracking-Learning-Detection Framework

Robert Collins CSE598G. Intro to Template Matching and the Lucas-Kanade Method

Towards Robust Visual Object Tracking: Proposal Selection & Occlusion Reasoning Yang HUA

PROBLEM FORMULATION AND RESEARCH METHODOLOGY

Pairwise Threshold for Gaussian Mixture Classification and its Application on Human Tracking Enhancement

Detection and Classification of Vehicles

A Novel Target Algorithm based on TLD Combining with SLBP

Object Detection by 3D Aspectlets and Occlusion Reasoning

Dynamic Shape Tracking via Region Matching

Pixel-Pair Features Selection for Vehicle Tracking

Weighted Multi-scale Local Binary Pattern Histograms for Face Recognition

Automatic Tracking of Moving Objects in Video for Surveillance Applications

CS4495/6495 Introduction to Computer Vision. 8C-L1 Classification: Discriminative models

Recognition of Animal Skin Texture Attributes in the Wild. Amey Dharwadker (aap2174) Kai Zhang (kz2213)

Category vs. instance recognition

ONLINE SELECTION OF DISCRIMINATIVE PIXEL-PAIR FEATURE FOR TRACKING

Local Image Features

Beyond Bags of Features

Non-Parametric Modeling

Announcements. CS 188: Artificial Intelligence Spring Generative vs. Discriminative. Classification: Feature Vectors. Project 4: due Friday.

Lecture 18: Human Motion Recognition

Machine Learning Techniques for Data Mining

Hand Posture Recognition Using Adaboost with SIFT for Human Robot Interaction

Describable Visual Attributes for Face Verification and Image Search

Face Detection and Recognition in an Image Sequence using Eigenedginess

A Novel Extreme Point Selection Algorithm in SIFT

Online Object Tracking with Proposal Selection. Yang Hua Karteek Alahari Cordelia Schmid Inria Grenoble Rhône-Alpes, France

Vehicle Tracking using Optical Features

Computer Vision with MATLAB

Computer Vision for HCI. Topics of This Lecture

Computer Vision with MATLAB MATLAB Expo 2012 Steve Kuznicki

Structured Light II. Thanks to Ronen Gvili, Szymon Rusinkiewicz and Maks Ovsjanikov

arxiv: v1 [cs.cv] 19 May 2018

Previously. Part-based and local feature models for generic object recognition. Bag-of-words model 4/20/2011

Chamfer matching. More on template matching. Distance transform example. Computing the distance transform. Shape based matching.

Learning based face hallucination techniques: A survey

Detection III: Analyzing and Debugging Detection Methods

Pedestrian Detection with Occlusion Handling

Patch-Based Image Classification Using Image Epitomes

Supervised texture detection in images

International Journal of Modern Engineering and Research Technology

Patch Descriptors. EE/CSE 576 Linda Shapiro

Object Category Detection: Sliding Windows

Utilizing Graphics Processing Units for Rapid Facial Recognition using Video Input

Motion Estimation for Video Coding Standards

Disguised Face Identification (DFI) with Facial KeyPoints using Spatial Fusion Convolutional Network. Nathan Sun CIS601

Generic Face Alignment Using an Improved Active Shape Model

Real Time Face Tracking By Using Naïve Bayes Classifier

INTELLIGENT AUTONOMOUS SYSTEMS LAB

Object Recognition with Invariant Features

Applications Video Surveillance (On-line or off-line)

Stereo and Epipolar geometry

A Real Time Facial Expression Classification System Using Local Binary Patterns

SE 263 R. Venkatesh Babu. Object Tracking. R. Venkatesh Babu

Image Features: Local Descriptors. Sanja Fidler CSC420: Intro to Image Understanding 1/ 58

Feature Extractors. CS 188: Artificial Intelligence Fall Some (Vague) Biology. The Binary Perceptron. Binary Decision Rule.

3D Computer Vision. Structured Light II. Prof. Didier Stricker. Kaiserlautern University.

BOP: Benchmark for 6D Object Pose Estimation

Local features: detection and description. Local invariant features

Digital Image Restoration

Pedestrian Detection with Improved LBP and Hog Algorithm

Transcription:

Tracking Hao Guan( 管皓 ) School of Computer Science Fudan University 2014-09-29

Multimedia Video Audio

Use your eyes Video Tracking

Use your ears Audio Tracking

Tracking Video Tracking

Definition Given a bounding box defining the initial position of an object in a single frame, automatically determine the object s bounding box in the following frames or indicate the object is not visible.

Why important? An important Mid-level of a vision system

Why important? One of the most practical areas of CV

Why difficult? Illumination Occlusion

Why difficult? Pose variants Clutter

Scale variant Why difficult?

Fast Motion Why difficult?

Single camera Multiple camera Re -identification Categories

Categories Static camera moving camera

Categories Single Object Multiple Object

Visible Infrared Categories

Categories Rigid Object Non-rigid Object

Evaluation Center Location Error average Euclidean distance between the center of the tracked target and the ground truth in all the frames of one video.

Evaluation Success Rate The success rate is the radio of the frames whose scores are larger than a given threshold.

The State-of-the-art trackers Tracking by detection is becoming popular. This stems directly from the development of powerful discriminative methods in machine learning and their application to detection with offline training. The discriminative trackers try to differentiate the target from the background by taking tracking as a binary classification problem.

Real-Time Compressive Tracking(CT) Core idea Facilitate an efficient project from the image feature space to a low-dimensional compressed space. Theoretical basis Compressive sensing theories A small number of randomly generated linear measurements can preserve most of the salient information and almost perfect reconstruct the signal

Dimension reduction Feature Extraction

Updating the classifier at the t-th frame Positive and negative samples are used to train a Naïve Bayes Classifier

Tracking at the (t+1)-th frame The sample which has the highest score will be the tracked position.

Tracking at the (t+1)-th frame

Experiments

Experiments

Experiments

Tracking-Learning-Detection(TLD) Core idea Combination of motion tracking and object detection. Focused on long-term tracking. Use detector to relocate the tracker.

Framework Overview

Tracking Module of TLD Frame difference Background subtraction Optical flow

Optical Flow A classical, common, successful method

Optical Flow A classical, common, successful method

Median Flow for TLD Assumption A good tracker should have forward-backward consistency.

Median Flow for TLD Features can be simple Handle scale variants Simple and efficient Sensitive to illumination variants Lack of self-learning and update

Median Flow for TLD

Median Flow for TLD

Median Flow for TLD

Detection Module of TLD

Variance Filter Remove patches that are smooth. Effective for the images that have smooth background

Variance Filter When background is more complicated, the effect will get worse.

Variance Filter When object is similar to the background, the effect will get worse.

Random Ferns Classifier The core part of the detector of TLD. Different from Random Forests same criterion for every layer, become linear

Feature Extraction LBP features Select two points A and B randomly from one patch, compare their intensity, if I(A) > I(B), then the feature value is 1, else is 0. Example: one patch passes a fern with 5 nodes.

Random Ferns Classifier If one fern has s nodes, then there will be 1+2 s feature values The samples of one class pass the fern, we can get the histogram of the prior probability.

Random Ferns Classifier Samples of different classes pass the fern, we can get corresponding histograms.

Random Ferns Classifier When a new patch passes the fern, if its feature is 00011( 3 ) for example, then find the max posterior probability from the given distribution.

Random Ferns Classifier We usually use a few ferns to form a random ferns classifier. Each fern has a vote.

Random Ferns Classifier In TLD, 13 nodes each fern, 10 ferns.

Nearest Neighborhood Compute the similarity with object models. Model

Integration

Conclusions & Future Directions There is not a perfect tracker. Select the most suitable one according to the application. New discriminative features. Dynamic and motion analysis.

Conclusions & Future Directions Depth information from multi-views. Re-identification. Integration of Video & Audio tracking

References Kaihua Zhang, Lei Zhang, Ming-Hsuan Yang. 20 Real-Time Compressive Tracking. ECCV, 2012. Zdenek Kalal, Krystian Mikolajczyk, Jiri Matas. Tracking-Learning-Detection. In PAMI, 2010. Kalal Z, Matas J, Mikolajczyk K. P-N learning: Bootstrapping binary classifiers by structural constraints. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2010.