Development in Object Detection. Junyuan Lin May 4th

Similar documents
Object Detection with Discriminatively Trained Part Based Models

A Discriminatively Trained, Multiscale, Deformable Part Model

Fast, Accurate Detection of 100,000 Object Classes on a Single Machine

Pedestrian Detection Using Structured SVM

Category-level localization

Find that! Visual Object Detection Primer

Modern Object Detection. Most slides from Ali Farhadi

Deformable Part Models


Part-Based Models for Object Class Recognition Part 3

Deformable Part Models

Colorado School of Mines. Computer Vision. Professor William Hoff Dept of Electrical Engineering &Computer Science.

Category vs. instance recognition

Object Category Detection. Slides mostly from Derek Hoiem

Object Detection with Partial Occlusion Based on a Deformable Parts-Based Model

Object Recognition with Deformable Models

Hierarchical Learning for Object Detection. Long Zhu, Yuanhao Chen, William Freeman, Alan Yuille, Antonio Torralba MIT and UCLA, 2010

A Discriminatively Trained, Multiscale, Deformable Part Model

Object Recognition II

Detection III: Analyzing and Debugging Detection Methods

Combining ROI-base and Superpixel Segmentation for Pedestrian Detection Ji Ma1,2, a, Jingjiao Li1, Zhenni Li1 and Li Ma2

Human detection using histogram of oriented gradients. Srikumar Ramalingam School of Computing University of Utah

Using the Deformable Part Model with Autoencoded Feature Descriptors for Object Detection

Part-Based Models for Object Class Recognition Part 2

Part-Based Models for Object Class Recognition Part 2

Object Category Detection: Sliding Windows

Part 5: Structured Support Vector Machines

Object Detection Design challenges

DPM Score Regressor for Detecting Occluded Humans from Depth Images

Self-Paced Learning for Semisupervised Image Classification

Spatial Localization and Detection. Lecture 8-1

Multiple-Person Tracking by Detection

Pedestrian Detection and Tracking in Images and Videos

Recap Image Classification with Bags of Local Features

Learning to Localize Objects with Structured Output Regression

Structured Models in. Dan Huttenlocher. June 2010

Support Vector Machine Learning for Interdependent and Structured Output Spaces

Every Picture Tells a Story: Generating Sentences from Images

Object Category Detection: Sliding Windows

Part based models for recognition. Kristen Grauman

Window based detectors

Part 5: Structured Support Vector Machines

Action recognition in videos

Learning to Localize Objects with Structured Output Regression

Selective Search for Object Recognition

Recognizing Human Actions from Still Images with Latent Poses

Object Detection by 3D Aspectlets and Occlusion Reasoning

Three-Dimensional Object Detection and Layout Prediction using Clouds of Oriented Gradients

Person Detection in Images using HoG + Gentleboost. Rahul Rajan June 1st July 15th CMU Q Robotics Lab

Modeling 3D viewpoint for part-based object recognition of rigid objects

Structured output regression for detection with partial truncation

Graphical Models for Computer Vision

CS 1674: Intro to Computer Vision. Object Recognition. Prof. Adriana Kovashka University of Pittsburgh April 3, 5, 2018

Linear combinations of simple classifiers for the PASCAL challenge

Rich feature hierarchies for accurate object detection and semantic segmentation

PARTIALLY OCCLUDED OBJECT DETECTION. Di Sun. B.S. in Electrical Engineering, Beijing Jiaotong University, Beijing, China, 2014

Large-Scale Live Active Learning: Training Object Detectors with Crawled Data and Crowds

Optimized Pedestrian Detection for Multiple and Occluded People

A novel template matching method for human detection

Comparison of Object Detection Algorithms on Maritime Vessels Mark Chua 1, David W. Aha 2, Bryan Auslander 3, Kalyan Gupta 3, and Brendan Morris 1

Discriminative models for multi-class object layout

SVMs for Structured Output. Andrea Vedaldi

HOG-based Pedestriant Detector Training

Structured Learning. Jun Zhu

Multiresolution models for object detection

Beyond Actions: Discriminative Models for Contextual Group Activities

18 October, 2013 MVA ENS Cachan. Lecture 6: Introduction to graphical models Iasonas Kokkinos

Coupling-and-Decoupling: A Hierarchical Model for Occlusion-Free Car Detection

Templates and Background Subtraction. Prof. D. Stricker Doz. G. Bleser

Object Detection. Computer Vision Yuliang Zou, Virginia Tech. Many slides from D. Hoiem, J. Hays, J. Johnson, R. Girshick

Object recognition. Methods for classification and image representation

Visual Recognition: Examples of Graphical Models

Feature Sets and Dimensionality Reduction for Visual Object Detection

A New Strategy of Pedestrian Detection Based on Pseudo- Wavelet Transform and SVM

Semantic Pooling for Image Categorization using Multiple Kernel Learning

Lecture 15: Detecting Objects by Parts

Pedestrian Detection with Occlusion Handling

Histograms of Oriented Gradients for Human Detection p. 1/1

Bilinear classifiers for visual recognition

Detection and Localization with Multi-scale Models

arxiv: v2 [cs.cv] 25 Aug 2014

DPM Configurations for Human Interaction Detection

Deep Learning for Object detection & localization

High Level Computer Vision

Human detection based on Sliding Window Approach

Active View Selection for Object and Pose Recognition

Non-rigid body Object Tracking using Fuzzy Neural System based on Multiple ROIs and Adaptive Motion Frame Method

Multiple Kernel Learning for Emotion Recognition in the Wild

Classifying Images with Visual/Textual Cues. By Steven Kappes and Yan Cao

An Object Detection Algorithm based on Deformable Part Models with Bing Features Chunwei Li1, a and Youjun Bu1, b

arxiv: v1 [cs.cv] 10 Jul 2014

Support Vector Machines. James McInerney Adapted from slides by Nakul Verma

Machine Learning. Support Vector Machines. Fabio Vandin November 20, 2017

CS 179 Lecture 16. Logistic Regression & Parallel SGD

Is 2D Information Enough For Viewpoint Estimation? Amir Ghodrati, Marco Pedersoli, Tinne Tuytelaars BMVC 2014

Deep learning for object detection. Slides from Svetlana Lazebnik and many others

Generative and discriminative classification techniques

Recognizing human actions in still images: a study of bag-of-features and part-based representations

The Pennsylvania State University. The Graduate School. College of Engineering ONLINE LIVESTREAM CAMERA CALIBRATION FROM CROWD SCENE VIDEOS

Facial Landmark Tracking by Tree-based Deformable Part Model Based Detector

Transcription:

Development in Object Detection Junyuan Lin May 4th

Line of Research [1] N. Dalal and B. Triggs. Histograms of oriented gradients for human detection, CVPR 2005. HOG Feature template [2] P. Felzenszwalb, D. McAllester, and D. Ramanan. A discriminatively trained, multiscale, deformable part model, CVPR 2008. Deformable Part Model [3] C. Desai, D. Ramanan, C. Fowlkes. Discriminative Models for Multi-Class Object Layout, ICCV 2009. Multi-class Object Relationship

Max-margin Formulation Linear SVM HOG Feature template Latent SVM Deformable Part Model Structural SVM Y structured output Multi-class Object Relationship

Dalal & Triggs Detector Concatenate HOG Features in a search window into feature vector Formulate the object detection as a binary linear classifier problem Sliding window scanning over the HOG feature pyramid Output location with highest score f(x) Possible post process (Non-maxima suppression)

Linear SVM classifier

Learned weighted Filter Training sample Positive weights Negative weights *

Deformable Part Model Multiscale model captures features at two-resolution relative to p 0 Slide by P. Felzenszwalb

Score of a hypothesis relative to anchor position Latent variable Slide by P. Felzenszwalb

Latent SVM Training Learn Semi-convexity: Hinge loss Not convex in general! is convex for negative examples is concave for positive examples For positive examples make convex by fixing the latent variable for each positive training example.

Latent SVM Training cont d Coordinate descent optimization approach: Initialize and iterate: Fix pick best for each positive example. Fix optimize over by quadratic programming or gradient descent Model training procedure: Initialization: -Root Filter : train an initial root filter F 0 using linear SVM without latent variable. -Part Filter : Initialize six parts from root filter F 0 with the same shape, place at anchor positions with most positive energy in F 0. The size of each part filter is determined to take 80% of the area of F 0. Train the latent SVM model

Example Results Slide by P. Felzenszwalb

Quantitative Result

Multi-class Object Layout Pascal Dataset: Multiple objects of multi-class in a single image. Problems with traditional binary classification + sliding window approach: Intra-class: Require ad-hoc non-maxima suppression as post processing. Inter-class: multiple class models are searched independently over images. Heuristically forcing mutual exclusion. Need a principal way to model spatial statistics between objects

Formulation Formulate as a structured prediction problem. X: a collection of overlapping windows at various scales covering the entire image Y: the entire label vector for the set of all windows in an image. spatial relationship between prediction

Output score of local detectors The parameter learning and inference can be formulated as a structural SVM framework. Spatial context descriptor

Structural SVM Multi-class SVM Joint feature map Exponential Parameters Exponential Constraints represent Y using features extract from X,Y Learn weights so that is max for correct y

Structural SVM cont d n-slack Formulation: (margin rescaling) 1-Slack Formulation: Slide by T. Joachims

Cutting-Plane Algorithm Input: REPEAT FOR Compute ENDFOR IF _ Find most violated constraint Violated by more than ε? optimize StructSVM over ENDIF UNTIL has not changed during iteration Add constraint to working set [Jo06] [JoFinYu08]

Theoretical Bound Theorem: Given any S is bounded by where log, the number of constraints added to working set 2 2 4R C + 16 2 R C ε so that is a feasible solution. Number of constraints is independent of training examples. Linear time training algorithm.

Summary of Structural SVM Training: (cutting plane algorithm) Prediction: Application Specific Design of Model Loss function: Representation: joint feature map Algorithm to compute:

Component Formulation 0-1 loss function joint feature map Greedy forward search algorithm

Slide by D. Ramanan

Slide by D. Ramanan

Slide by D. Ramanan

Slide by D. Ramanan