Object Detection with Discriminatively Trained Part Based Models

Similar documents
Find that! Visual Object Detection Primer

Development in Object Detection. Junyuan Lin May 4th

A Discriminatively Trained, Multiscale, Deformable Part Model

Category-level localization

Object Recognition with Deformable Models

Deformable Part Models

Deformable Part Models

Detection III: Analyzing and Debugging Detection Methods

Fast, Accurate Detection of 100,000 Object Classes on a Single Machine

Object Recognition II

Pedestrian Detection and Tracking in Images and Videos

Modern Object Detection. Most slides from Ali Farhadi

A Discriminatively Trained, Multiscale, Deformable Part Model

Pedestrian Detection Using Structured SVM

Graphical Models for Computer Vision

Using the Deformable Part Model with Autoencoded Feature Descriptors for Object Detection

Colorado School of Mines. Computer Vision. Professor William Hoff Dept of Electrical Engineering &Computer Science.

Object Category Detection. Slides mostly from Derek Hoiem


Part-Based Models for Object Class Recognition Part 3

Category vs. instance recognition

Modeling 3D viewpoint for part-based object recognition of rigid objects

18 October, 2013 MVA ENS Cachan. Lecture 6: Introduction to graphical models Iasonas Kokkinos

Object Detection with Partial Occlusion Based on a Deformable Parts-Based Model

An Object Detection Algorithm based on Deformable Part Models with Bing Features Chunwei Li1, a and Youjun Bu1, b

Rich feature hierarchies for accurate object detection and semantic segmentation

Linear combinations of simple classifiers for the PASCAL challenge

Object Category Detection: Sliding Windows

arxiv: v1 [cs.cv] 10 Jul 2014

Human detection using histogram of oriented gradients. Srikumar Ramalingam School of Computing University of Utah

arxiv: v2 [cs.cv] 25 Aug 2014

Lecture 15: Detecting Objects by Parts

Category-level Localization

Spatial Localization and Detection. Lecture 8-1

Object Detection Design challenges

Classifying Images with Visual/Textual Cues. By Steven Kappes and Yan Cao

Object detection using Region Proposals (RCNN) Ernest Cheung COMP Presentation

Recognition of Animal Skin Texture Attributes in the Wild. Amey Dharwadker (aap2174) Kai Zhang (kz2213)

Object Category Detection: Sliding Windows

Object Detection Based on Deep Learning

Deep Learning for Object detection & localization

Hierarchical Learning for Object Detection. Long Zhu, Yuanhao Chen, William Freeman, Alan Yuille, Antonio Torralba MIT and UCLA, 2010

Multiple-Person Tracking by Detection

Part based models for recognition. Kristen Grauman

CS 1674: Intro to Computer Vision. Object Recognition. Prof. Adriana Kovashka University of Pittsburgh April 3, 5, 2018

Deformable Part Models with Individual Part Scaling

Recap Image Classification with Bags of Local Features

The Pennsylvania State University. The Graduate School. College of Engineering ONLINE LIVESTREAM CAMERA CALIBRATION FROM CROWD SCENE VIDEOS

Beyond Sliding Windows: Object Localization by Efficient Subwindow Search

Data Mining: Concepts and Techniques. Chapter 9 Classification: Support Vector Machines. Support Vector Machines (SVMs)

c 2011 by Pedro Moises Crisostomo Romero. All rights reserved.

CS5670: Computer Vision

Person Detection in Images using HoG + Gentleboost. Rahul Rajan June 1st July 15th CMU Q Robotics Lab

Object Detection with Heuristic Coarse-to-Fine Search. Ross Girshick

Using k-poselets for detecting people and localizing their keypoints

Object recognition. Methods for classification and image representation

CS6716 Pattern Recognition

LEARNING TO GENERATE CHAIRS WITH CONVOLUTIONAL NEURAL NETWORKS

Pedestrian Detection with Occlusion Handling

HOG-based Pedestriant Detector Training

Generative and discriminative classification techniques

TA Section: Problem Set 4

Tri-modal Human Body Segmentation

Structured Models in. Dan Huttenlocher. June 2010

Visuelle Perzeption für Mensch- Maschine Schnittstellen

Selective Search for Object Recognition

SURF. Lecture6: SURF and HOG. Integral Image. Feature Evaluation with Integral Image

Semantic Pooling for Image Categorization using Multiple Kernel Learning

Classification of objects from Video Data (Group 30)

Object Detection with YOLO on Artwork Dataset

Detection and Localization with Multi-scale Models

Rich feature hierarchies for accurate object detection and semant

Classifiers and Detection. D.A. Forsyth

Constrained Convolutional Neural Networks for Weakly Supervised Segmentation. Deepak Pathak, Philipp Krähenbühl and Trevor Darrell

FMA901F: Machine Learning Lecture 3: Linear Models for Regression. Cristian Sminchisescu

CS570: Introduction to Data Mining

Learning to Localize Objects with Structured Output Regression

Part-Based Models for Object Class Recognition Part 2

Part-Based Models for Object Class Recognition Part 2

Support vector machines

Bag-of-features. Cordelia Schmid

Histograms of Sparse Codes for Object Detection

Detecting Objects using Deformation Dictionaries

Histograms of Oriented Gradients for Human Detection p. 1/1

Object Detection. Computer Vision Yuliang Zou, Virginia Tech. Many slides from D. Hoiem, J. Hays, J. Johnson, R. Girshick

Is 2D Information Enough For Viewpoint Estimation? Amir Ghodrati, Marco Pedersoli, Tinne Tuytelaars BMVC 2014

Structured output regression for detection with partial truncation

Part-based models. Lecture 10

convolution shift invariant linear system Fourier Transform Aliasing and sampling scale representation edge detection corner detection

CPSC 340: Machine Learning and Data Mining. More Linear Classifiers Fall 2017

Discriminative classifiers for image recognition

Rich feature hierarchies for accurate object detection and semantic segmentation

Comparison of Object Detection Algorithms on Maritime Vessels Mark Chua 1, David W. Aha 2, Bryan Auslander 3, Kalyan Gupta 3, and Brendan Morris 1

CS 229 Midterm Review

DPM Score Regressor for Detecting Occluded Humans from Depth Images

Feature descriptors. Alain Pagani Prof. Didier Stricker. Computer Vision: Object and People Tracking

Object Detection by 3D Aspectlets and Occlusion Reasoning

Analysis: TextonBoost and Semantic Texton Forests. Daniel Munoz Februrary 9, 2009

Multiresolution models for object detection

Object Recognition and Detection

Transcription:

Object Detection with Discriminatively Trained Part Based Models Pedro F. Felzenszwelb, Ross B. Girshick, David McAllester and Deva Ramanan Presented by Fabricio Santolin da Silva Kaustav Basu Some slides were taken from Prof. Trevor Darrell lecture

Contents 1. Introduction 2. Model Overview 3. SVM 4. Latent SVM 5. Initialization 6. Features 7. Sample Models 8. Bounding Box Predition 9. Results

Introduction Problem: Recognition of generic objects with different shapes, colors or poses Cars, People, Bycicles, Animals

Introduction Detection System that represents highly variable objects using mixture of multiscale deformable part models. Performace gap among simpler models (rigid templates and bag-of-features) and richer models (pictorial structures)

Introduction Dalal-Triggs detector HOG filter Sliding box

Problem with bag-of-words All have equal probability for bag-of-words methods Location information is important

Model Overview Mixture of deformable part models (pictorial structures) Each component has global template + deformable parts Fully trained from bounding boxes alone

2 component bicycle model root filters coarse resolution part filters finer resolution deformation models

Object Hypothesis Score of filter is dot product of filter with HOG features underneath it Score of object hypothesis is sum of filter scores minus deformation costs plus the bias Multiscale model captures features at two resolutions

Model

Mixture Models

SVM Review Separable by a hyperplane in 2-d: x2 x1

Which one? x2 x1

Maximum Margin: x2 x1

Linear SVM Mathematically Goal: 1) Correctly classify all training data wxi +b 1 if yi = +1 wxi +b 1 if y = -1 i yi ( wxi +b) 1 for all i 2) Maximize the Margin same as minimize 2 M = w 1 t ww 2 We can formulate a Quadratic Optimization Problem and solve for w and b Minimize subject to 1 t Φ( w) = w w 2 yi ( wxi +b) 1 i

Latent SVM Linear in w if z is fixed Regularizatio n Hinge loss

Latent SVM training Non-convex optimization Huge number of negative examples Convex if we fix z for positive examples Optimization: - Initialize w and iterate: - Pick best z for each positive example - Optimize w via gradient descent with data mining

Initializing w For k component mixture model: Split examples into k sets based on bounding box aspect ratio Learn k root filters using standard SVM - Training data: warped positive examples and random windows from negative images (Dalal & Triggs) Initialize parts by selecting patches from root filters - Subwindows with strong coefficients - Interpolate to get higher resolution filters - Initialize spatial model using fixed spring constants

Histogram of Gradient (HOG) features Dalal & Triggs: - Histogram gradient orientations in 8x8 pixel blocks (9 bins) Normalize with respect to 4 different neighborhoods and truncate 9 orientations * 4 normalizations = 36 features per block PCA gives ~10 features that capture all information - Fewer parameters, speeds up convolution, but costly projection at runtime Analytic projection: spans PCA subspace and easy to compute - 9 orientations + 4 normalizations = 13 features

Car model root filters part filters coarse resolution finer resolution deformation models

Person model root filters part filters coarse resolution finer resolution deformation models

Bottle model root filters part filters coarse resolution finer resolution deformation models

Bounding box prediction (x1, y1) (x2, y2) predict (x1, y1) and (x2, y2) from part locations linear function trained using least-squares regression

Context rescoring Rescore a detection using context defined by all detections Let v be the max score of detector for class i in the image Let s be the highest score of a particular detection Let (x,y ), (x,y ) be normalized bounding box coordinates f = (s, x, y, x, y, v, v..., v ) Train class specific classifier - f is positive example if true positive detection - f is negative example if false positive detection i 1 1 1 2 1 2 2 2 1 2 20

PASCAL Challenge ~10,000 images, with ~25,000 target objects. - Objects from 20 categories (person, car, bicycle, cow, table...). - Objects are annotated with labeled bounding boxes.

Bicycle detection

More bicycles False positives

Car

Person, Bottle and Horse

Other information For other information and the implementantion of their code http://people.cs.uchicago.edu/~pff/latent/

Thank you!