Modern Object Detection. Most slides from Ali Farhadi

Similar documents
Object Category Detection. Slides mostly from Derek Hoiem

Category vs. instance recognition


Object Category Detection: Sliding Windows

Object Recognition II

Object Detection Design challenges

Deformable Part Models

Human detection using histogram of oriented gradients. Srikumar Ramalingam School of Computing University of Utah

Part based models for recognition. Kristen Grauman

Object Detection. Computer Vision Yuliang Zou, Virginia Tech. Many slides from D. Hoiem, J. Hays, J. Johnson, R. Girshick

Beyond Bags of features Spatial information & Shape models

Feature descriptors. Alain Pagani Prof. Didier Stricker. Computer Vision: Object and People Tracking

Development in Object Detection. Junyuan Lin May 4th

2D Image Processing Feature Descriptors

Object Detection with Discriminatively Trained Part Based Models

Detection III: Analyzing and Debugging Detection Methods

Find that! Visual Object Detection Primer

CS 1674: Intro to Computer Vision. Object Recognition. Prof. Adriana Kovashka University of Pittsburgh April 3, 5, 2018

Part-based models. Lecture 10

Previously. Part-based and local feature models for generic object recognition. Bag-of-words model 4/20/2011

Object recognition. Methods for classification and image representation

Recap Image Classification with Bags of Local Features

An Implementation on Histogram of Oriented Gradients for Human Detection

Histograms of Oriented Gradients for Human Detection p. 1/1

Part-based and local feature models for generic object recognition

A Discriminatively Trained, Multiscale, Deformable Part Model

Templates and Background Subtraction. Prof. D. Stricker Doz. G. Bleser

Visuelle Perzeption für Mensch- Maschine Schnittstellen

Category-level localization

Object Category Detection: Sliding Windows

Part-Based Models for Object Class Recognition Part 3

Using the Deformable Part Model with Autoencoded Feature Descriptors for Object Detection

Deformable Part Models

Part-Based Models for Object Class Recognition Part 2

Part-Based Models for Object Class Recognition Part 2

Colorado School of Mines. Computer Vision. Professor William Hoff Dept of Electrical Engineering &Computer Science.

Tri-modal Human Body Segmentation

SURF. Lecture6: SURF and HOG. Integral Image. Feature Evaluation with Integral Image

Object Detection with Partial Occlusion Based on a Deformable Parts-Based Model

Person Detection in Images using HoG + Gentleboost. Rahul Rajan June 1st July 15th CMU Q Robotics Lab

Object Recognition and Detection

Window based detectors

Beyond Sliding Windows: Object Localization by Efficient Subwindow Search

Histogram of Oriented Gradients for Human Detection

Fast, Accurate Detection of 100,000 Object Classes on a Single Machine

Classification of objects from Video Data (Group 30)

Sketchable Histograms of Oriented Gradients for Object Detection

Edge Detection. CSE 576 Ali Farhadi. Many slides from Steve Seitz and Larry Zitnick

Car Detecting Method using high Resolution images

Every Picture Tells a Story: Generating Sentences from Images

Segmentation. Bottom up Segmentation Semantic Segmentation

High Level Computer Vision

Human detection based on Sliding Window Approach

Discriminative classifiers for image recognition

Human detections using Beagle board-xm

Modeling 3D viewpoint for part-based object recognition of rigid objects

Multiple-Person Tracking by Detection

Linear combinations of simple classifiers for the PASCAL challenge

Pedestrian Detection and Tracking in Images and Videos

Object Recognition with Deformable Models

Automated Canvas Analysis for Painting Conservation. By Brendan Tobin

Classification and Detection in Images. D.A. Forsyth

Histogram of Oriented Gradients (HOG) for Object Detection

Object Classification for Video Surveillance

International Journal Of Global Innovations -Vol.4, Issue.I Paper Id: SP-V4-I1-P17 ISSN Online:

Classifiers and Detection. D.A. Forsyth

Lecture 18: Human Motion Recognition

Structured Models in. Dan Huttenlocher. June 2010

Learning and Recognizing Visual Object Categories Without First Detecting Features

A novel template matching method for human detection

Classifying Images with Visual/Textual Cues. By Steven Kappes and Yan Cao

Is 2D Information Enough For Viewpoint Estimation? Amir Ghodrati, Marco Pedersoli, Tinne Tuytelaars BMVC 2014

Loose Shape Model for Discriminative Learning of Object Categories

A Discriminatively Trained, Multiscale, Deformable Part Model

HISTOGRAMS OF ORIENTATIO N GRADIENTS

The Pennsylvania State University. The Graduate School. College of Engineering ONLINE LIVESTREAM CAMERA CALIBRATION FROM CROWD SCENE VIDEOS

Local Features based Object Categories and Object Instances Recognition

Generative and discriminative classification techniques

Object Detection by 3D Aspectlets and Occlusion Reasoning

Introduction to object recognition. Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and others

Large-Scale Traffic Sign Recognition based on Local Features and Color Segmentation

Computer Science Faculty, Bandar Lampung University, Bandar Lampung, Indonesia

CS 231A Computer Vision (Fall 2011) Problem Set 4

Implementation of Human detection system using DM3730

Anno accademico 2006/2007. Davide Migliore

arxiv: v1 [cs.cv] 10 Jul 2014

Parameter Sensitive Detectors

Multiple Pose Context Trees for estimating Human Pose in Object Context

Object detection. Announcements. Last time: Mid-level cues 2/23/2016. Wed Feb 24 Kristen Grauman UT Austin

Histograms of Oriented Gradients

Combining Discriminative Appearance and Segmentation Cues for Articulated Human Pose Estimation

Pedestrian Detection Using Structured SVM

Nested Pictorial Structures

Automatic Discovery of Meaningful Object Parts with Latent CRFs

CS 221: Object Recognition and Tracking

arxiv: v2 [cs.cv] 25 Aug 2014

TA Section: Problem Set 4

3 October, 2013 MVA ENS Cachan. Lecture 2: Logistic regression & intro to MIL Iasonas Kokkinos

Robust PDF Table Locator

Feature Sets and Dimensionality Reduction for Visual Object Detection

Transcription:

Modern Object Detection Most slides from Ali Farhadi

Comparison of Classifiers assuming x in {0 1} Learning Objective Training Inference Naïve Bayes maximize j i logp + logp ( x y ; θ ) ( y ; θ ) i ij 0 i j θ kj = i δ ( x = 1 y = k) i ij δ ( y = k) + Kr i i + r θ T 1 x + θ whereθ T 0 1 j θ 0 j ( 1 x) > 0 P ( x j = 1 y = 1 = log ), P( x j = 1 y = 0) P( x j = 0 y = 1) = log P( x = 0 y = 0) j Logistic Regression maximize where P ( P( y x, θ) ) log i + λ θ i T ( y x, θ) = 1/ ( 1+ exp( y θ x ) i i Gradient ascent θ T x > t Linear SVM minimize such that 1 λ ξi + θ i 2 T y θ x 1 ξ i i i, ξ 0 i Quadratic programming or subgradient opt. θ T x > t Kernelized SVM complicated to write Quadratic programming ( xˆ, x) yiα ik i > 0 i Nearest Neighbor most similar features same label Record data y i where i = argmin i K ( x ˆ,x) i

Image Categorization Training Images Training Image Features Training Labels Classifier Training Trained Classifier Test Image Image Features Testing Trained Classifier Prediction Outdoor

Example: Dalal-Triggs pedestrian detector 1. Extract fixed-sized (64x128 pixel) window at each position and scale 2. Compute HOG (histogram of gradient) features within each window 3. Score the window with a linear SVM classifier 4. Perform non-maxima suppression to remove overlapping detections with lower scores Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection, CVPR05

Slides by Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection, CVPR05

Tested with RGB LAB Grayscale Slightly better performance vs. grayscale

Outperforms centered diagonal uncentered cubic-corrected Sobel Slides by Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection, CVPR05

Histogram of gradient orientations Orientation: 9 bins (for unsigned angles) Histograms in 8x8 pixel cells Votes weighted by magnitude Bilinear interpolation between cells Slides by Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection, CVPR05

Normalize with respect to surrounding cells Slides by Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection, CVPR05

# orientations X= # features = 15 x 7 x 9 x 4 = 3780 # cells # normalizations by neighboring cells Slides by Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection, CVPR05

Training set

SVM pos w neg w Slides by Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection, CVPR05

pedestrian Slides by Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection, CVPR05

Detection examples

Each window is separately classified

What about this one? Can the model we trained for pedestrians detect the person in this image?

Specifying an object model Statistical Template in Bounding Box Object is some (x,y,w,h) in image Features defined wrt bounding box coordinates Image Template Visualization Images from Felzenszwalb

When do statistical templates make sense? Caltech 101 Average Object Images

Deformable objects Images from Caltech-256 Slide Credit: Duan Tran

Deformable objects Images from D. Ramanan s dataset Slide Credit: Duan Tran

Parts-based Models Define objects by collection of parts modeled by 1. Appearance 2. Spatial configuration Slide credit: Rob Fergus

Explicit Models Hybrid template/parts model Detections Template Visualization Felzenszwalb et al. 2008

How to model spatial relations? Many others... O(N 6 ) O(N 2 ) O(N 3 ) O(N 2 ) Fergus et al. 03 Fei-Fei et al. 03 Leibe et al. 04, 08 Crandall et al. 05 Fergus et al. 05 Crandall et al. 05 Felzenszwalb & Huttenlocher 05 Csurka 04 Vasconcelos 00 Bouchard & Triggs 05 Carneiro & Lowe 06 from [Carneiro & Lowe, ECCV 06]

Tree-shaped model

Pictorial Structures Model Part = oriented rectangle Spatial model = relative size/orientation Felzenszwalb and Huttenlocher 2005

Pictorial Structures Model Appearance likelihood Geometry likelihood

Part representation Background subtraction

Pictorial Structures

Results for person matching 34

Results for person matching 35

Enhanced pictorial structures BMVC 2009

Deformable Latent Parts Model Useful parts discovered during training Detections Template Visualization Felzenszwalb et al. 2008

Score = F 0.Φ(p 0,H) + Σ F i.φ(p i,h) - Σ d i.φ d (x,y) 38

State-of-the-art Detector: Deformable Parts Model (DPM) Lifetime Achievement 1. Strong low-level features based on HOG 2. Efficient matching algorithms for deformable part-based models (pictorial structures) 3. Discriminative learning with latent variables (latent SVM) Felzenszwalb 39 et al., 2008, 2010, 2011, 2012

40

41

Car 42

43

Cat 44

45

Person riding horse

Person riding bicycle

48

Structure Recognition using Visual Phrases, CVPR 2011