Similar documents
Object Detection Design challenges

Object Category Detection: Sliding Windows

Category vs. instance recognition

Object Category Detection: Sliding Windows

Object Category Detection. Slides mostly from Derek Hoiem

Modern Object Detection. Most slides from Ali Farhadi

Object Detection. Computer Vision Yuliang Zou, Virginia Tech. Many slides from D. Hoiem, J. Hays, J. Johnson, R. Girshick

Deformable Part Models

Object Recognition II

Window based detectors

Recap Image Classification with Bags of Local Features

Feature descriptors. Alain Pagani Prof. Didier Stricker. Computer Vision: Object and People Tracking

Category-level localization

Human detection using histogram of oriented gradients. Srikumar Ramalingam School of Computing University of Utah

Find that! Visual Object Detection Primer

2D Image Processing Feature Descriptors

SURF. Lecture6: SURF and HOG. Integral Image. Feature Evaluation with Integral Image

Histograms of Oriented Gradients for Human Detection p. 1/1

Generic Object-Face detection

Person Detection in Images using HoG + Gentleboost. Rahul Rajan June 1st July 15th CMU Q Robotics Lab

Detection III: Analyzing and Debugging Detection Methods

Templates and Background Subtraction. Prof. D. Stricker Doz. G. Bleser

Histogram of Oriented Gradients for Human Detection

Face Detection and Alignment. Prof. Xin Yang HUST

Development in Object Detection. Junyuan Lin May 4th

Object recognition. Methods for classification and image representation

Classifier Case Study: Viola-Jones Face Detector

Face and Nose Detection in Digital Images using Local Binary Patterns

High Level Computer Vision. Sliding Window Detection: Viola-Jones-Detector & Histogram of Oriented Gradients (HOG)

CS 1674: Intro to Computer Vision. Object Recognition. Prof. Adriana Kovashka University of Pittsburgh April 3, 5, 2018

Previously. Window-based models for generic object detection 4/11/2011

An Implementation on Histogram of Oriented Gradients for Human Detection

Large-Scale Traffic Sign Recognition based on Local Features and Color Segmentation

A Cascade of Feed-Forward Classifiers for Fast Pedestrian Detection

Human detection based on Sliding Window Approach

Study of Viola-Jones Real Time Face Detector

Discriminative classifiers for image recognition

Detecting and Reading Text in Natural Scenes

Learning to Detect Faces. A Large-Scale Application of Machine Learning

Colorado School of Mines. Computer Vision. Professor William Hoff Dept of Electrical Engineering &Computer Science.

Object Recognition and Detection

Fast Human Detection With Cascaded Ensembles On The GPU

Beyond Bags of features Spatial information & Shape models

Histogram of Oriented Gradients (HOG) for Object Detection

Object recognition (part 1)

Face detection and recognition. Many slides adapted from K. Grauman and D. Lowe

Skin and Face Detection

A Discriminatively Trained, Multiscale, Deformable Part Model

Object Detection with Discriminatively Trained Part Based Models

Human-Robot Interaction

Human detections using Beagle board-xm

Object detection. Asbjørn Berge. INF 5300 Advanced Topic: Video Content Analysis. Technology for a better society

Multiple-Person Tracking by Detection

Pedestrian Detection and Tracking in Images and Videos

Fast Human Detection with Cascaded Ensembles. Berkin Bilgiç

Fast Human Detection Using a Cascade of Histograms of Oriented Gradients

Using the Deformable Part Model with Autoencoded Feature Descriptors for Object Detection

Mobile Human Detection Systems based on Sliding Windows Approach-A Review

Deformable Part Models

Classification and Detection in Images. D.A. Forsyth

Histograms of Oriented Gradients

Active learning for visual object recognition

Machine Learning for Signal Processing Detecting faces (& other objects) in images

Face detection and recognition. Detection Recognition Sally

Final Project Face Detection and Recognition

International Journal Of Global Innovations -Vol.4, Issue.I Paper Id: SP-V4-I1-P17 ISSN Online:

Haar Wavelets and Edge Orientation Histograms for On Board Pedestrian Detection

RGBD Face Detection with Kinect Sensor. ZhongJie Bi

Object detection using Region Proposals (RCNN) Ernest Cheung COMP Presentation

Parameter Sensitive Detectors

Image Analysis. Window-based face detection: The Viola-Jones algorithm. iphoto decides that this is a face. It can be trained to recognize pets!

Spatial Localization and Detection. Lecture 8-1

Traffic Sign Localization and Classification Methods: An Overview

DPM Score Regressor for Detecting Occluded Humans from Depth Images

CS 221: Object Recognition and Tracking

A New Strategy of Pedestrian Detection Based on Pseudo- Wavelet Transform and SVM

Computer Science Faculty, Bandar Lampung University, Bandar Lampung, Indonesia

Visual Detection and Species Classification of Orchid Flowers

Car Detecting Method using high Resolution images

DEPARTMENT OF INFORMATICS

Harder case. Image matching. Even harder case. Harder still? by Diva Sian. by swashford

Automatic Initialization of the TLD Object Tracker: Milestone Update

Fast, Accurate Detection of 100,000 Object Classes on a Single Machine

Ensemble Methods, Decision Trees

Designing Applications that See Lecture 7: Object Recognition

Face/Flesh Detection and Face Recognition

HISTOGRAMS OF ORIENTATIO N GRADIENTS

Boosting Object Detection Performance in Crowded Surveillance Videos

Human Motion Detection and Tracking for Video Surveillance

Linear combinations of simple classifiers for the PASCAL challenge

Implementation of Human detection system using DM3730

CS4495/6495 Introduction to Computer Vision. 8C-L1 Classification: Discriminative models

Classification of objects from Video Data (Group 30)

AN HARDWARE ALGORITHM FOR REAL TIME IMAGE IDENTIFICATION 1

PEOPLE IN SEATS COUNTING VIA SEAT DETECTION FOR MEETING SURVEILLANCE

Classifiers and Detection. D.A. Forsyth

CS4670: Computer Vision

Introduction. How? Rapid Object Detection using a Boosted Cascade of Simple Features. Features. By Paul Viola & Michael Jones

CS 4495 Computer Vision A. Bobick. CS 4495 Computer Vision. Features 2 SIFT descriptor. Aaron Bobick School of Interactive Computing

Object detection as supervised classification

Transcription:

https://en.wikipedia.org/wiki/the_dress

Recap: Viola-Jones sliding window detector Fast detection through two mechanisms Quickly eliminate unlikely windows Use features that are fast to compute Viola and Jones. Rapid Object Detection using a Boosted Cascade of Simple Features (2001).

Cascade for Fast Detection Stage 1 H 1 (x) > t 1? Yes Stage 2 H 2 (x) > t 2? Stage N H N (x) > t N? Yes Pass Examples No No No Reject Reject Reject Choose threshold for low false negative rate Fast classifiers early in cascade Slow classifiers later, but most examples don t get there

Features that are fast to compute Haar-like features Differences of sums of intensity Thousands, computed at various positions and scales within detection window -1 +1 Two-rectangle features Three-rectangle features Etc.

Integral Images ii = cumsum(cumsum(im, 1), 2) x, y ii(x,y) = Sum of the values in the grey region SUM within Rectangle D is ii(4) - ii(2) - ii(3) + ii(1)

Feature selection with Adaboost Create a large pool of features (180K) Select features that are discriminative and work well together Weak learner = feature + threshold + parity Choose weak learner that minimizes error on the weighted training set Reweight

Viola Jones Results Speed = 15 FPS (in 2001) MIT + CMU face dataset

Object Detection Overview Viola-Jones Dalal-Triggs Deformable models Deep learning

Statistical Template Object model = sum of scores of features at fixed positions +3 +2-2 -1-2.5 = -0.5? > 7.5 Non-object +4 +1 +0.5 +3 +0.5 = 10.5? > 7.5 Object

Example: Dalal-Triggs pedestrian detector 1. Extract fixed-sized (64x128 pixel) window at each position and scale 2. Compute HOG (histogram of gradient) features within each window 3. Score the window with a linear SVM classifier 4. Perform non-maxima suppression to remove overlapping detections with lower scores Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection, CVPR05

Slides by Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection, CVPR05

Tested with RGB LAB Grayscale Gamma Normalization and Compression Square root Log Slightly better performance vs. grayscale Very slightly better performance vs. no adjustment

Outperforms centered diagonal uncentered cubic-corrected Sobel Slides by Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection, CVPR05

Histogram of gradient orientations Orientation: 9 bins (for unsigned angles 0-180) Histograms in k x k pixel cells Votes weighted by magnitude Bilinear interpolation between cells Slides by Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection, CVPR05

Normalize with respect to surrounding cells Slides by Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection, CVPR05

Original Formulation # orientations X= # features = 15 x 7 x 9 x 4 = 3780 # cells # normalizations by neighboring cells Slides by Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection, CVPR05

pos w neg w Slides by Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection, CVPR05

pedestrian Slides by Pete Barnum Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection, CVPR05

Pedestrian detection with HOG Train a pedestrian template using a linear support vector machine At test time, convolve feature map with template Find local maxima of response For multi-scale detection, repeat over multiple levels of a HOG pyramid HOG feature map Template Detector response map N. Dalal and B. Triggs, Histograms of Oriented Gradients for Human Detection, CVPR 2005

Something to think about Sliding window detectors work very well for faces fairly well for cars and pedestrians badly for cats and dogs Why are some classes easier than others?

Strengths and Weaknesses of Statistical Template Approach Strengths Works very well for non-deformable objects with canonical orientations: faces, cars, pedestrians Fast detection Weaknesses Not so well for highly deformable objects or stuff Not robust to occlusion Requires lots of training data

Tricks of the trade Details in feature computation really matter E.g., normalization in Dalal-Triggs improves detection rate by 27% at fixed false positive rate Template size Typical choice is size of smallest detectable object Jittering to create synthetic positive examples Create slightly rotated, translated, scaled, mirrored versions as extra positive examples Bootstrapping to get hard negative examples 1. Randomly sample negative examples 2. Train detector 3. Sample negative examples that score > -1 4. Repeat until all high-scoring negative examples fit in memory

Things to remember Sliding window for search Features based on differences of intensity (gradient, wavelet, etc.) Excellent results require careful feature design Boosting for feature selection Integral images, cascade for speed Stage 1 H 1 (x) > t 1? Yes Stage 2 H 2 (x) > t 2? Stage N H N (x) > t N? Yes Pass Bootstrapping to deal with many, many negative examples Examples Reject No Reject No Reject No

Discriminative part-based models Root filter Part filters Deformation weights P. Felzenszwalb, R. Girshick, D. McAllester, D. Ramanan, Object Detection with Discriminatively Trained Part Based Models, PAMI 32(9), 2010

Car model Component 1 Component 2

Person model

Bottle model

More detections

Twenty object categories (aeroplane to TV/monitor) Three challenges: Classification challenge (is there an X in this image?) Detection challenge (draw a box around every X) Segmentation challenge Slides from Noah Snavely

is is there a cat?

Chance essentially 0