Detection III: Analyzing and Debugging Detection Methods

Similar documents
Deformable Part Models

Diagnosing Error in Object Detectors

Category-level localization

Find that! Visual Object Detection Primer

Object Category Detection. Slides mostly from Derek Hoiem

Object Detection with Discriminatively Trained Part Based Models

Category vs. instance recognition

Object Category Detection: Sliding Windows

CS 1674: Intro to Computer Vision. Object Recognition. Prof. Adriana Kovashka University of Pittsburgh April 3, 5, 2018


Part based models for recognition. Kristen Grauman

Previously. Part-based and local feature models for generic object recognition. Bag-of-words model 4/20/2011

Object Recognition and Detection

Object Recognition II

An Object Detection Algorithm based on Deformable Part Models with Bing Features Chunwei Li1, a and Youjun Bu1, b

Object Detection by 3D Aspectlets and Occlusion Reasoning

Category-level Localization

Modern Object Detection. Most slides from Ali Farhadi

Beyond bags of features: Adding spatial information. Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba

Linear combinations of simple classifiers for the PASCAL challenge

Part-Based Models for Object Class Recognition Part 3

Part-based and local feature models for generic object recognition

Beyond Bags of features Spatial information & Shape models

Object Detection Design challenges

Object Detection with Partial Occlusion Based on a Deformable Parts-Based Model

Object detection. Announcements. Last time: Mid-level cues 2/23/2016. Wed Feb 24 Kristen Grauman UT Austin

Development in Object Detection. Junyuan Lin May 4th

Window based detectors

Beyond Sliding Windows: Object Localization by Efficient Subwindow Search

Recap Image Classification with Bags of Local Features

Human detection using histogram of oriented gradients. Srikumar Ramalingam School of Computing University of Utah

Using the Deformable Part Model with Autoencoded Feature Descriptors for Object Detection

Supplementary Material: Pixelwise Instance Segmentation with a Dynamically Instantiated Network

CS 1674: Intro to Computer Vision. Attributes. Prof. Adriana Kovashka University of Pittsburgh November 2, 2016

Learning Collections of Part Models for Object Recognition

Object Category Detection: Sliding Windows

Attributes. Computer Vision. James Hays. Many slides from Derek Hoiem

Colorado School of Mines. Computer Vision. Professor William Hoff Dept of Electrical Engineering &Computer Science.

24 hours of Photo Sharing. installation by Erik Kessels

Object Recognition with Deformable Models

A Discriminatively Trained, Multiscale, Deformable Part Model

Object Recognition. Computer Vision. Slides from Lana Lazebnik, Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce

Computer Vision: Summary and Discussion

Comparison of Object Detection Algorithms on Maritime Vessels Mark Chua 1, David W. Aha 2, Bryan Auslander 3, Kalyan Gupta 3, and Brendan Morris 1

Lecture 15: Detecting Objects by Parts

Immediate, scalable object category detection

Histograms of Oriented Gradients

HOGgles: Visualizing Object Detection Features

c 2011 by Pedro Moises Crisostomo Romero. All rights reserved.

Bias-Variance Trade-off (cont d) + Image Representations

Semantic Pooling for Image Categorization using Multiple Kernel Learning

Learning Spatial Context: Using Stuff to Find Things

Bag-of-features. Cordelia Schmid

Object Detection Based on Deep Learning

Detection and Localization with Multi-scale Models

Segmentation and Grouping

Analysis: TextonBoost and Semantic Texton Forests. Daniel Munoz Februrary 9, 2009

Modeling 3D viewpoint for part-based object recognition of rigid objects

Large-Scale Live Active Learning: Training Object Detectors with Crawled Data and Crowds

Fast, Accurate Detection of 100,000 Object Classes on a Single Machine

Segmenting Objects in Weakly Labeled Videos

Part-Based Models for Object Class Recognition Part 2

Part-Based Models for Object Class Recognition Part 2

Object Detection with YOLO on Artwork Dataset

Object Detection. Computer Vision Yuliang Zou, Virginia Tech. Many slides from D. Hoiem, J. Hays, J. Johnson, R. Girshick

DPM Score Regressor for Detecting Occluded Humans from Depth Images

Detecting Object Instances Without Discriminative Features

Learning Representations for Visual Object Class Recognition

Detecting Objects using Deformation Dictionaries

LEARNING TO GENERATE CHAIRS WITH CONVOLUTIONAL NEURAL NETWORKS

Deformable Part Models with Individual Part Scaling

Previously. Window-based models for generic object detection 4/11/2011

Selective Search for Object Recognition

Multiple-Person Tracking by Detection

Structured Models in. Dan Huttenlocher. June 2010

Segmentation as Selective Search for Object Recognition in ILSVRC2011

Object Detection. Sanja Fidler CSC420: Intro to Image Understanding 1/ 1

Deep condolence to Professor Mark Everingham

Lecture 16: Object recognition: Part-based generative models

Training Deformable Object Models for Human Detection based on Alignment and Clustering

Finding Tiny Faces Supplementary Materials

Ranking Figure-Ground Hypotheses for Object Segmentation

Apprenticeship Learning: Transfer of Knowledge via Dataset Augmentation

Hierarchical Learning for Object Detection. Long Zhu, Yuanhao Chen, William Freeman, Alan Yuille, Antonio Torralba MIT and UCLA, 2010

Feature descriptors. Alain Pagani Prof. Didier Stricker. Computer Vision: Object and People Tracking

Every Picture Tells a Story: Generating Sentences from Images

Three-Dimensional Object Detection and Layout Prediction using Clouds of Oriented Gradients

Learning Realistic Human Actions from Movies

Seeing 3D chairs: Exemplar part-based 2D-3D alignment using a large dataset of CAD models

Indexing Ensembles of Exemplar-SVMs with Rejecting Taxonomies for Fast Evaluation

High Level Computer Vision

Part-based models. Lecture 10

Fitting: The Hough transform

Selection of Scale-Invariant Parts for Object Class Recognition

Accurate Object Detection with Location Relaxation and Regionlets Re-localization

Pedestrian Detection Using Structured SVM

Segmentation. Bottom up Segmentation Semantic Segmentation

HISTOGRAMS OF ORIENTATIO N GRADIENTS

Deformable Part Models Revisited: A Performance Evaluation for Object Category Pose Estimation

Visuelle Perzeption für Mensch- Maschine Schnittstellen

Transcription:

CS 1699: Intro to Computer Vision Detection III: Analyzing and Debugging Detection Methods Prof. Adriana Kovashka University of Pittsburgh November 17, 2015

Today Review: Deformable part models How can we speed up detection? In what ways does detection fail? How can we visualize features and models?

Parts-based Models Define object by collection of parts modeled by 1. Appearance 2. Spatial configuration Rob Fergus

How to model spatial relations? Star-shaped model X = X Part Part Part X Root Part Part Derek Hoiem

Implicit shape models: Training 1. Build vocabulary of patches around extracted interest points using clustering 2. Map the patch around each interest point to closest word 3. For each word, store all positions it was found, relative to object center Lana Lazebnik

Implicit shape models: Testing 1. Given new test image, extract patches, match to vocabulary words 2. Cast votes for possible positions of object center 3. Search for maxima in voting space Lana Lazebnik

Histograms of oriented gradients (HOG) Bin gradients from 8x8 pixel neighborhoods into 9 orientations (Dalal & Triggs CVPR 05)

Discriminative part-based models Root filter Part filters Deformation weights P. Felzenszwalb, R. Girshick, D. McAllester, D. Ramanan, Object Detection with Discriminatively Trained Part Based Models, PAMI 32(9), 2010 Lana Lazebnik

Scoring an object hypothesis The score of a hypothesis is the sum of appearance scores minus the sum of deformation costs Subwindow features n n 2 2 0 i ( dxi, dyi, dxi, dyi i 0 i 1 score( p,..., pn) Fi H( pi ) D Displacements ) Appearance weights Deformation weights Adapted from Lana Lazebnik

What is an Object? B. Alexe, T. Deselaers, and V. Ferrari Computer Vision and Pattern Recognition (CVPR) 2010

Speeding up detection: Restrict set of windows we pass through SVM to those w/ high objectness Alexe et al., CVPR 2010

Alexe et al., CVPR 2010 Objectness cue #1: Where people look

Objectness cue #2: color contrast at boundary Alexe et al., CVPR 2010

Objectness cue #3: no segments straddling the object box Alexe et al., CVPR 2010

Boxes found to have high objectness Cyan = ground truth bounding boxes, yellow = correct and red = incorrect predictions for objectness Only run the sheep / horse / chair etc. classifier on the yellow/red boxes. Alexe et al., CVPR 2010

Today Review: Deformable part models How can we speed up detection? In what ways does detection fail? How can we visualize features and detections?

Diagnosing Error in Object Detectors D. Hoiem, Y. Chodpathumwan and Q. Dai European Conference on Computer Vision (ECCV) 2012

Object detection is a collection of problems Intra-class Variation for Airplane Occlusion Shape Viewpoint Distance Hoiem et al., ECCV 2012

Object detection is a collection of problems Confusing Distractors for Airplane Background Similar Categories Dissimilar Categories Localization Error Hoiem et al., ECCV 2012

Top false positives: Airplane (DPM) AP = 0.36 3 1 5 4 27 37 Background 27% Localization 29% 30 33 Other Objects 11% Similar Objects 33% Bird, Boat, Car 2 6 7 Hoiem et al., ECCV 2012

Top false positives: Dog (DPM) AP = 0.03 1 6 16 Background 23% Localization 17% 2 4 5 Other Objects 10% Similar Objects 50% Person, Cat, Horse 8 22 3 9 10 Hoiem et al., ECCV 2012

Analysis of object characteristics Additional annotations for seven categories: occlusion level, parts visible, sides visible Hoiem et al., ECCV 2012

Object characteristics: Aeroplane Occlusion: poor robustness to occlusion, but little impact on overall performance Easier (None) Hoiem et al., ECCV 2012 Harder (Heavy)

Object characteristics: Aeroplane Size: strong preference for average to above average sized airplanes Large Medium X-Large Small X-Small Easier Hoiem et al., ECCV 2012 Harder

Object characteristics: Aeroplane Aspect Ratio: 2-3x better at detecting wide (side) views than tall views X-Wide Wide Medium X-Tall Tall Easier (Wide) Hoiem et al., ECCV 2012 Harder (Tall)

Object characteristics: Aeroplane Sides/Parts: best performance = direct side view with all parts visible Easier (Side) Hoiem et al., ECCV 2012 Harder (Non-Side)

Conclusions Most errors that detectors make are reasonable Localization error and confusion with similar objects Misdetection of occluded or small objects Detectors have different sensitivity to different factors E.g. less sensitive to truncation than to size differences Code and annotations are available online http://web.engr.illinois.edu/~dhoiem/projects/detectionanalysis/ Adapted from Hoiem et al., ECCV 2012

Today Review: Deformable part models How can we speed up detection? In what ways does detection fail? How can we visualize features and detections?

HOGgles: Visualizing Object Detection Features C. Vondrick, A. Khosla, T. Malisiewicz, and A. Torralba International Conference on Computer Vision (ICCV) 2013

Why did the detector fail? Car Vondrick et al., ICCV 2013

Vondrick et al., ICCV 2013 What information is lost?

Vondrick et al., ICCV 2013 What information is lost?

Recovering image from neighbors Image HOG Top detections Vondrick et al., ICCV 2013

Recovering image from neighbors Image HOG Top detections Vondrick et al., ICCV 2013

Recovering image from neighbors Image HOG Top detections Vondrick et al., ICCV 2013

Recovering image from neighbors Image HOG Top detections Vondrick et al., ICCV 2013

Better recovery using paired dictionary Vondrick et al., ICCV 2013

A microscope to view HOG 2x more intuitive Vondrick et al., ICCV 2013

vs Human Vision HOG Vision Vondrick et al., ICCV 2013

Vondrick et al., ICCV 2013

Vondrick et al., ICCV 2013

Vondrick et al., ICCV 2013

Vondrick et al., ICCV 2013

Vondrick et al., ICCV 2013

Vondrick et al., ICCV 2013

The HOGgles Challenge Humans detect & DPMs detect Vondrick et al., ICCV 2013

The HOGgles Challenge Humans miss & DPM miss Vondrick et al., ICCV 2013

Vondrick et al., ICCV 2013 Chair Detections

Vondrick et al., ICCV 2013 Chair Detections

Vondrick et al., ICCV 2013 Car Detections

Vondrick et al., ICCV 2013 Car Detections

Precision HOG+Human Human performance with HOG is poor despite perfect learning Detector 1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.1 Chair Loss due to RGB -> HOG HOG+DPM HOG+Human RGB+Human 0 0 0.2 0.4 0.6 0.8 1 Recall Vondrick et al., ICCV 2013

Why did the detector fail? Car Vondrick et al., ICCV 2013

Why did the detector fail? Car Vondrick et al., ICCV 2013

Why did the detector fail? Car Vondrick et al., ICCV 2013

Visualizing Learned Models Car Person Bottle Bicycle Motorbike Chair TV Horse Vondrick et al., ICCV 2013

http://web.mit.edu/vondrick/ihog/ What is this?

http://web.mit.edu/vondrick/ihog/ What is this?

http://web.mit.edu/vondrick/ihog/ What is this?

http://web.mit.edu/vondrick/ihog/ What is this?

http://web.mit.edu/vondrick/ihog/ What is this?

http://web.mit.edu/vondrick/ihog/ What is this?

Summary We can speed up object detection by using the notion of objectness to prune windows unlikely to contain any object Some failure modes are more important than others and fixing them could increase the overall detection performance Even humans cannot produce correct classifications with imperfect features