Object Recognition with Deformable Models

Similar documents
A Discriminatively Trained, Multiscale, Deformable Part Model

Object Detection with Discriminatively Trained Part Based Models

Find that! Visual Object Detection Primer

Graphical Models for Computer Vision

Category-level localization

A Discriminatively Trained, Multiscale, Deformable Part Model

Part-Based Models for Object Class Recognition Part 3

Development in Object Detection. Junyuan Lin May 4th

Part-Based Models for Object Class Recognition Part 2

Part-Based Models for Object Class Recognition Part 2

Structured Models in. Dan Huttenlocher. June 2010

Deformable Part Models

Fast, Accurate Detection of 100,000 Object Classes on a Single Machine

Detection III: Analyzing and Debugging Detection Methods

Modeling 3D viewpoint for part-based object recognition of rigid objects

Category-level Localization

Using the Deformable Part Model with Autoencoded Feature Descriptors for Object Detection

High Level Computer Vision

Object Detection with Partial Occlusion Based on a Deformable Parts-Based Model

Learning and Recognizing Visual Object Categories Without First Detecting Features

Colorado School of Mines. Computer Vision. Professor William Hoff Dept of Electrical Engineering &Computer Science.

Object Detection with Heuristic Coarse-to-Fine Search. Ross Girshick

Part based models for recognition. Kristen Grauman

Part-based models. Lecture 10

Every Picture Tells a Story: Generating Sentences from Images

Modern Object Detection. Most slides from Ali Farhadi

Pictorial Structures for Object Recognition

Object Category Detection. Slides mostly from Derek Hoiem

Object Recognition II

Deformable Part Models with Individual Part Scaling

DPM Configurations for Human Interaction Detection

Semantic Pooling for Image Categorization using Multiple Kernel Learning

2/15/2009. Part-Based Models. Andrew Harp. Part Based Models. Detect object from physical arrangement of individual features


Beyond Bags of features Spatial information & Shape models

Hierarchical Learning for Object Detection. Long Zhu, Yuanhao Chen, William Freeman, Alan Yuille, Antonio Torralba MIT and UCLA, 2010

c 2011 by Pedro Moises Crisostomo Romero. All rights reserved.

Computer Vision Lecture 16

An Object Detection Algorithm based on Deformable Part Models with Bing Features Chunwei Li1, a and Youjun Bu1, b

Object Recognition. Computer Vision. Slides from Lana Lazebnik, Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce

Object Recognition Using Pictorial Structures. Daniel Huttenlocher Computer Science Department. In This Talk. Object recognition in computer vision

CS 1674: Intro to Computer Vision. Object Recognition. Prof. Adriana Kovashka University of Pittsburgh April 3, 5, 2018

Lecture 15: Detecting Objects by Parts

Beyond Sliding Windows: Object Localization by Efficient Subwindow Search

Pedestrian Detection and Tracking in Images and Videos

Robotics Programming Laboratory

Pedestrian Detection Using Structured SVM

Supervised learning. y = f(x) function

Supervised learning. y = f(x) function

Detecting Objects using Deformation Dictionaries

CS 231A Computer Vision (Fall 2011) Problem Set 4

Detection and Localization with Multi-scale Models

Object Detection by 3D Aspectlets and Occlusion Reasoning

Learning to Localize Objects with Structured Output Regression

Multiple-Person Tracking by Detection

Deformable Part Models

Category vs. instance recognition

Nested Pictorial Structures

18 October, 2013 MVA ENS Cachan. Lecture 6: Introduction to graphical models Iasonas Kokkinos

Recent Progress on Object Classification and Detection

Comparison of Object Detection Algorithms on Maritime Vessels Mark Chua 1, David W. Aha 2, Bryan Auslander 3, Kalyan Gupta 3, and Brendan Morris 1

CS 231A Computer Vision (Fall 2012) Problem Set 4

Course Administration

The Caltech-UCSD Birds Dataset

P-CNN: Pose-based CNN Features for Action Recognition. Iman Rezazadeh

CS5670: Intro to Computer Vision

IMAGE RETRIEVAL USING VLAD WITH MULTIPLE FEATURES

arxiv: v1 [cs.cv] 10 Jul 2014

Lecture 12 Recognition. Davide Scaramuzza

Object Category Detection: Sliding Windows

Boosted Local Structured HOG-LBP for Object Localization

In Good Shape: Robust People Detection based on Appearance and Shape

Recap Image Classification with Bags of Local Features

Histograms of Oriented Gradients for Human Detection p. 1/1

arxiv: v2 [cs.cv] 25 Aug 2014

Attributes. Computer Vision. James Hays. Many slides from Derek Hoiem

Object Recognition Using Junctions

Clustered Pose and Nonlinear Appearance Models for Human Pose Estimation

Multiresolution models for object detection

Recognizing human actions in still images: a study of bag-of-features and part-based representations

Articulated Pose Estimation using Discriminative Armlet Classifiers

Feature Sets and Dimensionality Reduction for Visual Object Detection

Unified, real-time object detection

Window based detectors

Lecture 12 Recognition

Finding people in repeated shots of the same scene

Segmenting Objects in Weakly Labeled Videos

CS 664 Flexible Templates. Daniel Huttenlocher

Previous Lecture - Coded aperture photography

Recognizing Human Actions from Still Images with Latent Poses

CSE373: Data Structures & Algorithms Lecture 23: Applications. Linda Shapiro Spring 2016

Object Detection. TA : Young-geun Kim. Biostatistics Lab., Seoul National University. March-June, 2018

Developing Open Source code for Pyramidal Histogram Feature Sets

DPM Score Regressor for Detecting Occluded Humans from Depth Images

Representation and Detection of Deformable Shapes

Parallelized deformable part models with effective hypothesis pruning

arxiv: v2 [cs.cv] 1 Oct 2014

Learning Representations for Visual Object Class Recognition

24 hours of Photo Sharing. installation by Erik Kessels

Exact Acceleration of Linear Object Detectors

Transcription:

Object Recognition with Deformable Models Pedro F. Felzenszwalb Department of Computer Science University of Chicago Joint work with: Dan Huttenlocher, Joshua Schwartz, David McAllester, Deva Ramanan.

Example Problems Detecting rigid objects PASCAL challenge Detecting non-rigid objects Medical image analysis Segmenting cells

Deformable Models Significant challenge: - Handling variation in appearance within object classes - Non-rigid objects, generic categories, etc. Deformable models approach: - Consider each object as a deformed version of a template - Compact representation - Leads to interesting modeling and algorithmic problems

Overview Part I: Pictorial Structures - Deformable part models - Highly efficient matching algorithms Part II: Deformable Shapes - Triangulated polygons - Hierarchical models Part III: The PASCAL Challenge - Recognizing 20 object categories in realistic scenes - Discriminatively trained, multiscale, deformable part models

Part I: Pictorial Structures Introduced by Fischler and Elschlager in 1973 Part-based models: - Each part represents local visual properties - Springs capture spatial relationships Matching model to image involves joint optimization of part locations stretch and fit

Local Evidence + Global Decision Parts have a match quality at each image location Local evidence is noisy - Parts are detected in the context of the whole model part test image match quality

Matching Problem Model is represented by a graph G = (V, E) - V = {v1,...,vn} are the parts - (vi,vj) E indicates a connection between parts mi(li) is a cost for placing part i at location li dij(li,lj) is a deformation cost Optimal configuration for the object is L = (l1,...,ln) minimizing n E(L) = mi(li) + dij(li,lj) i=1 (vi,vj) E

Matching Problem n E(L) = mi(li) + dij(li,lj) i=1 (vi,vj) E Assume n parts, k possible locations for each part - There are k n configurations L If graph is a tree we can use dynamic programming - O(nk 2 ) algorithm If dij(li,lj) = g(li-lj) we can use min-convolutions - O(nk) algorithm - As fast as matching each part separately!

Human Pose Estimation

Human Tracking Ramanan, Forsyth, Zisserman, Tracking People by Learning their Appearance IEEE Pattern Analysis and Machine Intelligence (PAMI). Jan 2007

Part III: PASCAL Challenge ~10,000 images, with ~25,000 target objects - Objects from 20 categories (person, car, bicycle, cow, table...) - Objects are annotated with labeled bounding boxes

Model Overview detection root filter part filters deformation models Model has a root filter plus deformable parts

Histogram of Gradient (HOG) Features Image is partitioned into 8x8 pixel blocks In each block we compute a histogram of gradient orientations - Invariant to changes in lighting, small deformations, etc. We compute features at different resolutions (pyramid)

Filters Filters are rectangular templates defining weights for features Score is dot product of filter and subwindow of HOG pyramid W H Score of H at this location is H W HOG pyramid

Object Hypothesis Score is sum of filter scores plus deformation scores Image pyramid HOG feature pyramid Multiscale model captures features at two-resolutions

Training Training data consists of images with labeled bounding boxes Need to learn the model structure, filters and deformation costs Training

Learned Models Bicycle Sofa Car Bottle

Example Results

More Results

Overall Results 9 systems competed in the 2007 challenge Out of 20 classes we get: - First place in 10 classes - Second place in 6 classes Some statistics: - It takes ~2 seconds to evaluate a model in one image - It takes ~3 hours to train a model - MUCH faster than most systems

Summary Deformable models provide an elegant framework for object detection and recognition - Efficient algorithms for matching models to images - Applications: pose estimation, medical image analysis, object recognition, etc. We can learn models from partially labeled data - Generalized standard ideas from machine learning - Leads to state-of-the-art results in PASCAL challenge Future work: hierarchical models, grammars, 3D objects