High Level Computer Vision

Size: px
Start display at page:

Download "High Level Computer Vision"

Transcription

1 High Level Computer Vision Part-Based Models for Object Class Recognition Part 2 Bernt Schiele - schiele@mpi-inf.mpg.de Mario Fritz - mfritz@mpi-inf.mpg.de

2 Please Note No lecture next week (June 25) Next lecture July 2nd 2

3 Class of Object Models: Part-Based Models / Pictorial Structures Pictorial Structures [Fischler & Elschlager 1973] Model has two components - parts (2D image fragments) - structure (configuration of parts) 3

4 State-of-the-Art in Object Class Representations Bag of Words Models (BoW) object model = histogram of local features e.g. local feature around interest points Global Object Models object model = global feature object feature e.g. HOG (Histogram of Oriented Gradients) Part-Based Object Models object model = models of parts & spatial topology model e.g. constellation model or ISM (Implicit Shape Model) But: What is the Ideal Notion of Parts here? And: Should those Parts be Semantic? BoW: no spatial relationships e.g. HOG: fixed spatial relationships e.g. ISM: flexible spatial relationships 4

5 Constellation of Parts Fully connected shape model x 1 x 6 x 2 x 5 x 3 x 4 Weber, Welling, Perona, 00; Fergus, Zisserman, Perona, 03 5

6 Constellation Model Weber, Welling, Perona, 00; Fergus, Zisserman, Perona, 03 Joint model for appearance and structure (=shape) X: positions, A: part appearance, S: scale h: Hypothesis = assignment of features (in the image) to parts (of the model) Gaussian part appearance pdf Gaussian shape pdf Gaussian relative scale pdf Prob. of detection Log(scale) 6

7 Object Detection: ISM (Implicit Shape Model) Appearance of parts: Implicit Shape Model (ISM) [Leibe, Seemann & Schiele, CVPR 2005] x o 7

8 Spatial Models for Categorization Fully connected shape model Star shape model x 1 x 1 x 6 x 2 x 6 x 2 x 5 x 3 x 5 x 3 x 4 x 4 e.g. Constellation Model Parts fully connected Recognition complexity: O(N P ) Method: Exhaustive search e.g. ISM (Implicit Shape Model) Parts mutually independent Recognition complexity: O(NP) Method: Generalized Hough Transform 8

9 Implicit Shape Model: What are Good Parts? Parts of the Implicit Shape Model parts = feature clusters lots of parts (in the order of codebook entries) the more the better parts are mostly non-semantic parts = (mostly non semantic) feature clusters also true for bag of words models constellation model (much fewer parts - but still feature clusters)... 9

10 Part-Based Models - Overview Today (more next week) Last Week: Part-Based based on Manual Labeling of Parts - Detection by Components, Multi-Scale Parts The Constellation Model - automatic discovery of parts and part-structure The Implicit Shape Model (ISM) - star-model of part configurations, parts obtained by clustering interest-points Today: Pictorial Structures Model Hierarchical Graphical Model Learning Object Model from CAD Data Discussion Semantic Parts vs. Discriminative Parts 10

11 Part-Based Models - Overview Today (more next week) Last Week: Part-Based based on Manual Labeling of Parts - Detection by Components, Multi-Scale Parts The Constellation Model - automatic discovery of parts and part-structure The Implicit Shape Model (ISM) - star-model of part configurations, parts obtained by clustering interest-points Today: Pictorial Structures Model Hierarchical Graphical Model Learning Object Model from CAD Data Discussion Semantic Parts vs. Discriminative Parts 11

12 People Detection: partism Appearance of parts: Implicit Shape Model (ISM) [Leibe, Seemann & Schiele, CVPR 2005] x o 12

13 People Detection: partism Appearance of parts: Implicit Shape Model (ISM) [Leibe, Seemann & Schiele, CVPR 2005] Part decomposition and inference: Pictorial structures model [Felzenszwalb & Huttenlocher, IJCV 2005] x 8 x 7 x o x 3 x 6 p(l D) p(d L)p(L) x 2 x 1 x 5 x 4 Body-part positions Image evidence 13

14 Can we make pictorial structures model effective for these tasks? [Fischler&Elschlager, 1973] 14

15 Can we make pictorial structures model effective for these tasks? Yes... if the model components are chosen right. 15

16 Can we make pictorial structures model effective for these tasks? So... what are the right components? 16

17 Pictorial Structures Model Body-part positions Two Components p(l D) = p(d L)p(L) p(d) Image evidence Prior (capturing possible part configurations): Likelihood of Parts (capturing part appearance): p(d L)p(L) p(l) p(d L) 17

18 Pictorial Structures: Model Components Body is represented as flexible configuration of body parts posterior over body poses p(l D) likelihood of observations p(d L)p(L) prior on body poses orientation K likelihood of part N estimated pose part posteriors 20 Local Features.... AdaBoost gure 2. (left) Kine

19 Kinematic Tree Prior (modeling the structure) l 9 l 7 l 5 l 6 l 8 l 10 Represent pairwise part relations [Felzenszwalb & Huttenlocher, IJCV 05] l 2 l 1 l 3 l 4 l 2 50 p(l) =p(l 0 ) 40 p(l i l j ), l (i,j) E p(l 2 l 1 )=N(T 12 (l 2 ) T 21 (l 1 ), ) part locations relative to the joint transformed part locations l 2 l

20 Kinematic Tree Prior Prior parameters: {T ij, ij } Parameters of the prior are estimated with maximum likelihood mean pose several independent samples gure Kinematic prior learned on the multi-view a 20

21 Pictorial Structures: Model Components Body is represented as flexible configuration of body parts posterior over body poses p(l D) likelihood of observations p(d L)p(L) prior on body poses orientation K likelihood of part N estimated pose part posteriors 20 Local Features.... AdaBoost gure 2. (left) Kine

22 Likelihood Model Assumption: evidence (image features) for each part independent of all other parts: p(d L) = assumption clearly not correct, but allows efficient computation NY p(d i l i ) i=0 works rather well in practice training data for different body parts should cover all appearances 22

23 Likelihood Model Build on recent advances in object detection: state-of-the-art image descriptor: Shape Context [Belongie et al., PAMI 02; Mikolajczyk&Schmid, PAMI 05] dense representation discriminative model: AdaBoost classifier for each body part - Shape Context: 96 dimensions (4 angular, 3 radial, 8 gradient orientations) - Feature Vector: concatenate the descriptors inside part bounding box - head: 4032 dimensions - torso: 8448 dimensions 23

24 AdaBoost (Freund & Shapire 95) f ( x) tht ( x) t Initial uniform weight on training examples weak classifier 1 errort t 0.5log 1 errort w e i yi tht ( xi ) i t 1 w t i yi tht ( xi ) wt 1e i Incorrect classifications re-weighted more heavily weak classifier 2 weak classifier 3 Final classifier is weighted combination of weak classifiers Viola and Jones, Robust object detection using a boosted cascade of simple features, CVPR 2001

25 Likelihood Model Part likelihood derived from the boosting score: decision stump weight p(d i l i ) = max t i,th t (x(l i )) t decision stump output i,t, 0 part location small constant to deal with part occlusions 25

26 Likelihood Model Input image Head Torso Upper leg Our part likelihoods.... [Ramanan, NIPS 06] 26

27 Likelihood Model Input image Head Torso Upper leg Our part likelihoods.... [Ramanan, NIPS 06] 27

28 Likelihood Model Input image Head Torso Upper leg Our part likelihoods.... [Ramanan, NIPS 06] 28

29 Evaluation Scenarios 1. Human Pose Estimation People dataset [Ramanan, NIPS 06] 2. Upper-body Pose Estimation Buffy dataset [Ferrari et al., CVPR 08] 3. Pedestrian Detection TUD Pedestrians dataset [Andriluka et al., CVPR 08] 29

30 Evaluation Scenarios 1. Human Pose Estimation People dataset [Ramanan, NIPS 06] 2. Upper-body Pose Estimation Buffy dataset [Ferrari et al., CVPR 08] 3. Pedestrian Detection TUD Pedestrians dataset [Andriluka et al., CVPR 08] 30

31 Part-Based Model: 2D Human Pose Estimation 31

32 Scenario 1: Quantitative Results Method Torso Upper legs Lower legs Upper arm Forearm Head Total [Ramanan, NIPS 06] 2nd parse Our inference, edge features from [Ramanan, NIPS 06] Our part detectors (SC) Our prior, our part detectors (SC) Our prior, our part detectors (SIFT)

33 Evaluation Scenarios 1. Human Pose Estimation People dataset [Ramanan, NIPS 06] 2. Upper-body Pose Estimation Buffy dataset [Ferrari et al., CVPR 08] 3. Pedestrian Detection TUD Pedestrians dataset [Andriluka et al., CVPR 08] 33

34 Estimated upper-body poses 34

35 Quantitative Results Method Torso Upper arm Lower arm Head Total [Ferrari et al. CVPR 08] detectors only full model generic model - prior and appearance learned on the People dataset

36 Quantitative Results Method Torso Upper arm Lower arm Head Total [Ferrari et al. CVPR 08] detectors only full model [Ferrari et al. CVPR 09] generic model - prior and appearance learned on the People dataset

37 Quantitative Results Method Torso Upper arm Lower arm Head Total [Ferrari et al. CVPR 08] detectors only full model [Ferrari et al. CVPR 09] full model, Buffy pose prior specialized upper body prior - appearance learned on the People dataset

38 Typical Failure Cases Foreshortening Part occlusion Detections on other body parts 38

39 Pictorial Structures for Human Pose Estimation: What are Good Parts? Parts of the Pictorial Structures Model parts = semantic body parts pose estimation = estimation of body part configuration semantic body parts allow to use motion capture data, etc. to improve kinematic tree prior non-semantic parts (e.g. in the ISM-model) are more difficult to generalize across human body poses 39

40 Part-Based Models - Overview Today (more next week) Last Week: Part-Based based on Manual Labeling of Parts - Detection by Components, Multi-Scale Parts The Constellation Model - automatic discovery of parts and part-structure The Implicit Shape Model (ISM) - star-model of part configurations, parts obtained by clustering interest-points Today: Pictorial Structures Model Hierarchical Graphical Model Learning Object Model from CAD Data Discussion Semantic Parts vs. Discriminative Parts 40

41 Standard Models seen as Graphical Models Monolithic (global) HOG [Dalal & Triggs, 2005] Part-based (local) Implicit Shape Model [Leibe&Schiele, 2003] also: Spatial Pyramid Kernel [Lazebnik et al., 2006] Pictorial Structures [Felzenszwalb et al., 2005] 41

42 Hierarchical Graphical Model Generalization of Standard Models New hierarchical structure Short and longer range dependencies Joint training of local and global representations - Cyclic graph structure - Bottom-up top-down propagation 42

43 Hierarchical CRF Latent Model Goal: discover meaningful parts Labels of nodes not known Now: part labels Allow for ambiguities of part assignments Combine with structure learning Keep hierarchical representation 43

44 Experiments on PASCAL VOC 2007: Hierarchical CRF Latent Model Consistently outperforms DPM [Felzenszwalb et al., 2008] MKL [Vedaldi et al., 2009] better due to more features 44

45 Hierarchical Latent CRF-Model: What are Good Parts? Parts of the Latent CRF Model: parts have three main properties - parts = hierarchical appearance abstractions - parts = local appearance shared across objects - parts = discriminat local appearance of the objects parts enable effective learning - by finding correspondences between discriminative local object appearance semantics of parts is a secondary effect and not necessary [Schnitzspan,Roth,Schiele@cvpr10] similar observations hold for DPM [Felzenszwalb et al@cvpr08] 45

46 Part-Based Models - Overview Today (more next week) Last Week: Part-Based based on Manual Labeling of Parts - Detection by Components, Multi-Scale Parts The Constellation Model - automatic discovery of parts and part-structure The Implicit Shape Model (ISM) - star-model of part configurations, parts obtained by clustering interest-points Today: Pictorial Structures Model Hierarchical Graphical Model Learning Object Model from CAD Data Discussion Semantic Parts vs. Discriminative Parts 46

47 Back to the Future: Learning Shape Models from 3D CAD Data 3D Computer Aided Design (CAD) Models Computer graphics, game design Polygonal meshes + texture descriptions semantic part annotations (may) exist [Stark,Goesele,Schiele@bmvc10] Can we learn Object Class Models directly from 3D CAD data? Issue: Transition between 3D CAD models and 2D real-world images 47

48 Shape-based Appearance Abstraction Non-Photorealistic Rendering Learn shape models from rendered images we do NOT render photo-realistically / texture But focus on 3D CAD model edges (mimic real-world image edges) Part boundaries Mesh creases Silhouette } Final edges (hidden edges removed) [Stark,Goesele,Schiele@bmvc10] 48

49 Shape - Local Shape + Global Geometry [Stark,Goesele,Schiele@bmvc10] Part-based object class representation Semantic parts from 3D CAD models: left front wheel, left front door, etc. View-points Left Front left Front Parts Local shape Global geometry 49

50 Constellation Model Weber, Welling, Perona, 00; Fergus, Zisserman, Perona, 03 Joint model for appearance and structure (=shape) X: positions, A: part appearance, S: scale h: Hypothesis = assignment of features (in the image) to parts (of the model) Gaussian part appearance pdf Gaussian shape pdf Gaussian relative scale pdf Prob. of detection Log(scale) 50

51 Experimental Evaluation - Test Data Set [Stark,Goesele,Schiele@bmvc10] 3D Object Classes Cars [Savarese and Fei-Fei ICCV 07] 8 azimuth angles... back back-left left front-left 2 elevation angles low high 5 cars 3 distances 240 images near medium far 51

52 Qualitative Results Three strongest true positive detections per viewpoint model Right Back-right Observations Accurate part localization Predicted viewpoints do match 52

53 Quantitative Results - Multi-View Detection [Stark,Goesele,Schiele@bmvc10] Evaluation protocol Precision / recall PASCAL overlap criterion Observations 4.3% AP improvement ( ) [Liebelt et al. CVPR 10] ( ) We can further improve performance by 8.9% AP ( ) recall bank of 8 (AP = 81.0) 10 degree intervals (AP = 89.9) bank of 8 (AP = 81.0) 10 degree intervals (AP = 89.9) Su et al. ICCV09 (AP = 55.3) Su et al. ICCV09 (AP = 55.3) Gill et al. DAGM09 (AP = 72.6) Gill et al. DAGM09 (AP = 72.6) Liebelt Liebelt et al. et CVPR10 al. (AP = 76.7) = 76.7) precision 53

54 Shape Model learned from 3D CAD-Data What are Good Parts? Parts of the Shape Model: parts = just means to enable correspondence across 3D-models semantics of parts: - in our case: yes - because of the employed 3D models - but: semantics neither necessary nor important Left Front left Front

55 Part-Based Models - Overview Today (more next week) Last Week: Part-Based based on Manual Labeling of Parts - Detection by Components, Multi-Scale Parts The Constellation Model - automatic discovery of parts and part-structure The Implicit Shape Model (ISM) - star-model of part configurations, parts obtained by clustering interest-points Today: Pictorial Structures Model Hierarchical Graphical Model Learning Object Model from CAD Data Discussion Semantic Parts vs. Discriminative Parts 55

56 What are Ideal Parts for Part-Based Object Models? Parts can/may be semantic body parts - e.g. for articulated human body pose estimation be feature clusters (typically many clusters) (e,g, ISM, constellation model, BoW) support learnability of discriminant appearance enable correspondence across 3D models... in all those cases: the most important property is that parts facilitate correspondence across object instances 56

57 What are Ideal Parts for Part-Based Object Models? Multiple motivations for part based models exist: intuitiveness: semantic meaning of parts/attributes is attractive (e.g. enables use of language sources) learnability: sharing of parts/attributes across instances/classes scalability: transferability of parts/attributes across classes... in general, parts support learnability and scalability when they facilitate correspondence across object instances across object classes across modalities (e.g. from language to visual appearance) and semantics is only a secondary concern (for intuitiveness ) 57

58 What are Ideal Parts for Part-Based Object Models? the Good, the Bad, and the Ugly thanks to: Micha Andriluka, Bastian Leibe, Sandra Ebert, Mario Fritz, Diane Larlus, Marcus Rohrbach, Paul Schnitzspan, Stefan Roth, Michael Stark, Michael Goesele

Part-Based Models for Object Class Recognition Part 2

Part-Based Models for Object Class Recognition Part 2 High Level Computer Vision Part-Based Models for Object Class Recognition Part 2 Bernt Schiele - schiele@mpi-inf.mpg.de Mario Fritz - mfritz@mpi-inf.mpg.de https://www.mpi-inf.mpg.de/hlcv Class of Object

More information

Part-Based Models for Object Class Recognition Part 2

Part-Based Models for Object Class Recognition Part 2 High Level Computer Vision Part-Based Models for Object Class Recognition Part 2 Bernt Schiele - schiele@mpi-inf.mpg.de Mario Fritz - mfritz@mpi-inf.mpg.de https://www.mpi-inf.mpg.de/hlcv Class of Object

More information

Part-Based Models for Object Class Recognition Part 3

Part-Based Models for Object Class Recognition Part 3 High Level Computer Vision! Part-Based Models for Object Class Recognition Part 3 Bernt Schiele - schiele@mpi-inf.mpg.de Mario Fritz - mfritz@mpi-inf.mpg.de! http://www.d2.mpi-inf.mpg.de/cv ! State-of-the-Art

More information

Deformable Part Models

Deformable Part Models CS 1674: Intro to Computer Vision Deformable Part Models Prof. Adriana Kovashka University of Pittsburgh November 9, 2016 Today: Object category detection Window-based approaches: Last time: Viola-Jones

More information

Part based models for recognition. Kristen Grauman

Part based models for recognition. Kristen Grauman Part based models for recognition Kristen Grauman UT Austin Limitations of window-based models Not all objects are box-shaped Assuming specific 2d view of object Local components themselves do not necessarily

More information

Previously. Part-based and local feature models for generic object recognition. Bag-of-words model 4/20/2011

Previously. Part-based and local feature models for generic object recognition. Bag-of-words model 4/20/2011 Previously Part-based and local feature models for generic object recognition Wed, April 20 UT-Austin Discriminative classifiers Boosting Nearest neighbors Support vector machines Useful for object recognition

More information

Structured Models in. Dan Huttenlocher. June 2010

Structured Models in. Dan Huttenlocher. June 2010 Structured Models in Computer Vision i Dan Huttenlocher June 2010 Structured Models Problems where output variables are mutually dependent or constrained E.g., spatial or temporal relations Such dependencies

More information

Object Category Detection. Slides mostly from Derek Hoiem

Object Category Detection. Slides mostly from Derek Hoiem Object Category Detection Slides mostly from Derek Hoiem Today s class: Object Category Detection Overview of object category detection Statistical template matching with sliding window Part-based Models

More information

Automatic Discovery of Meaningful Object Parts with Latent CRFs

Automatic Discovery of Meaningful Object Parts with Latent CRFs Automatic Discovery of Meaningful Object Parts with Latent CRFs Paul Schnitzspan 1 Stefan Roth 1 Bernt Schiele 1,2 1 Department of Computer Science, TU Darmstadt 2 MPI Informatics, Saarbrücken Abstract

More information

Part-based and local feature models for generic object recognition

Part-based and local feature models for generic object recognition Part-based and local feature models for generic object recognition May 28 th, 2015 Yong Jae Lee UC Davis Announcements PS2 grades up on SmartSite PS2 stats: Mean: 80.15 Standard Dev: 22.77 Vote on piazza

More information

Lecture 16: Object recognition: Part-based generative models

Lecture 16: Object recognition: Part-based generative models Lecture 16: Object recognition: Part-based generative models Professor Stanford Vision Lab 1 What we will learn today? Introduction Constellation model Weakly supervised training One-shot learning (Problem

More information

In Good Shape: Robust People Detection based on Appearance and Shape

In Good Shape: Robust People Detection based on Appearance and Shape PISHCHULIN et al.: PEOPLE DETECTION USING APPEARANCE AND SHAPE 1 In Good Shape: Robust People Detection based on Appearance and Shape Leonid Pishchulin leonid@mpi-inf.mpg.de Arjun Jain ajain@mpi-inf.mpg.de

More information

Combining PGMs and Discriminative Models for Upper Body Pose Detection

Combining PGMs and Discriminative Models for Upper Body Pose Detection Combining PGMs and Discriminative Models for Upper Body Pose Detection Gedas Bertasius May 30, 2014 1 Introduction In this project, I utilized probabilistic graphical models together with discriminative

More information

Part-based models. Lecture 10

Part-based models. Lecture 10 Part-based models Lecture 10 Overview Representation Location Appearance Generative interpretation Learning Distance transforms Other approaches using parts Felzenszwalb, Girshick, McAllester, Ramanan

More information

Detection III: Analyzing and Debugging Detection Methods

Detection III: Analyzing and Debugging Detection Methods CS 1699: Intro to Computer Vision Detection III: Analyzing and Debugging Detection Methods Prof. Adriana Kovashka University of Pittsburgh November 17, 2015 Today Review: Deformable part models How can

More information

Object Detection by 3D Aspectlets and Occlusion Reasoning

Object Detection by 3D Aspectlets and Occlusion Reasoning Object Detection by 3D Aspectlets and Occlusion Reasoning Yu Xiang University of Michigan Silvio Savarese Stanford University In the 4th International IEEE Workshop on 3D Representation and Recognition

More information

Category-level localization

Category-level localization Category-level localization Cordelia Schmid Recognition Classification Object present/absent in an image Often presence of a significant amount of background clutter Localization / Detection Localize object

More information

Action recognition in videos

Action recognition in videos Action recognition in videos Cordelia Schmid INRIA Grenoble Joint work with V. Ferrari, A. Gaidon, Z. Harchaoui, A. Klaeser, A. Prest, H. Wang Action recognition - goal Short actions, i.e. drinking, sit

More information

Local Features based Object Categories and Object Instances Recognition

Local Features based Object Categories and Object Instances Recognition Local Features based Object Categories and Object Instances Recognition Eric Nowak Ph.D. thesis defense 17th of March, 2008 1 Thesis in Computer Vision Computer vision is the science and technology of

More information

Modern Object Detection. Most slides from Ali Farhadi

Modern Object Detection. Most slides from Ali Farhadi Modern Object Detection Most slides from Ali Farhadi Comparison of Classifiers assuming x in {0 1} Learning Objective Training Inference Naïve Bayes maximize j i logp + logp ( x y ; θ ) ( y ; θ ) i ij

More information

Revisiting 3D Geometric Models for Accurate Object Shape and Pose

Revisiting 3D Geometric Models for Accurate Object Shape and Pose Revisiting 3D Geometric Models for Accurate Object Shape and Pose M. 1 Michael Stark 2,3 Bernt Schiele 3 Konrad Schindler 1 1 Photogrammetry and Remote Sensing Laboratory Swiss Federal Institute of Technology

More information

Beyond Bags of features Spatial information & Shape models

Beyond Bags of features Spatial information & Shape models Beyond Bags of features Spatial information & Shape models Jana Kosecka Many slides adapted from S. Lazebnik, FeiFei Li, Rob Fergus, and Antonio Torralba Detection, recognition (so far )! Bags of features

More information

Visuelle Perzeption für Mensch- Maschine Schnittstellen

Visuelle Perzeption für Mensch- Maschine Schnittstellen Visuelle Perzeption für Mensch- Maschine Schnittstellen Vorlesung, WS 2009 Prof. Dr. Rainer Stiefelhagen Dr. Edgar Seemann Institut für Anthropomatik Universität Karlsruhe (TH) http://cvhci.ira.uka.de

More information

People-Tracking-by-Detection and People-Detection-by-Tracking

People-Tracking-by-Detection and People-Detection-by-Tracking People-Tracking-by-Detection and People-Detection-by-Tracking Mykhaylo Andriluka Stefan Roth Bernt Schiele Computer Science Department TU Darmstadt, Germany {andriluka, sroth, schiele}@cs.tu-darmstadt.de

More information

Detecting and Segmenting Humans in Crowded Scenes

Detecting and Segmenting Humans in Crowded Scenes Detecting and Segmenting Humans in Crowded Scenes Mikel D. Rodriguez University of Central Florida 4000 Central Florida Blvd Orlando, Florida, 32816 mikel@cs.ucf.edu Mubarak Shah University of Central

More information

Category-level Localization

Category-level Localization Category-level Localization Andrew Zisserman Visual Geometry Group University of Oxford http://www.robots.ox.ac.uk/~vgg Includes slides from: Ondra Chum, Alyosha Efros, Mark Everingham, Pedro Felzenszwalb,

More information

Development in Object Detection. Junyuan Lin May 4th

Development in Object Detection. Junyuan Lin May 4th Development in Object Detection Junyuan Lin May 4th Line of Research [1] N. Dalal and B. Triggs. Histograms of oriented gradients for human detection, CVPR 2005. HOG Feature template [2] P. Felzenszwalb,

More information

Recap Image Classification with Bags of Local Features

Recap Image Classification with Bags of Local Features Recap Image Classification with Bags of Local Features Bag of Feature models were the state of the art for image classification for a decade BoF may still be the state of the art for instance retrieval

More information

Visuelle Perzeption für Mensch- Maschine Schnittstellen

Visuelle Perzeption für Mensch- Maschine Schnittstellen Visuelle Perzeption für Mensch- Maschine Schnittstellen Vorlesung, WS 2009 Prof. Dr. Rainer Stiefelhagen Dr. Edgar Seemann Institut für Anthropomatik Universität Karlsruhe (TH) http://cvhci.ira.uka.de

More information

Object Recognition with Deformable Models

Object Recognition with Deformable Models Object Recognition with Deformable Models Pedro F. Felzenszwalb Department of Computer Science University of Chicago Joint work with: Dan Huttenlocher, Joshua Schwartz, David McAllester, Deva Ramanan.

More information

Clustered Pose and Nonlinear Appearance Models for Human Pose Estimation

Clustered Pose and Nonlinear Appearance Models for Human Pose Estimation JOHNSON, EVERINGHAM: CLUSTERED MODELS FOR HUMAN POSE ESTIMATION 1 Clustered Pose and Nonlinear Appearance Models for Human Pose Estimation Sam Johnson s.a.johnson04@leeds.ac.uk Mark Everingham m.everingham@leeds.ac.uk

More information

High Level Computer Vision. Sliding Window Detection: Viola-Jones-Detector & Histogram of Oriented Gradients (HOG)

High Level Computer Vision. Sliding Window Detection: Viola-Jones-Detector & Histogram of Oriented Gradients (HOG) High Level Computer Vision Sliding Window Detection: Viola-Jones-Detector & Histogram of Oriented Gradients (HOG) Bernt Schiele - schiele@mpi-inf.mpg.de Mario Fritz - mfritz@mpi-inf.mpg.de http://www.d2.mpi-inf.mpg.de/cv

More information

Multiple-Person Tracking by Detection

Multiple-Person Tracking by Detection http://excel.fit.vutbr.cz Multiple-Person Tracking by Detection Jakub Vojvoda* Abstract Detection and tracking of multiple person is challenging problem mainly due to complexity of scene and large intra-class

More information

Backprojection Revisited: Scalable Multi-view Object Detection and Similarity Metrics for Detections

Backprojection Revisited: Scalable Multi-view Object Detection and Similarity Metrics for Detections Backprojection Revisited: Scalable Multi-view Object Detection and Similarity Metrics for Detections Nima Razavi 1, Juergen Gall 1, and Luc Van Gool 1,2 1 Computer Vision Laboratory, ETH Zurich 2 ESAT-PSI/IBBT,

More information

Supervised learning. y = f(x) function

Supervised learning. y = f(x) function Supervised learning y = f(x) output prediction function Image feature Training: given a training set of labeled examples {(x 1,y 1 ),, (x N,y N )}, estimate the prediction function f by minimizing the

More information

Object Category Detection: Sliding Windows

Object Category Detection: Sliding Windows 04/10/12 Object Category Detection: Sliding Windows Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem Today s class: Object Category Detection Overview of object category detection Statistical

More information

Object Recognition. Computer Vision. Slides from Lana Lazebnik, Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce

Object Recognition. Computer Vision. Slides from Lana Lazebnik, Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce Object Recognition Computer Vision Slides from Lana Lazebnik, Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce How many visual object categories are there? Biederman 1987 ANIMALS PLANTS OBJECTS

More information

Category vs. instance recognition

Category vs. instance recognition Category vs. instance recognition Category: Find all the people Find all the buildings Often within a single image Often sliding window Instance: Is this face James? Find this specific famous building

More information

Combining Discriminative Appearance and Segmentation Cues for Articulated Human Pose Estimation

Combining Discriminative Appearance and Segmentation Cues for Articulated Human Pose Estimation Combining Discriminative Appearance and Segmentation Cues for Articulated Human Pose Estimation Sam Johnson and Mark Everingham School of Computing University of Leeds {mat4saj m.everingham}@leeds.ac.uk

More information

Computer Vision Lecture 16

Computer Vision Lecture 16 Computer Vision Lecture 16 Deep Learning for Object Categorization 14.01.2016 Bastian Leibe RWTH Aachen http://www.vision.rwth-aachen.de leibe@vision.rwth-aachen.de Announcements Seminar registration period

More information

Course Administration

Course Administration Course Administration Project 2 results are online Project 3 is out today The first quiz is a week from today (don t panic!) Covers all material up to the quiz Emphasizes lecture material NOT project topics

More information

Selective Search for Object Recognition

Selective Search for Object Recognition Selective Search for Object Recognition Uijlings et al. Schuyler Smith Overview Introduction Object Recognition Selective Search Similarity Metrics Results Object Recognition Kitten Goal: Problem: Where

More information

A novel template matching method for human detection

A novel template matching method for human detection University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2009 A novel template matching method for human detection Duc Thanh Nguyen

More information

Lecture 15: Detecting Objects by Parts

Lecture 15: Detecting Objects by Parts Lecture 15: Detecting Objects by Parts David R. Morales, Austin O. Narcomey, Minh-An Quinn, Guilherme Reis, Omar Solis Department of Computer Science Stanford University Stanford, CA 94305 {mrlsdvd, aon2,

More information

Window based detectors

Window based detectors Window based detectors CS 554 Computer Vision Pinar Duygulu Bilkent University (Source: James Hays, Brown) Today Window-based generic object detection basic pipeline boosting classifiers face detection

More information

Object detection using non-redundant local Binary Patterns

Object detection using non-redundant local Binary Patterns University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2010 Object detection using non-redundant local Binary Patterns Duc Thanh

More information

TA Section: Problem Set 4

TA Section: Problem Set 4 TA Section: Problem Set 4 Outline Discriminative vs. Generative Classifiers Image representation and recognition models Bag of Words Model Part-based Model Constellation Model Pictorial Structures Model

More information

Supervised learning. y = f(x) function

Supervised learning. y = f(x) function Supervised learning y = f(x) output prediction function Image feature Training: given a training set of labeled examples {(x 1,y 1 ),, (x N,y N )}, estimate the prediction function f by minimizing the

More information

Selection of Scale-Invariant Parts for Object Class Recognition

Selection of Scale-Invariant Parts for Object Class Recognition Selection of Scale-Invariant Parts for Object Class Recognition Gy. Dorkó and C. Schmid INRIA Rhône-Alpes, GRAVIR-CNRS 655, av. de l Europe, 3833 Montbonnot, France fdorko,schmidg@inrialpes.fr Abstract

More information

Lecture 15 Visual recognition

Lecture 15 Visual recognition Lecture 15 Visual recognition 3D object detection Introduction Single instance 3D object detectors Generic 3D object detectors Silvio Savarese Lecture 15-26-Feb-14 3D object detection Object: Building

More information

Strong Appearance and Expressive Spatial Models for Human Pose Estimation

Strong Appearance and Expressive Spatial Models for Human Pose Estimation 2013 IEEE International Conference on Computer Vision Strong Appearance and Expressive Spatial Models for Human Pose Estimation Leonid Pishchulin 1 Mykhaylo Andriluka 1 Peter Gehler 2 Bernt Schiele 1 1

More information

Multiple Pose Context Trees for estimating Human Pose in Object Context

Multiple Pose Context Trees for estimating Human Pose in Object Context Multiple Pose Context Trees for estimating Human Pose in Object Context Vivek Kumar Singh Furqan Muhammad Khan Ram Nevatia University of Southern California Los Angeles, CA {viveksin furqankh nevatia}@usc.edu

More information

Occlusion Patterns for Object Class Detection

Occlusion Patterns for Object Class Detection Occlusion Patterns for Object Class Detection Bojan Pepik1 Michael Stark1,2 Peter Gehler3 Bernt Schiele1 Max Planck Institute for Informatics, 2Stanford University, 3Max Planck Institute for Intelligent

More information

Beyond bags of features: Adding spatial information. Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba

Beyond bags of features: Adding spatial information. Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba Adding spatial information Forming vocabularies from pairs of nearby features doublets

More information

Seeing 3D chairs: Exemplar part-based 2D-3D alignment using a large dataset of CAD models

Seeing 3D chairs: Exemplar part-based 2D-3D alignment using a large dataset of CAD models Seeing 3D chairs: Exemplar part-based 2D-3D alignment using a large dataset of CAD models Mathieu Aubry (INRIA) Daniel Maturana (CMU) Alexei Efros (UC Berkeley) Bryan Russell (Intel) Josef Sivic (INRIA)

More information

Object Detection with Partial Occlusion Based on a Deformable Parts-Based Model

Object Detection with Partial Occlusion Based on a Deformable Parts-Based Model Object Detection with Partial Occlusion Based on a Deformable Parts-Based Model Johnson Hsieh (johnsonhsieh@gmail.com), Alexander Chia (alexchia@stanford.edu) Abstract -- Object occlusion presents a major

More information

Revisiting 3D Geometric Models for Accurate Object Shape and Pose

Revisiting 3D Geometric Models for Accurate Object Shape and Pose Revisiting 3D Geometric Models for Accurate Object Shape and Pose M. Zeeshan Zia 1, Michael Stark 2, Bernt Schiele 2, and Konrad Schindler 1 1 Photogrammetry and Remote Sensing Laboratory, ETH Zürich,

More information

Fuzzy based Multiple Dictionary Bag of Words for Image Classification

Fuzzy based Multiple Dictionary Bag of Words for Image Classification Available online at www.sciencedirect.com Procedia Engineering 38 (2012 ) 2196 2206 International Conference on Modeling Optimisation and Computing Fuzzy based Multiple Dictionary Bag of Words for Image

More information

Is 2D Information Enough For Viewpoint Estimation? Amir Ghodrati, Marco Pedersoli, Tinne Tuytelaars BMVC 2014

Is 2D Information Enough For Viewpoint Estimation? Amir Ghodrati, Marco Pedersoli, Tinne Tuytelaars BMVC 2014 Is 2D Information Enough For Viewpoint Estimation? Amir Ghodrati, Marco Pedersoli, Tinne Tuytelaars BMVC 2014 Problem Definition Viewpoint estimation: Given an image, predicting viewpoint for object of

More information

Segmentation. Bottom up Segmentation Semantic Segmentation

Segmentation. Bottom up Segmentation Semantic Segmentation Segmentation Bottom up Segmentation Semantic Segmentation Semantic Labeling of Street Scenes Ground Truth Labels 11 classes, almost all occur simultaneously, large changes in viewpoint, scale sky, road,

More information

Discriminative classifiers for image recognition

Discriminative classifiers for image recognition Discriminative classifiers for image recognition May 26 th, 2015 Yong Jae Lee UC Davis Outline Last time: window-based generic object detection basic pipeline face detection with boosting as case study

More information

Human detection using local shape and nonredundant

Human detection using local shape and nonredundant University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2010 Human detection using local shape and nonredundant binary patterns

More information

Find that! Visual Object Detection Primer

Find that! Visual Object Detection Primer Find that! Visual Object Detection Primer SkTech/MIT Innovation Workshop August 16, 2012 Dr. Tomasz Malisiewicz tomasz@csail.mit.edu Find that! Your Goals...imagine one such system that drives information

More information

Object detection. Announcements. Last time: Mid-level cues 2/23/2016. Wed Feb 24 Kristen Grauman UT Austin

Object detection. Announcements. Last time: Mid-level cues 2/23/2016. Wed Feb 24 Kristen Grauman UT Austin Object detection Wed Feb 24 Kristen Grauman UT Austin Announcements Reminder: Assignment 2 is due Mar 9 and Mar 10 Be ready to run your code again on a new test set on Mar 10 Vision talk next Tuesday 11

More information

A Comparison on Different 2D Human Pose Estimation Method

A Comparison on Different 2D Human Pose Estimation Method A Comparison on Different 2D Human Pose Estimation Method Santhanu P.Mohan 1, Athira Sugathan 2, Sumesh Sekharan 3 1 Assistant Professor, IT Department, VJCET, Kerala, India 2 Student, CVIP, ASE, Tamilnadu,

More information

Repositorio Institucional de la Universidad Autónoma de Madrid.

Repositorio Institucional de la Universidad Autónoma de Madrid. Repositorio Institucional de la Universidad Autónoma de Madrid https://repositorio.uam.es Esta es la versión de autor de la comunicación de congreso publicada en: This is an author produced version of

More information

CEE598 - Visual Sensing for Civil Infrastructure Eng. & Mgmt.

CEE598 - Visual Sensing for Civil Infrastructure Eng. & Mgmt. CEE598 - Visual Sensing for Civil Infrastructure Eng. & Mgmt. Session 20 Object Recognition 3 Mani Golparvar-Fard Department of Civil and Environmental Engineering Department of Computer Science 3129D,

More information

Multi-View Object Class Detection with a 3D Geometric Model

Multi-View Object Class Detection with a 3D Geometric Model Multi-View Object Class Detection with a 3D Geometric Model Jörg Liebelt, Cordelia Schmid To cite this version: Jörg Liebelt, Cordelia Schmid. Multi-View Object Class Detection with a 3D Geometric Model.

More information

https://en.wikipedia.org/wiki/the_dress Recap: Viola-Jones sliding window detector Fast detection through two mechanisms Quickly eliminate unlikely windows Use features that are fast to compute Viola

More information

Occlusion Patterns for Object Class Detection

Occlusion Patterns for Object Class Detection 23 IEEE Conference on Computer Vision and Pattern Recognition Occlusion Patterns for Object Class Detection Bojan Pepik Michael Stark,2 Peter Gehler 3 Bernt Schiele Max Planck Institute for Informatics,

More information

CEE598 - Visual Sensing for Civil Infrastructure Eng. & Mgmt.

CEE598 - Visual Sensing for Civil Infrastructure Eng. & Mgmt. CEE598 - Visual Sensing for Civil Infrastructure Eng. & Mgmt. Session 20 Object Recognition 3 Mani Golparvar-Fard Department of Civil and Environmental Engineering 3129D, Newmark Civil Engineering Lab

More information

People detection in complex scene using a cascade of Boosted classifiers based on Haar-like-features

People detection in complex scene using a cascade of Boosted classifiers based on Haar-like-features People detection in complex scene using a cascade of Boosted classifiers based on Haar-like-features M. Siala 1, N. Khlifa 1, F. Bremond 2, K. Hamrouni 1 1. Research Unit in Signal Processing, Image Processing

More information

Feature Selection using Constellation Models

Feature Selection using Constellation Models Feature Selection using Constellation Models Heydar Maboudi Afkham, Carl Herik Ek and Stefan Calsson Computer Vision and Active Perception Lab. Royal Institute of Technology (KTH) Stockholm, Sweden {heydarma,chek,stefanc}@csc.kth.se

More information

Poselet Conditioned Pictorial Structures

Poselet Conditioned Pictorial Structures Poselet Conditioned Pictorial Structures Leonid Pishchulin 1 Mykhaylo Andriluka 1 Peter Gehler 2 Bernt Schiele 1 1 Max Planck Institute for Informatics, Saarbrücken, Germany 2 Max Planck Institute for

More information

Histograms of Oriented Gradients for Human Detection p. 1/1

Histograms of Oriented Gradients for Human Detection p. 1/1 Histograms of Oriented Gradients for Human Detection p. 1/1 Histograms of Oriented Gradients for Human Detection Navneet Dalal and Bill Triggs INRIA Rhône-Alpes Grenoble, France Funding: acemedia, LAVA,

More information

PEDESTRIAN DETECTION IN CROWDED SCENES VIA SCALE AND OCCLUSION ANALYSIS

PEDESTRIAN DETECTION IN CROWDED SCENES VIA SCALE AND OCCLUSION ANALYSIS PEDESTRIAN DETECTION IN CROWDED SCENES VIA SCALE AND OCCLUSION ANALYSIS Lu Wang Lisheng Xu Ming-Hsuan Yang Northeastern University, China University of California at Merced, USA ABSTRACT Despite significant

More information

Monocular 3D Pose Estimation and Tracking by Detection

Monocular 3D Pose Estimation and Tracking by Detection Monocular 3D Pose Estimation and Tracking by Detection Mykhaylo Andriluka1,2 Stefan Roth1 Bernt Schiele1,2 1 2 Department of Computer Science, TU Darmstadt MPI Informatics, Saarbru cken Abstract Automatic

More information

Lecture 18: Human Motion Recognition

Lecture 18: Human Motion Recognition Lecture 18: Human Motion Recognition Professor Fei Fei Li Stanford Vision Lab 1 What we will learn today? Introduction Motion classification using template matching Motion classification i using spatio

More information

Mining Discriminative Adjectives and Prepositions for Natural Scene Recognition

Mining Discriminative Adjectives and Prepositions for Natural Scene Recognition Mining Discriminative Adjectives and Prepositions for Natural Scene Recognition Bangpeng Yao 1, Juan Carlos Niebles 2,3, Li Fei-Fei 1 1 Department of Computer Science, Princeton University, NJ 08540, USA

More information

Computer Vision. Exercise Session 10 Image Categorization

Computer Vision. Exercise Session 10 Image Categorization Computer Vision Exercise Session 10 Image Categorization Object Categorization Task Description Given a small number of training images of a category, recognize a-priori unknown instances of that category

More information

Test-time Adaptation for 3D Human Pose Estimation

Test-time Adaptation for 3D Human Pose Estimation Test-time Adaptation for 3D Human Pose Estimation Sikandar Amin,2, Philipp Müller 2, Andreas Bulling 2, and Mykhaylo Andriluka 2,3 Technische Universität München, Germany 2 Max Planck Institute for Informatics,

More information

Histogram of Oriented Gradients (HOG) for Object Detection

Histogram of Oriented Gradients (HOG) for Object Detection Histogram of Oriented Gradients (HOG) for Object Detection Navneet DALAL Joint work with Bill TRIGGS and Cordelia SCHMID Goal & Challenges Goal: Detect and localise people in images and videos n Wide variety

More information

Person Detection in Images using HoG + Gentleboost. Rahul Rajan June 1st July 15th CMU Q Robotics Lab

Person Detection in Images using HoG + Gentleboost. Rahul Rajan June 1st July 15th CMU Q Robotics Lab Person Detection in Images using HoG + Gentleboost Rahul Rajan June 1st July 15th CMU Q Robotics Lab 1 Introduction One of the goals of computer vision Object class detection car, animal, humans Human

More information

Object Recognition and Detection

Object Recognition and Detection CS 2770: Computer Vision Object Recognition and Detection Prof. Adriana Kovashka University of Pittsburgh March 16, 21, 23, 2017 Plan for the next few lectures Recognizing the category in the image as

More information

Multiple Kernel Learning for Emotion Recognition in the Wild

Multiple Kernel Learning for Emotion Recognition in the Wild Multiple Kernel Learning for Emotion Recognition in the Wild Karan Sikka, Karmen Dykstra, Suchitra Sathyanarayana, Gwen Littlewort and Marian S. Bartlett Machine Perception Laboratory UCSD EmotiW Challenge,

More information

Fusing shape and appearance information for object category detection

Fusing shape and appearance information for object category detection 1 Fusing shape and appearance information for object category detection Andreas Opelt, Axel Pinz Graz University of Technology, Austria Andrew Zisserman Dept. of Engineering Science, University of Oxford,

More information

Object Recognition II

Object Recognition II Object Recognition II Linda Shapiro EE/CSE 576 with CNN slides from Ross Girshick 1 Outline Object detection the task, evaluation, datasets Convolutional Neural Networks (CNNs) overview and history Region-based

More information

CS 231A Computer Vision (Fall 2011) Problem Set 4

CS 231A Computer Vision (Fall 2011) Problem Set 4 CS 231A Computer Vision (Fall 2011) Problem Set 4 Due: Nov. 30 th, 2011 (9:30am) 1 Part-based models for Object Recognition (50 points) One approach to object recognition is to use a deformable part-based

More information

CS6670: Computer Vision

CS6670: Computer Vision CS6670: Computer Vision Noah Snavely Lecture 16: Bag-of-words models Object Bag of words Announcements Project 3: Eigenfaces due Wednesday, November 11 at 11:59pm solo project Final project presentations:

More information

Spatial Localization and Detection. Lecture 8-1

Spatial Localization and Detection. Lecture 8-1 Lecture 8: Spatial Localization and Detection Lecture 8-1 Administrative - Project Proposals were due on Saturday Homework 2 due Friday 2/5 Homework 1 grades out this week Midterm will be in-class on Wednesday

More information

Object recognition. Methods for classification and image representation

Object recognition. Methods for classification and image representation Object recognition Methods for classification and image representation Credits Slides by Pete Barnum Slides by FeiFei Li Paul Viola, Michael Jones, Robust Realtime Object Detection, IJCV 04 Navneet Dalal

More information

Segmentation as Selective Search for Object Recognition in ILSVRC2011

Segmentation as Selective Search for Object Recognition in ILSVRC2011 Segmentation as Selective Search for Object Recognition in ILSVRC2011 Koen van de Sande Jasper Uijlings Arnold Smeulders Theo Gevers Nicu Sebe Cees Snoek University of Amsterdam, University of Trento ILSVRC2011

More information

Deformable Part Models Revisited: A Performance Evaluation for Object Category Pose Estimation

Deformable Part Models Revisited: A Performance Evaluation for Object Category Pose Estimation Deformable Part Models Revisited: A Performance Evaluation for Object Category Pose Estimation Roberto J. Lo pez-sastre Tinne Tuytelaars2 Silvio Savarese3 GRAM, Dept. Signal Theory and Communications,

More information

Local cues and global constraints in image understanding

Local cues and global constraints in image understanding Local cues and global constraints in image understanding Olga Barinova Lomonosov Moscow State University *Many slides adopted from the courses of Anton Konushin Image understanding «To see means to know

More information

Object Detection Design challenges

Object Detection Design challenges Object Detection Design challenges How to efficiently search for likely objects Even simple models require searching hundreds of thousands of positions and scales Feature design and scoring How should

More information

Modeling 3D viewpoint for part-based object recognition of rigid objects

Modeling 3D viewpoint for part-based object recognition of rigid objects Modeling 3D viewpoint for part-based object recognition of rigid objects Joshua Schwartz Department of Computer Science Cornell University jdvs@cs.cornell.edu Abstract Part-based object models based on

More information

A Hierarchical Compositional System for Rapid Object Detection

A Hierarchical Compositional System for Rapid Object Detection A Hierarchical Compositional System for Rapid Object Detection Long Zhu and Alan Yuille Department of Statistics University of California at Los Angeles Los Angeles, CA 90095 {lzhu,yuille}@stat.ucla.edu

More information

Methods for Representing and Recognizing 3D objects

Methods for Representing and Recognizing 3D objects Methods for Representing and Recognizing 3D objects part 1 Silvio Savarese University of Michigan at Ann Arbor Object: Building, 45º pose, 8-10 meters away Object: Person, back; 1-2 meters away Object:

More information

High-Level Fusion of Depth and Intensity for Pedestrian Classification

High-Level Fusion of Depth and Intensity for Pedestrian Classification High-Level Fusion of Depth and Intensity for Pedestrian Classification Marcus Rohrbach 1,3, Markus Enzweiler 2 and Dariu M. Gavrila 1,4 1 Environment Perception, Group Research, Daimler AG, Ulm, Germany

More information

Boosted Local Structured HOG-LBP for Object Localization

Boosted Local Structured HOG-LBP for Object Localization Boosted Local Structured HOG-LBP for Object Localization Junge Zhang, Kaiqi Huang, Yinan Yu and Tieniu Tan National Laboratory of Pattern Recognition Institute of Automation, Chinese Academy of Sciences

More information