High Level Computer Vision
|
|
- Louise Wilcox
- 5 years ago
- Views:
Transcription
1 High Level Computer Vision Part-Based Models for Object Class Recognition Part 2 Bernt Schiele - schiele@mpi-inf.mpg.de Mario Fritz - mfritz@mpi-inf.mpg.de
2 Please Note No lecture next week (June 25) Next lecture July 2nd 2
3 Class of Object Models: Part-Based Models / Pictorial Structures Pictorial Structures [Fischler & Elschlager 1973] Model has two components - parts (2D image fragments) - structure (configuration of parts) 3
4 State-of-the-Art in Object Class Representations Bag of Words Models (BoW) object model = histogram of local features e.g. local feature around interest points Global Object Models object model = global feature object feature e.g. HOG (Histogram of Oriented Gradients) Part-Based Object Models object model = models of parts & spatial topology model e.g. constellation model or ISM (Implicit Shape Model) But: What is the Ideal Notion of Parts here? And: Should those Parts be Semantic? BoW: no spatial relationships e.g. HOG: fixed spatial relationships e.g. ISM: flexible spatial relationships 4
5 Constellation of Parts Fully connected shape model x 1 x 6 x 2 x 5 x 3 x 4 Weber, Welling, Perona, 00; Fergus, Zisserman, Perona, 03 5
6 Constellation Model Weber, Welling, Perona, 00; Fergus, Zisserman, Perona, 03 Joint model for appearance and structure (=shape) X: positions, A: part appearance, S: scale h: Hypothesis = assignment of features (in the image) to parts (of the model) Gaussian part appearance pdf Gaussian shape pdf Gaussian relative scale pdf Prob. of detection Log(scale) 6
7 Object Detection: ISM (Implicit Shape Model) Appearance of parts: Implicit Shape Model (ISM) [Leibe, Seemann & Schiele, CVPR 2005] x o 7
8 Spatial Models for Categorization Fully connected shape model Star shape model x 1 x 1 x 6 x 2 x 6 x 2 x 5 x 3 x 5 x 3 x 4 x 4 e.g. Constellation Model Parts fully connected Recognition complexity: O(N P ) Method: Exhaustive search e.g. ISM (Implicit Shape Model) Parts mutually independent Recognition complexity: O(NP) Method: Generalized Hough Transform 8
9 Implicit Shape Model: What are Good Parts? Parts of the Implicit Shape Model parts = feature clusters lots of parts (in the order of codebook entries) the more the better parts are mostly non-semantic parts = (mostly non semantic) feature clusters also true for bag of words models constellation model (much fewer parts - but still feature clusters)... 9
10 Part-Based Models - Overview Today (more next week) Last Week: Part-Based based on Manual Labeling of Parts - Detection by Components, Multi-Scale Parts The Constellation Model - automatic discovery of parts and part-structure The Implicit Shape Model (ISM) - star-model of part configurations, parts obtained by clustering interest-points Today: Pictorial Structures Model Hierarchical Graphical Model Learning Object Model from CAD Data Discussion Semantic Parts vs. Discriminative Parts 10
11 Part-Based Models - Overview Today (more next week) Last Week: Part-Based based on Manual Labeling of Parts - Detection by Components, Multi-Scale Parts The Constellation Model - automatic discovery of parts and part-structure The Implicit Shape Model (ISM) - star-model of part configurations, parts obtained by clustering interest-points Today: Pictorial Structures Model Hierarchical Graphical Model Learning Object Model from CAD Data Discussion Semantic Parts vs. Discriminative Parts 11
12 People Detection: partism Appearance of parts: Implicit Shape Model (ISM) [Leibe, Seemann & Schiele, CVPR 2005] x o 12
13 People Detection: partism Appearance of parts: Implicit Shape Model (ISM) [Leibe, Seemann & Schiele, CVPR 2005] Part decomposition and inference: Pictorial structures model [Felzenszwalb & Huttenlocher, IJCV 2005] x 8 x 7 x o x 3 x 6 p(l D) p(d L)p(L) x 2 x 1 x 5 x 4 Body-part positions Image evidence 13
14 Can we make pictorial structures model effective for these tasks? [Fischler&Elschlager, 1973] 14
15 Can we make pictorial structures model effective for these tasks? Yes... if the model components are chosen right. 15
16 Can we make pictorial structures model effective for these tasks? So... what are the right components? 16
17 Pictorial Structures Model Body-part positions Two Components p(l D) = p(d L)p(L) p(d) Image evidence Prior (capturing possible part configurations): Likelihood of Parts (capturing part appearance): p(d L)p(L) p(l) p(d L) 17
18 Pictorial Structures: Model Components Body is represented as flexible configuration of body parts posterior over body poses p(l D) likelihood of observations p(d L)p(L) prior on body poses orientation K likelihood of part N estimated pose part posteriors 20 Local Features.... AdaBoost gure 2. (left) Kine
19 Kinematic Tree Prior (modeling the structure) l 9 l 7 l 5 l 6 l 8 l 10 Represent pairwise part relations [Felzenszwalb & Huttenlocher, IJCV 05] l 2 l 1 l 3 l 4 l 2 50 p(l) =p(l 0 ) 40 p(l i l j ), l (i,j) E p(l 2 l 1 )=N(T 12 (l 2 ) T 21 (l 1 ), ) part locations relative to the joint transformed part locations l 2 l
20 Kinematic Tree Prior Prior parameters: {T ij, ij } Parameters of the prior are estimated with maximum likelihood mean pose several independent samples gure Kinematic prior learned on the multi-view a 20
21 Pictorial Structures: Model Components Body is represented as flexible configuration of body parts posterior over body poses p(l D) likelihood of observations p(d L)p(L) prior on body poses orientation K likelihood of part N estimated pose part posteriors 20 Local Features.... AdaBoost gure 2. (left) Kine
22 Likelihood Model Assumption: evidence (image features) for each part independent of all other parts: p(d L) = assumption clearly not correct, but allows efficient computation NY p(d i l i ) i=0 works rather well in practice training data for different body parts should cover all appearances 22
23 Likelihood Model Build on recent advances in object detection: state-of-the-art image descriptor: Shape Context [Belongie et al., PAMI 02; Mikolajczyk&Schmid, PAMI 05] dense representation discriminative model: AdaBoost classifier for each body part - Shape Context: 96 dimensions (4 angular, 3 radial, 8 gradient orientations) - Feature Vector: concatenate the descriptors inside part bounding box - head: 4032 dimensions - torso: 8448 dimensions 23
24 AdaBoost (Freund & Shapire 95) f ( x) tht ( x) t Initial uniform weight on training examples weak classifier 1 errort t 0.5log 1 errort w e i yi tht ( xi ) i t 1 w t i yi tht ( xi ) wt 1e i Incorrect classifications re-weighted more heavily weak classifier 2 weak classifier 3 Final classifier is weighted combination of weak classifiers Viola and Jones, Robust object detection using a boosted cascade of simple features, CVPR 2001
25 Likelihood Model Part likelihood derived from the boosting score: decision stump weight p(d i l i ) = max t i,th t (x(l i )) t decision stump output i,t, 0 part location small constant to deal with part occlusions 25
26 Likelihood Model Input image Head Torso Upper leg Our part likelihoods.... [Ramanan, NIPS 06] 26
27 Likelihood Model Input image Head Torso Upper leg Our part likelihoods.... [Ramanan, NIPS 06] 27
28 Likelihood Model Input image Head Torso Upper leg Our part likelihoods.... [Ramanan, NIPS 06] 28
29 Evaluation Scenarios 1. Human Pose Estimation People dataset [Ramanan, NIPS 06] 2. Upper-body Pose Estimation Buffy dataset [Ferrari et al., CVPR 08] 3. Pedestrian Detection TUD Pedestrians dataset [Andriluka et al., CVPR 08] 29
30 Evaluation Scenarios 1. Human Pose Estimation People dataset [Ramanan, NIPS 06] 2. Upper-body Pose Estimation Buffy dataset [Ferrari et al., CVPR 08] 3. Pedestrian Detection TUD Pedestrians dataset [Andriluka et al., CVPR 08] 30
31 Part-Based Model: 2D Human Pose Estimation 31
32 Scenario 1: Quantitative Results Method Torso Upper legs Lower legs Upper arm Forearm Head Total [Ramanan, NIPS 06] 2nd parse Our inference, edge features from [Ramanan, NIPS 06] Our part detectors (SC) Our prior, our part detectors (SC) Our prior, our part detectors (SIFT)
33 Evaluation Scenarios 1. Human Pose Estimation People dataset [Ramanan, NIPS 06] 2. Upper-body Pose Estimation Buffy dataset [Ferrari et al., CVPR 08] 3. Pedestrian Detection TUD Pedestrians dataset [Andriluka et al., CVPR 08] 33
34 Estimated upper-body poses 34
35 Quantitative Results Method Torso Upper arm Lower arm Head Total [Ferrari et al. CVPR 08] detectors only full model generic model - prior and appearance learned on the People dataset
36 Quantitative Results Method Torso Upper arm Lower arm Head Total [Ferrari et al. CVPR 08] detectors only full model [Ferrari et al. CVPR 09] generic model - prior and appearance learned on the People dataset
37 Quantitative Results Method Torso Upper arm Lower arm Head Total [Ferrari et al. CVPR 08] detectors only full model [Ferrari et al. CVPR 09] full model, Buffy pose prior specialized upper body prior - appearance learned on the People dataset
38 Typical Failure Cases Foreshortening Part occlusion Detections on other body parts 38
39 Pictorial Structures for Human Pose Estimation: What are Good Parts? Parts of the Pictorial Structures Model parts = semantic body parts pose estimation = estimation of body part configuration semantic body parts allow to use motion capture data, etc. to improve kinematic tree prior non-semantic parts (e.g. in the ISM-model) are more difficult to generalize across human body poses 39
40 Part-Based Models - Overview Today (more next week) Last Week: Part-Based based on Manual Labeling of Parts - Detection by Components, Multi-Scale Parts The Constellation Model - automatic discovery of parts and part-structure The Implicit Shape Model (ISM) - star-model of part configurations, parts obtained by clustering interest-points Today: Pictorial Structures Model Hierarchical Graphical Model Learning Object Model from CAD Data Discussion Semantic Parts vs. Discriminative Parts 40
41 Standard Models seen as Graphical Models Monolithic (global) HOG [Dalal & Triggs, 2005] Part-based (local) Implicit Shape Model [Leibe&Schiele, 2003] also: Spatial Pyramid Kernel [Lazebnik et al., 2006] Pictorial Structures [Felzenszwalb et al., 2005] 41
42 Hierarchical Graphical Model Generalization of Standard Models New hierarchical structure Short and longer range dependencies Joint training of local and global representations - Cyclic graph structure - Bottom-up top-down propagation 42
43 Hierarchical CRF Latent Model Goal: discover meaningful parts Labels of nodes not known Now: part labels Allow for ambiguities of part assignments Combine with structure learning Keep hierarchical representation 43
44 Experiments on PASCAL VOC 2007: Hierarchical CRF Latent Model Consistently outperforms DPM [Felzenszwalb et al., 2008] MKL [Vedaldi et al., 2009] better due to more features 44
45 Hierarchical Latent CRF-Model: What are Good Parts? Parts of the Latent CRF Model: parts have three main properties - parts = hierarchical appearance abstractions - parts = local appearance shared across objects - parts = discriminat local appearance of the objects parts enable effective learning - by finding correspondences between discriminative local object appearance semantics of parts is a secondary effect and not necessary [Schnitzspan,Roth,Schiele@cvpr10] similar observations hold for DPM [Felzenszwalb et al@cvpr08] 45
46 Part-Based Models - Overview Today (more next week) Last Week: Part-Based based on Manual Labeling of Parts - Detection by Components, Multi-Scale Parts The Constellation Model - automatic discovery of parts and part-structure The Implicit Shape Model (ISM) - star-model of part configurations, parts obtained by clustering interest-points Today: Pictorial Structures Model Hierarchical Graphical Model Learning Object Model from CAD Data Discussion Semantic Parts vs. Discriminative Parts 46
47 Back to the Future: Learning Shape Models from 3D CAD Data 3D Computer Aided Design (CAD) Models Computer graphics, game design Polygonal meshes + texture descriptions semantic part annotations (may) exist [Stark,Goesele,Schiele@bmvc10] Can we learn Object Class Models directly from 3D CAD data? Issue: Transition between 3D CAD models and 2D real-world images 47
48 Shape-based Appearance Abstraction Non-Photorealistic Rendering Learn shape models from rendered images we do NOT render photo-realistically / texture But focus on 3D CAD model edges (mimic real-world image edges) Part boundaries Mesh creases Silhouette } Final edges (hidden edges removed) [Stark,Goesele,Schiele@bmvc10] 48
49 Shape - Local Shape + Global Geometry [Stark,Goesele,Schiele@bmvc10] Part-based object class representation Semantic parts from 3D CAD models: left front wheel, left front door, etc. View-points Left Front left Front Parts Local shape Global geometry 49
50 Constellation Model Weber, Welling, Perona, 00; Fergus, Zisserman, Perona, 03 Joint model for appearance and structure (=shape) X: positions, A: part appearance, S: scale h: Hypothesis = assignment of features (in the image) to parts (of the model) Gaussian part appearance pdf Gaussian shape pdf Gaussian relative scale pdf Prob. of detection Log(scale) 50
51 Experimental Evaluation - Test Data Set [Stark,Goesele,Schiele@bmvc10] 3D Object Classes Cars [Savarese and Fei-Fei ICCV 07] 8 azimuth angles... back back-left left front-left 2 elevation angles low high 5 cars 3 distances 240 images near medium far 51
52 Qualitative Results Three strongest true positive detections per viewpoint model Right Back-right Observations Accurate part localization Predicted viewpoints do match 52
53 Quantitative Results - Multi-View Detection [Stark,Goesele,Schiele@bmvc10] Evaluation protocol Precision / recall PASCAL overlap criterion Observations 4.3% AP improvement ( ) [Liebelt et al. CVPR 10] ( ) We can further improve performance by 8.9% AP ( ) recall bank of 8 (AP = 81.0) 10 degree intervals (AP = 89.9) bank of 8 (AP = 81.0) 10 degree intervals (AP = 89.9) Su et al. ICCV09 (AP = 55.3) Su et al. ICCV09 (AP = 55.3) Gill et al. DAGM09 (AP = 72.6) Gill et al. DAGM09 (AP = 72.6) Liebelt Liebelt et al. et CVPR10 al. (AP = 76.7) = 76.7) precision 53
54 Shape Model learned from 3D CAD-Data What are Good Parts? Parts of the Shape Model: parts = just means to enable correspondence across 3D-models semantics of parts: - in our case: yes - because of the employed 3D models - but: semantics neither necessary nor important Left Front left Front
55 Part-Based Models - Overview Today (more next week) Last Week: Part-Based based on Manual Labeling of Parts - Detection by Components, Multi-Scale Parts The Constellation Model - automatic discovery of parts and part-structure The Implicit Shape Model (ISM) - star-model of part configurations, parts obtained by clustering interest-points Today: Pictorial Structures Model Hierarchical Graphical Model Learning Object Model from CAD Data Discussion Semantic Parts vs. Discriminative Parts 55
56 What are Ideal Parts for Part-Based Object Models? Parts can/may be semantic body parts - e.g. for articulated human body pose estimation be feature clusters (typically many clusters) (e,g, ISM, constellation model, BoW) support learnability of discriminant appearance enable correspondence across 3D models... in all those cases: the most important property is that parts facilitate correspondence across object instances 56
57 What are Ideal Parts for Part-Based Object Models? Multiple motivations for part based models exist: intuitiveness: semantic meaning of parts/attributes is attractive (e.g. enables use of language sources) learnability: sharing of parts/attributes across instances/classes scalability: transferability of parts/attributes across classes... in general, parts support learnability and scalability when they facilitate correspondence across object instances across object classes across modalities (e.g. from language to visual appearance) and semantics is only a secondary concern (for intuitiveness ) 57
58 What are Ideal Parts for Part-Based Object Models? the Good, the Bad, and the Ugly thanks to: Micha Andriluka, Bastian Leibe, Sandra Ebert, Mario Fritz, Diane Larlus, Marcus Rohrbach, Paul Schnitzspan, Stefan Roth, Michael Stark, Michael Goesele
Part-Based Models for Object Class Recognition Part 2
High Level Computer Vision Part-Based Models for Object Class Recognition Part 2 Bernt Schiele - schiele@mpi-inf.mpg.de Mario Fritz - mfritz@mpi-inf.mpg.de https://www.mpi-inf.mpg.de/hlcv Class of Object
More informationPart-Based Models for Object Class Recognition Part 2
High Level Computer Vision Part-Based Models for Object Class Recognition Part 2 Bernt Schiele - schiele@mpi-inf.mpg.de Mario Fritz - mfritz@mpi-inf.mpg.de https://www.mpi-inf.mpg.de/hlcv Class of Object
More informationPart-Based Models for Object Class Recognition Part 3
High Level Computer Vision! Part-Based Models for Object Class Recognition Part 3 Bernt Schiele - schiele@mpi-inf.mpg.de Mario Fritz - mfritz@mpi-inf.mpg.de! http://www.d2.mpi-inf.mpg.de/cv ! State-of-the-Art
More informationDeformable Part Models
CS 1674: Intro to Computer Vision Deformable Part Models Prof. Adriana Kovashka University of Pittsburgh November 9, 2016 Today: Object category detection Window-based approaches: Last time: Viola-Jones
More informationPart based models for recognition. Kristen Grauman
Part based models for recognition Kristen Grauman UT Austin Limitations of window-based models Not all objects are box-shaped Assuming specific 2d view of object Local components themselves do not necessarily
More informationPreviously. Part-based and local feature models for generic object recognition. Bag-of-words model 4/20/2011
Previously Part-based and local feature models for generic object recognition Wed, April 20 UT-Austin Discriminative classifiers Boosting Nearest neighbors Support vector machines Useful for object recognition
More informationStructured Models in. Dan Huttenlocher. June 2010
Structured Models in Computer Vision i Dan Huttenlocher June 2010 Structured Models Problems where output variables are mutually dependent or constrained E.g., spatial or temporal relations Such dependencies
More informationObject Category Detection. Slides mostly from Derek Hoiem
Object Category Detection Slides mostly from Derek Hoiem Today s class: Object Category Detection Overview of object category detection Statistical template matching with sliding window Part-based Models
More informationAutomatic Discovery of Meaningful Object Parts with Latent CRFs
Automatic Discovery of Meaningful Object Parts with Latent CRFs Paul Schnitzspan 1 Stefan Roth 1 Bernt Schiele 1,2 1 Department of Computer Science, TU Darmstadt 2 MPI Informatics, Saarbrücken Abstract
More informationPart-based and local feature models for generic object recognition
Part-based and local feature models for generic object recognition May 28 th, 2015 Yong Jae Lee UC Davis Announcements PS2 grades up on SmartSite PS2 stats: Mean: 80.15 Standard Dev: 22.77 Vote on piazza
More informationLecture 16: Object recognition: Part-based generative models
Lecture 16: Object recognition: Part-based generative models Professor Stanford Vision Lab 1 What we will learn today? Introduction Constellation model Weakly supervised training One-shot learning (Problem
More informationIn Good Shape: Robust People Detection based on Appearance and Shape
PISHCHULIN et al.: PEOPLE DETECTION USING APPEARANCE AND SHAPE 1 In Good Shape: Robust People Detection based on Appearance and Shape Leonid Pishchulin leonid@mpi-inf.mpg.de Arjun Jain ajain@mpi-inf.mpg.de
More informationCombining PGMs and Discriminative Models for Upper Body Pose Detection
Combining PGMs and Discriminative Models for Upper Body Pose Detection Gedas Bertasius May 30, 2014 1 Introduction In this project, I utilized probabilistic graphical models together with discriminative
More informationPart-based models. Lecture 10
Part-based models Lecture 10 Overview Representation Location Appearance Generative interpretation Learning Distance transforms Other approaches using parts Felzenszwalb, Girshick, McAllester, Ramanan
More informationDetection III: Analyzing and Debugging Detection Methods
CS 1699: Intro to Computer Vision Detection III: Analyzing and Debugging Detection Methods Prof. Adriana Kovashka University of Pittsburgh November 17, 2015 Today Review: Deformable part models How can
More informationObject Detection by 3D Aspectlets and Occlusion Reasoning
Object Detection by 3D Aspectlets and Occlusion Reasoning Yu Xiang University of Michigan Silvio Savarese Stanford University In the 4th International IEEE Workshop on 3D Representation and Recognition
More informationCategory-level localization
Category-level localization Cordelia Schmid Recognition Classification Object present/absent in an image Often presence of a significant amount of background clutter Localization / Detection Localize object
More informationAction recognition in videos
Action recognition in videos Cordelia Schmid INRIA Grenoble Joint work with V. Ferrari, A. Gaidon, Z. Harchaoui, A. Klaeser, A. Prest, H. Wang Action recognition - goal Short actions, i.e. drinking, sit
More informationLocal Features based Object Categories and Object Instances Recognition
Local Features based Object Categories and Object Instances Recognition Eric Nowak Ph.D. thesis defense 17th of March, 2008 1 Thesis in Computer Vision Computer vision is the science and technology of
More informationModern Object Detection. Most slides from Ali Farhadi
Modern Object Detection Most slides from Ali Farhadi Comparison of Classifiers assuming x in {0 1} Learning Objective Training Inference Naïve Bayes maximize j i logp + logp ( x y ; θ ) ( y ; θ ) i ij
More informationRevisiting 3D Geometric Models for Accurate Object Shape and Pose
Revisiting 3D Geometric Models for Accurate Object Shape and Pose M. 1 Michael Stark 2,3 Bernt Schiele 3 Konrad Schindler 1 1 Photogrammetry and Remote Sensing Laboratory Swiss Federal Institute of Technology
More informationBeyond Bags of features Spatial information & Shape models
Beyond Bags of features Spatial information & Shape models Jana Kosecka Many slides adapted from S. Lazebnik, FeiFei Li, Rob Fergus, and Antonio Torralba Detection, recognition (so far )! Bags of features
More informationVisuelle Perzeption für Mensch- Maschine Schnittstellen
Visuelle Perzeption für Mensch- Maschine Schnittstellen Vorlesung, WS 2009 Prof. Dr. Rainer Stiefelhagen Dr. Edgar Seemann Institut für Anthropomatik Universität Karlsruhe (TH) http://cvhci.ira.uka.de
More informationPeople-Tracking-by-Detection and People-Detection-by-Tracking
People-Tracking-by-Detection and People-Detection-by-Tracking Mykhaylo Andriluka Stefan Roth Bernt Schiele Computer Science Department TU Darmstadt, Germany {andriluka, sroth, schiele}@cs.tu-darmstadt.de
More informationDetecting and Segmenting Humans in Crowded Scenes
Detecting and Segmenting Humans in Crowded Scenes Mikel D. Rodriguez University of Central Florida 4000 Central Florida Blvd Orlando, Florida, 32816 mikel@cs.ucf.edu Mubarak Shah University of Central
More informationCategory-level Localization
Category-level Localization Andrew Zisserman Visual Geometry Group University of Oxford http://www.robots.ox.ac.uk/~vgg Includes slides from: Ondra Chum, Alyosha Efros, Mark Everingham, Pedro Felzenszwalb,
More informationDevelopment in Object Detection. Junyuan Lin May 4th
Development in Object Detection Junyuan Lin May 4th Line of Research [1] N. Dalal and B. Triggs. Histograms of oriented gradients for human detection, CVPR 2005. HOG Feature template [2] P. Felzenszwalb,
More informationRecap Image Classification with Bags of Local Features
Recap Image Classification with Bags of Local Features Bag of Feature models were the state of the art for image classification for a decade BoF may still be the state of the art for instance retrieval
More informationVisuelle Perzeption für Mensch- Maschine Schnittstellen
Visuelle Perzeption für Mensch- Maschine Schnittstellen Vorlesung, WS 2009 Prof. Dr. Rainer Stiefelhagen Dr. Edgar Seemann Institut für Anthropomatik Universität Karlsruhe (TH) http://cvhci.ira.uka.de
More informationObject Recognition with Deformable Models
Object Recognition with Deformable Models Pedro F. Felzenszwalb Department of Computer Science University of Chicago Joint work with: Dan Huttenlocher, Joshua Schwartz, David McAllester, Deva Ramanan.
More informationClustered Pose and Nonlinear Appearance Models for Human Pose Estimation
JOHNSON, EVERINGHAM: CLUSTERED MODELS FOR HUMAN POSE ESTIMATION 1 Clustered Pose and Nonlinear Appearance Models for Human Pose Estimation Sam Johnson s.a.johnson04@leeds.ac.uk Mark Everingham m.everingham@leeds.ac.uk
More informationHigh Level Computer Vision. Sliding Window Detection: Viola-Jones-Detector & Histogram of Oriented Gradients (HOG)
High Level Computer Vision Sliding Window Detection: Viola-Jones-Detector & Histogram of Oriented Gradients (HOG) Bernt Schiele - schiele@mpi-inf.mpg.de Mario Fritz - mfritz@mpi-inf.mpg.de http://www.d2.mpi-inf.mpg.de/cv
More informationMultiple-Person Tracking by Detection
http://excel.fit.vutbr.cz Multiple-Person Tracking by Detection Jakub Vojvoda* Abstract Detection and tracking of multiple person is challenging problem mainly due to complexity of scene and large intra-class
More informationBackprojection Revisited: Scalable Multi-view Object Detection and Similarity Metrics for Detections
Backprojection Revisited: Scalable Multi-view Object Detection and Similarity Metrics for Detections Nima Razavi 1, Juergen Gall 1, and Luc Van Gool 1,2 1 Computer Vision Laboratory, ETH Zurich 2 ESAT-PSI/IBBT,
More informationSupervised learning. y = f(x) function
Supervised learning y = f(x) output prediction function Image feature Training: given a training set of labeled examples {(x 1,y 1 ),, (x N,y N )}, estimate the prediction function f by minimizing the
More informationObject Category Detection: Sliding Windows
04/10/12 Object Category Detection: Sliding Windows Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem Today s class: Object Category Detection Overview of object category detection Statistical
More informationObject Recognition. Computer Vision. Slides from Lana Lazebnik, Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce
Object Recognition Computer Vision Slides from Lana Lazebnik, Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce How many visual object categories are there? Biederman 1987 ANIMALS PLANTS OBJECTS
More informationCategory vs. instance recognition
Category vs. instance recognition Category: Find all the people Find all the buildings Often within a single image Often sliding window Instance: Is this face James? Find this specific famous building
More informationCombining Discriminative Appearance and Segmentation Cues for Articulated Human Pose Estimation
Combining Discriminative Appearance and Segmentation Cues for Articulated Human Pose Estimation Sam Johnson and Mark Everingham School of Computing University of Leeds {mat4saj m.everingham}@leeds.ac.uk
More informationComputer Vision Lecture 16
Computer Vision Lecture 16 Deep Learning for Object Categorization 14.01.2016 Bastian Leibe RWTH Aachen http://www.vision.rwth-aachen.de leibe@vision.rwth-aachen.de Announcements Seminar registration period
More informationCourse Administration
Course Administration Project 2 results are online Project 3 is out today The first quiz is a week from today (don t panic!) Covers all material up to the quiz Emphasizes lecture material NOT project topics
More informationSelective Search for Object Recognition
Selective Search for Object Recognition Uijlings et al. Schuyler Smith Overview Introduction Object Recognition Selective Search Similarity Metrics Results Object Recognition Kitten Goal: Problem: Where
More informationA novel template matching method for human detection
University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2009 A novel template matching method for human detection Duc Thanh Nguyen
More informationLecture 15: Detecting Objects by Parts
Lecture 15: Detecting Objects by Parts David R. Morales, Austin O. Narcomey, Minh-An Quinn, Guilherme Reis, Omar Solis Department of Computer Science Stanford University Stanford, CA 94305 {mrlsdvd, aon2,
More informationWindow based detectors
Window based detectors CS 554 Computer Vision Pinar Duygulu Bilkent University (Source: James Hays, Brown) Today Window-based generic object detection basic pipeline boosting classifiers face detection
More informationObject detection using non-redundant local Binary Patterns
University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2010 Object detection using non-redundant local Binary Patterns Duc Thanh
More informationTA Section: Problem Set 4
TA Section: Problem Set 4 Outline Discriminative vs. Generative Classifiers Image representation and recognition models Bag of Words Model Part-based Model Constellation Model Pictorial Structures Model
More informationSupervised learning. y = f(x) function
Supervised learning y = f(x) output prediction function Image feature Training: given a training set of labeled examples {(x 1,y 1 ),, (x N,y N )}, estimate the prediction function f by minimizing the
More informationSelection of Scale-Invariant Parts for Object Class Recognition
Selection of Scale-Invariant Parts for Object Class Recognition Gy. Dorkó and C. Schmid INRIA Rhône-Alpes, GRAVIR-CNRS 655, av. de l Europe, 3833 Montbonnot, France fdorko,schmidg@inrialpes.fr Abstract
More informationLecture 15 Visual recognition
Lecture 15 Visual recognition 3D object detection Introduction Single instance 3D object detectors Generic 3D object detectors Silvio Savarese Lecture 15-26-Feb-14 3D object detection Object: Building
More informationStrong Appearance and Expressive Spatial Models for Human Pose Estimation
2013 IEEE International Conference on Computer Vision Strong Appearance and Expressive Spatial Models for Human Pose Estimation Leonid Pishchulin 1 Mykhaylo Andriluka 1 Peter Gehler 2 Bernt Schiele 1 1
More informationMultiple Pose Context Trees for estimating Human Pose in Object Context
Multiple Pose Context Trees for estimating Human Pose in Object Context Vivek Kumar Singh Furqan Muhammad Khan Ram Nevatia University of Southern California Los Angeles, CA {viveksin furqankh nevatia}@usc.edu
More informationOcclusion Patterns for Object Class Detection
Occlusion Patterns for Object Class Detection Bojan Pepik1 Michael Stark1,2 Peter Gehler3 Bernt Schiele1 Max Planck Institute for Informatics, 2Stanford University, 3Max Planck Institute for Intelligent
More informationBeyond bags of features: Adding spatial information. Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba
Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba Adding spatial information Forming vocabularies from pairs of nearby features doublets
More informationSeeing 3D chairs: Exemplar part-based 2D-3D alignment using a large dataset of CAD models
Seeing 3D chairs: Exemplar part-based 2D-3D alignment using a large dataset of CAD models Mathieu Aubry (INRIA) Daniel Maturana (CMU) Alexei Efros (UC Berkeley) Bryan Russell (Intel) Josef Sivic (INRIA)
More informationObject Detection with Partial Occlusion Based on a Deformable Parts-Based Model
Object Detection with Partial Occlusion Based on a Deformable Parts-Based Model Johnson Hsieh (johnsonhsieh@gmail.com), Alexander Chia (alexchia@stanford.edu) Abstract -- Object occlusion presents a major
More informationRevisiting 3D Geometric Models for Accurate Object Shape and Pose
Revisiting 3D Geometric Models for Accurate Object Shape and Pose M. Zeeshan Zia 1, Michael Stark 2, Bernt Schiele 2, and Konrad Schindler 1 1 Photogrammetry and Remote Sensing Laboratory, ETH Zürich,
More informationFuzzy based Multiple Dictionary Bag of Words for Image Classification
Available online at www.sciencedirect.com Procedia Engineering 38 (2012 ) 2196 2206 International Conference on Modeling Optimisation and Computing Fuzzy based Multiple Dictionary Bag of Words for Image
More informationIs 2D Information Enough For Viewpoint Estimation? Amir Ghodrati, Marco Pedersoli, Tinne Tuytelaars BMVC 2014
Is 2D Information Enough For Viewpoint Estimation? Amir Ghodrati, Marco Pedersoli, Tinne Tuytelaars BMVC 2014 Problem Definition Viewpoint estimation: Given an image, predicting viewpoint for object of
More informationSegmentation. Bottom up Segmentation Semantic Segmentation
Segmentation Bottom up Segmentation Semantic Segmentation Semantic Labeling of Street Scenes Ground Truth Labels 11 classes, almost all occur simultaneously, large changes in viewpoint, scale sky, road,
More informationDiscriminative classifiers for image recognition
Discriminative classifiers for image recognition May 26 th, 2015 Yong Jae Lee UC Davis Outline Last time: window-based generic object detection basic pipeline face detection with boosting as case study
More informationHuman detection using local shape and nonredundant
University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2010 Human detection using local shape and nonredundant binary patterns
More informationFind that! Visual Object Detection Primer
Find that! Visual Object Detection Primer SkTech/MIT Innovation Workshop August 16, 2012 Dr. Tomasz Malisiewicz tomasz@csail.mit.edu Find that! Your Goals...imagine one such system that drives information
More informationObject detection. Announcements. Last time: Mid-level cues 2/23/2016. Wed Feb 24 Kristen Grauman UT Austin
Object detection Wed Feb 24 Kristen Grauman UT Austin Announcements Reminder: Assignment 2 is due Mar 9 and Mar 10 Be ready to run your code again on a new test set on Mar 10 Vision talk next Tuesday 11
More informationA Comparison on Different 2D Human Pose Estimation Method
A Comparison on Different 2D Human Pose Estimation Method Santhanu P.Mohan 1, Athira Sugathan 2, Sumesh Sekharan 3 1 Assistant Professor, IT Department, VJCET, Kerala, India 2 Student, CVIP, ASE, Tamilnadu,
More informationRepositorio Institucional de la Universidad Autónoma de Madrid.
Repositorio Institucional de la Universidad Autónoma de Madrid https://repositorio.uam.es Esta es la versión de autor de la comunicación de congreso publicada en: This is an author produced version of
More informationCEE598 - Visual Sensing for Civil Infrastructure Eng. & Mgmt.
CEE598 - Visual Sensing for Civil Infrastructure Eng. & Mgmt. Session 20 Object Recognition 3 Mani Golparvar-Fard Department of Civil and Environmental Engineering Department of Computer Science 3129D,
More informationMulti-View Object Class Detection with a 3D Geometric Model
Multi-View Object Class Detection with a 3D Geometric Model Jörg Liebelt, Cordelia Schmid To cite this version: Jörg Liebelt, Cordelia Schmid. Multi-View Object Class Detection with a 3D Geometric Model.
More informationhttps://en.wikipedia.org/wiki/the_dress Recap: Viola-Jones sliding window detector Fast detection through two mechanisms Quickly eliminate unlikely windows Use features that are fast to compute Viola
More informationOcclusion Patterns for Object Class Detection
23 IEEE Conference on Computer Vision and Pattern Recognition Occlusion Patterns for Object Class Detection Bojan Pepik Michael Stark,2 Peter Gehler 3 Bernt Schiele Max Planck Institute for Informatics,
More informationCEE598 - Visual Sensing for Civil Infrastructure Eng. & Mgmt.
CEE598 - Visual Sensing for Civil Infrastructure Eng. & Mgmt. Session 20 Object Recognition 3 Mani Golparvar-Fard Department of Civil and Environmental Engineering 3129D, Newmark Civil Engineering Lab
More informationPeople detection in complex scene using a cascade of Boosted classifiers based on Haar-like-features
People detection in complex scene using a cascade of Boosted classifiers based on Haar-like-features M. Siala 1, N. Khlifa 1, F. Bremond 2, K. Hamrouni 1 1. Research Unit in Signal Processing, Image Processing
More informationFeature Selection using Constellation Models
Feature Selection using Constellation Models Heydar Maboudi Afkham, Carl Herik Ek and Stefan Calsson Computer Vision and Active Perception Lab. Royal Institute of Technology (KTH) Stockholm, Sweden {heydarma,chek,stefanc}@csc.kth.se
More informationPoselet Conditioned Pictorial Structures
Poselet Conditioned Pictorial Structures Leonid Pishchulin 1 Mykhaylo Andriluka 1 Peter Gehler 2 Bernt Schiele 1 1 Max Planck Institute for Informatics, Saarbrücken, Germany 2 Max Planck Institute for
More informationHistograms of Oriented Gradients for Human Detection p. 1/1
Histograms of Oriented Gradients for Human Detection p. 1/1 Histograms of Oriented Gradients for Human Detection Navneet Dalal and Bill Triggs INRIA Rhône-Alpes Grenoble, France Funding: acemedia, LAVA,
More informationPEDESTRIAN DETECTION IN CROWDED SCENES VIA SCALE AND OCCLUSION ANALYSIS
PEDESTRIAN DETECTION IN CROWDED SCENES VIA SCALE AND OCCLUSION ANALYSIS Lu Wang Lisheng Xu Ming-Hsuan Yang Northeastern University, China University of California at Merced, USA ABSTRACT Despite significant
More informationMonocular 3D Pose Estimation and Tracking by Detection
Monocular 3D Pose Estimation and Tracking by Detection Mykhaylo Andriluka1,2 Stefan Roth1 Bernt Schiele1,2 1 2 Department of Computer Science, TU Darmstadt MPI Informatics, Saarbru cken Abstract Automatic
More informationLecture 18: Human Motion Recognition
Lecture 18: Human Motion Recognition Professor Fei Fei Li Stanford Vision Lab 1 What we will learn today? Introduction Motion classification using template matching Motion classification i using spatio
More informationMining Discriminative Adjectives and Prepositions for Natural Scene Recognition
Mining Discriminative Adjectives and Prepositions for Natural Scene Recognition Bangpeng Yao 1, Juan Carlos Niebles 2,3, Li Fei-Fei 1 1 Department of Computer Science, Princeton University, NJ 08540, USA
More informationComputer Vision. Exercise Session 10 Image Categorization
Computer Vision Exercise Session 10 Image Categorization Object Categorization Task Description Given a small number of training images of a category, recognize a-priori unknown instances of that category
More informationTest-time Adaptation for 3D Human Pose Estimation
Test-time Adaptation for 3D Human Pose Estimation Sikandar Amin,2, Philipp Müller 2, Andreas Bulling 2, and Mykhaylo Andriluka 2,3 Technische Universität München, Germany 2 Max Planck Institute for Informatics,
More informationHistogram of Oriented Gradients (HOG) for Object Detection
Histogram of Oriented Gradients (HOG) for Object Detection Navneet DALAL Joint work with Bill TRIGGS and Cordelia SCHMID Goal & Challenges Goal: Detect and localise people in images and videos n Wide variety
More informationPerson Detection in Images using HoG + Gentleboost. Rahul Rajan June 1st July 15th CMU Q Robotics Lab
Person Detection in Images using HoG + Gentleboost Rahul Rajan June 1st July 15th CMU Q Robotics Lab 1 Introduction One of the goals of computer vision Object class detection car, animal, humans Human
More informationObject Recognition and Detection
CS 2770: Computer Vision Object Recognition and Detection Prof. Adriana Kovashka University of Pittsburgh March 16, 21, 23, 2017 Plan for the next few lectures Recognizing the category in the image as
More informationMultiple Kernel Learning for Emotion Recognition in the Wild
Multiple Kernel Learning for Emotion Recognition in the Wild Karan Sikka, Karmen Dykstra, Suchitra Sathyanarayana, Gwen Littlewort and Marian S. Bartlett Machine Perception Laboratory UCSD EmotiW Challenge,
More informationFusing shape and appearance information for object category detection
1 Fusing shape and appearance information for object category detection Andreas Opelt, Axel Pinz Graz University of Technology, Austria Andrew Zisserman Dept. of Engineering Science, University of Oxford,
More informationObject Recognition II
Object Recognition II Linda Shapiro EE/CSE 576 with CNN slides from Ross Girshick 1 Outline Object detection the task, evaluation, datasets Convolutional Neural Networks (CNNs) overview and history Region-based
More informationCS 231A Computer Vision (Fall 2011) Problem Set 4
CS 231A Computer Vision (Fall 2011) Problem Set 4 Due: Nov. 30 th, 2011 (9:30am) 1 Part-based models for Object Recognition (50 points) One approach to object recognition is to use a deformable part-based
More informationCS6670: Computer Vision
CS6670: Computer Vision Noah Snavely Lecture 16: Bag-of-words models Object Bag of words Announcements Project 3: Eigenfaces due Wednesday, November 11 at 11:59pm solo project Final project presentations:
More informationSpatial Localization and Detection. Lecture 8-1
Lecture 8: Spatial Localization and Detection Lecture 8-1 Administrative - Project Proposals were due on Saturday Homework 2 due Friday 2/5 Homework 1 grades out this week Midterm will be in-class on Wednesday
More informationObject recognition. Methods for classification and image representation
Object recognition Methods for classification and image representation Credits Slides by Pete Barnum Slides by FeiFei Li Paul Viola, Michael Jones, Robust Realtime Object Detection, IJCV 04 Navneet Dalal
More informationSegmentation as Selective Search for Object Recognition in ILSVRC2011
Segmentation as Selective Search for Object Recognition in ILSVRC2011 Koen van de Sande Jasper Uijlings Arnold Smeulders Theo Gevers Nicu Sebe Cees Snoek University of Amsterdam, University of Trento ILSVRC2011
More informationDeformable Part Models Revisited: A Performance Evaluation for Object Category Pose Estimation
Deformable Part Models Revisited: A Performance Evaluation for Object Category Pose Estimation Roberto J. Lo pez-sastre Tinne Tuytelaars2 Silvio Savarese3 GRAM, Dept. Signal Theory and Communications,
More informationLocal cues and global constraints in image understanding
Local cues and global constraints in image understanding Olga Barinova Lomonosov Moscow State University *Many slides adopted from the courses of Anton Konushin Image understanding «To see means to know
More informationObject Detection Design challenges
Object Detection Design challenges How to efficiently search for likely objects Even simple models require searching hundreds of thousands of positions and scales Feature design and scoring How should
More informationModeling 3D viewpoint for part-based object recognition of rigid objects
Modeling 3D viewpoint for part-based object recognition of rigid objects Joshua Schwartz Department of Computer Science Cornell University jdvs@cs.cornell.edu Abstract Part-based object models based on
More informationA Hierarchical Compositional System for Rapid Object Detection
A Hierarchical Compositional System for Rapid Object Detection Long Zhu and Alan Yuille Department of Statistics University of California at Los Angeles Los Angeles, CA 90095 {lzhu,yuille}@stat.ucla.edu
More informationMethods for Representing and Recognizing 3D objects
Methods for Representing and Recognizing 3D objects part 1 Silvio Savarese University of Michigan at Ann Arbor Object: Building, 45º pose, 8-10 meters away Object: Person, back; 1-2 meters away Object:
More informationHigh-Level Fusion of Depth and Intensity for Pedestrian Classification
High-Level Fusion of Depth and Intensity for Pedestrian Classification Marcus Rohrbach 1,3, Markus Enzweiler 2 and Dariu M. Gavrila 1,4 1 Environment Perception, Group Research, Daimler AG, Ulm, Germany
More informationBoosted Local Structured HOG-LBP for Object Localization
Boosted Local Structured HOG-LBP for Object Localization Junge Zhang, Kaiqi Huang, Yinan Yu and Tieniu Tan National Laboratory of Pattern Recognition Institute of Automation, Chinese Academy of Sciences
More information