Object recognition (part 2)

Size: px
Start display at page:

Download "Object recognition (part 2)"

Transcription

1 Object recognition (part 2) CSE P 576 Larry Zitnick (larryz@microsoft.com) 1

2 2

3 3

4 Support Vector Machines Modified from the slides by Dr. Andrew W. Moore Linear Classifiers denotes +1 denotes -1 x a f f(x,w,b) = sign(w. x - b) How would you classify this data? y est Nov 23rd, 2001 Support Vector Machines: Slide 14 Linear Classifiers x a f y est Linear Classifiers x a f y est denotes +1 f(x,w,b) = sign(w. x - b) denotes +1 f(x,w,b) = sign(w. x - b) denotes -1 denotes -1 How would you classify this data? How would you classify this data? Support Vector Machines: Slide 15 Support Vector Machines: Slide 16 4

5 Linear Classifiers x a f y est Linear Classifiers x a f y est denotes +1 f(x,w,b) = sign(w. x - b) denotes +1 f(x,w,b) = sign(w. x - b) denotes -1 denotes -1 How would you classify this data? Any of these would be fine....but which is best? Support Vector Machines: Slide 17 Support Vector Machines: Slide 18 Classifier Margin x a f y est Maximum Margin x a f y est denotes +1 denotes -1 f(x,w,b) = sign(w. x - b) Define the margin of a linear classifier as the width that the boundary could be increased by before hitting a datapoint. denotes +1 denotes -1 Linear SVM f(x,w,b) = sign(w. x - b) The maximum margin linear classifier is the linear classifier with the, um, maximum margin. This is the simplest kind of SVM (Called an LSVM) Support Vector Machines: Slide 19 Support Vector Machines: Slide 20 5

6 Maximum Margin denotes +1 denotes -1 Support Vectors are those datapoints that the margin pushes up against x Linear SVM a f f(x,w,b) = sign(w. x - b) y est The maximum margin linear classifier is the linear classifier with the, um, maximum margin. This is the simplest kind of SVM (Called an LSVM) Support Vector Machines: Slide 21 Why Maximum Margin? denotes +1 denotes -1 Support Vectors are those datapoints that the margin pushes up against 1. Intuitively this feels safest. 2. If we ve made f(x,w,b) a small = sign(w. error in x the - b) location of the boundary (it s been jolted in its perpendicular The maximum direction) this gives us least margin chance linear of causing a misclassification. classifier is the 3. LOOCV is easy since linear the classifier model is immune to removal with of any the, nonsupport-vector datapoints. um, maximum margin. 4. There s some theory (using VC dimension) that is This related is the to (but not the same as) the simplest proposition kind that of this is a good thing. SVM (Called an 5. Empirically it works LSVM) very very well. Support Vector Machines: Slide 22 Nonlinear Kernel (I) Nonlinear Kernel (II) Support Vector Machines: Slide 23 Support Vector Machines: Slide 24 6

7 Support Vector Machines: Slide 25 Caltech-101: Drawbacks Smallest category size is 31 images: N train 30 Too easy? left-right aligned Rotation artifacts Soon will saturate performance 7

8 Antonio Torralba generated these average images of the Caltech 101 categories 5/23/2011 Jump to Nicolas Pinto s slides. (page 29) Objects in Context R. Gokberk Cinbis MIT Object Recognition and Scene Understanding 32 8

9 Papers A. Torralba. Contextual priming for object detection. IJCV Object presence at a particular location/scal e Object Detection Probabilistic Framework (Single Object Likelihood) A. Rabinovich, A. Vedaldi, C. Galleguillos, E. Wiewiora and S. Belongie. Objects in Context. ICCV 2007 Given all image features (local/object and scene/context) v = v Local + v Contextual Contextual Reasoning Preview: Contextual Priming for Object Detection Scene Centered 2D Reasoning Object Centered 2,5D / 3D Reasoning Surface orientations w.r.t. camera SKY? VERTICAL SUPPORT VERTIC AL Input test image Contextual priming for object detection Objects in Context Geometric context from a single image

10 Preview: Contextual Priming for Object Detection Preview: Contextual Priming for Object Detection Using previously collected statistics about filter output predict information about objects Correlate with many filters Preview: Contextual Priming for Object Detection Contextual Priming for Object Detection: Probabilistic Framework Predict information about objects Local measurements (a lot in the literature) Contextual features Where I can find the objects easily? people chair car Which objects do I expect to see? How large objects do I expect to see?

11 Contextual Priming for Object Detection: Contextual Features Contextual Priming for Object Detection: Object Priming Results Gabor filters at 4 scales and 6 orientations Use PCA on filter output images to reduce the number of features (< 64) Use Mixture of Gaussians to model the probabilities. (Other alternatives include KNN, parzen window, logistic regression, etc) 41 (o 1 =people, o 2 =furniture, o 3 =vehicles and o 4 =trees 42 Contextual Priming for Object Detection: Focus of Attention Results Contextual Priming for Object Detection: Conclusions Proves the relation btw low level features and scene/context Can be seen as a computational evidence for the (possible) existence of low-level feature based biological attention mechanisms Also a warning: Whether an object recognition system understands the object or works by lots bg features. Heads

12 Preview: Objects in Context Preview: Objects in Context Input test image Do segmentation on the image Preview: Objects in Context Preview: Objects in Context Building, boat, person Building, boat, person Building Water, sky Road Building, boat, motorbike Do classification (find label probabilities) in each segment only with local info 47 Water, sky Road Building, boat, motorbike Road Boat Water Most consistent labeling according to object cooccurrences & local label probabilities

13 Objects in Context: Local Categorization Objects in Context: Contextual Refinement Building, boat, person Water, sky Road Building, boat, motorbike Extract random patches on zero-padded segments Calculate SIFT descriptors Use BoF: Training: - Cluster patches in training (Hier. K-means, K=10x3) - Histogram of words in each segment - NN classifier (returns a sorted list of categories) Water Building Boat Road Contextual model based on co-occurrences Try to find the most consistent labeling with high posterior probability and high mean pairwise interaction. Use CRF for this purpose. Each segment is classified independently 49 Mean interaction of all label pairs Φ(i,j) is basically the observed label cooccurrences in training set. Independent segment classification 50 Objects in Context: Learning Context Objects in Context: Results Using labeled image datasets (MSRC, PASCAL) Using labeled text based data (Google Sets): Contains list of related items A large set turns out to be useless! (anything is related)

14 Objects in Context Limitations: Context modeling Objects in Context Limitations: Context modeling With co-occurrence Categorization without With co-occurrence Categorization without context Segmentation Segmentation context context context P(person,horse) > Local information only Means: P(person,dog) Local information only P(person, dog) > P(person, cow) But why? Isn t it only a dataset bias? We have seen in the previous example that P(person,dog) is common too. (Bonus Q: How did it handle the background?) Stuff-like Stuff Labels with high co-occurrences with other labels Objects in Context Object-Object or Stuff-Object? Looks like background stuff object (such as water-boat) does help rather than foreground object cooccurrences (such as person-horse) [but still car-person-motorbike is useful in PASCAL] Objects in Context Limitations: Segmentation Too good: A few or many? How to select a good segmentation in multiple segmentations? Can make object recognition & contextual reasoning (due to stuff detection) much easier

15 Objects in Context - Limitations Contextual Priming vs. Objects in Context No cue by unknown objects No spatial relationship reasoning Object detection part heavily depends on good segmentations Improvements using object co-occurrences are demonstrated with images where many labels are already correct. How good is the model? 57 Scene->Object Simpler training data (only target object s labels are enough) Scene information is view-dependent (due to gist) Object detector independent {Object,Stuff} <-> {Object,Stuff} May need huge amount of labeled data Can be more generic than scene->object with a very good model Contextual model is object detector independent, in theory. But: + use segmentation easier to detect stuff - uses segmentation can be unreliable 58 Object recognition We ve come a long way Finding the weakest link in person detectors Devi Parikh TTI, Chicago Larry Zitnick Microsoft Research Fischler and Elschlager, 1973 Dollar et al., BMVC

16 Still a ways to go Dollar et al., BMVC 2009 Dollar et al., BMVC 2009 Still a ways to go Part-based person detector 4 main components: Feature selection Part detection Spatial model NMS / context Color Intensity Edges Dollar et al., BMVC 2009 Felzenszwalb Hoiem et al., et al.,

17 How can we help? Human debugging Humans supply training data 100,000s labeled images We design the algorithms. Going on 40 years. Help me! Feature selection Feature selection Part detection Spatial model Spatial model NMS / context NMS / context Can we use humans to debug? Feature selection Part detection NMS / context Amazon Mechanical Turk Feature selection Part detection Human performance Humans ~90% average precision Machines ~46% average precision Feature selection Human debugging Spatial model NMS / context Is it a head, torso, arm, leg, foot, hand, or nothing? Head Leg Nothing Low resolution 20x20 pixels PASCAL VOC dataset Feet Head? 17

18 Part detections Head Torso Arm Hand Leg Foot Person Part detections Machine Machine Humans Humans Part detections Part detections High res Humans Machine Low res Machine 18

19 AP results Spatial model Feature selection Part detection NMS / context Person High res Low res Not a person Spatial model Context/NMS vs. NMS / context 19

20 Conclusion 7:00min 20

Analysis: TextonBoost and Semantic Texton Forests. Daniel Munoz Februrary 9, 2009

Analysis: TextonBoost and Semantic Texton Forests. Daniel Munoz Februrary 9, 2009 Analysis: TextonBoost and Semantic Texton Forests Daniel Munoz 16-721 Februrary 9, 2009 Papers [shotton-eccv-06] J. Shotton, J. Winn, C. Rother, A. Criminisi, TextonBoost: Joint Appearance, Shape and Context

More information

Bag-of-features. Cordelia Schmid

Bag-of-features. Cordelia Schmid Bag-of-features for category classification Cordelia Schmid Visual search Particular objects and scenes, large databases Category recognition Image classification: assigning a class label to the image

More information

CS 8520: Artificial Intelligence. Machine Learning 2. Paula Matuszek Fall, CSC 8520 Fall Paula Matuszek

CS 8520: Artificial Intelligence. Machine Learning 2. Paula Matuszek Fall, CSC 8520 Fall Paula Matuszek CS 8520: Artificial Intelligence Machine Learning 2 Paula Matuszek Fall, 2015!1 Regression Classifiers We said earlier that the task of a supervised learning system can be viewed as learning a function

More information

CAP5415-Computer Vision Lecture 13-Support Vector Machines for Computer Vision Applica=ons

CAP5415-Computer Vision Lecture 13-Support Vector Machines for Computer Vision Applica=ons CAP5415-Computer Vision Lecture 13-Support Vector Machines for Computer Vision Applica=ons Guest Lecturer: Dr. Boqing Gong Dr. Ulas Bagci bagci@ucf.edu 1 October 14 Reminders Choose your mini-projects

More information

Beyond Sliding Windows: Object Localization by Efficient Subwindow Search

Beyond Sliding Windows: Object Localization by Efficient Subwindow Search Beyond Sliding Windows: Object Localization by Efficient Subwindow Search Christoph H. Lampert, Matthew B. Blaschko, & Thomas Hofmann Max Planck Institute for Biological Cybernetics Tübingen, Germany Google,

More information

Learning and Inferring Depth from Monocular Images. Jiyan Pan April 1, 2009

Learning and Inferring Depth from Monocular Images. Jiyan Pan April 1, 2009 Learning and Inferring Depth from Monocular Images Jiyan Pan April 1, 2009 Traditional ways of inferring depth Binocular disparity Structure from motion Defocus Given a single monocular image, how to infer

More information

Decomposing a Scene into Geometric and Semantically Consistent Regions

Decomposing a Scene into Geometric and Semantically Consistent Regions Decomposing a Scene into Geometric and Semantically Consistent Regions Stephen Gould sgould@stanford.edu Richard Fulton rafulton@cs.stanford.edu Daphne Koller koller@cs.stanford.edu IEEE International

More information

Introduction to object recognition. Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and others

Introduction to object recognition. Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and others Introduction to object recognition Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and others Overview Basic recognition tasks A statistical learning approach Traditional or shallow recognition

More information

Learning and Recognizing Visual Object Categories Without First Detecting Features

Learning and Recognizing Visual Object Categories Without First Detecting Features Learning and Recognizing Visual Object Categories Without First Detecting Features Daniel Huttenlocher 2007 Joint work with D. Crandall and P. Felzenszwalb Object Category Recognition Generic classes rather

More information

Recognition Tools: Support Vector Machines

Recognition Tools: Support Vector Machines CS 2770: Computer Vision Recognition Tools: Support Vector Machines Prof. Adriana Kovashka University of Pittsburgh January 12, 2017 Announcement TA office hours: Tuesday 4pm-6pm Wednesday 10am-12pm Matlab

More information

Context. CS 554 Computer Vision Pinar Duygulu Bilkent University. (Source:Antonio Torralba, James Hays)

Context. CS 554 Computer Vision Pinar Duygulu Bilkent University. (Source:Antonio Torralba, James Hays) Context CS 554 Computer Vision Pinar Duygulu Bilkent University (Source:Antonio Torralba, James Hays) A computer vision goal Recognize many different objects under many viewing conditions in unconstrained

More information

Category-level localization

Category-level localization Category-level localization Cordelia Schmid Recognition Classification Object present/absent in an image Often presence of a significant amount of background clutter Localization / Detection Localize object

More information

Probabilistic Graphical Models Part III: Example Applications

Probabilistic Graphical Models Part III: Example Applications Probabilistic Graphical Models Part III: Example Applications Selim Aksoy Department of Computer Engineering Bilkent University saksoy@cs.bilkent.edu.tr CS 551, Fall 2014 CS 551, Fall 2014 c 2014, Selim

More information

Computer Vision: Summary and Discussion

Computer Vision: Summary and Discussion 12/05/2011 Computer Vision: Summary and Discussion Computer Vision CS 143, Brown James Hays Many slides from Derek Hoiem Announcements Today is last day of regular class Second quiz on Wednesday (Dec 7

More information

CS 8520: Artificial Intelligence

CS 8520: Artificial Intelligence CS 8520: Artificial Intelligence Machine Learning 2 Paula Matuszek Spring, 2013 1 Regression Classifiers We said earlier that the task of a supervised learning system can be viewed as learning a function

More information

Object recognition (part 1)

Object recognition (part 1) Recognition Object recognition (part 1) CSE P 576 Larry Zitnick (larryz@microsoft.com) The Margaret Thatcher Illusion, by Peter Thompson Readings Szeliski Chapter 14 Recognition What do we mean by object

More information

Part based models for recognition. Kristen Grauman

Part based models for recognition. Kristen Grauman Part based models for recognition Kristen Grauman UT Austin Limitations of window-based models Not all objects are box-shaped Assuming specific 2d view of object Local components themselves do not necessarily

More information

Learning Representations for Visual Object Class Recognition

Learning Representations for Visual Object Class Recognition Learning Representations for Visual Object Class Recognition Marcin Marszałek Cordelia Schmid Hedi Harzallah Joost van de Weijer LEAR, INRIA Grenoble, Rhône-Alpes, France October 15th, 2007 Bag-of-Features

More information

Semantic Pooling for Image Categorization using Multiple Kernel Learning

Semantic Pooling for Image Categorization using Multiple Kernel Learning Semantic Pooling for Image Categorization using Multiple Kernel Learning Thibaut Durand (1,2), Nicolas Thome (1), Matthieu Cord (1), David Picard (2) (1) Sorbonne Universités, UPMC Univ Paris 06, UMR 7606,

More information

Region-based Segmentation and Object Detection

Region-based Segmentation and Object Detection Region-based Segmentation and Object Detection Stephen Gould Tianshi Gao Daphne Koller Presented at NIPS 2009 Discussion and Slides by Eric Wang April 23, 2010 Outline Introduction Model Overview Model

More information

Segmentation. Bottom up Segmentation Semantic Segmentation

Segmentation. Bottom up Segmentation Semantic Segmentation Segmentation Bottom up Segmentation Semantic Segmentation Semantic Labeling of Street Scenes Ground Truth Labels 11 classes, almost all occur simultaneously, large changes in viewpoint, scale sky, road,

More information

Effect of Salient Features in Object Recognition

Effect of Salient Features in Object Recognition Effect of Salient Features in Object Recognition Kashif Ahmad University of Engineering & Technology, Peshawar, Pakistan Nasir Ahmad University of Engineering & Technology, Peshawar, Pakistan Muhammad

More information

Discriminative classifiers for image recognition

Discriminative classifiers for image recognition Discriminative classifiers for image recognition May 26 th, 2015 Yong Jae Lee UC Davis Outline Last time: window-based generic object detection basic pipeline face detection with boosting as case study

More information

Context by Region Ancestry

Context by Region Ancestry Context by Region Ancestry Joseph J. Lim, Pablo Arbeláez, Chunhui Gu, and Jitendra Malik University of California, Berkeley - Berkeley, CA 94720 {lim,arbelaez,chunhui,malik}@eecs.berkeley.edu Abstract

More information

Detection III: Analyzing and Debugging Detection Methods

Detection III: Analyzing and Debugging Detection Methods CS 1699: Intro to Computer Vision Detection III: Analyzing and Debugging Detection Methods Prof. Adriana Kovashka University of Pittsburgh November 17, 2015 Today Review: Deformable part models How can

More information

A Framework for Encoding Object-level Image Priors

A Framework for Encoding Object-level Image Priors Microsoft Research Technical Report MSR-TR-2011-99 A Framework for Encoding Object-level Image Priors Jenny Yuen 1 C. Lawrence Zitnick 2 Ce Liu 2 Antonio Torralba 1 {jenny, torralba}@csail.mit.edu {larryz,

More information

570 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 20, NO. 2, FEBRUARY 2011

570 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 20, NO. 2, FEBRUARY 2011 570 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 20, NO. 2, FEBRUARY 2011 Contextual Object Localization With Multiple Kernel Nearest Neighbor Brian McFee, Student Member, IEEE, Carolina Galleguillos, Student

More information

Attributes. Computer Vision. James Hays. Many slides from Derek Hoiem

Attributes. Computer Vision. James Hays. Many slides from Derek Hoiem Many slides from Derek Hoiem Attributes Computer Vision James Hays Recap: Human Computation Active Learning: Let the classifier tell you where more annotation is needed. Human-in-the-loop recognition:

More information

Classifying Images with Visual/Textual Cues. By Steven Kappes and Yan Cao

Classifying Images with Visual/Textual Cues. By Steven Kappes and Yan Cao Classifying Images with Visual/Textual Cues By Steven Kappes and Yan Cao Motivation Image search Building large sets of classified images Robotics Background Object recognition is unsolved Deformable shaped

More information

Modeling Image Context using Object Centered Grid

Modeling Image Context using Object Centered Grid Modeling Image Context using Object Centered Grid Sobhan Naderi Parizi, Ivan Laptev, Alireza Tavakoli Targhi Computer Vision and Active Perception Laboratory Royal Institute of Technology (KTH) SE-100

More information

Multiple-Choice Questionnaire Group C

Multiple-Choice Questionnaire Group C Family name: Vision and Machine-Learning Given name: 1/28/2011 Multiple-Choice naire Group C No documents authorized. There can be several right answers to a question. Marking-scheme: 2 points if all right

More information

Find that! Visual Object Detection Primer

Find that! Visual Object Detection Primer Find that! Visual Object Detection Primer SkTech/MIT Innovation Workshop August 16, 2012 Dr. Tomasz Malisiewicz tomasz@csail.mit.edu Find that! Your Goals...imagine one such system that drives information

More information

Mixtures of Gaussians and Advanced Feature Encoding

Mixtures of Gaussians and Advanced Feature Encoding Mixtures of Gaussians and Advanced Feature Encoding Computer Vision Ali Borji UWM Many slides from James Hayes, Derek Hoiem, Florent Perronnin, and Hervé Why do good recognition systems go bad? E.g. Why

More information

Supervised learning. y = f(x) function

Supervised learning. y = f(x) function Supervised learning y = f(x) output prediction function Image feature Training: given a training set of labeled examples {(x 1,y 1 ),, (x N,y N )}, estimate the prediction function f by minimizing the

More information

Local Features based Object Categories and Object Instances Recognition

Local Features based Object Categories and Object Instances Recognition Local Features based Object Categories and Object Instances Recognition Eric Nowak Ph.D. thesis defense 17th of March, 2008 1 Thesis in Computer Vision Computer vision is the science and technology of

More information

Using the Forest to See the Trees: Context-based Object Recognition

Using the Forest to See the Trees: Context-based Object Recognition Using the Forest to See the Trees: Context-based Object Recognition Bill Freeman Joint work with Antonio Torralba and Kevin Murphy Computer Science and Artificial Intelligence Laboratory MIT A computer

More information

Object Recognition. Computer Vision. Slides from Lana Lazebnik, Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce

Object Recognition. Computer Vision. Slides from Lana Lazebnik, Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce Object Recognition Computer Vision Slides from Lana Lazebnik, Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce How many visual object categories are there? Biederman 1987 ANIMALS PLANTS OBJECTS

More information

Preliminary Local Feature Selection by Support Vector Machine for Bag of Features

Preliminary Local Feature Selection by Support Vector Machine for Bag of Features Preliminary Local Feature Selection by Support Vector Machine for Bag of Features Tetsu Matsukawa Koji Suzuki Takio Kurita :University of Tsukuba :National Institute of Advanced Industrial Science and

More information

Determining Patch Saliency Using Low-Level Context

Determining Patch Saliency Using Low-Level Context Determining Patch Saliency Using Low-Level Context Devi Parikh 1, C. Lawrence Zitnick 2, and Tsuhan Chen 1 1 Carnegie Mellon University, Pittsburgh, PA, USA 2 Microsoft Research, Redmond, WA, USA Abstract.

More information

Structured Models in. Dan Huttenlocher. June 2010

Structured Models in. Dan Huttenlocher. June 2010 Structured Models in Computer Vision i Dan Huttenlocher June 2010 Structured Models Problems where output variables are mutually dependent or constrained E.g., spatial or temporal relations Such dependencies

More information

Classification and Detection in Images. D.A. Forsyth

Classification and Detection in Images. D.A. Forsyth Classification and Detection in Images D.A. Forsyth Classifying Images Motivating problems detecting explicit images classifying materials classifying scenes Strategy build appropriate image features train

More information

24 hours of Photo Sharing. installation by Erik Kessels

24 hours of Photo Sharing. installation by Erik Kessels 24 hours of Photo Sharing installation by Erik Kessels And sometimes Internet photos have useful labels Im2gps. Hays and Efros. CVPR 2008 But what if we want more? Image Categorization Training Images

More information

Contextual Co-occurrence Information for Object Representation and Categorization

Contextual Co-occurrence Information for Object Representation and Categorization Vol.8, No.1 (2015), pp.95-104 http://dx.doi.org/10.14257/ijdta.2015.8.1.11 Contextual Co-occurrence Information for Object Representation and Categorization 1 Soheila Sheikhbahaei and 2 Zahra Sadeghi 1

More information

Object Detection by 3D Aspectlets and Occlusion Reasoning

Object Detection by 3D Aspectlets and Occlusion Reasoning Object Detection by 3D Aspectlets and Occlusion Reasoning Yu Xiang University of Michigan Silvio Savarese Stanford University In the 4th International IEEE Workshop on 3D Representation and Recognition

More information

High Level Computer Vision

High Level Computer Vision High Level Computer Vision Part-Based Models for Object Class Recognition Part 2 Bernt Schiele - schiele@mpi-inf.mpg.de Mario Fritz - mfritz@mpi-inf.mpg.de http://www.d2.mpi-inf.mpg.de/cv Please Note No

More information

CS 231A Computer Vision (Fall 2011) Problem Set 4

CS 231A Computer Vision (Fall 2011) Problem Set 4 CS 231A Computer Vision (Fall 2011) Problem Set 4 Due: Nov. 30 th, 2011 (9:30am) 1 Part-based models for Object Recognition (50 points) One approach to object recognition is to use a deformable part-based

More information

TA Section: Problem Set 4

TA Section: Problem Set 4 TA Section: Problem Set 4 Outline Discriminative vs. Generative Classifiers Image representation and recognition models Bag of Words Model Part-based Model Constellation Model Pictorial Structures Model

More information

Deformable Part Models

Deformable Part Models CS 1674: Intro to Computer Vision Deformable Part Models Prof. Adriana Kovashka University of Pittsburgh November 9, 2016 Today: Object category detection Window-based approaches: Last time: Viola-Jones

More information

1 Learning Spatial Context: 2 Using Stuff to Find Things

1 Learning Spatial Context: 2 Using Stuff to Find Things Learning Spatial Context: 2 Using Stuff to Find Things 2 Geremy Heitz Daphne Koller 3 3 4 Department of Computer Science 4 5 Stanford University 5 6 Stanford, CA 94305 6 7 Abstract. The sliding window

More information

Learning Spatial Context: Using Stuff to Find Things

Learning Spatial Context: Using Stuff to Find Things Learning Spatial Context: Using Stuff to Find Things Wei-Cheng Su Motivation 2 Leverage contextual information to enhance detection Some context objects are non-rigid and are more naturally classified

More information

Object Category Detection. Slides mostly from Derek Hoiem

Object Category Detection. Slides mostly from Derek Hoiem Object Category Detection Slides mostly from Derek Hoiem Today s class: Object Category Detection Overview of object category detection Statistical template matching with sliding window Part-based Models

More information

Part-based and local feature models for generic object recognition

Part-based and local feature models for generic object recognition Part-based and local feature models for generic object recognition May 28 th, 2015 Yong Jae Lee UC Davis Announcements PS2 grades up on SmartSite PS2 stats: Mean: 80.15 Standard Dev: 22.77 Vote on piazza

More information

Supervised Learning (contd) Linear Separation. Mausam (based on slides by UW-AI faculty)

Supervised Learning (contd) Linear Separation. Mausam (based on slides by UW-AI faculty) Supervised Learning (contd) Linear Separation Mausam (based on slides by UW-AI faculty) Images as Vectors Binary handwritten characters Treat an image as a highdimensional vector (e.g., by reading pixel

More information

Statistics of Natural Image Categories

Statistics of Natural Image Categories Statistics of Natural Image Categories Authors: Antonio Torralba and Aude Oliva Presented by: Sebastian Scherer Experiment Please estimate the average depth from the camera viewpoint to all locations(pixels)

More information

Local Features and Kernels for Classifcation of Texture and Object Categories: A Comprehensive Study

Local Features and Kernels for Classifcation of Texture and Object Categories: A Comprehensive Study Local Features and Kernels for Classifcation of Texture and Object Categories: A Comprehensive Study J. Zhang 1 M. Marszałek 1 S. Lazebnik 2 C. Schmid 1 1 INRIA Rhône-Alpes, LEAR - GRAVIR Montbonnot, France

More information

Beyond Bags of features Spatial information & Shape models

Beyond Bags of features Spatial information & Shape models Beyond Bags of features Spatial information & Shape models Jana Kosecka Many slides adapted from S. Lazebnik, FeiFei Li, Rob Fergus, and Antonio Torralba Detection, recognition (so far )! Bags of features

More information

Local Features and Bag of Words Models

Local Features and Bag of Words Models 10/14/11 Local Features and Bag of Words Models Computer Vision CS 143, Brown James Hays Slides from Svetlana Lazebnik, Derek Hoiem, Antonio Torralba, David Lowe, Fei Fei Li and others Computer Engineering

More information

3D Shape Analysis with Multi-view Convolutional Networks. Evangelos Kalogerakis

3D Shape Analysis with Multi-view Convolutional Networks. Evangelos Kalogerakis 3D Shape Analysis with Multi-view Convolutional Networks Evangelos Kalogerakis 3D model repositories [3D Warehouse - video] 3D geometry acquisition [KinectFusion - video] 3D shapes come in various flavors

More information

Stealing Objects With Computer Vision

Stealing Objects With Computer Vision Stealing Objects With Computer Vision Learning Based Methods in Vision Analysis Project #4: Mar 4, 2009 Presented by: Brian C. Becker Carnegie Mellon University Motivation Goal: Detect objects in the photo

More information

Social Media Computing

Social Media Computing Social Media Computing Lecture 4: Introduction to Information Retrieval and Classification Lecturer: Aleksandr Farseev E-mail: farseev@u.nus.edu Slides: http://farseev.com/ainlfruct.html At the beginning,

More information

Visuelle Perzeption für Mensch- Maschine Schnittstellen

Visuelle Perzeption für Mensch- Maschine Schnittstellen Visuelle Perzeption für Mensch- Maschine Schnittstellen Vorlesung, WS 2009 Prof. Dr. Rainer Stiefelhagen Dr. Edgar Seemann Institut für Anthropomatik Universität Karlsruhe (TH) http://cvhci.ira.uka.de

More information

Attributes and More Crowdsourcing

Attributes and More Crowdsourcing Attributes and More Crowdsourcing Computer Vision CS 143, Brown James Hays Many slides from Derek Hoiem Recap: Human Computation Active Learning: Let the classifier tell you where more annotation is needed.

More information

Bias-Variance Trade-off + Other Models and Problems

Bias-Variance Trade-off + Other Models and Problems CS 1699: Intro to Computer Vision Bias-Variance Trade-off + Other Models and Problems Prof. Adriana Kovashka University of Pittsburgh November 3, 2015 Outline Support Vector Machines (review + other uses)

More information

Supervised learning. y = f(x) function

Supervised learning. y = f(x) function Supervised learning y = f(x) output prediction function Image feature Training: given a training set of labeled examples {(x 1,y 1 ),, (x N,y N )}, estimate the prediction function f by minimizing the

More information

Object detection. Announcements. Last time: Mid-level cues 2/23/2016. Wed Feb 24 Kristen Grauman UT Austin

Object detection. Announcements. Last time: Mid-level cues 2/23/2016. Wed Feb 24 Kristen Grauman UT Austin Object detection Wed Feb 24 Kristen Grauman UT Austin Announcements Reminder: Assignment 2 is due Mar 9 and Mar 10 Be ready to run your code again on a new test set on Mar 10 Vision talk next Tuesday 11

More information

Human detection using histogram of oriented gradients. Srikumar Ramalingam School of Computing University of Utah

Human detection using histogram of oriented gradients. Srikumar Ramalingam School of Computing University of Utah Human detection using histogram of oriented gradients Srikumar Ramalingam School of Computing University of Utah Reference Navneet Dalal and Bill Triggs, Histograms of Oriented Gradients for Human Detection,

More information

Hierarchical Image-Region Labeling via Structured Learning

Hierarchical Image-Region Labeling via Structured Learning Hierarchical Image-Region Labeling via Structured Learning Julian McAuley, Teo de Campos, Gabriela Csurka, Florent Perronin XRCE September 14, 2009 McAuley et al (XRCE) Hierarchical Image-Region Labeling

More information

Learning Object Representations for Visual Object Class Recognition

Learning Object Representations for Visual Object Class Recognition Learning Object Representations for Visual Object Class Recognition Marcin Marszalek, Cordelia Schmid, Hedi Harzallah, Joost Van de Weijer To cite this version: Marcin Marszalek, Cordelia Schmid, Hedi

More information

CS 1674: Intro to Computer Vision. Attributes. Prof. Adriana Kovashka University of Pittsburgh November 2, 2016

CS 1674: Intro to Computer Vision. Attributes. Prof. Adriana Kovashka University of Pittsburgh November 2, 2016 CS 1674: Intro to Computer Vision Attributes Prof. Adriana Kovashka University of Pittsburgh November 2, 2016 Plan for today What are attributes and why are they useful? (paper 1) Attributes for zero-shot

More information

Efficient Kernels for Identifying Unbounded-Order Spatial Features

Efficient Kernels for Identifying Unbounded-Order Spatial Features Efficient Kernels for Identifying Unbounded-Order Spatial Features Yimeng Zhang Carnegie Mellon University yimengz@andrew.cmu.edu Tsuhan Chen Cornell University tsuhan@ece.cornell.edu Abstract Higher order

More information

COSC160: Detection and Classification. Jeremy Bolton, PhD Assistant Teaching Professor

COSC160: Detection and Classification. Jeremy Bolton, PhD Assistant Teaching Professor COSC160: Detection and Classification Jeremy Bolton, PhD Assistant Teaching Professor Outline I. Problem I. Strategies II. Features for training III. Using spatial information? IV. Reducing dimensionality

More information

Deformable Part Models

Deformable Part Models Deformable Part Models References: Felzenszwalb, Girshick, McAllester and Ramanan, Object Detec@on with Discrimina@vely Trained Part Based Models, PAMI 2010 Code available at hkp://www.cs.berkeley.edu/~rbg/latent/

More information

Part-Based Models for Object Class Recognition Part 2

Part-Based Models for Object Class Recognition Part 2 High Level Computer Vision Part-Based Models for Object Class Recognition Part 2 Bernt Schiele - schiele@mpi-inf.mpg.de Mario Fritz - mfritz@mpi-inf.mpg.de https://www.mpi-inf.mpg.de/hlcv Class of Object

More information

Part-Based Models for Object Class Recognition Part 2

Part-Based Models for Object Class Recognition Part 2 High Level Computer Vision Part-Based Models for Object Class Recognition Part 2 Bernt Schiele - schiele@mpi-inf.mpg.de Mario Fritz - mfritz@mpi-inf.mpg.de https://www.mpi-inf.mpg.de/hlcv Class of Object

More information

Object Recognition II

Object Recognition II Object Recognition II Linda Shapiro EE/CSE 576 with CNN slides from Ross Girshick 1 Outline Object detection the task, evaluation, datasets Convolutional Neural Networks (CNNs) overview and history Region-based

More information

Local Image Features

Local Image Features Local Image Features Ali Borji UWM Many slides from James Hayes, Derek Hoiem and Grauman&Leibe 2008 AAAI Tutorial Overview of Keypoint Matching 1. Find a set of distinctive key- points A 1 A 2 A 3 B 3

More information

EVENT DETECTION AND HUMAN BEHAVIOR RECOGNITION. Ing. Lorenzo Seidenari

EVENT DETECTION AND HUMAN BEHAVIOR RECOGNITION. Ing. Lorenzo Seidenari EVENT DETECTION AND HUMAN BEHAVIOR RECOGNITION Ing. Lorenzo Seidenari e-mail: seidenari@dsi.unifi.it What is an Event? Dictionary.com definition: something that occurs in a certain place during a particular

More information

Scenes vs. Objects: a Comparative Study of Two Approaches to Context Based Recognition

Scenes vs. Objects: a Comparative Study of Two Approaches to Context Based Recognition Scenes vs. Objects: a Comparative Study of Two Approaches to Context Based Recognition Andrew Rabinovich Google Inc. amrabino@google.com Serge Belongie University of California, San Diego sjb@cs.ucsd.edu

More information

Evaluation of GIST descriptors for web scale image search

Evaluation of GIST descriptors for web scale image search Evaluation of GIST descriptors for web scale image search Matthijs Douze Hervé Jégou, Harsimrat Sandhawalia, Laurent Amsaleg and Cordelia Schmid INRIA Grenoble, France July 9, 2009 Evaluation of GIST for

More information

Beyond bags of features: Adding spatial information. Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba

Beyond bags of features: Adding spatial information. Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba Adding spatial information Forming vocabularies from pairs of nearby features doublets

More information

Large-Scale Live Active Learning: Training Object Detectors with Crawled Data and Crowds

Large-Scale Live Active Learning: Training Object Detectors with Crawled Data and Crowds Large-Scale Live Active Learning: Training Object Detectors with Crawled Data and Crowds Sudheendra Vijayanarasimhan Kristen Grauman Department of Computer Science University of Texas at Austin Austin,

More information

Sparse coding for image classification

Sparse coding for image classification Sparse coding for image classification Columbia University Electrical Engineering: Kun Rong(kr2496@columbia.edu) Yongzhou Xiang(yx2211@columbia.edu) Yin Cui(yc2776@columbia.edu) Outline Background Introduction

More information

Robotics Programming Laboratory

Robotics Programming Laboratory Chair of Software Engineering Robotics Programming Laboratory Bertrand Meyer Jiwon Shin Lecture 8: Robot Perception Perception http://pascallin.ecs.soton.ac.uk/challenges/voc/databases.html#caltech car

More information

REGION AVERAGE POOLING FOR CONTEXT-AWARE OBJECT DETECTION

REGION AVERAGE POOLING FOR CONTEXT-AWARE OBJECT DETECTION REGION AVERAGE POOLING FOR CONTEXT-AWARE OBJECT DETECTION Kingsley Kuan 1, Gaurav Manek 1, Jie Lin 1, Yuan Fang 1, Vijay Chandrasekhar 1,2 Institute for Infocomm Research, A*STAR, Singapore 1 Nanyang Technological

More information

Hierarchical Learning for Object Detection. Long Zhu, Yuanhao Chen, William Freeman, Alan Yuille, Antonio Torralba MIT and UCLA, 2010

Hierarchical Learning for Object Detection. Long Zhu, Yuanhao Chen, William Freeman, Alan Yuille, Antonio Torralba MIT and UCLA, 2010 Hierarchical Learning for Object Detection Long Zhu, Yuanhao Chen, William Freeman, Alan Yuille, Antonio Torralba MIT and UCLA, 2010 Background I: Our prior work Our work for the Pascal Challenge is based

More information

An Object Detection Algorithm based on Deformable Part Models with Bing Features Chunwei Li1, a and Youjun Bu1, b

An Object Detection Algorithm based on Deformable Part Models with Bing Features Chunwei Li1, a and Youjun Bu1, b 5th International Conference on Advanced Materials and Computer Science (ICAMCS 2016) An Object Detection Algorithm based on Deformable Part Models with Bing Features Chunwei Li1, a and Youjun Bu1, b 1

More information

Automatic Discovery of Groups of Objects for Scene Understanding

Automatic Discovery of Groups of Objects for Scene Understanding Automatic Discovery of Groups of Objects for Scene Understanding Congcong Li Cornell University cl758@cornell.edu Devi Parikh Toyota Technological Institute (Chicago) dparikh@ttic.edu Tsuhan Chen Cornell

More information

Learning Deep Structured Models for Semantic Segmentation. Guosheng Lin

Learning Deep Structured Models for Semantic Segmentation. Guosheng Lin Learning Deep Structured Models for Semantic Segmentation Guosheng Lin Semantic Segmentation Outline Exploring Context with Deep Structured Models Guosheng Lin, Chunhua Shen, Ian Reid, Anton van dan Hengel;

More information

Recognizing Deformable Shapes. Salvador Ruiz Correa Ph.D. UW EE

Recognizing Deformable Shapes. Salvador Ruiz Correa Ph.D. UW EE Recognizing Deformable Shapes Salvador Ruiz Correa Ph.D. UW EE Input 3-D Object Goal We are interested in developing algorithms for recognizing and classifying deformable object shapes from range data.

More information

Deep condolence to Professor Mark Everingham

Deep condolence to Professor Mark Everingham Deep condolence to Professor Mark Everingham Towards VOC2012 Object Classification Challenge Generalized Hierarchical Matching for Sub-category Aware Object Classification National University of Singapore

More information

Region-based Segmentation and Object Detection

Region-based Segmentation and Object Detection Region-based Segmentation and Object Detection Stephen Gould 1 Tianshi Gao 1 Daphne Koller 2 1 Department of Electrical Engineering, Stanford University 2 Department of Computer Science, Stanford University

More information

Spatially Constrained Location Prior for Scene Parsing

Spatially Constrained Location Prior for Scene Parsing Spatially Constrained Location Prior for Scene Parsing Ligang Zhang, Brijesh Verma, David Stockwell, Sujan Chowdhury Centre for Intelligent Systems School of Engineering and Technology, Central Queensland

More information

CS 229 Midterm Review

CS 229 Midterm Review CS 229 Midterm Review Course Staff Fall 2018 11/2/2018 Outline Today: SVMs Kernels Tree Ensembles EM Algorithm / Mixture Models [ Focus on building intuition, less so on solving specific problems. Ask

More information

Improving Recognition through Object Sub-categorization

Improving Recognition through Object Sub-categorization Improving Recognition through Object Sub-categorization Al Mansur and Yoshinori Kuno Graduate School of Science and Engineering, Saitama University, 255 Shimo-Okubo, Sakura-ku, Saitama-shi, Saitama 338-8570,

More information

Visual Object Recognition

Visual Object Recognition Visual Object Recognition -67777 Instructor: Daphna Weinshall, daphna@cs.huji.ac.il Office: Ross 211 Office hours: Sunday 12:00-13:00 1 Sources Recognizing and Learning Object Categories ICCV 2005 short

More information

Texton Clustering for Local Classification using Scene-Context Scale

Texton Clustering for Local Classification using Scene-Context Scale Texton Clustering for Local Classification using Scene-Context Scale Yousun Kang Tokyo Polytechnic University Atsugi, Kanakawa, Japan 243-0297 Email: yskang@cs.t-kougei.ac.jp Sugimoto Akihiro National

More information

Recognition problems. Face Recognition and Detection. Readings. What is recognition?

Recognition problems. Face Recognition and Detection. Readings. What is recognition? Face Recognition and Detection Recognition problems The Margaret Thatcher Illusion, by Peter Thompson Computer Vision CSE576, Spring 2008 Richard Szeliski CSE 576, Spring 2008 Face Recognition and Detection

More information

Visual words. Map high-dimensional descriptors to tokens/words by quantizing the feature space.

Visual words. Map high-dimensional descriptors to tokens/words by quantizing the feature space. Visual words Map high-dimensional descriptors to tokens/words by quantizing the feature space. Quantize via clustering; cluster centers are the visual words Word #2 Descriptor feature space Assign word

More information

A Taxonomy of Semi-Supervised Learning Algorithms

A Taxonomy of Semi-Supervised Learning Algorithms A Taxonomy of Semi-Supervised Learning Algorithms Olivier Chapelle Max Planck Institute for Biological Cybernetics December 2005 Outline 1 Introduction 2 Generative models 3 Low density separation 4 Graph

More information

Recognizing Deformable Shapes. Salvador Ruiz Correa (CSE/EE576 Computer Vision I)

Recognizing Deformable Shapes. Salvador Ruiz Correa (CSE/EE576 Computer Vision I) Recognizing Deformable Shapes Salvador Ruiz Correa (CSE/EE576 Computer Vision I) Input 3-D Object Goal We are interested in developing algorithms for recognizing and classifying deformable object shapes

More information