EE795: Computer Vision and Intelligent Systems

Similar documents
Verification: is that a lamp? What do we mean by recognition? Recognition. Recognition

What do we mean by recognition?

EE795: Computer Vision and Intelligent Systems

Object recognition (part 1)

Last week. Multi-Frame Structure from Motion: Multi-View Stereo. Unknown camera viewpoints

Recognition problems. Face Recognition and Detection. Readings. What is recognition?

12/3/2009. What is Computer Vision? Applications. Application: Assisted driving Pedestrian and car detection. Application: Improving online search

CS5670: Intro to Computer Vision

Object and Class Recognition I:

Visual Object Recognition

CS4495/6495 Introduction to Computer Vision. 8C-L1 Classification: Discriminative models

CEE598 - Visual Sensing for Civil Infrastructure Eng. & Mgmt.

Announcements. Recognition. Recognition. Recognition. Recognition. Homework 3 is due May 18, 11:59 PM Reading: Computer Vision I CSE 152 Lecture 14

Object detection as supervised classification

Computer Vision. Exercise Session 10 Image Categorization

Recognition: Face Recognition. Linda Shapiro EE/CSE 576

COSC160: Detection and Classification. Jeremy Bolton, PhD Assistant Teaching Professor

Snakes, level sets and graphcuts. (Deformable models)

Object Category Detection: Sliding Windows

Local cues and global constraints in image understanding

Object Category Detection. Slides mostly from Derek Hoiem

Lecture 16: Object recognition: Part-based generative models

Category vs. instance recognition

2%34 #5 +,,% ! # %& ()% #% +,,%. & /%0%)( 1 ! # %&# (&)# +, %& ./01 /1&2% # /&

Supervised Sementation: Pixel Classification

Image classification Computer Vision Spring 2018, Lecture 18

Recognizing and Learning Object Categories. Li Fei-Fei, Stanford Rob Fergus, NYU Antonio Torralba, MIT

Beyond Bags of Features

Course Administration

Features Points. Andrea Torsello DAIS Università Ca Foscari via Torino 155, Mestre (VE)

Local Patch Descriptors

Supervised learning. y = f(x) function

Supervised learning. y = f(x) function

Generative and discriminative classification techniques

Object Recognition. Computer Vision. Slides from Lana Lazebnik, Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce

Lecture 12 Recognition

CS 231A Computer Vision (Fall 2012) Problem Set 3

CS 223B Computer Vision Problem Set 3

Today. Introduction to recognition Alignment based approaches 11/4/2008

Tracking Computer Vision Spring 2018, Lecture 24

Local Feature Detectors

Lecture 9: Hough Transform and Thresholding base Segmentation

Recap Image Classification with Bags of Local Features

Part-based and local feature models for generic object recognition

Local Features and Bag of Words Models

Object Recognition Using Pictorial Structures. Daniel Huttenlocher Computer Science Department. In This Talk. Object recognition in computer vision

Image Classification pipeline. Lecture 2-1

Previously. Part-based and local feature models for generic object recognition. Bag-of-words model 4/20/2011

EECS 442 Computer Vision fall 2011

Image processing & Computer vision Xử lí ảnh và thị giác máy tính

Image Classification pipeline. Lecture 2-1

Image Features: Local Descriptors. Sanja Fidler CSC420: Intro to Image Understanding 1/ 58

Computer Vision for HCI. Topics of This Lecture

Eppur si muove ( And yet it moves )

Lecture 12 Recognition. Davide Scaramuzza

Homework. Gaussian, Bishop 2.3 Non-parametric, Bishop 2.5 Linear regression Pod-cast lecture on-line. Next lectures:

Bayes Classifiers and Generative Methods

Modern Object Detection. Most slides from Ali Farhadi

Classification Algorithms in Data Mining

9.913 Pattern Recognition for Vision. Class I - Overview. Instructors: B. Heisele, Y. Ivanov, T. Poggio

Local features: detection and description. Local invariant features

Machine Learning Lecture 3

Introduction to object recognition. Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and others

Recap: Gaussian (or Normal) Distribution. Recap: Minimizing the Expected Loss. Topics of This Lecture. Recap: Maximum Likelihood Approach

Instance-level recognition

CS4442/9542b Artificial Intelligence II prof. Olga Veksler

Computer Vision with MATLAB MATLAB Expo 2012 Steve Kuznicki

Bayes Risk. Classifiers for Recognition Reading: Chapter 22 (skip 22.3) Discriminative vs Generative Models. Loss functions in classifiers

Case-Based Reasoning. CS 188: Artificial Intelligence Fall Nearest-Neighbor Classification. Parametric / Non-parametric.

CS 188: Artificial Intelligence Fall 2008

Instance-level recognition

TA Section 7 Problem Set 3. SIFT (Lowe 2004) Shape Context (Belongie et al. 2002) Voxel Coloring (Seitz and Dyer 1999)

Classifiers for Recognition Reading: Chapter 22 (skip 22.3)

Part based models for recognition. Kristen Grauman

Face/Flesh Detection and Face Recognition

Image Analysis. Window-based face detection: The Viola-Jones algorithm. iphoto decides that this is a face. It can be trained to recognize pets!

Midterm Wed. Local features: detection and description. Today. Last time. Local features: main components. Goal: interest operator repeatability

Automatic Image Alignment (feature-based)

Applications Video Surveillance (On-line or off-line)

Pattern recognition (3)

Local Image Features

Machine Learning Lecture 3

Local Features: Detection, Description & Matching

Feature descriptors. Alain Pagani Prof. Didier Stricker. Computer Vision: Object and People Tracking

Bias-Variance Trade-off (cont d) + Image Representations

CS4442/9542b Artificial Intelligence II prof. Olga Veksler

PART VI High-Level Vision: Probabilistic and Inferential Methods

Patch-based Object Recognition. Basic Idea

Automatic Tracking of Moving Objects in Video for Surveillance Applications

Classification and Detection in Images. D.A. Forsyth

Designing Applications that See Lecture 7: Object Recognition

Lecture 7: Segmentation. Thursday, Sept 20

Why is computer vision difficult?

Object Detection. Sanja Fidler CSC420: Intro to Image Understanding 1/ 1

Window based detectors

Machine Learning. Classification

Local features: detection and description May 12 th, 2015

Local Features Tutorial: Nov. 8, 04

Segmentation and Tracking of Partial Planar Templates

EE795: Computer Vision and Intelligent Systems

Transcription:

EE795: Computer Vision and Intelligent Systems Spring 2013 TTh 17:30-18:45 FDH 204 Lecture 18 130404 http://www.ee.unlv.edu/~b1morris/ecg795/

2 Outline Object Recognition Intro (Chapter 14) Slides from Steve Seitz, Washington Excellent References http://people.csail.mit.edu/torralba/shortcourserloc/index.html http://web.eecs.umich.edu/~silvio/teaching/lectures/lecture19.pdf

3 Object Recognition Introduction The Margaret Thatcher Illusion, by Peter Thompson Steve Seitz

4 Object Recognition Introduction The Margaret Thatcher Illusion, by Peter Thompson Steve Seitz

What do we mean by object recognition? Next 15 slides adapted from Li, Fergus, & Torralba s excellent short course on category and object recognition 5

Verification: is that a lamp? 6

Detection: are there people? 7

Identification: is that Potala Palace? 8

Object categorization mountain tree banner building street lamp people vendor 9

Scene and context categorization outdoor city 10

Object recognition Is it really so hard? Find the chair in this image Output of normalized correlation This is a chair 11

Object recognition Is it really so hard? Find the chair in this image Pretty much garbage Simple template matching is not going to make it 12

Object recognition Is it really so hard? Find the chair in this image A popular method is that of template matching, by point to point correlation of a model pattern with the image pattern. These techniques are inadequate for three-dimensional scene analysis for many reasons, such as occlusion, changes in viewing angle, and articulation of parts. Nivatia & Binford, 1977. 13

Why not use SIFT matching for everything? Works well for object instances Not great for generic object categories 14

Applications: Computational photography 15

meters Applications: Assisted driving Pedestrian and car detection Ped Ped Car meters Lane detection Collision warning systems with adaptive cruise control, Lane departure warning systems, Rear object detection 16 systems,

Applications: image search 17

Challenges: viewpoint variation Michelangelo 1475-1564 18

Challenges: illumination variation 19 slide credit: S. Ullman

Challenges: occlusion Magritte, 1957 20

Challenges: scale 21

Challenges: deformation 22 Xu, Beihong 1943

Challenges: background clutter Klimt, 1913 23

Challenges: intra-class variation 24

25 Recognition problems What is it? Object and scene recognition Who is it? Identity recognition Where is it? Object detection What are they doing? Activities All of these are classification problems Choose one class from a list of possible candidates Steve Seitz

26 What is recognition? A different taxonomy from [Csurka et al. 2006]: Recognition Where is this particular object? Categorization What kind of object(s) is(are) present? Content-based image retrieval Find me something that looks similar Detection Locate all instances of a given class Steve Seitz

Face detection How to tell if a face is present? 27

One simple method: skin detection skin Skin pixels have a distinctive range of colors Corresponds to region(s) in RGB color space Skin classifier for visualization, only R and G components are shown above A pixel X = (R,G,B) is skin if it is in the skin region But how to find this region? 28

Skin detection Learn the skin region from examples Manually label pixels in one or more training images as skin or not skin Plot the training data in RGB space skin pixels shown in orange, non-skin pixels shown in blue some skin pixels may be outside the region, non-skin pixels inside. Why? Skin classifier Given X = (R,G,B): how to determine if it is skin or not? 29

Skin classification techniques Skin classifier Given X = (R,G,B): how to determine if it is skin or not? Nearest neighbor find labeled pixel closest to X choose the label for that pixel Data modeling fit a model (curve, surface, or volume) to each class Probabilistic data modeling fit a probability model to each class 30

Probability Basic probability X is a random variable P(X) is the probability that X achieves a certain value called a PDF -probability distribution/density function -a 2D PDF is a surface, 3D PDF is a volume or continuous X discrete X Conditional probability: P(X Y) probability of X given that we already know Y 31

Probabilistic skin classification Now we can model uncertainty Each pixel has a probability of being skin or not skin Skin classifier Given X = (R,G,B): how to determine if it is skin or not? Choose interpretation of highest probability set X to be a skin pixel if and only if Where do we get and? 32

Learning conditional PDF s We can calculate P(R skin) from a set of training images It is simply a histogram over the pixels in the training images each bin R i contains the proportion of skin pixels with color R i This doesn t work as well in higher-dimensional spaces. Why not? Approach: fit parametric PDF functions common choice is rotated Gaussian center covariance 33» orientation, size defined by eigenvecs, eigenvals

Learning conditional PDF s We can calculate P(R skin) from a set of training images It is simply a histogram over the pixels in the training images each bin R i contains the proportion of skin pixels with color R i But this isn t quite what we want Why not? How to determine if a pixel is skin? We want P(skin R) not P(R skin) How can we get it? 34

Bayes rule In terms of our problem: what we measure (likelihood) domain knowledge (prior) what we want (posterior) normalization term The prior: P(skin) Could use domain knowledge P(skin) may be larger if we know the image contains a person for a portrait, P(skin) may be higher for pixels in the center Could learn the prior from the training set. How? P(skin) may be proportion of skin pixels in training set 35

Bayesian estimation likelihood posterior (unnormalized) Bayesian estimation = minimize probability of misclassification Goal is to choose the label (skin or ~skin) that maximizes the posterior this is called Maximum A Posteriori (MAP) estimation Suppose the prior is uniform: P(skin) = P(~skin) = 0.5 in this case, maximizing the posterior is equivalent to maximizing the likelihood» if and only if this is called Maximum Likelihood (ML) estimation 36

Skin detection results 37

General classification This same procedure applies in more general circumstances More than two classes More than one dimension Example: face detection Here, X is an image region dimension = # pixels each face can be thought of as a point in a high dimensional space H. Schneiderman, T. Kanade. "A Statistical Method for 3D Object Detection Applied to Faces and Cars". IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2000) http://www-2.cs.cmu.edu/afs/cs.cmu.edu/user/hws/www/cvpr00.pdf H. Schneiderman and T.Kanade 38