TA Section: Problem Set 4

Similar documents
Previously. Part-based and local feature models for generic object recognition. Bag-of-words model 4/20/2011

Part-based and local feature models for generic object recognition

Part based models for recognition. Kristen Grauman

CS 231A Computer Vision (Fall 2011) Problem Set 4

Object Classification Problem

Beyond Bags of Features

Bag of Words Models. CS4670 / 5670: Computer Vision Noah Snavely. Bag-of-words models 11/26/2013

Bag-of-features. Cordelia Schmid

CS 231A Computer Vision (Fall 2012) Problem Set 4

CLASSIFICATION Experiments

CS6670: Computer Vision

Sparse coding for image classification

Multiple Kernel Learning for Emotion Recognition in the Wild

Analysis: TextonBoost and Semantic Texton Forests. Daniel Munoz Februrary 9, 2009

Recognize Complex Events from Static Images by Fusing Deep Channels Supplementary Materials

Local Features and Bag of Words Models

Classifying Images with Visual/Textual Cues. By Steven Kappes and Yan Cao

Object Recognition. Computer Vision. Slides from Lana Lazebnik, Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce

Fisher vector image representation

Supervised learning. y = f(x) function

Visual Object Recognition

Lecture 16: Object recognition: Part-based generative models

Generative and discriminative classification techniques

Semi-Supervised Hierarchical Models for 3D Human Pose Reconstruction

Deformable Part Models

Announcements. Recognition. Recognition. Recognition. Recognition. Homework 3 is due May 18, 11:59 PM Reading: Computer Vision I CSE 152 Lecture 14

Rich feature hierarchies for accurate object detection and semantic segmentation

Recognition with Bag-ofWords. (Borrowing heavily from Tutorial Slides by Li Fei-fei)

CS 231A CA Session: Problem Set 4 Review. Kevin Chen May 13, 2016

Local Features and Kernels for Classifcation of Texture and Object Categories: A Comprehensive Study

Descriptors for CV. Introduc)on:

Bias-Variance Trade-off (cont d) + Image Representations

Object Detection with Discriminatively Trained Part Based Models

Action recognition in videos

Large-Scale Live Active Learning: Training Object Detectors with Crawled Data and Crowds

Object Classification for Video Surveillance

Image classification Computer Vision Spring 2018, Lecture 18

Distances and Kernels. Motivation

COSC160: Detection and Classification. Jeremy Bolton, PhD Assistant Teaching Professor

Contents I IMAGE FORMATION 1

Beyond Bags of features Spatial information & Shape models

Mining Discriminative Adjectives and Prepositions for Natural Scene Recognition

Applying Supervised Learning

Part-Based Models for Object Class Recognition Part 2

Part-Based Models for Object Class Recognition Part 2

Naïve Bayes, Gaussian Distributions, Practical Applications

Recap Image Classification with Bags of Local Features

Object Category Detection. Slides mostly from Derek Hoiem

Discriminative classifiers for image recognition

Classification and Detection in Images. D.A. Forsyth

CS6716 Pattern Recognition

Efficient Kernels for Identifying Unbounded-Order Spatial Features

Improved Spatial Pyramid Matching for Image Classification

Object recognition. Methods for classification and image representation

Beyond bags of Features

arxiv: v3 [cs.cv] 3 Oct 2012

Modern Object Detection. Most slides from Ali Farhadi

Category-level localization

Feature Descriptors. CS 510 Lecture #21 April 29 th, 2013

Shape Matching. Brandon Smith and Shengnan Wang Computer Vision CS766 Fall 2007

Exploring Bag of Words Architectures in the Facial Expression Domain

High Level Computer Vision

Verification: is that a lamp? What do we mean by recognition? Recognition. Recognition

Visual words. Map high-dimensional descriptors to tokens/words by quantizing the feature space.

Artistic ideation based on computer vision methods

Patch Descriptors. CSE 455 Linda Shapiro

Bias-Variance Trade-off + Other Models and Problems

Instance-based Learning CE-717: Machine Learning Sharif University of Technology. M. Soleymani Fall 2015

Supervised learning. y = f(x) function

String distance for automatic image classification

Recognition. Topics that we will try to cover:

Learning Visual Semantics: Models, Massive Computation, and Innovative Applications

Learning and Inferring Depth from Monocular Images. Jiyan Pan April 1, 2009

Learning Compact Visual Attributes for Large-scale Image Classification

Introduction to object recognition. Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and others

TagProp: Discriminative Metric Learning in Nearest Neighbor Models for Image Annotation

Large-scale visual recognition The bag-of-words representation

Multiple-Choice Questionnaire Group C

Object recognition (part 1)

Part-Based Models for Object Class Recognition Part 3

Metric learning approaches! for image annotation! and face recognition!

EVENT DETECTION AND HUMAN BEHAVIOR RECOGNITION. Ing. Lorenzo Seidenari

Object Detection Based on Deep Learning

Category vs. instance recognition

Multi-view Facial Expression Recognition Analysis with Generic Sparse Coding Feature

Structured Learning. Jun Zhu

Machine Learning Crash Course

What do we mean by recognition?

Automatic Colorization of Grayscale Images

SEMANTIC SEGMENTATION AS IMAGE REPRESENTATION FOR SCENE RECOGNITION. Ahmed Bassiouny, Motaz El-Saban. Microsoft Advanced Technology Labs, Cairo, Egypt

Machine Learning. Chao Lan

Using Machine Learning for Classification of Cancer Cells

Recognition Tools: Support Vector Machines

Lecture 15: Object recognition: Bag of Words models & Part based generative models

Preliminary Local Feature Selection by Support Vector Machine for Bag of Features

Computer vision: models, learning and inference. Chapter 13 Image preprocessing and feature extraction

Deep Learning for Object detection & localization

Patch Descriptors. EE/CSE 576 Linda Shapiro

CSE 158. Web Mining and Recommender Systems. Midterm recap

Computer Vision Lecture 16

Transcription:

TA Section: Problem Set 4

Outline Discriminative vs. Generative Classifiers Image representation and recognition models Bag of Words Model Part-based Model Constellation Model Pictorial Structures Model Spatial Pyramid Matching (SPM) ObjectBank

Discriminative vs Generative Classifiers Training classifiers involves estimating f: X Y, or P(Y X) Discriminative classifiers (e.g. logistic regression, SVM): We want to model P(Y X) Assume some functional form for P(Y X) Estimate parameters of P(Y X) directly from training data Generative classifiers (e.g. naïve bayes): We want to model P(X, Y) Assume some functional form for P(X Y), P(X) Estimate parameters of P(X Y), P(X) directly from training data Use Bayes rule to calculate P(Y X=x i )

Discriminative vs Generative Classifiers Advantages of discriminative classifiers: Typically faster at making predictions Tend to have better performance Direct modeling of what we want to optimize Advantage of generative classifiers: Can handle missing/partially labeled data A new class (Y+1) can be added incrementally without training the complete model Can generate samples from the training distribution [Ulusoy & Bishop, 2005]

Image Representation Bag of Words Weakly Spatial Models Part-based Spatial Pyramid Matching Object- Bank Constellation Model Pictorial Structure No spatial info. Spatial Specificity of Parts Very strong but sparse

Image Representation Bag of Words Weakly Spatial Models Part-based Spatial Pyramid Matching Object- Bank Constellation Model Pictorial Structure No spatial info. Spatial Specificity of Parts Very strong but sparse

Bag-of-words Representation Clustering (e.g. K-means) 19 Local features (e.g. SIFT features) Using minimum Euclidean distance! Dictionary or Code book Feature generation 19 Normalization 19 (Why?) Count Bag-of-words! Histogram Assignment Assign feature to a dictionary word

Image Representation Bag of Words Weakly Spatial Models Part-based Spatial Pyramid Matching Object- Bank Constellation Model Pictorial Structure No spatial info. Spatial Specificity of Parts Very strong but sparse

Image Representation Bag of Words Weakly Spatial Models Part-based Spatial Pyramid Matching Object- Bank Constellation Model Pictorial Structure No spatial info. Spatial Specificity of Parts Very strong but sparse

Generat ive probabilist ic model ( 2) Foreground model Gaussian shape pdf Gaussian part appearance pdf Gaussian relative scale pdf log(scale) Prob. of detection Clutter model Uniform shape pdf Gaussian background appearance pdf 0.8 0.75 0.9 Uniform relative scale pdf log(scale) Poission pdf on # detections

Image Representation Bag of Words Weakly Spatial Models Part-based Spatial Pyramid Matching Object- Bank Constellation Model Pictorial Structure No spatial info. Spatial Specificity of Parts Very strong but sparse

Pictorial Structures Basic idea: We would like to represent an object by a collection of parts arranged in a deformable configuration Parts sharing similar appearance Examples of detected models

Pictorial Structures Local model of appearance with non-local geometric or spatial constraints Simultaneous use of appearance and spatial information Simple part models alone are not discriminative The model needs to solve the tasks: determine whether an object is visible in an image determine where an object is in the image

Pictorial Structures Model is represented as an undirected graph structure G= (V, E), where V are the vertices and E are the edges Matching score of individual parts Sum over all edges Optimal parts configuration Sum over all parts Deformation score of connected parts

Image Representation Bag of Words Weakly Spatial Models Part-based Spatial Pyramid Matching Object- Bank Constellation Model Pictorial Structure No spatial info. Spatial Specificity of Parts Very strong but sparse

Start with Pyramid Matching Kernel for BoW Models Sets of features [Grauman & Darrell, 2005]

Pyramid Matching Kernel How do we build a discriminative classifier using the set representation? Kernel-based methods (e.g. SVM) are appealing for efficiency and generalization power. But what is an appropriate kernel? Each instance is an unordered set of vectors Varying number of vectors per instance

Pyramid Matching Kernel We can compare sets by computing a partial matching between their features Number of newly matched pairs at level i Approximate partial match similarity Measure of difficulty of a match at level i

Pyramid Matching Kernel (Example) Level 0

Pyramid Matching Kernel (Example) Level 1

Pyramid Matching Kernel optimal partial matching between sets of features difficulty of a match at level i number of new matches at level i

Spatial Pyramid Matching Pyramid Match Kernel (Grauman & Darrell) Pyramid in feature space, ignore location Spatial Pyramid (Lazebnik et al) Pyramid in image space, quantize features Features: OR Weak (edge orientations) Strong (SIFT)

Spatial Pyramid Matching

Spatial Pyramid Matching

Image Representation Bag of Words Weakly Spatial Models Part-based Spatial Pyramid Matching Object- Bank Constellation Model Pictorial Structure No spatial info. Spatial Specificity of Parts Very strong but sparse

Li et al. 2010 Image classification Event: Sailing High level tasks Object Bank High Level Objects based Sailboat, water, sky, tree, Image representation Semantic Gap Low level feature HoG, Gist, SIFT, Color, Texture, Bag of Words (BoW), Spatial Pyramid (SPM)

Li et al. 2010 Object Bank representation Sailboat response Object Filters Spatial pooling

Li et al. 2010 Object Bank representation Sailboat response Object Filters Spatial pooling Object size variance Small Median Large

Li et al. 2010 Object Bank representation Sailboat response Bear response Water response Object Filters Spatial pooling Implementation details ~ 200 object detectors Felzenswalb et al. 2008 Hoeim et al. 2005 3-level spatial pyramid for each grid: max of each object

Li et al. 2010 Object Bank representation Object Filters Spatial pooling Sailboat response Bear response Water response N N N N M x N M x N N: ngridsperscale, M: nscale, O: nobject M x N x O= thousands E.g., In our setting: 12 x 21x 177=44604

A word about Q2 in PS4 We d like you to understand the differences between BoW, SPM, and ObjectBank We d like you to use what you ve learned so far, be creative, and come up with interesting ways of encoding image information for an image recognition task Extra credits are given especially to innovation and good performances