Visual Object Recognition

Similar documents
Object and Class Recognition I:

Today. Introduction to recognition Alignment based approaches 11/4/2008

Visual Object Recognition

EE795: Computer Vision and Intelligent Systems

Evaluation and comparison of interest points/regions

12/3/2009. What is Computer Vision? Applications. Application: Assisted driving Pedestrian and car detection. Application: Improving online search

Computer Vision for HCI. Topics of This Lecture

Object recognition (part 1)

Local invariant features

Object Category Detection. Slides mostly from Derek Hoiem

Verification: is that a lamp? What do we mean by recognition? Recognition. Recognition

Object Recognition with Invariant Features

Lecture 16: Object recognition: Part-based generative models

CEE598 - Visual Sensing for Civil Infrastructure Eng. & Mgmt.

What do we mean by recognition?

Category-level localization

Snakes, level sets and graphcuts. (Deformable models)

Prof. Feng Liu. Spring /26/2017

Recognizing Deformable Shapes. Salvador Ruiz Correa UW Ph.D. Graduate Researcher at Children s Hospital

Part based models for recognition. Kristen Grauman

Lecture 12 Recognition

Image Features: Detection, Description, and Matching and their Applications

Lecture 14: Indexing with local features. Thursday, Nov 1 Prof. Kristen Grauman. Outline

Building a Panorama. Matching features. Matching with Features. How do we build a panorama? Computational Photography, 6.882

Local features: detection and description. Local invariant features

Basic Problem Addressed. The Approach I: Training. Main Idea. The Approach II: Testing. Why a set of vocabularies?

Comparison of Local Feature Descriptors

CS4670: Computer Vision

Local Image Features

Lecture: RANSAC and feature detectors

Lecture 12 Recognition. Davide Scaramuzza

Video Google faces. Josef Sivic, Mark Everingham, Andrew Zisserman. Visual Geometry Group University of Oxford

Recognizing Deformable Shapes. Salvador Ruiz Correa Ph.D. UW EE

Category vs. instance recognition

Announcements. Recognition (Part 3) Model-Based Vision. A Rough Recognition Spectrum. Pose consistency. Recognition by Hypothesize and Test

High-Level Computer Vision

EE795: Computer Vision and Intelligent Systems

CEE598 - Visual Sensing for Civil Infrastructure Eng. & Mgmt.

Part-based and local feature models for generic object recognition

Automatic Image Alignment (feature-based)

Recognition (Part 4) Introduction to Computer Vision CSE 152 Lecture 17

Features Points. Andrea Torsello DAIS Università Ca Foscari via Torino 155, Mestre (VE)

Lecture 10 Detectors and descriptors

CS5670: Intro to Computer Vision

Beyond Bags of Features

Midterm Wed. Local features: detection and description. Today. Last time. Local features: main components. Goal: interest operator repeatability

CS 4495 Computer Vision A. Bobick. CS 4495 Computer Vision. Features 2 SIFT descriptor. Aaron Bobick School of Interactive Computing

Recognizing Deformable Shapes. Salvador Ruiz Correa (CSE/EE576 Computer Vision I)

Feature descriptors. Alain Pagani Prof. Didier Stricker. Computer Vision: Object and People Tracking

Patch Descriptors. EE/CSE 576 Linda Shapiro

Shape recognition with edge-based features

Feature Based Registration - Image Alignment

Announcements. Recognition. Recognition. Recognition. Recognition. Homework 3 is due May 18, 11:59 PM Reading: Computer Vision I CSE 152 Lecture 14

Previously. Part-based and local feature models for generic object recognition. Bag-of-words model 4/20/2011

School of Computing University of Utah

Patch Descriptors. CSE 455 Linda Shapiro

Patch-based Object Recognition. Basic Idea

Classifying Images with Visual/Textual Cues. By Steven Kappes and Yan Cao

Local features: detection and description May 12 th, 2015

Outline 7/2/201011/6/

Image Features. Work on project 1. All is Vanity, by C. Allan Gilbert,

Local Image Features

Recognizing Deformable Shapes. Salvador Ruiz Correa Ph.D. Thesis, Electrical Engineering

Image matching. Announcements. Harder case. Even harder case. Project 1 Out today Help session at the end of class. by Diva Sian.

Deformation Invariant Image Matching

Beyond bags of features: Adding spatial information. Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba

Local features and image matching. Prof. Xin Yang HUST

2D Image Processing Feature Descriptors

Automatic Image Alignment

SCALE INVARIANT FEATURE TRANSFORM (SIFT)

Object Detection. Sanja Fidler CSC420: Intro to Image Understanding 1/ 1

A Tale of Two Object Recognition Methods for Mobile Robots

Announcements. Recognition I. Gradient Space (p,q) What is the reflectance map?

Object Recognition. Computer Vision. Slides from Lana Lazebnik, Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce

Local Feature Detectors

Recognition with Bag-ofWords. (Borrowing heavily from Tutorial Slides by Li Fei-fei)

SIFT: SCALE INVARIANT FEATURE TRANSFORM SURF: SPEEDED UP ROBUST FEATURES BASHAR ALSADIK EOS DEPT. TOPMAP M13 3D GEOINFORMATION FROM IMAGES 2014

Human Detection and Action Recognition. in Video Sequences

Wikipedia - Mysid

Local Features: Detection, Description & Matching

Local cues and global constraints in image understanding

Motion Estimation and Optical Flow Tracking

Motion illusion, rotating snakes

Evaluation of GIST descriptors for web scale image search

Find that! Visual Object Detection Primer

Today. Main questions 10/30/2008. Bag of words models. Last time: Local invariant features. Harris corner detector: rotation invariant detection

Beyond Bags of features Spatial information & Shape models

Harder case. Image matching. Even harder case. Harder still? by Diva Sian. by swashford

Object Category Detection: Sliding Windows

Lecture 6: Finding Features (part 1/2)

Local Features based Object Categories and Object Instances Recognition

CLASSIFICATION Experiments

Introduction. Introduction. Related Research. SIFT method. SIFT method. Distinctive Image Features from Scale-Invariant. Scale.

Harder case. Image matching. Even harder case. Harder still? by Diva Sian. by swashford

3-D Object Recognition Using 2-D Views Wenjing Li, Member, IEEE, George Bebis, Member, IEEE, and Nikolaos G. Bourbakis, Fellow, IEEE

Supervised learning. y = f(x) function

Deformable Part Models

Object Classification for Video Surveillance

The SIFT (Scale Invariant Feature

SUMMARY: DISTINCTIVE IMAGE FEATURES FROM SCALE- INVARIANT KEYPOINTS

Transcription:

Visual Object Recognition -67777 Instructor: Daphna Weinshall, daphna@cs.huji.ac.il Office: Ross 211 Office hours: Sunday 12:00-13:00 1

Sources Recognizing and Learning Object Categories ICCV 2005 short courses Li Fei-Fei(UIUC), Rob Fergus (Oxford-MIT), Antonio Torralba(MIT) http://people.csail.mit.edu/torralba/iccv2005 Object Recognition (UPenn, CSE399b Spring 2007) Jianbo Shi (UPenn) Recognition and Matching based on local invariant features Cordelia Schmid(INRIA), David Lowe (UBC) Visual Object Recognition AAAI tutorial, July 2008 Bastian Leibe(ETH Zurich), Kristen Grauman(U Texas, Austin) 2

What is Visual Recognition About Finding and recognizing objects in a picture o This is a car (class recognition) o Look, there is Obama (object identification) o Shmuel is the person on the left (localization) Problems: the same object may appear very differently o Change of viewpoint (facial profile vs. fronto-parallel) o Change of illumination o Object articulation (a person lifting up his hand) o Intra-class variability (there are many types of cars) 3

Object Identification???? 4 Visual Object Recognition - 67777

Object Categorization How to recognize ANY car How to recognize ANY cow 5

What could be done with recognition algorithms? There is a wide range of applications, including Autonomous robots Navigation, driver safety Content-based retrieval and analysis for images and videos Situated search Medical image analysis 6

Object Categorization Task Description o Given a small number of training images of a category, recognize a-priori unknown instances of that category and assign the correct category label. Which categories are feasible visually? o Extensively studied in Cognitive Psychology, e.g. [Brown 58] Fido German dog animal living 7 shepherd being

Visual Object Categories Basic Level Categories in human categorization [Rosch 76, Lakoff 87] o The highest level at which category members have similar perceived shape o The highest level at which a single mental image reflects the entire category o The level at which human subjects are usually fastest at identifying category members o The first level named and understood by children o The highest level at which a person uses similar motor actions for interaction with category members 8

Visual Object Categories Basic-level categories in humans seem to be defined predominantly visually. There is evidence that humans (usually) start with basic-level categorization before doing identification. Basic-level categorization is easier and faster for humans than object identification! Most promising starting point for visual classification Individual level recognition is easier for most automatic methods Basic level Individual level Abstract levels dog German shepherd Fido animal quadruped cat cow Doberman 9

Other Types of Categories Functional Categories e.g., chairs = something you can sit on 10

Other Types of Categories Ad-hoc categories e.g., something you can find in an office environment 11

Levels of Object Categorization cow car motorbike Different levels of recognition o Which object class is in the image? o Where is it in the image? o Where exactly which pixels? Obj/Img classification Detection/Localization Figure/Ground segmentation 12

What makes visual object recognition difficult? Geometrical transformations (viewpoint translation, rotation, scale) Photometric transformations (Illumination) Deformations Intra-class variations Scene obstructions: occlusion, clutter 13

pose Michelangelo 1475-1564 14

scale 15 Visual Object Recognition - 67777

illumination 16 slide credit: S. Ullman Visual Object Recognition - 67777

deformations Xu, Beihong 1943 17

intra-class variation 18 Visual Object Recognition - 67777

occlusion Magritte, 1957 19 Visual Object Recognition - 67777

background clutter Klimt, 1913 20 Visual Object Recognition - 67777

Context 21 Visual Object Recognition - 67777

Context 22 Image credit: D. Hoeim Visual Object Recognition - 67777

Context 23 Visual Object Recognition - 67777

Example: Detection in Crowded Scenes Learn object variability o Changes in appearance, scale, and articulation Compensate for clutter, overlap, and occlusion 24

Teachers are scarce how much supervision? Less More 25 Visual Object Recognition - 67777

Rough evolution of focus in recognition research 1980s rigid planar objects 1990s to early 2000s Currently higher level categories 26

Recognition system Feature extraction: use a set of basic features Learning: learn representation of objects from training images (sometimes of minimal size or larger) Recognition: decide whether the object exists in the image odetection o Localization o Segmentation 27

Object or image representation A picture is a function (array) of grey level values I 10 250 I(x,y) 20 200 30 150 Visual Object Recognition - 67777 y 40 50 60 70 10 20 30 40 50 60 70 80 90 x First challenge: obtain informative representation o Global representation o Local features representation 100 50 0 80 60 40 y 20 0 0 20 40 x 60 80 100 28

Global representations: limitations All parts of the image or window impact the description -> sensitive to occlusion, clutter, viewpoint 29

Local representations Describe component regions or patches separately: SIFT [Lowe 99] Salient regions [Kadir 01] Shape context [Belongie 02] Harris-Affine [Mikolajczyk 04] Superpixels [Ren et al.] Spin images [Johnson 99] Maximally Stable Extremal Regions [Matas 02] Geometric Blur [Berg 05] 30

Main issues with feature detection: 1. Where features are detected: oa single feature for the whole object (global representation) oon a grid o Random sampling o At interest points 2. How feature regions are described o Fixed window (suitable for global representation) ovary size of window ovary shape of window 31

Learning representation: When number of features is not prohibitive (e.g., number of pixels, quantized color values), suitable for global representation o Vector (ordered list) of all features o Histogram (distribution) of feature values Group of possible features is very large o Set of features in object o Set of object features, related to each other by a Graphical model o Set of object features, where the image is some projection transformation of those features This representation is learned, or computed, from a set of training images 32

Recognition Global representation: sliding window Car/non-car Classifier 33

Discussion Global representations have major limitations Alternatively, describe and match only local regions Increased robustness to o Occlusions o Articulation o Intra-category variations d φ θ θ q φ d q 34

Recognition Representation with local features: o Compute local features in the image o Intersect set of image features with set of object features or o Compute probability for image features based on object s graphical model or o Check whether the image is a permissible transformation of the object: invariants, indexing + verification, geometric hashing suitable for recognition at individual level 35

Organizaiton of the course About 4 classes of frontal lectures, discussing: o Features: detectors (part1) and descriptors (part2) orepresentation of objects: vectors, histograms, bag of visual words, constellation models, 3D models (learning) orecognition of objects: probabilistic approaches, geometrical approaches (inference) Remaining classes: student presentations of recent papers on object recognition (seminar format), 2-3 papers per class Final project 36