Object Recognition. Lecture 11, April 21 st, Lexing Xie. EE4830 Digital Image Processing

Similar documents
Bayes Risk. Classifiers for Recognition Reading: Chapter 22 (skip 22.3) Discriminative vs Generative Models. Loss functions in classifiers

Classifiers for Recognition Reading: Chapter 22 (skip 22.3)

Lecture 12 Recognition

Robotics Programming Laboratory

Supervised Sementation: Pixel Classification

Building a Panorama. Matching features. Matching with Features. How do we build a panorama? Computational Photography, 6.882

CS 343H: Honors AI. Lecture 23: Kernels and clustering 4/15/2014. Kristen Grauman UT Austin

Machine Learning Classifiers and Boosting

CS4495/6495 Introduction to Computer Vision. 8C-L1 Classification: Discriminative models

Designing Applications that See Lecture 7: Object Recognition

Announcements. CS 188: Artificial Intelligence Spring Generative vs. Discriminative. Classification: Feature Vectors. Project 4: due Friday.

Day 3 Lecture 1. Unsupervised Learning

Discriminative classifiers for image recognition

Announcements. Recognition. Recognition. Recognition. Recognition. Homework 3 is due May 18, 11:59 PM Reading: Computer Vision I CSE 152 Lecture 14

Lecture 12 Recognition. Davide Scaramuzza

2. Basic Task of Pattern Classification

Case-Based Reasoning. CS 188: Artificial Intelligence Fall Nearest-Neighbor Classification. Parametric / Non-parametric.

CS 188: Artificial Intelligence Fall 2008

CS5670: Computer Vision

Support Vector Machines

Instance-based Learning CE-717: Machine Learning Sharif University of Technology. M. Soleymani Fall 2015

ECG782: Multidimensional Digital Signal Processing

CSE 573: Artificial Intelligence Autumn 2010

Part-Based Models for Object Class Recognition Part 3

Local features: detection and description. Local invariant features

Local Features and Bag of Words Models

Local features: detection and description May 12 th, 2015

Classification and Detection in Images. D.A. Forsyth

Part-based and local feature models for generic object recognition

Announcements. CS 188: Artificial Intelligence Spring Classification: Feature Vectors. Classification: Weights. Learning: Binary Perceptron

Feature Extractors. CS 188: Artificial Intelligence Fall Some (Vague) Biology. The Binary Perceptron. Binary Decision Rule.

Classification: Feature Vectors

Image Features: Local Descriptors. Sanja Fidler CSC420: Intro to Image Understanding 1/ 58

COSC160: Detection and Classification. Jeremy Bolton, PhD Assistant Teaching Professor

Feature descriptors. Alain Pagani Prof. Didier Stricker. Computer Vision: Object and People Tracking

9.913 Pattern Recognition for Vision. Class I - Overview. Instructors: B. Heisele, Y. Ivanov, T. Poggio

Machine Learning. Classification

22 October, 2012 MVA ENS Cachan. Lecture 5: Introduction to generative models Iasonas Kokkinos

Local Features Tutorial: Nov. 8, 04

Image classification Computer Vision Spring 2018, Lecture 18

Automatic Image Alignment (feature-based)

Class 5: Attributes and Semantic Features

Local Feature Detectors

Local Image Features

Generative and discriminative classification techniques

Local Image Features

Content Based Image Retrieval

Midterm Wed. Local features: detection and description. Today. Last time. Local features: main components. Goal: interest operator repeatability

Lecture 3. Oct

BSB663 Image Processing Pinar Duygulu. Slides are adapted from Selim Aksoy

Graphical Models for Computer Vision

Object and Action Detection from a Single Example

Automatic Image Alignment

Computer Vision. Exercise Session 10 Image Categorization

Feature Matching and Robust Fitting

2D Image Processing Feature Descriptors

PATTERN CLASSIFICATION AND SCENE ANALYSIS

CS 343: Artificial Intelligence

Perceptron as a graph

Analysis: TextonBoost and Semantic Texton Forests. Daniel Munoz Februrary 9, 2009

Feature Descriptors. CS 510 Lecture #21 April 29 th, 2013

CS4670: Computer Vision

VECTOR SPACE CLASSIFICATION

Classifying Images with Visual/Textual Cues. By Steven Kappes and Yan Cao

Last week. Multi-Frame Structure from Motion: Multi-View Stereo. Unknown camera viewpoints

Learning to Recognize Faces in Realistic Conditions

Introduction to SLAM Part II. Paul Robertson

Previously. Part-based and local feature models for generic object recognition. Bag-of-words model 4/20/2011

The Kinect Sensor. Luís Carriço FCUL 2014/15

Image Processing, Analysis and Machine Vision

More Learning. Ensembles Bayes Rule Neural Nets K-means Clustering EM Clustering WEKA

Recognition Tools: Support Vector Machines

The Boundary Graph Supervised Learning Algorithm for Regression and Classification

Patch-based Object Recognition. Basic Idea

Face/Flesh Detection and Face Recognition

COS Lecture 13 Autonomous Robot Navigation

CS5670: Computer Vision

Introduction to object recognition. Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and others

CS6670: Computer Vision

Instance-based Learning

Keypoint-based Recognition and Object Search

An Introduction to Pattern Recognition

Digital Image Processing

Data Mining Classification: Alternative Techniques. Lecture Notes for Chapter 4. Instance-Based Learning. Introduction to Data Mining, 2 nd Edition

CS 340 Lec. 4: K-Nearest Neighbors

Pattern Recognition in Image Analysis

EECS150 - Digital Design Lecture 14 FIFO 2 and SIFT. Recap and Outline

Feature Extractors. CS 188: Artificial Intelligence Fall Nearest-Neighbor Classification. The Perceptron Update Rule.

Announcements. Recognition I. Gradient Space (p,q) What is the reflectance map?

Content-Based Image Classification: A Non-Parametric Approach

CAP 5415 Computer Vision Fall 2012

ECE 172A: Introduction to Intelligent Systems: Machine Vision, Fall Midterm Examination

Kernels + K-Means Introduction to Machine Learning. Matt Gormley Lecture 29 April 25, 2018

Features Points. Andrea Torsello DAIS Università Ca Foscari via Torino 155, Mestre (VE)

Local invariant features

Local features and image matching. Prof. Xin Yang HUST

Window based detectors

Final Exam Study Guide

Generative and discriminative classification techniques

Machine Learning. Chao Lan

Transcription:

Object Recognition Lecture 11, April 21 st, 2008 Lexing Xie EE4830 Digital Image Processing http://www.ee.columbia.edu/~xlx/ee4830/ 1

Announcements 2 HW#5 due today HW#6 last HW of the semester Due May 5 th

Roadmap to Date 3 Spatial Domain processing and enhancement Image Transform and Filtering Morphological Processing Image Segmentation Image Descriptors Applicatoions: Image Restoration, Object recognition, Image Compression, Indexing and Retrieval, Reconstruction

Lecture Outline 4 Problem: object recognition from images. What and why Pattern recognition primer Object recognition in controlled environments State of the art object recognition systems

What is Object Recognition? 5 perceptible material thing Courtesy of http://people.csail.mit.edu/torralba/iccv2005/

What is Object Recognition? 6 Sensory data Color, texture, shape, motion, size, weight, smell, touch, sound, Descriptions toy, stuffed Pooh, a frontal, close-up shot of stuffed Pooh, ToysRus #345812, One of the fundamental problems of computer vision: + Apple

Why? 7 Science How do we recognize objects? Practice Robot navigation Medical diagnosis Security Industrial inspection and automation Human-computer interface Information retrieval

Applications of Object Recognition 8 Some images from http://www.cs.utexas.edu/~grauman/research/research.html

9

Lecture Outline 10 Object recognition: what and why Object recognition in controlled environments Distance-based classifiers generalized linear classifiers Neural networks Bayes classifiers Object recognition in practice General object recognition systems Summary

Objects as Vectors 11

image vector representation 12

pattern classifier from examples 13 goal: given x, infer y learning from examples: supervised learning given (x i, y i =f(x i )), i=1,,n for some unknown function f find a good approximation to f rules versus data encode human knowledge as rules e.g. the petal length and width of iris appropriate scenarios for supervised learning no human expert (predict strength to cure AIDS given new molecule structure) human can perform task but can t describe how they do it (e.g. handwriting recognition, object recognition) the desired function is changing constantly w.r.t. time, or user (stock trading decisions, user-specific spam filtering)

minimum distance classifier 14 step 1: calculate class prototypes as the means step 2: use the prototypes to classify a new example discriminant function f:

nearest neighbor classifier 15 steps: store all training examples classify a new example x? by finding the training example (x i, y i ) that s nearest to x? according to Euclidean distance, and copying the labels

nearest neighbor classifier 16 discriminant function f: gray area -1; white area +1 (implicit) decision boundaries form a subset of the Voronoi diagram of the training data each line segment is equidistant between two points comments prone to noisy, poorly scaled features conditioned on the distance metric smooth the decision by looking at K-neighbors and vote good news: knn is universally consistent

linear discriminant 17 two desirables explicit (linear) decision boundary use many training examples/prototypes but do no need to remember all!

the perceptron algorithm 18 learning a linear classifier given training data (x_i, y_i) and loss function find: weight vector [w; b] that minimizes expected loss on training data use hinge loss: start from initial weights w 0 compute gradient update repeat until convergence η : learning rate

computing the gradient 19 given compute gradient η must decrease to zero in order to guarantee convergence. some algorithms (Newton s) can automatically select η. local minimum is the global minimum for hinge loss

Support Vector Machines 20 Two key ideas: The best separating hyperplane has the largest margin. Class boundary can be linear in a higherdimensional space, e.g., generalized linear discriminant weighted inner product with support vectors

Neural Networks 21

Digit Recognition with Neural Net 22 0 1 LeCun et al, 1992, 1998, http://yann.lecun.com/exdb/mnist/

probabilistic classifiers 23 what about probabilities p(x y) is usually easy to obtain from training data can we estimate p(y x)?

Bayes classifier 24

Bayes classifier for Gaussian classes 25

26

Bayes classifier example 27

28

Lecture Outline 29 object recognition: what and why object recognition as pattern classification general object recognition systems object recognition: a systems view current commercial systems real-world challenges survey of state-of the art demo websites

Object Recognition End-to-End 30 training testing Recognition Target Sensor(s) images Pre-processing Feature Extraction train classifier segments features Sensor(s) Pre-processing Feature Extraction Classification window parts or patches salient points pixel vector descriptors classes Postprocessing face? car? statue of liberty? spatial/temporal smoothing context, etc.

Object Recognition in Practice 31 Commercial object recognition Currently a $4 billion/year industry for inspection and assembly Almost entirely based on template matching Upcoming applications Mobile robots, toys, user interfaces Location recognition Digital camera panoramas, 3D scene modeling courtesy of David Lowe, website and CVPR 2003 Tutorial

Industrial Applications 32 http://www.cs.ubc.ca/spider/lowe/vision.html

http://www.appian-tech.com/ 33

http://www.sportvision.com/ 34

http://www.dipix.com/ 35

What to Recognize 36 Specific Wild card Tower Bridge The Stata Center Categories butterfly butterfly building building Kristen Grauman, http://www.cs.utexas.edu/~grauman/research/

Challenges of object recognition 37

Recognize Specific Objects (1) 38 Appearance Matching [Nayar, Murase et. al.]

39 PCA on the training set. Estimate parameters of a low-dimensional pose manifold with splines. Match new image to the closest point on the manifold.

Recognize Specific Objects (2) 40 Part-based approach Image content is transformed into local feature coordinates that are invariant to translation, rotation, scale, and other imaging parameters SIFT Features David Lowe, CVPR 2003 Tutorial

SIFT Descriptor 41 Thresholded image gradients are sampled over 16x16 array of locations in scale space (Gaussian-weighted). Create array of orientation histograms 8 orientations x 4x4 histogram array = 128 dimensions David Lowe, CVPR 2003 Tutorial

Object Category Recognition 42

Overview of object category recognition see iccv tutorial 43

Demos 44 Pittpatt http://demo.pittpatt.com/

It s not just vision Integrate with mobile sensor information (GPS, time, nearby object or people), calendar, schedule Infer semantically rich meta-data labels from joint sources. 10am 7 Sep 05 Australian park Jim, Jill nearby 45 two koalas seen on nat. park trip with Jim and Jill Jill and koala on nat. park trip John and his new car 4pm 8 Sep 05 Sydney 8pm 10 Oct 05 London 3pm 10 Sep 05 downloaded from http://... office parking lot car to consider purchasing http://www.cs.utexas.edu/~grauman/research/research.html

Summary 46 The object recognition problem Pattern classification primer Object recognition grown up Readings: G&W 12.1-12.2 Reference: Duda, Hart, Stork, Pattern Classification, 2 nd Ed. Next time: Image Compression Additional acknowledgements: Dan Ellis, EE6820 Slides; Duda, Hart& Stork, Pattern classificaion 2 nd Ed., David Claus and Christoph F. Eick: Nearest Neighbor Editing and Condensing Techniques