Local Features based Object Categories and Object Instances Recognition

Size: px
Start display at page:

Download "Local Features based Object Categories and Object Instances Recognition"

Transcription

1 Local Features based Object Categories and Object Instances Recognition Eric Nowak Ph.D. thesis defense 17th of March,

2 Thesis in Computer Vision Computer vision is the science and technology of machines that see and think (Wikipedia) Object recognition is the subfield of computer vision whose goal is to recognize objects from image data Algorithm BIKE 2

3 What we want to achieve Local Features based Object Categories and Object Instances Recognition YES NO YES NO YES NO NO NO NO 3

4 What we want to achieve Local Features based Object Categories and Object Instances Recognition NO YES YES YES NO 4

5 What we want to achieve Local Features based Object Categories and Object Instances Recognition 5

6 What we want to achieve Categorization YES Localization Segmentation Different tasks 6

7 Applications of Object Recognition Look for all faces or for Mr. Smith s face in my personal collection of 20,000 pictures Send an alert automatically when any tank or an AMX-30B2 is seen by a surveillance camera Drive autonomous vehicles: detect pedestrians, other vehicles, street tracks, Look for bike images on the Internet 7

8 Difficulties of object recognition Reference object YES Same pixels, Same object NO Different pixels, Different objects YES Different pixels, Same object 8

9 Difficulties of object recognition Reference object YES NO DIFFICULTY = SIGNAL VARIATION Same pixels, Same object Different pixels, Different objects YES Different pixels, Same object 9

10 Difficulties of object recognition View point change: orientation, translation, scale 10

11 Difficulties of object recognition Illumination modification: global brightness and contrast, light sources, shadows, 11

12 Difficulties of object recognition Clutter 12

13 Difficulties of object recognition Occlusions 13

14 Difficulties of object recognition Intra-class variations Specific for object categories, not instances 14

15 Difficulties of object recognition All together: view point variations, illumination variations, clutter, occlusions, intra-class variations Use local representations for local invariance 15

16 To Re-cognize From Old French reconoistre and from Latin recognoscere: to know again Two step process: Knowledge acquisition: what characterizes or discriminates such and such object? Provided by an expert Inferred from a data set Machine Learning Knowledge usage: Is what I see similar to what I learned? 16

17 State of the art Object instance recognition Geometric alignment Fischler & Bolles 81, Goad 83, Lowe 87, Stockman 87 Global methods Turk & Pentland 91, Murase & Nayar 95, Pontil & Verri 98, Schiele & Crowley 2000 Local methods Schmid & Mohr 97, Lowe 99 Object class recognition Global methods Idem above Local methods Rigid geometry Schneiderman & Kanade 00, Viola & Jones 01, Torralba et al 04, Dalal & Triggs 05 No geometry Csurka et al 04, Opelt et al 04, Zhang et al 05 Flexible geometry Fischler & Elschlager 73 (!!), Burl et al 98, Weber et al 00, Agarwal & Roth 02, Leibe & Schiele 03, Fergus et al 03, Berg et al 05, Felzenszwalb & Huttenlocher 05, Fei-Fei et al 06 17

18 State of the art Object instance recognition Geometric alignment Fischler & Bolles 81, Goad 83, Lowe 87, Stockman 87 Global methods Turk & Pentland 91, Murase & Nayar 95, Pontil & Verri 98, Schiele & Crowley 2000 Local methods Schmid & Mohr 97, Lowe 99 Object class recognition Triumph of local methods! Global methods Idem above Local methods Rigid geometry Schneiderman & Kanade 00, Viola & Jones 01, Torralba et al 04, Dalal & Triggs 05 No geometry Csurka et al 04, Opelt et al 04, Zhang et al 05 Flexible geometry Fischler & Elschlager 73 (!!), Burl et al 98, Weber et al 00, Agarwal & Roth 02, Leibe & Schiele 03, Fergus et al 03, Berg et al 05, Felzenszwalb & Huttenlocher 05, Fei-Fei et al 06 But how to select, describe and combine local regions to take a decision? 18

19 t+10 Overview of our works ECCV06 DGA06 VS05 CVPR07 (detailed presentation) 19

20 The Bag of Words Algorithm ECCV06

21 The Bag of Words Algorithm Csurka et al 2004 Object Bag of words X=[ ] X=[ ] (Credit: Li Fei-Fei) Visual Vocabulary 21

22 The Bag of Words Algorithm Issues: Local regions sampling (sampler, min scale, samples number, ) Local regions description (normalization, descriptor, ) Codebook computation (size, algorithm, specificity, ) Image description (encoding [1nn vs thr.], normalization, ) Image classification (discriminative/generative, kernel, ) Many claims in literature, but settings not comparable! Study the relative importance of the different steps 22

23 The Bag of Words Algorithm Graz01: 667 images, 3 cat. + bg: bikes, cars, humans Xerox7: 1776 images, 7 cat. [bike, book, building, ] Pascal Voc 2005: 1373 images, 4 cat + bg: cars, bikes, motorbikes, humans KTH-TIPS: 810 images, 10 categories UIUCTex: 1000 images, 25 categories Brodatz: 112 images, 112 categories Graz 01 KTH-TIPS 23

24 The Bag of Words Algorithm Perf = average of mean multiclass accuracy over 6 runs of 2 folds cross validation SIFT vs GrayLevel Multiclass accuracy Pascal 05 dataset 24

25 The Bag of Words Algorithm t+21 Our contributions Measure of the relative importance of algorithmic choices Most important parameter: number of regions => Rand Few regions: HL, LoG >> Rand Many regions: Rand >> HL, LoG HL, LoG: matching Rand: categorization Other parameters (little influence alone, make the difference combined) Codebook size Codebook source Codebook construction algorithm Non linear SVM Kernel Minimum sampling scale Histogram normalization Best accuracy on Pascal

26 Bag of Words and Infrared Images DGA06

27 Bag of Words and Infrared Images Infrared specificities Noise Low resolution images Distance, humidity Intraclass variations due to local warming + Target detection Do the previous conclusions hold? [sampling, CB size, ] Which operation conditions matter more? [distance, noise, ] 27

28 Bag of Words and Infrared Images 7 objects [tanks, light fighting vehicles, light transport vehicles], 10,000 images Hybrid dataset: Zoom, histogram, noise modification Inner contrast, outer contrast, atmosphere transmission Joint work with DGA Parameters studied (learn A,B,C ; test D) Codebook size Feature selection methods Standard patch size Sampling size [max size, min size, offset, ] Activation values for codewords assignment Nb of training images required 28

29 Bag of Words and Infrared Images 29

30 Bag of Words and Infrared Images Operation parameters Camera / target distance Atmosphere transmission Occlusion rate RSC (target to background contrast ratio) RSS (target contrast) ROI extraction precision Class grouping Two situations Already seen : same data distribution No figures Results are classified Never seen : learn on easy data, test on difficult data 30

31 Bag of Words and Infrared Images t+27 Our contributions Conclusions about algorithms Same behavior as visible state of the art datasets All the parameters can be tuned from a validation set Conclusions about operation conditions In depth analysis for DGA already seen : good, harder with occlusions never seen : good, harder with occlusions or clutter + bad target extraction 31

32 Time / Accuracy trade-off VS05

33 Time / Accuracy trade-off Problem: too many local regions and codewords A solution: feature selection (Int. pts not a solution for IR) A better solution: feature selection for HBC 33

34 Time / Accuracy trade-off 2 datasets: infrared images [1000 im. vehicles, 1500 im. bg]+ Xerox7 [350 im, 400 bg] fewer patches => faster computation 34

35 Time / Accuracy trade-off t+32 A feature selection scheme adapted to Hierarchical Binary Classifiers Our contributions It performs much better with very few features, as well with few features, worse with all the features 35

36 Learning Visual Similarity Measures for Comparing Never Seen Objects CVPR07

37 Motivation Learning Visual Similarity Measures for Comparing Never Seen Objects 37

38 Our Goal Computing the visual similarity of two never seen objects Based on training pairs labeled Same or Different (equivalence constraints) Despite occlusions, changes in pose, light, background Learning Visual Similarity Measures for Comparing Never Seen Objects 38

39 Equivalence Constraints? Same Different Car A Car A Class A Car A Car A Car A Car A Car B Car B Car B Car B Class B Car B Car B Learning Visual Similarity Measures for Comparing Never Seen Objects Car B 39

40 Equivalence Constraints Less informative than Class Labels car model X and car model Y same/different car model Cheaper to obtain e.g. space of class labels too large Deal with new objects. Which model? CANNOT answer Same or Different? CAN answer Learning Visual Similarity Measures for Comparing Never Seen Objects 40

41 How to be robust to variabilities? Consider local representations Get corresponding patch pairs Learning Visual Similarity Measures for Comparing Never Seen Objects 41

42 Vocabulary for Local Representations Text vocabulary of words car, wheel, glass, motor, Image vocabulary of visual words Image pair vocabulary of visual differences HOW do the patches differ? => Characterize local differences Learning Visual Similarity Measures for Comparing Never Seen Objects 42

43 Characterizing local differences (Ferencz et al, ICCV 05) d1 d2 d3 D(I1,I2) = f(d1,d2,d3) d1, d2, d3: weak characterization of the differences Learning Visual Similarity Measures for Comparing Never Seen Objects 43

44 Characterizing local differences: our approach Characteristic Difference!!! Patch Pair Space (ND) Much more information than a simple distance HOW TO COMPUTE THIS QUANTIZATION? Learning Visual Similarity Measures for Comparing Never Seen Objects 44

45 Patch pair quantization algorithm Thr=0.19 Thr=0.03 Thr= SIFT descrip. Both larger than 0.19? False left child True right child Learning Visual Similarity Measures for Comparing Never Seen Objects 45

46 Patch pair quantization algorithm Patch Pair Space (ND) Quantizer / Clusterer Defined by the trees Cluster centers (characteristic differences) defined by the leaves Learning Visual Similarity Measures for Comparing Never Seen Objects 46

47 How to learn the trees? Classical decision trees For each node select the best feature [which SIFT dimension] and the best threshold Extremely Randomized Decision Trees (Geurts 06) Ensemble of decision trees + combination rule Each node is suboptimal Variance is small Fast to learn Good for clustering (Moosman, Triggs and Jurie 06) Learning Visual Similarity Measures for Comparing Never Seen Objects 47

48 An image pair descriptor a) sample corresponding patch pairs b) cluster them with the forest c) Update a global image pair descriptor Learning Visual Similarity Measures for Comparing Never Seen Objects 48

49 Similarity Measure Computation X=[ ] X=[ ] Our Goal! S( I1, I 2) = S( x) = ω x Learning Visual Similarity Measures for Comparing Never Seen Objects t 49

50 Datasets Ferencz et al: cars distortions, tiny details, crop Our dataset: toycars view point, light, background Jain et al: faces in the news light, expression, pose, quality, annotation errors Fleuret et al: COIL 100 Learning Visual Similarity Measures for Comparing Never Seen Objects full rotation, heterogeneous 50

51 Comparison with State of the Art: Equal Error Rate of Precision Recall Never Seen Ferencz Toycars Left: Faces Right: COIL 100 Learning Visual Similarity Measures for Comparing Never Seen Objects 51

52 Visualizations Learning Visual Similarity Measures for Comparing Never Seen Objects 52

53 Visualizations Multi dimensional scaling (2D): L2 distance in 2D as close as possible to the pairwise similarity matrix Below: simple bag of words representation Next page: our similarity measure Learning Visual Similarity Measures for Comparing Never Seen Objects 53

54 Learning Visual Similarity Measures for Comparing Never Seen Objects 54

55 Method Summary Consider corresp. local regions Quantize patch pair differences Extremely Randomized Clustering Forest Get global image pair descriptor Similarity measure is a weighted sum Learning Visual Similarity Measures for Comparing Never Seen Objects 55

56 Learning Visual Similarity Measures for Comparing Never Seen Objects Our contributions A new concept: local differences informative characterization Comparison of never seen objects Very significant improvement over the state of the art In depth study of all the parameters of the algorithm Learning Visual Similarity Measures for Comparing Never Seen Objects 56

57 Conclusions Summary of the contributions ECCV06 DGA06 VS05 CVPR07 57

58 Open issues What is a good local representation? How to integrate geometry (and improve the results)? Sampling: how to bias the random detector (but not too much)? What is the role of the context? When is it more useful? How to use it? How to integrate physical object constraints (local warming) to statistical models? Should the same algorithm be used for all the problems? HOG for large categories [some geometry, no details] BoW for classes recognition [no geometry, texture] kas for classes recognition [geometry, no texture] Sim. Measure for instance recognition [fine differences between local corresponding regions] How does the algorithms scale wrt. the number of classes? 10,000! Overlapping classes Unsupervised learning 58

59 Thank you for your attention

Part based models for recognition. Kristen Grauman

Part based models for recognition. Kristen Grauman Part based models for recognition Kristen Grauman UT Austin Limitations of window-based models Not all objects are box-shaped Assuming specific 2d view of object Local components themselves do not necessarily

More information

Previously. Part-based and local feature models for generic object recognition. Bag-of-words model 4/20/2011

Previously. Part-based and local feature models for generic object recognition. Bag-of-words model 4/20/2011 Previously Part-based and local feature models for generic object recognition Wed, April 20 UT-Austin Discriminative classifiers Boosting Nearest neighbors Support vector machines Useful for object recognition

More information

Object Recognition. Computer Vision. Slides from Lana Lazebnik, Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce

Object Recognition. Computer Vision. Slides from Lana Lazebnik, Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce Object Recognition Computer Vision Slides from Lana Lazebnik, Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce How many visual object categories are there? Biederman 1987 ANIMALS PLANTS OBJECTS

More information

Supervised learning. y = f(x) function

Supervised learning. y = f(x) function Supervised learning y = f(x) output prediction function Image feature Training: given a training set of labeled examples {(x 1,y 1 ),, (x N,y N )}, estimate the prediction function f by minimizing the

More information

Object Category Detection. Slides mostly from Derek Hoiem

Object Category Detection. Slides mostly from Derek Hoiem Object Category Detection Slides mostly from Derek Hoiem Today s class: Object Category Detection Overview of object category detection Statistical template matching with sliding window Part-based Models

More information

Part-based and local feature models for generic object recognition

Part-based and local feature models for generic object recognition Part-based and local feature models for generic object recognition May 28 th, 2015 Yong Jae Lee UC Davis Announcements PS2 grades up on SmartSite PS2 stats: Mean: 80.15 Standard Dev: 22.77 Vote on piazza

More information

Course Administration

Course Administration Course Administration Project 2 results are online Project 3 is out today The first quiz is a week from today (don t panic!) Covers all material up to the quiz Emphasizes lecture material NOT project topics

More information

Supervised learning. y = f(x) function

Supervised learning. y = f(x) function Supervised learning y = f(x) output prediction function Image feature Training: given a training set of labeled examples {(x 1,y 1 ),, (x N,y N )}, estimate the prediction function f by minimizing the

More information

Deformable Part Models

Deformable Part Models CS 1674: Intro to Computer Vision Deformable Part Models Prof. Adriana Kovashka University of Pittsburgh November 9, 2016 Today: Object category detection Window-based approaches: Last time: Viola-Jones

More information

Beyond Bags of Features

Beyond Bags of Features : for Recognizing Natural Scene Categories Matching and Modeling Seminar Instructed by Prof. Haim J. Wolfson School of Computer Science Tel Aviv University December 9 th, 2015

More information

Lecture 16: Object recognition: Part-based generative models

Lecture 16: Object recognition: Part-based generative models Lecture 16: Object recognition: Part-based generative models Professor Stanford Vision Lab 1 What we will learn today? Introduction Constellation model Weakly supervised training One-shot learning (Problem

More information

Local Features and Kernels for Classifcation of Texture and Object Categories: A Comprehensive Study

Local Features and Kernels for Classifcation of Texture and Object Categories: A Comprehensive Study Local Features and Kernels for Classifcation of Texture and Object Categories: A Comprehensive Study J. Zhang 1 M. Marszałek 1 S. Lazebnik 2 C. Schmid 1 1 INRIA Rhône-Alpes, LEAR - GRAVIR Montbonnot, France

More information

Learning Visual Similarity Measures for Comparing Never Seen Objects

Learning Visual Similarity Measures for Comparing Never Seen Objects Learning Visual Similarity Measures for Comparing Never Seen Objects Eric Nowak INRIA - INPG - Bertin Technologies 155 r Louis Armand 13290 Aix en Provence - France eric.nowak@inrialpes.fr Frédéric Jurie

More information

Object Category Detection: Sliding Windows

Object Category Detection: Sliding Windows 04/10/12 Object Category Detection: Sliding Windows Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem Today s class: Object Category Detection Overview of object category detection Statistical

More information

Category vs. instance recognition

Category vs. instance recognition Category vs. instance recognition Category: Find all the people Find all the buildings Often within a single image Often sliding window Instance: Is this face James? Find this specific famous building

More information

Visuelle Perzeption für Mensch- Maschine Schnittstellen

Visuelle Perzeption für Mensch- Maschine Schnittstellen Visuelle Perzeption für Mensch- Maschine Schnittstellen Vorlesung, WS 2009 Prof. Dr. Rainer Stiefelhagen Dr. Edgar Seemann Institut für Anthropomatik Universität Karlsruhe (TH) http://cvhci.ira.uka.de

More information

Category-level localization

Category-level localization Category-level localization Cordelia Schmid Recognition Classification Object present/absent in an image Often presence of a significant amount of background clutter Localization / Detection Localize object

More information

Beyond bags of features: Adding spatial information. Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba

Beyond bags of features: Adding spatial information. Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba Adding spatial information Forming vocabularies from pairs of nearby features doublets

More information

Methods for Representing and Recognizing 3D objects

Methods for Representing and Recognizing 3D objects Methods for Representing and Recognizing 3D objects part 1 Silvio Savarese University of Michigan at Ann Arbor Object: Building, 45º pose, 8-10 meters away Object: Person, back; 1-2 meters away Object:

More information

Efficient Kernels for Identifying Unbounded-Order Spatial Features

Efficient Kernels for Identifying Unbounded-Order Spatial Features Efficient Kernels for Identifying Unbounded-Order Spatial Features Yimeng Zhang Carnegie Mellon University yimengz@andrew.cmu.edu Tsuhan Chen Cornell University tsuhan@ece.cornell.edu Abstract Higher order

More information

High Level Computer Vision

High Level Computer Vision High Level Computer Vision Part-Based Models for Object Class Recognition Part 2 Bernt Schiele - schiele@mpi-inf.mpg.de Mario Fritz - mfritz@mpi-inf.mpg.de http://www.d2.mpi-inf.mpg.de/cv Please Note No

More information

Patch Descriptors. CSE 455 Linda Shapiro

Patch Descriptors. CSE 455 Linda Shapiro Patch Descriptors CSE 455 Linda Shapiro How can we find corresponding points? How can we find correspondences? How do we describe an image patch? How do we describe an image patch? Patches with similar

More information

Selective Search for Object Recognition

Selective Search for Object Recognition Selective Search for Object Recognition Uijlings et al. Schuyler Smith Overview Introduction Object Recognition Selective Search Similarity Metrics Results Object Recognition Kitten Goal: Problem: Where

More information

Bertin Technologies / CNRS LEAR Group INRIA - France

Bertin Technologies / CNRS LEAR Group INRIA - France Learning Visual Distance Function for Identification from one Example. Ei Eric Nowak and dfrederic Jurie Bertin Technologies / CNRS LEAR Group INRIA - France This is an object you've never seen before

More information

Loose Shape Model for Discriminative Learning of Object Categories

Loose Shape Model for Discriminative Learning of Object Categories Loose Shape Model for Discriminative Learning of Object Categories Margarita Osadchy and Elran Morash Computer Science Department University of Haifa Mount Carmel, Haifa 31905, Israel rita@cs.haifa.ac.il

More information

Window based detectors

Window based detectors Window based detectors CS 554 Computer Vision Pinar Duygulu Bilkent University (Source: James Hays, Brown) Today Window-based generic object detection basic pipeline boosting classifiers face detection

More information

CV as making bank. Intel buys Mobileye! $15 billion. Mobileye:

CV as making bank. Intel buys Mobileye! $15 billion. Mobileye: CV as making bank Intel buys Mobileye! $15 billion Mobileye: Spin-off from Hebrew University, Israel 450 engineers 15 million cars installed 313 car models June 2016 - Tesla left Mobileye Fatal crash car

More information

Beyond Bags of features Spatial information & Shape models

Beyond Bags of features Spatial information & Shape models Beyond Bags of features Spatial information & Shape models Jana Kosecka Many slides adapted from S. Lazebnik, FeiFei Li, Rob Fergus, and Antonio Torralba Detection, recognition (so far )! Bags of features

More information

Recap Image Classification with Bags of Local Features

Recap Image Classification with Bags of Local Features Recap Image Classification with Bags of Local Features Bag of Feature models were the state of the art for image classification for a decade BoF may still be the state of the art for instance retrieval

More information

Local Features and Bag of Words Models

Local Features and Bag of Words Models 10/14/11 Local Features and Bag of Words Models Computer Vision CS 143, Brown James Hays Slides from Svetlana Lazebnik, Derek Hoiem, Antonio Torralba, David Lowe, Fei Fei Li and others Computer Engineering

More information

Basic Problem Addressed. The Approach I: Training. Main Idea. The Approach II: Testing. Why a set of vocabularies?

Basic Problem Addressed. The Approach I: Training. Main Idea. The Approach II: Testing. Why a set of vocabularies? Visual Categorization With Bags of Keypoints. ECCV,. G. Csurka, C. Bray, C. Dance, and L. Fan. Shilpa Gulati //7 Basic Problem Addressed Find a method for Generic Visual Categorization Visual Categorization:

More information

Descriptors for CV. Introduc)on:

Descriptors for CV. Introduc)on: Descriptors for CV Content 2014 1.Introduction 2.Histograms 3.HOG 4.LBP 5.Haar Wavelets 6.Video based descriptor 7.How to compare descriptors 8.BoW paradigm 1 2 1 2 Color RGB histogram Introduc)on: Image

More information

Detection III: Analyzing and Debugging Detection Methods

Detection III: Analyzing and Debugging Detection Methods CS 1699: Intro to Computer Vision Detection III: Analyzing and Debugging Detection Methods Prof. Adriana Kovashka University of Pittsburgh November 17, 2015 Today Review: Deformable part models How can

More information

Part-based models. Lecture 10

Part-based models. Lecture 10 Part-based models Lecture 10 Overview Representation Location Appearance Generative interpretation Learning Distance transforms Other approaches using parts Felzenszwalb, Girshick, McAllester, Ramanan

More information

Selection of Scale-Invariant Parts for Object Class Recognition

Selection of Scale-Invariant Parts for Object Class Recognition Selection of Scale-Invariant Parts for Object Class Recognition Gy. Dorkó and C. Schmid INRIA Rhône-Alpes, GRAVIR-CNRS 655, av. de l Europe, 3833 Montbonnot, France fdorko,schmidg@inrialpes.fr Abstract

More information

Local cues and global constraints in image understanding

Local cues and global constraints in image understanding Local cues and global constraints in image understanding Olga Barinova Lomonosov Moscow State University *Many slides adopted from the courses of Anton Konushin Image understanding «To see means to know

More information

Feature descriptors. Alain Pagani Prof. Didier Stricker. Computer Vision: Object and People Tracking

Feature descriptors. Alain Pagani Prof. Didier Stricker. Computer Vision: Object and People Tracking Feature descriptors Alain Pagani Prof. Didier Stricker Computer Vision: Object and People Tracking 1 Overview Previous lectures: Feature extraction Today: Gradiant/edge Points (Kanade-Tomasi + Harris)

More information

CEE598 - Visual Sensing for Civil Infrastructure Eng. & Mgmt.

CEE598 - Visual Sensing for Civil Infrastructure Eng. & Mgmt. CEE598 - Visual Sensing for Civil Infrastructure Eng. & Mgmt. Session 20 Object Recognition 3 Mani Golparvar-Fard Department of Civil and Environmental Engineering Department of Computer Science 3129D,

More information

Part-Based Models for Object Class Recognition Part 2

Part-Based Models for Object Class Recognition Part 2 High Level Computer Vision Part-Based Models for Object Class Recognition Part 2 Bernt Schiele - schiele@mpi-inf.mpg.de Mario Fritz - mfritz@mpi-inf.mpg.de https://www.mpi-inf.mpg.de/hlcv Class of Object

More information

Multiple-Person Tracking by Detection

Multiple-Person Tracking by Detection http://excel.fit.vutbr.cz Multiple-Person Tracking by Detection Jakub Vojvoda* Abstract Detection and tracking of multiple person is challenging problem mainly due to complexity of scene and large intra-class

More information

Part-Based Models for Object Class Recognition Part 2

Part-Based Models for Object Class Recognition Part 2 High Level Computer Vision Part-Based Models for Object Class Recognition Part 2 Bernt Schiele - schiele@mpi-inf.mpg.de Mario Fritz - mfritz@mpi-inf.mpg.de https://www.mpi-inf.mpg.de/hlcv Class of Object

More information

CEE598 - Visual Sensing for Civil Infrastructure Eng. & Mgmt.

CEE598 - Visual Sensing for Civil Infrastructure Eng. & Mgmt. CEE598 - Visual Sensing for Civil Infrastructure Eng. & Mgmt. Session 20 Object Recognition 3 Mani Golparvar-Fard Department of Civil and Environmental Engineering 3129D, Newmark Civil Engineering Lab

More information

Object Classification for Video Surveillance

Object Classification for Video Surveillance Object Classification for Video Surveillance Rogerio Feris IBM TJ Watson Research Center rsferis@us.ibm.com http://rogerioferis.com 1 Outline Part I: Object Classification in Far-field Video Part II: Large

More information

Object detection. Announcements. Last time: Mid-level cues 2/23/2016. Wed Feb 24 Kristen Grauman UT Austin

Object detection. Announcements. Last time: Mid-level cues 2/23/2016. Wed Feb 24 Kristen Grauman UT Austin Object detection Wed Feb 24 Kristen Grauman UT Austin Announcements Reminder: Assignment 2 is due Mar 9 and Mar 10 Be ready to run your code again on a new test set on Mar 10 Vision talk next Tuesday 11

More information

Discriminative classifiers for image recognition

Discriminative classifiers for image recognition Discriminative classifiers for image recognition May 26 th, 2015 Yong Jae Lee UC Davis Outline Last time: window-based generic object detection basic pipeline face detection with boosting as case study

More information

Action recognition in videos

Action recognition in videos Action recognition in videos Cordelia Schmid INRIA Grenoble Joint work with V. Ferrari, A. Gaidon, Z. Harchaoui, A. Klaeser, A. Prest, H. Wang Action recognition - goal Short actions, i.e. drinking, sit

More information

CS6670: Computer Vision

CS6670: Computer Vision CS6670: Computer Vision Noah Snavely Lecture 16: Bag-of-words models Object Bag of words Announcements Project 3: Eigenfaces due Wednesday, November 11 at 11:59pm solo project Final project presentations:

More information

Object and Class Recognition I:

Object and Class Recognition I: Object and Class Recognition I: Object Recognition Lectures 10 Sources ICCV 2005 short courses Li Fei-Fei (UIUC), Rob Fergus (Oxford-MIT), Antonio Torralba (MIT) http://people.csail.mit.edu/torralba/iccv2005

More information

Object Category Detection: Sliding Windows

Object Category Detection: Sliding Windows 03/18/10 Object Category Detection: Sliding Windows Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem Goal: Detect all instances of objects Influential Works in Detection Sung-Poggio

More information

Is 2D Information Enough For Viewpoint Estimation? Amir Ghodrati, Marco Pedersoli, Tinne Tuytelaars BMVC 2014

Is 2D Information Enough For Viewpoint Estimation? Amir Ghodrati, Marco Pedersoli, Tinne Tuytelaars BMVC 2014 Is 2D Information Enough For Viewpoint Estimation? Amir Ghodrati, Marco Pedersoli, Tinne Tuytelaars BMVC 2014 Problem Definition Viewpoint estimation: Given an image, predicting viewpoint for object of

More information

String distance for automatic image classification

String distance for automatic image classification String distance for automatic image classification Nguyen Hong Thinh*, Le Vu Ha*, Barat Cecile** and Ducottet Christophe** *University of Engineering and Technology, Vietnam National University of HaNoi,

More information

The Caltech-UCSD Birds Dataset

The Caltech-UCSD Birds Dataset The Caltech-UCSD Birds-200-2011 Dataset Catherine Wah 1, Steve Branson 1, Peter Welinder 2, Pietro Perona 2, Serge Belongie 1 1 University of California, San Diego 2 California Institute of Technology

More information

Learning to Recognize Faces in Realistic Conditions

Learning to Recognize Faces in Realistic Conditions 000 001 002 003 004 005 006 007 008 009 010 011 012 013 014 015 016 017 018 019 020 021 022 023 024 025 026 027 028 029 030 031 032 033 034 035 036 037 038 039 040 041 042 043 044 045 046 047 048 049 050

More information

Machine Learning Crash Course

Machine Learning Crash Course Machine Learning Crash Course Photo: CMU Machine Learning Department protests G20 Computer Vision James Hays Slides: Isabelle Guyon, Erik Sudderth, Mark Johnson, Derek Hoiem The machine learning framework

More information

Evaluation and comparison of interest points/regions

Evaluation and comparison of interest points/regions Introduction Evaluation and comparison of interest points/regions Quantitative evaluation of interest point/region detectors points / regions at the same relative location and area Repeatability rate :

More information

Patch Descriptors. EE/CSE 576 Linda Shapiro

Patch Descriptors. EE/CSE 576 Linda Shapiro Patch Descriptors EE/CSE 576 Linda Shapiro 1 How can we find corresponding points? How can we find correspondences? How do we describe an image patch? How do we describe an image patch? Patches with similar

More information

Sampling strategies for bag-of-features image classification

Sampling strategies for bag-of-features image classification Sampling strategies for bag-of-features image classification Emmanuel Nowak, Frédéric Jurie, Bill Triggs To cite this version: Emmanuel Nowak, Frédéric Jurie, Bill Triggs. Sampling strategies for bag-of-features

More information

Bag-of-features. Cordelia Schmid

Bag-of-features. Cordelia Schmid Bag-of-features for category classification Cordelia Schmid Visual search Particular objects and scenes, large databases Category recognition Image classification: assigning a class label to the image

More information

CS5670: Intro to Computer Vision

CS5670: Intro to Computer Vision CS5670: Intro to Computer Vision Noah Snavely Introduction to Recognition mountain tree banner building street lamp people vendor Announcements Final exam, in-class, last day of lecture (5/9/2018, 12:30

More information

Bag of Words Models. CS4670 / 5670: Computer Vision Noah Snavely. Bag-of-words models 11/26/2013

Bag of Words Models. CS4670 / 5670: Computer Vision Noah Snavely. Bag-of-words models 11/26/2013 CS4670 / 5670: Computer Vision Noah Snavely Bag-of-words models Object Bag of words Bag of Words Models Adapted from slides by Rob Fergus and Svetlana Lazebnik 1 Object Bag of words Origin 1: Texture Recognition

More information

Recognition of Animal Skin Texture Attributes in the Wild. Amey Dharwadker (aap2174) Kai Zhang (kz2213)

Recognition of Animal Skin Texture Attributes in the Wild. Amey Dharwadker (aap2174) Kai Zhang (kz2213) Recognition of Animal Skin Texture Attributes in the Wild Amey Dharwadker (aap2174) Kai Zhang (kz2213) Motivation Patterns and textures are have an important role in object description and understanding

More information

Large-Scale Traffic Sign Recognition based on Local Features and Color Segmentation

Large-Scale Traffic Sign Recognition based on Local Features and Color Segmentation Large-Scale Traffic Sign Recognition based on Local Features and Color Segmentation M. Blauth, E. Kraft, F. Hirschenberger, M. Böhm Fraunhofer Institute for Industrial Mathematics, Fraunhofer-Platz 1,

More information

Modern Object Detection. Most slides from Ali Farhadi

Modern Object Detection. Most slides from Ali Farhadi Modern Object Detection Most slides from Ali Farhadi Comparison of Classifiers assuming x in {0 1} Learning Objective Training Inference Naïve Bayes maximize j i logp + logp ( x y ; θ ) ( y ; θ ) i ij

More information

Announcements. Recognition. Recognition. Recognition. Recognition. Homework 3 is due May 18, 11:59 PM Reading: Computer Vision I CSE 152 Lecture 14

Announcements. Recognition. Recognition. Recognition. Recognition. Homework 3 is due May 18, 11:59 PM Reading: Computer Vision I CSE 152 Lecture 14 Announcements Computer Vision I CSE 152 Lecture 14 Homework 3 is due May 18, 11:59 PM Reading: Chapter 15: Learning to Classify Chapter 16: Classifying Images Chapter 17: Detecting Objects in Images Given

More information

Bias-Variance Trade-off (cont d) + Image Representations

Bias-Variance Trade-off (cont d) + Image Representations CS 275: Machine Learning Bias-Variance Trade-off (cont d) + Image Representations Prof. Adriana Kovashka University of Pittsburgh January 2, 26 Announcement Homework now due Feb. Generalization Training

More information

CLASSIFICATION Experiments

CLASSIFICATION Experiments CLASSIFICATION Experiments January 27,2015 CS3710: Visual Recognition Bhavin Modi Bag of features Object Bag of words 1. Extract features 2. Learn visual vocabulary Bag of features: outline 3. Quantize

More information

Visual Object Recognition

Visual Object Recognition Perceptual and Sensory Augmented Computing Visual Object Recognition Tutorial Visual Object Recognition Bastian Leibe Computer Vision Laboratory ETH Zurich Chicago, 14.07.2008 & Kristen Grauman Department

More information

Fusing shape and appearance information for object category detection

Fusing shape and appearance information for object category detection 1 Fusing shape and appearance information for object category detection Andreas Opelt, Axel Pinz Graz University of Technology, Austria Andrew Zisserman Dept. of Engineering Science, University of Oxford,

More information

Visual Object Recognition

Visual Object Recognition Visual Object Recognition -67777 Instructor: Daphna Weinshall, daphna@cs.huji.ac.il Office: Ross 211 Office hours: Sunday 12:00-13:00 1 Sources Recognizing and Learning Object Categories ICCV 2005 short

More information

2D Image Processing Feature Descriptors

2D Image Processing Feature Descriptors 2D Image Processing Feature Descriptors Prof. Didier Stricker Kaiserlautern University http://ags.cs.uni-kl.de/ DFKI Deutsches Forschungszentrum für Künstliche Intelligenz http://av.dfki.de 1 Overview

More information

Preliminary Local Feature Selection by Support Vector Machine for Bag of Features

Preliminary Local Feature Selection by Support Vector Machine for Bag of Features Preliminary Local Feature Selection by Support Vector Machine for Bag of Features Tetsu Matsukawa Koji Suzuki Takio Kurita :University of Tsukuba :National Institute of Advanced Industrial Science and

More information

Oriented Filters for Object Recognition: an empirical study

Oriented Filters for Object Recognition: an empirical study Oriented Filters for Object Recognition: an empirical study Jerry Jun Yokono Tomaso Poggio Center for Biological and Computational Learning, M.I.T. E5-0, 45 Carleton St., Cambridge, MA 04, USA Sony Corporation,

More information

Object Detection by 3D Aspectlets and Occlusion Reasoning

Object Detection by 3D Aspectlets and Occlusion Reasoning Object Detection by 3D Aspectlets and Occlusion Reasoning Yu Xiang University of Michigan Silvio Savarese Stanford University In the 4th International IEEE Workshop on 3D Representation and Recognition

More information

Fuzzy based Multiple Dictionary Bag of Words for Image Classification

Fuzzy based Multiple Dictionary Bag of Words for Image Classification Available online at www.sciencedirect.com Procedia Engineering 38 (2012 ) 2196 2206 International Conference on Modeling Optimisation and Computing Fuzzy based Multiple Dictionary Bag of Words for Image

More information

Templates and Background Subtraction. Prof. D. Stricker Doz. G. Bleser

Templates and Background Subtraction. Prof. D. Stricker Doz. G. Bleser Templates and Background Subtraction Prof. D. Stricker Doz. G. Bleser 1 Surveillance Video: Example of multiple people tracking http://www.youtube.com/watch?v=inqv34bchem&feature=player_embedded As for

More information

CEE598 - Visual Sensing for Civil Infrastructure Eng. & Mgmt.

CEE598 - Visual Sensing for Civil Infrastructure Eng. & Mgmt. CEE598 - Visual Sensing for Civil Infrastructure Eng. & Mgmt. Session 18 Object Recognition I Mani Golparvar-Fard Department of Civil and Environmental Engineering 3129D, Newmark Civil Engineering Lab

More information

High Level Computer Vision. Sliding Window Detection: Viola-Jones-Detector & Histogram of Oriented Gradients (HOG)

High Level Computer Vision. Sliding Window Detection: Viola-Jones-Detector & Histogram of Oriented Gradients (HOG) High Level Computer Vision Sliding Window Detection: Viola-Jones-Detector & Histogram of Oriented Gradients (HOG) Bernt Schiele - schiele@mpi-inf.mpg.de Mario Fritz - mfritz@mpi-inf.mpg.de http://www.d2.mpi-inf.mpg.de/cv

More information

OBJECT CATEGORIZATION

OBJECT CATEGORIZATION OBJECT CATEGORIZATION Ing. Lorenzo Seidenari e-mail: seidenari@dsi.unifi.it Slides: Ing. Lamberto Ballan November 18th, 2009 What is an Object? Merriam-Webster Definition: Something material that may be

More information

Today. Introduction to recognition Alignment based approaches 11/4/2008

Today. Introduction to recognition Alignment based approaches 11/4/2008 Today Introduction to recognition Alignment based approaches Tuesday, Nov 4 Brief recap of visual words Introduction to recognition problem Recognition by alignment, pose clustering Kristen Grauman UT

More information

Category-level Localization

Category-level Localization Category-level Localization Andrew Zisserman Visual Geometry Group University of Oxford http://www.robots.ox.ac.uk/~vgg Includes slides from: Ondra Chum, Alyosha Efros, Mark Everingham, Pedro Felzenszwalb,

More information

Effective Classifiers for Detecting Objects

Effective Classifiers for Detecting Objects Effective Classifiers for Detecting Objects Michael Mayo Dept. of Computer Science University of Waikato Private Bag 3105, Hamilton, New Zealand mmayo@cs.waikato.ac.nz Abstract Several state-of-the-art

More information

Extracting Spatio-temporal Local Features Considering Consecutiveness of Motions

Extracting Spatio-temporal Local Features Considering Consecutiveness of Motions Extracting Spatio-temporal Local Features Considering Consecutiveness of Motions Akitsugu Noguchi and Keiji Yanai Department of Computer Science, The University of Electro-Communications, 1-5-1 Chofugaoka,

More information

Person Detection in Images using HoG + Gentleboost. Rahul Rajan June 1st July 15th CMU Q Robotics Lab

Person Detection in Images using HoG + Gentleboost. Rahul Rajan June 1st July 15th CMU Q Robotics Lab Person Detection in Images using HoG + Gentleboost Rahul Rajan June 1st July 15th CMU Q Robotics Lab 1 Introduction One of the goals of computer vision Object class detection car, animal, humans Human

More information

CS229: Action Recognition in Tennis

CS229: Action Recognition in Tennis CS229: Action Recognition in Tennis Aman Sikka Stanford University Stanford, CA 94305 Rajbir Kataria Stanford University Stanford, CA 94305 asikka@stanford.edu rkataria@stanford.edu 1. Motivation As active

More information

Computer Science Faculty, Bandar Lampung University, Bandar Lampung, Indonesia

Computer Science Faculty, Bandar Lampung University, Bandar Lampung, Indonesia Application Object Detection Using Histogram of Oriented Gradient For Artificial Intelegence System Module of Nao Robot (Control System Laboratory (LSKK) Bandung Institute of Technology) A K Saputra 1.,

More information

Incremental Learning of Object Detectors Using a Visual Shape Alphabet

Incremental Learning of Object Detectors Using a Visual Shape Alphabet Incremental Learning of Object Detectors Using a Visual Shape Alphabet A. Opelt, A. Pinz & A. Zisserman CVPR 06 Presented by Medha Bhargava* * Several slides adapted from authors presentation, CVPR 06

More information

EVENT DETECTION AND HUMAN BEHAVIOR RECOGNITION. Ing. Lorenzo Seidenari

EVENT DETECTION AND HUMAN BEHAVIOR RECOGNITION. Ing. Lorenzo Seidenari EVENT DETECTION AND HUMAN BEHAVIOR RECOGNITION Ing. Lorenzo Seidenari e-mail: seidenari@dsi.unifi.it What is an Event? Dictionary.com definition: something that occurs in a certain place during a particular

More information

Categorization by Learning and Combining Object Parts

Categorization by Learning and Combining Object Parts Categorization by Learning and Combining Object Parts Bernd Heisele yz Thomas Serre y Massimiliano Pontil x Thomas Vetter Λ Tomaso Poggio y y Center for Biological and Computational Learning, M.I.T., Cambridge,

More information

Object recognition (part 2)

Object recognition (part 2) Object recognition (part 2) CSE P 576 Larry Zitnick (larryz@microsoft.com) 1 2 3 Support Vector Machines Modified from the slides by Dr. Andrew W. Moore http://www.cs.cmu.edu/~awm/tutorials Linear Classifiers

More information

Local Features and Kernels for Classification of Texture and Object Categories: A Comprehensive Study

Local Features and Kernels for Classification of Texture and Object Categories: A Comprehensive Study Local Features and Kernels for Classification of Texture and Object Categories: A Comprehensive Study Jianguo Zhang, Marcin Marszalek, Svetlana Lazebnik, Cordelia Schmid To cite this version: Jianguo Zhang,

More information

Analysis: TextonBoost and Semantic Texton Forests. Daniel Munoz Februrary 9, 2009

Analysis: TextonBoost and Semantic Texton Forests. Daniel Munoz Februrary 9, 2009 Analysis: TextonBoost and Semantic Texton Forests Daniel Munoz 16-721 Februrary 9, 2009 Papers [shotton-eccv-06] J. Shotton, J. Winn, C. Rother, A. Criminisi, TextonBoost: Joint Appearance, Shape and Context

More information

Face detection and recognition. Detection Recognition Sally

Face detection and recognition. Detection Recognition Sally Face detection and recognition Detection Recognition Sally Face detection & recognition Viola & Jones detector Available in open CV Face recognition Eigenfaces for face recognition Metric learning identification

More information

People detection in complex scene using a cascade of Boosted classifiers based on Haar-like-features

People detection in complex scene using a cascade of Boosted classifiers based on Haar-like-features People detection in complex scene using a cascade of Boosted classifiers based on Haar-like-features M. Siala 1, N. Khlifa 1, F. Bremond 2, K. Hamrouni 1 1. Research Unit in Signal Processing, Image Processing

More information

Learning Tree-structured Descriptor Quantizers for Image Categorization

Learning Tree-structured Descriptor Quantizers for Image Categorization KRAPAC, VERBEEK, JURIE: TREES AS QUANTIZERS FOR IMAGE CATEGORIZATION 1 Learning Tree-structured Descriptor Quantizers for Image Categorization Josip Krapac 1 http://lear.inrialpes.fr/~krapac Jakob Verbeek

More information

Sketchable Histograms of Oriented Gradients for Object Detection

Sketchable Histograms of Oriented Gradients for Object Detection Sketchable Histograms of Oriented Gradients for Object Detection No Author Given No Institute Given Abstract. In this paper we investigate a new representation approach for visual object recognition. The

More information

Human Detection. A state-of-the-art survey. Mohammad Dorgham. University of Hamburg

Human Detection. A state-of-the-art survey. Mohammad Dorgham. University of Hamburg Human Detection A state-of-the-art survey Mohammad Dorgham University of Hamburg Presentation outline Motivation Applications Overview of approaches (categorized) Approaches details References Motivation

More information

Using the Deformable Part Model with Autoencoded Feature Descriptors for Object Detection

Using the Deformable Part Model with Autoencoded Feature Descriptors for Object Detection Using the Deformable Part Model with Autoencoded Feature Descriptors for Object Detection Hyunghoon Cho and David Wu December 10, 2010 1 Introduction Given its performance in recent years' PASCAL Visual

More information

Find that! Visual Object Detection Primer

Find that! Visual Object Detection Primer Find that! Visual Object Detection Primer SkTech/MIT Innovation Workshop August 16, 2012 Dr. Tomasz Malisiewicz tomasz@csail.mit.edu Find that! Your Goals...imagine one such system that drives information

More information

Face Detection and Alignment. Prof. Xin Yang HUST

Face Detection and Alignment. Prof. Xin Yang HUST Face Detection and Alignment Prof. Xin Yang HUST Many slides adapted from P. Viola Face detection Face detection Basic idea: slide a window across image and evaluate a face model at every location Challenges

More information