Part based models for recognition. Kristen Grauman
|
|
- Abner Pitts
- 5 years ago
- Views:
Transcription
1 Part based models for recognition Kristen Grauman UT Austin
2 Limitations of window-based models Not all objects are box-shaped Assuming specific 2d view of object Local components themselves do not necessarily conform to rigid layout Occlusion sensitivity Sliding window expensive Kristen Grauman
3 Part-based and local feature models for recognition Main idea: Rather than a representation based on holistic appearance, decompose the image into: local parts or patches, and their relative spatial relationships Kristen Grauman
4 Origins of the part-based model Fischler & Elschlager 1973 Model has two components parts (2D image fragments) structure structure (configuration of parts)
5 Part-based and local feature models for recognition We ll look at three forms: 1. Bag of words (no geometry) 2. Implicit shape model (star graph for spatial model) 3. Constellation model (fully connected graph for spatial model) Kristen Grauman
6 Spatial models: Connectivity and structure O(N P ) O(NP) Fergus et al. 03 Fei-Fei et al. 03 Leibe et al. 04, 08 Crandall et al. 05 Fergus et al. 05 Crandall et al. 05 Felzenszwalb & Huttenlocher 05 Csurka 04 Vasconcelos 00 Bouchard & Triggs 05 Carneiro & Lowe 06 from [Carneiro & Lowe, ECCV 06]
7 Bag-of-words model Summarize entire image based on its distribution (histogram) of word occurrences. Total freedom on spatial positions, relative geometry. Vector representation easily usable by most classifiers. Kristen Grauman
8 Bag-of-words model Csurka et al. Visual Categorization with Bags of Keypoints, 2004
9 Words as parts Csurka et al All local features Local features from two selected clusters occurring in this image
10 Naïve Bayes model for classification c arg max c p ( c w ) p ( c ) p ( w c ) N p ( c ) p ( w n c ) n 1 Object class decision Prior prob. of the object classes Image likelihood given the class patches th What assumptions does the model make, and what are their significance?
11 Confusion matrix Example bag of words + Naïve Bayes classification results for generic categorization of objects Kristen Grauman Csurka et al. 2004
12 Clutter or context? Kristen Grauman
13 Sampling strategies Kristen Grauman Specific object Category
14 Sampling strategies Sparse, at interest points Dense, uniformly Randomly To find specific, textured objects, sparse sampling from interest points more reliable. Multiple complementary interest operators offer more image coverage. Multiple l interest t operators Image credits: F-F. Li, E. Nowak, J. Sivic For object categorization, dense sampling offers better coverage. [See Nowak, Jurie & Triggs, ECCV 2006]
15 Local feature correspondence for generic object categories Kristen Grauman
16 Local feature correspondence for generic object categories Comparing bags of words histograms coarsely reflects agreement between local parts (patches, words). But choice of quantization directly determines what we consider to be similar Kristen Grauman
17 Partially matching sets of features Optimal match: O(m 3 ) Greedy match: O(m 2 log m) Pyramid match: O(m) (m=num pts) We introduce an approximate matching kernel that makes it practical to compare large sets of features based on their partial correspondences. Kristen Grauman [Previous work: Indyk & Thaper, Bartal, Charikar, Agarwal & Varadarajan, ]
18 Pyramid match: main idea Feature space partitions serve to match the local descriptors within successively wider regions. descriptor space Kristen Grauman
19 Pyramid match: main idea Kristen Grauman Histogram intersection counts number of possible matches at a given partitioning.
20 Pyramid match kernel measures difficulty of a match at level l number of newly matched pairs at level For similarity, weights inversely proportional p to bin size (or may be learned) Normalize these kernel values to avoid favoring large sets [Grauman & Darrell, ICCV 2005]
21 Pyramid match kernel Optimal match: O(m 3 ) Pyramid match: O(mL) optimal partial matching Kristen Grauman
22 Highlights of the pyramid match Linear time complexity Formal bounds on expected error Mercer kernel Data-driven partitions allow accurate matches even in high-dim. h feature spaces Strong performance on benchmark object recognition datasets Kristen Grauman
23 Example recognition results: Caltech-101 dataset 101 categories images per class Data provided by Fei-Fei, Fergus, and Perona
24 Recognition results: Caltech-101 dataset Jain, Huynh, & Grauman (2007) Kristen Grauman
25 Example recognition results: Caltech-101 dataset Combination of pyramid match and correspondence kernels uracy Accu [Kapoor et al. IJCV 2009] Number of training examples Kristen Grauman
26 Unordered sets of local features: No spatial layout preserved! Too much? Too little?
27 Spatial pyramid match Make a pyramid of bag-of-words histograms. Provides some loose (global) spatial layout information Sum over PMKs computed in image coordinate space, one per word. [Lazebnik, Schmid & Ponce, CVPR 2006] Kristen Grauman
28 Spatial pyramid match Captures scene categories well---texture-like patterns but with some variability in the positions of all the local pieces. Confusion matrix Kristen Grauman
29 Spatial pyramid match Captures scene categories well---texture-like patterns but with some variability in the positions of all the local pieces. Kristen Grauman
30 Part-based and local feature models for recognition We ll look at three forms: 1. Bag of words (no geometry) 2. Implicit shape model (star graph for spatial model) 3. Constellation model (fully connected graph for spatial model) Kristen Grauman
31 Shape representation in part-based models Star shape model x 1 x 6 x 2 x 5 x 3 x 4 e.g. implicit shape model Parts mutually independent N image features, P parts in the model Kristen Grauman
32 Implicit shape models Visual vocabulary is used to index votes for object position [a visual word = part ] training image annotated with object localization info visual codeword with displacement vectors B. Leibe, A. Leonardis, and B. Schiele, Combined Object Categorization and Segmentation with an Implicit Shape Model, ECCV Workshop on Statistical Learning in Computer Vision 2004
33 Implicit shape models Visual vocabulary is used to index votes for object position [a visual word = part ] test image B. Leibe, A. Leonardis, and B. Schiele, Combined Object Categorization and Segmentation with an Implicit Shape Model, ECCV Workshop on Statistical Learning in Computer Vision 2004
34 Implicit shape models: Training 1. Build vocabulary of patches around extracted interest points using clustering How will this differ from a vocabulary built How will this differ from a vocabulary built from, e.g., all frames in a movie?
35 Implicit shape models: Training 1. Build vocabulary of patches around extracted interest points using clustering 2. Map the patch around each interest point to closest word
36 Implicit shape models: Training 1. Build vocabulary of patches around extracted interest points using clustering 2. Map the patch around each interest point to closest word 3. For each word, store all positions it was found, relative to object center
37 Implicit shape models: Testing 1. Given new test image, extract patches, match to vocabulary words 2. Cast votes for possible positions of object center 3. Search for maxima in voting space 4. (Extract weighted segmentation mask based on stored masks for the codebook occurrences) What is the dimension of the Hough space?
38 Implicit shape models: Testing
39 Example: Results on Cows Original image K. Grauman, B. Leibe Visual PercepObject ptual and ReScognition ensory Augmented Tutorial Computing
40 Example: Results on Cows Original image Interest points K. Grauman, B. Leibe Visual PercepObject ptual and ReScognition ensory Augmented Tutorial Computing
41 Example: Results on Cows Matched Interest Original points patches image K. Grauman, B. Leibe Visual PercepObject ptual and ReScognition ensory Augmented Tutorial Computing
42 Example: Results on Cows Matched Interest Original Votes patches points image K. Grauman, B. Leibe 45 Visual PercepObject ptual and ReScognition ensory Augmented Tutorial Computing
43 Example: Results on Cows 1 st hypothesis K. Grauman, B. Leibe 46 Visual PercepObject ptual and ReScognition ensory Augmented Tutorial Computing
44 Example: Results on Cows 2 nd hypothesis K. Grauman, B. Leibe 47 Visual PercepObject ptual and ReScognition ensory Augmented Tutorial Computing
45 Example: Results on Cows 3 rd hypothesis K. Grauman, B. Leibe 48 Visual PercepObject ptual and ReScognition ensory Augmented Tutorial Computing
46 Detection Results Qualitative Performance Recognizes different kinds of objects Robust to clutter, occlusion, noise, low contrast K. Grauman, B. Leibe 49 Visual PercepObject ptual and ReScognition ensory Augmented Tutorial C omputing
47 Shape representation in part-based models Star shape model x 1 x 6 x 2 x 5 x 3 x 4 e.g. implicit shape model Parts mutually independent N image features, P parts in the model
48 Deformable part model Felzenszwalb et al A hybrid window + part-based model vs Viola & Jones Dalal & Triggs Felzenszwalb et al.
49 Deformable part model Felzenszwalb et al Mixture of deformable part models Each component has global template + deformable parts Fully trained from bounding boxes alone Adapted from Felzenszwalb s slides at
50 Deformable part model Training: Training data consists of images with bounding boxes. Need to learn the model structure, filters, deformation costs Latent SVM: score by root + max over part placements Alternate between optimizing latent t values and model parameters. Adapted from Felzenszwalb s slides at
51 Example: Person model
52 Deformable part model Matching: Define an overall score for each root location - Based on best placement of parts High scoring root locations define detections - sliding window approach Adapted from Felzenszwalb s slides at
53 Results: car detections
54 Results: person detections
55 Results: horse detections
56 Results: cat detections
57 Shape representation in part-based models Star shape model Fully connected constellation model x 6 x 1 x 1 x 2 x 6 x 2 x 5 x 3 x 5 x 3 x 4 x 4 e.g. implicit shape model Parts mutually independent e.g. Constellation ti Model Parts fully connected N image features, P parts in the model Slide credit: Rob Fergus
58 Probabilistic constellation model P( image object) P( appearance, shape object) max h P( appearance h, object) p( shape h, object) p( h object) h: assignment Part of features to Part parts descriptors locations Candidate parts Source: Lana Lazebnik
59 Probabilistic constellation model P( image object) P( appearance, shape object) max h P( appearance h, object) p( shape h, object) p( h object) h: assignment of features to parts Part 1 Part 3 Part 2 Source: Lana Lazebnik
60 Probabilistic constellation model P( image object) P( appearance, shape object) max h P( appearance h, object) p( shape h, object) p( h object) h: assignment of features to parts Part 1 Part 3 Part 2 Source: Lana Lazebnik
61 Probabilistic constellation model P( image object) P( appearance, shape object) max h P( appearance h, object) p( shape h, object) p( h object) Distribution over patch descriptors High-dimensional appearance space Source: Lana Lazebnik
62 Probabilistic constellation model P( image object) P( appearance, shape object) max h P( appearance h, object) p( shape h, object) p( h object) Distribution tion over joint part positions 2D image space Source: Lana Lazebnik
63 Example results from constellation model: data from four categories Slide from Li Fei-Fei
64 Face model Appearance: 10 patches closest to mean for each part Recognition results Fergus et al. CVPR 2003
65 Face model Appearance: 10 patches closest to mean for each part Recognition results Test images: size of circles indicates score of hypothesis Fergus et al. CVPR 2003
66 Motorbike model Appearance: 10 patches closest to mean for each part Recognition results Fergus et al. CVPR 2003
67 Spotted cat model Appearance: 10 patches closest to mean for each part Recognition results Fergus et al. CVPR 2003
68 Comparison bag of features bag of features Part-based model Source: Lana Lazebnik
69 Shape representation in part-based models Star shape model Fully connected constellation model x 6 x 1 x 1 x 2 x 6 x 2 x 5 x 3 x 5 x 3 x 4 x 4 e.g. implicit shape model Parts mutually independent Recognition complexity: O(NP) Method: Gen. Hough Transform e.g. Constellation ti Model Parts fully connected Recognition complexity: O(N P ) Method: Exhaustive search N image features, P parts in the model Slide credit: Rob Fergus
70 Part-based models: issues and choices Invariance of the structure representation Part (appearance) representation Learning cost Cost of fitting to new examples Generative vs. discriminative Supervision required for training examples Data-driven vs. knowledge-driven model construction Kristen Grauman
71 Summary: part-based and local feature models for generic object recognition Histograms of visual words to capture global or local layout in the bag-of-words framework Pyramid match kernels Powerful in practice for image recognition Part-based models encode category s part appearance together with 2d layout; allow detection in cluttered image implicit shape model and LSVM model : shape based on layout of all parts relative to a reference part; Generalized Hough for detection constellation model : explicitly model mutual spatial layout between all pairs of parts; exhaustive search for best fit of features to parts
72 Recognition models Instances: recognition by alignment Categories: Holistic appearance models (and sliding window detection) Categories: Local feature and part based models Kristen Grauman
73 OTHER ONGOING CHALLENGES
74 Context Kristen Grauman
75 View invariant invariant models Kristen Grauman
76 Attributes Kristen Grauman
77 Use of video eg e.g., activities + objects Kristen Grauman
78 Annotation Annotator [tiny image montage by Torralba et al.] The complex space of visual objects, activities, and scenes. Kristen Grauman
79 Annotation Labeled data Annotator Accuracy? Active request? Unlabeled Num labels added Current classifiers data active passive Kristen Grauman
80 Very large multi class problems
Previously. Part-based and local feature models for generic object recognition. Bag-of-words model 4/20/2011
Previously Part-based and local feature models for generic object recognition Wed, April 20 UT-Austin Discriminative classifiers Boosting Nearest neighbors Support vector machines Useful for object recognition
More informationPart-based and local feature models for generic object recognition
Part-based and local feature models for generic object recognition May 28 th, 2015 Yong Jae Lee UC Davis Announcements PS2 grades up on SmartSite PS2 stats: Mean: 80.15 Standard Dev: 22.77 Vote on piazza
More informationBeyond bags of features: Adding spatial information. Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba
Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba Adding spatial information Forming vocabularies from pairs of nearby features doublets
More informationDeformable Part Models
CS 1674: Intro to Computer Vision Deformable Part Models Prof. Adriana Kovashka University of Pittsburgh November 9, 2016 Today: Object category detection Window-based approaches: Last time: Viola-Jones
More informationObject Category Detection. Slides mostly from Derek Hoiem
Object Category Detection Slides mostly from Derek Hoiem Today s class: Object Category Detection Overview of object category detection Statistical template matching with sliding window Part-based Models
More informationBeyond Bags of features Spatial information & Shape models
Beyond Bags of features Spatial information & Shape models Jana Kosecka Many slides adapted from S. Lazebnik, FeiFei Li, Rob Fergus, and Antonio Torralba Detection, recognition (so far )! Bags of features
More informationVisual Object Recognition
Perceptual and Sensory Augmented Computing Visual Object Recognition Tutorial Visual Object Recognition Bastian Leibe Computer Vision Laboratory ETH Zurich Chicago, 14.07.2008 & Kristen Grauman Department
More informationObject detection. Announcements. Last time: Mid-level cues 2/23/2016. Wed Feb 24 Kristen Grauman UT Austin
Object detection Wed Feb 24 Kristen Grauman UT Austin Announcements Reminder: Assignment 2 is due Mar 9 and Mar 10 Be ready to run your code again on a new test set on Mar 10 Vision talk next Tuesday 11
More informationPart-based models. Lecture 10
Part-based models Lecture 10 Overview Representation Location Appearance Generative interpretation Learning Distance transforms Other approaches using parts Felzenszwalb, Girshick, McAllester, Ramanan
More informationObject Recognition. Computer Vision. Slides from Lana Lazebnik, Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce
Object Recognition Computer Vision Slides from Lana Lazebnik, Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce How many visual object categories are there? Biederman 1987 ANIMALS PLANTS OBJECTS
More informationObject Detection. Sanja Fidler CSC420: Intro to Image Understanding 1/ 1
Object Detection Sanja Fidler CSC420: Intro to Image Understanding 1/ 1 Object Detection The goal of object detection is to localize objects in an image and tell their class Localization: place a tight
More informationModern Object Detection. Most slides from Ali Farhadi
Modern Object Detection Most slides from Ali Farhadi Comparison of Classifiers assuming x in {0 1} Learning Objective Training Inference Naïve Bayes maximize j i logp + logp ( x y ; θ ) ( y ; θ ) i ij
More informationBy Suren Manvelyan,
By Suren Manvelyan, http://www.surenmanvelyan.com/gallery/7116 By Suren Manvelyan, http://www.surenmanvelyan.com/gallery/7116 By Suren Manvelyan, http://www.surenmanvelyan.com/gallery/7116 By Suren Manvelyan,
More informationCS6670: Computer Vision
CS6670: Computer Vision Noah Snavely Lecture 16: Bag-of-words models Object Bag of words Announcements Project 3: Eigenfaces due Wednesday, November 11 at 11:59pm solo project Final project presentations:
More informationLocal Features based Object Categories and Object Instances Recognition
Local Features based Object Categories and Object Instances Recognition Eric Nowak Ph.D. thesis defense 17th of March, 2008 1 Thesis in Computer Vision Computer vision is the science and technology of
More informationFitting: The Hough transform
Fitting: The Hough transform Voting schemes Let each feature vote for all the models that are compatible with it Hopefully the noise features will not vote consistently for any single model Missing data
More informationSupervised learning. y = f(x) function
Supervised learning y = f(x) output prediction function Image feature Training: given a training set of labeled examples {(x 1,y 1 ),, (x N,y N )}, estimate the prediction function f by minimizing the
More informationDetection III: Analyzing and Debugging Detection Methods
CS 1699: Intro to Computer Vision Detection III: Analyzing and Debugging Detection Methods Prof. Adriana Kovashka University of Pittsburgh November 17, 2015 Today Review: Deformable part models How can
More informationBag of Words Models. CS4670 / 5670: Computer Vision Noah Snavely. Bag-of-words models 11/26/2013
CS4670 / 5670: Computer Vision Noah Snavely Bag-of-words models Object Bag of words Bag of Words Models Adapted from slides by Rob Fergus and Svetlana Lazebnik 1 Object Bag of words Origin 1: Texture Recognition
More informationEECS 442 Computer vision. Object Recognition
EECS 442 Computer vision Object Recognition Intro Recognition of 3D objects Recognition of object categories: Bag of world models Part based models 3D object categorization Challenges: intraclass variation
More informationCategory-level localization
Category-level localization Cordelia Schmid Recognition Classification Object present/absent in an image Often presence of a significant amount of background clutter Localization / Detection Localize object
More informationVisuelle Perzeption für Mensch- Maschine Schnittstellen
Visuelle Perzeption für Mensch- Maschine Schnittstellen Vorlesung, WS 2009 Prof. Dr. Rainer Stiefelhagen Dr. Edgar Seemann Institut für Anthropomatik Universität Karlsruhe (TH) http://cvhci.ira.uka.de
More informationBag-of-features. Cordelia Schmid
Bag-of-features for category classification Cordelia Schmid Visual search Particular objects and scenes, large databases Category recognition Image classification: assigning a class label to the image
More informationFitting: The Hough transform
Fitting: The Hough transform Voting schemes Let each feature vote for all the models that are compatible with it Hopefully the noise features will not vote consistently for any single model Missing data
More informationCEE598 - Visual Sensing for Civil Infrastructure Eng. & Mgmt.
CEE598 - Visual Sensing for Civil Infrastructure Eng. & Mgmt. Session 20 Object Recognition 3 Mani Golparvar-Fard Department of Civil and Environmental Engineering 3129D, Newmark Civil Engineering Lab
More informationEfficient Kernels for Identifying Unbounded-Order Spatial Features
Efficient Kernels for Identifying Unbounded-Order Spatial Features Yimeng Zhang Carnegie Mellon University yimengz@andrew.cmu.edu Tsuhan Chen Cornell University tsuhan@ece.cornell.edu Abstract Higher order
More informationCEE598 - Visual Sensing for Civil Infrastructure Eng. & Mgmt.
CEE598 - Visual Sensing for Civil Infrastructure Eng. & Mgmt. Session 20 Object Recognition 3 Mani Golparvar-Fard Department of Civil and Environmental Engineering Department of Computer Science 3129D,
More informationCategory-level Localization
Category-level Localization Andrew Zisserman Visual Geometry Group University of Oxford http://www.robots.ox.ac.uk/~vgg Includes slides from: Ondra Chum, Alyosha Efros, Mark Everingham, Pedro Felzenszwalb,
More informationBeyond Bags of Features
: for Recognizing Natural Scene Categories Matching and Modeling Seminar Instructed by Prof. Haim J. Wolfson School of Computer Science Tel Aviv University December 9 th, 2015
More informationLoose Shape Model for Discriminative Learning of Object Categories
Loose Shape Model for Discriminative Learning of Object Categories Margarita Osadchy and Elran Morash Computer Science Department University of Haifa Mount Carmel, Haifa 31905, Israel rita@cs.haifa.ac.il
More informationPatch Descriptors. CSE 455 Linda Shapiro
Patch Descriptors CSE 455 Linda Shapiro How can we find corresponding points? How can we find correspondences? How do we describe an image patch? How do we describe an image patch? Patches with similar
More informationTA Section: Problem Set 4
TA Section: Problem Set 4 Outline Discriminative vs. Generative Classifiers Image representation and recognition models Bag of Words Model Part-based Model Constellation Model Pictorial Structures Model
More informationLecture 16: Object recognition: Part-based generative models
Lecture 16: Object recognition: Part-based generative models Professor Stanford Vision Lab 1 What we will learn today? Introduction Constellation model Weakly supervised training One-shot learning (Problem
More informationHigh Level Computer Vision
High Level Computer Vision Part-Based Models for Object Class Recognition Part 2 Bernt Schiele - schiele@mpi-inf.mpg.de Mario Fritz - mfritz@mpi-inf.mpg.de http://www.d2.mpi-inf.mpg.de/cv Please Note No
More informationPatch-based Object Recognition. Basic Idea
Patch-based Object Recognition 1! Basic Idea Determine interest points in image Determine local image properties around interest points Use local image properties for object classification Example: Interest
More informationPart-Based Models for Object Class Recognition Part 2
High Level Computer Vision Part-Based Models for Object Class Recognition Part 2 Bernt Schiele - schiele@mpi-inf.mpg.de Mario Fritz - mfritz@mpi-inf.mpg.de https://www.mpi-inf.mpg.de/hlcv Class of Object
More informationPart-Based Models for Object Class Recognition Part 2
High Level Computer Vision Part-Based Models for Object Class Recognition Part 2 Bernt Schiele - schiele@mpi-inf.mpg.de Mario Fritz - mfritz@mpi-inf.mpg.de https://www.mpi-inf.mpg.de/hlcv Class of Object
More informationFitting: The Hough transform
Fitting: The Hough transform Voting schemes Let each feature vote for all the models that are compatible with it Hopefully the noise features will not vote consistently for any single model Missing data
More informationIntroduction to object recognition. Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and others
Introduction to object recognition Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and others Overview Basic recognition tasks A statistical learning approach Traditional or shallow recognition
More informationModel Fitting: The Hough transform II
Model Fitting: The Hough transform II Guido Gerig, CS6640 Image Processing, Utah Theory: See handwritten notes GG: HT-notes-GG-II.pdf Credits: S. Narasimhan, CMU, Spring 2006 15-385,-685, Link Svetlana
More informationToday. Introduction to recognition Alignment based approaches 11/4/2008
Today Introduction to recognition Alignment based approaches Tuesday, Nov 4 Brief recap of visual words Introduction to recognition problem Recognition by alignment, pose clustering Kristen Grauman UT
More informationMethods for Representing and Recognizing 3D objects
Methods for Representing and Recognizing 3D objects part 1 Silvio Savarese University of Michigan at Ann Arbor Object: Building, 45º pose, 8-10 meters away Object: Person, back; 1-2 meters away Object:
More informationInstance-level recognition II.
Reconnaissance d objets et vision artificielle 2010 Instance-level recognition II. Josef Sivic http://www.di.ens.fr/~josef INRIA, WILLOW, ENS/INRIA/CNRS UMR 8548 Laboratoire d Informatique, Ecole Normale
More informationCS 4495 Computer Vision Classification 3: Bag of Words. Aaron Bobick School of Interactive Computing
CS 4495 Computer Vision Classification 3: Bag of Words Aaron Bobick School of Interactive Computing Administrivia PS 6 is out. Due Tues Nov 25th, 11:55pm. One more assignment after that Mea culpa This
More informationCategory vs. instance recognition
Category vs. instance recognition Category: Find all the people Find all the buildings Often within a single image Often sliding window Instance: Is this face James? Find this specific famous building
More informationObject Recognition and Detection
CS 2770: Computer Vision Object Recognition and Detection Prof. Adriana Kovashka University of Pittsburgh March 16, 21, 23, 2017 Plan for the next few lectures Recognizing the category in the image as
More informationPatch Descriptors. EE/CSE 576 Linda Shapiro
Patch Descriptors EE/CSE 576 Linda Shapiro 1 How can we find corresponding points? How can we find correspondences? How do we describe an image patch? How do we describe an image patch? Patches with similar
More informationCourse Administration
Course Administration Project 2 results are online Project 3 is out today The first quiz is a week from today (don t panic!) Covers all material up to the quiz Emphasizes lecture material NOT project topics
More informationLocal Features and Bag of Words Models
10/14/11 Local Features and Bag of Words Models Computer Vision CS 143, Brown James Hays Slides from Svetlana Lazebnik, Derek Hoiem, Antonio Torralba, David Lowe, Fei Fei Li and others Computer Engineering
More informationLocal features, distances and kernels. Plan for today
Local features, distances and kernels February 5, 2009 Kristen Grauman UT Austin Plan for today Lecture : local features and matchi Papers: Video Google [Sivic & Zisserman] Pyramid match [Grauman & Darrell]
More informationVisuelle Perzeption für Mensch- Maschine Schnittstellen
Visuelle Perzeption für Mensch- Maschine Schnittstellen Vorlesung, WS 2009 Prof. Dr. Rainer Stiefelhagen Dr. Edgar Seemann Institut für Anthropomatik Universität Karlsruhe (TH) http://cvhci.ira.uka.de
More informationBackprojection Revisited: Scalable Multi-view Object Detection and Similarity Metrics for Detections
Backprojection Revisited: Scalable Multi-view Object Detection and Similarity Metrics for Detections Nima Razavi 1, Juergen Gall 1, and Luc Van Gool 1,2 1 Computer Vision Laboratory, ETH Zurich 2 ESAT-PSI/IBBT,
More informationDiscriminative classifiers for image recognition
Discriminative classifiers for image recognition May 26 th, 2015 Yong Jae Lee UC Davis Outline Last time: window-based generic object detection basic pipeline face detection with boosting as case study
More informationVisual words. Map high-dimensional descriptors to tokens/words by quantizing the feature space.
Visual words Map high-dimensional descriptors to tokens/words by quantizing the feature space. Quantize via clustering; cluster centers are the visual words Word #2 Descriptor feature space Assign word
More informationMining Discriminative Adjectives and Prepositions for Natural Scene Recognition
Mining Discriminative Adjectives and Prepositions for Natural Scene Recognition Bangpeng Yao 1, Juan Carlos Niebles 2,3, Li Fei-Fei 1 1 Department of Computer Science, Princeton University, NJ 08540, USA
More informationObject Classification Problem
HIERARCHICAL OBJECT CATEGORIZATION" Gregory Griffin and Pietro Perona. Learning and Using Taxonomies For Fast Visual Categorization. CVPR 2008 Marcin Marszalek and Cordelia Schmid. Constructing Category
More informationSelection of Scale-Invariant Parts for Object Class Recognition
Selection of Scale-Invariant Parts for Object Class Recognition Gy. Dorkó and C. Schmid INRIA Rhône-Alpes, GRAVIR-CNRS 655, av. de l Europe, 3833 Montbonnot, France fdorko,schmidg@inrialpes.fr Abstract
More informationInstance-level recognition part 2
Visual Recognition and Machine Learning Summer School Paris 2011 Instance-level recognition part 2 Josef Sivic http://www.di.ens.fr/~josef INRIA, WILLOW, ENS/INRIA/CNRS UMR 8548 Laboratoire d Informatique,
More informationDistances and Kernels. Motivation
Distances and Kernels Amirshahed Mehrtash Motivation How similar? 1 Problem Definition Designing a fast system to measure the similarity il it of two images. Used to categorize images based on appearance.
More informationToday. Main questions 10/30/2008. Bag of words models. Last time: Local invariant features. Harris corner detector: rotation invariant detection
Today Indexing with local features, Bag of words models Matching local features Indexing features Bag of words model Thursday, Oct 30 Kristen Grauman UT-Austin Main questions Where will the interest points
More informationSupervised learning. y = f(x) function
Supervised learning y = f(x) output prediction function Image feature Training: given a training set of labeled examples {(x 1,y 1 ),, (x N,y N )}, estimate the prediction function f by minimizing the
More informationRecap Image Classification with Bags of Local Features
Recap Image Classification with Bags of Local Features Bag of Feature models were the state of the art for image classification for a decade BoF may still be the state of the art for instance retrieval
More informationOBJECT CATEGORIZATION
OBJECT CATEGORIZATION Ing. Lorenzo Seidenari e-mail: seidenari@dsi.unifi.it Slides: Ing. Lamberto Ballan November 18th, 2009 What is an Object? Merriam-Webster Definition: Something material that may be
More informationImproved Spatial Pyramid Matching for Image Classification
Improved Spatial Pyramid Matching for Image Classification Mohammad Shahiduzzaman, Dengsheng Zhang, and Guojun Lu Gippsland School of IT, Monash University, Australia {Shahid.Zaman,Dengsheng.Zhang,Guojun.Lu}@monash.edu
More informationArtistic ideation based on computer vision methods
Journal of Theoretical and Applied Computer Science Vol. 6, No. 2, 2012, pp. 72 78 ISSN 2299-2634 http://www.jtacs.org Artistic ideation based on computer vision methods Ferran Reverter, Pilar Rosado,
More informationWindow based detectors
Window based detectors CS 554 Computer Vision Pinar Duygulu Bilkent University (Source: James Hays, Brown) Today Window-based generic object detection basic pipeline boosting classifiers face detection
More informationString distance for automatic image classification
String distance for automatic image classification Nguyen Hong Thinh*, Le Vu Ha*, Barat Cecile** and Ducottet Christophe** *University of Engineering and Technology, Vietnam National University of HaNoi,
More informationAggregating Descriptors with Local Gaussian Metrics
Aggregating Descriptors with Local Gaussian Metrics Hideki Nakayama Grad. School of Information Science and Technology The University of Tokyo Tokyo, JAPAN nakayama@ci.i.u-tokyo.ac.jp Abstract Recently,
More informationIndexing local features and instance recognition May 16 th, 2017
Indexing local features and instance recognition May 16 th, 2017 Yong Jae Lee UC Davis Announcements PS2 due next Monday 11:59 am 2 Recap: Features and filters Transforming and describing images; textures,
More informationImage classification Computer Vision Spring 2018, Lecture 18
Image classification http://www.cs.cmu.edu/~16385/ 16-385 Computer Vision Spring 2018, Lecture 18 Course announcements Homework 5 has been posted and is due on April 6 th. - Dropbox link because course
More informationFusing shape and appearance information for object category detection
1 Fusing shape and appearance information for object category detection Andreas Opelt, Axel Pinz Graz University of Technology, Austria Andrew Zisserman Dept. of Engineering Science, University of Oxford,
More informationObject recognition. Methods for classification and image representation
Object recognition Methods for classification and image representation Credits Slides by Pete Barnum Slides by FeiFei Li Paul Viola, Michael Jones, Robust Realtime Object Detection, IJCV 04 Navneet Dalal
More informationClassifying Images with Visual/Textual Cues. By Steven Kappes and Yan Cao
Classifying Images with Visual/Textual Cues By Steven Kappes and Yan Cao Motivation Image search Building large sets of classified images Robotics Background Object recognition is unsolved Deformable shaped
More informationLEARNING MODELS FOR MULTI-VIEWPOINT OBJECT DETECTION
LEARNING MODELS FOR MULTI-VIEWPOINT OBJECT DETECTION Akash M. Kushal, Ph.D. Department of Computer Science University of Illinois at Urbana-Champaign, 2008 Jean Ponce, Adviser This dissertation addresses
More informationBias-Variance Trade-off + Other Models and Problems
CS 1699: Intro to Computer Vision Bias-Variance Trade-off + Other Models and Problems Prof. Adriana Kovashka University of Pittsburgh November 3, 2015 Outline Support Vector Machines (review + other uses)
More informationComputer Vision. Exercise Session 10 Image Categorization
Computer Vision Exercise Session 10 Image Categorization Object Categorization Task Description Given a small number of training images of a category, recognize a-priori unknown instances of that category
More informationMachine Learning Crash Course
Machine Learning Crash Course Photo: CMU Machine Learning Department protests G20 Computer Vision James Hays Slides: Isabelle Guyon, Erik Sudderth, Mark Johnson, Derek Hoiem The machine learning framework
More informationIndexing local features and instance recognition May 14 th, 2015
Indexing local features and instance recognition May 14 th, 2015 Yong Jae Lee UC Davis Announcements PS2 due Saturday 11:59 am 2 We can approximate the Laplacian with a difference of Gaussians; more efficient
More informationObjectDetectionusingaMax-MarginHoughTransform
ObjectDetectionusingaMax-MarginHoughTransform Subhransu Maji, Jitendra Malik Computer Science Division, EECS University of California at Berkeley {smaji,malik}@cs.berkeley.edu Abstract We present a discriminative
More informationSemi-Supervised Hierarchical Models for 3D Human Pose Reconstruction
Semi-Supervised Hierarchical Models for 3D Human Pose Reconstruction Atul Kanaujia, CBIM, Rutgers Cristian Sminchisescu, TTI-C Dimitris Metaxas,CBIM, Rutgers 3D Human Pose Inference Difficulties Towards
More informationDevelopment in Object Detection. Junyuan Lin May 4th
Development in Object Detection Junyuan Lin May 4th Line of Research [1] N. Dalal and B. Triggs. Histograms of oriented gradients for human detection, CVPR 2005. HOG Feature template [2] P. Felzenszwalb,
More informationImproving Recognition through Object Sub-categorization
Improving Recognition through Object Sub-categorization Al Mansur and Yoshinori Kuno Graduate School of Science and Engineering, Saitama University, 255 Shimo-Okubo, Sakura-ku, Saitama-shi, Saitama 338-8570,
More informationObject Category Detection: Sliding Windows
04/10/12 Object Category Detection: Sliding Windows Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem Today s class: Object Category Detection Overview of object category detection Statistical
More informationLearning the Compositional Nature of Visual Objects
Learning the Compositional Nature of Visual Objects Björn Ommer Joachim M. Buhmann Institute of Computational Science, ETH Zurich 8092 Zurich, Switzerland bjoern.ommer@inf.ethz.ch jbuhmann@inf.ethz.ch
More informationAnalysis: TextonBoost and Semantic Texton Forests. Daniel Munoz Februrary 9, 2009
Analysis: TextonBoost and Semantic Texton Forests Daniel Munoz 16-721 Februrary 9, 2009 Papers [shotton-eccv-06] J. Shotton, J. Winn, C. Rother, A. Criminisi, TextonBoost: Joint Appearance, Shape and Context
More informationLearning Visual Semantics: Models, Massive Computation, and Innovative Applications
Learning Visual Semantics: Models, Massive Computation, and Innovative Applications Part II: Visual Features and Representations Liangliang Cao, IBM Watson Research Center Evolvement of Visual Features
More informationLearning Realistic Human Actions from Movies
Learning Realistic Human Actions from Movies Ivan Laptev*, Marcin Marszałek**, Cordelia Schmid**, Benjamin Rozenfeld*** INRIA Rennes, France ** INRIA Grenoble, France *** Bar-Ilan University, Israel Presented
More informationPart-Based Models for Object Class Recognition Part 3
High Level Computer Vision! Part-Based Models for Object Class Recognition Part 3 Bernt Schiele - schiele@mpi-inf.mpg.de Mario Fritz - mfritz@mpi-inf.mpg.de! http://www.d2.mpi-inf.mpg.de/cv ! State-of-the-Art
More informationRecognizing object instances
Recognizing object instances UT-Austin Instance recognition Motivation visual search Visual words quantization, index, bags of words Spatial verification affine; RANSAC, Hough Other text retrieval tools
More informationCV as making bank. Intel buys Mobileye! $15 billion. Mobileye:
CV as making bank Intel buys Mobileye! $15 billion Mobileye: Spin-off from Hebrew University, Israel 450 engineers 15 million cars installed 313 car models June 2016 - Tesla left Mobileye Fatal crash car
More informationObject Classification for Video Surveillance
Object Classification for Video Surveillance Rogerio Feris IBM TJ Watson Research Center rsferis@us.ibm.com http://rogerioferis.com 1 Outline Part I: Object Classification in Far-field Video Part II: Large
More informationLocal Image Features
Local Image Features Ali Borji UWM Many slides from James Hayes, Derek Hoiem and Grauman&Leibe 2008 AAAI Tutorial Overview of Keypoint Matching 1. Find a set of distinctive key- points A 1 A 2 A 3 B 3
More informationStructured Models in. Dan Huttenlocher. June 2010
Structured Models in Computer Vision i Dan Huttenlocher June 2010 Structured Models Problems where output variables are mutually dependent or constrained E.g., spatial or temporal relations Such dependencies
More informationFeature Matching + Indexing and Retrieval
CS 1699: Intro to Computer Vision Feature Matching + Indexing and Retrieval Prof. Adriana Kovashka University of Pittsburgh October 1, 2015 Today Review (fitting) Hough transform RANSAC Matching points
More informationIncremental Learning of Object Detectors Using a Visual Shape Alphabet
Incremental Learning of Object Detectors Using a Visual Shape Alphabet A. Opelt, A. Pinz & A. Zisserman CVPR 06 Presented by Medha Bhargava* * Several slides adapted from authors presentation, CVPR 06
More informationFitting: Voting and the Hough Transform April 23 rd, Yong Jae Lee UC Davis
Fitting: Voting and the Hough Transform April 23 rd, 2015 Yong Jae Lee UC Davis Last time: Grouping Bottom-up segmentation via clustering To find mid-level regions, tokens General choices -- features,
More informationLocal invariant features
Local invariant features Tuesday, Oct 28 Kristen Grauman UT-Austin Today Some more Pset 2 results Pset 2 returned, pick up solutions Pset 3 is posted, due 11/11 Local invariant features Detection of interest
More informationFind that! Visual Object Detection Primer
Find that! Visual Object Detection Primer SkTech/MIT Innovation Workshop August 16, 2012 Dr. Tomasz Malisiewicz tomasz@csail.mit.edu Find that! Your Goals...imagine one such system that drives information
More informationRecognition. Topics that we will try to cover:
Recognition Topics that we will try to cover: Indexing for fast retrieval (we still owe this one) Object classification (we did this one already) Neural Networks Object class detection Hough-voting techniques
More informationFuzzy based Multiple Dictionary Bag of Words for Image Classification
Available online at www.sciencedirect.com Procedia Engineering 38 (2012 ) 2196 2206 International Conference on Modeling Optimisation and Computing Fuzzy based Multiple Dictionary Bag of Words for Image
More information