Class 5: Attributes and Semantic Features

Similar documents
Shifting from Naming to Describing: Semantic Attribute Models. Rogerio Feris, June 2014

Learning Visual Semantics: Models, Massive Computation, and Innovative Applications. Tutorial at CVPR 2014 June 23rd, 1:00pm-5:00pm, Columbus, OH

CS 1674: Intro to Computer Vision. Attributes. Prof. Adriana Kovashka University of Pittsburgh November 2, 2016

Experiments of Image Retrieval Using Weak Attributes

End-to-End Localization and Ranking for Relative Attributes

Interactive Image Search with Attributes

Visual Recognition and Search

Describable Visual Attributes for Face Verification and Image Search

An Exploration of Computer Vision Techniques for Bird Species Classification

The Caltech-UCSD Birds Dataset

Beyond Bags of Features

Part Localization by Exploiting Deep Convolutional Networks

Object Recognition. Lecture 11, April 21 st, Lexing Xie. EE4830 Digital Image Processing

arxiv: v1 [cs.mm] 12 Jan 2016

Discriminative classifiers for image recognition

By Suren Manvelyan,

Additional Remarks on Designing Category-Level Attributes for Discriminative Visual Recognition

Mixtures of Gaussians and Advanced Feature Encoding

Object Recognition. Computer Vision. Slides from Lana Lazebnik, Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce

Part-based and local feature models for generic object recognition

CS 231A Computer Vision (Fall 2011) Problem Set 4

Previously. Part-based and local feature models for generic object recognition. Bag-of-words model 4/20/2011

Announcements. Recognition. Recognition. Recognition. Recognition. Homework 3 is due May 18, 11:59 PM Reading: Computer Vision I CSE 152 Lecture 14

Similarity Comparisons for Interactive Fine-Grained Categorization

POOF: Part-Based One-vs-One Features for Fine-Grained Categorization, Face Verification, and Attribute Estimation

Mining Discriminative Adjectives and Prepositions for Natural Scene Recognition

EVENT DETECTION AND HUMAN BEHAVIOR RECOGNITION. Ing. Lorenzo Seidenari

Supervised learning. y = f(x) function

AN ENHANCED ATTRIBUTE RERANKING DESIGN FOR WEB IMAGE SEARCH

Learning Compact Visual Attributes for Large-scale Image Classification

Designing Category-Level Attributes for Discriminative Visual Recognition

Recognition of Animal Skin Texture Attributes in the Wild. Amey Dharwadker (aap2174) Kai Zhang (kz2213)

Class 9 Action Recognition

Course Administration

Exemplar-specific Patch Features for Fine-grained Recognition

Attributes. Computer Vision. James Hays. Many slides from Derek Hoiem

COSC160: Detection and Classification. Jeremy Bolton, PhD Assistant Teaching Professor

24 hours of Photo Sharing. installation by Erik Kessels

Recognition Tools: Support Vector Machines

Additional Remarks on Designing Category-Level Attributes for Discriminative Visual Recognition

Object recognition (part 1)

Beyond bags of features: Adding spatial information. Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba

Supervised learning. y = f(x) function

Columbia University High-Level Feature Detection: Parts-based Concept Detectors

Part-Based Models for Object Class Recognition Part 2

Part-Based Models for Object Class Recognition Part 2

Object Classification for Video Surveillance

Lecture 12 Recognition. Davide Scaramuzza

String distance for automatic image classification

Segmentation. Bottom up Segmentation Semantic Segmentation

Tag Recommendation for Photos

Verification: is that a lamp? What do we mean by recognition? Recognition. Recognition

on learned visual embedding patrick pérez Allegro Workshop Inria Rhônes-Alpes 22 July 2015

Classifying Images with Visual/Textual Cues. By Steven Kappes and Yan Cao

What do we mean by recognition?

EE795: Computer Vision and Intelligent Systems

Beyond Bags of features Spatial information & Shape models

CS230: Lecture 3 Various Deep Learning Topics

Query-Specific Visual Semantic Spaces for Web Image Re-ranking

Dimensionality Reduction using Relative Attributes

The attributes of objects. D.A. Forsyth, UIUC channelling Derek Hoiem, UIUC, with Ali Farhadi, Ian Endres, Gang Wang all of UIUC

Lecture 12 Recognition

Geodesic Flow Kernel for Unsupervised Domain Adaptation

Part based models for recognition. Kristen Grauman

Object Recognition and Augmented Reality

PICODES: Learning a Compact Code for Novel-Category Recognition

Spoken Attributes: Mixing Binary and Relative Attributes to Say the Right Thing

AUTOMATIC VISUAL CONCEPT DETECTION IN VIDEOS

Attributes and More Crowdsourcing

Dr. Enrique Cabello Pardos July

arxiv: v2 [cs.cv] 13 Apr 2018

Semantic image search using queries

FACE RECOGNITION USING FUZZY NEURAL NETWORK

Face Quality Assessment System in Video Sequences

Compare and Contrast: Learning Prominent Differences in Relative Attributes. Steven Ziqiu Chen

Window based detectors

Image Retrieval Using Attribute Enhanced Sparse Code Words

Metric learning approaches! for image annotation! and face recognition!

P-CNN: Pose-based CNN Features for Action Recognition. Iman Rezazadeh

Predicting ground-level scene Layout from Aerial imagery. Muhammad Hasan Maqbool

High Quality Image Representation via SR and IR: CA Based Approach

Tensor Decomposition of Dense SIFT Descriptors in Object Recognition

Selection of Scale-Invariant Parts for Object Class Recognition

ShadowDraw Real-Time User Guidance for Freehand Drawing. Harshal Priyadarshi

Storyline Reconstruction for Unordered Images

Augmented Attribute Representations

MINE THE FINE: FINE-GRAINED FRAGMENT DISCOVERY

Object detection as supervised classification

CS 231A Computer Vision (Fall 2012) Problem Set 4

arxiv: v2 [cs.cv] 23 May 2016

Pattern recognition (3)

CS4495/6495 Introduction to Computer Vision. 8C-L1 Classification: Discriminative models

Multi-Attribute Spaces: Calibration for Attribute Fusion and Similarity Search

Bias-Variance Trade-off (cont d) + Image Representations

Neural Networks. Single-layer neural network. CSE 446: Machine Learning Emily Fox University of Washington March 10, /10/2017

Object recognition (part 2)

Discovering Localized Attributes for Fine-grained Recognition

Face2Face Comparing faces with applications Patrick Pérez. Inria, Rennes 2 Oct. 2014

Every Picture Tells a Story: Generating Sentences from Images

Video annotation based on adaptive annular spatial partition scheme

Transcription:

Class 5: Attributes and Semantic Features Rogerio Feris, Feb 21, 2013 EECS 6890 Topics in Information Processing Spring 2013, Columbia University http://rogerioferis.com/visualrecognitionandsearch

Project Report Thanks for sending the project proposals! Project update presentations (10 min per group) March 14 April 11 Details will be provided in the course website

Plan for Today Introduction to Semantic Features Attribute-based Classification and Search Attributes for Fine-Grained Classification Relative Attributes Project Proposal Presentations

Semantic Features Use the scores of semantic classifiers as high-level features Input Image Off-the-shelf Classifiers Sky Classifier Sand Classifier Water Classifier Score Score Score Semantic Features Compact / powerful descriptor with semantic meaning (allows explaining the decision) Beach Classifier

Semantic Features (Frame-Level) Illustration of Early IBM work (multimedia community) describing this concept [John Smith et al, Multimedia Semantic Indexing Using Model Vectors, ICME 2003] Concatenation / Dimensionality Reduction

Semantic Features (Frame-level) System evolved to the IBM Multimedia Analysis and Retrieval System (IMARS) Discriminative semantic basis [Rong Yan et al, Model-Shared Subspace Boosting for Multi-label Classification, KDD 2007] Ensemble Learning Rapid event modeling, e.g., accident with highspeed skidding

Classemes (Frame-level) Descriptor is formed by concatenating the outputs of weakly trained classifiers called classemes (trained with noisy labels) [L. Torresani et al, Efficient Object Category Recognition Using Classemes, ECCV 2010] Images used to train the table classeme (from Google image search) Noisy Labels

Classemes (Frame-level) Compact and Efficient Descriptor, useful for large-scale classification Features are not really semantic!

Semantic Features (Object Level) Object Bank [Li-Jia Li et al, Object Bank: A High-Level Image Representation for Scene Classification and Semantic Feature Sparsification] http://vision.stanford.edu/projects/objectbank/ State-of-the-art scene classification results (~7 seconds per image)

Semantic Attributes Naming Describing Bald Beard? Red Shirt Modifiers rather than (or in addition to) nouns Semantic properties that are shared among objects Attributes are category independent and transferrable

Attribute-Based Search

People Search in Surveillance Videos Traditional Approaches: Face Recognition ( Naming ) Face recognition is very challenging under lighting changes, pose variation, and lowresolution imagery (typical conditions in surveillance scenarios) Attribute-based People Search ( Describing ) [Vaquero et al, Attribute-based People Search in Surveillance Environments, WACV 2009] Rather than relying on face recognition only, a complementary people search framework based on semantic attributes is provided Query Example: Show me all bald people at the 42 nd street station last month with dark skin, wearing sunglasses, wearing a red jacket

People Search in Surveillance Videos

People Search in Surveillance Videos

People Search in Surveillance Videos People Search based on textual descriptions - It does not require training images for the target suspect. Robustness: attribute detectors are trained using lots of training images covering different lighting conditions, pose variation, etc. Works well in low-resolution imagery (typical in video surveillance scenarios)

People Search in Surveillance Videos Modeling attribute correlations [Siddiquie, Feris and Davis, Image Ranking and Retrieval Based on Multi-Attribute Queries, CVPR 2011]

Attribute-Based Classification

Attribute-based Classification Recognition of Unseen Classes (Zero-Shot Learning) [Lampert et al, Learning To Detect Unseen Object Classes by Between-Class Attribute Transfer, CVPR 2009] 1) Train semantic attribute classifiers 2) Obtain a classifier for an unseen object (no training samples) by just specifying which attributes it has

Attribute-based Classification Unseen categories Flat multi-class classification Unseen categories Semantic Attribute Classifiers Attribute-based classification

Attribute-based Classification Face verification [Kumar et al, ICCV 2009] Action recognition [Liu al, CVPR2011] Animal Recognition [Lampert et al, CVPR 2009] Person Re-identification [Layne et al, BMVC 2012] Bird Categorization [Farrell et al, ICCV 2011] Many more! Significant growth in the past few years

Attribute-based Classification Note: Several recent methods use the term attributes to refer to non-semantic model outputs In this case attributes are just mid-level features, like PCA, hidden layers in neural nets, (non-interpretable splits)

Attribute-based Classification http://rogerioferis.com/visualrecognitionandsearch/resources.html

Attributes for Fine-Grained Categorization

Fine-Grained Categorization

Fine-Grained Categorization

Fine-Grained Categorization

Fine-Grained Categorization Visipedia (http://http://visipedia.org/) Machines collaborating with humans to organize visual knowledge, connecting text to images, images to text, and images to images Easy annotation interface for experts (powered by computer vision) Visual Query: Fine-grained Bird Categorization Picture credit: Serge Belongie

Fine-Grained Categorization African Is it an African or Indian Elephant? Indian Example-based Fine-Grained Categorization is Hard!! Slide Credit: Christoph Lampert

Fine-Grained Categorization African Is it an African or Indian Elephant? Indian Larger Ears Smaller Ears Visual distinction of subordinate categories may be quite subtle, usually based on Parts and Attributes

Fine-Grained Categorization Standard classification methods may not be suitable because the variation between classes is small [B. Yao, CVPR 2012] Codebook and intra-class variation is still high.

Fine-Grained Categorization Humans rely on field guides! Field guides usually refer to parts and attributes of the object Slide Credit: Pietro Perona

Fine-Grained Categorization [Branson et al, Visual Recognition with Humans in the Loop, ECCV 2010]

Fine-Grained Categorization [Branson et al, Visual Recognition with Humans in the Loop, ECCV 2010] Computer vision reduces the amount of human-interaction (minimizes the number of questions)

Fine-Grained Categorization [Wah et al, Multiclass Recognition and Part Localization with Humans in the Loop, ICCV 2011] Localized part and attribute detectors. Questions include asking the user to localize parts.

Fine-Grained Categorization http://www.vision.caltech.edu/visipedia/cub-200-2011.html

Fine-Grained Categorization Video Demo: http://www.youtube.com/watch?v=_rekvqndxza

Like a normal field guide that you can search and sort and with visual recognition See N. Kumar et al, "Leafsnap: A Computer Vision System for Automatic Plant Species Identification, ECCV 2012

Nearly 1 million downloads 40k new users per month 100k active users 1.7 million images taken 100k new images/month 100k users with > 5 images Users from all over the world Botanists, educators, kids, hobbyists, photographers, Slide Credit: Neeraj Kumar

Fine-Grained Categorization Check the fine-grained visual categorization workshop: http://www.fgvc.org/

Relative Attributes

Relative Attributes [Parikh & Grauman, Relative Attributes, ICCV 2011] Smiling??? Not smiling Natural??? Not natural Slide credit: Parikh &Grauman

Learning Relative Attributes For each attribute e.g., openness Supervision consists of: Ordered pairs Similar pairs Slide credit: Parikh &Grauman

Learning Relative Attributes Learn a ranking function Image features Learned parameters that best satisfies the constraints: Slide credit: Parikh &Grauman

Learning Relative Attributes Max-margin learning to rank formulation 2 1 Based on [Joachims 2002] 6 4 5 3 Rank Margin Image Relative Attribute Score Slide credit: Parikh &Grauman

Relative Zero-Shot Learning Each image is converted into a vector of relative attribute scores indicating the strength of each attribute A Gaussian distribution for each category is built in the relative attribute space. The distribution of unseen categories is estimated based on the specified constraints and the distributions of seen categories Max-likelihood is then used for classification Blue: Seen class Green: Unseen class

Relative Image Description Slide credit: Parikh &Grauman

Whittle Search Slide credit: Kristen Grauman

http://rogerioferis.com/partsandattributes/ http://pub.ist.ac.at/~chl/pna2012/

Summary Semantic attribute classifiers can be useful for: Describing images of unknown objects [Farhadi et al, CVPR 2009] Recognizing unseen classes [Lampert et al, CVPR 2009] Reducing dataset bias (trained across classes) Effective object search in surveillance videos [Vaquero et al, WACV 2009] Compact descriptors / Efficient image retrieval [Douze et al, CVPR 2011] Fine-grained object categorization [Wah et al, ICCV 2011] Face verification [Kumar et al, 2009], Action recognition [Liu et al, CVPR 2011], Person re-identification [Layne et al, BMVC 2012] and other classification tasks. Other applications, such as sentence generation from images [Kulkarni et al, CVPR 2011], image aesthetics prediction [Dhar et al CVPR 2011],

Summary Extensive annotation may be required for attribute classifiers Class-attribute relations may be automatically extracted from textual sources [Rohrbach et al, What Helps Where And Why? Semantic Relatedness for Knowledge Transfer", CVPR 2010]; [Berg et al, Automatic Attribute Discovery and Characterization from Noisy Web Data, ECCV 2008]. Semantic Attributes may not be discriminative Various methods combine semantic attributes with discriminative attributes (non-semantic) for classification (e.g., [Farhadi et al, CVPR 2009]). Construction of nameable + discriminative attributes has also been proposed by [Parikh & Grauman, Interactively Building a Discriminative Vocabulary of Nameable Attributes, CVPR 2011]