Dimensionality Reduction using Relative Attributes

Similar documents
Pattern Recognition Letters Authorship Confirmation

CS 1674: Intro to Computer Vision. Attributes. Prof. Adriana Kovashka University of Pittsburgh November 2, 2016

Interactive Image Search with Attributes

Experiments of Image Retrieval Using Weak Attributes

Tag Recommendation for Photos

AN ENHANCED ATTRIBUTE RERANKING DESIGN FOR WEB IMAGE SEARCH

A Hierarchial Model for Visual Perception

Discriminative classifiers for image recognition

Attributes and More Crowdsourcing

Describable Visual Attributes for Face Verification and Image Search

Assessment of Dimensionality Reduction Based on Communication Channel Model; Application to Immersive Information Visualization

Can Similar Scenes help Surface Layout Estimation?

24 hours of Photo Sharing. installation by Erik Kessels

Every Picture Tells a Story: Generating Sentences from Images

An Associate-Predict Model for Face Recognition FIPA Seminar WS 2011/2012

Facial expression recognition using shape and texture information

End-to-End Localization and Ranking for Relative Attributes

Attributes. Computer Vision. James Hays. Many slides from Derek Hoiem

Visual localization using global visual features and vanishing points

Interactively Guiding Semi-Supervised Clustering via Attribute-based Explanations

Class 5: Attributes and Semantic Features

Object Recognition. Computer Vision. Slides from Lana Lazebnik, Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce

Semi-Supervised Clustering with Partial Background Information

Fantope Regularization in Metric Learning

DEPT: Depth Estimation by Parameter Transfer for Single Still Images

Supplementary Material for Ensemble Diffusion for Retrieval

Robotics Programming Laboratory

Discriminative Clustering for Image Co-segmentation

Facial Expression Recognition Using Non-negative Matrix Factorization

Latent Variable Models for Structured Prediction and Content-Based Retrieval

Previously. Part-based and local feature models for generic object recognition. Bag-of-words model 4/20/2011

Improved Spatial Pyramid Matching for Image Classification

Enhancement of Text Based Image Retrieval to Expedite Image Ranking

Statistics of Natural Image Categories

Sketchable Histograms of Oriented Gradients for Object Detection

Image Resizing Based on Gradient Vector Flow Analysis

Aggregating Descriptors with Local Gaussian Metrics

Recognize Complex Events from Static Images by Fusing Deep Channels Supplementary Materials

Learning and Inferring Depth from Monocular Images. Jiyan Pan April 1, 2009

Affine-invariant scene categorization

DEPT: Depth Estimation by Parameter Transfer for Single Still Images

Contexts and 3D Scenes

Using the Kolmogorov-Smirnov Test for Image Segmentation

A Weighted Non-negative Matrix Factorization for Local Representations Λ

Nonnegative Matrix Factorization with Orthogonality Constraints

Contexts and 3D Scenes

K-Means Clustering Using Localized Histogram Analysis

Non-negative Matrix Factorization for Multimodal Image Retrieval

Metric learning approaches! for image annotation! and face recognition!

Beyond Bags of Features

Part-based and local feature models for generic object recognition

ROBUST SCENE CLASSIFICATION BY GIST WITH ANGULAR RADIAL PARTITIONING. Wei Liu, Serkan Kiranyaz and Moncef Gabbouj

Segmentation and Grouping April 19 th, 2018

LOCAL AND GLOBAL DESCRIPTORS FOR PLACE RECOGNITION IN ROBOTICS

Local Features and Bag of Words Models

Video annotation based on adaptive annular spatial partition scheme

Supervised learning. y = f(x) function

Clustering and Visualisation of Data

IJREAT International Journal of Research in Engineering & Advanced Technology, Volume 1, Issue 5, Oct-Nov, 2013 ISSN:

Learning to Match. Jun Xu, Zhengdong Lu, Tianqi Chen, Hang Li

Spatial Hierarchy of Textons Distributions for Scene Classification

Automated Canvas Analysis for Painting Conservation. By Brendan Tobin

Learning Compact Visual Attributes for Large-scale Image Classification

A Scene Recognition Algorithm Based on Covariance Descriptor

Feature Selection Using Modified-MCA Based Scoring Metric for Classification

Cluster Analysis (b) Lijun Zhang

A ew Algorithm for Community Identification in Linked Data

Efficient Acquisition of Human Existence Priors from Motion Trajectories

Large-Scale Scene Classification Using Gist Feature

Discrete Optimization of Ray Potentials for Semantic 3D Reconstruction

TRANSPARENT OBJECT DETECTION USING REGIONS WITH CONVOLUTIONAL NEURAL NETWORK

Comparing Local Feature Descriptors in plsa-based Image Models

Development in Object Detection. Junyuan Lin May 4th

A GENERIC FACE REPRESENTATION APPROACH FOR LOCAL APPEARANCE BASED FACE VERIFICATION

Bilevel Sparse Coding

Short Survey on Static Hand Gesture Recognition

Limitations of Matrix Completion via Trace Norm Minimization

String distance for automatic image classification

Convex Optimization. Lijun Zhang Modification of

Shifting from Naming to Describing: Semantic Attribute Models. Rogerio Feris, June 2014

Nearest Clustering Algorithm for Satellite Image Classification in Remote Sensing Applications

CS6716 Pattern Recognition

Computer vision: models, learning and inference. Chapter 13 Image preprocessing and feature extraction

Enhancing Clustering by Exploiting Complementary Data Modalities in the Medical Domain

AM 221: Advanced Optimization Spring 2016

A Miniature-Based Image Retrieval System

Image Retrieval Based on its Contents Using Features Extraction

Kernel-based online machine learning and support vector reduction

Mutual Information Based Codebooks Construction for Natural Scene Categorization

Query-Specific Visual Semantic Spaces for Web Image Re-ranking

arxiv: v2 [cs.cv] 27 Nov 2017

Linear Discriminant Analysis for 3D Face Recognition System

Discovering Localized Attributes for Fine-grained Recognition

PICODES: Learning a Compact Code for Novel-Category Recognition

Part based models for recognition. Kristen Grauman

Learning to Recognize Faces in Realistic Conditions

A modified and fast Perceptron learning rule and its use for Tag Recommendations in Social Bookmarking Systems

Rushes Video Segmentation Using Semantic Features

Object and Action Detection from a Single Example

on learned visual embedding patrick pérez Allegro Workshop Inria Rhônes-Alpes 22 July 2015

Transcription:

Dimensionality Reduction using Relative Attributes Mohammadreza Babaee 1, Stefanos Tsoukalas 1, Maryam Babaee Gerhard Rigoll 1, and Mihai Datcu 1 Institute for Human-Machine Communication, Technische Universität München {reza.babaee,rigoll}@tum.de, s.tsoukalas@mytum.de Computer Engineering Dept. University of Isfahan, Iran babaee@eng.ui.ac.ir The Remote Sensing Technology Institute (IMF), German Aerospace Center mihai.datcu@dlr.de 1 Introduction Visual attributes are high-level semantic description of visual data that are close to the language of human. They have been intensively used in various applications such as image classification [1,], active learning [,], and interactive search []. However, the usage of attributes in dimensionality reduction has not been considered yet. In this work, we propose to utilize relative attributes as semantic cues in dimensionality reduction. To this end, we employ Non-negative Matrix Factorization () [] constrained by embedded relative attributes to come up with a new algorithm for dimensionality reduction, namely attribute regularized (A). Approach We assume that X R D N denotes N data points (e.g., images) represented by D dimensional low-level feature vectors. The decomposes the non-negative matrix X into two non-negative matrices U R D K and V R N K such that the multiplication of U and V approximates the original matrix X. Here, U represents the bases and V contains the coefficients, which are considered as new representation of the original data. The objective function is: F = X UV T F s.t. U = [u ik ] 0 V = [v jk ] 0. (1) Additionally, we assume that M semantic attributes have been predefined for the data and the relative attributes of each image are available. Precisely, the matrix of relative attributes, Q R M N, has been learned by some ranking function (e,.g, ranksvm). Intuitively, those images which own similar relative attributes have similar semantic contents and therefore belong to the same semantic class. This concept can be formulated as a regularizer to be added to the

Mohammadreza Babaee et al. main objective function. This information can be formulated in a regularization term as R = α Q AV T, () where V = [v 1,..., v N ] T R N K and the matrix A R M K. The matrix A linearly transforms and scales the vectors in the new representation in order to obtain the best fit for the matrix Q. The matrix A is allowed to take negative values and is computed as part of the minimization. We arrive at the following minimization problem: min F = X UV T + α Q AV T s.t. U = [u ik] 0 V = [v jk ] 0. ().1 Update rules For the derivation of the update rules we expand the objective to O =Tr ( XX T) Tr ( XV U T) + Tr ( UV T V U T ) () + αtr ( QQ T) αtr ( QV A T) + αtr ( AV T V A T ) and introduce Lagrange multipliers Φ = [φ ik ], Ψ = [ψ jk ] for the constraints [u ik ] 0, [v jk ] 0 respectively. Adding the Lagrange multipliers and ignoring the constant terms leads to the Lagrangian: L = Tr ( XV U T) + Tr ( UV T V U T ) + Tr (ΦU) + Tr (ΨV ) αtr ( QV A T) + αtr ( AV T V A T ). () The partial derivatives of L with respect to U, V and A are: U = XV + UV T V + Φ V = XT U + V U T U αq T A + αv A T A + Ψ () A = QV + AV T V For the derivation of the update rules for U and V we apply the KKTconditions φ ik u ik = 0, ψ jk v jk = 0 []. For A the update rules can be derived directly by setting its derivative of the Lagrangian to 0. Thus, we arrive at the following equations: [XV ] ik u ik u ik [UV T V ] ik () [X T U + α(v A T A) + α(q T A) + ] jk v jk v jk [V U T U + α(v A T A) + + α(q T A) ] jk () A QV (V T V ) 1 (9)

Dimensionality Reduction using Relative Attributes where for a matrix M we define M +, M as M + = ( M + M)/ and M = ( M M)/. The newly introduced terms depend only on the variables V and A. The proof of convergence for the update rules for V and A can be found here. Experiments We perform our experiments by applying the proposed method on two borrowed datasets from [], namely Outdoor Scene Recognition (OSR) and Public Figure Face Database (PubFig). The OSR dataset contains images from categories and the PubFig contains images from different individuals. The OSR images are represented by 1-dimensional gist [] features and PubFig images are represented by a concatenation of gist descriptor and a -dimensional Lab color histogram []. We also utilize the learned relative attributes for both datasets from []. First, we reduce the dimensionality of the data using the proposed method (A) and also, and. Then, we apply K-Means clustering algorithm on the new representation of the data and also on the original data. We perform the experiments with different number of classes, k, extracted from each dataset. In order to obtain representative results, we repeat the experiments times for each k, by selecting a random subset of k classes from the dataset and computing the average results. For the dimensionality reduction techniques (i.e.,, and A), we always set the new dimension equal to the number of classes. We use two metrics to evaluate the performance of the compared algorithms, namely accuracy (AC) and normalized mutual information (nmi) [9]. The K-Means runs 0 times in each experiment and the best result are selected. In A, the regularization parameter is chosen by running cross-validation on each dataset. The results of our experiments are depicted in Figure 1. The accuracy and normalized mutual information of clustering results for OSR dataset are depicted in Figure 1a and 1b, respectively. Figures 1c and 1d show the accuracy and mutual information of clustering results for PubFig dataset, respectively. As it can be seen, the proposed method that utilizes the relative attribute outperforms largely the other techniques in both datasets. For PubFig dataset we even achieve % % accuracy. Additionally, we reduce the dimensionality of both datasets to D for visualization (see Figures a and b). Here, it is clearly observable that those image which share similar attributes are located closely. For instance, in OSR dataset, the images sharing openness attribute are in left down part of layout. Additionally, the plots of convergence rate of the proposed algorithm applied on both OSR and PubFig datasets are represented in Figures a and b, respectively. The convergence plots show that the algorithm converges after 0 iterations, which means the running time is quite small. The experimental results confirm, that the proposed method learns the bases with different semantic attributes. http://www.mmk.ei.tum.de/%erez/convergenceproof.pdf

Mohammadreza Babaee et al. A 0 0 A Mutual Information [%] 0 Accuracy [%] Mutual Information [%] Accuracy [%] A 0 0 0 0 A 0 (c) (d) Fig. 1: Clustering results on new representation computed by,, A and original data evaluated by accuracy (AC) and normalized mutual information (nmi). and show the AC, nmi for the OSR dataset, respectively. (c) and (d) show the AC and nmi for the PubFig dataset, respectively. Fig. : D visualization of the datasets computed by the proposed method (A); OSR dataset; PubFig dataset.. 1. Objective Value Objective Value... 1. 1. 1.. 1.. 0 0 0 0 Iteration # 0 0 0. 0 0 0 0 Iteration # 0 0 Fig. : Convergence plot of the proposed algorithm applied on OSR dataset and PubFig dataset.

Dimensionality Reduction using Relative Attributes References 1. Kumar, N., Berg, A.C., Belhumeur, P.N., Nayar, S.K.: Attribute and simile classifiers for face verification. In: Computer Vision, 009 IEEE 1th International Conference on, IEEE (009). Farhadi, A., Endres, I., Hoiem, D., Forsyth, D.: Describing objects by their attributes. In: Computer Vision and Pattern Recognition, 009. CVPR 009. IEEE Conference on, IEEE (009) 1 1. Parikh, D., Grauman, K.: Relative attributes. In: Computer Vision (ICCV), 011 IEEE International Conference on, IEEE (011). Kovashka, A., Vijayanarasimhan, S., Grauman, K.: Actively selecting annotations among objects and attributes. In: Computer Vision (ICCV), 011 IEEE International Conference on, IEEE (011) 1. Kovashka, A., Parikh, D., Grauman, K.: Whittlesearch: Image search with relative attribute feedback. In: Computer Vision and Pattern Recognition (CVPR), 01 IEEE Conference on, IEEE (01) 9 9. Lee, D.D., Seung, H.S.: Learning the parts of objects by non-negative matrix factorization. Nature 01() (1999) 91. Boyd, S., Vandenberghe, L.: Convex optimization. Cambridge university press (009). Oliva, A., Torralba, A.: Modeling the shape of the scene: A holistic representation of the spatial envelope. International journal of computer vision () (001) 1 1 9. Xu, W., Liu, X., Gong, Y.: Document clustering based on non-negative matrix factorization. In: Proceedings of the th annual international ACM SIGIR conference on Research and development in informaion retrieval, ACM (00)