Non-negative Matrix Factorization for Multimodal Image Retrieval

Similar documents
Non-negative Matrix Factorization for Multimodal Image Retrieval

Multimodal Information Spaces for Content-based Image Retrieval

NMF-based Multimodal Image Indexing for Querying by Visual Example

Collaborative Filtering for Netflix

An Empirical Comparison of Collaborative Filtering Approaches on Netflix Data

Cluster Analysis (b) Lijun Zhang

Recommender Systems New Approaches with Netflix Dataset

CSC 411 Lecture 18: Matrix Factorizations

Data Mining Techniques

CS246: Mining Massive Datasets Jure Leskovec, Stanford University

Performance Comparison of Algorithms for Movie Rating Estimation

CSE 158 Lecture 8. Web Mining and Recommender Systems. Extensions of latent-factor models, (and more on the Netflix prize)

General Instructions. Questions

A Weighted Non-negative Matrix Factorization for Local Representations Λ

CPSC 340: Machine Learning and Data Mining. Recommender Systems Fall 2017

CS246: Mining Massive Datasets Jure Leskovec, Stanford University

COMP 465: Data Mining Recommender Systems

Information Retrieval: Retrieval Models

CSE 258 Lecture 8. Web Mining and Recommender Systems. Extensions of latent-factor models, (and more on the Netflix prize)

Progress Report: Collaborative Filtering Using Bregman Co-clustering

Data Mining Techniques

Document Summarization using Semantic Feature based on Cloud

DOCUMENT INDEXING USING INDEPENDENT TOPIC EXTRACTION. Yu-Hwan Kim and Byoung-Tak Zhang

Lecture Video Indexing and Retrieval Using Topic Keywords

CS246: Mining Massive Datasets Jure Leskovec, Stanford University

CSE 494: Information Retrieval, Mining and Integration on the Internet

Sparse Estimation of Movie Preferences via Constrained Optimization

Text Analytics (Text Mining)

Big-data Clustering: K-means vs K-indicators

arxiv: v1 [cs.lg] 3 Jan 2018

A Family of Contextual Measures of Similarity between Distributions with Application to Image Retrieval

Use of KNN for the Netflix Prize Ted Hong, Dimitris Tsamis Stanford University

Property1 Property2. by Elvir Sabic. Recommender Systems Seminar Prof. Dr. Ulf Brefeld TU Darmstadt, WS 2013/14

CSE 6242 / CX October 9, Dimension Reduction. Guest Lecturer: Jaegul Choo

LRLW-LSI: An Improved Latent Semantic Indexing (LSI) Text Classifier

BINARY PRINCIPAL COMPONENT ANALYSIS IN THE NETFLIX COLLABORATIVE FILTERING TASK

Text Analytics (Text Mining)

A Brief Review of Representation Learning in Recommender 赵鑫 RUC

Minoru SASAKI and Kenji KITA. Department of Information Science & Intelligent Systems. Faculty of Engineering, Tokushima University

CSE 6242 A / CS 4803 DVA. Feb 12, Dimension Reduction. Guest Lecturer: Jaegul Choo

Reviewer Profiling Using Sparse Matrix Regression

Large-Scale Face Manifold Learning

The Pre-Image Problem in Kernel Methods

Effective Latent Space Graph-based Re-ranking Model with Global Consistency

Clustering K-means. Machine Learning CSEP546 Carlos Guestrin University of Washington February 18, Carlos Guestrin

Enhancing Clustering by Exploiting Complementary Data Modalities in the Medical Domain

By Atul S. Kulkarni Graduate Student, University of Minnesota Duluth. Under The Guidance of Dr. Richard Maclin

BUAA AUDR at ImageCLEF 2012 Photo Annotation Task

Singular Value Decomposition, and Application to Recommender Systems

COSC6376 Cloud Computing Homework 1 Tutorial

Recommender Systems by means of Information Retrieval

ECS289: Scalable Machine Learning

CS246: Mining Massive Datasets Jure Leskovec, Stanford University

CPSC 340: Machine Learning and Data Mining. Principal Component Analysis Fall 2016

Feature Selection for Image Retrieval and Object Recognition

Chapter 6: Information Retrieval and Web Search. An introduction

Computing Similarity between Cultural Heritage Items using Multimodal Features

Clustering and The Expectation-Maximization Algorithm

Training-Free, Generic Object Detection Using Locally Adaptive Regression Kernels

TagProp: Discriminative Metric Learning in Nearest Neighbor Models for Image Annotation

Part 11: Collaborative Filtering. Francesco Ricci

Recommendation Systems

CS473: Course Review CS-473. Luo Si Department of Computer Science Purdue University

Hybrid algorithms for recommending new items

A Hybrid Face Recognition Approach Using GPUMLib

Collaborative Filtering via Euclidean Embedding

CPSC 340: Machine Learning and Data Mining. Multi-Dimensional Scaling Fall 2017

Analysis and Latent Semantic Indexing

Compression of View Dependent Displacement Maps

Semi-Supervised Abstraction-Augmented String Kernel for bio-relationship Extraction

MULTIMEDIA ANALYTICS: SYNERGY BETWEEN HUMAN AND MACHINE BY VISUALIZATION

Content-based Dimensionality Reduction for Recommender Systems

Project Participants

Dimensionality Reduction using Relative Attributes

Collaborative Filtering Based on Iterative Principal Component Analysis. Dohyun Kim and Bong-Jin Yum*

Locality Preserving Projections (LPP) Abstract

CSE 547: Machine Learning for Big Data Spring Problem Set 2. Please read the homework submission policies.

Recommender Systems 6CCS3WSN-7CCSMWAL

Data Distortion for Privacy Protection in a Terrorist Analysis System

Latent Semantic Indexing

Information Retrieval. CS630 Representing and Accessing Digital Information. What is a Retrieval Model? Basic IR Processes

Concept Based Search Using LSI and Automatic Keyphrase Extraction

Music Recommendation with Implicit Feedback and Side Information

ImageCLEF 2011

vector space retrieval many slides courtesy James Amherst

Explore Co-clustering on Job Applications. Qingyun Wan SUNet ID:qywan

Bibliometrics: Citation Analysis

SCALABLE KNOWLEDGE BASED AGGREGATION OF COLLECTIVE BEHAVIOR

Pattern Spotting in Historical Document Image

Entity and Knowledge Base-oriented Information Retrieval

Machine Learning for OR & FE

A Novel Non-Negative Matrix Factorization Method for Recommender Systems

Additive Regression Applied to a Large-Scale Collaborative Filtering Problem

Unsupervised Clustering of Bitcoin Transaction Data

Unsupervised Learning

Courtesy of Prof. Shixia University

Online Learning for URL classification

CPSC 340: Machine Learning and Data Mining. Principal Component Analysis Fall 2017

Community-Based Recommendations: a Solution to the Cold Start Problem

Extension Study on Item-Based P-Tree Collaborative Filtering Algorithm for Netflix Prize

Transcription:

Non-negative Matrix Factorization for Multimodal Image Retrieval Fabio A. González PhD Machine Learning 2015-II Universidad Nacional de Colombia F. González NMF for MM IR ML 2015-II 1 / 54

Outline 1 The Problem Content-based image retrieval Semantic image retrieval Multimodal image retrieval 2 Matrix Factorization The Netflix Prize Non-negative matrix factorization NMF vs SVD 3 NMF for Multimodal Learning Semantic space Multimodal clustering Image annotation Multimodal retrieval Application to histology images F. González NMF for MM IR ML 2015-II 2 / 54

Outline 1 The Problem Content-based image retrieval Semantic image retrieval Multimodal image retrieval 2 Matrix Factorization The Netflix Prize Non-negative matrix factorization NMF vs SVD 3 NMF for Multimodal Learning Semantic space Multimodal clustering Image annotation Multimodal retrieval Application to histology images F. González NMF for MM IR ML 2015-II 3 / 54

Outline 1 The Problem Content-based image retrieval Semantic image retrieval Multimodal image retrieval 2 Matrix Factorization The Netflix Prize Non-negative matrix factorization NMF vs SVD 3 NMF for Multimodal Learning Semantic space Multimodal clustering Image annotation Multimodal retrieval Application to histology images F. González NMF for MM IR ML 2015-II 4 / 54

Content-Based Image Retrieval F. González NMF for MM IR ML 2015-II 5 / 54

Query by Visual Example F. González NMF for MM IR ML 2015-II 6 / 54

Low-level vs. High-level F. González NMF for MM IR ML 2015-II 7 / 54

Outline 1 The Problem Content-based image retrieval Semantic image retrieval Multimodal image retrieval 2 Matrix Factorization The Netflix Prize Non-negative matrix factorization NMF vs SVD 3 NMF for Multimodal Learning Semantic space Multimodal clustering Image annotation Multimodal retrieval Application to histology images F. González NMF for MM IR ML 2015-II 8 / 54

Semantic Annotation using ML Source: Nuno Vasconcelos, UCSD, http://www.svcl.ucsd.edu/projects/qbse/ F. González NMF for MM IR ML 2015-II 9 / 54

An Example (1) F. González NMF for MM IR ML 2015-II 10 / 54

An Example (2) 1 Recall vs Precision Graph - Semantic models Histogram Intersection MetaFeatures Identity Kernel Sobel 0.8 0.6 Precision 0.4 0.2 0 0 0.2 0.4 0.6 0.8 1 Recall F. González NMF for MM IR ML 2015-II 11 / 54

Disadvantages Requires a training set with expert annotations, so it is a costly process F. González NMF for MM IR ML 2015-II 12 / 54

Disadvantages Requires a training set with expert annotations, so it is a costly process Does not scale for large semantic vocabularies F. González NMF for MM IR ML 2015-II 12 / 54

Disadvantages Requires a training set with expert annotations, so it is a costly process Does not scale for large semantic vocabularies The mapping from visual features to annotations may lose the visual richness F. González NMF for MM IR ML 2015-II 12 / 54

Scalability F. González NMF for MM IR ML 2015-II 13 / 54

Outline 1 The Problem Content-based image retrieval Semantic image retrieval Multimodal image retrieval 2 Matrix Factorization The Netflix Prize Non-negative matrix factorization NMF vs SVD 3 NMF for Multimodal Learning Semantic space Multimodal clustering Image annotation Multimodal retrieval Application to histology images F. González NMF for MM IR ML 2015-II 14 / 54

Multimodality Text and images come naturally together in many documents Academic papers, books Newspapers, web pages Medical cases F. González NMF for MM IR ML 2015-II 15 / 54

Multimodal Retrieval Unstructured text associated to images may be used as semantic annotations Images and texts are complimentary information units Take advantage of interactions between both data modalities F. González NMF for MM IR ML 2015-II 16 / 54

Multimodal Retrieval Unstructured text associated to images may be used as semantic annotations Images and texts are complimentary information units Take advantage of interactions between both data modalities Problems: Text associated to images is not structured Unclear relationships between keywords and visual patterns Possible presence of noise in both data modalities F. González NMF for MM IR ML 2015-II 16 / 54

Multimodal Retrieval Unstructured text associated to images may be used as semantic annotations Images and texts are complimentary information units Take advantage of interactions between both data modalities Problems: Text associated to images is not structured Unclear relationships between keywords and visual patterns Possible presence of noise in both data modalities Retrieval scenarios: Cross-modal: find images based on a text query find text based on an image query (image annotation) Visual retrieval based on a visual query F. González NMF for MM IR ML 2015-II 16 / 54

Outline 1 The Problem Content-based image retrieval Semantic image retrieval Multimodal image retrieval 2 Matrix Factorization The Netflix Prize Non-negative matrix factorization NMF vs SVD 3 NMF for Multimodal Learning Semantic space Multimodal clustering Image annotation Multimodal retrieval Application to histology images F. González NMF for MM IR ML 2015-II 17 / 54

Outline 1 The Problem Content-based image retrieval Semantic image retrieval Multimodal image retrieval 2 Matrix Factorization The Netflix Prize Non-negative matrix factorization NMF vs SVD 3 NMF for Multimodal Learning Semantic space Multimodal clustering Image annotation Multimodal retrieval Application to histology images F. González NMF for MM IR ML 2015-II 18 / 54

The Netflix Competition Problem: prediction of user ratings for films (collaborative filtering) Y. Koren, R. Bell, and C. Volinsky, Matrix factorization techniques for recommender systems, Computer, vol. 42, no. 8, pp. 30-37, August 2009 F. González NMF for MM IR ML 2015-II 19 / 54

The Netflix Competition Problem: prediction of user ratings for films (collaborative filtering) Prize: $1 Million to the first algorithm able to improve Netflix own algorithm results by at least 10% Y. Koren, R. Bell, and C. Volinsky, Matrix factorization techniques for recommender systems, Computer, vol. 42, no. 8, pp. 30-37, August 2009 F. González NMF for MM IR ML 2015-II 19 / 54

The Netflix Competition Problem: prediction of user ratings for films (collaborative filtering) Prize: $1 Million to the first algorithm able to improve Netflix own algorithm results by at least 10% Won on Sept/21/2009 by BellKor s Pragmatic Chaos Y. Koren, R. Bell, and C. Volinsky, Matrix factorization techniques for recommender systems, Computer, vol. 42, no. 8, pp. 30-37, August 2009 F. González NMF for MM IR ML 2015-II 19 / 54

The Netflix Competition Problem: prediction of user ratings for films (collaborative filtering) Prize: $1 Million to the first algorithm able to improve Netflix own algorithm results by at least 10% Won on Sept/21/2009 by BellKor s Pragmatic Chaos Their approach used Matrix Factorization to build a latent-factor representation of users and movies Y. Koren, R. Bell, and C. Volinsky, Matrix factorization techniques for recommender systems, Computer, vol. 42, no. 8, pp. 30-37, August 2009 F. González NMF for MM IR ML 2015-II 19 / 54

The Latent-Factor Model movies factors movies r 11... r 1m q 11... q 1f p 11... p 1m users users..... =.......... r n1... r nm q n1... q nf p f 1... p fm factors R QP Q, P 0 n 5 10 5 m 1.7 10 4 {(i, j) r ij 0} 10 8 f 200 F. González NMF for MM IR ML 2015-II 20 / 54

Outline 1 The Problem Content-based image retrieval Semantic image retrieval Multimodal image retrieval 2 Matrix Factorization The Netflix Prize Non-negative matrix factorization NMF vs SVD 3 NMF for Multimodal Learning Semantic space Multimodal clustering Image annotation Multimodal retrieval Application to histology images F. González NMF for MM IR ML 2015-II 21 / 54

Non-negative Matrix Factorization Problem: to find a factorization Optimization problem: X n m = W n r H r m min A,B X WH 2 s.t. W, H 0 is the Frobenius norm It is a non-convex optimization problem Solution alternatives: Gradient descendent methods Multiplicative updating rules F. González NMF for MM IR ML 2015-II 22 / 54

Multiplicative Rules Optimization problem: min W,H X WH 2 s.t. W, H 0 Incremental optimization: H aµ H aµ (W T X ) aµ (W T WH) aµ W ia W ia (XH T ) αµ (WHH T ) αµ F. González NMF for MM IR ML 2015-II 23 / 54

Divergence Optimization Optimization problem: min W,H D(X WH) = ij s.t. W, H 0 Multiplicative Rules: ( X ij log ) X ij (WH) ij X ij + (WH) ij i H aµ H W iax iµ /(WH) iµ aµ i W ia µ W ia W H aµx iµ /(WH) iµ ia µ H aµ F. González NMF for MM IR ML 2015-II 24 / 54

Outline 1 The Problem Content-based image retrieval Semantic image retrieval Multimodal image retrieval 2 Matrix Factorization The Netflix Prize Non-negative matrix factorization NMF vs SVD 3 NMF for Multimodal Learning Semantic space Multimodal clustering Image annotation Multimodal retrieval Application to histology images F. González NMF for MM IR ML 2015-II 25 / 54

PCA and SVD Problem: X n m = W n r H r m Principal Component Analysis (PCA) X = UΣV W = UΣ 1 2, H = Σ 1 2 V PCA = SVD keeping the best Eigenvectors Columns of U are orthonormal There is not restriction on sign. D. D. Lee and H. S. Seung, Learning the parts of objects by non-negative matrix factorization, Nature, vol. 401, no. 6755, pp. 788-791, October 1999 F. González NMF for MM IR ML 2015-II 26 / 54

Latent Semantic Indexing (LSI) documents factors documents x 11... x 1m w 11... w 1r h 11... h 1m terms terms..... =.......... x n1... x nm w n1... w nr h r1... h rm X W H factors Documents are represented by the frequency of keywords (terms) Uses SVD to find the factorization Factors = semantic concepts Columns of W are orthonormal F. González NMF for MM IR ML 2015-II 27 / 54

NMF vs LSI W. Xu, X. Liu, and Y. Gong, Document clustering based on non-negative matrix factorization, in SIGIR 03: Proceedings of the 26th annual international ACM SIGIR conference on Research and development in information retrieval. New York, NY, USA: ACM, 2003, pp. 267-273 F. González NMF for MM IR ML 2015-II 28 / 54

Outline 1 The Problem Content-based image retrieval Semantic image retrieval Multimodal image retrieval 2 Matrix Factorization The Netflix Prize Non-negative matrix factorization NMF vs SVD 3 NMF for Multimodal Learning Semantic space Multimodal clustering Image annotation Multimodal retrieval Application to histology images F. González NMF for MM IR ML 2015-II 29 / 54

NMF for Multimodal Learning F. González NMF for MM IR ML 2015-II 30 / 54

Outline 1 The Problem Content-based image retrieval Semantic image retrieval Multimodal image retrieval 2 Matrix Factorization The Netflix Prize Non-negative matrix factorization NMF vs SVD 3 NMF for Multimodal Learning Semantic space Multimodal clustering Image annotation Multimodal retrieval Application to histology images F. González NMF for MM IR ML 2015-II 31 / 54

Multimodal Representation F. González NMF for MM IR ML 2015-II 32 / 54

Bag-of-Features Image Representation F. González NMF for MM IR ML 2015-II 33 / 54

Semantic Space: Multimodal Latent Indexing (I) Objects are described by terms in a textual vocabulary and a visual vocabulary F. González NMF for MM IR ML 2015-II 34 / 54

Semantic Space: Multimodal Latent Indexing (I) Objects are described by terms in a textual vocabulary and a visual vocabulary Objects are mapped to a latent (semantic) space where objects are represented by a set of latent factors F. González NMF for MM IR ML 2015-II 34 / 54

Semantic Space: Multimodal Latent Indexing (I) Objects are described by terms in a textual vocabulary and a visual vocabulary Objects are mapped to a latent (semantic) space where objects are represented by a set of latent factors NMF is used to build the latent representation F. González NMF for MM IR ML 2015-II 34 / 54

Semantic Space: Multimodal Latent Indexing (I) Objects are described by terms in a textual vocabulary and a visual vocabulary Objects are mapped to a latent (semantic) space where objects are represented by a set of latent factors NMF is used to build the latent representation Three main tasks: Multimodal clustering Automatic image annotation Image retrieval F. González NMF for MM IR ML 2015-II 34 / 54

Semantic Space: Multimodal Latent Indexing (II) F. González NMF for MM IR ML 2015-II 35 / 54

Outline 1 The Problem Content-based image retrieval Semantic image retrieval Multimodal image retrieval 2 Matrix Factorization The Netflix Prize Non-negative matrix factorization NMF vs SVD 3 NMF for Multimodal Learning Semantic space Multimodal clustering Image annotation Multimodal retrieval Application to histology images F. González NMF for MM IR ML 2015-II 36 / 54

Multimodal Clustering F. González NMF for MM IR ML 2015-II 37 / 54

Dual Multimodal Clustering F. González NMF for MM IR ML 2015-II 38 / 54

Semantic Space Visualization (I) F. González NMF for MM IR ML 2015-II 39 / 54

Semantic Space Visualization (II) F. González NMF for MM IR ML 2015-II 40 / 54

Outline 1 The Problem Content-based image retrieval Semantic image retrieval Multimodal image retrieval 2 Matrix Factorization The Netflix Prize Non-negative matrix factorization NMF vs SVD 3 NMF for Multimodal Learning Semantic space Multimodal clustering Image annotation Multimodal retrieval Application to histology images F. González NMF for MM IR ML 2015-II 41 / 54

Image Annotation F. González NMF for MM IR ML 2015-II 42 / 54

Annotation Results F. González NMF for MM IR ML 2015-II 43 / 54

Outline 1 The Problem Content-based image retrieval Semantic image retrieval Multimodal image retrieval 2 Matrix Factorization The Netflix Prize Non-negative matrix factorization NMF vs SVD 3 NMF for Multimodal Learning Semantic space Multimodal clustering Image annotation Multimodal retrieval Application to histology images F. González NMF for MM IR ML 2015-II 44 / 54

Image Retrieval Scenario: visual retrieval based on a visual query F. González NMF for MM IR ML 2015-II 45 / 54

Image Retrieval Scenario: visual retrieval based on a visual query Images are represented in a latent space using NMF F. González NMF for MM IR ML 2015-II 45 / 54

Image Retrieval Scenario: visual retrieval based on a visual query Images are represented in a latent space using NMF Some images in the database may not have text content associated F. González NMF for MM IR ML 2015-II 45 / 54

Image Retrieval Scenario: visual retrieval based on a visual query Images are represented in a latent space using NMF Some images in the database may not have text content associated The image query is projected the latent space F. González NMF for MM IR ML 2015-II 45 / 54

Image Retrieval Scenario: visual retrieval based on a visual query Images are represented in a latent space using NMF Some images in the database may not have text content associated The image query is projected the latent space Image are retrieved according to their latent space similarity F. González NMF for MM IR ML 2015-II 45 / 54

Retrieval Performance F. González NMF for MM IR ML 2015-II 46 / 54

Semantic Space Dimension F. González NMF for MM IR ML 2015-II 47 / 54

Outline 1 The Problem Content-based image retrieval Semantic image retrieval Multimodal image retrieval 2 Matrix Factorization The Netflix Prize Non-negative matrix factorization NMF vs SVD 3 NMF for Multimodal Learning Semantic space Multimodal clustering Image annotation Multimodal retrieval Application to histology images F. González NMF for MM IR ML 2015-II 48 / 54

Histology Image Retrieval F. González NMF for MM IR ML 2015-II 49 / 54

Semantic Back Projection F. González NMF for MM IR ML 2015-II 50 / 54

Performance F. González NMF for MM IR ML 2015-II 51 / 54

Performance F. González NMF for MM IR ML 2015-II 52 / 54

Retrieval Performance Example F. González NMF for MM IR ML 2015-II 53 / 54

Thanks! fagonzalezo@unal.edu.co http://dis.unal.edu.co/~fgonza F. González NMF for MM IR ML 2015-II 54 / 54