ACM MM Dong Liu, Shuicheng Yan, Yong Rui and Hong-Jiang Zhang

Similar documents
Graph-based Techniques for Searching Large-Scale Noisy Multimedia Data

Tag Based Image Search by Social Re-ranking

PRISM: Concept-preserving Social Image Search Results Summarization

Visual Query Suggestion

over Multi Label Images

NUS-WIDE: A Real-World Web Image Database from National University of Singapore

Joint Inference in Image Databases via Dense Correspondence. Michael Rubinstein MIT CSAIL (while interning at Microsoft Research)

Extraction of Web Image Information: Semantic or Visual Cues?

Content is Still King: The Effect of Neighbor Voting Schemes on Tag Relevance for Social Image Retrieval

An Efficient Methodology for Image Rich Information Retrieval

Analysis: TextonBoost and Semantic Texton Forests. Daniel Munoz Februrary 9, 2009

ECS 289H: Visual Recognition Fall Yong Jae Lee Department of Computer Science

Leveraging flickr images for object detection

Topic Diversity Method for Image Re-Ranking

Input. Output. Problem Definition. Rectified stereo image pair All correspondences lie in same scan lines

A REVIEW ON SEARCH BASED FACE ANNOTATION USING WEAKLY LABELED FACIAL IMAGES

Fashion Analytics and Systems

Supervised Hashing for Image Retrieval via Image Representation Learning

Exploiting noisy web data for largescale visual recognition

Web-Scale Image Search and Their Applications

Speaker: Ming-Ming Cheng Nankai University 15-Sep-17 Towards Weakly Supervised Image Understanding

Saliency detection on sampled images for tag ranking

Multimedia Data Management M

IJREAT International Journal of Research in Engineering & Advanced Technology, Volume 1, Issue 5, Oct-Nov, 2013 ISSN:

Canonical Image Selection for Large-scale Flickr Photos using Hadoop

Minghai Liu, Rui Cai, Ming Zhang, and Lei Zhang. Microsoft Research, Asia School of EECS, Peking University

Recommender Systems New Approaches with Netflix Dataset

TagProp: Discriminative Metric Learning in Nearest Neighbor Models for Image Annotation

Nonparametric Label-to-Region by Search

Multi-Stage Rocchio Classification for Large-scale Multilabeled

Neurocomputing 105 (2013) Contents lists available at SciVerse ScienceDirect. Neurocomputing. journal homepage:

CIRGDISCO at RepLab2012 Filtering Task: A Two-Pass Approach for Company Name Disambiguation in Tweets

A Bayesian Approach to Hybrid Image Retrieval

Survey on Recommendation of Personalized Travel Sequence

PERSONALIZED TAG RECOMMENDATION

Supervised Models for Multimodal Image Retrieval based on Visual, Semantic and Geographic Information

Experiments of Image Retrieval Using Weak Attributes

Supervised Reranking for Web Image Search

Image Similarity Measurements Using Hmok- Simrank

Image Annotation by k NN-Sparse Graph-based Label Propagation over Noisily-Tagged Web Images

Deep Matrix Factorization for Social Image Tag Refinement and Assignment

ANovel Approach to Collect Training Images from WWW for Image Thesaurus Building

Enhanced and Efficient Image Retrieval via Saliency Feature and Visual Attention

Multimodal Information Spaces for Content-based Image Retrieval

Deep condolence to Professor Mark Everingham

Welcome to the class of Web Information Retrieval. Min ZHANG

Semantic Graph Construction for Weakly-Supervised Image Parsing

Effective Latent Space Graph-based Re-ranking Model with Global Consistency

Search Based Face Annotation Using Weakly Labeled Facial Images

WISE: Large Scale Content Based Web Image Search. Michael Isard Joint with: Qifa Ke, Jian Sun, Zhong Wu Microsoft Research Silicon Valley

Content Based Image Retrieval system with a combination of Rough Set and Support Vector Machine

AUTOMATIC VISUAL CONCEPT DETECTION IN VIDEOS

MULTIMEDIA ANALYTICS: SYNERGY BETWEEN HUMAN AND MACHINE BY VISUALIZATION

You are Who You Know and How You Behave: Attribute Inference Attacks via Users Social Friends and Behaviors

arxiv: v1 [cs.mm] 12 Jan 2016

INFORMATION MANAGEMENT FOR SEMANTIC REPRESENTATION IN RANDOM FOREST

CHAPTER 6 PROPOSED HYBRID MEDICAL IMAGE RETRIEVAL SYSTEM USING SEMANTIC AND VISUAL FEATURES

Jianyong Wang Department of Computer Science and Technology Tsinghua University

Adaptive Binary Quantization for Fast Nearest Neighbor Search

TriRank: Review-aware Explainable Recommendation by Modeling Aspects

A REVIEW ON IMAGE RETRIEVAL USING HYPERGRAPH

Large-Scale Semantics for Image and Video Retrieval

Modeling and Analyzing 3D Shapes using Clues from 2D Images. Minglun Gong Dept. of CS, Memorial Univ.

Large scale object/scene recognition

Keywords TBIR, Tag completion, Matrix completion, Image annotation, Image retrieval

Image Tag Completion via Image-Specific and Tag-Specific Linear Sparse Reconstructions

Approximate Nearest Neighbor Search. Deng Cai Zhejiang University

A Deep Learning Framework for Authorship Classification of Paintings

Unsupervised Auxiliary Visual Words Discovery for Large-Scale Image Object Retrieval

Lecture 24: Image Retrieval: Part II. Visual Computing Systems CMU , Fall 2013

Understanding the Query: THCIB and THUIS at NTCIR-10 Intent Task. Junjun Wang 2013/4/22

Re-Ranking of Web Image Search Using Relevance Preserving Ranking Techniques

A Few Things to Know about Machine Learning for Web Search

Hyperspectral Image Classification by Using Pixel Spatial Correlation

Separating Objects and Clutter in Indoor Scenes

SkyFinder: Attribute-based Sky Image Search

Track: Rich Media / Session: Tagging and Clustering. Tag Ranking. Xian-Sheng Hua

SEARCHING pictures on smart phones, PCs, and the

Web Personalization & Recommender Systems

MICC-UNIFI at ImageCLEF 2013 Scalable Concept Image Annotation

CPSC 340: Machine Learning and Data Mining. Multi-Dimensional Scaling Fall 2017

Reddit Recommendation System Daniel Poon, Yu Wu, David (Qifan) Zhang CS229, Stanford University December 11 th, 2011

Bing Liu. Web Data Mining. Exploring Hyperlinks, Contents, and Usage Data. With 177 Figures. Springer

AT the beginning of the 21st century, the world wide

A Novel Categorized Search Strategy using Distributional Clustering Neenu Joseph. M 1, Sudheep Elayidom 2

Automatic Image Annotation and Retrieval Using Hybrid Approach

ECS289: Scalable Machine Learning

Image Classification pipeline. Lecture 2-1

Harvesting collective Images for Bi-Concept exploration

Voronoi Region. K-means method for Signal Compression: Vector Quantization. Compression Formula 11/20/2013

Query-Sensitive Similarity Measure for Content-Based Image Retrieval

PASCAL VOC Classification: Local Features vs. Deep Features. Shuicheng YAN, NUS

Semantics-based Image Retrieval by Region Saliency

Visual Recognition and Search April 18, 2008 Joo Hyun Kim

Semantic Image Clustering Using Object Relation Network

Assistive Tagging: A Survey of Multimedia Tagging with Human-Computer Joint Exploration

Urban Scene Segmentation, Recognition and Remodeling. Part III. Jinglu Wang 11/24/2016 ACCV 2016 TUTORIAL

Web Personalization & Recommender Systems

A Family of Contextual Measures of Similarity between Distributions with Application to Image Retrieval

Total Recall: Automatic Query Expansion with a Generative Feature Model for Object Retrieval O. Chum, et al.

Transcription:

ACM MM 2010 Dong Liu, Shuicheng Yan, Yong Rui and Hong-Jiang Zhang Harbin Institute of Technology National University of Singapore Microsoft Corporation

Proliferation of images and videos on the Internet Sep. 2010: 5 billion 2000 images /minute Sep. 2010 : 120 million 20 hours uploaded/minute 2

Internet Image Search 3 rd Paradigm 1 st Paradigm Query by Example 2 nd Paradigm Pure Content Based Query by Surrounding Text Direct Text Based Query by Tag Semantic Based 1990 2000 2001 2002 Year 3

medici chapel, Firenze, Italy... Loggia dei lanzi, sword, honeymoon,... Statue, building, sky, Italy,... Cathedral, tower, Italy... 4

Tag Refinement Tag-to-Region Auto-Tagging Tag Ranking top 101 tour tiger sweet big cloud To discover the relationship between the tags and the underlying semantic regions in the images. dog house tree sky ground cloud alex speed leave dog 101 tree dog leave tree speed alex 101 D. Liu, X.-S. Hua and H.-J. Zhang. Content-Based Tag Processing for Internet Social Images: A Survey. Multimedia Tools and Applications. 5

How to solve various tag analysis tasks in a unified framework? Our Strategy Tag Refinem ent Tag-to- Region Auto- Tagging Content-Based Tag Analysis Perform tag analysis at the granularity of image regions Propose a new concept of multi-edge graph to model the parallel semantic relationships between the images. propagate the tags from images to regions. 6

(v, f) vertex 1 vertex 2 y 1 (v, f) y 2 Multi-Edge Graph (v, f) (v, f) (v, f) vertex 3 y 3 (v, f) vertex t y t (v, f) (v, f) vertex k y k A Core Equation one edge all edges between vertex i and j vertex n y n labeling information of vertex i with respect to tag c probability that edge e t is labeled as positive with tag c 7

Step 1: Bag-of-Regions Representation Segmentation 1 Input Image Bag-of-Regions Segmentation 2 8

Step 2: Multi-Edge Graph Construction Given two images with bag-of-regions representation: Edge construction : mutual k-nearest Neighbor Edge affinity calculation reliable edge connection reliable similarity measure 9

two images with the same tag dog, flower dog, bird at least one edge connecting the two regions corresponding to the tag 10

Notations 11

Model the cross-level tag propagation Loss function Regularization Objective Function Solving F directly is of great computational challenge, we turn to the alternative optimization strategy 12

Optimize sub-problems with cutting plane At each iteration, solving only a subset of tag confidence vector between vertex i and vertex j : The yielded a sub-optimization problem : Since Max function is non-smooth, we solve it with the cutting plane method. 13

f 1 f 2 f 3 f1 f2 f3 dog 0.1 0.2 0 cat 0.1 0.1 0.2 apple 0.4 0.2 0.4 flower 0.3 0.3 0.2 tree 0.1 0.2 0.1 apple flower apple Majority Voting: apple (2 times) > flower (1 time) apple By doing so, a series of tag analysis tasks can be performed in a coherent way. 14

The cutting plane iteration will terminate in a constant number of steps. The optimization objective is convex, resulting in a globally optimal solution. 15

In term of pixel-level accuracy. MSRC-100 and Corel-350 datasets.(benchmarks for tag-to-region assignment task) Comparison with knn-1 (k=49), knn-2 (k=99) and Bi-layer sparse coding [1]. Dataset knn-1 knn-2 Bi-layer [1] M-E Graph MSRC-350 0.45 0.37 0.63 0.73 COREL-100 0.52 0.44 0.61 0.67 [1] Liu, Cheng, Yan and Chua. Label to region by bi-layer sparsity priors. MM 2009. 16

17

In terms of Average F-Score On the NUS-WIDE-SUB datasets with 18, 325 Flickr images Comparison with Baseline (initial user provided tags ), CBAR [1] and TRVSC [2]. Method Baseline CBAR [1] TRVSC [2] M-E Graph Precision 0.47 0.50 0.52 0.54 Recall 0.49 0.52 0.53 0.57 F-Score 0.44 0.47 0.49 0.53 [1] C. Wang, L. Zhang and H.-J. Zhang. Content-based Image Annotation Refinement. CVPR 2007. [2] D. Liu, X.-S. Hua and H.-J. Zhang. Retagging Social Images based on Visual and Semantic Consistency. WWW 2010. 18

In terms of Average Per-tag Precision and Recall. MSRC, COREL and NUS-WIDE-SUB datasets. Comparison with the state-of-the-art multi-label auto-tagging methods. 19

Unified Tag Analysis with Multi-Edge Graph Perform tag analysis at the granularity of image regions Model the parallel semantic relationship between the images Realize cross-level tag propagation 20

Scalability Large-scale testing Correlative cross-level tag propagation Semantic correlation among the tags More applications User behavior analysis in social network Knowledge mining from rich information cues of multimedia document 21