Building and Using a Semantivisual Image Hierarchy

Size: px

Start display at page:

Download "Building and Using a Semantivisual Image Hierarchy"

Jeffery Parker
5 years ago
Views:

1 Building and Using a Semantivisual Image Hierarchy Li-Jia Li, Chong Wang, Yongwhan Lim, David M. Blei, Li Fei-Fei Computer Science Department, Stanford University Computer Science Department, Princeton University Presented by Lingbo Li CVPR, 2010

2 Outline Introduction Building the Semantivisual Image Hierarchy A Hierarchical Model for both Image and Text Learning the Semantivisual Image Hierarchy A Semantivisual Image Hierarchy Using the Semantivisual Image Hierarchy Hierarchical Annotation of Images Image Labeling Image Classification Conclusion

3 Introduction two types of hierarchies: 1)language-based hierarchy: no important visual information that connects images together 2)low-level visual feature based hierarchy: no quantitative evaluation Motivation Construct a semantivisual hierarchy built upon both semantic and visual information related to images Automatically organize images in a general-to-specific relationship Hierarchy is quantitatively evaluated by both human objects and end tasks (image classification and annotation)

4 A Hierarchical Model for both Image and Text Each image is decomposed into a set of over-segmented regions R = [R 1,..., R r,..., R N ] Each of the N regions is characterized by four features: color, texture, location and quantized SIFT histogram of the small patches within each region Each image is associated with its tags W = [W 1,..., W ω,..., W M ] Each image is associated with a path of the hierarchy Image regions can be assigned to different nodes of the path

5 Introduction Schematic illustration

6 A Hierarchical Model for both Image and Text Graphical model and the notations of the variables Joint likelihood

7 Learning the Semantivisual Image Hierarchy Goal of learning: the concept index Z, the coupling variable S and the path C Sampling concept index Z The conditional distribution of a concept index of a particular region depends on 1)the likelihood of the region appearance, 2) the likelihood of tags associated with this region and 3) the concept indices of the other regions in the same image-text pair p(z d,r = l ) p(z d,r = l Z r d, α) 4 j=1 p(r d,r,j R dr, C d, Z, φ j ) p({w d,ω : ω S r } W dω:ω Sr, C d, Z d,r, λ) = n dr +φ 4 C d,l,j,r j d,r,j j=1 n dr C d,l,j,. +V j φ j ω S r n dω C d,l,w d,ω +λ n dω C d,l,. +Uλ n r d,l +α n r d,. +Lα

8 Learning the Semantivisual Image Hierarchy Sampling coupling variable S The conditional distribution depends on the likelihood of tag The path assignment is fixed at this step p(s d,ω = r ) p(w d,ω S d,ω = r, S dω, W dω, Z d, C d, λ) = n dω C d,z d,r,w d,ω +λ n dω C d,z d,r,. +Uλ

9 Learning the Semantivisual Image Hierarchy Sampling path C The path assignment of a new image-text pair depends on the previous arrangement of the hierarchy and the likelihood of the image-text pair

10 A Semantivisual Image Hierarchy Implementation 4000 images and 538 tags across 40 image class from Flickr Each image is divided into small patches of pixels, and a collection of over-segmented regions Each patch is assigned to one codeword in a codebook of 500 visual words obtained by K-means Obtain 4 region codebooks for color, location, texture and normalized SIFT histogram Initial the levels in a path according to ti-idf scores obtain a hierarchy of 121 nodes, 4 levels and 53 paths

11 A Semantivisual Image Hierarchy

12 A quantitative evaluation of image hierarchies Evaluation of the hierarchy

13 Using the Semantivisual Image Hierarchy Hierarchical Annotation of Images For an unannotated image I with posterior path assignments samples S C = {C 1 I, C2 I,..., C S C I }, the probability of tag W for level l: S C p(w I, level = l) (1/ S C ) p(w ˆφ i, CI i (l)) i=1

14 Using the Semantivisual Image Hierarchy Image Labeling For a test image I and its posterior samples S C = {C 1 I, C2 I,..., C S C I } and S Z = {C 1 I, C2 I,..., C S Z I }, the probability of tag W given the image I: S C p(w I) (1/ S C ) i=1 l=1 L p(w ˆφ i, CI i (l))p(l Zi I )

15 Using the Semantivisual Image Hierarchy Image Classification

16 Conclusion Construct the semantivisual hierarchy by using both images and their tags Quantitatively evaluate the quality of the hierarchy by human subjects Perform different applications

Bag of Words Models. CS4670 / 5670: Computer Vision Noah Snavely. Bag-of-words models 11/26/2013

Bag of Words Models. CS4670 / 5670: Computer Vision Noah Snavely. Bag-of-words models 11/26/2013 CS4670 / 5670: Computer Vision Noah Snavely Bag-of-words models Object Bag of words Bag of Words Models Adapted from slides by Rob Fergus and Svetlana Lazebnik 1 Object Bag of words Origin 1: Texture Recognition