Evaluation of GIST descriptors for web scale image search

Size: px

Start display at page:

Download "Evaluation of GIST descriptors for web scale image search"

Ambrose Golden
5 years ago
Views:

1 Evaluation of GIST descriptors for web scale image search Matthijs Douze Hervé Jégou, Harsimrat Sandhawalia, Laurent Amsaleg and Cordelia Schmid INRIA Grenoble, France July 9, 2009 Evaluation of GIST for web-scale image search 1/18

Query image Same-object recognition: bag-of-features sparse frequency vector region extraction + SIFT descriptors quantization BOF scales up to ~10 M images

fr) Web scale = size of Flickr, Picasa, or Facebook At least 100 M images per machine New constraints: smaller descriptor, more efficient index use a global

2 Query image Same-object recognition: bag-of-features sparse frequency vector region extraction + SIFT descriptors quantization BOF scales up to ~10 M images (see Web scale = size of Flickr, Picasa, or Facebook At least 100 M images per machine New constraints: smaller descriptor, more efficient index use a global descriptor! database of sparse vectors (inverted file) Geometric verification distance computation ranked image short-list Video Google: A text retrieval approach to object matching in videos. J. Sivic and A. Zisserman, ICCV 2003 July 9, 2009 Evaluation of GIST for web-scale image search 2/18

3 Overview GIST compared with BOF Same-object recognition Copy detection GIST at web scale Indexing scheme Results July 9, 2009 Evaluation of GIST for web-scale image search 3/18

Modeling the shape of the scene: a holistic representation of the spatial envelope, A. Olivia & A. Torralba, IJCV, 2001 GIST Designed from perceptual experiments: properties like naturalness (vs.

4 Modeling the shape of the scene: a holistic representation of the spatial envelope, A. Olivia & A. Torralba, IJCV, 2001 GIST Designed from perceptual experiments: properties like naturalness (vs. manmade), roughness, openness,... Histogram of orientations R G B 32x32 patch descriptor Image pyramid Invariant to luminance transformations, blur, resize, etc. Not invariant to translation, rotation, occlusion, crop, etc. L2 distance to compare, NN-search July 9, 2009 Evaluation of GIST for web-scale image search 4/18

5 Evaluation for same-scene/object recognition Test on the INRIA Holidays dataset 500 groups of images of the same scene Hide them in up to 1 M distractor images from Flickr Queries = 1 image per group Measure = map = mean area under precision-recall curve July 9, 2009 Evaluation of GIST for web-scale image search 5/18

6 Results on Holidays 1 BOF+HE GIST map k 100k database size 1M July 9, 2009 Evaluation of GIST for web-scale image search 6/18

7 Copy detection scenario Simpler sub-problem: recognize transformed pictures Useful for copy detection Our test dataset: Copydays 156 base images merged with 1 M distractors Attacks JPEG compression Crop Queries = attacked images Performance measure is map July 9, 2009 Evaluation of GIST for web-scale image search 7/18

Results, JPEG compression 1 0.8 0.6 map 0.4 0.

8 Results, JPEG compression map BOF+HE GIST JPEG quality factor July 9, 2009 Evaluation of GIST for web-scale image search 8/18

Results, crop Remove (random) margin from image 1

2 0 BOF+HE GIST 0 10 20 30 40 50 60 70 80 %

9 Results, crop Remove (random) margin from image % surface map BOF+HE GIST % surface removed July 9, 2009 Evaluation of GIST for web-scale image search 9/18

10 Overview GIST compared with BOF Same-object recognition Copy detection GIST at web scale Indexing scheme Results July 9, 2009 Evaluation of GIST for web-scale image search 10/18

11 GIST indexing structure (GISTIS) Raw GIST for 100 M images: 384 GB : too big! quantize Nearest-neighbor quantizer, centroids (learnt on independent dataset) Index = Inverted file Returns 1% of dataset: too many images! July 9, 2009 Evaluation of GIST for web-scale image search 11/18

12 Binary signatures & Hamming Embedding Quantization index is too coarse: need information about the point's position inside the quantization cell Add a binary signature to the descriptor's quantization index Set of orthogonal hyperplanes Bit i = on which side of hyperplane i is the descriptor? 512 hyperplanes 512 bits Signatures stored in inverted file, compared with Hamming distance Threshold: filter 94% images Output ordered by Hamming distance Hamming embedding and weak geometric consistency for large scale image search, H. Jégou, M. Douze, C. Schmid, ECCV 2008 July 9, 2009 Evaluation of GIST for web-scale image search 12/18

13 Evaluation on Holidays BOF+HE GIST GISTIS GISTIS+L2 GISTIS+L2+SV map k 100k 1M 10M 100M 1G database size Thanks to A. Torralba for 80 M distractor pictures (32*32 pixels)... July 9, 2009 Evaluation of GIST for web-scale image search 13/18

14 GISTIS results on Copydays+110 M images July 9, 2009 Evaluation of GIST for web-scale image search 14/18

15 Timings & memory usage Method Dataset size RAM usage Search time BOF + HE 1 M 24 GB 1.8 s GIST 1 M 3.8 GB 1.3 s GISTIS 1 M 68 MB 38 ms GISTIS 100 M 6.8 GB 0.18 s GISTIS+L2 100 M 6.8 GB 2.1 s July 9, 2009 Evaluation of GIST for web-scale image search 15/18

16 Conclusion GIST recognizes: Image copies with small geometrical changes Scenes with small occlusions, same overall color... GISTIS (quantized GIST + Hamming Embedding) Fast and compact Results close to full GIST descriptor Current work: compact BOF representation Web-scale Handles geometrical changes Better results! (see our ICCV paper) Thank you See our demo tomorrow! July 9, 2009 Evaluation of GIST for web-scale image search 16/18

Compressed local descriptors for fast image and video search in large databases

Compressed local descriptors for fast image and video search in large databases Matthijs Douze2 joint work with Hervé Jégou1, Cordelia Schmid2 and Patrick Pérez3 1: INRIA Rennes, TEXMEX team, France 2: