Evaluation of GIST descriptors for web scale image search

Size: px
Start display at page:

Download "Evaluation of GIST descriptors for web scale image search"

Transcription

1 Evaluation of GIST descriptors for web scale image search Matthijs Douze Hervé Jégou, Harsimrat Sandhawalia, Laurent Amsaleg and Cordelia Schmid INRIA Grenoble, France July 9, 2009 Evaluation of GIST for web-scale image search 1/18

2 Query image Same-object recognition: bag-of-features sparse frequency vector region extraction + SIFT descriptors quantization BOF scales up to ~10 M images (see Web scale = size of Flickr, Picasa, or Facebook At least 100 M images per machine New constraints: smaller descriptor, more efficient index use a global descriptor! database of sparse vectors (inverted file) Geometric verification distance computation ranked image short-list Video Google: A text retrieval approach to object matching in videos. J. Sivic and A. Zisserman, ICCV 2003 July 9, 2009 Evaluation of GIST for web-scale image search 2/18

3 Overview GIST compared with BOF Same-object recognition Copy detection GIST at web scale Indexing scheme Results July 9, 2009 Evaluation of GIST for web-scale image search 3/18

4 Modeling the shape of the scene: a holistic representation of the spatial envelope, A. Olivia & A. Torralba, IJCV, 2001 GIST Designed from perceptual experiments: properties like naturalness (vs. manmade), roughness, openness,... Histogram of orientations R G B 32x32 patch descriptor Image pyramid Invariant to luminance transformations, blur, resize, etc. Not invariant to translation, rotation, occlusion, crop, etc. L2 distance to compare, NN-search July 9, 2009 Evaluation of GIST for web-scale image search 4/18

5 Evaluation for same-scene/object recognition Test on the INRIA Holidays dataset 500 groups of images of the same scene Hide them in up to 1 M distractor images from Flickr Queries = 1 image per group Measure = map = mean area under precision-recall curve July 9, 2009 Evaluation of GIST for web-scale image search 5/18

6 Results on Holidays 1 BOF+HE GIST map k 100k database size 1M July 9, 2009 Evaluation of GIST for web-scale image search 6/18

7 Copy detection scenario Simpler sub-problem: recognize transformed pictures Useful for copy detection Our test dataset: Copydays 156 base images merged with 1 M distractors Attacks JPEG compression Crop Queries = attacked images Performance measure is map July 9, 2009 Evaluation of GIST for web-scale image search 7/18

8 Results, JPEG compression map BOF+HE GIST JPEG quality factor July 9, 2009 Evaluation of GIST for web-scale image search 8/18

9 Results, crop Remove (random) margin from image % surface map BOF+HE GIST % surface removed July 9, 2009 Evaluation of GIST for web-scale image search 9/18

10 Overview GIST compared with BOF Same-object recognition Copy detection GIST at web scale Indexing scheme Results July 9, 2009 Evaluation of GIST for web-scale image search 10/18

11 GIST indexing structure (GISTIS) Raw GIST for 100 M images: 384 GB : too big! quantize Nearest-neighbor quantizer, centroids (learnt on independent dataset) Index = Inverted file Returns 1% of dataset: too many images! July 9, 2009 Evaluation of GIST for web-scale image search 11/18

12 Binary signatures & Hamming Embedding Quantization index is too coarse: need information about the point's position inside the quantization cell Add a binary signature to the descriptor's quantization index Set of orthogonal hyperplanes Bit i = on which side of hyperplane i is the descriptor? 512 hyperplanes 512 bits Signatures stored in inverted file, compared with Hamming distance Threshold: filter 94% images Output ordered by Hamming distance Hamming embedding and weak geometric consistency for large scale image search, H. Jégou, M. Douze, C. Schmid, ECCV 2008 July 9, 2009 Evaluation of GIST for web-scale image search 12/18

13 Evaluation on Holidays BOF+HE GIST GISTIS GISTIS+L2 GISTIS+L2+SV map k 100k 1M 10M 100M 1G database size Thanks to A. Torralba for 80 M distractor pictures (32*32 pixels)... July 9, 2009 Evaluation of GIST for web-scale image search 13/18

14 GISTIS results on Copydays+110 M images July 9, 2009 Evaluation of GIST for web-scale image search 14/18

15 Timings & memory usage Method Dataset size RAM usage Search time BOF + HE 1 M 24 GB 1.8 s GIST 1 M 3.8 GB 1.3 s GISTIS 1 M 68 MB 38 ms GISTIS 100 M 6.8 GB 0.18 s GISTIS+L2 100 M 6.8 GB 2.1 s July 9, 2009 Evaluation of GIST for web-scale image search 15/18

16 Conclusion GIST recognizes: Image copies with small geometrical changes Scenes with small occlusions, same overall color... GISTIS (quantized GIST + Hamming Embedding) Fast and compact Results close to full GIST descriptor Current work: compact BOF representation Web-scale Handles geometrical changes Better results! (see our ICCV paper) Thank you See our demo tomorrow! July 9, 2009 Evaluation of GIST for web-scale image search 16/18

Compressed local descriptors for fast image and video search in large databases

Compressed local descriptors for fast image and video search in large databases Compressed local descriptors for fast image and video search in large databases Matthijs Douze2 joint work with Hervé Jégou1, Cordelia Schmid2 and Patrick Pérez3 1: INRIA Rennes, TEXMEX team, France 2:

More information

Large scale object/scene recognition

Large scale object/scene recognition Large scale object/scene recognition Image dataset: > 1 million images query Image search system ranked image list Each image described by approximately 2000 descriptors 2 10 9 descriptors to index! Database

More information

Evaluation of GIST descriptors for web-scale image search

Evaluation of GIST descriptors for web-scale image search Evaluation of GIST descriptors for web-scale image search Matthijs Douze, Hervé Jégou, Sandhawalia Harsimrat, Laurent Amsaleg, Cordelia Schmid To cite this version: Matthijs Douze, Hervé Jégou, Sandhawalia

More information

Hamming embedding and weak geometric consistency for large scale image search

Hamming embedding and weak geometric consistency for large scale image search Hamming embedding and weak geometric consistency for large scale image search Herve Jegou, Matthijs Douze, and Cordelia Schmid INRIA Grenoble, LEAR, LJK firstname.lastname@inria.fr Abstract. This paper

More information

Searching in one billion vectors: re-rank with source coding

Searching in one billion vectors: re-rank with source coding Searching in one billion vectors: re-rank with source coding Hervé Jégou INRIA / IRISA Romain Tavenard Univ. Rennes / IRISA Laurent Amsaleg CNRS / IRISA Matthijs Douze INRIA / LJK ICASSP May 2011 LARGE

More information

Large-scale visual recognition The bag-of-words representation

Large-scale visual recognition The bag-of-words representation Large-scale visual recognition The bag-of-words representation Florent Perronnin, XRCE Hervé Jégou, INRIA CVPR tutorial June 16, 2012 Outline Bag-of-words Large or small vocabularies? Extensions for instance-level

More information

IMAGE RETRIEVAL USING VLAD WITH MULTIPLE FEATURES

IMAGE RETRIEVAL USING VLAD WITH MULTIPLE FEATURES IMAGE RETRIEVAL USING VLAD WITH MULTIPLE FEATURES Pin-Syuan Huang, Jing-Yi Tsai, Yu-Fang Wang, and Chun-Yi Tsai Department of Computer Science and Information Engineering, National Taitung University,

More information

INRIA-LEAR S VIDEO COPY DETECTION SYSTEM. INRIA LEAR, LJK Grenoble, France 1 Copyright detection task 2 High-level feature extraction task

INRIA-LEAR S VIDEO COPY DETECTION SYSTEM. INRIA LEAR, LJK Grenoble, France 1 Copyright detection task 2 High-level feature extraction task INRIA-LEAR S VIDEO COPY DETECTION SYSTEM Matthijs Douze, Adrien Gaidon 2, Herve Jegou, Marcin Marszałek 2 and Cordelia Schmid,2 INRIA LEAR, LJK Grenoble, France Copyright detection task 2 High-level feature

More information

Efficient Representation of Local Geometry for Large Scale Object Retrieval

Efficient Representation of Local Geometry for Large Scale Object Retrieval Efficient Representation of Local Geometry for Large Scale Object Retrieval Michal Perďoch Ondřej Chum and Jiří Matas Center for Machine Perception Czech Technical University in Prague IEEE Computer Society

More information

Mixtures of Gaussians and Advanced Feature Encoding

Mixtures of Gaussians and Advanced Feature Encoding Mixtures of Gaussians and Advanced Feature Encoding Computer Vision Ali Borji UWM Many slides from James Hayes, Derek Hoiem, Florent Perronnin, and Hervé Why do good recognition systems go bad? E.g. Why

More information

On the burstiness of visual elements

On the burstiness of visual elements On the burstiness of visual elements Herve Je gou Matthijs Douze INRIA Grenoble, LJK Cordelia Schmid firstname.lastname@inria.fr Figure. Illustration of burstiness. Features assigned to the most bursty

More information

Evaluation and comparison of interest points/regions

Evaluation and comparison of interest points/regions Introduction Evaluation and comparison of interest points/regions Quantitative evaluation of interest point/region detectors points / regions at the same relative location and area Repeatability rate :

More information

Large-scale visual recognition Efficient matching

Large-scale visual recognition Efficient matching Large-scale visual recognition Efficient matching Florent Perronnin, XRCE Hervé Jégou, INRIA CVPR tutorial June 16, 2012 Outline!! Preliminary!! Locality Sensitive Hashing: the two modes!! Hashing!! Embedding!!

More information

Large Scale Image Retrieval

Large Scale Image Retrieval Large Scale Image Retrieval Ondřej Chum and Jiří Matas Center for Machine Perception Czech Technical University in Prague Features Affine invariant features Efficient descriptors Corresponding regions

More information

An image-based approach to video copy detection with spatio-temporal post-filtering

An image-based approach to video copy detection with spatio-temporal post-filtering An image-based approach to video copy detection with spatio-temporal post-filtering Matthijs Douze, Hervé Jégou and Cordelia Schmid Abstract This paper introduces a video copy detection system which efficiently

More information

Enhanced and Efficient Image Retrieval via Saliency Feature and Visual Attention

Enhanced and Efficient Image Retrieval via Saliency Feature and Visual Attention Enhanced and Efficient Image Retrieval via Saliency Feature and Visual Attention Anand K. Hase, Baisa L. Gunjal Abstract In the real world applications such as landmark search, copy protection, fake image

More information

Instance-level recognition II.

Instance-level recognition II. Reconnaissance d objets et vision artificielle 2010 Instance-level recognition II. Josef Sivic http://www.di.ens.fr/~josef INRIA, WILLOW, ENS/INRIA/CNRS UMR 8548 Laboratoire d Informatique, Ecole Normale

More information

Learning a Fine Vocabulary

Learning a Fine Vocabulary Learning a Fine Vocabulary Andrej Mikulík, Michal Perdoch, Ondřej Chum, and Jiří Matas CMP, Dept. of Cybernetics, Faculty of EE, Czech Technical University in Prague Abstract. We present a novel similarity

More information

IMPROVING VLAD: HIERARCHICAL CODING AND A REFINED LOCAL COORDINATE SYSTEM. Christian Eggert, Stefan Romberg, Rainer Lienhart

IMPROVING VLAD: HIERARCHICAL CODING AND A REFINED LOCAL COORDINATE SYSTEM. Christian Eggert, Stefan Romberg, Rainer Lienhart IMPROVING VLAD: HIERARCHICAL CODING AND A REFINED LOCAL COORDINATE SYSTEM Christian Eggert, Stefan Romberg, Rainer Lienhart Multimedia Computing and Computer Vision Lab University of Augsburg ABSTRACT

More information

Local Image Features

Local Image Features Local Image Features Ali Borji UWM Many slides from James Hayes, Derek Hoiem and Grauman&Leibe 2008 AAAI Tutorial Overview of Keypoint Matching 1. Find a set of distinctive key- points A 1 A 2 A 3 B 3

More information

Supplementary Material for Ensemble Diffusion for Retrieval

Supplementary Material for Ensemble Diffusion for Retrieval Supplementary Material for Ensemble Diffusion for Retrieval Song Bai 1, Zhichao Zhou 1, Jingdong Wang, Xiang Bai 1, Longin Jan Latecki 3, Qi Tian 4 1 Huazhong University of Science and Technology, Microsoft

More information

Instance-level recognition part 2

Instance-level recognition part 2 Visual Recognition and Machine Learning Summer School Paris 2011 Instance-level recognition part 2 Josef Sivic http://www.di.ens.fr/~josef INRIA, WILLOW, ENS/INRIA/CNRS UMR 8548 Laboratoire d Informatique,

More information

Bag-of-colors for improved image search

Bag-of-colors for improved image search Bag-of-colors for improved image search Christian Wengert, Matthijs Douze, Hervé Jégou To cite this version: Christian Wengert, Matthijs Douze, Hervé Jégou. Bag-of-colors for improved image search. MM

More information

Efficient Representation of Local Geometry for Large Scale Object Retrieval

Efficient Representation of Local Geometry for Large Scale Object Retrieval Efficient Representation of Local Geometry for Large Scale Object Retrieval Michal Perd och, Ondřej Chum and Jiří Matas Center for Machine Perception, Department of Cybernetics Faculty of Electrical Engineering,

More information

Binary SIFT: Towards Efficient Feature Matching Verification for Image Search

Binary SIFT: Towards Efficient Feature Matching Verification for Image Search Binary SIFT: Towards Efficient Feature Matching Verification for Image Search Wengang Zhou 1, Houqiang Li 2, Meng Wang 3, Yijuan Lu 4, Qi Tian 1 Dept. of Computer Science, University of Texas at San Antonio

More information

By Suren Manvelyan,

By Suren Manvelyan, By Suren Manvelyan, http://www.surenmanvelyan.com/gallery/7116 By Suren Manvelyan, http://www.surenmanvelyan.com/gallery/7116 By Suren Manvelyan, http://www.surenmanvelyan.com/gallery/7116 By Suren Manvelyan,

More information

Combining attributes and Fisher vectors for efficient image retrieval

Combining attributes and Fisher vectors for efficient image retrieval Author manuscript, published in "IEEE Conference on Computer Vision & Pattern Recognition (2011)" Combining attributes and Fisher vectors for efficient image retrieval Matthijs Douze INRIA, France matthijs.douze@inria.fr

More information

Query Adaptive Similarity for Large Scale Object Retrieval

Query Adaptive Similarity for Large Scale Object Retrieval Query Adaptive Similarity for Large Scale Object Retrieval Danfeng Qin Christian Wengert Luc van Gool ETH Zürich, Switzerland {qind,wengert,vangool}@vision.ee.ethz.ch Abstract Many recent object retrieval

More information

Oriented pooling for dense and non-dense rotation-invariant features

Oriented pooling for dense and non-dense rotation-invariant features ZHAO, JÉGOU, GRAVIER: ORIENTED POOLING 1 Oriented pooling for dense and non-dense rotation-invariant features Wan-Lei Zhao 1 http://www.cs.cityu.edu.hk/~wzhao2 Hervé Jégou 1 http://people.rennes.inria.fr/herve.jegou

More information

Bundling Features for Large Scale Partial-Duplicate Web Image Search

Bundling Features for Large Scale Partial-Duplicate Web Image Search Bundling Features for Large Scale Partial-Duplicate Web Image Search Zhong Wu, Qifa Ke, Michael Isard, and Jian Sun Microsoft Research Abstract In state-of-the-art image retrieval systems, an image is

More information

Product quantization for nearest neighbor search

Product quantization for nearest neighbor search Product quantization for nearest neighbor search Hervé Jégou, Matthijs Douze, Cordelia Schmid Abstract This paper introduces a product quantization based approach for approximate nearest neighbor search.

More information

Instance-level recognition

Instance-level recognition Instance-level recognition 1) Local invariant features 2) Matching and recognition with local features 3) Efficient visual search 4) Very large scale indexing Matching of descriptors Matching and 3D reconstruction

More information

Previous Lecture - Coded aperture photography

Previous Lecture - Coded aperture photography Previous Lecture - Coded aperture photography Depth from a single image based on the amount of blur Estimate the amount of blur using and recover a sharp image by deconvolution with a sparse gradient prior.

More information

Beyond bags of Features

Beyond bags of Features Beyond bags of Features Spatial Pyramid Matching for Recognizing Natural Scene Categories Camille Schreck, Romain Vavassori Ensimag December 14, 2012 Schreck, Vavassori (Ensimag) Beyond bags of Features

More information

Video Google faces. Josef Sivic, Mark Everingham, Andrew Zisserman. Visual Geometry Group University of Oxford

Video Google faces. Josef Sivic, Mark Everingham, Andrew Zisserman. Visual Geometry Group University of Oxford Video Google faces Josef Sivic, Mark Everingham, Andrew Zisserman Visual Geometry Group University of Oxford The objective Retrieve all shots in a video, e.g. a feature length film, containing a particular

More information

Bag-of-features. Cordelia Schmid

Bag-of-features. Cordelia Schmid Bag-of-features for category classification Cordelia Schmid Visual search Particular objects and scenes, large databases Category recognition Image classification: assigning a class label to the image

More information

Instance-level recognition

Instance-level recognition Instance-level recognition 1) Local invariant features 2) Matching and recognition with local features 3) Efficient visual search 4) Very large scale indexing Matching of descriptors Matching and 3D reconstruction

More information

Negative evidences and co-occurrences in image retrieval: the benefit of PCA and whitening

Negative evidences and co-occurrences in image retrieval: the benefit of PCA and whitening Negative evidences and co-occurrences in image retrieval: the benefit of PCA and whitening Hervé Jégou, Ondrej Chum To cite this version: Hervé Jégou, Ondrej Chum. Negative evidences and co-occurrences

More information

Local features: detection and description. Local invariant features

Local features: detection and description. Local invariant features Local features: detection and description Local invariant features Detection of interest points Harris corner detection Scale invariant blob detection: LoG Description of local patches SIFT : Histograms

More information

Hamming Embedding and Weak Geometry Consistency for Large Scale Image Search - extended version

Hamming Embedding and Weak Geometry Consistency for Large Scale Image Search - extended version Hamming Embedding and Weak Geometry Consistency for Large Scale Image Search - extended version Hervé Jégou, Matthijs Douze, Cordelia Schmid To cite this version: Hervé Jégou, Matthijs Douze, Cordelia

More information

Video Google: A Text Retrieval Approach to Object Matching in Videos

Video Google: A Text Retrieval Approach to Object Matching in Videos Video Google: A Text Retrieval Approach to Object Matching in Videos Josef Sivic, Frederik Schaffalitzky, Andrew Zisserman Visual Geometry Group University of Oxford The vision Enable video, e.g. a feature

More information

Previously. Part-based and local feature models for generic object recognition. Bag-of-words model 4/20/2011

Previously. Part-based and local feature models for generic object recognition. Bag-of-words model 4/20/2011 Previously Part-based and local feature models for generic object recognition Wed, April 20 UT-Austin Discriminative classifiers Boosting Nearest neighbors Support vector machines Useful for object recognition

More information

Improving bag-of-features for large scale image search

Improving bag-of-features for large scale image search Improving bag-of-features for large scale image search Hervé Jégou, Matthijs Douze, Cordelia Schmid To cite this version: Hervé Jégou, Matthijs Douze, Cordelia Schmid. Improving bag-of-features for large

More information

Lecture 12 Recognition

Lecture 12 Recognition Institute of Informatics Institute of Neuroinformatics Lecture 12 Recognition Davide Scaramuzza http://rpg.ifi.uzh.ch/ 1 Lab exercise today replaced by Deep Learning Tutorial by Antonio Loquercio Room

More information

Learning Vocabularies over a Fine Quantization

Learning Vocabularies over a Fine Quantization International Journal of Computer Vision manuscript No. (will be inserted by the editor) Learning Vocabularies over a Fine Quantization Andrej Mikulik Michal Perdoch Ondřej Chum Jiří Matas Received: date

More information

Patch Descriptors. CSE 455 Linda Shapiro

Patch Descriptors. CSE 455 Linda Shapiro Patch Descriptors CSE 455 Linda Shapiro How can we find corresponding points? How can we find correspondences? How do we describe an image patch? How do we describe an image patch? Patches with similar

More information

Efficient Large-scale Image Search With a Vocabulary Tree. PREPRINT December 19, 2017

Efficient Large-scale Image Search With a Vocabulary Tree. PREPRINT December 19, 2017 2014/04/29 v0.4.3 IPOL article class Published in Image Processing On Line on YYYY MM DD. Submitted on YYYY MM DD, accepted on YYYY MM DD. ISSN 2105 1232 c YYYY IPOL & the authors CC BY NC SA This article

More information

Local Features and Bag of Words Models

Local Features and Bag of Words Models 10/14/11 Local Features and Bag of Words Models Computer Vision CS 143, Brown James Hays Slides from Svetlana Lazebnik, Derek Hoiem, Antonio Torralba, David Lowe, Fei Fei Li and others Computer Engineering

More information

Beyond bags of features: Adding spatial information. Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba

Beyond bags of features: Adding spatial information. Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba Adding spatial information Forming vocabularies from pairs of nearby features doublets

More information

Packing and Padding: Coupled Multi-index for Accurate Image Retrieval

Packing and Padding: Coupled Multi-index for Accurate Image Retrieval Packing and Padding: Coupled Multi-index for Accurate Image Retrieval Liang Zheng 1, Shengjin Wang 1, Ziqiong Liu 1, and Qi Tian 2 1 State Key Laboratory of Intelligent Technology and Systems; 1 Tsinghua

More information

Aggregating local image descriptors into compact codes

Aggregating local image descriptors into compact codes Aggregating local image descriptors into compact codes Hervé Jégou, Florent Perronnin, Matthijs Douze, Jorge Sánchez, Patrick Pérez, Cordelia Schmid To cite this version: Hervé Jégou, Florent Perronnin,

More information

Learning Visual Semantics: Models, Massive Computation, and Innovative Applications

Learning Visual Semantics: Models, Massive Computation, and Innovative Applications Learning Visual Semantics: Models, Massive Computation, and Innovative Applications Part II: Visual Features and Representations Liangliang Cao, IBM Watson Research Center Evolvement of Visual Features

More information

CLASSIFICATION Experiments

CLASSIFICATION Experiments CLASSIFICATION Experiments January 27,2015 CS3710: Visual Recognition Bhavin Modi Bag of features Object Bag of words 1. Extract features 2. Learn visual vocabulary Bag of features: outline 3. Quantize

More information

Visual words. Map high-dimensional descriptors to tokens/words by quantizing the feature space.

Visual words. Map high-dimensional descriptors to tokens/words by quantizing the feature space. Visual words Map high-dimensional descriptors to tokens/words by quantizing the feature space. Quantize via clustering; cluster centers are the visual words Word #2 Descriptor feature space Assign word

More information

Part-based and local feature models for generic object recognition

Part-based and local feature models for generic object recognition Part-based and local feature models for generic object recognition May 28 th, 2015 Yong Jae Lee UC Davis Announcements PS2 grades up on SmartSite PS2 stats: Mean: 80.15 Standard Dev: 22.77 Vote on piazza

More information

Three things everyone should know to improve object retrieval. Relja Arandjelović and Andrew Zisserman (CVPR 2012)

Three things everyone should know to improve object retrieval. Relja Arandjelović and Andrew Zisserman (CVPR 2012) Three things everyone should know to improve object retrieval Relja Arandjelović and Andrew Zisserman (CVPR 2012) University of Oxford 2 nd April 2012 Large scale object retrieval Find all instances of

More information

Midterm Wed. Local features: detection and description. Today. Last time. Local features: main components. Goal: interest operator repeatability

Midterm Wed. Local features: detection and description. Today. Last time. Local features: main components. Goal: interest operator repeatability Midterm Wed. Local features: detection and description Monday March 7 Prof. UT Austin Covers material up until 3/1 Solutions to practice eam handed out today Bring a 8.5 11 sheet of notes if you want Review

More information

Patch Descriptors. EE/CSE 576 Linda Shapiro

Patch Descriptors. EE/CSE 576 Linda Shapiro Patch Descriptors EE/CSE 576 Linda Shapiro 1 How can we find corresponding points? How can we find correspondences? How do we describe an image patch? How do we describe an image patch? Patches with similar

More information

arxiv: v1 [cs.cv] 15 Mar 2014

arxiv: v1 [cs.cv] 15 Mar 2014 Geometric VLAD for Large Scale Image Search Zixuan Wang, Wei Di, Anurag Bhardwaj, Vignesh Jagadeesh, Robinson Piramuthu Dept.of Electrical Engineering, Stanford University, CA 94305 ebay Research Labs,

More information

EVENT RETRIEVAL USING MOTION BARCODES. Gil Ben-Artzi Michael Werman Shmuel Peleg

EVENT RETRIEVAL USING MOTION BARCODES. Gil Ben-Artzi Michael Werman Shmuel Peleg EVENT RETRIEVAL USING MOTION BARCODES Gil Ben-Artzi Michael Werman Shmuel Peleg School of Computer Science and Engineering The Hebrew University of Jerusalem, Israel ABSTRACT We introduce a simple and

More information

Local features: detection and description May 12 th, 2015

Local features: detection and description May 12 th, 2015 Local features: detection and description May 12 th, 2015 Yong Jae Lee UC Davis Announcements PS1 grades up on SmartSite PS1 stats: Mean: 83.26 Standard Dev: 28.51 PS2 deadline extended to Saturday, 11:59

More information

Pattern Spotting in Historical Document Image

Pattern Spotting in Historical Document Image Pattern Spotting in historical document images Sovann EN, Caroline Petitjean, Stéphane Nicolas, Frédéric Jurie, Laurent Heutte LITIS, University of Rouen, France 1 Outline Introduction Commons Pipeline

More information

Local features and image matching. Prof. Xin Yang HUST

Local features and image matching. Prof. Xin Yang HUST Local features and image matching Prof. Xin Yang HUST Last time RANSAC for robust geometric transformation estimation Translation, Affine, Homography Image warping Given a 2D transformation T and a source

More information

Local Configuration of SIFT-like Features by a Shape Context

Local Configuration of SIFT-like Features by a Shape Context Local Configuration of SIFT-like Features by a Shape Context Martin KLINKIGT and Koichi KISE Graduate School of Engineering, Osaka Prefecture University 1-1 Gakuen-cho, Naka, Sakai, Osaka, Japan E-mail:

More information

WISE: Large Scale Content Based Web Image Search. Michael Isard Joint with: Qifa Ke, Jian Sun, Zhong Wu Microsoft Research Silicon Valley

WISE: Large Scale Content Based Web Image Search. Michael Isard Joint with: Qifa Ke, Jian Sun, Zhong Wu Microsoft Research Silicon Valley WISE: Large Scale Content Based Web Image Search Michael Isard Joint with: Qifa Ke, Jian Sun, Zhong Wu Microsoft Research Silicon Valley 1 A picture is worth a thousand words. Query by Images What leaf?

More information

Compact video description for copy detection with precise temporal alignment

Compact video description for copy detection with precise temporal alignment Compact video description for copy detection with precise temporal alignment Matthijs Douze 1, Hervé Jégou 2, Cordelia Schmid 1, and Patrick Pérez 3 1 INRIA Grenoble, France 2 INRIA Rennes, France 3 Technicolor

More information

arxiv: v1 [cs.cv] 11 Feb 2014

arxiv: v1 [cs.cv] 11 Feb 2014 Packing and Padding: Coupled Multi-index for Accurate Image Retrieval Liang Zheng 1, Shengjin Wang 1, Ziqiong Liu 1, and Qi Tian 2 1 Tsinghua University, Beijing, China 2 University of Texas at San Antonio,

More information

Neural Codes for Image Retrieval. David Stutz July 22, /48 1/48

Neural Codes for Image Retrieval. David Stutz July 22, /48 1/48 Neural Codes for Image Retrieval David Stutz July 22, 2015 David Stutz July 22, 2015 0/48 1/48 Table of Contents 1 Introduction 2 Image Retrieval Bag of Visual Words Vector of Locally Aggregated Descriptors

More information

Combining attributes and Fisher vectors for efficient image retrieval

Combining attributes and Fisher vectors for efficient image retrieval Combining attributes and Fisher vectors for efficient image retrieval Matthijs Douze, Arnau Ramisa, Cordelia Schmid To cite this version: Matthijs Douze, Arnau Ramisa, Cordelia Schmid. Combining attributes

More information

Image matching on a mobile device

Image matching on a mobile device Image matching on a mobile device Honours project Authors: Steve Nowee Nick de Wolf Eva van Weel Supervisor: Jan van Gemert Contents 1 Introduction 3 2 Theory 4 2.1 Bag of words...........................

More information

Reconstructing an image from its local descriptors

Reconstructing an image from its local descriptors Reconstructing an image from its local descriptors Philippe Weinzaepfel ENS Cachan Bretagne Hervé Jégou INRIA Patrick Pérez Technicolor Figure 1. Top: the original picture. Bottom: the image reconstructed

More information

Indexing local features and instance recognition May 16 th, 2017

Indexing local features and instance recognition May 16 th, 2017 Indexing local features and instance recognition May 16 th, 2017 Yong Jae Lee UC Davis Announcements PS2 due next Monday 11:59 am 2 Recap: Features and filters Transforming and describing images; textures,

More information

Classifying Images with Visual/Textual Cues. By Steven Kappes and Yan Cao

Classifying Images with Visual/Textual Cues. By Steven Kappes and Yan Cao Classifying Images with Visual/Textual Cues By Steven Kappes and Yan Cao Motivation Image search Building large sets of classified images Robotics Background Object recognition is unsolved Deformable shaped

More information

Beyond Bags of features Spatial information & Shape models

Beyond Bags of features Spatial information & Shape models Beyond Bags of features Spatial information & Shape models Jana Kosecka Many slides adapted from S. Lazebnik, FeiFei Li, Rob Fergus, and Antonio Torralba Detection, recognition (so far )! Bags of features

More information

ROBUST SCENE CLASSIFICATION BY GIST WITH ANGULAR RADIAL PARTITIONING. Wei Liu, Serkan Kiranyaz and Moncef Gabbouj

ROBUST SCENE CLASSIFICATION BY GIST WITH ANGULAR RADIAL PARTITIONING. Wei Liu, Serkan Kiranyaz and Moncef Gabbouj Proceedings of the 5th International Symposium on Communications, Control and Signal Processing, ISCCSP 2012, Rome, Italy, 2-4 May 2012 ROBUST SCENE CLASSIFICATION BY GIST WITH ANGULAR RADIAL PARTITIONING

More information

LOCAL AND GLOBAL DESCRIPTORS FOR PLACE RECOGNITION IN ROBOTICS

LOCAL AND GLOBAL DESCRIPTORS FOR PLACE RECOGNITION IN ROBOTICS 8th International DAAAM Baltic Conference "INDUSTRIAL ENGINEERING - 19-21 April 2012, Tallinn, Estonia LOCAL AND GLOBAL DESCRIPTORS FOR PLACE RECOGNITION IN ROBOTICS Shvarts, D. & Tamre, M. Abstract: The

More information

arxiv: v1 [cs.mm] 3 May 2016

arxiv: v1 [cs.mm] 3 May 2016 Bloom Filters and Compact Hash Codes for Efficient and Distributed Image Retrieval Andrea Salvi, Simone Ercoli, Marco Bertini and Alberto Del Bimbo Media Integration and Communication Center, Università

More information

Keypoint-based Recognition and Object Search

Keypoint-based Recognition and Object Search 03/08/11 Keypoint-based Recognition and Object Search Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem Notices I m having trouble connecting to the web server, so can t post lecture

More information

Mobile visual search. Research Online. University of Wollongong. Huiguang Sun University of Wollongong. Recommended Citation

Mobile visual search. Research Online. University of Wollongong. Huiguang Sun University of Wollongong. Recommended Citation University of Wollongong Research Online University of Wollongong Thesis Collection University of Wollongong Thesis Collections 2013 Mobile visual search Huiguang Sun University of Wollongong Recommended

More information

DeepIndex for Accurate and Efficient Image Retrieval

DeepIndex for Accurate and Efficient Image Retrieval DeepIndex for Accurate and Efficient Image Retrieval Yu Liu, Yanming Guo, Song Wu, Michael S. Lew Media Lab, Leiden Institute of Advance Computer Science Outline Motivation Proposed Approach Results Conclusions

More information

Recognizing Thousands of Legal Entities through Instance-based Visual Classification

Recognizing Thousands of Legal Entities through Instance-based Visual Classification Recognizing Thousands of Legal Entities through Instance-based Visual Classification Valentin Leveau, Alexis Joly, Olivier Buisson, Pierre Letessier, Patrick Valduriez To cite this version: Valentin Leveau,

More information

Category-level localization

Category-level localization Category-level localization Cordelia Schmid Recognition Classification Object present/absent in an image Often presence of a significant amount of background clutter Localization / Detection Localize object

More information

Gist vocabularies in omnidirectional images for appearance based mapping and localization

Gist vocabularies in omnidirectional images for appearance based mapping and localization Gist vocabularies in omnidirectional images for appearance based mapping and localization A. C. Murillo, P. Campos, J. Kosecka and J. J. Guerrero DIIS-I3A, University of Zaragoza, Spain Dept. of Computer

More information

Geometric VLAD for Large Scale Image Search. Zixuan Wang 1, Wei Di 2, Anurag Bhardwaj 2, Vignesh Jagadesh 2, Robinson Piramuthu 2

Geometric VLAD for Large Scale Image Search. Zixuan Wang 1, Wei Di 2, Anurag Bhardwaj 2, Vignesh Jagadesh 2, Robinson Piramuthu 2 Geometric VLAD for Large Scale Image Search Zixuan Wang 1, Wei Di 2, Anurag Bhardwaj 2, Vignesh Jagadesh 2, Robinson Piramuthu 2 1 2 Our Goal 1) Robust to various imaging conditions 2) Small memory footprint

More information

Discriminative classifiers for image recognition

Discriminative classifiers for image recognition Discriminative classifiers for image recognition May 26 th, 2015 Yong Jae Lee UC Davis Outline Last time: window-based generic object detection basic pipeline face detection with boosting as case study

More information

Indexing local features and instance recognition May 14 th, 2015

Indexing local features and instance recognition May 14 th, 2015 Indexing local features and instance recognition May 14 th, 2015 Yong Jae Lee UC Davis Announcements PS2 due Saturday 11:59 am 2 We can approximate the Laplacian with a difference of Gaussians; more efficient

More information

Perception IV: Place Recognition, Line Extraction

Perception IV: Place Recognition, Line Extraction Perception IV: Place Recognition, Line Extraction Davide Scaramuzza University of Zurich Margarita Chli, Paul Furgale, Marco Hutter, Roland Siegwart 1 Outline of Today s lecture Place recognition using

More information

Lecture 12 Recognition

Lecture 12 Recognition Institute of Informatics Institute of Neuroinformatics Lecture 12 Recognition Davide Scaramuzza 1 Lab exercise today replaced by Deep Learning Tutorial Room ETH HG E 1.1 from 13:15 to 15:00 Optional lab

More information

Basic Problem Addressed. The Approach I: Training. Main Idea. The Approach II: Testing. Why a set of vocabularies?

Basic Problem Addressed. The Approach I: Training. Main Idea. The Approach II: Testing. Why a set of vocabularies? Visual Categorization With Bags of Keypoints. ECCV,. G. Csurka, C. Bray, C. Dance, and L. Fan. Shilpa Gulati //7 Basic Problem Addressed Find a method for Generic Visual Categorization Visual Categorization:

More information

Object Recognition. Computer Vision. Slides from Lana Lazebnik, Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce

Object Recognition. Computer Vision. Slides from Lana Lazebnik, Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce Object Recognition Computer Vision Slides from Lana Lazebnik, Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce How many visual object categories are there? Biederman 1987 ANIMALS PLANTS OBJECTS

More information

Distances and Kernels. Motivation

Distances and Kernels. Motivation Distances and Kernels Amirshahed Mehrtash Motivation How similar? 1 Problem Definition Designing a fast system to measure the similarity il it of two images. Used to categorize images based on appearance.

More information

Bloom Filters and Compact Hash Codes for Efficient and Distributed Image Retrieval

Bloom Filters and Compact Hash Codes for Efficient and Distributed Image Retrieval 2016 IEEE International Symposium on Multimedia Bloom Filters and Compact Hash Codes for Efficient and Distributed Image Retrieval Andrea Salvi, Simone Ercoli, Marco Bertini and Alberto Del Bimbo MICC

More information

Preliminary Local Feature Selection by Support Vector Machine for Bag of Features

Preliminary Local Feature Selection by Support Vector Machine for Bag of Features Preliminary Local Feature Selection by Support Vector Machine for Bag of Features Tetsu Matsukawa Koji Suzuki Takio Kurita :University of Tsukuba :National Institute of Advanced Industrial Science and

More information

Instance Search with Weak Geometric Correlation Consistency

Instance Search with Weak Geometric Correlation Consistency Instance Search with Weak Geometric Correlation Consistency Zhenxing Zhang, Rami Albatal, Cathal Gurrin, and Alan F. Smeaton Insight Centre for Data Analytics School of Computing, Dublin City University

More information

Modern-era mobile phones and tablets

Modern-era mobile phones and tablets Anthony Vetro Mitsubishi Electric Research Labs Mobile Visual Search: Architectures, Technologies, and the Emerging MPEG Standard Bernd Girod and Vijay Chandrasekhar Stanford University Radek Grzeszczuk

More information

String distance for automatic image classification

String distance for automatic image classification String distance for automatic image classification Nguyen Hong Thinh*, Le Vu Ha*, Barat Cecile** and Ducottet Christophe** *University of Engineering and Technology, Vietnam National University of HaNoi,

More information

A Systems View of Large- Scale 3D Reconstruction

A Systems View of Large- Scale 3D Reconstruction Lecture 23: A Systems View of Large- Scale 3D Reconstruction Visual Computing Systems Goals and motivation Construct a detailed 3D model of the world from unstructured photographs (e.g., Flickr, Facebook)

More information

Nagoya University at TRECVID 2014: the Instance Search Task

Nagoya University at TRECVID 2014: the Instance Search Task Nagoya University at TRECVID 2014: the Instance Search Task Cai-Zhi Zhu 1 Yinqiang Zheng 2 Ichiro Ide 1 Shin ichi Satoh 2 Kazuya Takeda 1 1 Nagoya University, 1 Furo-Cho, Chikusa-ku, Nagoya, Aichi 464-8601,

More information

Fuzzy based Multiple Dictionary Bag of Words for Image Classification

Fuzzy based Multiple Dictionary Bag of Words for Image Classification Available online at www.sciencedirect.com Procedia Engineering 38 (2012 ) 2196 2206 International Conference on Modeling Optimisation and Computing Fuzzy based Multiple Dictionary Bag of Words for Image

More information