Approximate Nearest Neighbor Search. Deng Cai Zhejiang University
|
|
- August Byron Horton
- 5 years ago
- Views:
Transcription
1 Approximate Nearest Neighbor Search Deng Cai Zhejiang University
2 The Era of Big Data
3 How to Find Things Quickly? Web 1.0 Text Search Sparse feature Inverted Index
4 How to Find Things Quickly? Web 2.0, 3.0 Multimedia Search Dense feature GIST 特征 960 维 HoG 特征 3360 维 SIFT 特征 128 维
5 How to Find Things Quickly? Dense feature Inverted index can no longer be used Exhaustive search is inefficient Indexing the large scale dataset Tree based index (KD-Tree, R-Tree, )
6 KD-Tree Spatial partitions and recursive hyper plane decomposition 2-dimensional KD-Tree
7 KD-Tree The worst case time for range search in a d-dimensional KD-Tree containing n nodes is given by the following equation It performs worse than a linear scan when the dimensionality of the space is high! D. T. Lee, C. K. Wong. Worst case analysis for region and partial region searches in multidimensional binary search trees and balanced quad trees, ACTA Informatica, 1977.
8 Approximate Nearest Neighbor (ANN) Search ε-approximate nearest neighbor search is a special case of the nearest neighbor search problem. The solution to the ε- approximate nearest neighbor search is a point or multiple points within distance (1+ε) R from a query point, where R is the distance between the query point and its true nearest neighbor.
9 Approximate Nearest Neighbor (ANN) Search Search nearest neighbors of a query An algorithm returns points Not all the points are the true nearest neighbors of the query We need to examine how many points in this returned set are among the true nearest neighbors of the query. Naturally defines the evaluation metric
10 Hashing for ANN Search Coding Locating Linear scan
11 Hashing for Nearest Neighbor Search Hash Function
12 History LSH (VLDB 1999) Spectral Hashing (NIPS 2009) NIPS(14), ICML(10), AAAI(15), IJCAI(23), CVPR(38), ICCV(15), ACM Multimedia(32), SIGKDD(7),SIGIR(17) Total 171 papers 24 papers per year
13 Unclear questions What are the proper metric for evaluating a ANN search method? How is the performance of hashing methods compared to tree-based methods (e.g. KD-tree)? Which (group of) hashing algorithm is the best? How can we invent a better ANNS algorithm?
14 Question 1: How to evaluate a ANNS method
15 Common Evaluation Mean Average Precision
16 Common Evaluation Hash Look Up Hamming Radius 2
17 Common Evaluation Missing the most important aspect Time! ANNS method Sacrifice the accuracy to achieve the efficiency Accuracy & Recall : The returned set of points given a query : The true nearest neighbors of the query The recall of is the same as the accuracy of Time-Accuracy (Time-Recall) curve
18 Evaluation Time-Accuracy (Time-Recall) curve Index size (memory size) Critical for some applications Indexing time Less important
19 Question 2: Hashing vs. KD-tree How is the performance of hashing methods compared to tree-based methods (e.g. KD-tree)? Datasets SIFT1M: [0~255] integer GIST1M: [0~1.5] float
20 Compared Algorithms flann (KD-tree) VISAPP'09, TPAMI 14 LSH (Locality Sensitive Hashing) VLDB 99 32bit ITQ (Iterative Quantization) CVPR 11 32bit
21 SIFT1M (128 dim) SIFT 1NN SIFT 100NN
22 GIST1M (960 dim) GIST 1NN GIST 100NN
23 SIFT1M (128 dim) SIFT 1NN SIFT 100NN
24 SIFT1M (128 dim) SIFT 1NN SIFT 100NN
25 SIFT1M (128 dim) SIFT 1NN SIFT 100NN
26 SIFT1M (128 dim) SIFT 1NN SIFT 100NN
27 SIFT1M (128 dim) SIFT 1NN SIFT 100NN
28 SIFT1M (128 dim) SIFT 1NN SIFT 100NN
29 GIST1M (960 dim) GIST 1NN GIST 100NN
30 GIST1M (960 dim) GIST 1NN GIST 100NN
31 GIST1M (960 dim) GIST 1NN GIST 100NN
32 GIST1M (960 dim) GIST 1NN GIST 100NN
33 Index Memory & Indexing Time
34 Question 2: Hashing vs. KD-tree Answer: A carefully designed hashing algorithm (e.g. ITQ) is better than KD-tree
35 Question 3: The best hashing algorithm
36 Compared Algorithms LSH (Locality Sensitive Hashing) VLDB 99 32bit Based on random projection ITQ (Iterative Quantization) CVPR 11 32bit AGH (Anchor Graph Hashing) ICML 11 32bit Based on graph embedding
37 SIFT1M (128 dim) SIFT 1NN SIFT 100NN
38 SIFT1M (128 dim) SIFT 1NN SIFT 100NN
39 GIST1M (960 dim) GIST 1NN GIST 100NN
40 GIST1M (960 dim) GIST 1NN GIST 100NN
41 AGH results in the original paper
42 AGH results in the original paper MNIST 10 classes, each class has 7,000 images NUS-WIDE 21 classes, each class has 5,000 to 30,000 images
43 Supervised hashing methods?
44 Supervised hashing methods?
45 Properties of good hashing algorithm 1. Nearby points are mapped into same (nearby) buckets 2. The points are evenly distributed among all the buckets Each bucket is associated with a binary code!!!
46 Question 4: Better ANNS method How can we invent a better ANNS algorithm? Existing algorithms seem not good enough.
47 SIFT1M (128 dim) SIFT 1NN SIFT 100NN
48 GIST1M (960 dim) GIST 1NN GIST 100NN
49 Idea
50 Graph-base Nearest Neighbor Search Additional Indexing structure An offline build knn graph Online search Given a query Use hashing methods to generate a small number of candidates Check the neighbors of these candidates for closer ones iteratively Z. Jin, D. Zhang, Y. Hu, S. Lin, D. Cai, X. He: Fast and Accurate Hashing Via Iterative Nearest Neighbors Expansion. IEEE Trans. Cybernetics 44(11): (2014)
51 SIFT1M (128 dim) SIFT 1NN SIFT 100NN
52 SIFT1M (128 dim) SIFT 1NN SIFT 100NN
53 SIFT1M (128 dim) SIFT 1NN SIFT 100NN
54 GIST1M (960 dim) GIST 1NN GIST 100NN
55 Efanna Extremely Fast Approximate Nearest Neighbor Search Algorithm Based on knn Graph (arxiv) Use truncated KD-tree for initialization Extremely Fast Approximate Nearest Neighbor Graph construction Algorithm C. Fu, D. Cai: EFANNA : An Extremely Fast Approximate Nearest Neighbor Search Algorithm Based on knn Graph. CoRR abs/ (2016)
56 How to build the knn graph efficiently SIFT1M 19 hours; GIST1M 157 hours (a week) No discussions on IEH (IEEE TCB 2014) papers
57 How to build the knn graph efficiently Efanna: Extremely Fast Approximate Nearest Neighbor graph construction Algorithm SIFT1M (128): 659s, 19 hours, to construct a 10NN-graph 188s, % accuracy, 362 times speedup! GIST1M (960): s, 157 hours, to construct a 10NN-graph 2300s, 92% accuracy, 245 times speedup!
58 SIFT1M Accuracy (%) SGraph FastKNN LargeVis brute-force Efanna Kgraph NN-descent NN-expansion
59 GIST1M Accuracy (%) SGraph FastKNN LargeVis brute-force Efanna Kgraph NN-descent NN-expansion
60 Index Memory & Indexing Time
61
NEarest neighbor search plays an important role in
1 EFANNA : An Extremely Fast Approximate Nearest Neighbor Search Algorithm Based on knn Graph Cong Fu, Deng Cai arxiv:1609.07228v3 [cs.cv] 3 Dec 2016 Abstract Approximate nearest neighbor (ANN) search
More informationNEarest neighbor search plays an important role in
1 EFANNA : An Extremely Fast Approximate Nearest Neighbor Search Algorithm Based on knn Graph Cong Fu, Deng Cai arxiv:1609.07228v2 [cs.cv] 18 Nov 2016 Abstract Approximate nearest neighbor (ANN) search
More informationHashing with Graphs. Sanjiv Kumar (Google), and Shih Fu Chang (Columbia) June, 2011
Hashing with Graphs Wei Liu (Columbia Columbia), Jun Wang (IBM IBM), Sanjiv Kumar (Google), and Shih Fu Chang (Columbia) June, 2011 Overview Graph Hashing Outline Anchor Graph Hashing Experiments Conclusions
More informationLecture 24: Image Retrieval: Part II. Visual Computing Systems CMU , Fall 2013
Lecture 24: Image Retrieval: Part II Visual Computing Systems Review: K-D tree Spatial partitioning hierarchy K = dimensionality of space (below: K = 2) 3 2 1 3 3 4 2 Counts of points in leaf nodes Nearest
More informationSupervised Hashing for Image Retrieval via Image Representation Learning
Supervised Hashing for Image Retrieval via Image Representation Learning Rongkai Xia, Yan Pan, Cong Liu (Sun Yat-Sen University) Hanjiang Lai, Shuicheng Yan (National University of Singapore) Finding Similar
More informationRongrong Ji (Columbia), Yu Gang Jiang (Fudan), June, 2012
Supervised Hashing with Kernels Wei Liu (Columbia Columbia), Jun Wang (IBM IBM), Rongrong Ji (Columbia), Yu Gang Jiang (Fudan), and Shih Fu Chang (Columbia Columbia) June, 2012 Outline Motivations Problem
More informationarxiv: v5 [cs.cv] 15 May 2018
A Revisit of Hashing Algorithms for Approximate Nearest Neighbor Search The State Key Lab of CAD&CG, College of Computer Science, Zhejiang University, China Alibaba-Zhejiang University Joint Institute
More informationLarge-scale visual recognition Efficient matching
Large-scale visual recognition Efficient matching Florent Perronnin, XRCE Hervé Jégou, INRIA CVPR tutorial June 16, 2012 Outline!! Preliminary!! Locality Sensitive Hashing: the two modes!! Hashing!! Embedding!!
More informationAdaptive Binary Quantization for Fast Nearest Neighbor Search
IBM Research Adaptive Binary Quantization for Fast Nearest Neighbor Search Zhujin Li 1, Xianglong Liu 1*, Junjie Wu 1, and Hao Su 2 1 Beihang University, Beijing, China 2 Stanford University, Stanford,
More informationCLSH: Cluster-based Locality-Sensitive Hashing
CLSH: Cluster-based Locality-Sensitive Hashing Xiangyang Xu Tongwei Ren Gangshan Wu Multimedia Computing Group, State Key Laboratory for Novel Software Technology, Nanjing University xiangyang.xu@smail.nju.edu.cn
More informationNearest neighbors. Focus on tree-based methods. Clément Jamin, GUDHI project, Inria March 2017
Nearest neighbors Focus on tree-based methods Clément Jamin, GUDHI project, Inria March 2017 Introduction Exact and approximate nearest neighbor search Essential tool for many applications Huge bibliography
More informationGeometric data structures:
Geometric data structures: Machine Learning for Big Data CSE547/STAT548, University of Washington Sham Kakade Sham Kakade 2017 1 Announcements: HW3 posted Today: Review: LSH for Euclidean distance Other
More informationProgressive Generative Hashing for Image Retrieval
Progressive Generative Hashing for Image Retrieval Yuqing Ma, Yue He, Fan Ding, Sheng Hu, Jun Li, Xianglong Liu 2018.7.16 01 BACKGROUND the NNS problem in big data 02 RELATED WORK Generative adversarial
More informationSearching in one billion vectors: re-rank with source coding
Searching in one billion vectors: re-rank with source coding Hervé Jégou INRIA / IRISA Romain Tavenard Univ. Rennes / IRISA Laurent Amsaleg CNRS / IRISA Matthijs Douze INRIA / LJK ICASSP May 2011 LARGE
More informationLocality- Sensitive Hashing Random Projections for NN Search
Case Study 2: Document Retrieval Locality- Sensitive Hashing Random Projections for NN Search Machine Learning for Big Data CSE547/STAT548, University of Washington Sham Kakade April 18, 2017 Sham Kakade
More informationComplementary Projection Hashing
23 IEEE International Conference on Computer Vision Complementary Projection Hashing Zhongg Jin, Yao Hu, Yue Lin, Debing Zhang, Shiding Lin 2, Deng Cai, Xuelong Li 3 State Key Lab of CAD&CG, College of
More informationon learned visual embedding patrick pérez Allegro Workshop Inria Rhônes-Alpes 22 July 2015
on learned visual embedding patrick pérez Allegro Workshop Inria Rhônes-Alpes 22 July 2015 Vector visual representation Fixed-size image representation High-dim (100 100,000) Generic, unsupervised: BoW,
More informationFast Indexing and Search. Lida Huang, Ph.D. Senior Member of Consulting Staff Magma Design Automation
Fast Indexing and Search Lida Huang, Ph.D. Senior Member of Consulting Staff Magma Design Automation Motivation Object categorization? http://www.cs.utexas.edu/~grauman/slides/jain_et_al_cvpr2008.ppt Motivation
More informationLarge scale object/scene recognition
Large scale object/scene recognition Image dataset: > 1 million images query Image search system ranked image list Each image described by approximately 2000 descriptors 2 10 9 descriptors to index! Database
More informationMetric Learning Applied for Automatic Large Image Classification
September, 2014 UPC Metric Learning Applied for Automatic Large Image Classification Supervisors SAHILU WENDESON / IT4BI TOON CALDERS (PhD)/ULB SALIM JOUILI (PhD)/EuraNova Image Database Classification
More informationarxiv: v1 [cs.cv] 19 Oct 2017 Abstract
Improved Search in Hamming Space using Deep Multi-Index Hashing Hanjiang Lai and Yan Pan School of Data and Computer Science, Sun Yan-Sen University, China arxiv:70.06993v [cs.cv] 9 Oct 207 Abstract Similarity-preserving
More informationEvaluation of Hashing Methods Performance on Binary Feature Descriptors
Evaluation of Hashing Methods Performance on Binary Feature Descriptors Jacek Komorowski and Tomasz Trzcinski 2 Warsaw University of Technology, Warsaw, Poland jacek.komorowski@gmail.com 2 Warsaw University
More informationEvaluation of GIST descriptors for web scale image search
Evaluation of GIST descriptors for web scale image search Matthijs Douze Hervé Jégou, Harsimrat Sandhawalia, Laurent Amsaleg and Cordelia Schmid INRIA Grenoble, France July 9, 2009 Evaluation of GIST for
More informationover Multi Label Images
IBM Research Compact Hashing for Mixed Image Keyword Query over Multi Label Images Xianglong Liu 1, Yadong Mu 2, Bo Lang 1 and Shih Fu Chang 2 1 Beihang University, Beijing, China 2 Columbia University,
More informationLarge Scale Nearest Neighbor Search Theories, Algorithms, and Applications. Junfeng He
Large Scale Nearest Neighbor Search Theories, Algorithms, and Applications Junfeng He Submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy in the Graduate School
More informationThe Normalized Distance Preserving Binary Codes and Distance Table *
JOURNAL OF INFORMATION SCIENCE AND ENGINEERING 32, XXXX-XXXX (2016) The Normalized Distance Preserving Binary Codes and Distance Table * HONGWEI ZHAO 1,2, ZHEN WANG 1, PINGPING LIU 1,2 AND BIN WU 1 1.
More informationA General and Efficient Querying Method for Learning to Hash
A General and Efficient Querying Method for Jinfeng Li, Xiao Yan, Jian Zhang, An Xu, James Cheng, Jie Liu, Kelvin K. W. Ng, Ti-chung Cheng Department of Computer Science and Engineering The Chinese University
More informationImage Analysis & Retrieval. CS/EE 5590 Special Topics (Class Ids: 44873, 44874) Fall 2016, M/W Lec 18.
Image Analysis & Retrieval CS/EE 5590 Special Topics (Class Ids: 44873, 44874) Fall 2016, M/W 4-5:15pm@Bloch 0012 Lec 18 Image Hashing Zhu Li Dept of CSEE, UMKC Office: FH560E, Email: lizhu@umkc.edu, Ph:
More informationIntroduction to Machine Learning Lecture 4. Mehryar Mohri Courant Institute and Google Research
Introduction to Machine Learning Lecture 4 Mehryar Mohri Courant Institute and Google Research mohri@cims.nyu.edu Nearest-Neighbor Algorithms Nearest Neighbor Algorithms Definition: fix k 1, given a labeled
More informationMachine Learning. Nonparametric methods for Classification. Eric Xing , Fall Lecture 2, September 12, 2016
Machine Learning 10-701, Fall 2016 Nonparametric methods for Classification Eric Xing Lecture 2, September 12, 2016 Reading: 1 Classification Representing data: Hypothesis (classifier) 2 Clustering 3 Supervised
More information6.819 / 6.869: Advances in Computer Vision
6.819 / 6.869: Advances in Computer Vision Image Retrieval: Retrieval: Information, images, objects, large-scale Website: http://6.869.csail.mit.edu/fa15/ Instructor: Yusuf Aytar Lecture TR 9:30AM 11:00AM
More informationLearning independent, diverse binary hash functions: pruning and locality
Learning independent, diverse binary hash functions: pruning and locality Ramin Raziperchikolaei and Miguel Á. Carreira-Perpiñán Electrical Engineering and Computer Science University of California, Merced
More informationLarge Scale Mobile Visual Search
Large Scale Mobile Visual Search Ricoh, HotPaper (by Mac Funamizu) Shih-Fu Chang June 2012 The Explosive Growth of Visual Data broadcast Social portals video blogs 70,000 TB/year, 100 million hours 60
More informationTask Description: Finding Similar Documents. Document Retrieval. Case Study 2: Document Retrieval
Case Study 2: Document Retrieval Task Description: Finding Similar Documents Machine Learning for Big Data CSE547/STAT548, University of Washington Sham Kakade April 11, 2017 Sham Kakade 2017 1 Document
More informationlearning stage (Stage 1), CNNH learns approximate hash codes for training images by optimizing the following loss function:
1 Query-adaptive Image Retrieval by Deep Weighted Hashing Jian Zhang and Yuxin Peng arxiv:1612.2541v2 [cs.cv] 9 May 217 Abstract Hashing methods have attracted much attention for large scale image retrieval.
More informationdoc. RNDr. Tomáš Skopal, Ph.D. Department of Software Engineering, Faculty of Information Technology, Czech Technical University in Prague
Praha & EU: Investujeme do vaší budoucnosti Evropský sociální fond course: Searching the Web and Multimedia Databases (BI-VWM) Tomáš Skopal, 2011 SS2010/11 doc. RNDr. Tomáš Skopal, Ph.D. Department of
More informationSpatial Data Management
Spatial Data Management [R&G] Chapter 28 CS432 1 Types of Spatial Data Point Data Points in a multidimensional space E.g., Raster data such as satellite imagery, where each pixel stores a measured value
More informationSpatial Data Management
Spatial Data Management Chapter 28 Database management Systems, 3ed, R. Ramakrishnan and J. Gehrke 1 Types of Spatial Data Point Data Points in a multidimensional space E.g., Raster data such as satellite
More informationHashing with Binary Autoencoders
Hashing with Binary Autoencoders Ramin Raziperchikolaei Electrical Engineering and Computer Science University of California, Merced http://eecs.ucmerced.edu Joint work with Miguel Á. Carreira-Perpiñán
More informationScalable Nearest Neighbor Algorithms for High Dimensional Data Marius Muja (UBC), David G. Lowe (Google) IEEE 2014
Scalable Nearest Neighbor Algorithms for High Dimensional Data Marius Muja (UBC), David G. Lowe (Google) IEEE 2014 Presenter: Derrick Blakely Department of Computer Science, University of Virginia https://qdata.github.io/deep2read/
More informationarxiv: v2 [cs.cv] 27 Nov 2017
Deep Supervised Discrete Hashing arxiv:1705.10999v2 [cs.cv] 27 Nov 2017 Qi Li Zhenan Sun Ran He Tieniu Tan Center for Research on Intelligent Perception and Computing National Laboratory of Pattern Recognition
More informationCompressed local descriptors for fast image and video search in large databases
Compressed local descriptors for fast image and video search in large databases Matthijs Douze2 joint work with Hervé Jégou1, Cordelia Schmid2 and Patrick Pérez3 1: INRIA Rennes, TEXMEX team, France 2:
More informationarxiv: v6 [cs.lg] 3 Jun 2018
Fast Approximate Nearest Neighbor Search With The Navigating Spreading-out Graph Cong Fu, Chao Xiang, Changxu Wang, Deng Cai The State Key Lab of CAD&CG, College of Computer Science, Zhejiang University,
More informationLearning to Hash on Structured Data
Learning to Hash on Structured Data Qifan Wang, Luo Si and Bin Shen Computer Science Department, Purdue University West Lafayette, IN 47907, US wang868@purdue.edu, lsi@purdue.edu, bshen@purdue.edu Abstract
More informationSimilarity-Preserving Binary Hashing for Image Retrieval in large databases
Universidad Politécnica de Valencia Master s Final Project Similarity-Preserving Binary Hashing for Image Retrieval in large databases Author: Guillermo García Franco Supervisor: Dr. Roberto Paredes Palacios
More informationProblem 1: Complexity of Update Rules for Logistic Regression
Case Study 1: Estimating Click Probabilities Tackling an Unknown Number of Features with Sketching Machine Learning for Big Data CSE547/STAT548, University of Washington Emily Fox January 16 th, 2014 1
More informationSupervised Hashing for Image Retrieval via Image Representation Learning
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence Supervised Hashing for Image Retrieval via Image Representation Learning Rongkai Xia 1, Yan Pan 1, Hanjiang Lai 1,2, Cong Liu
More informationNearest Neighbor with KD Trees
Case Study 2: Document Retrieval Finding Similar Documents Using Nearest Neighbors Machine Learning/Statistics for Big Data CSE599C1/STAT592, University of Washington Emily Fox January 22 nd, 2013 1 Nearest
More informationA Systems View of Large- Scale 3D Reconstruction
Lecture 23: A Systems View of Large- Scale 3D Reconstruction Visual Computing Systems Goals and motivation Construct a detailed 3D model of the world from unstructured photographs (e.g., Flickr, Facebook)
More informationComplementary Hashing for Approximate Nearest Neighbor Search
Complementary Hashing for Approximate Nearest Neighbor Search Hao Xu Jingdong Wang Zhu Li Gang Zeng Shipeng Li Nenghai Yu MOE-MS KeyLab of MCC, University of Science and Technology of China, P. R. China
More informationImproving the LSD h -tree for fast approximate nearest neighbor search
Improving the LSD h -tree for fast approximate nearest neighbor search Floris Kleyn Leiden University, The Netherlands Technical Report kleyn@liacs.nl 1. Abstract Finding most similar items in large datasets
More informationarxiv: v1 [cs.cv] 11 Dec 2013
Fast Neighborhood Graph Search using Cartesian Concatenation Jingdong Wang Jing Wang Gang Zeng Rui Gan Shipeng Li Baining Guo arxiv:32.3062v [cs.cv] Dec 203 Abstract In this paper, we propose a new data
More informationHASHING has been recognized as an effective technique
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 1 Unsupervised Deep Hashing with Similarity-Adaptive and Discrete Optimization Fumin Shen, Yan Xu, Li Liu, Yang Yang, Zi Huang, Heng Tao Shen
More informationA GPGPU Algorithm for c-approximate r-nearest Neighbor Search in High Dimensions
A GPGPU Algorithm for c-approximate r-nearest Neighbor Search in High Dimensions Lee A Carraher, Philip A Wilsey, and Fred S Annexstein School of Electronic and Computing Systems University of Cincinnati
More informationProduct Quantized Translation for Fast Nearest Neighbor Search
The Thirty-Second AAAI Conference on Artificial Intelligence (AAAI-8) Product Quantized Translation for Fast Nearest Neighbor Search Yoonho Hwang, Mooyeol Baek, Saehoon Kim, Bohyung Han, Hee-Kap Ahn Dept.
More informationSupervised Learning: Nearest Neighbors
CS 2750: Machine Learning Supervised Learning: Nearest Neighbors Prof. Adriana Kovashka University of Pittsburgh February 1, 2016 Today: Supervised Learning Part I Basic formulation of the simplest classifier:
More informationRECENT years have witnessed the rapid growth of image. SSDH: Semi-supervised Deep Hashing for Large Scale Image Retrieval
SSDH: Semi-supervised Deep Hashing for Large Scale Image Retrieval Jian Zhang, Yuxin Peng, and Junchao Zhang arxiv:607.08477v [cs.cv] 28 Jul 206 Abstract The hashing methods have been widely used for efficient
More informationIsometric Mapping Hashing
Isometric Mapping Hashing Yanzhen Liu, Xiao Bai, Haichuan Yang, Zhou Jun, and Zhihong Zhang Springer-Verlag, Computer Science Editorial, Tiergartenstr. 7, 692 Heidelberg, Germany {alfred.hofmann,ursula.barth,ingrid.haas,frank.holzwarth,
More informationLearning Affine Robust Binary Codes Based on Locality Preserving Hash
Learning Affine Robust Binary Codes Based on Locality Preserving Hash Wei Zhang 1,2, Ke Gao 1, Dongming Zhang 1, and Jintao Li 1 1 Advanced Computing Research Laboratory, Beijing Key Laboratory of Mobile
More informationEfficient Large-scale Approximate Nearest Neighbor Search on the GPU
Efficient Large-scale Approximate Nearest Neighbor Search on the GPU Patrick Wieschollek 1,4 Oliver Wang 2 Alexander Sorkine-Hornung 3 Hendrik PA Lensch 1 1 University of Tübingen 2 Adobe Systems Inc 3
More informationCS246: Mining Massive Datasets Jure Leskovec, Stanford University
CS246: Mining Massive Datasets Jure Leskovec, Stanford University http://cs246.stanford.edu [Kumar et al. 99] 2/13/2013 Jure Leskovec, Stanford CS246: Mining Massive Datasets, http://cs246.stanford.edu
More informationBinary Embedding with Additive Homogeneous Kernels
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence (AAAI-7) Binary Embedding with Additive Homogeneous Kernels Saehoon Kim, Seungjin Choi Department of Computer Science and Engineering
More informationBeing Prepared In A Sparse World: The Case of KNN Graph Construction. Antoine Boutet DRIM LIRIS, Lyon
Being Prepared In A Sparse World: The Case of KNN Graph Construction Antoine Boutet DRIM LIRIS, Lyon Co-authors Joint work with François Taiani Nupur Mittal Anne-Marie Kermarrec Published at ICDE 2016
More informationSupervised Hashing with Latent Factor Models
Supervised Hashing with Latent Factor Models Peichao Zhang Shanghai Key Laboratory of Scalable Computing and Systems Department of Computer Science and Engineering Shanghai Jiao Tong University, China
More informationIndexing. Week 14, Spring Edited by M. Naci Akkøk, , Contains slides from 8-9. April 2002 by Hector Garcia-Molina, Vera Goebel
Indexing Week 14, Spring 2005 Edited by M. Naci Akkøk, 5.3.2004, 3.3.2005 Contains slides from 8-9. April 2002 by Hector Garcia-Molina, Vera Goebel Overview Conventional indexes B-trees Hashing schemes
More informationGoing nonparametric: Nearest neighbor methods for regression and classification
Going nonparametric: Nearest neighbor methods for regression and classification STAT/CSE 46: Machine Learning Emily Fox University of Washington May 3, 208 Locality sensitive hashing for approximate NN
More informationFast Nearest Neighbor Search in the Hamming Space
Fast Nearest Neighbor Search in the Hamming Space Zhansheng Jiang 1(B), Lingxi Xie 2, Xiaotie Deng 1,WeiweiXu 3, and Jingdong Wang 4 1 Shanghai Jiao Tong University, Shanghai, People s Republic of China
More informationDatenbanksysteme II: Multidimensional Index Structures 2. Ulf Leser
Datenbanksysteme II: Multidimensional Index Structures 2 Ulf Leser Content of this Lecture Introduction Partitioned Hashing Grid Files kdb Trees kd Tree kdb Tree R Trees Example: Nearest neighbor image
More informationFast Neighborhood Graph Search using Cartesian Concatenation
Fast Neighborhood Graph Search using Cartesian Concatenation Jing Wang Jingdong Wang 2 Gang Zeng Rui Gan Shipeng Li 2 Baining Guo 2 Peking University 2 Microsoft Research Abstract In this paper, we propose
More informationNearest Neighbor with KD Trees
Case Study 2: Document Retrieval Finding Similar Documents Using Nearest Neighbors Machine Learning/Statistics for Big Data CSE599C1/STAT592, University of Washington Emily Fox January 22 nd, 2013 1 Nearest
More informationArchitectural Support for Large-Scale Visual Search. Carlo C. del Mundo Vincent Lee Armin Alaghi Luis Ceze Mark Oskin
Architectural Support for Large-Scale Visual Search Carlo C. del Mundo Vincent Lee Armin Alaghi Luis Ceze Mark Oskin Motivation: Visual Data & Their Applications Rebooting the IT Revolution, SIA, September
More informationGoing nonparametric: Nearest neighbor methods for regression and classification
Going nonparametric: Nearest neighbor methods for regression and classification STAT/CSE 46: Machine Learning Emily Fox University of Washington May 8, 28 Locality sensitive hashing for approximate NN
More informationBoosting Complementary Hash Tables for Fast Nearest Neighbor Search
Boosting Complementary Hash Tables for Fast Nearest Neighbor Search Xianglong Liu Cheng Deng Yadong Mu Zhujin Li State Key Lab of Software Development Environment, Beihang University, Being 1191, China
More informationRandom Grids: Fast Approximate Nearest Neighbors and Range Searching for Image Search
Random Grids: Fast Approximate Nearest Neighbors and Range Searching for Image Search Dror Aiger, Efi Kokiopoulou, Ehud Rivlin Google Inc. aigerd@google.com, kokiopou@google.com, ehud@google.com Abstract
More informationJoint Inference in Image Databases via Dense Correspondence. Michael Rubinstein MIT CSAIL (while interning at Microsoft Research)
Joint Inference in Image Databases via Dense Correspondence Michael Rubinstein MIT CSAIL (while interning at Microsoft Research) My work Throughout the year (and my PhD thesis): Temporal Video Analysis
More informationLearning to Hash with Binary Reconstructive Embeddings
Learning to Hash with Binary Reconstructive Embeddings Brian Kulis Trevor Darrell Electrical Engineering and Computer Sciences University of California at Berkeley Technical Report No. UCB/EECS-2009-0
More informationLearning to Hash with Binary Reconstructive Embeddings
Learning to Hash with Binary Reconstructive Embeddings Brian Kulis and Trevor Darrell UC Berkeley EECS and ICSI Berkeley, CA {kulis,trevor}@eecs.berkeley.edu Abstract Fast retrieval methods are increasingly
More informationPrinciples of Data Management. Lecture #14 (Spatial Data Management)
Principles of Data Management Lecture #14 (Spatial Data Management) Instructor: Mike Carey mjcarey@ics.uci.edu Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 1 Today s Notable News v Project
More informationCoding for Random Projects
Coding for Random Projects CS 584: Big Data Analytics Material adapted from Li s talk at ICML 2014 (http://techtalks.tv/talks/coding-for-random-projections/61085/) Random Projections for High-Dimensional
More informationFast Similarity Search for High-Dimensional Dataset
Fast Similarity Search for High-Dimensional Dataset Quan Wang and Suya You Computer Science Department University of Southern California {quanwang,suyay}@graphics.usc.edu Abstract This paper addresses
More informationLarge-Scale Face Manifold Learning
Large-Scale Face Manifold Learning Sanjiv Kumar Google Research New York, NY * Joint work with A. Talwalkar, H. Rowley and M. Mohri 1 Face Manifold Learning 50 x 50 pixel faces R 2500 50 x 50 pixel random
More informationMultidimensional Indexes [14]
CMSC 661, Principles of Database Systems Multidimensional Indexes [14] Dr. Kalpakis http://www.csee.umbc.edu/~kalpakis/courses/661 Motivation Examined indexes when search keys are in 1-D space Many interesting
More informationA Unified Approach to Learning Task-Specific Bit Vector Representations for Fast Nearest Neighbor Search
A Unified Approach to Learning Task-Specific Bit Vector Representations for Fast Nearest Neighbor Search Vinod Nair Yahoo! Labs Bangalore vnair@yahoo-inc.com Dhruv Mahajan Yahoo! Labs Bangalore dkm@yahoo-inc.com
More informationDeepIndex for Accurate and Efficient Image Retrieval
DeepIndex for Accurate and Efficient Image Retrieval Yu Liu, Yanming Guo, Song Wu, Michael S. Lew Media Lab, Leiden Institute of Advance Computer Science Outline Motivation Proposed Approach Results Conclusions
More informationSupplementary Material for The Generalized PatchMatch Correspondence Algorithm
Supplementary Material for The Generalized PatchMatch Correspondence Algorithm Connelly Barnes 1, Eli Shechtman 2, Dan B Goldman 2, Adam Finkelstein 1 1 Princeton University, 2 Adobe Systems 1 Overview
More informationCompact Hash Code Learning with Binary Deep Neural Network
Compact Hash Code Learning with Binary Deep Neural Network Thanh-Toan Do, Dang-Khoa Le Tan, Tuan Hoang, Ngai-Man Cheung arxiv:7.0956v [cs.cv] 6 Feb 08 Abstract In this work, we firstly propose deep network
More informationWarped parallel nearest neighbor searches using kd-trees
Warped parallel nearest neighbor searches using kd-trees Roman Sokolov, Andrei Tchouprakov D4D Technologies Kd-trees Binary space partitioning tree Used for nearest-neighbor search, range search Application:
More informationMetric Learning for Large-Scale Image Classification:
Metric Learning for Large-Scale Image Classification: Generalizing to New Classes at Near-Zero Cost Florent Perronnin 1 work published at ECCV 2012 with: Thomas Mensink 1,2 Jakob Verbeek 2 Gabriela Csurka
More informationAdaptive Hash Retrieval with Kernel Based Similarity
Adaptive Hash Retrieval with Kernel Based Similarity Xiao Bai a, Cheng Yan a, Haichuan Yang a, Lu Bai b, Jun Zhou c, Edwin Robert Hancock d a School of Computer Science and Engineering, Beihang University,
More informationThree things everyone should know to improve object retrieval. Relja Arandjelović and Andrew Zisserman (CVPR 2012)
Three things everyone should know to improve object retrieval Relja Arandjelović and Andrew Zisserman (CVPR 2012) University of Oxford 2 nd April 2012 Large scale object retrieval Find all instances of
More informationAPPROXIMATE K-Nearest Neighbor (ANN) search has
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE Distance Encoded Product Quantization for Approximate K-Nearest Neighbor Search in High-Dimensional Space Jae-Pil Heo, Zhe Lin, and Sung-Eui
More informationSupplementary Material for Ensemble Diffusion for Retrieval
Supplementary Material for Ensemble Diffusion for Retrieval Song Bai 1, Zhichao Zhou 1, Jingdong Wang, Xiang Bai 1, Longin Jan Latecki 3, Qi Tian 4 1 Huazhong University of Science and Technology, Microsoft
More informationOptimizing Out-of-Core Nearest Neighbor Problems on Multi-GPU Systems Using NVLink
Optimizing Out-of-Core Nearest Neighbor Problems on Multi-GPU Systems Using NVLink Rajesh Bordawekar IBM T. J. Watson Research Center bordaw@us.ibm.com Pidad D Souza IBM Systems pidsouza@in.ibm.com 1 Outline
More informationWeb- Scale Mul,media: Op,mizing LSH. Malcolm Slaney Yury Li<shits Junfeng He Y! Research
Web- Scale Mul,media: Op,mizing LSH Malcolm Slaney Yury Li
More informationOnline Document Clustering Using the GPU
Online Document Clustering Using the GPU Benjamin E. Teitler, Jagan Sankaranarayanan, Hanan Samet Center for Automation Research Institute for Advanced Computer Studies Department of Computer Science University
More informationMetric Learning for Large Scale Image Classification:
Metric Learning for Large Scale Image Classification: Generalizing to New Classes at Near-Zero Cost Thomas Mensink 1,2 Jakob Verbeek 2 Florent Perronnin 1 Gabriela Csurka 1 1 TVPA - Xerox Research Centre
More informationChallenge... Ex. Maps are 2D Ex. Images are (with * height) D (assuming that each pixel is a feature)
Challenge... l Traditional data is one dimensional. l Multimedia data is multi dimensional. Ex. Maps are 2D Ex. Images are (with * height) D (assuming that each pixel is a feature) In general, if a given
More informationDeep Supervised Hashing with Triplet Labels
Deep Supervised Hashing with Triplet Labels Xiaofang Wang, Yi Shi, Kris M. Kitani Carnegie Mellon University, Pittsburgh, PA 15213 USA Abstract. Hashing is one of the most popular and powerful approximate
More informationarxiv: v1 [cs.mm] 3 May 2016
Bloom Filters and Compact Hash Codes for Efficient and Distributed Image Retrieval Andrea Salvi, Simone Ercoli, Marco Bertini and Alberto Del Bimbo Media Integration and Communication Center, Università
More informationPattern Spotting in Historical Document Image
Pattern Spotting in historical document images Sovann EN, Caroline Petitjean, Stéphane Nicolas, Frédéric Jurie, Laurent Heutte LITIS, University of Rouen, France 1 Outline Introduction Commons Pipeline
More information