Efficient Similarity Search in Scientific Databases with Feature Signatures
|
|
- Adrian McCormick
- 5 years ago
- Views:
Transcription
1 DATA MANAGEMENT AND DATA EXPLORATION GROUP Prof. Dr. rer. nat. Thomas Seidl DATA MANAGEMENT AND DATA EXPLORATION GROUP Prof. Dr. rer. nat. Thomas Seidl Efficient Similarity Search in Scientific Databases with Feature Signatures Merih Seran Uysal Christian Beecks Jochen Schmücking Thomas Seidl RWTH Aachen University, Germany 27th International Conference on Scientific and Statistical Database Management July 1, 2015 San Diego, California
2 Similarity Search How to determine the similarity between two data objects in scientific databases? similar? Requirements Feature representation Similarity measure Efficient query processing Merih Seran Uysal et al. Efficient Similarity Search in Scientific Databases with Feature Signatures 2 / 18
3 Similarity Search How to determine the similarity between two data objects in scientific databases? similar? Requirements Feature representation use feature signatures Similarity measure use Earth Mover s Distance Efficient query processing new lower-bounding techniques Merih Seran Uysal et al. Efficient Similarity Search in Scientific Databases with Feature Signatures 2 / 18
4 Overview 1 Preliminaries 2 Reduced Signatures Filter Approximations 4 Experimental Evaluation 5 Conclusion and Outlook Merih Seran Uysal et al. Efficient Similarity Search in Scientific Databases with Feature Signatures / 18
5 Feature Representation similar? data object feature signature Feature Signature Represent the data object by features in a feature space F Aggregate features to obtain a compact feature representation Finite number of features with non-zero weights (representatives) Formally; X : F R subject to R X < with R X = {f F X(f ) 0} F Merih Seran Uysal et al. Efficient Similarity Search in Scientific Databases with Feature Signatures 4 / 18
6 Intuition: Earth Mover s Distance Earth Mover s Distance (EMD) [1] Emd:,2 tested with java transforms each feature signature to another one denotes a transportation problem (linear optimization problem) chooses the minimum-cost flow among all flows exhibits high computational time complexity [1] Y. Rubner, C. Tomasi, and L. J. Guibas. The earth mover s distance as a metric for image retrieval. Int. Journal of Computer Vision, 40(2):99-121, Merih Seran Uysal et al. Efficient Similarity Search in Scientific Databases with Feature Signatures 5 / 18
7 Earth Mover s Distance Given feature signatures X, Y S + over a feature space (F, δ) with a distance function δ : F F R, EMD : S + S + R between X and Y is defined as a minimum-cost flow of all possible flows F = {f f : F F R} = R F F by: subject to constraints: EMD(X, Y ) = min f F Non-negativity: x, y F : f (x, y) 0 Source: x F : f (x, y) X(x) Target: y F : y F f (x, y) Y (y) x F Total flow: m = x F y F 1 δ(x, y) f (x, y) m x F y F f (x, y) = min{ X(x), Y (y)} x F y F Emd:,2 tested with java Merih Seran Uysal et al. Efficient Similarity Search in Scientific Databases with Feature Signatures 6 / 18
8 Efficient Query Processing High computational time complexity of the EMD Bottleneck! How to perform query processing with the EMD on signatures efficiently? Merih Seran Uysal et al. Efficient Similarity Search in Scientific Databases with Feature Signatures 7 / 18
9 Efficient Query Processing High computational time complexity of the EMD Bottleneck! How to perform query processing with the EMD on signatures efficiently? Approach: Filter: Utilize reduced signatures via some heuristics Lower-bounding filter distance functions on reduced signatures Filter-and-refine architecture Completeness (no false dismissal) lower bound property Selectivity a small candidate set Efficiency database query object filter candidates EMD result database Merih Seran Uysal et al. Efficient Similarity Search in Scientific Databases with Feature Signatures 7 / 18
10 Efficient Query Processing High computational time complexity of the EMD Bottleneck! How to perform query processing with the EMD on signatures efficiently? Approach: Filter: Utilize reduced signatures via some heuristics Lower-bounding filter distance functions on reduced signatures Filter-and-refine architecture Completeness (no false dismissal) lower bound property Selectivity a small candidate set Efficiency database query object filter candidates EMD result database query object filter candidates EMD result Merih Seran Uysal et al. Efficient Similarity Search in Scientific Databases with Feature Signatures 7 / 18
11 Reduced Signature Definition (Reduced Signature) Let X, X r S 0 be two feature signatures. X r is a reduced feature signature with respect to X if it holds: x F : X(x) X r (x). Dimensionality reduction is a special case of signature reduction (a) (b) (a) EMD flow between Beforetwo dimred: signatures Emd:,2 EMD(X,Y Emd:,71 )=.2 tested tested with java with java (b) Removing a representative and EMD flow EMD(X r,y )=.71 Discarding representatives in a signature leads to a higher EMD value Completeness not preserved! Merih Seran Uysal et al. Efficient Similarity Search in Scientific Databases with Feature Signatures 8 / 18
12 λ-im-sig Lower Bound Definition (λ-independent Minimization for Signatures) Let (F, δ) be a feature space with a distance function δ, X, Y S 0 be feature signatures, and λ R +. λ-im-sig : S 0 S 0 R 0 between X and Y is defined as: subject to λ-im-sig(x, Y ) = min f F 1 f (x, y) δ(x, y) λ x F y F non-negativity constraint: x, y F : f (x, y) 0, source constraint: x F : f (x, y) X(x), and λ-im-sig y F target constraint: x, y F : f (x, y) Y (y), and total flow constraint: f (x, y) = min{ X(x), Y (y)} x F y F x F y F Merih Seran Uysal et al. Efficient Similarity Search in Scientific Databases with Feature Signatures 9 / 18
13 λ-im-sig Lower Bound Constraint relaxation with respect to the target constraint Greater solution space than that for the EMD Adaptable independent normalization factor λ The optimal factor λ = min{ X(x), Y (y)} (shown in the paper) x F y F Merih Seran Uysal et al. Efficient Similarity Search in Scientific Databases with Feature Signatures 10 / 18
14 λ-im-sig Lower Bound λ-im-sig lower-bounds EMD on reduced signatures Given feature signatures X, Y S 0 with total weights m X = X(x) Y (y) = m Y, a reduced feature signature X r S 0 with x F y F respect to X, and λ R + with λ = m X, it holds: λ-im-sig(x r, Y ) EMD(X, Y ). (Proved in the paper) λ-emd can be defined in a similar way As experiments will show later, λ-im-sig lower bound is more efficient than λ-emd Different signature reduction heuristics possible, such as earth-based (ER) or centroid-based (CR) dimensionality reduction heuristics Merih Seran Uysal et al. Efficient Similarity Search in Scientific Databases with Feature Signatures 11 / 18
15 Efficiency vs. Dimensionality: λ-im-sig Real world data: ImageNet [1] ,000 data objects ; 100 nn 100,000 data objects ; 100 nn query time [sec] nr. of EMD computations λ IM Sig CR 50% λ IM Sig CR 70% λ IM Sig CR 90% λ IM Sig ER 50% λ IM Sig ER 70% λ IM Sig ER 90% signature size (a) signature size (b) [1] J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei. ImageNet: A Large-Scale Hierarchical Image Database. In CVPR09, p , Merih Seran Uysal et al. Efficient Similarity Search in Scientific Databases with Feature Signatures 12 / 18
16 Efficiency vs. Dimensionality (Imagenet) Real world data: ImageNet [1] Imagenet all :: Save two images as pdfs for subfigure OPT1 VALID 100K Existing lower bounds on feature signatures: Rubner [2] and IM-Sig [] query time [sec] Imagenet ; 100,000 data objects ; 100 nn signature size λ EMD ER 90% λ IM Sig CR 90% λ IM Sig CR 90% and IM Sig λ IM Sig and λ EMD Sig CR 90% Rubner and λ EMD ER 90% Rubner and λ IM Sig CR 90% IM Sig Rubner [1] J. Deng et al. ImageNet: A Large-Scale Hierarchical Image Database. In CVPR09, p , [2] Y. Rubner et al. The earth mover s distance as a metric for image retrieval. Int. Journal of Computer Vision, 40(2):99-121, number of EMD computations Imagenet ; 100, λ EMD ER 90% λ IM Sig CR 90% and IM Rubner and λ EMD ER 9 IM Sig [] M.S. Uysal et al. Efficient Filter Approximation Using the EMD in Very Large Multimedia Databases with Feature Signatures. In CIKM, p , Merih Seran Uysal et al. Efficient Similarity Search in Scientific Databases with Feature Signatures 1 / 18
17 VALID 100K Preliminaries Reduced Signatures Filter Approximations Experimental Evaluation Conclusion Efficiency vs. Dimensionality (Imagenet) query time [sec] Imagenet ; 100,000 data objects ; 100 nn signature size λ EMD ER 90% λ IM Sig CR 90% λ IM Sig CR 90% and IM Sig λ IM Sig and λ EMD Sig CR 90% Rubner and λ EMD ER 90% Rubner and λ IM Sig CR 90% IM Sig Rubner number of EMD computations Imagenet ; 100, λ EMD ER 90% λ IM Sig CR 90% and IM Rubner and λ EMD ER 9 IM Sig Less query time than for the competitive methods The combination of Rubner and λ-im-sig outperforms other combinations or existing methods regarding efficiency and signature size Merih Seran Uysal et al. Efficient Similarity Search in Scientific Databases with Feature Signatures 14 / 18
18 Selectivity vs. Dimensionality (Imagenet) ta objects ; 100 nn ure size λ IM Sig CR 90% λ IM Sig and λ EMD Sig CR 90% Rubner and λ IM Sig CR 90% Rubner number of EMD computations Imagenet ; 100,000 data objects ; 100 nn signature size λ EMD ER 90% λ IM Sig CR 90% λ IM Sig CR 90% and IM Sig λ IM Sig and λ EMD Sig CR 90% Rubner and λ EMD ER 90% Rubner and λ IM Sig CR 90% IM Sig Rubner Smaller candidate set than for the existing methods The combination of Rubner and λ-im-sig shows better selectivity than for existing methods w.r.t signature size Merih Seran Uysal et al. Efficient Similarity Search in Scientific Databases with Feature Signatures 15 / 18
19 Ukbench all :: Save two images as pdfs for subfigure OPT1 Efficiency vs. Dimensionality VALID ER and CR are(ukbench) changed Real world data: UKBench [4] query time [sec] Ukbench ; 10,200 data objects ; 100 nn signature size λ EMD ER 90% λ IM Sig CR 90% λ IM Sig CR 90% and IM Sig λ IM Sig and λ EMD ER 90% Rubner and λ EMD ER 90% Rubner and λ IM Sig CR 90% IM Sig Rubner number of EMD computations Ukbench ; 10, λ EMD ER 90% λ IM Sig CR 90% and Rubner and λ EMD ER IM Sig Less query time than for the competitive methods The combination of Rubner and λ-im-sig outperforms other combinations or existing methods regarding efficiency and signature size [4] D. Nister and H. Stewenius. Scalable recognition with a vocabulary tree. In CVPR, pages , Merih Seran Uysal et al. Efficient Similarity Search in Scientific Databases with Feature Signatures 16 / 18
20 wo images as pdfs for subfigure OPT1 changed Selectivity vs. Dimensionality (UKBench) objects ; 100 nn Real world data: UKBench [4] e size λ IM Sig CR 90% λ IM Sig and λ EMD ER 90% Rubner and λ IM Sig CR 90% Rubner number of EMD computations Ukbench ; 10,200 data objects ; 100 nn signature size λ EMD ER 90% λ IM Sig CR 90% λ IM Sig CR 90% and IM Sig λ IM Sig and λ EMD ER 90% Rubner and λ EMD ER 90% Rubner and λ IM Sig CR 90% IM Sig Rubner Smaller candidate set than for the existing methods The combination of Rubner and λ-im-sig shows better selectivity than for existing methods w.r.t signature size [4] D. Nister and H. Stewenius. Scalable recognition with a vocabulary tree. In CVPR, pages , Merih Seran Uysal et al. Efficient Similarity Search in Scientific Databases with Feature Signatures 17 / 18
21 Conclusion and Outlook Feature signature as feature representation model for scientific data High computational time complexity of the Earth Mover s Distance Novel lower-bounding filter approximations λ-im-sig and λ-emd High efficiency and selectivity providing time cost reduction How to extend this work for other domains? e.g. probabilistic data, uncertain data Further minimization of the candidate set Investigation of the constraint relaxation Merih Seran Uysal et al. Efficient Similarity Search in Scientific Databases with Feature Signatures 18 / 18
22 Conclusion and Outlook Feature signature as feature representation model for scientific data High computational time complexity of the Earth Mover s Distance Novel lower-bounding filter approximations λ-im-sig and λ-emd High efficiency and selectivity providing time cost reduction How to extend this work for other domains? e.g. probabilistic data, uncertain data Further minimization of the candidate set Investigation of the constraint relaxation Thank you for your attention! Any questions? Merih Seran Uysal et al. Efficient Similarity Search in Scientific Databases with Feature Signatures 18 / 18
Efficient Processing of Multiple DTW Queries in Time Series Databases
Efficient Processing of Multiple DTW Queries in Time Series Databases Hardy Kremer 1 Stephan Günnemann 1 Anca-Maria Ivanescu 1 Ira Assent 2 Thomas Seidl 1 1 RWTH Aachen University, Germany 2 Aarhus University,
More informationAutomatic Ranking of Images on the Web
Automatic Ranking of Images on the Web HangHang Zhang Electrical Engineering Department Stanford University hhzhang@stanford.edu Zixuan Wang Electrical Engineering Department Stanford University zxwang@stanford.edu
More informationMetric Learning for Large Scale Image Classification:
Metric Learning for Large Scale Image Classification: Generalizing to New Classes at Near-Zero Cost Thomas Mensink 1,2 Jakob Verbeek 2 Florent Perronnin 1 Gabriela Csurka 1 1 TVPA - Xerox Research Centre
More informationEarth Mover s Distance and The Applications
Earth Mover s Distance and The Applications Hu Ding Computer Science and Engineering, Michigan State University The Motivations It is easy to compare two single objects: the pairwise distance. For example:
More informationLarge-scale visual recognition Efficient matching
Large-scale visual recognition Efficient matching Florent Perronnin, XRCE Hervé Jégou, INRIA CVPR tutorial June 16, 2012 Outline!! Preliminary!! Locality Sensitive Hashing: the two modes!! Hashing!! Embedding!!
More informationVisual Material Traits Recognizing Per-Pixel Material Context
Visual Material Traits Recognizing Per-Pixel Material Context Gabriel Schwartz gbs25@drexel.edu Ko Nishino kon@drexel.edu 1 / 22 2 / 22 Materials as Visual Context What tells us this road is unsafe? 3
More informationFast and Robust Earth Mover s Distances
Fast and Robust Earth Mover s Distances Ofir Pele and Michael Werman School of Computer Science and Engineering The Hebrew University of Jerusalem {ofirpele,werman}@cs.huji.ac.il Abstract We present a
More informationMetric Learning for Large-Scale Image Classification:
Metric Learning for Large-Scale Image Classification: Generalizing to New Classes at Near-Zero Cost Florent Perronnin 1 work published at ECCV 2012 with: Thomas Mensink 1,2 Jakob Verbeek 2 Gabriela Csurka
More informationFACULTY OF ENGINEERING AND INFORMATION TECHNOLOGY DEPARTMENT OF COMPUTER SCIENCE. Project Plan
FACULTY OF ENGINEERING AND INFORMATION TECHNOLOGY DEPARTMENT OF COMPUTER SCIENCE Project Plan Structured Object Recognition for Content Based Image Retrieval Supervisors: Dr. Antonio Robles Kelly Dr. Jun
More informationSupplementary Material for Ensemble Diffusion for Retrieval
Supplementary Material for Ensemble Diffusion for Retrieval Song Bai 1, Zhichao Zhou 1, Jingdong Wang, Xiang Bai 1, Longin Jan Latecki 3, Qi Tian 4 1 Huazhong University of Science and Technology, Microsoft
More informationDISTANCE MATRIX APPROACH TO CONTENT IMAGE RETRIEVAL. Dmitry Kinoshenko, Vladimir Mashtalir, Elena Yegorova
International Book Series "Information Science and Computing" 29 DISTANCE MATRIX APPROACH TO CONTENT IMAGE RETRIEVAL Dmitry Kinoshenko, Vladimir Mashtalir, Elena Yegorova Abstract: As the volume of image
More informationTRACKING OF MULTIPLE SOCCER PLAYERS USING A 3D PARTICLE FILTER BASED ON DETECTOR CONFIDENCE
Advances in Computer Science and Engineering Volume 6, Number 1, 2011, Pages 93-104 Published Online: February 22, 2011 This paper is available online at http://pphmj.com/journals/acse.htm 2011 Pushpa
More informationTree Based Index (TBI) System. Getting Started with TBI
Tree Based Index (TBI) System Getting Started with TBI Jia Xu 1 Zhenjie Zhang 2 Anthony K. H. Tung 2 Ge Yu 1 1 {xujia,yuge}@ise.neu.edu.cn 2 {zhenjie,atung}@comp.nus.edu.sg May 5, 2010 1 System Introduction
More informationA Study on Reverse Top-K Queries Using Monochromatic and Bichromatic Methods
A Study on Reverse Top-K Queries Using Monochromatic and Bichromatic Methods S.Anusuya 1, M.Balaganesh 2 P.G. Student, Department of Computer Science and Engineering, Sembodai Rukmani Varatharajan Engineering
More informationRobot localization method based on visual features and their geometric relationship
, pp.46-50 http://dx.doi.org/10.14257/astl.2015.85.11 Robot localization method based on visual features and their geometric relationship Sangyun Lee 1, Changkyung Eem 2, and Hyunki Hong 3 1 Department
More informationPart Localization by Exploiting Deep Convolutional Networks
Part Localization by Exploiting Deep Convolutional Networks Marcel Simon, Erik Rodner, and Joachim Denzler Computer Vision Group, Friedrich Schiller University of Jena, Germany www.inf-cv.uni-jena.de Abstract.
More informationObject and Action Detection from a Single Example
Object and Action Detection from a Single Example Peyman Milanfar* EE Department University of California, Santa Cruz *Joint work with Hae Jong Seo AFOSR Program Review, June 4-5, 29 Take a look at this:
More informationEfficient Re-ranking in Vocabulary Tree based Image Retrieval
Efficient Re-ranking in Vocabulary Tree based Image Retrieval Xiaoyu Wang 2, Ming Yang 1, Kai Yu 1 1 NEC Laboratories America, Inc. 2 Dept. of ECE, Univ. of Missouri Cupertino, CA 95014 Columbia, MO 65211
More informationProUD: Probabilistic Ranking in Uncertain Databases
Proc. 20th Int. Conf. on Scientific and Statistical Database Management (SSDBM'08), Hong Kong, China, 2008. ProUD: Probabilistic Ranking in Uncertain Databases Thomas Bernecker, Hans-Peter Kriegel, Matthias
More informationA Fast Distance Between Histograms
Fast Distance Between Histograms Francesc Serratosa 1 and lberto Sanfeliu 2 1 Universitat Rovira I Virgili, Dept. d Enginyeria Informàtica i Matemàtiques, Spain francesc.serratosa@.urv.net 2 Universitat
More informationColor-sketch simulator: a guide for color-based visual known-item search
Color-sketch simulator: a guide for color-based visual known-item search Jakub Lokoč 1, Anh Nguyen Phuong 3, Marta Vomlelová 2, Chong-Wah Ngo 3 1 SIRET research group, Department of Software Engineering
More informationFADA: An Efficient Dimension Reduction Scheme for Image Classification
Best Paper Candidate in Retrieval rack, Pacific-rim Conference on Multimedia, December 11-14, 7, Hong Kong. FADA: An Efficient Dimension Reduction Scheme for Image Classification Yijuan Lu 1, Jingsheng
More informationTrusted Routing in IoT
Trusted Routing in IoT Dr Ivana Tomić In collaborat ion w it h: Re s e arc h As s oc iate Prof. Julie A. McC ann and Im perial C ollege London AESE group Em ail: i.tom ic@ im perial.ac.uk Outline q Sensors
More informationLarge scale object/scene recognition
Large scale object/scene recognition Image dataset: > 1 million images query Image search system ranked image list Each image described by approximately 2000 descriptors 2 10 9 descriptors to index! Database
More informationSimilarity Estimation Techniques from Rounding Algorithms. Moses Charikar Princeton University
Similarity Estimation Techniques from Rounding Algorithms Moses Charikar Princeton University 1 Compact sketches for estimating similarity Collection of objects, e.g. mathematical representation of documents,
More informationPattern Recognition Lecture Sequential Clustering
Pattern Recognition Lecture Prof. Dr. Marcin Grzegorzek Research Group for Pattern Recognition Institute for Vision and Graphics University of Siegen, Germany Pattern Recognition Chain patterns sensor
More informationAlgorithms for Nearest Neighbors
Algorithms for Nearest Neighbors Classic Ideas, New Ideas Yury Lifshits Steklov Institute of Mathematics at St.Petersburg http://logic.pdmi.ras.ru/~yura University of Toronto, July 2007 1 / 39 Outline
More informationA Scalable QoS Device for Broadband Access to Multimedia Services
University of Duisburg-Essen, Institute for Experimental Mathematics A Scalable QoS Device for Broadband Access to Multimedia Services Dr. University of Duisburg-Essen, Germany dreibh@iem.uni-due.de http://www.iem.uni-due.de/~dreibh
More informationDynamic Skyline Queries in Metric Spaces
Dynamic Skyline Queries in Metric Spaces Lei Chen and Xiang Lian Department of Computer Science and Engineering Hong Kong University of Science and Technology Clear Water Bay, Kowloon Hong Kong, China
More informationOutsourcing Privacy-Preserving Social Networks to a Cloud
IEEE INFOCOM 2013, April 14-19, Turin, Italy Outsourcing Privacy-Preserving Social Networks to a Cloud Guojun Wang a, Qin Liu a, Feng Li c, Shuhui Yang d, and Jie Wu b a Central South University, China
More informationVideo-based Face Recognition Using Earth Mover s Distance
Video-based Face Recognition Using Earth Mover s Distance Jiangwei Li 1, Yunhong Wang 1,2, and Tieniu Tan 1 1 National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences,
More informationLecture 12 Recognition
Institute of Informatics Institute of Neuroinformatics Lecture 12 Recognition Davide Scaramuzza http://rpg.ifi.uzh.ch/ 1 Lab exercise today replaced by Deep Learning Tutorial by Antonio Loquercio Room
More informationRapid k-d tree construction for sparse volume data
20th EG/VGTC Conference on Visualization Rapid k-d tree construction for sparse volume data Stefan Zellmann*, Jürgen Schulze**, Ulrich Lang* * University of Cologne ** University of California San Diego
More informationRandom projection for non-gaussian mixture models
Random projection for non-gaussian mixture models Győző Gidófalvi Department of Computer Science and Engineering University of California, San Diego La Jolla, CA 92037 gyozo@cs.ucsd.edu Abstract Recently,
More informationFaster Cover Trees. Mike Izbicki and Christian R. Shelton UC Riverside. Izbicki and Shelton (UC Riverside) Faster Cover Trees July 7, / 21
Faster Cover Trees Mike Izbicki and Christian R. Shelton UC Riverside Izbicki and Shelton (UC Riverside) Faster Cover Trees July 7, 2015 1 / 21 Outline Why care about faster cover trees? Making cover trees
More informationNOVEL APPROACH FOR COMPARING SIMILARITY VECTORS IN IMAGE RETRIEVAL
NOVEL APPROACH FOR COMPARING SIMILARITY VECTORS IN IMAGE RETRIEVAL Abstract In this paper we will present new results in image database retrieval, which is a developing field with growing interest. In
More informationFaster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun Presented by Tushar Bansal Objective 1. Get bounding box for all objects
More informationMixtures of Gaussians and Advanced Feature Encoding
Mixtures of Gaussians and Advanced Feature Encoding Computer Vision Ali Borji UWM Many slides from James Hayes, Derek Hoiem, Florent Perronnin, and Hervé Why do good recognition systems go bad? E.g. Why
More informationProgramming Languages
CSE 230: Winter 2008 Principles of Programming Languages Ocaml/HW #3 Q-A Session Push deadline = Mar 10 Session Mon 3pm? Lecture 15: Type Systems Ranjit Jhala UC San Diego Why Typed Languages? Development
More informationA novel supervised learning algorithm and its use for Spam Detection in Social Bookmarking Systems
A novel supervised learning algorithm and its use for Spam Detection in Social Bookmarking Systems Anestis Gkanogiannis and Theodore Kalamboukis Department of Informatics Athens University of Economics
More informationEnsemble of Bayesian Filters for Loop Closure Detection
Ensemble of Bayesian Filters for Loop Closure Detection Mohammad Omar Salameh, Azizi Abdullah, Shahnorbanun Sahran Pattern Recognition Research Group Center for Artificial Intelligence Faculty of Information
More informationsflow: Towards Resource-Efficient and Agile Service Federation in Service Overlay Networks
sflow: Towards Resource-Efficient and Agile Service Federation in Service Overlay Networks Mea Wang, Baochun Li, Zongpeng Li Department of Electrical and Computer Engineering University of Toronto {mea,
More informationSHOT-BASED OBJECT RETRIEVAL FROM VIDEO WITH COMPRESSED FISHER VECTORS. Luca Bertinetto, Attilio Fiandrotti, Enrico Magli
SHOT-BASED OBJECT RETRIEVAL FROM VIDEO WITH COMPRESSED FISHER VECTORS Luca Bertinetto, Attilio Fiandrotti, Enrico Magli Dipartimento di Elettronica e Telecomunicazioni, Politecnico di Torino (Italy) ABSTRACT
More informationGeneralizing the Optimality of Multi-Step k-nearest Neighbor Query Processing
Generalizing the Optimality of Multi-Step k-nearest Neighbor Query Processing SSTD 2007 Boston, U.S.A. Hans-Peter Kriegel, Peer Kröger, Peter Kunath, Matthias Renz Institute for Computer Science University
More informationPivoting M-tree: A Metric Access Method for Efficient Similarity Search
Pivoting M-tree: A Metric Access Method for Efficient Similarity Search Tomáš Skopal Department of Computer Science, VŠB Technical University of Ostrava, tř. 17. listopadu 15, Ostrava, Czech Republic tomas.skopal@vsb.cz
More informationVisuelle Perzeption für Mensch- Maschine Schnittstellen
Visuelle Perzeption für Mensch- Maschine Schnittstellen Vorlesung, WS 2009 Prof. Dr. Rainer Stiefelhagen Dr. Edgar Seemann Institut für Anthropomatik Universität Karlsruhe (TH) http://cvhci.ira.uka.de
More informationFast Indexing and Search. Lida Huang, Ph.D. Senior Member of Consulting Staff Magma Design Automation
Fast Indexing and Search Lida Huang, Ph.D. Senior Member of Consulting Staff Magma Design Automation Motivation Object categorization? http://www.cs.utexas.edu/~grauman/slides/jain_et_al_cvpr2008.ppt Motivation
More informationarxiv: v2 [cs.cv] 25 Aug 2014
ARTOS Adaptive Real-Time Object Detection System Björn Barz Erik Rodner bjoern.barz@uni-jena.de erik.rodner@uni-jena.de arxiv:1407.2721v2 [cs.cv] 25 Aug 2014 Joachim Denzler Computer Vision Group Friedrich
More informationCompact Data Representations and their Applications. Moses Charikar Princeton University
Compact Data Representations and their Applications Moses Charikar Princeton University Lots and lots of data AT&T Information about who calls whom What information can be got from this data? Network router
More informationContent Based Video Retrieval
Content Based Video Retrieval Jyoti Kashyap 1, Gaurav Punjabi 2, Kshitij Chhatwani 3 1Guide, Department of Electronics and Telecommunication, Thadomal Shahani Engineering College, Bandra, Mumbai, INDIA.
More informationHOT asax: A Novel Adaptive Symbolic Representation for Time Series Discords Discovery
HOT asax: A Novel Adaptive Symbolic Representation for Time Series Discords Discovery Ninh D. Pham, Quang Loc Le, Tran Khanh Dang Faculty of Computer Science and Engineering, HCM University of Technology,
More informationCompressed local descriptors for fast image and video search in large databases
Compressed local descriptors for fast image and video search in large databases Matthijs Douze2 joint work with Hervé Jégou1, Cordelia Schmid2 and Patrick Pérez3 1: INRIA Rennes, TEXMEX team, France 2:
More informationVisuelle Perzeption für Mensch- Maschine Schnittstellen
Visuelle Perzeption für Mensch- Maschine Schnittstellen Vorlesung, WS 2009 Prof. Dr. Rainer Stiefelhagen Dr. Edgar Seemann Institut für Anthropomatik Universität Karlsruhe (TH) http://cvhci.ira.uka.de
More informationarxiv: v1 [cs.ir] 20 Nov 2017
Linear-Complexity Relaxed Word Mover s Distance with GPU Acceleration Kubilay Atasu, Thomas Parnell, Celestine Dünner, Manolis Sifalakis, Haralampos Pozidis, Vasileios Vasileiadis, Michail Vlachos, Cesar
More informationThe Caltech-UCSD Birds Dataset
The Caltech-UCSD Birds-200-2011 Dataset Catherine Wah 1, Steve Branson 1, Peter Welinder 2, Pietro Perona 2, Serge Belongie 1 1 University of California, San Diego 2 California Institute of Technology
More informationSIR C R REDDY COLLEGE OF ENGINEERING
SIR C R REDDY COLLEGE OF ENGINEERING DEPARTMENT OF INFORMATION TECHNOLOGY Course Outcomes II YEAR 1 st SEMESTER Subject: Data Structures (CSE 2.1.1) 1. Describe how arrays, records, linked structures,
More informationVocabulary tree. Vocabulary tree supports very efficient retrieval. It only cares about the distance between a query feature and each node.
Vocabulary tree Vocabulary tree Recogni1on can scale to very large databases using the Vocabulary Tree indexing approach [Nistér and Stewénius, CVPR 2006]. Vocabulary Tree performs instance object recogni1on.
More informationTHIS paper considers the task of accurate image retrieval
JOURNAL OF L A TEX CLASS FILES, VOL. 14, NO. 8, AUGUST 2015 1 Coarse2Fine: Two-layer Fusion for Image Retrieval le Dong, Gaipeng Kong, Wenpu Dong, Liang Zheng and Qi Tian arxiv:1607.00719v1 [cs.mm] 4 Jul
More informationImage Analysis & Retrieval. CS/EE 5590 Special Topics (Class Ids: 44873, 44874) Fall 2016, M/W Lec 18.
Image Analysis & Retrieval CS/EE 5590 Special Topics (Class Ids: 44873, 44874) Fall 2016, M/W 4-5:15pm@Bloch 0012 Lec 18 Image Hashing Zhu Li Dept of CSEE, UMKC Office: FH560E, Email: lizhu@umkc.edu, Ph:
More informationCombining Selective Search Segmentation and Random Forest for Image Classification
Combining Selective Search Segmentation and Random Forest for Image Classification Gediminas Bertasius November 24, 2013 1 Problem Statement Random Forest algorithm have been successfully used in many
More informationAdaptive Gesture Recognition System Integrating Multiple Inputs
Adaptive Gesture Recognition System Integrating Multiple Inputs Master Thesis - Colloquium Tobias Staron University of Hamburg Faculty of Mathematics, Informatics and Natural Sciences Technical Aspects
More informationEffective Tour Searching for Large TSP Instances. Gerold Jäger
Effective Tour Searching for Large TSP Instances Gerold Jäger Martin-Luther-University Halle-Wittenberg (Germany) joint work with Changxing Dong, Paul Molitor, Dirk Richter German Research Foundation Grant
More informationHierarchical GEMINI - Hierarchical linear subspace indexing method
Hierarchical GEMINI - Hierarchical linear subspace indexing method GEneric Multimedia INdexIng DB in feature space Range Query Linear subspace sequence method DB in subspace Generic constraints Computing
More informationLecture 12 Recognition
Institute of Informatics Institute of Neuroinformatics Lecture 12 Recognition Davide Scaramuzza 1 Lab exercise today replaced by Deep Learning Tutorial Room ETH HG E 1.1 from 13:15 to 15:00 Optional lab
More informationarxiv: v1 [cs.cv] 10 Jul 2014
ARTOS Adaptive Real-Time Object Detection System Björn Barz Erik Rodner bjoern.barz@uni-jena.de erik.rodner@uni-jena.de arxiv:1407.2721v1 [cs.cv] 10 Jul 2014 Joachim Denzler Computer Vision Group Friedrich
More informationComputer Vision 2 Lecture 8
Computer Vision 2 Lecture 8 Multi-Object Tracking (30.05.2016) leibe@vision.rwth-aachen.de, stueckler@vision.rwth-aachen.de RWTH Aachen University, Computer Vision Group http://www.vision.rwth-aachen.de
More informationSyllabus. 1. Visual classification Intro 2. SVM 3. Datasets and evaluation 4. Shallow / Deep architectures
Syllabus 1. Visual classification Intro 2. SVM 3. Datasets and evaluation 4. Shallow / Deep architectures Image classification How to define a category? Bicycle Paintings with women Portraits Concepts,
More informationCLUSTERING. CSE 634 Data Mining Prof. Anita Wasilewska TEAM 16
CLUSTERING CSE 634 Data Mining Prof. Anita Wasilewska TEAM 16 1. K-medoids: REFERENCES https://www.coursera.org/learn/cluster-analysis/lecture/nj0sb/3-4-the-k-medoids-clustering-method https://anuradhasrinivas.files.wordpress.com/2013/04/lesson8-clustering.pdf
More informationSearching non-text information objects
Non-text digital objects Searching non-text information objects Music Speech Images 3D models Video? 1 2 Ways to query for something Query by describing content 1. Query by category/ theme easiest - work
More informationLandmark-Based Sparse Color Representations for Color Transfer
Landmark-Based Sparse Color Representations for Color Transfer Tzu-Wei Huang and Hwann-Tzong Chen Department of Computer Science, National Tsing Hua University 30013 Hsinchu, Taiwan Abstract We present
More informationDelay-minimal Transmission for Energy Constrained Wireless Communications
Delay-minimal Transmission for Energy Constrained Wireless Communications Jing Yang Sennur Ulukus Department of Electrical and Computer Engineering University of Maryland, College Park, M0742 yangjing@umd.edu
More informationMELODY-JOIN: Efficient Earth Mover s Distance Similarity Joins Using MapReduce
-JOIN: Efficient Earth Mover s Distance Similarity Joins Using MapReduce Jin Huang, Rui Zhang, Rajkumar Buyya 3, Jian Chen 4 Department of Computing and Information Systems, University of Melbourne, Victoria,
More informationDiffusion Distance for Histogram Comparison
Diffusion Distance for Histogram Comparison Haibin Ling Center for Automation Research, Computer Science Department, University of Maryland College Park, MD, 20770, USA hbling@umiacs.umd.edu Kazunori Okada
More informationAlgorithms for Nearest Neighbors
Algorithms for Nearest Neighbors State-of-the-Art Yury Lifshits Steklov Institute of Mathematics at St.Petersburg Yandex Tech Seminar, April 2007 1 / 28 Outline 1 Problem Statement Applications Data Models
More informationEfficient Aggregation for Graph Summarization
Efficient Aggregation for Graph Summarization Yuanyuan Tian (University of Michigan) Richard A. Hankins (Nokia Research Center) Jignesh M. Patel (University of Michigan) Motivation Graphs are everywhere
More informationTour-Based Mode Choice Modeling: Using An Ensemble of (Un-) Conditional Data-Mining Classifiers
Tour-Based Mode Choice Modeling: Using An Ensemble of (Un-) Conditional Data-Mining Classifiers James P. Biagioni Piotr M. Szczurek Peter C. Nelson, Ph.D. Abolfazl Mohammadian, Ph.D. Agenda Background
More informationMulti-scale Techniques for Document Page Segmentation
Multi-scale Techniques for Document Page Segmentation Zhixin Shi and Venu Govindaraju Center of Excellence for Document Analysis and Recognition (CEDAR), State University of New York at Buffalo, Amherst
More informationVideo shot segmentation using late fusion technique
Video shot segmentation using late fusion technique by C. Krishna Mohan, N. Dhananjaya, B.Yegnanarayana in Proc. Seventh International Conference on Machine Learning and Applications, 2008, San Diego,
More informationA FRAMEWORK OF EXTRACTING MULTI-SCALE FEATURES USING MULTIPLE CONVOLUTIONAL NEURAL NETWORKS. Kuan-Chuan Peng and Tsuhan Chen
A FRAMEWORK OF EXTRACTING MULTI-SCALE FEATURES USING MULTIPLE CONVOLUTIONAL NEURAL NETWORKS Kuan-Chuan Peng and Tsuhan Chen School of Electrical and Computer Engineering, Cornell University, Ithaca, NY
More informationSpatial color image retrieval without segmentation using thumbnails and the Earth Mover s Distance
1 Spatial color image retrieval without segmentation using thumbnails and the Earth Mover s Distance Thomas Hurtut, Haroldo Dalazoana, Yann Gousseau, Francis Schmitt CGIV 2006 Leeds 20 June 2006 Presentation
More informationSYMBOLIC FEATURES IN NEURAL NETWORKS
SYMBOLIC FEATURES IN NEURAL NETWORKS Włodzisław Duch, Karol Grudziński and Grzegorz Stawski 1 Department of Computer Methods, Nicolaus Copernicus University ul. Grudziadzka 5, 87-100 Toruń, Poland Abstract:
More informationQuery Processing and Alternative Search Structures. Indexing common words
Query Processing and Alternative Search Structures CS 510 Winter 2007 1 Indexing common words What is the indexing overhead for a common term? I.e., does leaving out stopwords help? Consider a word such
More informationIntroduction to Similarity Search in Multimedia Databases
Introduction to Similarity Search in Multimedia Databases Tomáš Skopal Charles University in Prague Faculty of Mathematics and Phycics SIRET research group http://siret.ms.mff.cuni.cz March 23 rd 2011,
More informationAC-Close: Efficiently Mining Approximate Closed Itemsets by Core Pattern Recovery
: Efficiently Mining Approximate Closed Itemsets by Core Pattern Recovery Hong Cheng Philip S. Yu Jiawei Han University of Illinois at Urbana-Champaign IBM T. J. Watson Research Center {hcheng3, hanj}@cs.uiuc.edu,
More informationColorado School of Mines. Computer Vision. Professor William Hoff Dept of Electrical Engineering &Computer Science.
Professor William Hoff Dept of Electrical Engineering &Computer Science http://inside.mines.edu/~whoff/ 1 Object Recognition in Large Databases Some material for these slides comes from www.cs.utexas.edu/~grauman/courses/spring2011/slides/lecture18_index.pptx
More informationExCuSe A Method for the Model-Based Safety Assessment of Simulink and Stateflow Models
ExCuSe A Method for the Model-Based Safety Assessment of Simulink and Stateflow Models MATLAB Expo 2018 2018-06-26 München Julian Rhein 1 Outline Introduction Property Proving Application to Safety Assessment
More informationdoc. RNDr. Tomáš Skopal, Ph.D. Department of Software Engineering, Faculty of Information Technology, Czech Technical University in Prague
Praha & EU: Investujeme do vaší budoucnosti Evropský sociální fond course: Searching the Web and Multimedia Databases (BI-VWM) Tomáš Skopal, 2011 SS2010/11 doc. RNDr. Tomáš Skopal, Ph.D. Department of
More informationK-Means Based Matching Algorithm for Multi-Resolution Feature Descriptors
K-Means Based Matching Algorithm for Multi-Resolution Feature Descriptors Shao-Tzu Huang, Chen-Chien Hsu, Wei-Yen Wang International Science Index, Electrical and Computer Engineering waset.org/publication/0007607
More informationCPSC 695. Geometric Algorithms in Biometrics. Dr. Marina L. Gavrilova
CPSC 695 Geometric Algorithms in Biometrics Dr. Marina L. Gavrilova Biometric goals Verify users Identify users Synthesis - recently Biometric identifiers Courtesy of Bromba GmbH Classification of identifiers
More informationHierarchical Local Maps for Robust Approximate Nearest Neighbor Computation
Hierarchical Local Maps for Robust Approximate Nearest Neighbor Computation Pratyush Bhatt and Anoop Namboodiri Center for Visual Information Technology, IIIT, Hyderabad, 500032, India {bhatt@research.,
More informationModel-based Database Systems
Model-based Database Systems Kasun S Perera Research Progress Report Supervisors TU Dresden Prof. Wolfgang Lehner Aalborg University Prof. Torben Bach Pedersen Introduction & Motivation DEFINITION 1: MODEL
More informationIMAGE MATCHING - ALOK TALEKAR - SAIRAM SUNDARESAN 11/23/2010 1
IMAGE MATCHING - ALOK TALEKAR - SAIRAM SUNDARESAN 11/23/2010 1 : Presentation structure : 1. Brief overview of talk 2. What does Object Recognition involve? 3. The Recognition Problem 4. Mathematical background:
More informationMultimedia Data Management M
ALMA MATER STUDIORUM - UNIVERSITÀ DI BOLOGNA Multimedia Data Management M Second cycle degree programme (LM) in Computer Engineering University of Bologna Complex Retrieval Models and MM DBMS Home page:
More informationA Pivot-based Index Structure for Combination of Feature Vectors
A Pivot-based Index Structure for Combination of Feature Vectors Benjamin Bustos Daniel Keim Tobias Schreck Department of Computer and Information Science, University of Konstanz Universitätstr. 10 Box
More informationMeta-Clustering. Parasaran Raman PhD Candidate School of Computing
Meta-Clustering Parasaran Raman PhD Candidate School of Computing What is Clustering? Goal: Group similar items together Unsupervised No labeling effort Popular choice for large-scale exploratory data
More informationShow, Discriminate, and Tell: A Discriminatory Image Captioning Model with Deep Neural Networks
Show, Discriminate, and Tell: A Discriminatory Image Captioning Model with Deep Neural Networks Zelun Luo Department of Computer Science Stanford University zelunluo@stanford.edu Te-Lin Wu Department of
More informationAlgorithms design under a geometric lens Spring 2014, CSE, OSU Lecture 1: Introduction
5339 - Algorithms design under a geometric lens Spring 2014, CSE, OSU Lecture 1: Introduction Instructor: Anastasios Sidiropoulos January 8, 2014 Geometry & algorithms Geometry in algorithm design Computational
More informationDynamic Collision Detection
Distance Computation Between Non-Convex Polyhedra June 17, 2002 Applications Dynamic Collision Detection Applications Dynamic Collision Detection Evaluating Safety Tolerances Applications Dynamic Collision
More informationOutline A Hierarchical P2P Architecture and an Efficient Flooding Algorithm
University of British Columbia Cpsc 527 Advanced Computer Communications Lecture 9b Hierarchical P2P Architecture and Efficient Multicasting (Juan Li s MSc Thesis) Instructor: Dr. Son Vuong The World Connected
More informationTexture. COS 429 Princeton University
Texture COS 429 Princeton University Texture What is a texture? Antonio Torralba Texture What is a texture? Antonio Torralba Texture What is a texture? Antonio Torralba Texture Texture is stochastic and
More information