Comparisson of Fuzzy C Means, K-Means and K-Medoids for Clustering in the Bag Of Visual Words Algorithm
|
|
- Joshua Payne
- 5 years ago
- Views:
Transcription
1 Comparisson of Fuzzy C Means, K-Means and K-Medoids for Clustering in the Bag Of Visual Words Algorithm Daniel Fernando Tello Gamarra Control and Automation Engineering Universidade Federal de Santa Maria (UFSM) Santa Maria, Rio Grande do Sul, Brazil, CEP daniel.gamarra@ufsm.br Marco Antonio De Souza Leite Cuadros Instituto Federal do Espirito Santo (IFES) Serra, Espirito Santo, Brazil, CEP marcoantonio@ifes.edu.br Abstract This paper makes a comparison of the performance of three different clustering techniques in the framework of the bag of visual words algorithm. The bag of visual words is a state of the art technique for image classification, one of the key features of the algorithm is the construction of a Codebook with clustering techniques. Different simulations will be run in an object database using the algorithms fuzzy c means, k-medois and k-means in order to evaluate their influence in the Bag of visual words algorithm when it is used for object classification. I. INTRODUCTION Human beings have an outstanding capacity of recognizing objects, and establish differences among objects that belong to the same class. If machines could develop or mimic in some way human capacity for object recognition with a good level of accuracy, this fact could pave the way for many applications in which machines could enhance their performance to interact with its environment in a more successful way. One of the algorithms that is state of the art in object recognition is the algorithm of Bag of Visual Words (BOV) initially proposed in [1] and inspired in the application of the algorithm of Bag of Words (BoW) created for text search applications. An essential step of the algorithm is the creation of a codebook with clustering techniques, being the k-means the most frequently used technique for this purpose. Application of other clustering algorithms for the BOV have been also explored in previous works as in the work of Dell Agnello in [2] that studied the application of the Fuzzy C Means (FCM) algorithm and the Kernel Fuzzy C Means (KFCM) algorithm for the construction of the visual codebooks, the results shown in the paper pointed out that the KFCM algorithm has a better performance than the FCM, but the computational cost is higher. Another application of the use of the FCM algorithm is proposed by Sujatha in [3] that implements the BOW technique using a multiple dictionary bag of words, the multiple dictionary information is concatenated in the process of image classification, the dictionaries are created usign the Fuzzy C Means algorithm. Zhou in [4] proposed a structured SOM (Self Organizing Maps) and a Spatial Constrained Fuzzy C Means with hierarchical technique to improve the bag-of-visual words model profiting the spatial information of the data for the clustering. Also Sujatha in [5] explores the application of the K-means and K-medoids clustering techniques for the bag of visual words. This paper is focused on the evaluation of three clustering algorithms for the codebook construction of an image dataset for the BOV technique. The remainder of the paper is as follows. In the second section are described the algorithms used in this work; the third section shows the experimental setup architecture using the Bag of Visual words algorithm; in the fourth section the experiments executed in the paper are described; finally, conclusions are drawn in the last section. II. A. Bag of Visual Words THEORETICAL BACKGROUND The Bag of Visual Words algorithm initially is derived from the algorithm Bag of Words (BoW) algorithm is a successful technique applied in text retrieval. The technique could be sumarized in five steps according to [5]: (i) feature extraction of the objects, (ii) feature pool, (iii) codebook generation from the extracted features using clustering techniques, (iv) training, and (v) classification using techniques as Support Vector Machines (SVM) or K-Nearest Neighbor (KNN) for object recognition. In the first step, as stated in [6] traditionally, the BOV approach models the distribution of low-level local features using the scale-invariant features transformation (SIFT) [7], which computes the orientation and gradient of the keypoints in gray-level information. The algorithm of k-means is used to cluster the feature detectors set in k clusters, the center of every cluster is named as a visual word and the set of these visual words generates the codebook. Based on the visual words, as stated in [6], an histogram is generated by counting their occurrence numbers. Here, such histogram is defined as a BOV representation. B. K-Means The description of the K-means algorithm is based on Bishop s book in [8], that states that, the k-means clustering is applied to identify groups, or clusters, of data points in a multidimensional space. If it is defined a data set {x 1,..., x N }, the goal is to partition the data into some number of k clusters.
2 If it is used the variable v k as representing the center of the clusters. Every data point x n has a corresponding set of binary indicator variables r nk {0, 1}, so every data point is assigned to a cluster k then r nk = 1, and r nj = 0 for j k. It is then defined an objective function called a distortion measure, given by N K J = r nk x n v k 2 (1) n=1 k=1 The algorithm tries to minimize this cost function. First are chosen randomly from the dataset some initial values for the v k, it is possible to optimize for each n separately by choosing r nk to be 1 for whichever value of k gives the minimum value of x n v k 2. In other words, the nth data point is simply assign to the closest cluster center, formally this is expressed by the equation: r nk = { 1 if k = arg min j x n v j 2 0 otherwise. The clusters means are recomputed with the equation: n v k = r nkx n n r (3) nk The two phases of the algorithm of re-assigning data points to clusters and re-computing the cluster means are repeated until there is no further change in the assignments or a number of iterations is exceeded. C. K-Medoids As stated in [9], in contrast to the k-means algorithm, k- medoids selects datapoints as centers (medoids or exemplars), so a medoid is the most centrally located point in the set. As Shah stated the most commom realization of k-medoid clustering is the Partitioning around Medoids (PAM ) algorithm and is described through the following steps: 1) Initialize: randomly select k of the n data points as the medoids 2) Assignment step: Associate each data point to the closest medoid. 3) Update step: For each medoid m and each data point o associated to m, swap m and compute the total cost of the configuration. Select the medoid o with the lowest cost of the configuration. 4) Repeat Alternating steps 2 and 3 D. Fuzzy C Means (FCM) As mentioned in [10], the popularity and usefulness of fuzzy C means results from three facts. The algorithm is very simple; the algorithm is unsupervised and is always convergent. The fuzzy C means algorithm uses concepts from the field of fuzzy logic and fuzzy set theory. The algorithm can be detailed following the description written in [11] and resumed in [12]. Fuzzy C Means is a very popular algorithm and has different applications in image processing such as segmentation[13].given the data set X, chose the number of clusters c, between 1 < c < N, where N is the number of (2) data objects, the weighting exponent m > 1 the tuning of this parameter can influence the performance of the algorithm, the termination tolerance ε > 0 and the norm inducing matrix A. The shape of the clusters is determined by the choice of matrix A, a common choice is A = I, which gives the standard Euclidean norm as stated in [14]. The membership of the data samples in the clusters is described by the fuzzy partition matrix U that will be initialized randomly. The weight µ ik represents the degree of membership of an object in a cluster, and is contained in the kth column of U. Every object belongs to every cluster in some degree, and each cluster is characterized by its centroid v i, the algorithm tries to minimize a cost function: J F CM = N n=1 k=1 c u m ik x i v k 2 (4) Assuming that the number of determined iterations is l, the first step is to compute the cluster centroids: v l i = N k=1 (µl 1 ik )x k N n=1 (µl 1 ik ), 1 i c (5) The second step consists in computing the distances, it means minimizing the sum square error: d 2 ika = (x k v l i) T A(x k v l i) (6) 1 i c, 1 k N The third step is to update the partition matrix elements: if d ika > 0 for 1 i c, 1 k N µ l ik = 1 c j=1 (d ika/d jka ) ( 2/(m 1)) Otherwise µ l ik = 0 if d ika = 0 and µ l ik [0, 1] with Σ c i=1 µl ik = 1. This procedure will be repeated a determined number of iterations or until the following condition will be reached U l U l 1 < ε meaning the absolute change of the variation of the fuzzy partition matrix is below a given threshold. III. EXPERIMENTAL SETUP The different experiments executed in this paper used the code hands on Advanced Bag-of-Words Models for Visual Recognition that can be downloaded in [15] developed by Ballan and Seidenari, all the code is written in Matlab using commands from the image processing toolbox of matlab and the libvsm library wrapped in C from matlab. The clustering algorithms, Fuzzy C Means and K-Medoids that were used are derived from the matlab toolboxes of clustering and Fuzzy logic, the K-Means algorithm used is the one taken from Ballan algorithm code. The classification step was developed using the algorithm of Support Vector Machines (SVM) as linear classifiers using the libvsm library. (7)
3 The algorithms were tested on a set of figures taken from the Caltech data-set. The Caltech dataset was divided in two subsets, one of 4 images and another subset of 15 images. The first subset is shown in Figure 1 and contains four types of images that are images of faces, motorbikes, cars and airplanes, every type of image has 450 images, i.e. the face type has 450 different faces. The second subset is shown in Figure 2, and has the following types of images: bonsai, butterfly, crab, elephant, euphonium, faces, grandpiano, joshuatree, leopards, lotus, motorbikes, schooner, stopsign, sunflower and watch. These types have different images number that varies between 60 to 240 images approximately. Dense features descriptors DSIFT were used instead of SIFT features detector in order to increase the number of features extracted in the objects of the two sets of images.the number of visual words constructed was 500 visual words, the number of features was features. The number of images selected for training was 30, and the number of images selecting for testing was 50. IV. RESULTS Different experiments were run with the two subsets of the Caltech dataset images, one with the 4-objects subset and another with the 15-objects subset. So, in this section, the algorithm used and the results obtained will be described. Both images subsets used the same clustering algorithms with the same parameters. It will be described briefly the clustering algorithms and the parameters used. The following parameters were used for the clustering algorithms, for the K-Means algorithm algorithm a maximum number of iterations of 50 were defined. The K-Medoids algorithm used an Euclidean distance. For the Fuzzy C Means algorithm, the number of iterations for the algorithm was 20. The weighting exponent (m) was 1.2, the termination tolerance (ε) also named as minimum amount of improvement was set to 1e 5; we have tried different values for m but the performance of the clustering decreased. The number of clusters or visual words is 500, and the number of features is A. subset 4-object The clustering algorithm will create 500 visual words, Figure 3 shows one of these 500 visual words obtained after applying the clustering algorithm. Fig. 1. images taken from the Caltech subset of 4-object. Fig. 3. Visual Word Obtained from the clustering step in the BOV technique. Fig. 2. images taken from the Caltech subset of 15-object. Figure 4 shows the classification accuracy, also known as confusion matrix, of the Bag of Visual Words Algorithm clustered applying the K-Means algorithm in one of the experiments. The confusion matrix is a contingency table where each cell has the number of objects shared by one a priori class and one of the predicted class. From the confusion matrix it can be computed the metrics accuracy. The diagonal of the matrix represents the accuracy of the classifier, in this case a Support Vector Machine (SVM). Figure 5 shows the classification accuracy of the Bag of Visual Words Algorithm clustered applying the K-Medoids algorithm in one of the experiments. Figure 6 shows the
4 Fig. 4. Confusion matrix obtained on the 4 objects Caltech dataset using linear SVM and K-Means algorithm for codebook construction. Fig. 6. Confusion matrix obtained on the 4 objects Caltech dataset using linear SVM FCM algorithm for codebook construction. classification accuracy of the Bag of Visual Words Algorithm clustered applying the FCM algorithm in one of the experiments. Figure 7 shows the performance of the three algorithms and its average accuracy. The referred accuracy is related to the performance of the SVM for every object classification and its average in a normal experiment run of the algorithm with a specific clustering algorithm. The categories 1,2 and 3 in the figure represented by bars are the k-means, FCM and K-Medoids algorithms accuracy respectively. From these categories is possible to conclude that the K-Means shows the best accuracy, followed by the K-Medoids and FCM algorithms.. Fig. 7. Accuracy of the clustering algorithms using SVM in the Caltech 4-objects subset Fig. 5. Confusion matrix obtained on the 4 objects Caltech dataset using linear SVM and K-Medoids algorithm for codebook construction. B. subset 15-object The same experiments were executed in the second subset of 15 images of the Caltech dataset, using this bigger subset was obtained the figure 8, that shows the accuracy of the clustering algorithms. In this figure it is possible to observe that the best performance is also achieved with the K-Means and K-Medoids algorithms that are hard clustering algorithms compared with the FCM algorithm that is considered a soft clustering algorithms method. Also the performance of the accuracy of the algorithm decays compared to a dataset with a minor number of images. Fig. 8. Accuracy of the clustering algorithms using SVM in the Caltech 15-objects subset V. CONCLUSION It has been explored the application of three clustering algorithms in the bag of visual words algorithm for object recognition; the tests realized in an image database divided in two subsets of different images proved that the performance of the K-means algorithm is better compared to the K-Medoids and FCM algorithm. Also, we can notice that the accuracy of the algorithm decreased if the number of objects is augmented
5 in our experiments, but we need to evaluate more datasets and experiments in order to find the limits of the relation of the accuracy and the quantity of objects in our framework. As future works, we are planning to evaluate the performance of the clustering algorithm with different feature extractors methods for the BOW algorithm. REFERENCES [1] Sivic, J. and Zisserman, A. Video Google: A text retrieval approach to object matching in videos, Internantional Conference on Computer Vision (ICCV) IEEE Computer Society., pp , [2] Dell Agnello, D, Carneiro, G., Chin, T., Castellano, G. and Fanelli, A.M. (2013) Fuzzy clustering based encoding for Visual Object Classification, in IFSA World Congress and NAFIPS Annual Meeting (IFSA/NAFIPS). (Edmonton-Canada). [3] Sujatha, K.S., Keerthana, P., Suga Priya, S., Kaavya, E. and Vinod, B. (2012) Fuzzy based Multiple Dictionary Bag of Words for Image Classificationg, Procedia Engineering., Vol. 38, pp [4] Zhou, G., Wang, Z., Wang, J. and Feng, D. (2010) Spatial Context for Visual Vocabulary Construction, in International Conference in Image Analysis and Signal Processing (IASP).(Zhejiang-China). [5] Sujatha, K.S., Karthiga, G.M. and Vinod, B. (2012) Evaluation of Bag of Visual Words for Category Level Object Recognition, International Joutnal of Electronics Signals and Systems., Vol. 1, No. 4,pp [6] Xu, S., Fang, T., Li, D. and Wang, S. Object Classification of Aerial Images With Bag-of-Visual Words, IEEE Geoscience and Remote Sensing Letters., Vol. 7, No. 2, pp , [7] Lowe, D.G. (1999) Object Recognition from Local Scale Invarian Features, in Proceedings of the International Conference on Computer Vision, pp (Kerkyra-Greece). [8] Bishop, C.M. Pattern Recognition and Machine Learning, Springer, [9] Shah, S. and Singh,M. (2012) Comparison of a Time Efficient Modified K-mean Algorithm with K-Mean and K-Medoid Algorithm, in International Conference on Communication Systems and Network TEchnologies (CSNT). (Rajkot-India). [10] Zhou, Z. (2010) Comparisson of Four Kinds of Fuzzy C-Means Clustering Methods, in Symposium on Informatics Processing (ISIP), pp (Qingdao-China). [11] Setnes, M. and Babusca, R. (1999) A Fuzzy Relational Classifier Trained by Fuzzy Clustering, IEEE Transactions and Systems, Man, and Cybernetics., Vol. 29, No. 5, pp [12] Gamarra, D.F.T. (2013) Forward Models with Cluster Validity Criteria Applied in Ballistic Reaching for Visual Servoing, International Review on Modelling and Simulations., Vol. 6, pp [13] Gamarra, D.F.T. (2015) Fuzzy Image Segmentation Using Validity Indexes Correlation, International Journal of Computer Science and Information Technology., Vol. 7, pp [14] Jain, N. and Shukla, S. (2012) Fuzzy Databases Using Extended Fuzzy C Means Clustering, Journal of Engineering Research and Applications., Vol. 2, No. 3, pp [15] Ballan, L. and Seidenari, L. (2013) available:
Fuzzy based Multiple Dictionary Bag of Words for Image Classification
Available online at www.sciencedirect.com Procedia Engineering 38 (2012 ) 2196 2206 International Conference on Modeling Optimisation and Computing Fuzzy based Multiple Dictionary Bag of Words for Image
More informationHands on Advanced Bag- of- Words Models for Visual Recogni8on
Hands on Advanced Bag- of- Words Models for Visual Recogni8on Lamberto Ballan and Lorenzo Seidenari MICC - University of Florence - The tutorial will start at 14:30 - In the meanwhile please download the
More informationLecture 12 Recognition
Institute of Informatics Institute of Neuroinformatics Lecture 12 Recognition Davide Scaramuzza 1 Lab exercise today replaced by Deep Learning Tutorial Room ETH HG E 1.1 from 13:15 to 15:00 Optional lab
More informationCPPP/UFMS at ImageCLEF 2014: Robot Vision Task
CPPP/UFMS at ImageCLEF 2014: Robot Vision Task Rodrigo de Carvalho Gomes, Lucas Correia Ribas, Amaury Antônio de Castro Junior, Wesley Nunes Gonçalves Federal University of Mato Grosso do Sul - Ponta Porã
More informationFuzzy Segmentation. Chapter Introduction. 4.2 Unsupervised Clustering.
Chapter 4 Fuzzy Segmentation 4. Introduction. The segmentation of objects whose color-composition is not common represents a difficult task, due to the illumination and the appropriate threshold selection
More informationContent-Based Image Classification: A Non-Parametric Approach
1 Content-Based Image Classification: A Non-Parametric Approach Paulo M. Ferreira, Mário A.T. Figueiredo, Pedro M. Q. Aguiar Abstract The rise of the amount imagery on the Internet, as well as in multimedia
More informationComputer Vision. Exercise Session 10 Image Categorization
Computer Vision Exercise Session 10 Image Categorization Object Categorization Task Description Given a small number of training images of a category, recognize a-priori unknown instances of that category
More informationLecture 12 Recognition. Davide Scaramuzza
Lecture 12 Recognition Davide Scaramuzza Oral exam dates UZH January 19-20 ETH 30.01 to 9.02 2017 (schedule handled by ETH) Exam location Davide Scaramuzza s office: Andreasstrasse 15, 2.10, 8050 Zurich
More informationVK Multimedia Information Systems
VK Multimedia Information Systems Mathias Lux, mlux@itec.uni-klu.ac.at Dienstags, 16.oo Uhr This work is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Agenda Evaluations
More informationA Fuzzy C-means Clustering Algorithm Based on Pseudo-nearest-neighbor Intervals for Incomplete Data
Journal of Computational Information Systems 11: 6 (2015) 2139 2146 Available at http://www.jofcis.com A Fuzzy C-means Clustering Algorithm Based on Pseudo-nearest-neighbor Intervals for Incomplete Data
More informationINF4820 Algorithms for AI and NLP. Evaluating Classifiers Clustering
INF4820 Algorithms for AI and NLP Evaluating Classifiers Clustering Murhaf Fares & Stephan Oepen Language Technology Group (LTG) September 27, 2017 Today 2 Recap Evaluation of classifiers Unsupervised
More informationTensor Decomposition of Dense SIFT Descriptors in Object Recognition
Tensor Decomposition of Dense SIFT Descriptors in Object Recognition Tan Vo 1 and Dat Tran 1 and Wanli Ma 1 1- Faculty of Education, Science, Technology and Mathematics University of Canberra, Australia
More informationRobotics Programming Laboratory
Chair of Software Engineering Robotics Programming Laboratory Bertrand Meyer Jiwon Shin Lecture 8: Robot Perception Perception http://pascallin.ecs.soton.ac.uk/challenges/voc/databases.html#caltech car
More informationPreviously. Part-based and local feature models for generic object recognition. Bag-of-words model 4/20/2011
Previously Part-based and local feature models for generic object recognition Wed, April 20 UT-Austin Discriminative classifiers Boosting Nearest neighbors Support vector machines Useful for object recognition
More informationCS 231A CA Session: Problem Set 4 Review. Kevin Chen May 13, 2016
CS 231A CA Session: Problem Set 4 Review Kevin Chen May 13, 2016 PS4 Outline Problem 1: Viewpoint estimation Problem 2: Segmentation Meanshift segmentation Normalized cut Problem 1: Viewpoint Estimation
More informationLecture 12 Recognition
Institute of Informatics Institute of Neuroinformatics Lecture 12 Recognition Davide Scaramuzza http://rpg.ifi.uzh.ch/ 1 Lab exercise today replaced by Deep Learning Tutorial by Antonio Loquercio Room
More informationString distance for automatic image classification
String distance for automatic image classification Nguyen Hong Thinh*, Le Vu Ha*, Barat Cecile** and Ducottet Christophe** *University of Engineering and Technology, Vietnam National University of HaNoi,
More informationBeyond bags of features: Adding spatial information. Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba
Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba Adding spatial information Forming vocabularies from pairs of nearby features doublets
More informationNew Approach for K-mean and K-medoids Algorithm
New Approach for K-mean and K-medoids Algorithm Abhishek Patel Department of Information & Technology, Parul Institute of Engineering & Technology, Vadodara, Gujarat, India Purnima Singh Department of
More informationBag of Words Models. CS4670 / 5670: Computer Vision Noah Snavely. Bag-of-words models 11/26/2013
CS4670 / 5670: Computer Vision Noah Snavely Bag-of-words models Object Bag of words Bag of Words Models Adapted from slides by Rob Fergus and Svetlana Lazebnik 1 Object Bag of words Origin 1: Texture Recognition
More informationClassifying Images with Visual/Textual Cues. By Steven Kappes and Yan Cao
Classifying Images with Visual/Textual Cues By Steven Kappes and Yan Cao Motivation Image search Building large sets of classified images Robotics Background Object recognition is unsolved Deformable shaped
More informationEfficient Kernels for Identifying Unbounded-Order Spatial Features
Efficient Kernels for Identifying Unbounded-Order Spatial Features Yimeng Zhang Carnegie Mellon University yimengz@andrew.cmu.edu Tsuhan Chen Cornell University tsuhan@ece.cornell.edu Abstract Higher order
More informationCOLOR image segmentation is a method of assigning
Proceedings of the Federated Conference on Computer Science and Information Systems pp. 1049 1054 DOI: 10.15439/2015F222 ACSIS, Vol. 5 Applying fuzzy clustering method to color image segmentation Omer
More informationIMAGE RETRIEVAL USING VLAD WITH MULTIPLE FEATURES
IMAGE RETRIEVAL USING VLAD WITH MULTIPLE FEATURES Pin-Syuan Huang, Jing-Yi Tsai, Yu-Fang Wang, and Chun-Yi Tsai Department of Computer Science and Information Engineering, National Taitung University,
More informationCLUSTERING. CSE 634 Data Mining Prof. Anita Wasilewska TEAM 16
CLUSTERING CSE 634 Data Mining Prof. Anita Wasilewska TEAM 16 1. K-medoids: REFERENCES https://www.coursera.org/learn/cluster-analysis/lecture/nj0sb/3-4-the-k-medoids-clustering-method https://anuradhasrinivas.files.wordpress.com/2013/04/lesson8-clustering.pdf
More informationA Comparative study of Clustering Algorithms using MapReduce in Hadoop
A Comparative study of Clustering Algorithms using MapReduce in Hadoop Dweepna Garg 1, Khushboo Trivedi 2, B.B.Panchal 3 1 Department of Computer Science and Engineering, Parul Institute of Engineering
More informationCS229: Action Recognition in Tennis
CS229: Action Recognition in Tennis Aman Sikka Stanford University Stanford, CA 94305 Rajbir Kataria Stanford University Stanford, CA 94305 asikka@stanford.edu rkataria@stanford.edu 1. Motivation As active
More informationAggregated Color Descriptors for Land Use Classification
Aggregated Color Descriptors for Land Use Classification Vedran Jovanović and Vladimir Risojević Abstract In this paper we propose and evaluate aggregated color descriptors for land use classification
More informationCodebook Graph Coding of Descriptors
Int'l Conf. Par. and Dist. Proc. Tech. and Appl. PDPTA'5 3 Codebook Graph Coding of Descriptors Tetsuya Yoshida and Yuu Yamada Graduate School of Humanities and Science, Nara Women s University, Nara,
More informationGeneralized Fuzzy Clustering Model with Fuzzy C-Means
Generalized Fuzzy Clustering Model with Fuzzy C-Means Hong Jiang 1 1 Computer Science and Engineering, University of South Carolina, Columbia, SC 29208, US jiangh@cse.sc.edu http://www.cse.sc.edu/~jiangh/
More informationINF4820, Algorithms for AI and NLP: Evaluating Classifiers Clustering
INF4820, Algorithms for AI and NLP: Evaluating Classifiers Clustering Erik Velldal University of Oslo Sept. 18, 2012 Topics for today 2 Classification Recap Evaluating classifiers Accuracy, precision,
More informationFeature-Guided K-Means Algorithm for Optimal Image Vector Quantizer Design
Journal of Information Hiding and Multimedia Signal Processing c 2017 ISSN 2073-4212 Ubiquitous International Volume 8, Number 6, November 2017 Feature-Guided K-Means Algorithm for Optimal Image Vector
More informationLarge-scale visual recognition The bag-of-words representation
Large-scale visual recognition The bag-of-words representation Florent Perronnin, XRCE Hervé Jégou, INRIA CVPR tutorial June 16, 2012 Outline Bag-of-words Large or small vocabularies? Extensions for instance-level
More informationPart-based and local feature models for generic object recognition
Part-based and local feature models for generic object recognition May 28 th, 2015 Yong Jae Lee UC Davis Announcements PS2 grades up on SmartSite PS2 stats: Mean: 80.15 Standard Dev: 22.77 Vote on piazza
More informationScene Recognition using Bag-of-Words
Scene Recognition using Bag-of-Words Sarthak Ahuja B.Tech Computer Science Indraprastha Institute of Information Technology Okhla, Delhi 110020 Email: sarthak12088@iiitd.ac.in Anchita Goel B.Tech Computer
More informationOBJECT CATEGORIZATION
OBJECT CATEGORIZATION Ing. Lorenzo Seidenari e-mail: seidenari@dsi.unifi.it Slides: Ing. Lamberto Ballan November 18th, 2009 What is an Object? Merriam-Webster Definition: Something material that may be
More informationCS6670: Computer Vision
CS6670: Computer Vision Noah Snavely Lecture 16: Bag-of-words models Object Bag of words Announcements Project 3: Eigenfaces due Wednesday, November 11 at 11:59pm solo project Final project presentations:
More informationINF4820. Clustering. Erik Velldal. Nov. 17, University of Oslo. Erik Velldal INF / 22
INF4820 Clustering Erik Velldal University of Oslo Nov. 17, 2009 Erik Velldal INF4820 1 / 22 Topics for Today More on unsupervised machine learning for data-driven categorization: clustering. The task
More informationCluster Analysis. Ying Shen, SSE, Tongji University
Cluster Analysis Ying Shen, SSE, Tongji University Cluster analysis Cluster analysis groups data objects based only on the attributes in the data. The main objective is that The objects within a group
More informationDeveloping Open Source code for Pyramidal Histogram Feature Sets
Developing Open Source code for Pyramidal Histogram Feature Sets BTech Project Report by Subodh Misra subodhm@iitk.ac.in Y648 Guide: Prof. Amitabha Mukerjee Dept of Computer Science and Engineering IIT
More informationFACULTY OF ENGINEERING AND INFORMATION TECHNOLOGY DEPARTMENT OF COMPUTER SCIENCE. Project Plan
FACULTY OF ENGINEERING AND INFORMATION TECHNOLOGY DEPARTMENT OF COMPUTER SCIENCE Project Plan Structured Object Recognition for Content Based Image Retrieval Supervisors: Dr. Antonio Robles Kelly Dr. Jun
More informationINF4820 Algorithms for AI and NLP. Evaluating Classifiers Clustering
INF4820 Algorithms for AI and NLP Evaluating Classifiers Clustering Erik Velldal & Stephan Oepen Language Technology Group (LTG) September 23, 2015 Agenda Last week Supervised vs unsupervised learning.
More informationTexture Image Segmentation using FCM
Proceedings of 2012 4th International Conference on Machine Learning and Computing IPCSIT vol. 25 (2012) (2012) IACSIT Press, Singapore Texture Image Segmentation using FCM Kanchan S. Deshmukh + M.G.M
More informationObject Classification Problem
HIERARCHICAL OBJECT CATEGORIZATION" Gregory Griffin and Pietro Perona. Learning and Using Taxonomies For Fast Visual Categorization. CVPR 2008 Marcin Marszalek and Cordelia Schmid. Constructing Category
More informationClustering. Chapter 10 in Introduction to statistical learning
Clustering Chapter 10 in Introduction to statistical learning 16 14 12 10 8 6 4 2 0 2 4 6 8 10 12 14 1 Clustering ² Clustering is the art of finding groups in data (Kaufman and Rousseeuw, 1990). ² What
More informationClustering & Classification (chapter 15)
Clustering & Classification (chapter 5) Kai Goebel Bill Cheetham RPI/GE Global Research goebel@cs.rpi.edu cheetham@cs.rpi.edu Outline k-means Fuzzy c-means Mountain Clustering knn Fuzzy knn Hierarchical
More informationNovel Intuitionistic Fuzzy C-Means Clustering for Linearly and Nonlinearly Separable Data
Novel Intuitionistic Fuzzy C-Means Clustering for Linearly and Nonlinearly Separable Data PRABHJOT KAUR DR. A. K. SONI DR. ANJANA GOSAIN Department of IT, MSIT Department of Computers University School
More informationVideo annotation based on adaptive annular spatial partition scheme
Video annotation based on adaptive annular spatial partition scheme Guiguang Ding a), Lu Zhang, and Xiaoxu Li Key Laboratory for Information System Security, Ministry of Education, Tsinghua National Laboratory
More informationBossaNova at ImageCLEF 2012 Flickr Photo Annotation Task
BossaNova at ImageCLEF 2012 Flickr Photo Annotation Task S. Avila 1,2, N. Thome 1, M. Cord 1, E. Valle 3, and A. de A. Araújo 2 1 Pierre and Marie Curie University, UPMC-Sorbonne Universities, LIP6, France
More informationEVENT DETECTION AND HUMAN BEHAVIOR RECOGNITION. Ing. Lorenzo Seidenari
EVENT DETECTION AND HUMAN BEHAVIOR RECOGNITION Ing. Lorenzo Seidenari e-mail: seidenari@dsi.unifi.it What is an Event? Dictionary.com definition: something that occurs in a certain place during a particular
More informationEE368 Project Report CD Cover Recognition Using Modified SIFT Algorithm
EE368 Project Report CD Cover Recognition Using Modified SIFT Algorithm Group 1: Mina A. Makar Stanford University mamakar@stanford.edu Abstract In this report, we investigate the application of the Scale-Invariant
More informationPart based models for recognition. Kristen Grauman
Part based models for recognition Kristen Grauman UT Austin Limitations of window-based models Not all objects are box-shaped Assuming specific 2d view of object Local components themselves do not necessarily
More informationImage classification Computer Vision Spring 2018, Lecture 18
Image classification http://www.cs.cmu.edu/~16385/ 16-385 Computer Vision Spring 2018, Lecture 18 Course announcements Homework 5 has been posted and is due on April 6 th. - Dropbox link because course
More informationBy Suren Manvelyan,
By Suren Manvelyan, http://www.surenmanvelyan.com/gallery/7116 By Suren Manvelyan, http://www.surenmanvelyan.com/gallery/7116 By Suren Manvelyan, http://www.surenmanvelyan.com/gallery/7116 By Suren Manvelyan,
More informationClustering CS 550: Machine Learning
Clustering CS 550: Machine Learning This slide set mainly uses the slides given in the following links: http://www-users.cs.umn.edu/~kumar/dmbook/ch8.pdf http://www-users.cs.umn.edu/~kumar/dmbook/dmslides/chap8_basic_cluster_analysis.pdf
More informationAn Improvement of Centroid-Based Classification Algorithm for Text Classification
An Improvement of Centroid-Based Classification Algorithm for Text Classification Zehra Cataltepe, Eser Aygun Istanbul Technical Un. Computer Engineering Dept. Ayazaga, Sariyer, Istanbul, Turkey cataltepe@itu.edu.tr,
More informationWorking with Unlabeled Data Clustering Analysis. Hsiao-Lung Chan Dept Electrical Engineering Chang Gung University, Taiwan
Working with Unlabeled Data Clustering Analysis Hsiao-Lung Chan Dept Electrical Engineering Chang Gung University, Taiwan chanhl@mail.cgu.edu.tw Unsupervised learning Finding centers of similarity using
More informationCHAPTER 4 FUZZY LOGIC, K-MEANS, FUZZY C-MEANS AND BAYESIAN METHODS
CHAPTER 4 FUZZY LOGIC, K-MEANS, FUZZY C-MEANS AND BAYESIAN METHODS 4.1. INTRODUCTION This chapter includes implementation and testing of the student s academic performance evaluation to achieve the objective(s)
More informationMultimodal Medical Image Retrieval based on Latent Topic Modeling
Multimodal Medical Image Retrieval based on Latent Topic Modeling Mandikal Vikram 15it217.vikram@nitk.edu.in Suhas BS 15it110.suhas@nitk.edu.in Aditya Anantharaman 15it201.aditya.a@nitk.edu.in Sowmya Kamath
More informationPreliminary Local Feature Selection by Support Vector Machine for Bag of Features
Preliminary Local Feature Selection by Support Vector Machine for Bag of Features Tetsu Matsukawa Koji Suzuki Takio Kurita :University of Tsukuba :National Institute of Advanced Industrial Science and
More informationarxiv: v3 [cs.cv] 3 Oct 2012
Combined Descriptors in Spatial Pyramid Domain for Image Classification Junlin Hu and Ping Guo arxiv:1210.0386v3 [cs.cv] 3 Oct 2012 Image Processing and Pattern Recognition Laboratory Beijing Normal University,
More informationRegion-based Segmentation
Region-based Segmentation Image Segmentation Group similar components (such as, pixels in an image, image frames in a video) to obtain a compact representation. Applications: Finding tumors, veins, etc.
More informationUnsupervised Data Mining: Clustering. Izabela Moise, Evangelos Pournaras, Dirk Helbing
Unsupervised Data Mining: Clustering Izabela Moise, Evangelos Pournaras, Dirk Helbing Izabela Moise, Evangelos Pournaras, Dirk Helbing 1 1. Supervised Data Mining Classification Regression Outlier detection
More informationCENTERED FEATURES FROM INTEGRAL IMAGES
CENTERED FEATURES FROM INTEGRAL IMAGES ZOHAIB KHAN Bachelor of Applied Information Technology Thesis Report No. 2009:070 ISSN: 1651-4769 University of Gothenburg Department of Applied Information Technology
More informationContents Machine Learning concepts 4 Learning Algorithm 4 Predictive Model (Model) 4 Model, Classification 4 Model, Regression 4 Representation
Contents Machine Learning concepts 4 Learning Algorithm 4 Predictive Model (Model) 4 Model, Classification 4 Model, Regression 4 Representation Learning 4 Supervised Learning 4 Unsupervised Learning 4
More informationI. INTRODUCTION II. RELATED WORK.
ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: A New Hybridized K-Means Clustering Based Outlier Detection Technique
More information1 Case study of SVM (Rob)
DRAFT a final version will be posted shortly COS 424: Interacting with Data Lecturer: Rob Schapire and David Blei Lecture # 8 Scribe: Indraneel Mukherjee March 1, 2007 In the previous lecture we saw how
More informationFigure (5) Kohonen Self-Organized Map
2- KOHONEN SELF-ORGANIZING MAPS (SOM) - The self-organizing neural networks assume a topological structure among the cluster units. - There are m cluster units, arranged in a one- or two-dimensional array;
More informationCHAPTER 6 MODIFIED FUZZY TECHNIQUES BASED IMAGE SEGMENTATION
CHAPTER 6 MODIFIED FUZZY TECHNIQUES BASED IMAGE SEGMENTATION 6.1 INTRODUCTION Fuzzy logic based computational techniques are becoming increasingly important in the medical image analysis arena. The significant
More informationIntroduction to object recognition. Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and others
Introduction to object recognition Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and others Overview Basic recognition tasks A statistical learning approach Traditional or shallow recognition
More informationBlood Microscopic Image Analysis for Acute Leukemia Detection
I J C T A, 9(9), 2016, pp. 3731-3735 International Science Press Blood Microscopic Image Analysis for Acute Leukemia Detection V. Renuga, J. Sivaraman, S. Vinuraj Kumar, S. Sathish, P. Padmapriya and R.
More informationLocal Features and Bag of Words Models
10/14/11 Local Features and Bag of Words Models Computer Vision CS 143, Brown James Hays Slides from Svetlana Lazebnik, Derek Hoiem, Antonio Torralba, David Lowe, Fei Fei Li and others Computer Engineering
More informationColour Image Segmentation Using K-Means, Fuzzy C-Means and Density Based Clustering
Colour Image Segmentation Using K-Means, Fuzzy C-Means and Density Based Clustering Preeti1, Assistant Professor Kompal Ahuja2 1,2 DCRUST, Murthal, Haryana (INDIA) DITM, Gannaur, Haryana (INDIA) Abstract:
More informationClassification of objects from Video Data (Group 30)
Classification of objects from Video Data (Group 30) Sheallika Singh 12665 Vibhuti Mahajan 12792 Aahitagni Mukherjee 12001 M Arvind 12385 1 Motivation Video surveillance has been employed for a long time
More informationLatent Variable Models and Expectation Maximization
Latent Variable Models and Expectation Maximization Oliver Schulte - CMPT 726 Bishop PRML Ch. 9 2 4 6 8 1 12 14 16 18 2 4 6 8 1 12 14 16 18 5 1 15 2 25 5 1 15 2 25 2 4 6 8 1 12 14 2 4 6 8 1 12 14 5 1 15
More informationIMPROVING SPATIO-TEMPORAL FEATURE EXTRACTION TECHNIQUES AND THEIR APPLICATIONS IN ACTION CLASSIFICATION. Maral Mesmakhosroshahi, Joohee Kim
IMPROVING SPATIO-TEMPORAL FEATURE EXTRACTION TECHNIQUES AND THEIR APPLICATIONS IN ACTION CLASSIFICATION Maral Mesmakhosroshahi, Joohee Kim Department of Electrical and Computer Engineering Illinois Institute
More informationLarge-scale visual recognition Efficient matching
Large-scale visual recognition Efficient matching Florent Perronnin, XRCE Hervé Jégou, INRIA CVPR tutorial June 16, 2012 Outline!! Preliminary!! Locality Sensitive Hashing: the two modes!! Hashing!! Embedding!!
More informationEfficiency of k-means and K-Medoids Algorithms for Clustering Arbitrary Data Points
Efficiency of k-means and K-Medoids Algorithms for Clustering Arbitrary Data Points Dr. T. VELMURUGAN Associate professor, PG and Research Department of Computer Science, D.G.Vaishnav College, Chennai-600106,
More informationFeature descriptors. Alain Pagani Prof. Didier Stricker. Computer Vision: Object and People Tracking
Feature descriptors Alain Pagani Prof. Didier Stricker Computer Vision: Object and People Tracking 1 Overview Previous lectures: Feature extraction Today: Gradiant/edge Points (Kanade-Tomasi + Harris)
More informationImproving Recognition through Object Sub-categorization
Improving Recognition through Object Sub-categorization Al Mansur and Yoshinori Kuno Graduate School of Science and Engineering, Saitama University, 255 Shimo-Okubo, Sakura-ku, Saitama-shi, Saitama 338-8570,
More informationFUZZY C-MEANS ALGORITHM BASED ON PRETREATMENT OF SIMILARITY RELATIONTP
Dynamics of Continuous, Discrete and Impulsive Systems Series B: Applications & Algorithms 14 (2007) 103-111 Copyright c 2007 Watam Press FUZZY C-MEANS ALGORITHM BASED ON PRETREATMENT OF SIMILARITY RELATIONTP
More informationComputer vision: models, learning and inference. Chapter 13 Image preprocessing and feature extraction
Computer vision: models, learning and inference Chapter 13 Image preprocessing and feature extraction Preprocessing The goal of pre-processing is to try to reduce unwanted variation in image due to lighting,
More informationA Rapid Automatic Image Registration Method Based on Improved SIFT
Available online at www.sciencedirect.com Procedia Environmental Sciences 11 (2011) 85 91 A Rapid Automatic Image Registration Method Based on Improved SIFT Zhu Hongbo, Xu Xuejun, Wang Jing, Chen Xuesong,
More informationGeneric object recognition using graph embedding into a vector space
American Journal of Software Engineering and Applications 2013 ; 2(1) : 13-18 Published online February 20, 2013 (http://www.sciencepublishinggroup.com/j/ajsea) doi: 10.11648/j. ajsea.20130201.13 Generic
More informationArtistic ideation based on computer vision methods
Journal of Theoretical and Applied Computer Science Vol. 6, No. 2, 2012, pp. 72 78 ISSN 2299-2634 http://www.jtacs.org Artistic ideation based on computer vision methods Ferran Reverter, Pilar Rosado,
More informationIntroduction to Machine Learning. Xiaojin Zhu
Introduction to Machine Learning Xiaojin Zhu jerryzhu@cs.wisc.edu Read Chapter 1 of this book: Xiaojin Zhu and Andrew B. Goldberg. Introduction to Semi- Supervised Learning. http://www.morganclaypool.com/doi/abs/10.2200/s00196ed1v01y200906aim006
More informationUnsupervised Learning Partitioning Methods
Unsupervised Learning Partitioning Methods Road Map 1. Basic Concepts 2. K-Means 3. K-Medoids 4. CLARA & CLARANS Cluster Analysis Unsupervised learning (i.e., Class label is unknown) Group data to form
More informationUnsupervised Feature Selection for Sparse Data
Unsupervised Feature Selection for Sparse Data Artur Ferreira 1,3 Mário Figueiredo 2,3 1- Instituto Superior de Engenharia de Lisboa, Lisboa, PORTUGAL 2- Instituto Superior Técnico, Lisboa, PORTUGAL 3-
More informationA Sparse and Locally Shift Invariant Feature Extractor Applied to Document Images
A Sparse and Locally Shift Invariant Feature Extractor Applied to Document Images Marc Aurelio Ranzato Yann LeCun Courant Institute of Mathematical Sciences New York University - New York, NY 10003 Abstract
More informationOlmo S. Zavala Romero. Clustering Hierarchical Distance Group Dist. K-means. Center of Atmospheric Sciences, UNAM.
Center of Atmospheric Sciences, UNAM November 16, 2016 Cluster Analisis Cluster analysis or clustering is the task of grouping a set of objects in such a way that objects in the same group (called a cluster)
More informationEvaluation of Local Space-time Descriptors based on Cuboid Detector in Human Action Recognition
International Journal of Innovation and Applied Studies ISSN 2028-9324 Vol. 9 No. 4 Dec. 2014, pp. 1708-1717 2014 Innovative Space of Scientific Research Journals http://www.ijias.issr-journals.org/ Evaluation
More informationFacial Expression Classification with Random Filters Feature Extraction
Facial Expression Classification with Random Filters Feature Extraction Mengye Ren Facial Monkey mren@cs.toronto.edu Zhi Hao Luo It s Me lzh@cs.toronto.edu I. ABSTRACT In our work, we attempted to tackle
More informationHard clustering. Each object is assigned to one and only one cluster. Hierarchical clustering is usually hard. Soft (fuzzy) clustering
An unsupervised machine learning problem Grouping a set of objects in such a way that objects in the same group (a cluster) are more similar (in some sense or another) to each other than to those in other
More informationSparse coding for image classification
Sparse coding for image classification Columbia University Electrical Engineering: Kun Rong(kr2496@columbia.edu) Yongzhou Xiang(yx2211@columbia.edu) Yin Cui(yc2776@columbia.edu) Outline Background Introduction
More informationBag-of-features. Cordelia Schmid
Bag-of-features for category classification Cordelia Schmid Visual search Particular objects and scenes, large databases Category recognition Image classification: assigning a class label to the image
More informationCLASSIFICATION WITH RADIAL BASIS AND PROBABILISTIC NEURAL NETWORKS
CLASSIFICATION WITH RADIAL BASIS AND PROBABILISTIC NEURAL NETWORKS CHAPTER 4 CLASSIFICATION WITH RADIAL BASIS AND PROBABILISTIC NEURAL NETWORKS 4.1 Introduction Optical character recognition is one of
More informationUnsupervised Learning : Clustering
Unsupervised Learning : Clustering Things to be Addressed Traditional Learning Models. Cluster Analysis K-means Clustering Algorithm Drawbacks of traditional clustering algorithms. Clustering as a complex
More informationColor-Based Classification of Natural Rock Images Using Classifier Combinations
Color-Based Classification of Natural Rock Images Using Classifier Combinations Leena Lepistö, Iivari Kunttu, and Ari Visa Tampere University of Technology, Institute of Signal Processing, P.O. Box 553,
More informationImproving the detection of excessive activation of ciliaris muscle by clustering thermal images
11 th International Conference on Quantitative InfraRed Thermography Improving the detection of excessive activation of ciliaris muscle by clustering thermal images *University of Debrecen, Faculty of
More informationA Sparse and Locally Shift Invariant Feature Extractor Applied to Document Images
A Sparse and Locally Shift Invariant Feature Extractor Applied to Document Images Marc Aurelio Ranzato Yann LeCun Courant Institute of Mathematical Sciences New York University - New York, NY 10003 Abstract
More information