New Version of Davies-Bouldin Index for Clustering Validation Based on Cylindrical Distance
|
|
- Stephanie Kelley
- 5 years ago
- Views:
Transcription
1 New Version of Davies-Boulin Inex for lustering Valiation Base on ylinrical Distance Juan arlos Roas Thomas Faculta e Informática Universia omplutense e Mari Mari, España correoroas@gmail.com Abstract This paper presents a new version of Davies- Boulin inex for clustering valiation through the use of a new istance base on ensity. This new istance is use as a similarity measurement between the means of the clusters, with the purpose of overcoming the limitations of the Eucliean istance. The new istance propose allows consiering the istribution of the ata set an to approximate in a more accurate way the separation between clusters through the estimation of the ensities along the line segments that connect the centrois. Keywors lustering; Davies-Boulin Inex; Density; ylinrical Distance. Introuction The process of clustering consists on classifying in an unsupervise way a set of patterns (observations or ata) into groups (clusters) []. In general, the clustering methos shoul search for clusters whose members are close to each other (in other wors have a high egree of similarity) an well separate [2]. One of the most important issues in cluster analysis is the evaluation of clustering results to fin the partition that best fits the unerlying ata. This is the main subect of clustering valiation [2]. In general, there are three approaches to investigate clustering valiation: external criteria, internal criteria an relative criteria [3]. lustering valiation approaches, which are base on relative criteria, aim at fining the best clustering scheme that a clustering algorithm can efine uner certain assumptions an parameter. Here the basic iea is the evaluation of a clustering structure by comparing it to other clustering schemes, resulting by the same algorithm but with ifferent parameter values [4]. The Davies-Boulin inex falls into the latter category. Such inexes are use when the partitions generate by the applie clustering algorithm are no overlapping, meaning by this that each ata belongs strictly to an only one class [4]. The Davies-Boulin inex is base on the approximately estimation of the istances between clusters an their ispersions to obtain a final value that represents the quality of the partition. One of the statistics use to estimate the istances between the clusters is the Eucliean istance between the means. The problem with this statistic is that it oesn't consier the geometry of the clusters; instea it reuces the estimation to the Eucliean istance between representative points (the means). As a result, two clusters with means very close each other will be consiere very close even if their ata are not. In orer to overcome these limitations, this paper proposes a special type of istance, the cylinrical istance, which is use to calculate the istance between the means. This istance tries to capture the ata ensity along the straight lines that connect the means, an through this, to estimate how close are the clusters as a whole. This technique is valiate by comparing its results with the original version of the inex over real atasets. This paper presents the following structure: first a escription of the original inex an its limitations an relate works. Then, the cylinrical istance an the new version of the inex propose are explaine. Afterwor is presente the comparative performance of the original inex an the new version propose, an finally the conclusions an future works. I. DAVIES-BOULDIN INDEX A. Definition This inex (DB) is base on the iea that for a goo partition inter cluster separation as well as intra cluster homogeneity an compactness shoul be high [5]. Then, to efine the DB inex, we nee to efine the ispersion measure an the cluster similarity measure [6]. In [] the ispersion S i of i cluster () an the separation D between ith an th clusters (2) are efine as: i x p p D x, c i, p 0 i () Where i is the number of ata points in cluster i an c i is the center of cluster i, an: D v l il v l t t, t (2)
2 Where v i an v are the centrois of clusters i an, respectively. Then, the DB inex is efine as: V k DB k i Where k is the number of clusters an is efine as: max R i Where R is the similarity measure between clusters i an, an is efine as: R S D nce the goal is to achieve minimum within-cluster ispersion an maximum between-cluster separation, the number of clusters c that minimizes V DB is taken as the optimal value of c [5]. B. Limitations of the Original Proposal Essentially, the main limitation of the original proposal is that none of its terms consiers the geometry of the spatial istribution of the clusters. This limitation is irectly relate to the use of the means to measure the istances between the clusters. Specifically, the use of means as a measure of istance between clusters fails when they have ifferences variances, or the variances are not equal in all axes. Using the Eucliean istance between the means to calculate the istances between the clusters can lea to situations where two clusters whose geometries are close but its means are far from each other will be consiere more istant than two clusters whose geometries are more istant but its means are closer. This situation is illustrate in Fig.. Another limitation of the use of the Eucliean istance between the means is its inability to istinguish when it is measuring istances between centrois that represent genuine clusters, from situations where it is in front of centrois that represent a partition that it is really iviing a genuine one. The Fig. 2 gives an example of two ifferent configurations where the Eucliean istance between the means are equal, but actually in only one of them the centrois represents genuine well separate clusters. (3) (4) (5) a) b) Fig. 2. The images show two configurations where the istance between the centrois are equals, but in one of them (a) the centrois belong to the same cluster, an in the other (b) the centrois belong to a well separate clusters. A final example of the limitations of the inex are ilustrate in Fig. 3. These images show two iferent configurations of clusters. In the first configuration (a) the clusters have a maximum variance vector along the horizontal axe. The secon one (b) is the result of rotating the maximum variance vectors of the first configuration to a vertical position. As a result, both configurations have the same means an the same ispersions, an as a consecuence the value of the Davies-Boulin inex will be the same for both, even if in the secon configuration (b) the clusters are more well separate than in the first one(a). a) b) Fig. 3. The images show two configurations of clusters with the same means an ispersion, but with iferent orientations of the maximum variance vectors. Both have the same Davies-Boulin inex value. Fig.. The image shows two clusters with closer geometries but more istan means (blue an green clusters) an the opositte situation (blue an brown clusters). II. RELATED WORKS In [5] the Davies-Boulin inex is generalize through the use of graphs. Specifically, they use Minimal Spanning Tree (MST), Relative Neighborhoo Graph (RNG) an Gabriel Graph (GG). However, these graphs are use only to obtain the
3 ispersion of each cluster, while the measurement of istance between clusters is still the istance between the means. For this reason, those approaches are not consiere equivalent to this proect, because they tackle another problem of the Davies- Boulin inex. Instea, they are consiere complementary, an will not be use in the evaluation of the new version. Respect to istances base on ensity, there is a group of clustering algorithms bases on ensity, where clusters are efine as ense regions separate by low-ensity regions [6]. Examples of these algorithms are DBSAN [9] an BRIDGE [0]. In these algorithm the concept of ensity is use for etermine a relation of connectivity of ata points that belong to the same cluster rather than to efine a new measure of istance for replacing the Eucliean one. III. THE PROPOSAL The proposal presente in this paper consists in a new istance, calle cylinrical istance, which is use to measure the istance between the means, instea of the Eucliean one. The main iea behin this istance is to capture the ata ensity in a limite region of the space aroun the straight line that connects the means, as it is illustrate in Fig. 4. This region is etermine by a parameter, the raius. All ata that are locate in a istance, from the straight line that connects the means, equal or less that the raius, an whose proections to this segment is perpenicular, are consiere to belong to this region, an use in the final calculation of the cylinrical istance. This process is illustrate in Fig 5. A. Definition of ylinrical Distance Let D be the ataset of n imensions. Let be any ata point that belongs to the ataset D. Let p, p 2 be the ata points of which the istance is wante to be etermine. Let L(p, p 2 ) be the length of the line segment that has p an p 2 as its extremes. Let E(, p, p 2 ) be the Eucliean istance between the ata point an the line segment etermine by the ata points p an p 2. Let R(r, p, p 2 ) be the region that is etermine by the parameters p, p 2 an r, where p an p 2 correspons to the beginning an the ening of a line segment, an r the raius that etermines the limits of the region R. Let P(,p,p 2 ) be the angle between the line segment etermine by the ata points p an p 2, an the shortest line that connects the ata point with that line segment. Let be the subset of ata points that belong to the region R, efines as:, p, p / 2 E, p p D P 2, 2 r (6) Let ρ(r) the relative ensity of the region R(r, p, p 2 ), efine as: R L( p, p 2 ) (7) Where correspons to the carinality of the subset. Then the cylinrical istance θ(r, p, p 2 ) is efine as: r, p, R (8) Fig. 4. The image shows an example of a region R generate to measure the cilynrical istance between two centrois (re an black circles). a) b) p Meaning that enser the region s, closer will be the ata points consiere. As a result of the efinition of the relative ensity of R, ρ(r), if this region oes not contain any ata point, then the value of the cylinrical istance is equal to the value of the Eucliean istance between p an p 2. c) r Fig. 5. The images show graphically the process of creation of the cylinrical region R, first (a) the two ata points of which the istance is to be calculate are selecte (p an ). Then (b) the line segment that connects both ata points is built, an finally (c) using the parameter r (raius) the region is built an the ata points that fall insie are use for the istance estimation (). ) B. Determination of the set of Data Points in R To obtain the ata points that belong to the region s necessary to etermine first, which ata points have a perpenicular proection over the line segment etermine by p an p 2, the ata points of which the istance is to be calculate. To perform this, it is use the law of the cosines. The use of this law has the avantage that allows avoiing the complexities of working with geometries of high imensions, an at the same time, it generalizes the algorithm to ifferent features spaces. To apply this law a triangle whose vertices are p, p 2 an the ata point, whose membership to the region s to be etermine, is built, as illustrate in Fig. 6. Then, the lengths of the sies of the triangle are calculate, an finally, the law of the cosines is applie with the purpose of obtaining the
4 measures of the inner angles of the triangle. If the two angles between the line segment p p an the two sies of the triangle 2 p an are not greater than π/2, then the ata point has a perpenicular proection over the central line segment of the region R. If it is the case, the secon conition to belong to this region it is the length of that proection (the line segment conforme by the ata point an the nearest point of the central line segment) is not greater than the parameter r, the raius of the region R. p α β α p β Fig. 6. The images show the two ifferent configurations of the triangles mae by the ata points of which the istance is to be calculate (p an ) an a ata point belonging to the ata set (). When the ata point has a perpenicular proection over the base of the triangle, the inner angles of the base of the triangle are always less or equal than π/2 (left image). When it is not the case, one of these angles will be always greater than π/2 (right image).. ylinrical Distance Algorithm Input: ata points p,, raius r, ataset D Begin n (Number of ata points belonging to R) = 0 alculate the length of p For each Data Point in D Do alculate the length of p an alculate the angles α an β If α<=π/2 an β<=π/2 then alculate E(, p, p 2 ), the Eucliean istance between an p If E(, p, p 2 )<=r Then n=n+ En If En If En For θ=l(p,p 2 )/(n+) En D. Definition of the New Inex The new inex, calle the cylinrical version of Davies- Boulin inex, DB, is efine as: k DB Ri (9) k i Where k is the number of clusters an R i is efine as: max R (0) i Where R is the similarity measure between clusters i an, an is efine as: S R () r, v, v i Where θ(r, v i, v ) correspons to the cylinrical istance between the v i, the centroi of the cluster i, an v, the centroi of the cluster, an r to the raius of the cylinrical region. The ispersion S i of i cluster is efine as usual in the traitional Davies-Boulin inex. IV. RESULTS A series of experiments on ifferent sets of test were mae to compare the performance of the propose version (DB ) with the original version of the Davies-Boulin inex. In the process of evaluation the Ran inex was use, which allows to measure the level of similitue between two partitions, with values ranging from zero (minimal similitue) to one (maximal similitue) [6]. Another coefficient that was use is the Pearson correlation coefficient. This can take values from - to +. A value of + show that the variables are perfectly linear relate by an increasing relationship, a value of - shows that the variables are perfectly linear relate by an ecreasing relationship, an a value of 0 shows that the variables are not linear relate by each other. It is consiere a strong correlation if the correlation coefficient is greater than 0.8 an a weak correlation if the correlation coefficient is less than 0.5 [7]. The methoology use was the following: first, the ranges of values of the features of each ataset were scale to a range of 0 to 00. Then, applying the clustering algorithm k-means with variable parameters were obtaine 20 ifferent partitions on each ataset use. Then, for each partition obtaine, were calculate the original version of the Davies-Boulin inex (DB), the propose version of the inex (DB ) using 5 ifferent values for the raius (3, 5, 8, 0 an 5) an the Ran inex, with the obective to see the similitue between the partition generate an the original classes of the ata set. Finally was obtaine the Pearson correlation between each version of the DB inex an the Ran inex consiering the 20 experiments. This process was applie over 8 ifferent atasets. They are the IRIS Dataset (3 classes, 4 features an 50 instances) [8], Ecoli Data Set (8 classes, 7 features an 336 instances) [8], Wine Data Set (3 classes, 3 features an 78 instances) [8], Vertebral olumn Data Set (3 classes, 6 features an 30 instances) [8], limate Moel mulation rashes Data Set (2 classes, 8 features, 540 instances) [8], Glass Ientification Data Set (6 classes, 9 features, 24 instances) [8], Page Block lassification Data Set (5 classes, 0 features, 5473 instances) [8] an an Artificial Dataset of Gaussian istributions (3 classes, 2 features, 3500 instances). The Table shows the values of Pearson coefficient obtaine by each of the inex evaluate. Because the obective of the Davies-Boulin inex an its erivatives is to be minimize, a high negative value in the Pearson coefficient inicates a goo performance of the inex. Those values which are highlighte inicate when the DB inex ha the best performance. V. ONLUSIONS AND FUTURE WORKS In four of the atasets the propose inex ha better results than the original version (Ecoli, Iris, Page an Artificial), showing a very goo performance. In other two atasets where the original Davies-Boulin inex showe a goo performance too (Wine an limate) the results were equivalent, even with
5 the ifferent raius use in the new version. An in the remaining two atasets where the original version of the inex showe a ba performance (olumn an Glass) the new version was not able to improve it. As a conclusion, the results are very promising, but, at the same time, they suggest that a more eep analysis is necessary for unerstaning the behavior of the inex, the new version an the original one, their relations with the structural characteristics of the atasets an the influence of the clustering algorithm use for generating the partitions. An explanation of the occasions where the new inex showe the same goo performance than the original one coul be that sometimes the structural characteristics of the atasets (an the partitions generate) make the consierations of the ensities irrelevant, an in the occasions where the new inex couln't improve the ba performance of the original coul be ue to the structural characteristics of the atasets make impossible for the Davies-Boulin inex to obtain goo results, an it woul be a limitation to any new version base on it. As a future works, it is possible to obtain better results aing moification to the cylinrical istance, with the obective of taking a more accurate measurement of the ensities involve, for example, moifying the region R or optimizing the raius value. TABLE I. DB DB Raius Wine Ecoli Iris olumn limate Glass Page Artificial REFERENES [] A.K. Jain, M.N. Murty, O.J. Flynn Data lustering: a review, AM omputing Surveys, Vol. 3, No. 3, September 999 [2] M. Halkii, Y. Batistakis, M. Vazirgiannis, "On lustering Valiation Techniques", Intelligent Information Systems Journal, Kluwer Pulishers, 7(2-3): 07-45, 200 [3] M. Halkii, Y. Batistakis, M. Vazirgiannis. "luster Valiity Methos: Part I", SIGMOD Recor, June 2002 [4] M. Halkii, Y. Batistakis, M. Vazirgiannis. "luster Valiity Methos: Part II", SIGMOD Recor, September [5] Nikhil R. Pal, J. Biswas: luster valiation using graph theoretic concepts. Pattern Recognition 30(6): (997) [6] Guoun Gan, haoqun Ma, Jianhong Wu, Data lustering Theory, algorithms an applications SIAM, Society for Inustrial an Applie Mathematics (May 30, 2007) [7] Sorana-Daniela BOLBOAĂ, Lorentz JÄNTSHI, Pearson versus Spearman, Kenall's Tau orrelation Analysis onstructure-activity Relationships of Biologic Active ompouns, Leonaro Journal of Sciences, Issue 9, July-December 2006, p [8] Frank, A. & Asuncion, A. (200). UI Machine Learning Repository [ Irvine, A: University of alifornia, School of Information an omputer Science. [9] Ester, M., Kriegel, H.-P., Saner, J., an Xu, X A ensity-base algorithm for iscovering clusters in large spatial atabases with noise. Proc. 2n Int. onf. on Knowlege Discovery an Data Mining. Portlan, OR, pp [0] Dash, M., Liu, H. an Xu, X. (200), "+>2: Merging Distance an Density Base lustering", In Proceeing of the 7th International onference on Database Systems for Avance Applications (DASFAA '0), pages Hong Kong. IEEE omputer Society Publisher.
Online Appendix to: Generalizing Database Forensics
Online Appenix to: Generalizing Database Forensics KYRIACOS E. PAVLOU an RICHARD T. SNODGRASS, University of Arizona This appenix presents a step-by-step iscussion of the forensic analysis protocol that
More informationNEW METHOD FOR FINDING A REFERENCE POINT IN FINGERPRINT IMAGES WITH THE USE OF THE IPAN99 ALGORITHM 1. INTRODUCTION 2.
JOURNAL OF MEDICAL INFORMATICS & TECHNOLOGIES Vol. 13/009, ISSN 164-6037 Krzysztof WRÓBEL, Rafał DOROZ * fingerprint, reference point, IPAN99 NEW METHOD FOR FINDING A REFERENCE POINT IN FINGERPRINT IMAGES
More informationGeneralized Edge Coloring for Channel Assignment in Wireless Networks
TR-IIS-05-021 Generalize Ege Coloring for Channel Assignment in Wireless Networks Chun-Chen Hsu, Pangfeng Liu, Da-Wei Wang, Jan-Jan Wu December 2005 Technical Report No. TR-IIS-05-021 http://www.iis.sinica.eu.tw/lib/techreport/tr2005/tr05.html
More informationClassifying Facial Expression with Radial Basis Function Networks, using Gradient Descent and K-means
Classifying Facial Expression with Raial Basis Function Networks, using Graient Descent an K-means Neil Allrin Department of Computer Science University of California, San Diego La Jolla, CA 9237 nallrin@cs.ucs.eu
More informationSkyline Community Search in Multi-valued Networks
Syline Community Search in Multi-value Networs Rong-Hua Li Beijing Institute of Technology Beijing, China lironghuascut@gmail.com Jeffrey Xu Yu Chinese University of Hong Kong Hong Kong, China yu@se.cuh.eu.h
More informationImage Segmentation using K-means clustering and Thresholding
Image Segmentation using Kmeans clustering an Thresholing Preeti Panwar 1, Girhar Gopal 2, Rakesh Kumar 3 1M.Tech Stuent, Department of Computer Science & Applications, Kurukshetra University, Kurukshetra,
More informationGeneralized Edge Coloring for Channel Assignment in Wireless Networks
Generalize Ege Coloring for Channel Assignment in Wireless Networks Chun-Chen Hsu Institute of Information Science Acaemia Sinica Taipei, Taiwan Da-wei Wang Jan-Jan Wu Institute of Information Science
More informationTHE BAYESIAN RECEIVER OPERATING CHARACTERISTIC CURVE AN EFFECTIVE APPROACH TO EVALUATE THE IDS PERFORMANCE
БСУ Международна конференция - 2 THE BAYESIAN RECEIVER OPERATING CHARACTERISTIC CURVE AN EFFECTIVE APPROACH TO EVALUATE THE IDS PERFORMANCE Evgeniya Nikolova, Veselina Jecheva Burgas Free University Abstract:
More informationMulti-camera tracking algorithm study based on information fusion
International Conference on Avance Electronic Science an Technolog (AEST 016) Multi-camera tracking algorithm stu base on information fusion a Guoqiang Wang, Shangfu Li an Xue Wen School of Electronic
More informationA Novel Density Based Clustering Algorithm by Incorporating Mahalanobis Distance
Receive: November 20, 2017 121 A Novel Density Base Clustering Algorithm by Incorporating Mahalanobis Distance Margaret Sangeetha 1 * Velumani Paikkaramu 2 Rajakumar Thankappan Chellan 3 1 Department of
More informationMultilevel Linear Dimensionality Reduction using Hypergraphs for Data Analysis
Multilevel Linear Dimensionality Reuction using Hypergraphs for Data Analysis Haw-ren Fang Department of Computer Science an Engineering University of Minnesota; Minneapolis, MN 55455 hrfang@csumneu ABSTRACT
More informationA PSO Optimized Layered Approach for Parametric Clustering on Weather Dataset
Vol.3, Issue.1, Jan-Feb. 013 pp-504-508 ISSN: 49-6645 A PSO Optimize Layere Approach for Parametric Clustering on Weather Dataset Shikha Verma, 1 Kiran Jyoti 1 Stuent, Guru Nanak Dev Engineering College
More informationCONSTRUCTION AND ANALYSIS OF INVERSIONS IN S 2 AND H 2. Arunima Ray. Final Paper, MATH 399. Spring 2008 ABSTRACT
CONSTUCTION AN ANALYSIS OF INVESIONS IN S AN H Arunima ay Final Paper, MATH 399 Spring 008 ASTACT The construction use to otain inversions in two-imensional Eucliean space was moifie an applie to otain
More informationParticle Swarm Optimization Based on Smoothing Approach for Solving a Class of Bi-Level Multiobjective Programming Problem
BULGARIAN ACADEMY OF SCIENCES CYBERNETICS AND INFORMATION TECHNOLOGIES Volume 17, No 3 Sofia 017 Print ISSN: 1311-970; Online ISSN: 1314-4081 DOI: 10.1515/cait-017-0030 Particle Swarm Optimization Base
More informationAlmost Disjunct Codes in Large Scale Multihop Wireless Network Media Access Control
Almost Disjunct Coes in Large Scale Multihop Wireless Network Meia Access Control D. Charles Engelhart Anan Sivasubramaniam Penn. State University University Park PA 682 engelhar,anan @cse.psu.eu Abstract
More informationA Cost Model For Nearest Neighbor Search. High-Dimensional Data Space
A Cost Moel For Nearest Neighbor Search in High-Dimensional Data Space Stefan Berchtol University of Munich Germany berchtol@informatikuni-muenchene Daniel A Keim University of Munich Germany keim@informatikuni-muenchene
More information10. WAVE OPTICS ONE MARK QUESTIONS
1 10. WAVE OPTICS ONE MARK QUESTIONS 1. Define wavefront.. What is the shape of wavefront obtaine from a point source at a (i) small istance (ii) large istance? 3. Uner what conitions a cylinrical wavefront
More informationModifying ROC Curves to Incorporate Predicted Probabilities
Moifying ROC Curves to Incorporate Preicte Probabilities Cèsar Ferri DSIC, Universitat Politècnica e València Peter Flach Department of Computer Science, University of Bristol José Hernánez-Orallo DSIC,
More informationAPPLYING GENETIC ALGORITHM IN QUERY IMPROVEMENT PROBLEM. Abdelmgeid A. Aly
International Journal "Information Technologies an Knowlege" Vol. / 2007 309 [Project MINERVAEUROPE] Project MINERVAEUROPE: Ministerial Network for Valorising Activities in igitalisation -
More informationWLAN Indoor Positioning Based on Euclidean Distances and Fuzzy Logic
WLAN Inoor Positioning Base on Eucliean Distances an Fuzzy Logic Anreas TEUBER, Bern EISSFELLER Institute of Geoesy an Navigation, University FAF, Munich, Germany, e-mail: (anreas.teuber, bern.eissfeller)@unibw.e
More informationUsing Vector and Raster-Based Techniques in Categorical Map Generalization
Thir ICA Workshop on Progress in Automate Map Generalization, Ottawa, 12-14 August 1999 1 Using Vector an Raster-Base Techniques in Categorical Map Generalization Beat Peter an Robert Weibel Department
More informationShift-map Image Registration
Shift-map Image Registration Svärm, Linus; Stranmark, Petter Unpublishe: 2010-01-01 Link to publication Citation for publishe version (APA): Svärm, L., & Stranmark, P. (2010). Shift-map Image Registration.
More informationCoupling the User Interfaces of a Multiuser Program
Coupling the User Interfaces of a Multiuser Program PRASUN DEWAN University of North Carolina at Chapel Hill RAJIV CHOUDHARY Intel Corporation We have evelope a new moel for coupling the user-interfaces
More informationData Mining: Concepts and Techniques. Chapter 7. Cluster Analysis. Examples of Clustering Applications. What is Cluster Analysis?
Data Mining: Concepts an Techniques Chapter Jiawei Han Department of Computer Science University of Illinois at Urbana-Champaign www.cs.uiuc.eu/~hanj Jiawei Han an Micheline Kamber, All rights reserve
More informationfiltering LETTER An Improved Neighbor Selection Algorithm in Collaborative Taek-Hun KIM a), Student Member and Sung-Bong YANG b), Nonmember
107 IEICE TRANS INF & SYST, VOLE88 D, NO5 MAY 005 LETTER An Improve Neighbor Selection Algorithm in Collaborative Filtering Taek-Hun KIM a), Stuent Member an Sung-Bong YANG b), Nonmember SUMMARY Nowaays,
More informationCluster Center Initialization Method for K-means Algorithm Over Data Sets with Two Clusters
Available online at www.scienceirect.com Proceia Engineering 4 (011 ) 34 38 011 International Conference on Avances in Engineering Cluster Center Initialization Metho for K-means Algorithm Over Data Sets
More informationNon-homogeneous Generalization in Privacy Preserving Data Publishing
Non-homogeneous Generalization in Privacy Preserving Data Publishing W. K. Wong, Nios Mamoulis an Davi W. Cheung Department of Computer Science, The University of Hong Kong Pofulam Roa, Hong Kong {wwong2,nios,cheung}@cs.hu.h
More informationA shortest path algorithm in multimodal networks: a case study with time varying costs
A shortest path algorithm in multimoal networks: a case stuy with time varying costs Daniela Ambrosino*, Anna Sciomachen* * Department of Economics an Quantitative Methos (DIEM), University of Genoa Via
More informationFINDING OPTICAL DISPERSION OF A PRISM WITH APPLICATION OF MINIMUM DEVIATION ANGLE MEASUREMENT METHOD
Warsaw University of Technology Faculty of Physics Physics Laboratory I P Joanna Konwerska-Hrabowska 6 FINDING OPTICAL DISPERSION OF A PRISM WITH APPLICATION OF MINIMUM DEVIATION ANGLE MEASUREMENT METHOD.
More information1 Surprises in high dimensions
1 Surprises in high imensions Our intuition about space is base on two an three imensions an can often be misleaing in high imensions. It is instructive to analyze the shape an properties of some basic
More information6 Gradient Descent. 6.1 Functions
6 Graient Descent In this topic we will iscuss optimizing over general functions f. Typically the function is efine f : R! R; that is its omain is multi-imensional (in this case -imensional) an output
More informationRandom Clustering for Multiple Sampling Units to Speed Up Run-time Sample Generation
DEIM Forum 2018 I4-4 Abstract Ranom Clustering for Multiple Sampling Units to Spee Up Run-time Sample Generation uzuru OKAJIMA an Koichi MARUAMA NEC Solution Innovators, Lt. 1-18-7 Shinkiba, Koto-ku, Tokyo,
More informationA Cost Model for Query Processing in High-Dimensional Data Spaces
A Cost Moel for Query Processing in High-Dimensional Data Spaces Christian Böhm Luwig Maximilians Universität München This is a preliminary release of an article accepte by ACM Transactions on Database
More informationStudy of Network Optimization Method Based on ACL
Available online at www.scienceirect.com Proceia Engineering 5 (20) 3959 3963 Avance in Control Engineering an Information Science Stuy of Network Optimization Metho Base on ACL Liu Zhian * Department
More informationClassical Mechanics Examples (Lagrange Multipliers)
Classical Mechanics Examples (Lagrange Multipliers) Dipan Kumar Ghosh Physics Department, Inian Institute of Technology Bombay Powai, Mumbai 400076 September 3, 015 1 Introuction We have seen that the
More informationA FUZZY FRAMEWORK FOR SEGMENTATION, FEATURE MATCHING AND RETRIEVAL OF BRAIN MR IMAGES
A FUZZY FRAMEWORK FOR SEGMENTATION, FEATURE MATCHING AND RETRIEVAL OF BRAIN MR IMAGES Archana.S 1 an Srihar.S 2 1 Department of Information Science an Technology, College of Engineering, Guiny archana.santhira@gmail.com
More informationBlind Data Classification using Hyper-Dimensional Convex Polytopes
Blin Data Classification using Hyper-Dimensional Convex Polytopes Brent T. McBrie an Gilbert L. Peterson Department of Electrical an Computer Engineering Air Force Institute of Technology 9 Hobson Way
More informationThe Reconstruction of Graphs. Dhananjay P. Mehendale Sir Parashurambhau College, Tilak Road, Pune , India. Abstract
The Reconstruction of Graphs Dhananay P. Mehenale Sir Parashurambhau College, Tila Roa, Pune-4030, Inia. Abstract In this paper we iscuss reconstruction problems for graphs. We evelop some new ieas lie
More informationKinematic Analysis of a Family of 3R Manipulators
Kinematic Analysis of a Family of R Manipulators Maher Baili, Philippe Wenger an Damien Chablat Institut e Recherche en Communications et Cybernétique e Nantes, UMR C.N.R.S. 6597 1, rue e la Noë, BP 92101,
More informationChapter 35 Homework (due 12/03/13)!!
Chapter 35 Homework (ue 12/03/13) 35.6 35.20 35.21 35.29 35.51 35.55 35.66 35.68 page 1 Problem 35.6 An unerwater scuba iver sees the sun an in the parent angle of 45 above the horizontal. What is the
More informationAdjacency Matrix Based Full-Text Indexing Models
1000-9825/2002/13(10)1933-10 2002 Journal of Software Vol.13, No.10 Ajacency Matrix Base Full-Text Inexing Moels ZHOU Shui-geng 1, HU Yun-fa 2, GUAN Ji-hong 3 1 (Department of Computer Science an Engineering,
More informationBends, Jogs, And Wiggles for Railroad Tracks and Vehicle Guide Ways
Ben, Jogs, An Wiggles for Railroa Tracks an Vehicle Guie Ways Louis T. Klauer Jr., PhD, PE. Work Soft 833 Galer Dr. Newtown Square, PA 19073 lklauer@wsof.com Preprint, June 4, 00 Copyright 00 by Louis
More informationOverlap Interval Partition Join
Overlap Interval Partition Join Anton Dignös Department of Computer Science University of Zürich, Switzerlan aignoes@ifi.uzh.ch Michael H. Böhlen Department of Computer Science University of Zürich, Switzerlan
More informationImage compression predicated on recurrent iterated function systems
2n International Conference on Mathematics & Statistics 16-19 June, 2008, Athens, Greece Image compression preicate on recurrent iterate function systems Chol-Hui Yun *, Metzler W. a an Barski M. a * Faculty
More informationd 3 d 4 d d d d d d d d d d d 1 d d d d d d
Proceeings of the IASTED International Conference Software Engineering an Applications (SEA') October 6-, 1, Scottsale, Arizona, USA AN OBJECT-ORIENTED APPROACH FOR MANAGING A NETWORK OF DATABASES Shu-Ching
More informationFeature Extraction and Rule Classification Algorithm of Digital Mammography based on Rough Set Theory
Feature Extraction an Rule Classification Algorithm of Digital Mammography base on Rough Set Theory Aboul Ella Hassanien Jafar M. H. Ali. Kuwait University, Faculty of Aministrative Science, Quantitative
More informationSURVIVABLE IP OVER WDM: GUARANTEEEING MINIMUM NETWORK BANDWIDTH
SURVIVABLE IP OVER WDM: GUARANTEEEING MINIMUM NETWORK BANDWIDTH Galen H Sasaki Dept Elec Engg, U Hawaii 2540 Dole Street Honolul HI 96822 USA Ching-Fong Su Fuitsu Laboratories of America 595 Lawrence Expressway
More informationFast Window Based Stereo Matching for 3D Scene Reconstruction
The International Arab Journal of Information Technology, Vol. 0, No. 3, May 203 209 Fast Winow Base Stereo Matching for 3D Scene Reconstruction Mohamma Mozammel Chowhury an Mohamma AL-Amin Bhuiyan Department
More informationTHE APPLICATION OF ARTICLE k-th SHORTEST TIME PATH ALGORITHM
International Journal of Physics an Mathematical Sciences ISSN: 2277-2111 (Online) 2016 Vol. 6 (1) January-March, pp. 24-6/Mao an Shi. THE APPLICATION OF ARTICLE k-th SHORTEST TIME PATH ALGORITHM Hua Mao
More informationHere are a couple of warnings to my students who may be here to get a copy of what happened on a day that you missed.
Preface Here are my online notes for my Calculus I course that I teach here at Lamar University. Despite the fact that these are my class notes, they shoul be accessible to anyone wanting to learn Calculus
More informationQueueing Model and Optimization of Packet Dropping in Real-Time Wireless Sensor Networks
Queueing Moel an Optimization of Packet Dropping in Real-Time Wireless Sensor Networks Marc Aoun, Antonios Argyriou, Philips Research, Einhoven, 66AE, The Netherlans Department of Computer an Communication
More informationClustering using Particle Swarm Optimization. Nuria Gómez Blas, Octavio López Tolic
24 International Journal Information Theories an Applications, Vol. 23, Number 1, (c) 2016 Clustering using Particle Swarm Optimization Nuria Gómez Blas, Octavio López Tolic Abstract: Data clustering has
More informationShort-term prediction of photovoltaic power based on GWPA - BP neural network model
Short-term preiction of photovoltaic power base on GWPA - BP neural networ moel Jian Di an Shanshan Meng School of orth China Electric Power University, Baoing. China Abstract In recent years, ue to China's
More informationTexture Recognition with combined GLCM, Wavelet and Rotated Wavelet Features
International Journal of Computer an Electrical Engineering, Vol.3, No., February, 793-863 Texture Recognition with combine GLCM, Wavelet an Rotate Wavelet Features Dipankar Hazra Abstract Aim of this
More informationShift-map Image Registration
Shift-map Image Registration Linus Svärm Petter Stranmark Centre for Mathematical Sciences, Lun University {linus,petter}@maths.lth.se Abstract Shift-map image processing is a new framework base on energy
More informationA Plane Tracker for AEC-automation Applications
A Plane Tracker for AEC-automation Applications Chen Feng *, an Vineet R. Kamat Department of Civil an Environmental Engineering, University of Michigan, Ann Arbor, USA * Corresponing author (cforrest@umich.eu)
More informationClustering Expression Data. Clustering Expression Data
www.cs.washington.eu/ Subscribe if you Din t get msg last night Clustering Exression Data Why cluster gene exression ata? Tissue classification Fin biologically relate genes First ste in inferring regulatory
More informationRough Set Approach for Classification of Breast Cancer Mammogram Images
Rough Set Approach for Classification of Breast Cancer Mammogram Images Aboul Ella Hassanien Jafar M. H. Ali. Kuwait University, Faculty of Aministrative Science, Quantitative Methos an Information Systems
More information5th International Conference on Advanced Design and Manufacturing Engineering (ICADME 2015)
5th International Conference on Avance Design an Manufacturing Engineering (ICADME 25) Research on motion characteristics an application of multi egree of freeom mechanism base on R-W metho Xiao-guang
More informationA Classification of 3R Orthogonal Manipulators by the Topology of their Workspace
A Classification of R Orthogonal Manipulators by the Topology of their Workspace Maher aili, Philippe Wenger an Damien Chablat Institut e Recherche en Communications et Cybernétique e Nantes, UMR C.N.R.S.
More informationDual Arm Robot Research Report
Dual Arm Robot Research Report Analytical Inverse Kinematics Solution for Moularize Dual-Arm Robot With offset at shouler an wrist Motivation an Abstract Generally, an inustrial manipulator such as PUMA
More informationFigure 1: 2D arm. Figure 2: 2D arm with labelled angles
2D Kinematics Consier a robotic arm. We can sen it commans like, move that joint so it bens at an angle θ. Once we ve set each joint, that s all well an goo. More interesting, though, is the question of
More informationImplicit and Explicit Functions
60_005.q //0 :5 PM Page SECTION.5 Implicit Differentiation Section.5 EXPLORATION Graphing an Implicit Equation How coul ou use a graphing utilit to sketch the graph of the equation? Here are two possible
More informationA new fuzzy visual servoing with application to robot manipulator
2005 American Control Conference June 8-10, 2005. Portlan, OR, USA FrA09.4 A new fuzzy visual servoing with application to robot manipulator Marco A. Moreno-Armenariz, Wen Yu Abstract Many stereo vision
More informationFast Fractal Image Compression using PSO Based Optimization Techniques
Fast Fractal Compression using PSO Base Optimization Techniques A.Krishnamoorthy Visiting faculty Department Of ECE University College of Engineering panruti rishpci89@gmail.com S.Buvaneswari Visiting
More informationObject Recognition Using Colour, Shape and Affine Invariant Ratios
Object Recognition Using Colour, Shape an Affine Invariant Ratios Paul A. Walcott Centre for Information Engineering City University, Lonon EC1V 0HB, Englan P.A.Walcott@city.ac.uk Abstract This paper escribes
More informationEstimating Velocity Fields on a Freeway from Low Resolution Video
Estimating Velocity Fiels on a Freeway from Low Resolution Vieo Young Cho Department of Statistics University of California, Berkeley Berkeley, CA 94720-3860 Email: young@stat.berkeley.eu John Rice Department
More informationClassification and clustering methods for documents. by probabilistic latent semantic indexing model
A Short Course at amang University aipei, aiwan, R.O.C., March 7-9, 2006 Classification an clustering methos for ocuments by probabilistic latent semantic inexing moel Shigeichi Hirasawa Department of
More informationAdvanced method of NC programming for 5-axis machining
Available online at www.scienceirect.com Proceia CIRP (0 ) 0 07 5 th CIRP Conference on High Performance Cutting 0 Avance metho of NC programming for 5-axis machining Sergej N. Grigoriev a, A.A. Kutin
More informationSolution Representation for Job Shop Scheduling Problems in Ant Colony Optimisation
Solution Representation for Job Shop Scheuling Problems in Ant Colony Optimisation James Montgomery, Carole Faya 2, an Sana Petrovic 2 Faculty of Information & Communication Technologies, Swinburne University
More informationCharacterizing Decoding Robustness under Parametric Channel Uncertainty
Characterizing Decoing Robustness uner Parametric Channel Uncertainty Jay D. Wierer, Wahee U. Bajwa, Nigel Boston, an Robert D. Nowak Abstract This paper characterizes the robustness of ecoing uner parametric
More informationImproving Performance of Sparse Matrix-Vector Multiplication
Improving Performance of Sparse Matrix-Vector Multiplication Ali Pınar Michael T. Heath Department of Computer Science an Center of Simulation of Avance Rockets University of Illinois at Urbana-Champaign
More informationHandling missing values in kernel methods with application to microbiology data
an Machine Learning. Bruges (Belgium), 24-26 April 2013, i6oc.com publ., ISBN 978-2-87419-081-0. Available from http://www.i6oc.com/en/livre/?gcoi=28001100131010. Hanling missing values in kernel methos
More informationA New Search Algorithm for Solving Symmetric Traveling Salesman Problem Based on Gravity
Worl Applie Sciences Journal 16 (10): 1387-1392, 2012 ISSN 1818-4952 IDOSI Publications, 2012 A New Search Algorithm for Solving Symmetric Traveling Salesman Problem Base on Gravity Aliasghar Rahmani Hosseinabai,
More informationOffloading Cellular Traffic through Opportunistic Communications: Analysis and Optimization
1 Offloaing Cellular Traffic through Opportunistic Communications: Analysis an Optimization Vincenzo Sciancalepore, Domenico Giustiniano, Albert Banchs, Anreea Picu arxiv:1405.3548v1 [cs.ni] 14 May 24
More informationLoop Scheduling and Partitions for Hiding Memory Latencies
Loop Scheuling an Partitions for Hiing Memory Latencies Fei Chen Ewin Hsing-Mean Sha Dept. of Computer Science an Engineering University of Notre Dame Notre Dame, IN 46556 Email: fchen,esha @cse.n.eu Tel:
More informationEvolutionary Optimisation Methods for Template Based Image Registration
Evolutionary Optimisation Methos for Template Base Image Registration Lukasz A Machowski, Tshilizi Marwala School of Electrical an Information Engineering University of Witwatersran, Johannesburg, South
More informationComparison of Methods for Increasing the Performance of a DUA Computation
Comparison of Methos for Increasing the Performance of a DUA Computation Michael Behrisch, Daniel Krajzewicz, Peter Wagner an Yun-Pang Wang Institute of Transportation Systems, German Aerospace Center,
More informationAdditional Divide and Conquer Algorithms. Skipping from chapter 4: Quicksort Binary Search Binary Tree Traversal Matrix Multiplication
Aitional Divie an Conquer Algorithms Skipping from chapter 4: Quicksort Binary Search Binary Tree Traversal Matrix Multiplication Divie an Conquer Closest Pair Let s revisit the closest pair problem. Last
More informationDivide-and-Conquer Algorithms
Supplment to A Practical Guie to Data Structures an Algorithms Using Java Divie-an-Conquer Algorithms Sally A Golman an Kenneth J Golman Hanout Divie-an-conquer algorithms use the following three phases:
More informationDense Disparity Estimation in Ego-motion Reduced Search Space
Dense Disparity Estimation in Ego-motion Reuce Search Space Luka Fućek, Ivan Marković, Igor Cvišić, Ivan Petrović University of Zagreb, Faculty of Electrical Engineering an Computing, Croatia (e-mail:
More informationA Framework for Dialogue Detection in Movies
A Framework for Dialogue Detection in Movies Margarita Kotti, Constantine Kotropoulos, Bartosz Ziólko, Ioannis Pitas, an Vassiliki Moschou Department of Informatics, Aristotle University of Thessaloniki
More informationLearning to Rank: A New Technology for Text Processing
TFANT 07 Tokyo Univ. March 2, 2007 Learning to Rank: A New Technology for Text Processing Hang Li Microsoft Research Asia Talk Outline What is Learning to Rank? Ranking SVM Definition Search Ranking SVM
More informationOn the Placement of Internet Taps in Wireless Neighborhood Networks
1 On the Placement of Internet Taps in Wireless Neighborhoo Networks Lili Qiu, Ranveer Chanra, Kamal Jain, Mohamma Mahian Abstract Recently there has emerge a novel application of wireless technology that
More informationLecture 1 September 4, 2013
CS 84r: Incentives an Information in Networks Fall 013 Prof. Yaron Singer Lecture 1 September 4, 013 Scribe: Bo Waggoner 1 Overview In this course we will try to evelop a mathematical unerstaning for the
More informationArchitecture Design of Mobile Access Coordinated Wireless Sensor Networks
Architecture Design of Mobile Access Coorinate Wireless Sensor Networks Mai Abelhakim 1 Leonar E. Lightfoot Jian Ren 1 Tongtong Li 1 1 Department of Electrical & Computer Engineering, Michigan State University,
More informationRobust PIM-SM Multicasting using Anycast RP in Wireless Ad Hoc Networks
Robust PIM-SM Multicasting using Anycast RP in Wireless A Hoc Networks Jaewon Kang, John Sucec, Vikram Kaul, Sunil Samtani an Mariusz A. Fecko Applie Research, Telcoria Technologies One Telcoria Drive,
More informationWilliam S. Law. Erik K. Antonsson. Engineering Design Research Laboratory. California Institute of Technology. Abstract
Optimization Methos for Calculating Design Imprecision y William S. Law Eri K. Antonsson Engineering Design Research Laboratory Division of Engineering an Applie Science California Institute of Technology
More informationA Versatile Model-Based Visibility Measure for Geometric Primitives
A Versatile Moel-Base Visibility Measure for Geometric Primitives Marc M. Ellenrieer 1,LarsKrüger 1, Dirk Stößel 2, an Marc Hanheie 2 1 DaimlerChrysler AG, Research & Technology, 89013 Ulm, Germany 2 Faculty
More informationA Multi-class SVM Classifier Utilizing Binary Decision Tree
Informatica 33 (009) 33-41 33 A Multi-class Classifier Utilizing Binary Decision Tree Gjorgji Mazarov, Dejan Gjorgjevikj an Ivan Chorbev Department of Computer Science an Engineering Faculty of Electrical
More informationCS 106 Winter 2016 Craig S. Kaplan. Module 01 Processing Recap. Topics
CS 106 Winter 2016 Craig S. Kaplan Moule 01 Processing Recap Topics The basic parts of speech in a Processing program Scope Review of syntax for classes an objects Reaings Your CS 105 notes Learning Processing,
More informationText Particles Multi-band Fusion for Robust Text Detection
Text Particles Multi-ban Fusion for Robust Text Detection Pengfei Xu, Rongrong Ji, Hongxun Yao, Xiaoshuai Sun, Tianqiang Liu, an Xianming Liu School of Computer Science an Engineering Harbin Institute
More informationA Highly Scalable Parallel Boundary Element Method for Capacitance Extraction
A Highly Scalable Parallel Bounary Element Metho for Capacitance Extraction The MIT Faculty has mae this article openly available. Please share how this access benefits you. Your story matters. Citation
More informationCellular Ants: Combining Ant-Based Clustering with Cellular Automata
Cellular Ants: Combining Ant-Base Clustering with Cellular Automata Anrew Vane Moere & Justin James Clayen Key Centre of Design Computing & Cognition, University of Syney, Australia {anrew,justin}@arch.usy.eu.au
More informationLearning Polynomial Functions. by Feature Construction
I Proceeings of the Eighth International Workshop on Machine Learning Chicago, Illinois, June 27-29 1991 Learning Polynomial Functions by Feature Construction Richar S. Sutton GTE Laboratories Incorporate
More informationA Neural Network Model Based on Graph Matching and Annealing :Application to Hand-Written Digits Recognition
ITERATIOAL JOURAL OF MATHEMATICS AD COMPUTERS I SIMULATIO A eural etwork Moel Base on Graph Matching an Annealing :Application to Han-Written Digits Recognition Kyunghee Lee Abstract We present a neural
More informationk-nn Graph Construction: a Generic Online Approach
k-nn Graph Construction: a Generic Online Approach Wan-Lei Zhao arxiv:80.00v [cs.ir] Sep 08 Abstract Nearest neighbor search an k-nearest neighbor graph construction are two funamental issues arise from
More informationKeywords Data compression, image processing, NCD, Kolmogorov complexity, JPEG, ZIP
Volume 3, Issue 7, July 203 ISSN: 2277 28X International Journal of Avance Research in Computer Science an Software Engineering Research Paper Available online at: wwwijarcssecom Compression Techniques
More informationarxiv: v1 [math.co] 15 Dec 2017
Rectilinear Crossings in Complete Balance -Partite -Uniform Hypergraphs Rahul Gangopahyay Saswata Shannigrahi arxiv:171.05539v1 [math.co] 15 Dec 017 December 18, 017 Abstract In this paper, we stuy the
More informationA half-scan error reduction based algorithm for cone-beam CT
Journal of X-Ray Science an Technology 12 (2004) 73 82 73 IOS Press A half-scan error reuction base algorithm for cone-beam CT Kai Zeng a, Zhiqiang Chen a, Li Zhang a an Ge Wang a,b a Department of Engineering
More information