A hybrid unsupervised and supervised clustering applied to microarray data

Size: px
Start display at page:

Download "A hybrid unsupervised and supervised clustering applied to microarray data"

Transcription

1 A hybrid usupervised ad supervised clusterig applied to microarray data Raul Măluţa, Pedro Gómez Vilda, Moica Borda Abstract Clusterig methods have bee ofte applied to large data with the mai purpose of reducig the dimesio, time computatio ad idetifyig clusters with similar behavior. This work presets a state-of-the-art i usupervised clusterig ad cluster validatio. It proposes a method for hybrid bi-clusterig of microarray data combied with a supervised validatio for determiig the optimal amout of clusters of gees. Keywords microarray, clusterig, iteral validatio, exteral validatio, gee shavig. I. INTRODUCTION The microarray data processig challeges owadays cosists of how to make it more reliable, easy to use ad efficiet. I this task other fields of kowledge as Sigal ad Image Processig, Patter Recogitio, or Statistical Data Aalysis [1] may help i yieldig their eormous potetial i solvig problems as microarray image ehacemet, segmetatio, correctio, griddig, data aalysis, reliable expressio estimatio i relatio with hybridizatio dyamics, etc. Others have to see with data iterpretatio, dimesioality reductio, cluster aalysis, etc. Clusterig techiques play a importat role i aalyzig high dimesioal data such as microarray data. A aalysis of microarray data is a search for gees that have similar, correlated patters of expressio. This idicates that some of the data might cotai redudat iformatio. For example, if a group of experimets were more closely related tha it was expected, some of the redudat experimets ca be igored, or use some average of the iformatio without loss of iformatio. Data aalysis methods [2] ca be grouped i two categories: supervised ad usupervised. I the Mauscript received Ortober 28, 2012, revised March 04, This work was supported by the project "Developmet ad support of multidiscipliary postdoctoral programmes i major techical areas of atioal strategy of Research - Developmet - Iovatio" 4D-POSTDOC, cotract o. POSDRU/89/1.5/S/52603, project co-fuded by the Europea Social Fud through Sectoral Operatioal Programme Huma Resources Developmet R. Maluta is with the Commuicatios Departmet, Techical Uiversity of Cluj-Napoca, George Baritiu St., Cluj-Napoca, Romaia, (phoe: ; fax: ; raul.maluta@com.utcluj.ro). P. Gómez Vilda is with Departameto de Arquitectura y Tecología de Sistemas Iformáticos (DATSI), Uiversidad Politécica de Madrid, Campus de Motegacedo, s/, 28660, Boadilla del Mote, Madrid, Spai ( pedro@pio.datsi.fi.upm.es). M. Borda is with the Commuicatios Departmet, Techical Uiversity of Cluj-Napoca, George Baritiu St., Cluj-Napoca, Romaia ( Moica.Borda@com.utcluj.ro). doi: /ijates.v2i3.21 usupervised approach, also kow as clusterig, data is orgaized without a priori iformatio. This paper describes several usupervised algorithms used mostly for microarray data, like k-meas, Partitioig Aroud Medoids (PAM) ad Expectatio-Maximizatio (EM). These algorithms have prove to be a useful whe the umber of clusters is kow or ca be estimated. Two of them, k-meas ad PAM, are based o miimizig the mea squared error, while the third, EM, o mixture modelig. The algorithms were ru o several data sets [3], observig that the quality of the obtaied clusters is depedet o the umber of clusters specified. To assess the effectiveess of these algorithms, but also to estimate the actual umber of clusters, several clusterig validatio method were implemeted. These methods cosist i calculatig iteral ad exteral idices used to estimate the optimal umber of clusters for each algorithm. The microarray data has a particularity compared with other type of data. A two dimesio data is i fact a gee expressio matrix which usually has the rows correspodig to gees from a experimet ad the colums correspodig to differet experimets. If oe fids that two rows are similar, it ca be assumed that the gees correspodig to the rows are co-regulated ad fuctioally related, ad by comparig two colums it ca foud which gees are differetially expressed i each experimet. To perform a compariso a similarity measure betwee the objects, gees or experimets uder compariso, has to be used. Mostly the same algorithm is used for aalyzig both the gees ad the experimets, i.e. bi-clusterig of microarray data. Cosiderig the dimesioality of the data, large amout of gees ad small umber of experimets, we propose i this paper a hybrid bi-clusterig, where we combie usupervised methods with the purpose of obtaiig the optimal combiatio of clusters. Besides, as a supplemetary validatio, a supervised method was used o the same data. Supervised classificatio represets the issue of idetifyig the subset to which ew observatios belog, where the idetity of the subset is ukow, o the basis of a traiig set of data cotaiig observatios whose subset is kow. Therefore the classificatio will display a variable behavior which ca be aalyzed by statistics. It is required for ew sample items to be placed ito the respective groups based o quatitative iformatio o oe or more measuremets, attributes or features ad based o the traiig set i which previously decided groupigs are already established. 99

2 II. CLUSTER ANALYSIS A. Usupervised algorithms The usupervised clusterig methods are divided i two categories: hierarchical ad o-hierarchical. Hierarchical methods group the objects i a iterative way, geeratig a hierarchical tree structure, also kow as dedogram. O the other had, the o-hierarchical clusterig, or partitioig methods, does the partitioig of a data set ito a predefied umber of differet clusters, without a hierarchical structure. Beside, these algorithms produce a iteger umber of partitios, ad also they optimize a certai criterio fuctio. The partitioig methods classify the data i k clusters which must fulfill the followig coditios: each group should cotai at least oe elemet; each object must belog to oe group oly. For the microarray data the most suitable clusterig methods are usupervised oes, because we caot observe the (real) umber of clusters i the data [4]. From the usupervised algorithms used i microarray data aalysis the k-meas, PAM (Partitioig Aroud Medoids) ad EM (Expectatio Maximizatio) are the most frequetly used. The first two from the list are usig the miimizatio of the root mea square error ad the third oe is usig the mixig models methods. The PAM algorithm [2] is based o the search for k represetative objects or medoids amog the objects of the dataset. These objects should represet the structure of the data. After fidig a set of k medoids, k clusters are costructed by assigig each object to the earest medoid. The goal is to fid k represetative objects which miimize the sum of the dissimilarities of the objects to their closest represetative object. The algorithm first looks for a good iitial set of medoids. The it fids a local miimum for the objective fuctio, that is, a solutio such that there is o sigle switch of a object with a medoid that will decrease the objective. The k-meas algorithm [5], a usupervised learig algorithm, has bee used to form clusters of gees i gee expressio data aalysis. The algorithm takes the umber of clusters (k) to be calculated as a iput. The umber of clusters is usually chose by the user. The procedure for k- meas clusterig is as follows: 1. First, the user tries to estimate the umber of clusters. 2. Radomly choose N poits ito k clusters. 3. Calculate the cetroid for each cluster. 4. For each poit, move it to the closest cluster. 5. Repeat steps 3 ad 4 util o further poits are moved to differet clusters. The Expectatio-Maximizatio (EM) algorithm [6] is a method for fidig maximum likelihood estimates of parameters i statistical models, where the model depeds o uobserved latet variables. This is a geeral method for optimizig likelihood fuctios ad is useful i situatios where data might be missig or simpler optimizatio methods fail. B. Usupervised clusterig validatio Clusterig validatio is a techique to fid a set of clusters that best fits atural partitios, i.e. umber of clusters, without ay class iformatio. There are two types of clusterig techiques [7], [8]: exteral validatio, based o previous kowledge about data ad iteral validatio, based o the iformatio itrisic to the data aloe. There are three exteral idexes which were used i our previous study [3], Rad idex, Jaccard coefficiet, ad Fowlkes ad Mallows idex, ad they are goig to be used also i this paper. I cotrast to exteral validatio, iteral validatio evaluates the clusterig without ay a priori iformatio. The obtaied values are kow as iteral idexes as they are computed from the data used for clusterig. I this paper we evaluate for the microarray data the followig iteral idexes: silhouette, Caliski-Harabasz idex, Krzaowski- Lai idex, Hartiga idex ad Davies-Bouldi idex. Silhouette idex calculates the silhouette width for each sample, average silhouette width for each cluster ad overall average silhouette width for a total data set: S i i ai a i,bi b, (1) max where a(i) is the average dissimilarity of i-object to all other objects i the same cluster; b(i) is the miimum of average dissimilarity of i-object to all objects i other cluster. The largest overall average silhouette idicates the best clusterig. Caliski-Harabasz idex is defied by: CH k B W k / k / k 1, (2) k where k deotes the umber of clusters, ad B(k) ad W(k) deote the betwee ad withi cluster sums of squares of the partitio, respectively. A optimal umber of clusters is the defied as a value of k that maximizes CH(k). Krzaowski-Lai idex is give by the equatio: k k 1 DIFF KL k, (3) DIFF 2 / p 2 / p where DIFF k ( k 1) Wk 1 ( k ) Wk ad p deotes the umber of features i the data set. A value of k is optimal if it maximizes KL(k). Davies-Bouldi idex is a fuctio of the ratio of the sum of withi-cluster scatter to betwee-cluster separatio: 1 DB S max i1 i j Qi SQ j SQ, Q i j, (4) where is the umber of clusters, S is the average distace of all objects from the cluster to their cluster cetre, S(Q i,q j ) is the distace betwee clusters cetres. Cosequetly, Davies-Bouldi idex will have a small value for a good clusterig. For the review, the mauscript is submitted to the IJATES 2 Editorial Board oly electroically i PDF format. Do ot chage ay formattig; otherwise you udergo the risk to be directly rejected without review process. C. Gee Shavig The method of Gee Shavig is desiged to extract coheret ad typically small clusters of gees that vary as much as 100

3 possible across the samples. Accordig to [9], the algorithm cosists of the followig steps: 1. Start with the etire expressio matrix X, each row cetered to have zero mea 2. Compute the leadig pricipal compoet of the rows of X 3. Shave off the proportio (typically 10 %) of the gees havig smallest absolute ier-product with the leadig pricipal compoet 4. Repeat steps 2 ad 3 util oly oe gee remais 5. This produces a ested sequece of gee clusters, where S k deotes a cluster of k gees. Estimate the optimal cluster size usig the gap statistic [10]. 6. Orthogoalize each row of X with respect to, the average gee i 7. Repeat steps 1-5 above with the orthogoalized data, to fid the secod optimal cluster. This process is cotiued util a maximum of M clusters are foud, where M is chose a priori. Whe implemetig this method we made some chagigs compared with the steps form [9]. But I have made some chages i the algorithm. First of all, we shaved off ot α % gees, but 1 gee each time. This is because we will lose the precisio of the algorithm if usig α%. For example, supposig we remai with S k clusters with k beig 135, 122,..., 53, 47, etc. gees, ad accordig to the gap statistic step, the algorithm decides to have a cluster with 52 or 47 gees, which is ot correct. This is the reaso why we have decided to have clusters with..., 53, 52, 51, 50, 49, 48,... gees, i this way hopig to obtai a maximum Gap(k) closer to ideal 50. Also whe computig the gap statistics we have made some chages. If we cosider all possible permutatios ad after that fidig each D(k), the, i case of 150 gees with 4 characteristics of each gee, it is required to have all possible permutatios of the matrix by permutig the elemets withi each row. This meas that each row has from 4 parameters a umber of 24 possible permutatios, this implies that we have aother sets of matrices, which is too much (we caot take for example 3 rd permutatio from gee 1 with 3 rd permutatio for gee 2, 3 rd permutatio for gee 3, ad so o, because we obtai the same D(k)). We have tried to cosider radom matrices, a umber of 5000, but the problem i this case is that the result is varyig at every other aalysis ad they are also differet at each simulatio. Fially we have decided to use just the first iput matrix ad get its D(k) ad Gap(k), without ay permutatios. A. Usupervised clusterig III. DATA CLUSTERING Three usupervised algorithms were used to cluster microarray data. We used for our study two differet datasets from public Affymetrix databases. The first set was the Chowdary database [11] the authors compared pairs of sap-froze ad RNA later preservative-suspeded tissue from 62 lymph ode-egative breast tumors ad 42 colo tumors, with purpose of separatig them. The secod set [12] cotais 24 acute lymphoblastic leukemia (ALL), 28 acute myelogeous leukemia (AML) ad 20 mixed-lieage leukemia (MLL) samples. For the microarray data the clusterig was doe by a twoway clusterig or bi-clusterig [13] i which both the samples ad the gees are clustered i the same time usig the portioig method. Regardig the coclusios from [3] a useful classificatio was obtaied for microarray data whe EM clustered the gees ad k-meas the samples. I this study we will use the k-meas algorithm to cluster the samples ad the PAM ad respectively the EM algorithm to cluster the gees. Before combiig the algorithms we applied the clusterig validatio methods, both exteral ad iteral idexes. Also, based o these results we combied the usupervised method with a supervised oe. So we clustered the samples by k-meas algorithm ad we classify the gees usig the gee shavig method. I Table I, the umbers of clusters obtaied after usig the iteral ad exteral idexes are idicated. For the k- meas algorithm the umber refers to sample partitioig, while for the PAM ad EM algorithms the umbers refers to gee clusterig. I the EM clusterig validatio oly the exteral idexes were computed. TABLE I THE NUMBER OF CLUSTERS OBTAINED WITH THE CLUSTERING VALIDATION METHODS Chowdary database Leukemia database Idex k- PAM EM k- PAM EM meas meas Rad Jaccard Fowlkes- Mallows Silhouette Caliski- Harabasz Krzaowski -Lai Davies- Bouldi After the validatio was doe we applied the combied clusterig for the datasets. Thereby, for the Chowdary dataset the gees were clustered ito 2 groups, with the PAM method, Fig 2.a, ad the with the EM algorithm, Fig.2.b. The samples were clustered with the k-meas algorithm. The obtaied values were compared with the give values from the microarray databases ad a similarity betwee these values was observed. Fig. 1.a ad 1.b shows all the computed idexes for the Chowdary database, iteral ad exteral, i the case of PAM algorithm. For the exteral idexes the highest values obtaied gave the optimal umber of clusters. I the case of iteral idexes the optimal value was marked by a square i Fig. 1.b. Accordig with the optimal umber of clusters idicated by the validatio idexes, i the case of the leukemia dataset the clusterig was doe i 3 clusters. Thus the samples were group ito three sets by the k-meas algorithm, while the gees formed also three groups oce with the PAM method 101

4 Fig. 1. Values of the exteral idexes, iteral idexes determied for PAM clusterig of gees for Chowdary database Fig. 2. Microarray data clusterig by the usupervised algorithms: PAM clusterig, EM clusterig of the gees for the Chowdary dataset. Fig. 3.a Microarray data clusterig by the usupervised algorithms: PAM clusterig, EM clusterig of the gees for the leukemia dataset. 102

5 classify the Chowdary dataset i 2 clusters. The umber of gee i the first cluster was determied by usig the gap statistic method. The value of 89 gees was give by the maximum of the graph from Fig. 4. I a similar maer we computed the gap statistics for the leukemia dataset ad we were able to classify the data ito 3 clusters: the first oe with 37 gees, the secod oe with 21 gees ad the third oe with the remaiig 14 gees from a total of 72 gees. The values of the Gap fuctio for this dataset are show i Fig 5 ad 6. Fig. 4 The values of the Gap(k) fuctio for the Chowdary dataset. The maximum was obtaied for a umber of 89 gees. IV. CONCLUSION I this paper some usupervised ad supervised clusterig methods were applied to microarray dataset i order to obtai a bi-clusterig of the data with differet algorithms. The selectio of the PAM algorithm for clusterig the gees, despite of the k-meas method was doe because of the robustess of the first oe. The EM algorithm showed some disadvatages whe workig with large datasets that is way the data was previously filtered. Gee shavig, a supervised method was also combied with k-meas i order to classify the gees accordig with the umber of clusters give by the validatio methods. For cluster validatio we used both iteral ad exteral methods by computig some idexes which gave us the optimal umber of clusters for each clusterig method. As future work we take ito cosideratio the combiatio of these methods with other data miig algorithms like Idepedet Compoet Aalysis, which ca be used for large datasets. Fig. 5 The values of the Gap(k) fuctio for the leukemia dataset. For the first cluster the maximum was obtaied for a umber of 37 gees Fig. 6 The values of the Gap(k) fuctio for the leukemia dataset. For the secod cluster the maximum was obtaied for a umber of 21 gees ad the with the EM algorithm, as it ca be see i Fig. 3.a ad Fig. 3.b. For the gee shavig algorithm we used the same umber of clusters as the oe give by the validatio methods. I the case of Chowdary dataset the gees were classified i 2 sets, while i the case of leukemia dataset we applied gee shavig with 3 clusters. Oce applyig the gee shavig method we were able to REFERENCES [1] S. Gozález, L. Guerra, V. Robles, JM. Peña, F. Famili, CliDaPa: A ew approach to combiig cliical data with DNA microarrays Itelliget Data Aalysis Joural, vol 14(2), pp , 2010 [2] J. Ha, M. Kamber, Data miig: Cocepts ad techiques. Morga Kaufma, 2000 [3] R. Maluta, B. Belea, P.G. Vilda, M. Borda, Two way clusterig of microarray data usig a hybrid approach i Proc. of 34th It. Cof. o Telecommuicatios ad Sigal Processig, Budapest, 2011, pp [4] G. McLachla, K-A. Do, C. Ambroise, Aalyzig Microarray Gee Expressio Data. Wiley-Itersciece, 2004 [5] M.B. Zoubi, A. Hudaib, A. Hueiti, B. Hammo, New efficiet strategy to accelerate k-meas clusterig algorithm America Joural of Applied Scieces, vol 5(9), pp , 2008 [6] G. McLachla, T. Krisha, The EM Algorithm ad Extesios. Joh Willey & Sos 2008 [7] N. Bolshakova, F. Azuaje, Cluster validatio techiques for geome expressio data Sigal Processig vol 83, pp , 2002 [8] K. Wag, B. Wag, L. Peg, CVAP: Validatio for Cluster Aalyses Data Sciece Joural vol 8, pp , 2009 [9] T. Hastie et. al., Gee shavig as a method for idetifyig distict sets of gees with similar expressio patters, Geome Biology, vol. I(2), research 0003, pp. 1-21, 2000 [10] W. L. Martiez, A. R. Martiez, Exploratory Data Aalysis with MATLAB, CRC Press LLC, 2005 [11] D. Chowdary et al, Progostic gee expressio sigatures ca be measured i tissues collected i RNAlater preservative J Mol Diag vol 8(1), pp , 2006 [12] S. Armstrog et al, MLL traslocatios specify a distict gee expressio profile that distiguishes a uique leukemia Nature Geetics vol 30, pp , 2001 [13] S. C. Madeira, A. L. Oliveira, Biclusterig algorithms for biological data aalysis: A survey IEEE/ACM Trasactios o Computatioal Biology ad Bioiformatics vol 1 (1), pp ,

Ones Assignment Method for Solving Traveling Salesman Problem

Ones Assignment Method for Solving Traveling Salesman Problem Joural of mathematics ad computer sciece 0 (0), 58-65 Oes Assigmet Method for Solvig Travelig Salesma Problem Hadi Basirzadeh Departmet of Mathematics, Shahid Chamra Uiversity, Ahvaz, Ira Article history:

More information

Administrative UNSUPERVISED LEARNING. Unsupervised learning. Supervised learning 11/25/13. Final project. No office hours today

Administrative UNSUPERVISED LEARNING. Unsupervised learning. Supervised learning 11/25/13. Final project. No office hours today Admiistrative Fial project No office hours today UNSUPERVISED LEARNING David Kauchak CS 451 Fall 2013 Supervised learig Usupervised learig label label 1 label 3 model/ predictor label 4 label 5 Supervised

More information

Image Segmentation EEE 508

Image Segmentation EEE 508 Image Segmetatio Objective: to determie (etract) object boudaries. It is a process of partitioig a image ito distict regios by groupig together eighborig piels based o some predefied similarity criterio.

More information

Fundamentals of Media Processing. Shin'ichi Satoh Kazuya Kodama Hiroshi Mo Duy-Dinh Le

Fundamentals of Media Processing. Shin'ichi Satoh Kazuya Kodama Hiroshi Mo Duy-Dinh Le Fudametals of Media Processig Shi'ichi Satoh Kazuya Kodama Hiroshi Mo Duy-Dih Le Today's topics Noparametric Methods Parze Widow k-nearest Neighbor Estimatio Clusterig Techiques k-meas Agglomerative Hierarchical

More information

A New Morphological 3D Shape Decomposition: Grayscale Interframe Interpolation Method

A New Morphological 3D Shape Decomposition: Grayscale Interframe Interpolation Method A ew Morphological 3D Shape Decompositio: Grayscale Iterframe Iterpolatio Method D.. Vizireau Politehica Uiversity Bucharest, Romaia ae@comm.pub.ro R. M. Udrea Politehica Uiversity Bucharest, Romaia mihea@comm.pub.ro

More information

New Fuzzy Color Clustering Algorithm Based on hsl Similarity

New Fuzzy Color Clustering Algorithm Based on hsl Similarity IFSA-EUSFLAT 009 New Fuzzy Color Clusterig Algorithm Based o hsl Similarity Vasile Ptracu Departmet of Iformatics Techology Tarom Compay Bucharest Romaia Email: patrascu.v@gmail.com Abstract I this paper

More information

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe Copyright 2016 Ramez Elmasri ad Shamkat B. Navathe CHAPTER 18 Strategies for Query Processig Copyright 2016 Ramez Elmasri ad Shamkat B. Navathe Itroductio DBMS techiques to process a query Scaer idetifies

More information

An Efficient Algorithm for Graph Bisection of Triangularizations

An Efficient Algorithm for Graph Bisection of Triangularizations A Efficiet Algorithm for Graph Bisectio of Triagularizatios Gerold Jäger Departmet of Computer Sciece Washigto Uiversity Campus Box 1045 Oe Brookigs Drive St. Louis, Missouri 63130-4899, USA jaegerg@cse.wustl.edu

More information

New HSL Distance Based Colour Clustering Algorithm

New HSL Distance Based Colour Clustering Algorithm The 4th Midwest Artificial Itelligece ad Cogitive Scieces Coferece (MAICS 03 pp 85-9 New Albay Idiaa USA April 3-4 03 New HSL Distace Based Colour Clusterig Algorithm Vasile Patrascu Departemet of Iformatics

More information

An Efficient Algorithm for Graph Bisection of Triangularizations

An Efficient Algorithm for Graph Bisection of Triangularizations Applied Mathematical Scieces, Vol. 1, 2007, o. 25, 1203-1215 A Efficiet Algorithm for Graph Bisectio of Triagularizatios Gerold Jäger Departmet of Computer Sciece Washigto Uiversity Campus Box 1045, Oe

More information

Analysis of Documents Clustering Using Sampled Agglomerative Technique

Analysis of Documents Clustering Using Sampled Agglomerative Technique Aalysis of Documets Clusterig Usig Sampled Agglomerative Techique Omar H. Karam, Ahmed M. Hamad, ad Sheri M. Moussa Abstract I this paper a clusterig algorithm for documets is proposed that adapts a samplig-based

More information

IMP: Superposer Integrated Morphometrics Package Superposition Tool

IMP: Superposer Integrated Morphometrics Package Superposition Tool IMP: Superposer Itegrated Morphometrics Package Superpositio Tool Programmig by: David Lieber ( 03) Caisius College 200 Mai St. Buffalo, NY 4208 Cocept by: H. David Sheets, Dept. of Physics, Caisius College

More information

Pattern Recognition Systems Lab 1 Least Mean Squares

Pattern Recognition Systems Lab 1 Least Mean Squares Patter Recogitio Systems Lab 1 Least Mea Squares 1. Objectives This laboratory work itroduces the OpeCV-based framework used throughout the course. I this assigmet a lie is fitted to a set of poits usig

More information

A SOFTWARE MODEL FOR THE MULTILAYER PERCEPTRON

A SOFTWARE MODEL FOR THE MULTILAYER PERCEPTRON A SOFTWARE MODEL FOR THE MULTILAYER PERCEPTRON Roberto Lopez ad Eugeio Oñate Iteratioal Ceter for Numerical Methods i Egieerig (CIMNE) Edificio C1, Gra Capitá s/, 08034 Barceloa, Spai ABSTRACT I this work

More information

HADOOP: A NEW APPROACH FOR DOCUMENT CLUSTERING

HADOOP: A NEW APPROACH FOR DOCUMENT CLUSTERING Y.K. Patil* Iteratioal Joural of Advaced Research i ISSN: 2278-6244 IT ad Egieerig Impact Factor: 4.54 HADOOP: A NEW APPROACH FOR DOCUMENT CLUSTERING Prof. V.S. Nadedkar** Abstract: Documet clusterig is

More information

CSCI 5090/7090- Machine Learning. Spring Mehdi Allahyari Georgia Southern University

CSCI 5090/7090- Machine Learning. Spring Mehdi Allahyari Georgia Southern University CSCI 5090/7090- Machie Learig Sprig 018 Mehdi Allahyari Georgia Souther Uiversity Clusterig (slides borrowed from Tom Mitchell, Maria Floria Balca, Ali Borji, Ke Che) 1 Clusterig, Iformal Goals Goal: Automatically

More information

Performance Comparisons of PSO based Clustering

Performance Comparisons of PSO based Clustering Performace Comparisos of PSO based Clusterig Suresh Chadra Satapathy, 2 Guaidhi Pradha, 3 Sabyasachi Pattai, 4 JVR Murthy, 5 PVGD Prasad Reddy Ail Neeruoda Istitute of Techology ad Scieces, Sagivalas,Vishaapatam

More information

Analysis Metrics. Intro to Algorithm Analysis. Slides. 12. Alg Analysis. 12. Alg Analysis

Analysis Metrics. Intro to Algorithm Analysis. Slides. 12. Alg Analysis. 12. Alg Analysis Itro to Algorithm Aalysis Aalysis Metrics Slides. Table of Cotets. Aalysis Metrics 3. Exact Aalysis Rules 4. Simple Summatio 5. Summatio Formulas 6. Order of Magitude 7. Big-O otatio 8. Big-O Theorems

More information

An Improved Shuffled Frog-Leaping Algorithm for Knapsack Problem

An Improved Shuffled Frog-Leaping Algorithm for Knapsack Problem A Improved Shuffled Frog-Leapig Algorithm for Kapsack Problem Zhoufag Li, Ya Zhou, ad Peg Cheg School of Iformatio Sciece ad Egieerig Hea Uiversity of Techology ZhegZhou, Chia lzhf1978@126.com Abstract.

More information

ISSN (Print) Research Article. *Corresponding author Nengfa Hu

ISSN (Print) Research Article. *Corresponding author Nengfa Hu Scholars Joural of Egieerig ad Techology (SJET) Sch. J. Eg. Tech., 2016; 4(5):249-253 Scholars Academic ad Scietific Publisher (A Iteratioal Publisher for Academic ad Scietific Resources) www.saspublisher.com

More information

Eigenimages. Digital Image Processing: Bernd Girod, Stanford University -- Eigenimages 1

Eigenimages. Digital Image Processing: Bernd Girod, Stanford University -- Eigenimages 1 Eigeimages Uitary trasforms Karhue-Loève trasform ad eigeimages Sirovich ad Kirby method Eigefaces for geder recogitio Fisher liear discrimat aalysis Fisherimages ad varyig illumiatio Fisherfaces vs. eigefaces

More information

3D Model Retrieval Method Based on Sample Prediction

3D Model Retrieval Method Based on Sample Prediction 20 Iteratioal Coferece o Computer Commuicatio ad Maagemet Proc.of CSIT vol.5 (20) (20) IACSIT Press, Sigapore 3D Model Retrieval Method Based o Sample Predictio Qigche Zhag, Ya Tag* School of Computer

More information

The Closest Line to a Data Set in the Plane. David Gurney Southeastern Louisiana University Hammond, Louisiana

The Closest Line to a Data Set in the Plane. David Gurney Southeastern Louisiana University Hammond, Louisiana The Closest Lie to a Data Set i the Plae David Gurey Southeaster Louisiaa Uiversity Hammod, Louisiaa ABSTRACT This paper looks at three differet measures of distace betwee a lie ad a data set i the plae:

More information

DATA MINING II - 1DL460

DATA MINING II - 1DL460 DATA MINING II - 1DL460 Sprig 2017 A secod course i data miig http://www.it.uu.se/edu/course/homepage/ifoutv2/vt17/ Kjell Orsbor Uppsala Database Laboratory Departmet of Iformatio Techology, Uppsala Uiversity,

More information

Redundancy Allocation for Series Parallel Systems with Multiple Constraints and Sensitivity Analysis

Redundancy Allocation for Series Parallel Systems with Multiple Constraints and Sensitivity Analysis IOSR Joural of Egieerig Redudacy Allocatio for Series Parallel Systems with Multiple Costraits ad Sesitivity Aalysis S. V. Suresh Babu, D.Maheswar 2, G. Ragaath 3 Y.Viaya Kumar d G.Sakaraiah e (Mechaical

More information

Pseudocode ( 1.1) Analysis of Algorithms. Primitive Operations. Pseudocode Details. Running Time ( 1.1) Estimating performance

Pseudocode ( 1.1) Analysis of Algorithms. Primitive Operations. Pseudocode Details. Running Time ( 1.1) Estimating performance Aalysis of Algorithms Iput Algorithm Output A algorithm is a step-by-step procedure for solvig a problem i a fiite amout of time. Pseudocode ( 1.1) High-level descriptio of a algorithm More structured

More information

Evaluation scheme for Tracking in AMI

Evaluation scheme for Tracking in AMI A M I C o m m u i c a t i o A U G M E N T E D M U L T I - P A R T Y I N T E R A C T I O N http://www.amiproject.org/ Evaluatio scheme for Trackig i AMI S. Schreiber a D. Gatica-Perez b AMI WP4 Trackig:

More information

Journal of Chemical and Pharmaceutical Research, 2013, 5(12): Research Article

Journal of Chemical and Pharmaceutical Research, 2013, 5(12): Research Article Available olie www.jocpr.com Joural of Chemical ad Pharmaceutical Research, 2013, 5(12):745-749 Research Article ISSN : 0975-7384 CODEN(USA) : JCPRC5 K-meas algorithm i the optimal iitial cetroids based

More information

A Semi- Non-Negative Matrix Factorization and Principal Component Analysis Unified Framework for Data Clustering

A Semi- Non-Negative Matrix Factorization and Principal Component Analysis Unified Framework for Data Clustering A Semi- No-Negative Matrix Factorizatio ad Pricipal Compoet Aalysis Uified Framework for Data Clusterig V.Yuvaraj, N.SivaKumar Assistat Professor, Departmet of Computer Sciece, K.S.G college of Arts ad

More information

LU Decomposition Method

LU Decomposition Method SOLUTION OF SIMULTANEOUS LINEAR EQUATIONS LU Decompositio Method Jamie Traha, Autar Kaw, Kevi Marti Uiversity of South Florida Uited States of America kaw@eg.usf.edu http://umericalmethods.eg.usf.edu Itroductio

More information

Eigenimages. Digital Image Processing: Bernd Girod, 2013 Stanford University -- Eigenimages 1

Eigenimages. Digital Image Processing: Bernd Girod, 2013 Stanford University -- Eigenimages 1 Eigeimages Uitary trasforms Karhue-Loève trasform ad eigeimages Sirovich ad Kirby method Eigefaces for geder recogitio Fisher liear discrimat aalysis Fisherimages ad varyig illumiatio Fisherfaces vs. eigefaces

More information

MATHEMATICAL METHODS OF ANALYSIS AND EXPERIMENTAL DATA PROCESSING (Or Methods of Curve Fitting)

MATHEMATICAL METHODS OF ANALYSIS AND EXPERIMENTAL DATA PROCESSING (Or Methods of Curve Fitting) MATHEMATICAL METHODS OF ANALYSIS AND EXPERIMENTAL DATA PROCESSING (Or Methods of Curve Fittig) I this chapter, we will eamie some methods of aalysis ad data processig; data obtaied as a result of a give

More information

Improving Template Based Spike Detection

Improving Template Based Spike Detection Improvig Template Based Spike Detectio Kirk Smith, Member - IEEE Portlad State Uiversity petra@ee.pdx.edu Abstract Template matchig algorithms like SSE, Covolutio ad Maximum Likelihood are well kow for

More information

Accuracy Improvement in Camera Calibration

Accuracy Improvement in Camera Calibration Accuracy Improvemet i Camera Calibratio FaJie L Qi Zag ad Reihard Klette CITR, Computer Sciece Departmet The Uiversity of Aucklad Tamaki Campus, Aucklad, New Zealad fli006, qza001@ec.aucklad.ac.z r.klette@aucklad.ac.z

More information

Cluster Analysis. Andrew Kusiak Intelligent Systems Laboratory

Cluster Analysis. Andrew Kusiak Intelligent Systems Laboratory Cluster Aalysis Adrew Kusiak Itelliget Systems Laboratory 2139 Seamas Ceter The Uiversity of Iowa Iowa City, Iowa 52242-1527 adrew-kusiak@uiowa.edu http://www.icae.uiowa.edu/~akusiak Two geeric modes of

More information

Lecture Notes 6 Introduction to algorithm analysis CSS 501 Data Structures and Object-Oriented Programming

Lecture Notes 6 Introduction to algorithm analysis CSS 501 Data Structures and Object-Oriented Programming Lecture Notes 6 Itroductio to algorithm aalysis CSS 501 Data Structures ad Object-Orieted Programmig Readig for this lecture: Carrao, Chapter 10 To be covered i this lecture: Itroductio to algorithm aalysis

More information

Probabilistic Fuzzy Time Series Method Based on Artificial Neural Network

Probabilistic Fuzzy Time Series Method Based on Artificial Neural Network America Joural of Itelliget Systems 206, 6(2): 42-47 DOI: 0.5923/j.ajis.2060602.02 Probabilistic Fuzzy Time Series Method Based o Artificial Neural Network Erol Egrioglu,*, Ere Bas, Cagdas Haka Aladag

More information

condition w i B i S maximum u i

condition w i B i S maximum u i ecture 10 Dyamic Programmig 10.1 Kapsack Problem November 1, 2004 ecturer: Kamal Jai Notes: Tobias Holgers We are give a set of items U = {a 1, a 2,..., a }. Each item has a weight w i Z + ad a utility

More information

BOOLEAN DIFFERENTIATION EQUATIONS APPLICABLE IN RECONFIGURABLE COMPUTATIONAL MEDIUM

BOOLEAN DIFFERENTIATION EQUATIONS APPLICABLE IN RECONFIGURABLE COMPUTATIONAL MEDIUM MATEC Web of Cofereces 79, 01014 (016) DOI: 10.1051/ mateccof/0167901014 T 016 BOOLEAN DIFFERENTIATION EQUATIONS APPLICABLE IN RECONFIGURABLE COMPUTATIONAL MEDIUM Staislav Shidlovskiy 1, 1 Natioal Research

More information

The isoperimetric problem on the hypercube

The isoperimetric problem on the hypercube The isoperimetric problem o the hypercube Prepared by: Steve Butler November 2, 2005 1 The isoperimetric problem We will cosider the -dimesioal hypercube Q Recall that the hypercube Q is a graph whose

More information

Analysis of Different Similarity Measure Functions and their Impacts on Shared Nearest Neighbor Clustering Approach

Analysis of Different Similarity Measure Functions and their Impacts on Shared Nearest Neighbor Clustering Approach Aalysis of Differet Similarity Measure Fuctios ad their Impacts o Shared Nearest Neighbor Clusterig Approach Ail Kumar Patidar School of IT, Rajiv Gadhi Techical Uiversity, Bhopal (M.P.), Idia Jitedra

More information

A New Bit Wise Technique for 3-Partitioning Algorithm

A New Bit Wise Technique for 3-Partitioning Algorithm Special Issue of Iteratioal Joural of Computer Applicatios (0975 8887) o Optimizatio ad O-chip Commuicatio, No.1. Feb.2012, ww.ijcaolie.org A New Bit Wise Techique for 3-Partitioig Algorithm Rajumar Jai

More information

ANN WHICH COVERS MLP AND RBF

ANN WHICH COVERS MLP AND RBF ANN WHICH COVERS MLP AND RBF Josef Boští, Jaromír Kual Faculty of Nuclear Scieces ad Physical Egieerig, CTU i Prague Departmet of Software Egieerig Abstract Two basic types of artificial eural etwors Multi

More information

Pruning and Summarizing the Discovered Time Series Association Rules from Mechanical Sensor Data Qing YANG1,a,*, Shao-Yu WANG1,b, Ting-Ting ZHANG2,c

Pruning and Summarizing the Discovered Time Series Association Rules from Mechanical Sensor Data Qing YANG1,a,*, Shao-Yu WANG1,b, Ting-Ting ZHANG2,c Advaces i Egieerig Research (AER), volume 131 3rd Aual Iteratioal Coferece o Electroics, Electrical Egieerig ad Iformatio Sciece (EEEIS 2017) Pruig ad Summarizig the Discovered Time Series Associatio Rules

More information

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe Copyright 2016 Ramez Elmasri ad Shamkat B. Navathe CHAPTER 19 Query Optimizatio Copyright 2016 Ramez Elmasri ad Shamkat B. Navathe Itroductio Query optimizatio Coducted by a query optimizer i a DBMS Goal:

More information

Exact Minimum Lower Bound Algorithm for Traveling Salesman Problem

Exact Minimum Lower Bound Algorithm for Traveling Salesman Problem Exact Miimum Lower Boud Algorithm for Travelig Salesma Problem Mohamed Eleiche GeoTiba Systems mohamed.eleiche@gmail.com Abstract The miimum-travel-cost algorithm is a dyamic programmig algorithm to compute

More information

How do we evaluate algorithms?

How do we evaluate algorithms? F2 Readig referece: chapter 2 + slides Algorithm complexity Big O ad big Ω To calculate ruig time Aalysis of recursive Algorithms Next time: Litterature: slides mostly The first Algorithm desig methods:

More information

Analysis of Server Resource Consumption of Meteorological Satellite Application System Based on Contour Curve

Analysis of Server Resource Consumption of Meteorological Satellite Application System Based on Contour Curve Advaces i Computer, Sigals ad Systems (2018) 2: 19-25 Clausius Scietific Press, Caada Aalysis of Server Resource Cosumptio of Meteorological Satellite Applicatio System Based o Cotour Curve Xiagag Zhao

More information

arxiv: v2 [cs.ds] 24 Mar 2018

arxiv: v2 [cs.ds] 24 Mar 2018 Similar Elemets ad Metric Labelig o Complete Graphs arxiv:1803.08037v [cs.ds] 4 Mar 018 Pedro F. Felzeszwalb Brow Uiversity Providece, RI, USA pff@brow.edu March 8, 018 We cosider a problem that ivolves

More information

Fuzzy Rule Selection by Data Mining Criteria and Genetic Algorithms

Fuzzy Rule Selection by Data Mining Criteria and Genetic Algorithms Fuzzy Rule Selectio by Data Miig Criteria ad Geetic Algorithms Hisao Ishibuchi Dept. of Idustrial Egieerig Osaka Prefecture Uiversity 1-1 Gakue-cho, Sakai, Osaka 599-8531, JAPAN E-mail: hisaoi@ie.osakafu-u.ac.jp

More information

Algorithms for Disk Covering Problems with the Most Points

Algorithms for Disk Covering Problems with the Most Points Algorithms for Disk Coverig Problems with the Most Poits Bi Xiao Departmet of Computig Hog Kog Polytechic Uiversity Hug Hom, Kowloo, Hog Kog csbxiao@comp.polyu.edu.hk Qigfeg Zhuge, Yi He, Zili Shao, Edwi

More information

Dynamic Programming and Curve Fitting Based Road Boundary Detection

Dynamic Programming and Curve Fitting Based Road Boundary Detection Dyamic Programmig ad Curve Fittig Based Road Boudary Detectio SHYAM PRASAD ADHIKARI, HYONGSUK KIM, Divisio of Electroics ad Iformatio Egieerig Chobuk Natioal Uiversity 664-4 Ga Deokji-Dog Jeoju-City Jeobuk

More information

CIS 121 Data Structures and Algorithms with Java Spring Stacks, Queues, and Heaps Monday, February 18 / Tuesday, February 19

CIS 121 Data Structures and Algorithms with Java Spring Stacks, Queues, and Heaps Monday, February 18 / Tuesday, February 19 CIS Data Structures ad Algorithms with Java Sprig 09 Stacks, Queues, ad Heaps Moday, February 8 / Tuesday, February 9 Stacks ad Queues Recall the stack ad queue ADTs (abstract data types from lecture.

More information

n n B. How many subsets of C are there of cardinality n. We are selecting elements for such a

n n B. How many subsets of C are there of cardinality n. We are selecting elements for such a 4. [10] Usig a combiatorial argumet, prove that for 1: = 0 = Let A ad B be disjoit sets of cardiality each ad C = A B. How may subsets of C are there of cardiality. We are selectig elemets for such a subset

More information

CIS 121 Data Structures and Algorithms with Java Spring Stacks and Queues Monday, February 12 / Tuesday, February 13

CIS 121 Data Structures and Algorithms with Java Spring Stacks and Queues Monday, February 12 / Tuesday, February 13 CIS Data Structures ad Algorithms with Java Sprig 08 Stacks ad Queues Moday, February / Tuesday, February Learig Goals Durig this lab, you will: Review stacks ad queues. Lear amortized ruig time aalysis

More information

New Results on Energy of Graphs of Small Order

New Results on Energy of Graphs of Small Order Global Joural of Pure ad Applied Mathematics. ISSN 0973-1768 Volume 13, Number 7 (2017), pp. 2837-2848 Research Idia Publicatios http://www.ripublicatio.com New Results o Eergy of Graphs of Small Order

More information

Matrix Partitions of Split Graphs

Matrix Partitions of Split Graphs Matrix Partitios of Split Graphs Tomás Feder, Pavol Hell, Ore Shklarsky Abstract arxiv:1306.1967v2 [cs.dm] 20 Ju 2013 Matrix partitio problems geeralize a umber of atural graph partitio problems, ad have

More information

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe Copyright 2016 Ramez Elmasri ad Shamkat B. Navathe CHAPTER 26 Ehaced Data Models: Itroductio to Active, Temporal, Spatial, Multimedia, ad Deductive Databases Copyright 2016 Ramez Elmasri ad Shamkat B.

More information

A Note on Least-norm Solution of Global WireWarping

A Note on Least-norm Solution of Global WireWarping A Note o Least-orm Solutio of Global WireWarpig Charlie C. L. Wag Departmet of Mechaical ad Automatio Egieerig The Chiese Uiversity of Hog Kog Shati, N.T., Hog Kog E-mail: cwag@mae.cuhk.edu.hk Abstract

More information

Data Structures and Algorithms. Analysis of Algorithms

Data Structures and Algorithms. Analysis of Algorithms Data Structures ad Algorithms Aalysis of Algorithms Outlie Ruig time Pseudo-code Big-oh otatio Big-theta otatio Big-omega otatio Asymptotic algorithm aalysis Aalysis of Algorithms Iput Algorithm Output

More information

Elementary Educational Computer

Elementary Educational Computer Chapter 5 Elemetary Educatioal Computer. Geeral structure of the Elemetary Educatioal Computer (EEC) The EEC coforms to the 5 uits structure defied by vo Neuma's model (.) All uits are preseted i a simplified

More information

Harris Corner Detection Algorithm at Sub-pixel Level and Its Application Yuanfeng Han a, Peijiang Chen b * and Tian Meng c

Harris Corner Detection Algorithm at Sub-pixel Level and Its Application Yuanfeng Han a, Peijiang Chen b * and Tian Meng c Iteratioal Coferece o Computatioal Sciece ad Egieerig (ICCSE 015) Harris Corer Detectio Algorithm at Sub-pixel Level ad Its Applicatio Yuafeg Ha a, Peijiag Che b * ad Tia Meg c School of Automobile, Liyi

More information

Improvement of the Orthogonal Code Convolution Capabilities Using FPGA Implementation

Improvement of the Orthogonal Code Convolution Capabilities Using FPGA Implementation Improvemet of the Orthogoal Code Covolutio Capabilities Usig FPGA Implemetatio Naima Kaabouch, Member, IEEE, Apara Dhirde, Member, IEEE, Saleh Faruque, Member, IEEE Departmet of Electrical Egieerig, Uiversity

More information

Fast Fourier Transform (FFT) Algorithms

Fast Fourier Transform (FFT) Algorithms Fast Fourier Trasform FFT Algorithms Relatio to the z-trasform elsewhere, ozero, z x z X x [ ] 2 ~ elsewhere,, ~ e j x X x x π j e z z X X π 2 ~ The DFS X represets evely spaced samples of the z- trasform

More information

Dimensionality Reduction PCA

Dimensionality Reduction PCA Dimesioality Reductio PCA Machie Learig CSE446 David Wadde (slides provided by Carlos Guestri) Uiversity of Washigto Feb 22, 2017 Carlos Guestri 2005-2017 1 Dimesioality reductio Iput data may have thousads

More information

Criterion in selecting the clustering algorithm in Radial Basis Functional Link Nets

Criterion in selecting the clustering algorithm in Radial Basis Functional Link Nets WSEAS TRANSACTIONS o SYSTEMS Ag Sau Loog, Og Hog Choo, Low Heg Chi Criterio i selectig the clusterig algorithm i Radial Basis Fuctioal Lik Nets ANG SAU LOONG 1, ONG HONG CHOON 2 & LOW HENG CHIN 3 Departmet

More information

A Novel Feature Extraction Algorithm for Haar Local Binary Pattern Texture Based on Human Vision System

A Novel Feature Extraction Algorithm for Haar Local Binary Pattern Texture Based on Human Vision System A Novel Feature Extractio Algorithm for Haar Local Biary Patter Texture Based o Huma Visio System Liu Tao 1,* 1 Departmet of Electroic Egieerig Shaaxi Eergy Istitute Xiayag, Shaaxi, Chia Abstract The locality

More information

What are we going to learn? CSC Data Structures Analysis of Algorithms. Overview. Algorithm, and Inputs

What are we going to learn? CSC Data Structures Analysis of Algorithms. Overview. Algorithm, and Inputs What are we goig to lear? CSC316-003 Data Structures Aalysis of Algorithms Computer Sciece North Carolia State Uiversity Need to say that some algorithms are better tha others Criteria for evaluatio Structure

More information

Parabolic Path to a Best Best-Fit Line:

Parabolic Path to a Best Best-Fit Line: Studet Activity : Fidig the Least Squares Regressio Lie By Explorig the Relatioship betwee Slope ad Residuals Objective: How does oe determie a best best-fit lie for a set of data? Eyeballig it may be

More information

BAYESIAN WITH FULL CONDITIONAL POSTERIOR DISTRIBUTION APPROACH FOR SOLUTION OF COMPLEX MODELS. Pudji Ismartini

BAYESIAN WITH FULL CONDITIONAL POSTERIOR DISTRIBUTION APPROACH FOR SOLUTION OF COMPLEX MODELS. Pudji Ismartini Proceedig of Iteratioal Coferece O Research, Implemetatio Ad Educatio Of Mathematics Ad Scieces 014, Yogyakarta State Uiversity, 18-0 May 014 BAYESIAN WIH FULL CONDIIONAL POSERIOR DISRIBUION APPROACH FOR

More information

FORMATION OF PART FAMILY IN RECONFIGURABLE MANUFACTURING SYSTEM USING PRINCIPLE COMPONENT ANALYSIS AND K-MEANS ALGORITHM

FORMATION OF PART FAMILY IN RECONFIGURABLE MANUFACTURING SYSTEM USING PRINCIPLE COMPONENT ANALYSIS AND K-MEANS ALGORITHM als of DM for 01 & Proceedigs of the 3rd Iteratioal DM Symposium, Volume 3, No.1, ISSN 304-138 ISBN 978-3-901509-91-9, CDROM versio, Ed. B. Kataliic, Published by DM Iteratioal, Viea, ustria, EU, 01 Make

More information

Appendix A. Use of Operators in ARPS

Appendix A. Use of Operators in ARPS A Appedix A. Use of Operators i ARPS The methodology for solvig the equatios of hydrodyamics i either differetial or itegral form usig grid-poit techiques (fiite differece, fiite volume, fiite elemet)

More information

South Slave Divisional Education Council. Math 10C

South Slave Divisional Education Council. Math 10C South Slave Divisioal Educatio Coucil Math 10C Curriculum Package February 2012 12 Strad: Measuremet Geeral Outcome: Develop spatial sese ad proportioal reasoig It is expected that studets will: 1. Solve

More information

Lecture 5. Counting Sort / Radix Sort

Lecture 5. Counting Sort / Radix Sort Lecture 5. Coutig Sort / Radix Sort T. H. Corme, C. E. Leiserso ad R. L. Rivest Itroductio to Algorithms, 3rd Editio, MIT Press, 2009 Sugkyukwa Uiversity Hyuseug Choo choo@skku.edu Copyright 2000-2018

More information

Speeding-up dynamic programming in sequence alignment

Speeding-up dynamic programming in sequence alignment Departmet of Computer Sciece Aarhus Uiversity Demark Speedig-up dyamic programmig i sequece aligmet Master s Thesis Dug My Hoa - 443 December, Supervisor: Christia Nørgaard Storm Pederse Implemetatio code

More information

Optimized Aperiodic Concentric Ring Arrays

Optimized Aperiodic Concentric Ring Arrays 24th Aual Review of Progress i Applied Computatioal Electromagetics March 30 - April 4, 2008 - iagara Falls, Caada 2008 ACES Optimized Aperiodic Cocetric Rig Arrays Rady L Haupt The Pesylvaia State Uiversity

More information

Data diverse software fault tolerance techniques

Data diverse software fault tolerance techniques Data diverse software fault tolerace techiques Complemets desig diversity by compesatig for desig diversity s s limitatios Ivolves obtaiig a related set of poits i the program data space, executig the

More information

Sorting in Linear Time. Data Structures and Algorithms Andrei Bulatov

Sorting in Linear Time. Data Structures and Algorithms Andrei Bulatov Sortig i Liear Time Data Structures ad Algorithms Adrei Bulatov Algorithms Sortig i Liear Time 7-2 Compariso Sorts The oly test that all the algorithms we have cosidered so far is compariso The oly iformatio

More information

A Study on the Performance of Cholesky-Factorization using MPI

A Study on the Performance of Cholesky-Factorization using MPI A Study o the Performace of Cholesky-Factorizatio usig MPI Ha S. Kim Scott B. Bade Departmet of Computer Sciece ad Egieerig Uiversity of Califoria Sa Diego {hskim, bade}@cs.ucsd.edu Abstract Cholesky-factorizatio

More information

Web Text Feature Extraction with Particle Swarm Optimization

Web Text Feature Extraction with Particle Swarm Optimization 32 IJCSNS Iteratioal Joural of Computer Sciece ad Network Security, VOL.7 No.6, Jue 2007 Web Text Feature Extractio with Particle Swarm Optimizatio Sog Liagtu,, Zhag Xiaomig Istitute of Itelliget Machies,

More information

Neural Networks A Model of Boolean Functions

Neural Networks A Model of Boolean Functions Neural Networks A Model of Boolea Fuctios Berd Steibach, Roma Kohut Freiberg Uiversity of Miig ad Techology Istitute of Computer Sciece D-09596 Freiberg, Germay e-mails: steib@iformatik.tu-freiberg.de

More information

Euclidean Distance Based Feature Selection for Fault Detection Prediction Model in Semiconductor Manufacturing Process

Euclidean Distance Based Feature Selection for Fault Detection Prediction Model in Semiconductor Manufacturing Process Vol.133 (Iformatio Techology ad Computer Sciece 016), pp.85-89 http://dx.doi.org/10.1457/astl.016. Euclidea Distace Based Feature Selectio for Fault Detectio Predictio Model i Semicoductor Maufacturig

More information

Morgan Kaufmann Publishers 26 February, COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Interface. Chapter 5

Morgan Kaufmann Publishers 26 February, COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Interface. Chapter 5 Morga Kaufma Publishers 26 February, 28 COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Iterface 5 th Editio Chapter 5 Set-Associative Cache Architecture Performace Summary Whe CPU performace icreases:

More information

Bayesian approach to reliability modelling for a probability of failure on demand parameter

Bayesian approach to reliability modelling for a probability of failure on demand parameter Bayesia approach to reliability modellig for a probability of failure o demad parameter BÖRCSÖK J., SCHAEFER S. Departmet of Computer Architecture ad System Programmig Uiversity Kassel, Wilhelmshöher Allee

More information

Neuro Fuzzy Model for Human Face Expression Recognition

Neuro Fuzzy Model for Human Face Expression Recognition IOSR Joural of Computer Egieerig (IOSRJCE) ISSN : 2278-0661 Volume 1, Issue 2 (May-Jue 2012), PP 01-06 Neuro Fuzzy Model for Huma Face Expressio Recogitio Mr. Mayur S. Burage 1, Prof. S. V. Dhopte 2 1

More information

Lecture 18. Optimization in n dimensions

Lecture 18. Optimization in n dimensions Lecture 8 Optimizatio i dimesios Itroductio We ow cosider the problem of miimizig a sigle scalar fuctio of variables, f x, where x=[ x, x,, x ]T. The D case ca be visualized as fidig the lowest poit of

More information

Rapid Frequent Pattern Growth and Possibilistic Fuzzy C-means Algorithms for Improving the User Profiling Personalized Web Page Recommendation System

Rapid Frequent Pattern Growth and Possibilistic Fuzzy C-means Algorithms for Improving the User Profiling Personalized Web Page Recommendation System Received: November 21, 2017 237 Rapid Frequet Patter Growth ad Possibilistic Fuzzy C-meas Algorithms for Improvig the User Profilig Persoalized Web Page Recommedatio System Sipra Sahoo 1 * Bikram Kesari

More information

Optimal Mapped Mesh on the Circle

Optimal Mapped Mesh on the Circle Koferece ANSYS 009 Optimal Mapped Mesh o the Circle doc. Ig. Jaroslav Štigler, Ph.D. Bro Uiversity of Techology, aculty of Mechaical gieerig, ergy Istitut, Abstract: This paper brigs out some ideas ad

More information

Running Time. Analysis of Algorithms. Experimental Studies. Limitations of Experiments

Running Time. Analysis of Algorithms. Experimental Studies. Limitations of Experiments Ruig Time Aalysis of Algorithms Iput Algorithm Output A algorithm is a step-by-step procedure for solvig a problem i a fiite amout of time. Most algorithms trasform iput objects ito output objects. The

More information

BASED ON ITERATIVE ERROR-CORRECTION

BASED ON ITERATIVE ERROR-CORRECTION A COHPARISO OF CRYPTAALYTIC PRICIPLES BASED O ITERATIVE ERROR-CORRECTIO Miodrag J. MihaljeviC ad Jova Dj. GoliC Istitute of Applied Mathematics ad Electroics. Belgrade School of Electrical Egieerig. Uiversity

More information

Optimization of Multiple Input Single Output Fuzzy Membership Functions Using Clonal Selection Algorithm

Optimization of Multiple Input Single Output Fuzzy Membership Functions Using Clonal Selection Algorithm Optimizatio of Multiple Iput Sigle Output Fuzzy Membership Fuctios Usig Cloal Selectio Algorithm AYŞE MERVE ACILAR, AHMET ARSLAN Computer Egieerig Departmet Selcuk Uiversity Selcuk Uiversity, Eg.-Arch.

More information

CIS 121 Data Structures and Algorithms with Java Fall Big-Oh Notation Tuesday, September 5 (Make-up Friday, September 8)

CIS 121 Data Structures and Algorithms with Java Fall Big-Oh Notation Tuesday, September 5 (Make-up Friday, September 8) CIS 11 Data Structures ad Algorithms with Java Fall 017 Big-Oh Notatio Tuesday, September 5 (Make-up Friday, September 8) Learig Goals Review Big-Oh ad lear big/small omega/theta otatios Practice solvig

More information

Improving Information Retrieval System Security via an Optimal Maximal Coding Scheme

Improving Information Retrieval System Security via an Optimal Maximal Coding Scheme Improvig Iformatio Retrieval System Security via a Optimal Maximal Codig Scheme Dogyag Log Departmet of Computer Sciece, City Uiversity of Hog Kog, 8 Tat Chee Aveue Kowloo, Hog Kog SAR, PRC dylog@cs.cityu.edu.hk

More information

CSC165H1 Worksheet: Tutorial 8 Algorithm analysis (SOLUTIONS)

CSC165H1 Worksheet: Tutorial 8 Algorithm analysis (SOLUTIONS) CSC165H1, Witer 018 Learig Objectives By the ed of this worksheet, you will: Aalyse the ruig time of fuctios cotaiig ested loops. 1. Nested loop variatios. Each of the followig fuctios takes as iput a

More information

l-1 text string ( l characters : 2lbytes) pointer table the i-th word table of coincidence number of prex characters. pointer table the i-th word

l-1 text string ( l characters : 2lbytes) pointer table the i-th word table of coincidence number of prex characters. pointer table the i-th word A New Method of N-gram Statistics for Large Number of ad Automatic Extractio of Words ad Phrases from Large Text Data of Japaese Makoto Nagao, Shisuke Mori Departmet of Electrical Egieerig Kyoto Uiversity

More information

Running Time ( 3.1) Analysis of Algorithms. Experimental Studies. Limitations of Experiments

Running Time ( 3.1) Analysis of Algorithms. Experimental Studies. Limitations of Experiments Ruig Time ( 3.1) Aalysis of Algorithms Iput Algorithm Output A algorithm is a step- by- step procedure for solvig a problem i a fiite amout of time. Most algorithms trasform iput objects ito output objects.

More information

Analysis of Algorithms

Analysis of Algorithms Aalysis of Algorithms Iput Algorithm Output A algorithm is a step-by-step procedure for solvig a problem i a fiite amout of time. Ruig Time Most algorithms trasform iput objects ito output objects. The

More information

Chapter 24. Sorting. Objectives. 1. To study and analyze time efficiency of various sorting algorithms

Chapter 24. Sorting. Objectives. 1. To study and analyze time efficiency of various sorting algorithms Chapter 4 Sortig 1 Objectives 1. o study ad aalyze time efficiecy of various sortig algorithms 4. 4.7.. o desig, implemet, ad aalyze bubble sort 4.. 3. o desig, implemet, ad aalyze merge sort 4.3. 4. o

More information

AN OPTIMIZATION NETWORK FOR MATRIX INVERSION

AN OPTIMIZATION NETWORK FOR MATRIX INVERSION 397 AN OPTIMIZATION NETWORK FOR MATRIX INVERSION Ju-Seog Jag, S~ Youg Lee, ad Sag-Yug Shi Korea Advaced Istitute of Sciece ad Techology, P.O. Box 150, Cheogryag, Seoul, Korea ABSTRACT Iverse matrix calculatio

More information

Package popkorn. R topics documented: February 20, Type Package

Package popkorn. R topics documented: February 20, Type Package Type Pacage Pacage popkor February 20, 2015 Title For iterval estimatio of mea of selected populatios Versio 0.3-0 Date 2014-07-04 Author Vi Gopal, Claudio Fuetes Maitaier Vi Gopal Depeds

More information