Research on K-Means Algorithm Based on Parallel Improving and Applying

Size: px
Start display at page:

Download "Research on K-Means Algorithm Based on Parallel Improving and Applying"

Transcription

1 Sed Orders for Reprits to 288 The Ope Cyberetics & Systemics Joural, 2015, 9, Ope Access Research o K-Meas Algorithm Based o Parallel Improvig ad Applyig Deg Zherog 1,2,*, Deg Xig 1, Zhag Chua 1, Xu Liag 1 ad Huag Wemig 1,2 1 School of Computer Sciece ad Egieerig, Guili Uiversity of Electroic Techology, Guili, , P.R. Chia; 2 Guagxi Key Laboratory of Trusted Software, Guili, , P.R. Chia Abstract: The capacity of sigle server or CPU is uable to fiish the task of the miig of mass data. I cosideratio of this bottleeck problem, a combied algorithm which is used by geetic ad MR-based parallel clusterig algorithm is proposed. To make up for the defects of clusterig aalysis i screeig the clusterig ceter, the clusters are used by geetic algorithm, relyig o M-R parallel computig model to accelerate the covergece of the clusterig aalysis. To verify reasoableess of algorithm, this algorithm which is applied to aalysis of the actual log is based o buildig of Hadoop platform. Experimetal results show that relyig o performace of distributed cluster computig ad geetic clusterig aalysis to process log files ca get better clusterig results, ad the efficiecy of miig of massive log ca be greatly improved. Keywords: Cloud computig, Clusterig aalysis, Geetic algorithm, Map-reduce, Mass data processig. 1. INTRODUCTION Nowadays, the arrival of the era of big data is ot i doubt. The data showed the tred of rapid growth i scietific research (Astroomy, biology, high-eergy physics), the Iteret, e-commerce, ad computer simulatio applicatio. Especially, the ew amout of data geerated i the aual scietific research is about 15PB; the era of big data has two major treds: the data expasio ad the depth aalysis of data. Eighty percet data is ustructured data i some large eterprises. This grows expoetially every year. Large data challege ot oly ivolves the costructio of eterprise storage architecture, data ceter ifrastructure, but ca also produce chai reactios for cloud computig, data miig, busiess itelligece, data warehouse, etc. Eterprises ivest more eergy i busiess aalysis ad for miig the TB level data i the future, for example, study of miig i massive log ad log depth aalysis will be a essetial part of [1]. The log is a very broad cocept term i the computig world; ay program may have a log file, such as the applicatio of computer kerel system, computer server, all kids of busiesses ad social etworkig sites. The log file cotais the iformatio of iterest, as the busiess eterprise develops the potetial value of commercial iformatio by aalysis of user lik ad statistics such as the umber of access i the logs. Therefore, aalysis ad miig of a log have become a hot topic i computer research field. Various characteristics of large data such as outstadig?, have made traditioal log processig, log miig methods ad algorithms o loger applicable. Facig the itesive, complex hybrid log file eeds to have a more efficiet method of computig ad date miig algorithm. *Address correspodece to this author at the School of Computer Sciece ad Egieerig, Guili Uiversity of Electroic Techology, Guili, , P.R. Chia; Tel: ; zhrdeg@guet.edu.c X/15 This paper is based o parallel computig model of MapReduce that has caused widespread attetio [2, 3]. The combied algorithm [4-6], which is used by geetic ad MRbased parallel clusterig algorithm is proposed i view of the advatages of geetic algorithm. The algorithm follows the thought of sample global optimizatio, task of divide ad rule, ad results of summary ad aalysis. O oe had, it solves the defect of cluster ceter iitializatio istability for k-meas algorithm, ad o the other had, accelerates the covergece of K-Meas algorithm. The algorithm is deployed o the Hadoop experimetal platform, verifyig the effectiveess of the algorithm. 2. ALGORITHM ANALYSIS 2.1. Geetic Algorithm Geetic algorithm is a geeral learig method based o evolutio. Geetic algorithm is o loger as other algorithms searchig hypothesis ragig from geeral to specific ad from simple to complex, but selects the parets sample through a certai probability from the iitial sample groups. The geetic algorithm ca cotrol the global search process, fially obtaiig the search optimal solutio set or ear optimal solutio set. Compared with the traditioal heuristic search, geetic algorithm advatage is the coloy search strategy. The mai work is the choice of geetic operatio parameters which ca be completed from frequet humacomputer iteractio process i the search process K-Meas Clusterig aalysis is that the data set is divided ito family, the family data are similar, ad differet data as far as possible. It ca mie valuable iformatio distributio patters i the data ad is a importat meas of mass iformatio processig. It also plays a vital role i machie learig, data miig, text aalysis ad other fields. Classi Betham Ope

2 Research o K-Meas Algorithm Based o Parallel Improvig ad Applyig The Ope Cyberetics & Systemics Joural, 2015, Volume cal clusterig algorithm is compared to the K-Meas clusterig, fuzzy clusterig ad spectral clusterig. the log of servers ad custom trasactio data, similar customer groups, web pages clusterig, ad clusterig algorithm of frequet access path are aalyzed [7] based o couplig matrix processig. Distributed affiity propagatio clusterig algorithm based o MapReduce is further discussed [8]; this algorithm overcomes the limitatio of the dese dataot Sparse. [9, 10] Three clusterig methods are clustered: clusterig method based o group, clusterig method based o graularity, ad clusterig method based o fuzzy. These algorithms caot reflect the advatage i big data aalysis. K-Meas is a clusterig algorithm based o iterative distace. Its realizatio is simple, the efficiecy is higher for large data processig, especially, it gets better whe miig high dimesio ad agglomerate data. The K-Meas algorithm observes istace classificatio to K clusterig where it is less tha the other clusterig ceter distace. The K- meas algorithm cosists of three steps: (1) Fid the cluster ceter iitializatio, K is defied as the umber of clusters; (2) Calculate the distace for each observatio istaces to the cluster ceter, while puttig istaces to the earest cluster. Distace uses Euclidea distace criterio. The calculatio formula is as follows: D( X i,y j ) = "( X ij! Y ij ) 2,i = 1,2,..., ; j = 1,2,..., k (1) j=1 (3) Calculate the average distace of each cluster for all observed istaces, ad the average value as a ew clusterig ceter. The calculatio formula is as follows: ceter i = 1 " x s i j (2) x j!s i Repeat the third step util the clusterig ceter o loger chages. Termiate the clusterig process whe achievig the optimal objective fuctio or a maximum umber of iteratios. Whe usig the Euclidea distace as the metric, the calculatio formula is as follows: K mi!! dist(c i, x) 2 (3) i=1 x"c i 2.3. MapReduce Map-Reduce is a excellet model of distributed computig. It is widely applied to log aalysis, mass data sortig, ad also i massive data search. Busiess logic is abstracted ito two fuctios by Map-Reduce i the large scale cluster system: Map ad Reduce. Usually, the iput data is divided ito several idepedet data blocks by Map-Reduce. Processig the data block is parallel to Map fuctio, ad calculatio model of frame is to sort the results of the Map task. The the results are outputted to the Reduce fuctio. Its core idea is to work (through the mappig) ad Reduce (simplificatio). Each phase of the Map-Reduce process is as follows: (1) Iput: The iput data is divided ito several idepedet data blocks i this phase. (2) Map: The user iput data is regarded as several groups such as <key, value> key by Map-Reduce. I this phase, the Map fuctio calls a user-defied model to hadle the <key, value> key, ad the geerates the middle <key, value> key. (3) Shuffle: the middle <key, value> key is obtaied by Map i shuffle phase, accordig to the value of key to sort the iput data. (4) Combie: Whe the map operatio outputs its pairs, they are already available i memory. For efficiecy reasos, sometimes it makes sese to take advatage of this fact by supplyig a combier class to perform a reduce-type fuctio. (5) Reduce: The framework calls the applicatio's Reduce fuctio oce for each uique key i the sorted order. The Reduce ca iterate through the values that are associated with that key ad produce zero or more outputs. (6) Output: The Output Writer writes the output of the Reduce to the stable storage, usually a distributed file system. As show i Fig. (1), Map-Reduce is efficiet i makig cocise model, good scalability, fault tolerace ad parallelism HDFS HDFS is a storage structure of a distributed computig. It provides iput, output data for Map-Reduce parallel computig stage. A HDFS cluster is composed of a umber of NameNode ad DataNode; these two types of odes are Master ad Worker, respectively. The NameNode is resposible for maiteace tasks of amespace directory ad idex file ad participates i the cluster eviromet schedulig i a cluster system. DataNode is maily resposible for the odes for data storage ad task executio, ad at the same time, uiterrupted implemetatio ad trasmissio data report is trasmitted through the heart (HeartBeat) mode to NameNode Clusterig Geetic Parallel Algorithm based o M-R Model Clusterig ceters obtaied are optimized by geetic algorithm to avoid the ureasoable clusterig aalysis i screeig the iitial cluster ceter of clusterig results ad covergece rate of the problem. The mai work of algorithm is to calculate the distace of sample to the cluster ceter ad redistributio of the cluster, ad the calculatio of differet clusters is idepedet. Accordig to the idepedet characteristics of each group, this paper presets a parallel K-Meas algorithm by the computig model of the MapReduce. The desig of algorithm maily icludes the clusterig ceter optimizatio fuctio, Map, Combie, Reduce fuctio. The algorithm process is show i Fig. (2). 3. THE IMPLEMENTATION OF IMPROVED K- MEANS ALGORITHM 3.1. Screeig the Optimal Clusterig Ceter K-Meas clusterig algorithm has the characteristics of local search, but its clusterig covergece speed ad its

3 290 The Ope Cyberetics & Systemics Joural, 2015, Volume 9 Zherog et al. Fig. (1). The computig model of map-reduce. effective clusterig results are affected by the selectio of iitial clusterig ceter. Oce the iitial value selectio is ot good, it is difficult to get ideal results. Geetic algorithm is a adaptive global optimizatio algorithm. This paper itroduces geetic algorithm i K-Meas. The ifluece of the clusterig result is o loger affected by the iitial clusterig ceter through pretreatmet of the clusterig ceter selectio. The global optimal cluster ceters are foud by chromosome codig, fitess fuctio selectio ad geetic operator of geetic algorithm. The specific steps are as follows. Step 1: Chromosome code selectio. Chromosome code selectio directly iflueces the efficiecy of the geetic algorithm ad the fial result. The biary coded mode is regarded as the solutio set of the problem which is metioed i the literature [11], but the accuracy is ot high eough i the process of large-scale umerical optimizatio. Accordig to the characteristics of large amout of data ad complex iformatio, this paper adopts real codig. The code will eed the chromosome legth as the umber of cluster ceters, assumig the chromosome legth is Li, the codig populatio for X(k), where k is the first geeratio of populatio, the the i-geeratio populatio is X(k)i={X(k)1,X (k)2,,x(k)i}, Ki is the umber of cluster ceters i the chromosome. Step 2: Populatio iitializatio. Sample K for evolutio is radomly selected from the date sample, i order to avoid the premature ed of geetic operatio; the desired effect of optimal solutio caot be achieved as K caot choose too small. Step 3: The appropriate fuctio selectio. Sort criteria of cadidate hypotheses are defied by the fitess fuctio, ad the ext geeratio populatio criterio is chose with a certai probability. I this paper, the fitess fuctio is costructed by referecig the K-Meas criterio fuctio. The K- Meas criterio fuctio ad fitess fuctio are as follows: k E = # # ( X j! u i ) 2 (4) i=1 f = 1 1+ E x j "s i E represets the square error of all objects; The Ui represets the average of the Si value, amely, the clusterig ceter; Xj represets the J class sample space; ad K is the umber of cluster ceters. Accordig to clusterig criterio F, the more less the E value is, the more excellet is the clusterig quality. Step 4: Geetic operator. Selectio: The idividual probability is obtaied by idividual to idividual ad group fitess ratio. The choice of idividual method is called fitess proportioal selectio or rotary table selectio. Calculatio formula of idividual is selected as follows: (5)

4 Research o K-Meas Algorithm Based o Parallel Improvig ad Applyig The Ope Cyberetics & Systemics Joural, 2015, Volume Fig. (2). Flow chart of the algorithm. p(h i ) = Fitess(h i )! j=1 Fitess(h j ) This method ca guaratee the radomess, higher fitess is selected ad the smaller is elimiated. Itersectio: Accordig to the real umber codig of the log miig, this paper uses uiform crossover method. The method is to radomly select two idividuals, takig two itersectios i a radom way. Whe the radom umber is 0, the frot part of the idividual is cross; whe the radom umber is 1, the middle part of the idividual is cross; whe the radom umber is 2, the tail part of the idividual is cross. Variatio: Accordig to the characteristics of mass data, each gee selected sample chromosome represets a cluster ceter. I order to meet the geetic algorithm based o assurace of global search which has also local search performace ad the diversity of the group, a mutatio operator is itroduced. Accordig to the variatio of the probability, (6) algorithm selects variat of the idividual i the group, the the samples of gee segmet have radom variatio (Here is a selectio of uiform probability as the mutatio probability). Note that variatio rage should be i the rage of gee segmets. Step 5: Output the optimal iitial clusterig ceter ad the clusterig descriptio iformatio. Iput: The iitializatio of populatio N, the probability of crossover ad mutatio. Output: The best cluster ceter, the clusterig ceter descriptio. // Calculatio of each idividual fitess, selectio probability, the expected probability for each i rage do CalAllGes(i); while maxges do // Iitial populatio

5 292 The Ope Cyberetics & Systemics Joural, 2015, Volume 9 Zherog et al. Iitial(i); // Selectio of high fitess to eter ext geeratio selectges(i); // Itersectio Crossover(); // Variatio Mutate(); maxges--; // Output the optimal clusterig ceter set pritbestresult(); 3.2. Desig of the Map ad Reduce fuctios The parallel computig task i each MapReduce task is iitially a Job. Each Job lik ca be split ito two parts: Map ad Reduce stage. The clusterig ceter durig the implemetatio of K-Meas is temporarily stored i HDFS; data exchage format uses the TextIputFomat fuctio to regulate. Key represets the clusterig ceter; value represets Euclidea distace of cluster sample to the cluster ceter. Iput: The clusterig ceter poit optimizatio ad clustered sample. Output: Output clusterig is set whe the coditios of covergece are satisfied. Iformatio is maily described o the map stage ad record belogs to the ew cluster. The calculatio of the shortest distace is carried out betwee the iput sample ad the clusterig ceter. 1. Class Mapper 2. method Map(LogWritable K,Text V,Cotext C) // radom sample of iput segmetatio 3. for each logfiles f V do // Traversig the clusterig ceter, gets the short iformatio of the cluster ceter 4. for each ceter i ceters do 5. if (mi more-tha distace) // record miimum 6. Save distace,pos 7. ed for // Record category ad attribute vector set of clusterig 8. emitresult(text(pos), Text(distace) I the Combie stage, the ecoded data files have duplicate data, i order to reduce the etwork flow, while the itermediate results are local mergers. With the same key value of itermediate results, amely clusterig ceter subscripts the same <key, value> key to merge ito a group. 9. Class Combie 10. Method Combie(LogWritable K,Text V,Cotext C) 11. EmitLocalResult(Text(pos), Text(distace) At the etrace Reduce phase, data is derived from the Map phase of <key, value> character key queue, where key is the idex of cluster ceter ad value is the Euclidea distace of the cluster family. The key of the same sample value is accumulated ad calculates the average value, ad the the mea value is regarded as the clusterig ceter at the ext iteratio, ad stored i the HDFS. 12. Class Reducer 13. Method Reduce(LogWritable K,Iterable V,Cotext C) 14. sum 0, ave for each V i Iterable<rs> 16. sum+=v(value) 17. Ave = sum/cout // Calculate the average value ad save it to the iitial clusterig ceter set i the ext roud 18. EmitNextResult(K,ave) 19. Class MRkmeas // Implemetatio of the algorithm ad output results 20. Job(cofigratio,alg) The algorithm termiates coditios: The last roud of the results is compared with the results of this clusterig. If the clusterig results do ot chage or are less tha the threshold value, the coditios for covergece are satisfied, clusterig algorithm is termiated, whereas the results would serve as the ext roud of the K-Meas iput. 4. EXPERIMENT AND ANALYSIS OF RESULTS 4.1. Set up the Experimetal Eviromet By buildig the Hadoop cloud computig platform to validate the effectiveess of the algorithm, the architecture is show i the figure. The hardware icludes five PC machies, oe machie as a master ode, loadig schedulig ad real-time moitorig of task, the remaiig four machies as a slave ode, loadig distributed processig of the task. Each ode cofiguratio of The Hadoop platform is show i Table 1. Nameode is regarded as the mai cotrol (JobTracker), which is task schedulig ad maagemet for distributed clusters; Dataode is regarded as TaskTracker, resposible for performig the task Aalysis of Results Accordig to the differet allocatios of hardware ad software for the two class statuses, this paper maily studied several experimets. Data source collectio is from log files of compay server. Experimet 1: Pseudo distributed cluster mode Firstly, build a good pseudo distributed platform, ru the traditioal clusterig algorithm (K-Meas), a parallel clusterig algorithm (CPA) ad geetic clusterig algorithm (M-R CPGA), to process the size scale of log file as 2MB, 4MB, 6MB, 8MB, 10MB, the compare the executio time, as show i Fig. (3).

6 Research o K-Meas Algorithm Based o Parallel Improvig ad Applyig The Ope Cyberetics & Systemics Joural, 2015, Volume Table 1. Cofiguratio of Hadoop platform. Name IP Fuctio Operatio System hadoop-master ameode dataode Ubutu Server hadoop-slave dataode Ubutu hadoop-slave dataode Ubutu hadoop-slave dataode Ubutu hadoop-slave dataode Ubutu I Experimet 2, the parallel clusterig algorithm (CPA) ad geetic clusterig algorithm (M-R CGPA) were compared. The experimetal results show that i the case of M-R cluster distributed CGPA, algorithm shows good speedup. As the data size icreases, computig clusters distributed advatages become more promiet, ad the scalability of Hadoop platform esures high availability program algorithm as show i Fig. (4). Fig. (3). Executio time of pseudo distributed cluster algorithm. I Experimet 1, three algorithms are used i differet sizes of the log file. The experimets of umerical comparisos show that i a sigle pseudo distributed case, whe a log file is of a very small scale, three kids of algorithm executio time have a obvious drop; however, data acquisitio is very large, because the performace of sigle cofiguratio caot reach the fully distributed parallel geetic clusterig algorithm. This reflects its disadvatage. Experimet 2: Distributed cluster mode I order to validate the algorithm i dealig with more data, thus complete cluster distributed eviromet eeds to be set up i which four machies work as the task processig ode while a machie as the task schedulig ode. Due to the hardware task, call ode cofiguratio is higher but also works as a task ode, as show i Fig. (3). Fig. (4). Executio time of distributed cluster algorithm. CONCLUSION This paper combied the global optimizatio of geetic algorithm ad K-Meas algorithm of local searchig characteristic ad proposed a geetic clusterig based o the calculatio model of M-R parallel algorithm. Through the deploymet of Hadoop cluster eviromet, the algorithm ra i HDFS ad the massive log files were processed. The experimetal results show that with the cluster odes ad the amout of data icrease, the efficiecy of the algorithm is higher to achieve effect of miig o massive log, layig the foudatio for future study of combiatio i the massive data off-lie calculatio ad multidimesioal data miig. CONFLICT OF INTEREST The authors cofirm that this article cotet has o coflict of iterest. ACKNOWLEDGEMENTS This work is supported by Guagxi key Laboratory of Trusted Software (No: kx201317), by the Postgraduate s Iovatio Project of Guili Uiversity of Electroic Techology uder (No: GDYCSZ201470), by the 2014 Guagxi Uiversity of Sciece ad Techology Research Projects (NO: LX ), by the Nature Sciece Foudatio of Guagxi (No: 2013GXNSFAA019350). REFERENCES [1] H. Yu, ad D. Wag, Mass log data processig ad miig based o Hadoop ad cloud Computig, I: Computer Sciece & Educatio (ICCSE), [2] Y. Liu, N. Cao, W. Pa, ad G. Qiao, System aomaly detectio i distributed systems through MapReduce-Based log aalysis, Advaced Computer Theory ad Egieerig (ICACTE), vol. 6, pp , [3] S. Badyopadhyay, Geetic algorithms for clusterig ad fuzzy clusterig, Wiley Iterdiscipliary Reviews: Data Miig ad Kowledge Discovery, vol. 3, p. 285, [4] X. Qi, ad H. Wag, Big Data Aalysis:Competitio ad symbiosis of RDBMS ad MapReduce, Joural of Software, vol. 23, pp , 2012.

7 294 The Ope Cyberetics & Systemics Joural, 2015, Volume 9 Zherog et al. [5] X. Meg, ad X. Ci, Big data maagemet: coceptio, techology ad challege, Research ad Developmet of Computer, vol. 50, pp , [6] S. Wag, H. Wag, ad X. Qi, Big data structure: challege, preset situatio ad prospect, Joural of Computer, vol. 10, pp , [7] Q. Sog, ad J. She, Efficiet miig algorithm of web log, Research ad Developmet of Computer, vol. 3, pp , [8] C. Zhag, ad H. Yig, Uiform block Crossover Geetic Algorithm, Techology ad Applicatio of Automatio, vol. 24, pp , [9] Y. Wu, Discussio o custerig aalysis method, Sciece of Computer, vol. 39, pp , [10] Y. Ma, ad W. Yu, Research Progress o geetic algorithm, The Research ad Applicatio of Computer, vol. 29, pp , [11] R. C Taylor, A overview of the Hadoop/MapReduce/HBase framework ad its curret applicatios i bioiformatics, I: 11 th Aual Bioiformatics Ope Source Coferece (BOSC), Bosto, MA, USA, Received: September 16, 2014 Revised: December 23, 2014 Accepted: December 31, 2014 Zherog et al.; Licesee Betham Ope. This is a ope access article licesed uder the terms of the Creative Commos Attributio No-Commercial Licese ( liceses/by-c/4.0/) which permits urestricted, o-commercial use, distributio ad reproductio i ay medium, provided the work is properly cited.

An Improved Shuffled Frog-Leaping Algorithm for Knapsack Problem

An Improved Shuffled Frog-Leaping Algorithm for Knapsack Problem A Improved Shuffled Frog-Leapig Algorithm for Kapsack Problem Zhoufag Li, Ya Zhou, ad Peg Cheg School of Iformatio Sciece ad Egieerig Hea Uiversity of Techology ZhegZhou, Chia lzhf1978@126.com Abstract.

More information

Ones Assignment Method for Solving Traveling Salesman Problem

Ones Assignment Method for Solving Traveling Salesman Problem Joural of mathematics ad computer sciece 0 (0), 58-65 Oes Assigmet Method for Solvig Travelig Salesma Problem Hadi Basirzadeh Departmet of Mathematics, Shahid Chamra Uiversity, Ahvaz, Ira Article history:

More information

Journal of Chemical and Pharmaceutical Research, 2013, 5(12): Research Article

Journal of Chemical and Pharmaceutical Research, 2013, 5(12): Research Article Available olie www.jocpr.com Joural of Chemical ad Pharmaceutical Research, 2013, 5(12):745-749 Research Article ISSN : 0975-7384 CODEN(USA) : JCPRC5 K-meas algorithm i the optimal iitial cetroids based

More information

Optimization for framework design of new product introduction management system Ma Ying, Wu Hongcui

Optimization for framework design of new product introduction management system Ma Ying, Wu Hongcui 2d Iteratioal Coferece o Electrical, Computer Egieerig ad Electroics (ICECEE 2015) Optimizatio for framework desig of ew product itroductio maagemet system Ma Yig, Wu Hogcui Tiaji Electroic Iformatio Vocatioal

More information

HADOOP: A NEW APPROACH FOR DOCUMENT CLUSTERING

HADOOP: A NEW APPROACH FOR DOCUMENT CLUSTERING Y.K. Patil* Iteratioal Joural of Advaced Research i ISSN: 2278-6244 IT ad Egieerig Impact Factor: 4.54 HADOOP: A NEW APPROACH FOR DOCUMENT CLUSTERING Prof. V.S. Nadedkar** Abstract: Documet clusterig is

More information

3D Model Retrieval Method Based on Sample Prediction

3D Model Retrieval Method Based on Sample Prediction 20 Iteratioal Coferece o Computer Commuicatio ad Maagemet Proc.of CSIT vol.5 (20) (20) IACSIT Press, Sigapore 3D Model Retrieval Method Based o Sample Predictio Qigche Zhag, Ya Tag* School of Computer

More information

Analysis Metrics. Intro to Algorithm Analysis. Slides. 12. Alg Analysis. 12. Alg Analysis

Analysis Metrics. Intro to Algorithm Analysis. Slides. 12. Alg Analysis. 12. Alg Analysis Itro to Algorithm Aalysis Aalysis Metrics Slides. Table of Cotets. Aalysis Metrics 3. Exact Aalysis Rules 4. Simple Summatio 5. Summatio Formulas 6. Order of Magitude 7. Big-O otatio 8. Big-O Theorems

More information

MapReduce and Hadoop. Debapriyo Majumdar Data Mining Fall 2014 Indian Statistical Institute Kolkata. November 10, 2014

MapReduce and Hadoop. Debapriyo Majumdar Data Mining Fall 2014 Indian Statistical Institute Kolkata. November 10, 2014 MapReduce ad Hadoop Debapriyo Majumdar Data Miig Fall 2014 Idia Statistical Istitute Kolkata November 10, 2014 Let s keep the itro short Moder data miig: process immese amout of data quickly Exploit parallelism

More information

Administrative UNSUPERVISED LEARNING. Unsupervised learning. Supervised learning 11/25/13. Final project. No office hours today

Administrative UNSUPERVISED LEARNING. Unsupervised learning. Supervised learning 11/25/13. Final project. No office hours today Admiistrative Fial project No office hours today UNSUPERVISED LEARNING David Kauchak CS 451 Fall 2013 Supervised learig Usupervised learig label label 1 label 3 model/ predictor label 4 label 5 Supervised

More information

n Explore virtualization concepts n Become familiar with cloud concepts

n Explore virtualization concepts n Become familiar with cloud concepts Chapter Objectives Explore virtualizatio cocepts Become familiar with cloud cocepts Chapter #15: Architecture ad Desig 2 Hypervisor Virtualizatio ad cloud services are becomig commo eterprise tools to

More information

New HSL Distance Based Colour Clustering Algorithm

New HSL Distance Based Colour Clustering Algorithm The 4th Midwest Artificial Itelligece ad Cogitive Scieces Coferece (MAICS 03 pp 85-9 New Albay Idiaa USA April 3-4 03 New HSL Distace Based Colour Clusterig Algorithm Vasile Patrascu Departemet of Iformatics

More information

ISSN (Print) Research Article. *Corresponding author Nengfa Hu

ISSN (Print) Research Article. *Corresponding author Nengfa Hu Scholars Joural of Egieerig ad Techology (SJET) Sch. J. Eg. Tech., 2016; 4(5):249-253 Scholars Academic ad Scietific Publisher (A Iteratioal Publisher for Academic ad Scietific Resources) www.saspublisher.com

More information

An Anomaly Detection Method Based On Deep Learning

An Anomaly Detection Method Based On Deep Learning Sed Orders for Reprits to reprits@bethamsciece.ae 734 The Ope Automatio ad Cotrol Systems Joural, 05, 7, 734-739 A Aomaly Detectio Method Based O Deep Learig Ope Access Hog-li Deg *, Tao yag ad Jiag-i

More information

CSCI 5090/7090- Machine Learning. Spring Mehdi Allahyari Georgia Southern University

CSCI 5090/7090- Machine Learning. Spring Mehdi Allahyari Georgia Southern University CSCI 5090/7090- Machie Learig Sprig 018 Mehdi Allahyari Georgia Souther Uiversity Clusterig (slides borrowed from Tom Mitchell, Maria Floria Balca, Ali Borji, Ke Che) 1 Clusterig, Iformal Goals Goal: Automatically

More information

Stone Images Retrieval Based on Color Histogram

Stone Images Retrieval Based on Color Histogram Stoe Images Retrieval Based o Color Histogram Qiag Zhao, Jie Yag, Jigyi Yag, Hogxig Liu School of Iformatio Egieerig, Wuha Uiversity of Techology Wuha, Chia Abstract Stoe images color features are chose

More information

Web Text Feature Extraction with Particle Swarm Optimization

Web Text Feature Extraction with Particle Swarm Optimization 32 IJCSNS Iteratioal Joural of Computer Sciece ad Network Security, VOL.7 No.6, Jue 2007 Web Text Feature Extractio with Particle Swarm Optimizatio Sog Liagtu,, Zhag Xiaomig Istitute of Itelliget Machies,

More information

Adaptive Resource Allocation for Electric Environmental Pollution through the Control Network

Adaptive Resource Allocation for Electric Environmental Pollution through the Control Network Available olie at www.sciecedirect.com Eergy Procedia 6 (202) 60 64 202 Iteratioal Coferece o Future Eergy, Eviromet, ad Materials Adaptive Resource Allocatio for Electric Evirometal Pollutio through the

More information

Analysis of Documents Clustering Using Sampled Agglomerative Technique

Analysis of Documents Clustering Using Sampled Agglomerative Technique Aalysis of Documets Clusterig Usig Sampled Agglomerative Techique Omar H. Karam, Ahmed M. Hamad, ad Sheri M. Moussa Abstract I this paper a clusterig algorithm for documets is proposed that adapts a samplig-based

More information

Outline. Research Definition. Motivation. Foundation of Reverse Engineering. Dynamic Analysis and Design Pattern Detection in Java Programs

Outline. Research Definition. Motivation. Foundation of Reverse Engineering. Dynamic Analysis and Design Pattern Detection in Java Programs Dyamic Aalysis ad Desig Patter Detectio i Java Programs Outlie Lei Hu Kamra Sartipi {hul4, sartipi}@mcmasterca Departmet of Computig ad Software McMaster Uiversity Caada Motivatio Research Problem Defiitio

More information

Analysis of Server Resource Consumption of Meteorological Satellite Application System Based on Contour Curve

Analysis of Server Resource Consumption of Meteorological Satellite Application System Based on Contour Curve Advaces i Computer, Sigals ad Systems (2018) 2: 19-25 Clausius Scietific Press, Caada Aalysis of Server Resource Cosumptio of Meteorological Satellite Applicatio System Based o Cotour Curve Xiagag Zhao

More information

Introduction. Nature-Inspired Computing. Terminology. Problem Types. Constraint Satisfaction Problems - CSP. Free Optimization Problem - FOP

Introduction. Nature-Inspired Computing. Terminology. Problem Types. Constraint Satisfaction Problems - CSP. Free Optimization Problem - FOP Nature-Ispired Computig Hadlig Costraits Dr. Şima Uyar September 2006 Itroductio may practical problems are costraied ot all combiatios of variable values represet valid solutios feasible solutios ifeasible

More information

Fuzzy Rule Selection by Data Mining Criteria and Genetic Algorithms

Fuzzy Rule Selection by Data Mining Criteria and Genetic Algorithms Fuzzy Rule Selectio by Data Miig Criteria ad Geetic Algorithms Hisao Ishibuchi Dept. of Idustrial Egieerig Osaka Prefecture Uiversity 1-1 Gakue-cho, Sakai, Osaka 599-8531, JAPAN E-mail: hisaoi@ie.osakafu-u.ac.jp

More information

SCHEDULING OPTIMIZATION IN CONSTRUCTION PROJECT BASED ON ANT COLONY GENETIC ALGORITHM

SCHEDULING OPTIMIZATION IN CONSTRUCTION PROJECT BASED ON ANT COLONY GENETIC ALGORITHM Joural of Theoretical ad Applied Iformatio Techology 8 th February 013. Vol. 48 No.3 005-013 JATIT & LLS. All rights reserved. ISSN: 199-8645 www.jatit.org E-ISSN: 1817-3195 SCHEDULING OPTIMIZATION IN

More information

Improvement of the Orthogonal Code Convolution Capabilities Using FPGA Implementation

Improvement of the Orthogonal Code Convolution Capabilities Using FPGA Implementation Improvemet of the Orthogoal Code Covolutio Capabilities Usig FPGA Implemetatio Naima Kaabouch, Member, IEEE, Apara Dhirde, Member, IEEE, Saleh Faruque, Member, IEEE Departmet of Electrical Egieerig, Uiversity

More information

Sectio 4, a prototype project of settig field weight with AHP method is developed ad the experimetal results are aalyzed. Fially, we coclude our work

Sectio 4, a prototype project of settig field weight with AHP method is developed ad the experimetal results are aalyzed. Fially, we coclude our work 200 2d Iteratioal Coferece o Iformatio ad Multimedia Techology (ICIMT 200) IPCSIT vol. 42 (202) (202) IACSIT Press, Sigapore DOI: 0.7763/IPCSIT.202.V42.0 Idex Weight Decisio Based o AHP for Iformatio Retrieval

More information

Harris Corner Detection Algorithm at Sub-pixel Level and Its Application Yuanfeng Han a, Peijiang Chen b * and Tian Meng c

Harris Corner Detection Algorithm at Sub-pixel Level and Its Application Yuanfeng Han a, Peijiang Chen b * and Tian Meng c Iteratioal Coferece o Computatioal Sciece ad Egieerig (ICCSE 015) Harris Corer Detectio Algorithm at Sub-pixel Level ad Its Applicatio Yuafeg Ha a, Peijiag Che b * ad Tia Meg c School of Automobile, Liyi

More information

Study on effective detection method for specific data of large database LI Jin-feng

Study on effective detection method for specific data of large database LI Jin-feng Iteratioal Coferece o Automatio, Mechaical Cotrol ad Computatioal Egieerig (AMCCE 205) Study o effective detectio method for specific data of large database LI Ji-feg (Vocatioal College of DogYig, Shadog

More information

Probabilistic Fuzzy Time Series Method Based on Artificial Neural Network

Probabilistic Fuzzy Time Series Method Based on Artificial Neural Network America Joural of Itelliget Systems 206, 6(2): 42-47 DOI: 0.5923/j.ajis.2060602.02 Probabilistic Fuzzy Time Series Method Based o Artificial Neural Network Erol Egrioglu,*, Ere Bas, Cagdas Haka Aladag

More information

Evaluation of Different Fitness Functions for the Evolutionary Testing of an Autonomous Parking System

Evaluation of Different Fitness Functions for the Evolutionary Testing of an Autonomous Parking System Evaluatio of Differet Fitess Fuctios for the Evolutioary Testig of a Autoomous Parkig System Joachim Wegeer 1 ad Oliver Bühler 2 1 DaimlerChrysler AG, Research ad Techology, Alt-Moabit 96 a, D-10559 Berli,

More information

Heuristic Approaches for Solving the Multidimensional Knapsack Problem (MKP)

Heuristic Approaches for Solving the Multidimensional Knapsack Problem (MKP) Heuristic Approaches for Solvig the Multidimesioal Kapsack Problem (MKP) R. PARRA-HERNANDEZ N. DIMOPOULOS Departmet of Electrical ad Computer Eg. Uiversity of Victoria Victoria, B.C. CANADA Abstract: -

More information

A SOFTWARE MODEL FOR THE MULTILAYER PERCEPTRON

A SOFTWARE MODEL FOR THE MULTILAYER PERCEPTRON A SOFTWARE MODEL FOR THE MULTILAYER PERCEPTRON Roberto Lopez ad Eugeio Oñate Iteratioal Ceter for Numerical Methods i Egieerig (CIMNE) Edificio C1, Gra Capitá s/, 08034 Barceloa, Spai ABSTRACT I this work

More information

An Algorithm of Mobile Robot Node Location Based on Wireless Sensor Network

An Algorithm of Mobile Robot Node Location Based on Wireless Sensor Network A Algorithm of Mobile Robot Node Locatio Based o Wireless Sesor Network https://doi.org/0.399/ijoe.v3i05.7044 Peg A Nigbo Uiversity of Techology, Zhejiag, Chia eirxvrp2269@26.com Abstract I the wireless

More information

Image Segmentation EEE 508

Image Segmentation EEE 508 Image Segmetatio Objective: to determie (etract) object boudaries. It is a process of partitioig a image ito distict regios by groupig together eighborig piels based o some predefied similarity criterio.

More information

Service Oriented Enterprise Architecture and Service Oriented Enterprise

Service Oriented Enterprise Architecture and Service Oriented Enterprise Approved for Public Release Distributio Ulimited Case Number: 09-2786 The 23 rd Ope Group Eterprise Practitioers Coferece Service Orieted Eterprise ad Service Orieted Eterprise Ya Zhao, PhD Pricipal, MITRE

More information

Pruning and Summarizing the Discovered Time Series Association Rules from Mechanical Sensor Data Qing YANG1,a,*, Shao-Yu WANG1,b, Ting-Ting ZHANG2,c

Pruning and Summarizing the Discovered Time Series Association Rules from Mechanical Sensor Data Qing YANG1,a,*, Shao-Yu WANG1,b, Ting-Ting ZHANG2,c Advaces i Egieerig Research (AER), volume 131 3rd Aual Iteratioal Coferece o Electroics, Electrical Egieerig ad Iformatio Sciece (EEEIS 2017) Pruig ad Summarizig the Discovered Time Series Associatio Rules

More information

Algorithms for Disk Covering Problems with the Most Points

Algorithms for Disk Covering Problems with the Most Points Algorithms for Disk Coverig Problems with the Most Poits Bi Xiao Departmet of Computig Hog Kog Polytechic Uiversity Hug Hom, Kowloo, Hog Kog csbxiao@comp.polyu.edu.hk Qigfeg Zhuge, Yi He, Zili Shao, Edwi

More information

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe Copyright 2016 Ramez Elmasri ad Shamkat B. Navathe CHAPTER 19 Query Optimizatio Copyright 2016 Ramez Elmasri ad Shamkat B. Navathe Itroductio Query optimizatio Coducted by a query optimizer i a DBMS Goal:

More information

Hui Xiao School of Environmental Science, Nanjing Xiaozhuang University, Nanjing , China

Hui Xiao School of Environmental Science, Nanjing Xiaozhuang University, Nanjing , China doi:0.3/00.39.. Cotiuous knn Queries i Dyamic Road Networks Hui Xiao School of Evirometal Sciece, Najig Xiaozhuag Uiversity, Najig 7, Chia Abstract Cotiuous knn queries have bee widely studied i recet

More information

Research on Identification Model of Financial Fraud of Listed Company Based on Data Mining Technology

Research on Identification Model of Financial Fraud of Listed Company Based on Data Mining Technology 208 2d Iteratioal Coferece o Systems, Computig, ad Applicatios (SYSTCA 208) Research o Idetificatio Model of Fiacial Fraud of Listed Compay Based o Data Miig Techology Jiaqi Hu, Xiao Che School of Busiess,

More information

World Scientific Research Journal (WSRJ) ISSN: Research on Fresnel Lens Optical Receiving Antenna in Indoor Visible

World Scientific Research Journal (WSRJ) ISSN: Research on Fresnel Lens Optical Receiving Antenna in Indoor Visible World Scietific Research Joural (WSRJ) ISSN: 2472-3703 www.wsr-j.org Research o Fresel Les Optical Receivig Atea i Idoor Visible Light Commuicatio Zhihua Du College of Electroics Egieerig, Chogqig Uiversity

More information

Fuzzy Linear Regression Analysis

Fuzzy Linear Regression Analysis 12th IFAC Coferece o Programmable Devices ad Embedded Systems The Iteratioal Federatio of Automatic Cotrol September 25-27, 2013. Fuzzy Liear Regressio Aalysis Jaa Nowaková Miroslav Pokorý VŠB-Techical

More information

A Hybrid Clustering Method Using Genetic Algorithm with New Variation Operators

A Hybrid Clustering Method Using Genetic Algorithm with New Variation Operators Iteratioal Joural of Idustrial Egieerig & Productio Maagemet (2012) Jue 2012, Volume 23, Number 1 pp. 121-128 http:ijiepm.iust.ac.ir A Hybrid Clusterig Method Usig Geetic Algorithm with New Variatio Operators

More information

Big Data Capacity Planning: Achieving Right Sized Hadoop Clusters and Optimized Operations

Big Data Capacity Planning: Achieving Right Sized Hadoop Clusters and Optimized Operations Big Data Capacity Plaig: Achievig Right Sized Hadoop Clusters ad Optimized Operatios Abstract Busiesses are cosiderig more opportuities to leverage data for differet purposes, impactig resources ad resultig

More information

Chapter 4 Threads. Operating Systems: Internals and Design Principles. Ninth Edition By William Stallings

Chapter 4 Threads. Operating Systems: Internals and Design Principles. Ninth Edition By William Stallings Operatig Systems: Iterals ad Desig Priciples Chapter 4 Threads Nith Editio By William Stalligs Processes ad Threads Resource Owership Process icludes a virtual address space to hold the process image The

More information

Digital Microfluidic Biochips Online Fault Detection Route Optimization Scheme Chuan-pei XU and Tong-zhou ZHAO *

Digital Microfluidic Biochips Online Fault Detection Route Optimization Scheme Chuan-pei XU and Tong-zhou ZHAO * 2018 Iteratioal Coferece o Commuicatio, Network ad Artificial Itelligece (CNAI 2018) ISBN: 978-1-60595-065-5 Digital Microfluidic Biochips Olie Fault Detectio Route Optimizatio Scheme Chua-pei XU ad Tog-zhou

More information

CS 683: Advanced Design and Analysis of Algorithms

CS 683: Advanced Design and Analysis of Algorithms CS 683: Advaced Desig ad Aalysis of Algorithms Lecture 6, February 1, 2008 Lecturer: Joh Hopcroft Scribes: Shaomei Wu, Etha Feldma February 7, 2008 1 Threshold for k CNF Satisfiability I the previous lecture,

More information

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe Copyright 2016 Ramez Elmasri ad Shamkat B. Navathe CHAPTER 18 Strategies for Query Processig Copyright 2016 Ramez Elmasri ad Shamkat B. Navathe Itroductio DBMS techiques to process a query Scaer idetifies

More information

A New Morphological 3D Shape Decomposition: Grayscale Interframe Interpolation Method

A New Morphological 3D Shape Decomposition: Grayscale Interframe Interpolation Method A ew Morphological 3D Shape Decompositio: Grayscale Iterframe Iterpolatio Method D.. Vizireau Politehica Uiversity Bucharest, Romaia ae@comm.pub.ro R. M. Udrea Politehica Uiversity Bucharest, Romaia mihea@comm.pub.ro

More information

Improving Template Based Spike Detection

Improving Template Based Spike Detection Improvig Template Based Spike Detectio Kirk Smith, Member - IEEE Portlad State Uiversity petra@ee.pdx.edu Abstract Template matchig algorithms like SSE, Covolutio ad Maximum Likelihood are well kow for

More information

Małgorzata Sterna. Mateusz Cicheński, Mateusz Jarus, Michał Miszkiewicz, Jarosław Szymczak

Małgorzata Sterna. Mateusz Cicheński, Mateusz Jarus, Michał Miszkiewicz, Jarosław Szymczak Małgorzata Stera Mateusz Cicheński, Mateusz Jarus, Michał Miszkiewicz, Jarosław Szymczak Istitute of Computig Sciece Pozań Uiversity of Techology Pozań - Polad Scope of the Talk Problem defiitio MP Formulatio

More information

A METHOD OF GENERATING RULES FOR A KERNEL FUZZY CLASSIFIER

A METHOD OF GENERATING RULES FOR A KERNEL FUZZY CLASSIFIER Proceedigs of the Sixth Iteratioal Coferece o Machie Learig ad Cyberetics, Hog Kog, 9- August 7 A METHOD OF GENERATING RULES FOR A KERNEL FUZZY CLASSIFIER AI-MIN YANG,, XIN-GUANG LI, LING-MIN JIANG,YONG-MEI

More information

New Fuzzy Color Clustering Algorithm Based on hsl Similarity

New Fuzzy Color Clustering Algorithm Based on hsl Similarity IFSA-EUSFLAT 009 New Fuzzy Color Clusterig Algorithm Based o hsl Similarity Vasile Ptracu Departmet of Iformatics Techology Tarom Compay Bucharest Romaia Email: patrascu.v@gmail.com Abstract I this paper

More information

1 Enterprise Modeler

1 Enterprise Modeler 1 Eterprise Modeler Itroductio I BaaERP, a Busiess Cotrol Model ad a Eterprise Structure Model for multi-site cofiguratios are itroduced. Eterprise Structure Model Busiess Cotrol Models Busiess Fuctio

More information

x x 2 x Iput layer = quatity of classificatio mode X T = traspositio matrix The core of such coditioal probability estimatig method is calculatig the

x x 2 x Iput layer = quatity of classificatio mode X T = traspositio matrix The core of such coditioal probability estimatig method is calculatig the COMPARATIVE RESEARCHES ON PROBABILISTIC NEURAL NETWORKS AND MULTI-LAYER PERCEPTRON NETWORKS FOR REMOTE SENSING IMAGE SEGMENTATION Liu Gag a, b, * a School of Electroic Iformatio, Wuha Uiversity, 430079,

More information

A Key Distribution method for Reducing Storage and Supporting High Level Security in the Large-scale WSN

A Key Distribution method for Reducing Storage and Supporting High Level Security in the Large-scale WSN Iteratioal Joural of Digital Cotet Techology ad its Applicatios Vol. 2 No 1, March 2008 A Key Distributio method for Reducig Storage ad Supportig High Level Security i the Large-scale WSN Yoo-Su Jeog *1,

More information

Bayesian approach to reliability modelling for a probability of failure on demand parameter

Bayesian approach to reliability modelling for a probability of failure on demand parameter Bayesia approach to reliability modellig for a probability of failure o demad parameter BÖRCSÖK J., SCHAEFER S. Departmet of Computer Architecture ad System Programmig Uiversity Kassel, Wilhelmshöher Allee

More information

A Novel Hybrid Algorithm for Software Cost Estimation Based on Cuckoo Optimization and K-Nearest Neighbors Algorithms

A Novel Hybrid Algorithm for Software Cost Estimation Based on Cuckoo Optimization and K-Nearest Neighbors Algorithms Egieerig, Techology & Applied Sciece Research Vol. 6, No. 3, 26, 8-22 8 A Novel Hybrid Algorithm for Software Cost Estimatio Based o Optimizatio ad K-Nearest Neighbors Algorithms Elaz Eskadaria Miadoab

More information

Lecture 5. Counting Sort / Radix Sort

Lecture 5. Counting Sort / Radix Sort Lecture 5. Coutig Sort / Radix Sort T. H. Corme, C. E. Leiserso ad R. L. Rivest Itroductio to Algorithms, 3rd Editio, MIT Press, 2009 Sugkyukwa Uiversity Hyuseug Choo choo@skku.edu Copyright 2000-2018

More information

Investigating methods for improving Bagged k-nn classifiers

Investigating methods for improving Bagged k-nn classifiers Ivestigatig methods for improvig Bagged k-nn classifiers Fuad M. Alkoot Telecommuicatio & Navigatio Istitute, P.A.A.E.T. P.O.Box 4575, Alsalmia, 22046 Kuwait Abstract- We experimet with baggig knn classifiers

More information

The measurement of overhead conductor s sag with DLT method

The measurement of overhead conductor s sag with DLT method Advaces i Egieerig Research (AER), volume 7 2d Aual Iteratioal Coferece o Electroics, Electrical Egieerig ad Iformatio Sciece (EEEIS 206) he measuremet of overhead coductor s sag with DL method Fag Ye,

More information

Abstract. Chapter 4 Computation. Overview 8/13/18. Bjarne Stroustrup Note:

Abstract. Chapter 4 Computation. Overview 8/13/18. Bjarne Stroustrup   Note: Chapter 4 Computatio Bjare Stroustrup www.stroustrup.com/programmig Abstract Today, I ll preset the basics of computatio. I particular, we ll discuss expressios, how to iterate over a series of values

More information

BASED ON ITERATIVE ERROR-CORRECTION

BASED ON ITERATIVE ERROR-CORRECTION A COHPARISO OF CRYPTAALYTIC PRICIPLES BASED O ITERATIVE ERROR-CORRECTIO Miodrag J. MihaljeviC ad Jova Dj. GoliC Istitute of Applied Mathematics ad Electroics. Belgrade School of Electrical Egieerig. Uiversity

More information

Evolutionary Hybrid Genetic-Firefly Algorithm for Global Optimization

Evolutionary Hybrid Genetic-Firefly Algorithm for Global Optimization Iteratioal Joural of Computatioal Egieerig & Maagemet, Vol. 6 Issue, May ISSN (Olie): -789 www..org Evolutioary Hybrid Geetic-Firefly Algorithm for Global Optimizatio 7 Shaik Farook, P. Sagameswara Raju

More information

Lecture Notes 6 Introduction to algorithm analysis CSS 501 Data Structures and Object-Oriented Programming

Lecture Notes 6 Introduction to algorithm analysis CSS 501 Data Structures and Object-Oriented Programming Lecture Notes 6 Itroductio to algorithm aalysis CSS 501 Data Structures ad Object-Orieted Programmig Readig for this lecture: Carrao, Chapter 10 To be covered i this lecture: Itroductio to algorithm aalysis

More information

ECE4050 Data Structures and Algorithms. Lecture 6: Searching

ECE4050 Data Structures and Algorithms. Lecture 6: Searching ECE4050 Data Structures ad Algorithms Lecture 6: Searchig 1 Search Give: Distict keys k 1, k 2,, k ad collectio L of records of the form (k 1, I 1 ), (k 2, I 2 ),, (k, I ) where I j is the iformatio associated

More information

Evaluation of Support Vector Machine Kernels for Detecting Network Anomalies

Evaluation of Support Vector Machine Kernels for Detecting Network Anomalies Evaluatio of Support Vector Machie Kerels for Detectig Network Aomalies Prera Batta, Maider Sigh, Zhida Li, Qigye Dig, ad Ljiljaa Trajković Commuicatio Networks Laboratory http://www.esc.sfu.ca/~ljilja/cl/

More information

An Estimation of Distribution Algorithm for solving the Knapsack problem

An Estimation of Distribution Algorithm for solving the Knapsack problem Vol.4,No.5, 214 Published olie: May 25, 214 DOI: 1.7321/jscse.v4.5.1 A Estimatio of Distributio Algorithm for solvig the Kapsack problem 1 Ricardo Pérez, 2 S. Jös, 3 Arturo Herádez, 4 Carlos A. Ochoa *1,

More information

Task scenarios Outline. Scenarios in Knowledge Extraction. Proposed Framework for Scenario to Design Diagram Transformation

Task scenarios Outline. Scenarios in Knowledge Extraction. Proposed Framework for Scenario to Design Diagram Transformation 6-0-0 Kowledge Trasformatio from Task Scearios to View-based Desig Diagrams Nima Dezhkam Kamra Sartipi {dezhka, sartipi}@mcmaster.ca Departmet of Computig ad Software McMaster Uiversity CANADA SEKE 08

More information

A Method of Malicious Application Detection

A Method of Malicious Application Detection 5th Iteratioal Coferece o Educatio, Maagemet, Iformatio ad Medicie (EMIM 2015) A Method of Malicious Applicatio Detectio Xiao Cheg 1,a, Ya Hui Guo 2,b, Qi Li 3,c 1 Xiao Cheg, Beijig Uiv Posts & Telecommu,

More information

Traditional queuing behaviour in routers. Scheduling and queue management. Questions. Scheduling mechanisms. Scheduling [1] Scheduling [2]

Traditional queuing behaviour in routers. Scheduling and queue management. Questions. Scheduling mechanisms. Scheduling [1] Scheduling [2] Traditioal queuig behaviour i routers Schedulig ad queue maagemet Data trasfer: datagrams: idividual packets o recogitio of flows coectioless: o sigallig Forwardig: based o per-datagram, forwardig table

More information

performance to the performance they can experience when they use the services from a xed location.

performance to the performance they can experience when they use the services from a xed location. I the Proceedigs of The First Aual Iteratioal Coferece o Mobile Computig ad Networkig (MobiCom 9) November -, 99, Berkeley, Califoria USA Performace Compariso of Mobile Support Strategies Rieko Kadobayashi

More information

Fundamentals of Media Processing. Shin'ichi Satoh Kazuya Kodama Hiroshi Mo Duy-Dinh Le

Fundamentals of Media Processing. Shin'ichi Satoh Kazuya Kodama Hiroshi Mo Duy-Dinh Le Fudametals of Media Processig Shi'ichi Satoh Kazuya Kodama Hiroshi Mo Duy-Dih Le Today's topics Noparametric Methods Parze Widow k-nearest Neighbor Estimatio Clusterig Techiques k-meas Agglomerative Hierarchical

More information

n Learn how resiliency strategies reduce risk n Discover automation strategies to reduce risk

n Learn how resiliency strategies reduce risk n Discover automation strategies to reduce risk Chapter Objectives Lear how resiliecy strategies reduce risk Discover automatio strategies to reduce risk Chapter #16: Architecture ad Desig Resiliecy ad Automatio Strategies 2 Automatio/Scriptig Resiliet

More information

Python Programming: An Introduction to Computer Science

Python Programming: An Introduction to Computer Science Pytho Programmig: A Itroductio to Computer Sciece Chapter 1 Computers ad Programs 1 Objectives To uderstad the respective roles of hardware ad software i a computig system. To lear what computer scietists

More information

Variance as a Stopping Criterion for Genetic Algorithms with Elitist Model

Variance as a Stopping Criterion for Genetic Algorithms with Elitist Model Fudameta Iformaticae 120 (2012) 145 164 145 DOI 10.3233/FI-2012-754 IOS Press Variace as a Stoppig Criterio for Geetic Algorithms with Elitist Model Diabadhu Bhadari, C. A. Murthy, Sakar K. Pal Ceter for

More information

Heaps. Presentation for use with the textbook Algorithm Design and Applications, by M. T. Goodrich and R. Tamassia, Wiley, 2015

Heaps. Presentation for use with the textbook Algorithm Design and Applications, by M. T. Goodrich and R. Tamassia, Wiley, 2015 Presetatio for use with the textbook Algorithm Desig ad Applicatios, by M. T. Goodrich ad R. Tamassia, Wiley, 201 Heaps 201 Goodrich ad Tamassia xkcd. http://xkcd.com/83/. Tree. Used with permissio uder

More information

An Image Retrieval Method Based on Hu Invariant Moment and Improved Annular Histogram

An Image Retrieval Method Based on Hu Invariant Moment and Improved Annular Histogram http://dx.doi.org/10.5755/j01.eee.0.4.6888 ELEKTROIKA IR ELEKTROTECHIKA ISS 139 115 VOL. 0 O. 4 014 A Image Retrieval Method Based o Hu Ivariat Momet ad Improved Aular Histogram F. Xiag 1 H. Yog 1 S. Dada

More information

A Novel Feature Extraction Algorithm for Haar Local Binary Pattern Texture Based on Human Vision System

A Novel Feature Extraction Algorithm for Haar Local Binary Pattern Texture Based on Human Vision System A Novel Feature Extractio Algorithm for Haar Local Biary Patter Texture Based o Huma Visio System Liu Tao 1,* 1 Departmet of Electroic Egieerig Shaaxi Eergy Istitute Xiayag, Shaaxi, Chia Abstract The locality

More information

Pattern Recognition Systems Lab 1 Least Mean Squares

Pattern Recognition Systems Lab 1 Least Mean Squares Patter Recogitio Systems Lab 1 Least Mea Squares 1. Objectives This laboratory work itroduces the OpeCV-based framework used throughout the course. I this assigmet a lie is fitted to a set of poits usig

More information

Euclidean Distance Based Feature Selection for Fault Detection Prediction Model in Semiconductor Manufacturing Process

Euclidean Distance Based Feature Selection for Fault Detection Prediction Model in Semiconductor Manufacturing Process Vol.133 (Iformatio Techology ad Computer Sciece 016), pp.85-89 http://dx.doi.org/10.1457/astl.016. Euclidea Distace Based Feature Selectio for Fault Detectio Predictio Model i Semicoductor Maufacturig

More information

6.854J / J Advanced Algorithms Fall 2008

6.854J / J Advanced Algorithms Fall 2008 MIT OpeCourseWare http://ocw.mit.edu 6.854J / 18.415J Advaced Algorithms Fall 2008 For iformatio about citig these materials or our Terms of Use, visit: http://ocw.mit.edu/terms. 18.415/6.854 Advaced Algorithms

More information

The Counterchanged Crossed Cube Interconnection Network and Its Topology Properties

The Counterchanged Crossed Cube Interconnection Network and Its Topology Properties WSEAS TRANSACTIONS o COMMUNICATIONS Wag Xiyag The Couterchaged Crossed Cube Itercoectio Network ad Its Topology Properties WANG XINYANG School of Computer Sciece ad Egieerig South Chia Uiversity of Techology

More information

A Flexible Hierarchical Classification Algorithm for Content Based Image Retrieval

A Flexible Hierarchical Classification Algorithm for Content Based Image Retrieval A Flexible Hierarchical Classificatio Algorithm for Cotet Based Image Retrieval Qiao Liu, Jiagfeg Che, ad Hui Zhag Abstract he goal of paper is to describe a flexible hierarchical classificatio algorithm

More information

CSC 220: Computer Organization Unit 11 Basic Computer Organization and Design

CSC 220: Computer Organization Unit 11 Basic Computer Organization and Design College of Computer ad Iformatio Scieces Departmet of Computer Sciece CSC 220: Computer Orgaizatio Uit 11 Basic Computer Orgaizatio ad Desig 1 For the rest of the semester, we ll focus o computer architecture:

More information

Goals of the Lecture UML Implementation Diagrams

Goals of the Lecture UML Implementation Diagrams Goals of the Lecture UML Implemetatio Diagrams Object-Orieted Aalysis ad Desig - Fall 1998 Preset UML Diagrams useful for implemetatio Provide examples Next Lecture Ð A variety of topics o mappig from

More information

Panel for Adobe Premiere Pro CC Partner Solution

Panel for Adobe Premiere Pro CC Partner Solution Pael for Adobe Premiere Pro CC Itegratio for more efficiecy The makes video editig simple, fast ad coveiet. The itegrated pael gives users immediate access to all medialoopster features iside Adobe Premiere

More information

BAYESIAN WITH FULL CONDITIONAL POSTERIOR DISTRIBUTION APPROACH FOR SOLUTION OF COMPLEX MODELS. Pudji Ismartini

BAYESIAN WITH FULL CONDITIONAL POSTERIOR DISTRIBUTION APPROACH FOR SOLUTION OF COMPLEX MODELS. Pudji Ismartini Proceedig of Iteratioal Coferece O Research, Implemetatio Ad Educatio Of Mathematics Ad Scieces 014, Yogyakarta State Uiversity, 18-0 May 014 BAYESIAN WIH FULL CONDIIONAL POSERIOR DISRIBUION APPROACH FOR

More information

SOFTWARE usually does not work alone. It must have

SOFTWARE usually does not work alone. It must have Proceedigs of the 203 Federated Coferece o Computer Sciece ad Iformatio Systems pp. 343 348 A method for selectig eviromets for software compatibility testig Łukasz Pobereżik AGH Uiversity of Sciece ad

More information

1&1 Next Level Hosting

1&1 Next Level Hosting 1&1 Next Level Hostig Performace Level: Performace that grows with your requiremets Copyright 1&1 Iteret SE 2017 1ad1.com 2 1&1 NEXT LEVEL HOSTING 3 Fast page loadig ad short respose times play importat

More information

Optimization of Priority based CPU Scheduling Algorithms to Minimize Starvation of Processes using an Efficiency Factor

Optimization of Priority based CPU Scheduling Algorithms to Minimize Starvation of Processes using an Efficiency Factor Iteratioal Joural of Computer Applicatios (97 8887) Volume 132 No.11, December21 Optimizatio of based CPU Schedulig Algorithms to Miimize Starvatio of Processes usig a Efficiecy Factor Muhammad A. Mustapha

More information

arxiv: v2 [cs.ds] 24 Mar 2018

arxiv: v2 [cs.ds] 24 Mar 2018 Similar Elemets ad Metric Labelig o Complete Graphs arxiv:1803.08037v [cs.ds] 4 Mar 018 Pedro F. Felzeszwalb Brow Uiversity Providece, RI, USA pff@brow.edu March 8, 018 We cosider a problem that ivolves

More information

CMSC Computer Architecture Lecture 12: Virtual Memory. Prof. Yanjing Li University of Chicago

CMSC Computer Architecture Lecture 12: Virtual Memory. Prof. Yanjing Li University of Chicago CMSC 22200 Computer Architecture Lecture 12: Virtual Memory Prof. Yajig Li Uiversity of Chicago A System with Physical Memory Oly Examples: most Cray machies early PCs Memory early all embedded systems

More information

EMPIRICAL ANALYSIS OF FAULT PREDICATION TECHNIQUES FOR IMPROVING SOFTWARE PROCESS CONTROL

EMPIRICAL ANALYSIS OF FAULT PREDICATION TECHNIQUES FOR IMPROVING SOFTWARE PROCESS CONTROL Iteratioal Joural of Iformatio Techology ad Kowledge Maagemet July-December 2012, Volume 5, No. 2, pp. 371-375 EMPIRICAL ANALYSIS OF FAULT PREDICATION TECHNIQUES FOR IMPROVING SOFTWARE PROCESS CONTROL

More information

Performance Optimization of Big Data Processing using Clustering Technique in Map Reduces Programming Model

Performance Optimization of Big Data Processing using Clustering Technique in Map Reduces Programming Model Performace Optimizatio of Big Data Processig usig Clusterig Techique i Map Reduces Programmig Model Ravidra Sigh Raghuwashi Samrat Ashok Techological Istitute VIDISHA,M.P Idia Deepak Sai Samrat Ashok Techological

More information

Performance Comparisons of PSO based Clustering

Performance Comparisons of PSO based Clustering Performace Comparisos of PSO based Clusterig Suresh Chadra Satapathy, 2 Guaidhi Pradha, 3 Sabyasachi Pattai, 4 JVR Murthy, 5 PVGD Prasad Reddy Ail Neeruoda Istitute of Techology ad Scieces, Sagivalas,Vishaapatam

More information

Transforming Irregular Algorithms for Heterogeneous Computing - Case Studies in Bioinformatics

Transforming Irregular Algorithms for Heterogeneous Computing - Case Studies in Bioinformatics Trasformig Irregular lgorithms for Heterogeeous omputig - ase Studies i ioiformatics Jig Zhag dvisor: Dr. Wu Feg ollaborator: Hao Wag syergy.cs.vt.edu Irregular lgorithms haracterized by Operate o irregular

More information

ANN WHICH COVERS MLP AND RBF

ANN WHICH COVERS MLP AND RBF ANN WHICH COVERS MLP AND RBF Josef Boští, Jaromír Kual Faculty of Nuclear Scieces ad Physical Egieerig, CTU i Prague Departmet of Software Egieerig Abstract Two basic types of artificial eural etwors Multi

More information

Private Key Cryptography. TELE3119: Week2

Private Key Cryptography. TELE3119: Week2 Private Key Cryptography TELE3119: Week2 Private Key Ecryptio Also referred to as: covetioal ecryptio symmetric key ecryptio secret-key or sigle-key ecryptio Oly alterative before public-key ecryptio i

More information

Hashing Functions Performance in Packet Classification

Hashing Functions Performance in Packet Classification Hashig Fuctios Performace i Packet Classificatio Mahmood Ahmadi ad Stepha Wog Computer Egieerig Laboratory Faculty of Electrical Egieerig, Mathematics ad Computer Sciece Delft Uiversity of Techology {mahmadi,

More information

Bayesian Network Structure Learning from Attribute Uncertain Data

Bayesian Network Structure Learning from Attribute Uncertain Data Bayesia Network Structure Learig from Attribute Ucertai Data Wetig Sog 1,2, Jeffrey Xu Yu 3, Hog Cheg 3, Hogya Liu 4, Ju He 1,2,*, ad Xiaoyog Du 1,2 1 Key Labs of Data Egieerig ad Kowledge Egieerig, Miistry

More information