BRDPHHC: A Balance RDF Data Partitioning Algorithm based on Hybrid Hierarchical Clustering

Size: px
Start display at page:

Download "BRDPHHC: A Balance RDF Data Partitioning Algorithm based on Hybrid Hierarchical Clustering"

Transcription

1 015 IEEE 17th Internatonal Conference on Hgh Performance Computng and Communcatons (HPCC), 015 IEEE 7th Internatonal Symposum on Cyberspace Safety and Securty (CSS), and 015 IEEE 1th Internatonal Conf on Embedded Software and Systems (ICESS) : A Balance RDF Data Parttonng Algorthm based on Hybrd Herarchcal Clusterng Yongln Leng, Zhu Chen, Fangmng Zhong, Hua Zhong School of Software Technology, Dalan Unversty of Technology, 11660, Dalan, Chna Abstract - Data parttonng s a fundamental step to acheve effectve storage and query of RDF bg data. Ths paper presents a balance RDF data parttonng algorthm based on hybrd herarchcal clusterng (), whch combnes AP and K- means clusterng. s functonalty ncludes three aspects: () a pre-processng step combnng nodes compresson and nodes remove to reduce the scale of raw data ponts, () AP clusterng algorthm s used to coarsen the RDF graph step by step and produce data blocs, and () K-means algorthm s used for data parttonng fnally. Experments on benchmar datasets demonstrate the effectveness of the proposed scheme. Index Terms - RDF; data parttonng; hybrd herarchcal clusterng I. INTRODUCTION The Resource Descrpton Framewor (RDF) s the W3C s recommendaton as the basement of the semantc web [1], whch can represent all the dentfed nformaton n the web and provde the nteroperablty between applcatons []. RDF data can be descrbed as a collecton of trples denoted as SPO (subect, predcate, obect), n whch the subect ndcates an entty, and the obect s the name of the entty or lteral n predcate. It can also be represented by a drected graph, wth the enttes (.e. subects and obects) as nodes and the predcate as drected edges from subect to obect [3]. Fg.1 (a) shows an example RDF trples, and Fg. (b) s the drected graph of RDF. (a) Example of RDF trple (b) Example of RDF drected graph Fg. 1 RDF examples Wth the wde applcaton of RDF data, RDF data sze ncreases sharply. The RDF data storage n sngle server Ths wor s supported by Proect U of NSFC and Proect of Laonng Provncal Natural Scence Foundaton of Chna. cannot meet the requrement of massve data storage and query. To tacle ths problem, the dstrbuted storage was proposed to store the massve RDF data [3, 4]. The data parttonng s a ey concern n RDF data dstrbuted storage. In other words, a bad parttonng scheme wll result n low effcency of storage and query. Typcal data parttonng algorthms adopt horzontal or vertcal parttonng based on trples [5]. Ths nd of algorthms gnores the correlatons between trples, leadng to a large number of on operatons among the compute nodes durng the RDF query. To address ths problem, graph-based approach has been proposed to partton the RDF data, whch stores the closely related nodes n the same compute node [6]. In ths scheme, most of data queres can be performed n parallel to mprove the query effcency. Moreover, ths scheme can reduce the on operaton among the compute nodes. The representatve parttonng algorthm based on graph ncludng geometrc parttonng [7], spectral parttonng [8] and multlevel parttonng algorthm [9, 10, 11]. Compared wth other graph parttonng methods, the multlevel parttonng algorthm s superor to others n tme complexty and the scale of data. However, ths nd of algorthms manly focuses on the parttonng effcency, gnorng the balance of data parttonng, resultng n the dffculty n parallel to perform. In ths paper, we present a balance RDF data parttonng algorthm based on hybrd herarchcal clusterng (), whch wors n three steps. Frstly, a pre-processng step s executed to reduce the number of raw data ponts by combnng the specal attrbutes and removng the hgh degree nodes. Secondly, the affnty propagaton clusterng algorthm (AP) s used to pre-cluster the RDF graph for producng data blocs. In each AP clusterng, we merge or splt the data blocs to balance the scale of the data blocs through nteractve edge. Fnally, an nteractve edge weghted K-means clusterng method s presented to partton data blocs and to realze the partton. The rest of the paper s organzed as follows. In Secton, we present the pre-processng method for RDF graph by two steps. Secton 3 depcts the smlarty measurement based on adacency and nteractve edges. In Secton 4, we descrbe the algorthm for RDF data parttonng. Performance evaluaton s llustrated n Secton 5. Secton 6 concludes the paper. II. PRE-PROCESSING FOR RDF GRAPH In ths secton, we present the proposed pre-processng steps for RDF graph. Combnaton of the nodes wth unque /15 $ IEEE DOI /HPCC-CSS-ICESS

2 attrbute value s frst descrbed, followed by the removng of the nodes wth hgh degree. A. Combnaton of the Nodes wth Unque Attrbute Values Gven an RDF graph G ( V, E), V Ve Vl represents the node set n whch denotes the entty node set and Vl denotes the attrbute node set, whle E {( e v, v) v, v V} Er Ea denotes the drected edge set n whch Er denotes the relatonshp edge set and Ea denotes the attrbute edge set. If the value of v s only belongng to the entty node v under the condton ( v, v) Ea,then v and v must be assgned to the same storage node n the process of parttonng. For example, the node whch s the value of the attrbute edge telephone s only belongng to the entty student50. To mprove the effcency of parttonng, the paper combnes such nodes and ther correspondng entty to one node. B. Remove of Hgh Degree Nodes In the real applcatons, some nodes have hgh degrees whle others have low degrees. For example, over 90% nodes have only less than 5 neghbors whle some nodes wth more than 100, 000 neghbors n DBpeda [1]. Generally speang, when one node has more neghbors, t s quered more frequently, leadng to hgher communcaton cost [3]. To reduce the on operatons n the process of query, the paper removes the nodes wth hgh degrees before parttonng. After parttonng, the removed nodes wll be stored nto such the storage nodes wth the correspondng nodes. Ve III. SIMILARITY MEASURE Smlarty measurement s the ey to the graph clusterng technque. Ths secton descrbes two metrcs to calculate the smlarty between node pars. A. Adacency Measure In ths paper, we propose a metrc based on adacency. The smlarty measurement based on adacency s wdely used n many graph parttonng algorthms to group nodes, whch are hghly connected [13]. The dea of adacency measure can be expressed as f the neghbors of one node u are connected wth another node v, then the nodes u and v have tghter connecton. In contrast, the nodes u and v wll become looser. Furthermore, the shortest dstance between two nodes also affect the smlarty. We use l to denote the shortest dstance between two nodes. The weght between two nodes s defned as follows: wuv 1/ l (1) The smlarty between two nodes s defned as follows: smlarty(u,v) nt err(u,v) Nr(u) w w,where s the neghbor set wthn the radus r of the node and nt err ( u, v) N r ( u) N r ( v) s the ntersecton of the neghbor sets of the node u and v. For nstance, the smlarty between u and v s as shown n Fg., s(u, v) =0.667 and s(u, m) =0.5. Whereas, s(a, b) = and s(a, d)=0.5714, whch have the same number of neghbors, but they have dfferent weghts, resultng n the dfferent smlartes. Here, the computaton of smlarty neglects the drecton of edges. u Fg. An example of adacency measure B. Interactve Edge Measure Interactve edge s the edge generated by the nodes n two dfferent partton. One prncple of RDF graph parttonng s to mnmze the nteractve edge. In hybrd herarchcal clusterng, from the second layer, the data blocs n each layer are clusterng collectons of the prevous layer. In order to mnmze the nteractve edges among the compute nodes, we assume that the more the number of nteractve edges between two data blocs, the larger the smlarty s, and the data blocs are easly clustered nto a collecton. Gven two data blocs C and C, cut( C, C ) represents the number of nteractve edges. As shown n Fg.3, n the frst layer, the nodes {a, b, c, d, e, f} are clustered nto a data bloc C1 and the nodes {g, h,, } are clustered nto another data blocc,socut( C1, C) 5. Fg. 3 Example of nteractve edge We use a lnear functon converson to normalze the smlarty between data blocs, where cut ( ) mn C and cut ( ) max C denote the mnmum and maxmum number of nteractve edges n data blocs. v () 1756

3 cut( C, C )! cutmn ( C) smlarty( C, C ) (3) cut ( C )! cut ( C ) max IV. ALGORITHM In ths secton, we frst present the framewor of algorthm, whch smultaneously consders both nteractve edge and balance. Then, the detaled steps are descrbed. The framewor of algorthm s shown n Fg.4. mn Fg. 4 Framewor of algorthm A. Herarchcal AP Clusterng AP clusterng algorthm was proposed by Brendan J. Frey and Delbert Duec, whch s a new clusterng algorthm. Compared wth the tradtonal clusterng algorthms, AP clusterng does not requre a pre-specfed number of clusters, and t teratvely transmts real-valued messages among the data ponts to gradually fnd the potental exemplars for formng a collecton of hgh qualty exemplars. AP clusterng algorthm taes a smlarty matrx between data ponts as an nput, where the smlarty s (, ) ndcates how well the data pont wth ndex s suted to be the exemplar for data pont [13]. AP teratvely updatng a responsblty matrx R, sent from data pont to canddate exemplar pont, whch reflects the accumulated evdence for how well-suted pont s to serve as the exemplar for pont, consderng other potental exemplars for pont, and an avalablty matrx A, sent from canddate exemplar pont to pont, whch reflects the accumulated evdence for how approprate t would be for pont to choose pont as ts exemplar, tang nto account the support from other ponts that pont should be an exemplar. To begn wth, the avalabltes matrx A are ntalzed to zero. Then, the responsbltes are computed usng the rule. r (, ) s (, )! max{ a (, )( s (, )} (4) The followng avalablty update gathers evdence from data ponts as to whether each canddate exemplar would mae a good exemplar: a r r (, ) mn{0, (, ) ( max{0, (, )}} # {, } a (, ) max{0, r (, )} (5) In each teraton, there are generally n data pars whose responsblty and avalablty values need to be calculated, thus the computaton complexty s On ( ). When the number of teratons s T, the computaton complexty s OTn ( ). Ths greatly affects the performance especally when the data are large. It has been ponted out n [14] that the sparsty of the constructed graph wll lead to faster calculaton snce the nformaton propagaton only needs to be performed on the exstng edges. So for a sparse smlarty matrx, the complexty wll up to OTn ( )[15]. The goal of RDF graph parttonng s to cluster the closely connected nodes n a compute node. So the smaller smlarty between two nodes, the less possblty two nodes cluster together. In order to mprove the effcency of clusterng, we set the smlarty equal to!) f the smlarty values less than. As a result, the sparsty of the matrx mproved and tme complexty n AP clusterng s reduced. Algorthm 1: Herarchcal AP Clusterng Algorthm(HC) Input: RDF graph G ( V, E) cluster threshold T Output: Cluster set C C1, C,, Cm%,where m& T Method: 1. Construct a sparse smlarty S based on (). Execute AP clusterng on matrx S and generate m clusters 3. If m T, compute the smlarty among the m clusters usng (3) to produce a new smlarty matrx S 4. Tae S as a new nput contnue to run, untl m& T B. Parttonng-balance Adustment The balance of the subgraph scales plays an mportant role n dstrbuted storage and query when a huge graph s parttoned nto several subgraphs. Specally, a bad balance wll lead to a low effcency of query. Therefore, to preserve the balance, the parttonng-balance wll be adusted n each clusterng. Gven a graph G ( V, E), we dvde the graph nto parttons P { P1, P,, P }. The parttonng-balance of a -way parttons should satsfy 1! e1 & PB & 1( e,where PB V / m and m V /. The parttonng-balance adustment approach s outlned n Algorthm. Algorthm : Parttonng-balance Adustment Input: Partton P { P1, P,, P } Output: New parttonng-balance P { P1, P,, P t } Method: 1. Compute parttonngbalance PB { PB1, PB,, PB }.. For p =1 to 1757

4 f PB 1! e1 merge( P, P ),where Cut( P, P ) s max and PB 1( e. else f PB 1( e Partton P use KL algorthm untl1! e & P & 1( e. 1 C. Fnal Parttonng Durng the Herarchcal AP clusterng, we use the AP clusterng to gradually reduce the scale of the RDF graph. But the number of clusters n AP clusterng s uncertan because of the nfluence of preferences. When the compute node s specfed, AP clusterng cannot acheve the fnal partton. In ths phase, we use K-means clusterng method to perform the fnal clusterng. Algorthm 3: K-means Clusterng Input: nteractve edge matrx S, the number of clusters Output: cluster set C C1, C,, C% Method: 1. Select ntal cluster centers C c, c,, c %. 1. Assgn each data obect to ts closest center accordng to smlarty between data obects and cluster centers. 3. Update the cluster centers. 1) Compute the average vector Sv ( ) of clusterc. 1 Sv ( ) Sv (, v), + v V C v C ) Fnd the new cluster center c. c arg mn S( v )! S( v ) v C 4. Repeat step and 3, untl the clusterng obectve E functon converges. 1 v C E S( v )! S( v ) D. Complexty Analyss There are three man operatons n the desgned algorthm. The frst one s the smlarty calculaton wth the tme complexty of On ( ). The second one s the Herarchcal AP clusterng wth the tme complexty of OTn ( ),wheretsthe number of teratons. The last one s the -means clusterng wth the tme complexty of OKTm ( ), where K s the number of clusters and m s the number of clusterng collectons. Therefore, the total tme complexty of s approxmatelyon ( ). A. Datasets and Envronment In ths secton, we present a set of experments on the Lehgh Unversty Benchmar (LUBM) [16] and the DBLP Bblography datasets [17]. The number of edges and nodes are shown n Table I. All experments were performed on an GHz PC runnng Wndows XP wth 4GB man memory. TABLE ISTATISTICS OF DATASETS USED IN EXPERIMENTS Degree V E Frst merge mn max LUBM DBLP B. Interactve Edge Rato and the Tme Performance Gven a graph G ( V, E), we dvde the graph nto clusters P 1 ( V 1, E 1 ), P ( V, E ),..., P ( V, E), then the nteractve edge rato can be descrbed as follows: cut( P, G \ P) 1 IER (6) E,where cut( P, G \ P) s the number of edges between P and the rest of the graph. The adacency measurement radus r s set to n the experment. Table II shows the number of removed nodes and edges under r= of dfferent datasets. Fg. 5 shows the nteractve edge rato on dfferent dataset and Fg. 6 presents runtme of the partton. TABLE II STATISTICS OF REMOVE NODES AND EDGES LUBM DBLP Remove condton Remove nodes Remove edges > > > > > > > > Interactve edge rato LUBM DBLP Fg. 5 Interactve edge rato on dfferent remove condton V. EXPERIMENTS 1758

5 Runtme(sec) LUBM DBLP Runtme(sec) 15 x Fg. 6 Partton effcency on dfferent remove condton From the result, we can see that the nteractve edge rato decreases and the tme performance mproves wth the remove of hgh degree nodes. s a multlevel graph parttonng method, whch uses a Fast Samplng algorthm to coarsen the nput sparse graph and chooses a small number of fnal representatve exemplars. For these exemplars, uses a densty-weghted spectral clusterng to partton the exemplars. Fnally, all data ponts can be assgned through the correspondng representatve exemplars [10]. In ths paper we compared algorthm and algorthm n nteractve edge rato and runtme. The results are shown n Fg.7 and Fg.8. Fg.7 (a) and (b) present the nteractve edge rato n LUBM and DBLP dataset respectvely. It can be seen that the nteractve edge rato of algorthm s hgher than. Interactve edge rato Interactve edge rato (a) LUBM (b) DBLP Fg. 7 Edge-cut on dfferent datasets Runtme(sec) (a) LUBM 10 x (b) DBLP Fg. 8 Partton effcency on dfferent datasets C. Parttonng-Balance Table III and Table IV shown the comparson of parttonng-balance between and n LUBM and DBLP datasets. We calculate PBmax and PBmn usng (10) and (11). PBmax max( V ) / m (10) PBmn mn( V ) / m (11) From Table III and Table IV, the balance obtaned by the proposed algorthm s better than that obtaned by. Furthermore, the removng of the nodes wth hgh degree results n a good balance, whch demonstrates the effectveness of the proposed algorthm. TABLE III PARTITIONING- BALANCE OF LUBM Remove condton PB max PB mn PB max PB mn > > > > TABLE IV PARTITIONING-BALANCE OF DBLP Remove condton PBmax PBmn PBmax PBmn > > > > VI. CONCLUSION In ths paper, we proposed a balance RDF data parttonng algorthm based on hybrd herarchcal clusterng, ncludng AP clusterng and K-means algorthm. Frstly, for 1759

6 the sae of parallelzaton, balance adustment algorthm s used to ensure the balance n each level AP clusterng. Further, to mprove the effcency of RDF graph parttonng, hgh degree nodes are removed before clusterng. Fnally, the expermental results demonstrate that our proposed algorthm outperforms by a better balance. REFERENCES [1] RDF model and syntax specfcaton / [] F. Du, Y. Chen, X. Du, et al, Survey of RDF query processng technques, Journal of Software, vol. 4, no. 6, pp: 1-141, 013. [3] K. Zeng, J. Yang, H. Wang, et al, A dstrbuted graph engne for web scale RDF data, Proceedngs of the VLDB Endowment, vol. 6, no. 4, pp: 65-76, 013. [4] Q. Zhang, Z. Chen, A weghted ernel possblstc c-means algorthm based on cloud computng for clusterng bg data, Internatonal Journal of Communcaton Systems, vol. 7, no. 9, pp: , 014. [5] K. Lee, L. Lu, Y. Tang, et al, Effcent and customzable data parttonng framewor for dstrbuted bg RDF data processng n the cloud, 013 IEEE Sxth Internatonal Conference on Cloud Computng (CLOUD), pp: , 013. [6] J.Huang,D.Abad,K.Ren, ScalableSPARQLqueryngoflargeRDF graph, Proc. of the VLDB endowment, vol. 4, no. 11, pp: , 011. [7] M. Heath, P. Raghavan, A Cartesan parallel nested dssecton algorthm, SIAM Journal on Matrx Analyss and Applcatons, vol. 16, no. 1, pp: 35-53, [8] A. Pothen, H. Smon, K. Lou, Parttonng sparse matrces wth egenvectors of graphs, SIAM Journal on Matrx Analyss and Applcatons, vol. 11, no. 3, pp: , [9] METIS, [10] F. Shang, L. Jao, J. Sh, et al, Fast affnty propagaton clusterng: A multlevel approach, Pattern recognton, vol. 45, no. 1, pp: , 01. [11] R. Ma, X. Wang, J. Dng, Multlevel core-sets based aggregaton clusterng algorthm, Journal of Software, vol. 4, no. 3, pp: , 013. [1] S. Auer, C. Bzer, G. Koblarov, et al, DBpeda: A nucleus for a web of open data, Sprnger Berln Hedelberg, 007. [13] S. Schaeffer, Graph clusterng, Computer Scence Revew, vol. 1, no. 1, pp: 7-64, 007. [14] B. Frey, D. Duec. Clusterng by passng messages between data ponts, Scence, vol. 315, no. 5814, pp: , 007. [15] Y. Ja, J. Wang, C. Zhang, et al, Fndng mage exemplars usng fast sparse affnty propagaton, Proceedngs of the 16th ACM nternatonal conference on Multmeda, ACM, pp: , 008. [16] Y. Guo, Z. Pan, J. Hefln, LUBM: A benchmar for OWL nowledge base systems, Web Semantcs: Scence, Servces and Agents on the World Wde Web, vol. 3, no., pp: , 005. [17] J. Tang, J.Sun,C.Wang, et al, Socal nfluence analyss n large-scale networs, Proceedngs of the 15th ACM SIGKDD nternatonal conference on Knowledge dscovery and data mnng ACM, pp: ,

Parallelism for Nested Loops with Non-uniform and Flow Dependences

Parallelism for Nested Loops with Non-uniform and Flow Dependences Parallelsm for Nested Loops wth Non-unform and Flow Dependences Sam-Jn Jeong Dept. of Informaton & Communcaton Engneerng, Cheonan Unversty, 5, Anseo-dong, Cheonan, Chungnam, 330-80, Korea. seong@cheonan.ac.kr

More information

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points;

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points; Subspace clusterng Clusterng Fundamental to all clusterng technques s the choce of dstance measure between data ponts; D q ( ) ( ) 2 x x = x x, j k = 1 k jk Squared Eucldean dstance Assumpton: All features

More information

A CALCULATION METHOD OF DEEP WEB ENTITIES RECOGNITION

A CALCULATION METHOD OF DEEP WEB ENTITIES RECOGNITION A CALCULATION METHOD OF DEEP WEB ENTITIES RECOGNITION 1 FENG YONG, DANG XIAO-WAN, 3 XU HONG-YAN School of Informaton, Laonng Unversty, Shenyang Laonng E-mal: 1 fyxuhy@163.com, dangxaowan@163.com, 3 xuhongyan_lndx@163.com

More information

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers IOSR Journal of Electroncs and Communcaton Engneerng (IOSR-JECE) e-issn: 78-834,p- ISSN: 78-8735.Volume 9, Issue, Ver. IV (Mar - Apr. 04), PP 0-07 Content Based Image Retreval Usng -D Dscrete Wavelet wth

More information

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data A Fast Content-Based Multmeda Retreval Technque Usng Compressed Data Borko Furht and Pornvt Saksobhavvat NSF Multmeda Laboratory Florda Atlantc Unversty, Boca Raton, Florda 3343 ABSTRACT In ths paper,

More information

A Binarization Algorithm specialized on Document Images and Photos

A Binarization Algorithm specialized on Document Images and Photos A Bnarzaton Algorthm specalzed on Document mages and Photos Ergna Kavalleratou Dept. of nformaton and Communcaton Systems Engneerng Unversty of the Aegean kavalleratou@aegean.gr Abstract n ths paper, a

More information

Cluster Analysis of Electrical Behavior

Cluster Analysis of Electrical Behavior Journal of Computer and Communcatons, 205, 3, 88-93 Publshed Onlne May 205 n ScRes. http://www.scrp.org/ournal/cc http://dx.do.org/0.4236/cc.205.350 Cluster Analyss of Electrcal Behavor Ln Lu Ln Lu, School

More information

Course Introduction. Algorithm 8/31/2017. COSC 320 Advanced Data Structures and Algorithms. COSC 320 Advanced Data Structures and Algorithms

Course Introduction. Algorithm 8/31/2017. COSC 320 Advanced Data Structures and Algorithms. COSC 320 Advanced Data Structures and Algorithms Course Introducton Course Topcs Exams, abs, Proects A quc loo at a few algorthms 1 Advanced Data Structures and Algorthms Descrpton: We are gong to dscuss algorthm complexty analyss, algorthm desgn technques

More information

A Shadow Detection Method for Remote Sensing Images Using Affinity Propagation Algorithm

A Shadow Detection Method for Remote Sensing Images Using Affinity Propagation Algorithm Proceedngs of the 009 IEEE Internatonal Conference on Systems, Man, and Cybernetcs San Antono, TX, USA - October 009 A Shadow Detecton Method for Remote Sensng Images Usng Affnty Propagaton Algorthm Huayng

More information

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching A Fast Vsual Trackng Algorthm Based on Crcle Pxels Matchng Zhqang Hou hou_zhq@sohu.com Chongzhao Han czhan@mal.xjtu.edu.cn Ln Zheng Abstract: A fast vsual trackng algorthm based on crcle pxels matchng

More information

Hierarchical clustering for gene expression data analysis

Hierarchical clustering for gene expression data analysis Herarchcal clusterng for gene expresson data analyss Gorgo Valentn e-mal: valentn@ds.unm.t Clusterng of Mcroarray Data. Clusterng of gene expresson profles (rows) => dscovery of co-regulated and functonally

More information

A Deflected Grid-based Algorithm for Clustering Analysis

A Deflected Grid-based Algorithm for Clustering Analysis A Deflected Grd-based Algorthm for Clusterng Analyss NANCY P. LIN, CHUNG-I CHANG, HAO-EN CHUEH, HUNG-JEN CHEN, WEI-HUA HAO Department of Computer Scence and Informaton Engneerng Tamkang Unversty 5 Yng-chuan

More information

BIN XIA et al: AN IMPROVED K-MEANS ALGORITHM BASED ON CLOUD PLATFORM FOR DATA MINING

BIN XIA et al: AN IMPROVED K-MEANS ALGORITHM BASED ON CLOUD PLATFORM FOR DATA MINING An Improved K-means Algorthm based on Cloud Platform for Data Mnng Bn Xa *, Yan Lu 2. School of nformaton and management scence, Henan Agrcultural Unversty, Zhengzhou, Henan 450002, P.R. Chna 2. College

More information

An Optimal Algorithm for Prufer Codes *

An Optimal Algorithm for Prufer Codes * J. Software Engneerng & Applcatons, 2009, 2: 111-115 do:10.4236/jsea.2009.22016 Publshed Onlne July 2009 (www.scrp.org/journal/jsea) An Optmal Algorthm for Prufer Codes * Xaodong Wang 1, 2, Le Wang 3,

More information

Classifier Selection Based on Data Complexity Measures *

Classifier Selection Based on Data Complexity Measures * Classfer Selecton Based on Data Complexty Measures * Edth Hernández-Reyes, J.A. Carrasco-Ochoa, and J.Fco. Martínez-Trndad Natonal Insttute for Astrophyscs, Optcs and Electroncs, Lus Enrque Erro No.1 Sta.

More information

An Image Fusion Approach Based on Segmentation Region

An Image Fusion Approach Based on Segmentation Region Rong Wang, L-Qun Gao, Shu Yang, Yu-Hua Cha, and Yan-Chun Lu An Image Fuson Approach Based On Segmentaton Regon An Image Fuson Approach Based on Segmentaton Regon Rong Wang, L-Qun Gao, Shu Yang 3, Yu-Hua

More information

The Greedy Method. Outline and Reading. Change Money Problem. Greedy Algorithms. Applications of the Greedy Strategy. The Greedy Method Technique

The Greedy Method. Outline and Reading. Change Money Problem. Greedy Algorithms. Applications of the Greedy Strategy. The Greedy Method Technique //00 :0 AM Outlne and Readng The Greedy Method The Greedy Method Technque (secton.) Fractonal Knapsack Problem (secton..) Task Schedulng (secton..) Mnmum Spannng Trees (secton.) Change Money Problem Greedy

More information

Learning the Kernel Parameters in Kernel Minimum Distance Classifier

Learning the Kernel Parameters in Kernel Minimum Distance Classifier Learnng the Kernel Parameters n Kernel Mnmum Dstance Classfer Daoqang Zhang 1,, Songcan Chen and Zh-Hua Zhou 1* 1 Natonal Laboratory for Novel Software Technology Nanjng Unversty, Nanjng 193, Chna Department

More information

Module Management Tool in Software Development Organizations

Module Management Tool in Software Development Organizations Journal of Computer Scence (5): 8-, 7 ISSN 59-66 7 Scence Publcatons Management Tool n Software Development Organzatons Ahmad A. Al-Rababah and Mohammad A. Al-Rababah Faculty of IT, Al-Ahlyyah Amman Unversty,

More information

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance Tsnghua Unversty at TAC 2009: Summarzng Mult-documents by Informaton Dstance Chong Long, Mnle Huang, Xaoyan Zhu State Key Laboratory of Intellgent Technology and Systems, Tsnghua Natonal Laboratory for

More information

A Clustering Algorithm for Chinese Adjectives and Nouns 1

A Clustering Algorithm for Chinese Adjectives and Nouns 1 Clusterng lgorthm for Chnese dectves and ouns Yang Wen, Chunfa Yuan, Changnng Huang 2 State Key aboratory of Intellgent Technology and System Deptartment of Computer Scence & Technology, Tsnghua Unversty,

More information

The Shortest Path of Touring Lines given in the Plane

The Shortest Path of Touring Lines given in the Plane Send Orders for Reprnts to reprnts@benthamscence.ae 262 The Open Cybernetcs & Systemcs Journal, 2015, 9, 262-267 The Shortest Path of Tourng Lnes gven n the Plane Open Access Ljuan Wang 1,2, Dandan He

More information

A Clustering Algorithm for Key Frame Extraction Based on Density Peak

A Clustering Algorithm for Key Frame Extraction Based on Density Peak Journal of Computer and Communcatons, 2018, 6, 118-128 http://www.scrp.org/ournal/cc ISSN Onlne: 2327-5227 ISSN Prnt: 2327-5219 A Clusterng Algorthm for Key Frame Extracton Based on Densty Peak Hong Zhao

More information

Machine Learning: Algorithms and Applications

Machine Learning: Algorithms and Applications 14/05/1 Machne Learnng: Algorthms and Applcatons Florano Zn Free Unversty of Bozen-Bolzano Faculty of Computer Scence Academc Year 011-01 Lecture 10: 14 May 01 Unsupervsed Learnng cont Sldes courtesy of

More information

Application of Clustering Algorithm in Big Data Sample Set Optimization

Application of Clustering Algorithm in Big Data Sample Set Optimization Applcaton of Clusterng Algorthm n Bg Data Sample Set Optmzaton Yutang Lu 1, Qn Zhang 2 1 Department of Basc Subjects, Henan Insttute of Technology, Xnxang 453002, Chna 2 School of Mathematcs and Informaton

More information

An Efficient Genetic Algorithm with Fuzzy c-means Clustering for Traveling Salesman Problem

An Efficient Genetic Algorithm with Fuzzy c-means Clustering for Traveling Salesman Problem An Effcent Genetc Algorthm wth Fuzzy c-means Clusterng for Travelng Salesman Problem Jong-Won Yoon and Sung-Bae Cho Dept. of Computer Scence Yonse Unversty Seoul, Korea jwyoon@sclab.yonse.ac.r, sbcho@cs.yonse.ac.r

More information

A Simple Methodology for Database Clustering. Hao Tang 12 Guangdong University of Technology, Guangdong, , China

A Simple Methodology for Database Clustering. Hao Tang 12 Guangdong University of Technology, Guangdong, , China for Database Clusterng Guangdong Unversty of Technology, Guangdong, 0503, Chna E-mal: 6085@qq.com Me Zhang Guangdong Unversty of Technology, Guangdong, 0503, Chna E-mal:64605455@qq.com Database clusterng

More information

Outline. Type of Machine Learning. Examples of Application. Unsupervised Learning

Outline. Type of Machine Learning. Examples of Application. Unsupervised Learning Outlne Artfcal Intellgence and ts applcatons Lecture 8 Unsupervsed Learnng Professor Danel Yeung danyeung@eee.org Dr. Patrck Chan patrckchan@eee.org South Chna Unversty of Technology, Chna Introducton

More information

Adaptive Energy and Location Aware Routing in Wireless Sensor Network

Adaptive Energy and Location Aware Routing in Wireless Sensor Network Adaptve Energy and Locaton Aware Routng n Wreless Sensor Network Hong Fu 1,1, Xaomng Wang 1, Yngshu L 1 Department of Computer Scence, Shaanx Normal Unversty, X an, Chna, 71006 fuhong433@gmal.com {wangxmsnnu@hotmal.cn}

More information

Research Article A High-Order CFS Algorithm for Clustering Big Data

Research Article A High-Order CFS Algorithm for Clustering Big Data Moble Informaton Systems Volume 26, Artcle ID 435627, 8 pages http://dx.do.org/.55/26/435627 Research Artcle A Hgh-Order Algorthm for Clusterng Bg Data Fanyu Bu,,2 Zhku Chen, Peng L, Tong Tang, 3 andyngzhang

More information

Query Clustering Using a Hybrid Query Similarity Measure

Query Clustering Using a Hybrid Query Similarity Measure Query clusterng usng a hybrd query smlarty measure Fu. L., Goh, D.H., & Foo, S. (2004). WSEAS Transacton on Computers, 3(3), 700-705. Query Clusterng Usng a Hybrd Query Smlarty Measure Ln Fu, Don Hoe-Lan

More information

Support Vector Machines

Support Vector Machines /9/207 MIST.6060 Busness Intellgence and Data Mnng What are Support Vector Machnes? Support Vector Machnes Support Vector Machnes (SVMs) are supervsed learnng technques that analyze data and recognze patterns.

More information

Efficient Distributed File System (EDFS)

Efficient Distributed File System (EDFS) Effcent Dstrbuted Fle System (EDFS) (Sem-Centralzed) Debessay(Debsh) Fesehaye, Rahul Malk & Klara Naherstedt Unversty of Illnos-Urbana Champagn Contents Problem Statement, Related Work, EDFS Desgn Rate

More information

Fast Computation of Shortest Path for Visiting Segments in the Plane

Fast Computation of Shortest Path for Visiting Segments in the Plane Send Orders for Reprnts to reprnts@benthamscence.ae 4 The Open Cybernetcs & Systemcs Journal, 04, 8, 4-9 Open Access Fast Computaton of Shortest Path for Vstng Segments n the Plane Ljuan Wang,, Bo Jang

More information

Efficient Segmentation and Classification of Remote Sensing Image Using Local Self Similarity

Efficient Segmentation and Classification of Remote Sensing Image Using Local Self Similarity ISSN(Onlne): 2320-9801 ISSN (Prnt): 2320-9798 Internatonal Journal of Innovatve Research n Computer and Communcaton Engneerng (An ISO 3297: 2007 Certfed Organzaton) Vol.2, Specal Issue 1, March 2014 Proceedngs

More information

Unsupervised Learning and Clustering

Unsupervised Learning and Clustering Unsupervsed Learnng and Clusterng Why consder unlabeled samples?. Collectng and labelng large set of samples s costly Gettng recorded speech s free, labelng s tme consumng 2. Classfer could be desgned

More information

Load Balancing for Hex-Cell Interconnection Network

Load Balancing for Hex-Cell Interconnection Network Int. J. Communcatons, Network and System Scences,,, - Publshed Onlne Aprl n ScRes. http://www.scrp.org/journal/jcns http://dx.do.org/./jcns.. Load Balancng for Hex-Cell Interconnecton Network Saher Manaseer,

More information

FINDING IMPORTANT NODES IN SOCIAL NETWORKS BASED ON MODIFIED PAGERANK

FINDING IMPORTANT NODES IN SOCIAL NETWORKS BASED ON MODIFIED PAGERANK FINDING IMPORTANT NODES IN SOCIAL NETWORKS BASED ON MODIFIED PAGERANK L-qng Qu, Yong-quan Lang 2, Jng-Chen 3, 2 College of Informaton Scence and Technology, Shandong Unversty of Scence and Technology,

More information

Available online at Available online at Advanced in Control Engineering and Information Science

Available online at   Available online at   Advanced in Control Engineering and Information Science Avalable onlne at wwwscencedrectcom Avalable onlne at wwwscencedrectcom Proceda Proceda Engneerng Engneerng 00 (2011) 15000 000 (2011) 1642 1646 Proceda Engneerng wwwelsevercom/locate/proceda Advanced

More information

Clustering Algorithm of Similarity Segmentation based on Point Sorting

Clustering Algorithm of Similarity Segmentation based on Point Sorting Internatonal onference on Logstcs Engneerng, Management and omputer Scence (LEMS 2015) lusterng Algorthm of Smlarty Segmentaton based on Pont Sortng Hanbng L, Yan Wang*, Lan Huang, Mngda L, Yng Sun, Hanyuan

More information

Unsupervised Learning

Unsupervised Learning Pattern Recognton Lecture 8 Outlne Introducton Unsupervsed Learnng Parametrc VS Non-Parametrc Approach Mxture of Denstes Maxmum-Lkelhood Estmates Clusterng Prof. Danel Yeung School of Computer Scence and

More information

Skew Angle Estimation and Correction of Hand Written, Textual and Large areas of Non-Textual Document Images: A Novel Approach

Skew Angle Estimation and Correction of Hand Written, Textual and Large areas of Non-Textual Document Images: A Novel Approach Angle Estmaton and Correcton of Hand Wrtten, Textual and Large areas of Non-Textual Document Images: A Novel Approach D.R.Ramesh Babu Pyush M Kumat Mahesh D Dhannawat PES Insttute of Technology Research

More information

Graph-based Clustering

Graph-based Clustering Graphbased Clusterng Transform the data nto a graph representaton ertces are the data ponts to be clustered Edges are eghted based on smlarty beteen data ponts Graph parttonng Þ Each connected component

More information

Using Fuzzy Logic to Enhance the Large Size Remote Sensing Images

Using Fuzzy Logic to Enhance the Large Size Remote Sensing Images Internatonal Journal of Informaton and Electroncs Engneerng Vol. 5 No. 6 November 015 Usng Fuzzy Logc to Enhance the Large Sze Remote Sensng Images Trung Nguyen Tu Huy Ngo Hoang and Thoa Vu Van Abstract

More information

Routability Driven Modification Method of Monotonic Via Assignment for 2-layer Ball Grid Array Packages

Routability Driven Modification Method of Monotonic Via Assignment for 2-layer Ball Grid Array Packages Routablty Drven Modfcaton Method of Monotonc Va Assgnment for 2-layer Ball Grd Array Pacages Yoch Tomoa Atsush Taahash Department of Communcatons and Integrated Systems, Toyo Insttute of Technology 2 12

More information

Study of Data Stream Clustering Based on Bio-inspired Model

Study of Data Stream Clustering Based on Bio-inspired Model , pp.412-418 http://dx.do.org/10.14257/astl.2014.53.86 Study of Data Stream lusterng Based on Bo-nspred Model Yngme L, Mn L, Jngbo Shao, Gaoyang Wang ollege of omputer Scence and Informaton Engneerng,

More information

Kent State University CS 4/ Design and Analysis of Algorithms. Dept. of Math & Computer Science LECT-16. Dynamic Programming

Kent State University CS 4/ Design and Analysis of Algorithms. Dept. of Math & Computer Science LECT-16. Dynamic Programming CS 4/560 Desgn and Analyss of Algorthms Kent State Unversty Dept. of Math & Computer Scence LECT-6 Dynamc Programmng 2 Dynamc Programmng Dynamc Programmng, lke the dvde-and-conquer method, solves problems

More information

Real-time Fault-tolerant Scheduling Algorithm for Distributed Computing Systems

Real-time Fault-tolerant Scheduling Algorithm for Distributed Computing Systems Real-tme Fault-tolerant Schedulng Algorthm for Dstrbuted Computng Systems Yun Lng, Y Ouyang College of Computer Scence and Informaton Engneerng Zheang Gongshang Unversty Postal code: 310018 P.R.CHINA {ylng,

More information

A Similarity Measure Method for Symbolization Time Series

A Similarity Measure Method for Symbolization Time Series Research Journal of Appled Scences, Engneerng and Technology 5(5): 1726-1730, 2013 ISSN: 2040-7459; e-issn: 2040-7467 Maxwell Scentfc Organzaton, 2013 Submtted: July 27, 2012 Accepted: September 03, 2012

More information

Clustering is a discovery process in data mining.

Clustering is a discovery process in data mining. Cover Feature Chameleon: Herarchcal Clusterng Usng Dynamc Modelng Many advanced algorthms have dffculty dealng wth hghly varable clusters that do not follow a preconceved model. By basng ts selectons on

More information

Face Recognition Method Based on Within-class Clustering SVM

Face Recognition Method Based on Within-class Clustering SVM Face Recognton Method Based on Wthn-class Clusterng SVM Yan Wu, Xao Yao and Yng Xa Department of Computer Scence and Engneerng Tong Unversty Shangha, Chna Abstract - A face recognton method based on Wthn-class

More information

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task Proceedngs of NTCIR-6 Workshop Meetng, May 15-18, 2007, Tokyo, Japan Term Weghtng Classfcaton System Usng the Ch-square Statstc for the Classfcaton Subtask at NTCIR-6 Patent Retreval Task Kotaro Hashmoto

More information

Comparison of Heuristics for Scheduling Independent Tasks on Heterogeneous Distributed Environments

Comparison of Heuristics for Scheduling Independent Tasks on Heterogeneous Distributed Environments Comparson of Heurstcs for Schedulng Independent Tasks on Heterogeneous Dstrbuted Envronments Hesam Izakan¹, Ath Abraham², Senor Member, IEEE, Václav Snášel³ ¹ Islamc Azad Unversty, Ramsar Branch, Ramsar,

More information

A Saturation Binary Neural Network for Crossbar Switching Problem

A Saturation Binary Neural Network for Crossbar Switching Problem A Saturaton Bnary Neural Network for Crossbar Swtchng Problem Cu Zhang 1, L-Qng Zhao 2, and Rong-Long Wang 2 1 Department of Autocontrol, Laonng Insttute of Scence and Technology, Benx, Chna bxlkyzhangcu@163.com

More information

Machine Learning. Topic 6: Clustering

Machine Learning. Topic 6: Clustering Machne Learnng Topc 6: lusterng lusterng Groupng data nto (hopefully useful) sets. Thngs on the left Thngs on the rght Applcatons of lusterng Hypothess Generaton lusters mght suggest natural groups. Hypothess

More information

Maintaining temporal validity of real-time data on non-continuously executing resources

Maintaining temporal validity of real-time data on non-continuously executing resources Mantanng temporal valdty of real-tme data on non-contnuously executng resources Tan Ba, Hong Lu and Juan Yang Hunan Insttute of Scence and Technology, College of Computer Scence, 44, Yueyang, Chna Wuhan

More information

Local Quaternary Patterns and Feature Local Quaternary Patterns

Local Quaternary Patterns and Feature Local Quaternary Patterns Local Quaternary Patterns and Feature Local Quaternary Patterns Jayu Gu and Chengjun Lu The Department of Computer Scence, New Jersey Insttute of Technology, Newark, NJ 0102, USA Abstract - Ths paper presents

More information

An Application of the Dulmage-Mendelsohn Decomposition to Sparse Null Space Bases of Full Row Rank Matrices

An Application of the Dulmage-Mendelsohn Decomposition to Sparse Null Space Bases of Full Row Rank Matrices Internatonal Mathematcal Forum, Vol 7, 2012, no 52, 2549-2554 An Applcaton of the Dulmage-Mendelsohn Decomposton to Sparse Null Space Bases of Full Row Rank Matrces Mostafa Khorramzadeh Department of Mathematcal

More information

Alignment Results of SOBOM for OAEI 2010

Alignment Results of SOBOM for OAEI 2010 Algnment Results of SOBOM for OAEI 2010 Pegang Xu, Yadong Wang, Lang Cheng, Tany Zang School of Computer Scence and Technology Harbn Insttute of Technology, Harbn, Chna pegang.xu@gmal.com, ydwang@ht.edu.cn,

More information

Determining the Optimal Bandwidth Based on Multi-criterion Fusion

Determining the Optimal Bandwidth Based on Multi-criterion Fusion Proceedngs of 01 4th Internatonal Conference on Machne Learnng and Computng IPCSIT vol. 5 (01) (01) IACSIT Press, Sngapore Determnng the Optmal Bandwdth Based on Mult-crteron Fuson Ha-L Lang 1+, Xan-Mn

More information

An Improved Spectral Clustering Algorithm Based on Local Neighbors in Kernel Space 1

An Improved Spectral Clustering Algorithm Based on Local Neighbors in Kernel Space 1 DOI: 10.98/CSIS110415064L An Improved Spectral Clusterng Algorthm Based on Local Neghbors n Kernel Space 1 Xnyue Lu 1,, Xng Yong and Hongfe Ln 1 1 School of Computer Scence and Technology, Dalan Unversty

More information

Positive Semi-definite Programming Localization in Wireless Sensor Networks

Positive Semi-definite Programming Localization in Wireless Sensor Networks Postve Sem-defnte Programmng Localzaton n Wreless Sensor etworks Shengdong Xe 1,, Jn Wang, Aqun Hu 1, Yunl Gu, Jang Xu, 1 School of Informaton Scence and Engneerng, Southeast Unversty, 10096, anjng Computer

More information

Load-Balanced Anycast Routing

Load-Balanced Anycast Routing Load-Balanced Anycast Routng Chng-Yu Ln, Jung-Hua Lo, and Sy-Yen Kuo Department of Electrcal Engneerng atonal Tawan Unversty, Tape, Tawan sykuo@cc.ee.ntu.edu.tw Abstract For fault-tolerance and load-balance

More information

Research of Dynamic Access to Cloud Database Based on Improved Pheromone Algorithm

Research of Dynamic Access to Cloud Database Based on Improved Pheromone Algorithm , pp.197-202 http://dx.do.org/10.14257/dta.2016.9.5.20 Research of Dynamc Access to Cloud Database Based on Improved Pheromone Algorthm Yongqang L 1 and Jn Pan 2 1 (Software Technology Vocatonal College,

More information

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration Improvement of Spatal Resoluton Usng BlockMatchng Based Moton Estmaton and Frame Integraton Danya Suga and Takayuk Hamamoto Graduate School of Engneerng, Tokyo Unversty of Scence, 6-3-1, Nuku, Katsuska-ku,

More information

LinkSelector: A Web Mining Approach to. Hyperlink Selection for Web Portals

LinkSelector: A Web Mining Approach to. Hyperlink Selection for Web Portals nkselector: A Web Mnng Approach to Hyperlnk Selecton for Web Portals Xao Fang and Olva R. u Sheng Department of Management Informaton Systems Unversty of Arzona, AZ 8572 {xfang,sheng}@bpa.arzona.edu Submtted

More information

Resource and Virtual Function Status Monitoring in Network Function Virtualization Environment

Resource and Virtual Function Status Monitoring in Network Function Virtualization Environment Journal of Physcs: Conference Seres PAPER OPEN ACCESS Resource and Vrtual Functon Status Montorng n Network Functon Vrtualzaton Envronment To cte ths artcle: MS Ha et al 2018 J. Phys.: Conf. Ser. 1087

More information

Collaboratively Regularized Nearest Points for Set Based Recognition

Collaboratively Regularized Nearest Points for Set Based Recognition Academc Center for Computng and Meda Studes, Kyoto Unversty Collaboratvely Regularzed Nearest Ponts for Set Based Recognton Yang Wu, Mchhko Mnoh, Masayuk Mukunok Kyoto Unversty 9/1/013 BMVC 013 @ Brstol,

More information

Sorting Review. Sorting. Comparison Sorting. CSE 680 Prof. Roger Crawfis. Assumptions

Sorting Review. Sorting. Comparison Sorting. CSE 680 Prof. Roger Crawfis. Assumptions Sortng Revew Introducton to Algorthms Qucksort CSE 680 Prof. Roger Crawfs Inserton Sort T(n) = Θ(n 2 ) In-place Merge Sort T(n) = Θ(n lg(n)) Not n-place Selecton Sort (from homework) T(n) = Θ(n 2 ) In-place

More information

A high precision collaborative vision measurement of gear chamfering profile

A high precision collaborative vision measurement of gear chamfering profile Internatonal Conference on Advances n Mechancal Engneerng and Industral Informatcs (AMEII 05) A hgh precson collaboratve vson measurement of gear chamferng profle Conglng Zhou, a, Zengpu Xu, b, Chunmng

More information

Recommended Items Rating Prediction based on RBF Neural Network Optimized by PSO Algorithm

Recommended Items Rating Prediction based on RBF Neural Network Optimized by PSO Algorithm Recommended Items Ratng Predcton based on RBF Neural Network Optmzed by PSO Algorthm Chengfang Tan, Cayn Wang, Yuln L and Xx Q Abstract In order to mtgate the data sparsty and cold-start problems of recommendaton

More information

From Comparing Clusterings to Combining Clusterings

From Comparing Clusterings to Combining Clusterings Proceedngs of the Twenty-Thrd AAAI Conference on Artfcal Intellgence (008 From Comparng Clusterngs to Combnng Clusterngs Zhwu Lu and Yuxn Peng and Janguo Xao Insttute of Computer Scence and Technology,

More information

A Multilevel Analytical Placement for 3D ICs

A Multilevel Analytical Placement for 3D ICs A Multlevel Analytcal Placement for 3D ICs Jason Cong, and Guoje Luo Computer Scence Department Unversty of Calforna, Los Angeles Calforna NanoSystems Insttute Los Angeles, CA 90095, USA Tel : (30) 06-775

More information

Unsupervised Learning and Clustering

Unsupervised Learning and Clustering Unsupervsed Learnng and Clusterng Supervsed vs. Unsupervsed Learnng Up to now we consdered supervsed learnng scenaro, where we are gven 1. samples 1,, n 2. class labels for all samples 1,, n Ths s also

More information

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz Compler Desgn Sprng 2014 Regster Allocaton Sample Exercses and Solutons Prof. Pedro C. Dnz USC / Informaton Scences Insttute 4676 Admralty Way, Sute 1001 Marna del Rey, Calforna 90292 pedro@s.edu Regster

More information

Two-Stage Data Distribution for Distributed Surveillance Video Processing with Hybrid Storage Architecture

Two-Stage Data Distribution for Distributed Surveillance Video Processing with Hybrid Storage Architecture Two-Stage Data Dstrbuton for Dstrbuted Survellance Vdeo Processng wth Hybrd Storage Archtecture Yangyang Gao, Hatao Zhang, Bngchang Tang, Yanpe Zhu, Huadong Ma Bejng Key Lab of Intellgent Telecomm. Software

More information

Outline. Discriminative classifiers for image recognition. Where in the World? A nearest neighbor recognition example 4/14/2011. CS 376 Lecture 22 1

Outline. Discriminative classifiers for image recognition. Where in the World? A nearest neighbor recognition example 4/14/2011. CS 376 Lecture 22 1 4/14/011 Outlne Dscrmnatve classfers for mage recognton Wednesday, Aprl 13 Krsten Grauman UT-Austn Last tme: wndow-based generc obect detecton basc ppelne face detecton wth boostng as case study Today:

More information

A fast algorithm for color image segmentation

A fast algorithm for color image segmentation Unersty of Wollongong Research Onlne Faculty of Informatcs - Papers (Arche) Faculty of Engneerng and Informaton Scences 006 A fast algorthm for color mage segmentaton L. Dong Unersty of Wollongong, lju@uow.edu.au

More information

Assignment # 2. Farrukh Jabeen Algorithms 510 Assignment #2 Due Date: June 15, 2009.

Assignment # 2. Farrukh Jabeen Algorithms 510 Assignment #2 Due Date: June 15, 2009. Farrukh Jabeen Algorthms 51 Assgnment #2 Due Date: June 15, 29. Assgnment # 2 Chapter 3 Dscrete Fourer Transforms Implement the FFT for the DFT. Descrbed n sectons 3.1 and 3.2. Delverables: 1. Concse descrpton

More information

An efficient iterative source routing algorithm

An efficient iterative source routing algorithm An effcent teratve source routng algorthm Gang Cheng Ye Tan Nrwan Ansar Advanced Networng Lab Department of Electrcal Computer Engneerng New Jersey Insttute of Technology Newar NJ 7 {gc yt Ansar}@ntedu

More information

Efficient Content Distribution in Wireless P2P Networks

Efficient Content Distribution in Wireless P2P Networks Effcent Content Dstrbuton n Wreless P2P Networs Qong Sun, Vctor O. K. L, and Ka-Cheong Leung Department of Electrcal and Electronc Engneerng The Unversty of Hong Kong Pofulam Road, Hong Kong, Chna {oansun,

More information

Cluster Feature-Based Incremental Clustering Approach (CFICA) For Numerical Data

Cluster Feature-Based Incremental Clustering Approach (CFICA) For Numerical Data JCSNS nternatonal Journal of Computer Scence and Network Securty, VOL.0 No.9, September 00 73 Cluster Feature-ased ncremental Clusterng pproach (CFC For Numercal ata.m.sowanya and M.Shash, epartment of

More information

Virtual Machine Migration based on Trust Measurement of Computer Node

Virtual Machine Migration based on Trust Measurement of Computer Node Appled Mechancs and Materals Onlne: 2014-04-04 ISSN: 1662-7482, Vols. 536-537, pp 678-682 do:10.4028/www.scentfc.net/amm.536-537.678 2014 Trans Tech Publcatons, Swtzerland Vrtual Machne Mgraton based on

More information

Hybrid Non-Blind Color Image Watermarking

Hybrid Non-Blind Color Image Watermarking Hybrd Non-Blnd Color Image Watermarkng Ms C.N.Sujatha 1, Dr. P. Satyanarayana 2 1 Assocate Professor, Dept. of ECE, SNIST, Yamnampet, Ghatkesar Hyderabad-501301, Telangana 2 Professor, Dept. of ECE, AITS,

More information

Estimating Costs of Path Expression Evaluation in Distributed Object Databases

Estimating Costs of Path Expression Evaluation in Distributed Object Databases Estmatng Costs of Path Expresson Evaluaton n Dstrbuted Obect Databases Gabrela Ruberg, Fernanda Baão, and Marta Mattoso Department of Computer Scence COPPE/UFRJ P.O.Box 685, Ro de Janero, RJ, 2945-970

More information

Optimizing Document Scoring for Query Retrieval

Optimizing Document Scoring for Query Retrieval Optmzng Document Scorng for Query Retreval Brent Ellwen baellwe@cs.stanford.edu Abstract The goal of ths project was to automate the process of tunng a document query engne. Specfcally, I used machne learnng

More information

Corner-Based Image Alignment using Pyramid Structure with Gradient Vector Similarity

Corner-Based Image Alignment using Pyramid Structure with Gradient Vector Similarity Journal of Sgnal and Informaton Processng, 013, 4, 114-119 do:10.436/jsp.013.43b00 Publshed Onlne August 013 (http://www.scrp.org/journal/jsp) Corner-Based Image Algnment usng Pyramd Structure wth Gradent

More information

Spatial Data Dynamic Balancing Distribution Method Based on the Minimum Spatial Proximity for Parallel Spatial Database

Spatial Data Dynamic Balancing Distribution Method Based on the Minimum Spatial Proximity for Parallel Spatial Database JOURNAL OF SOFTWARE, VOL. 6, NO. 7, JULY 211 1337 Spatal Data Dynamc Balancng Dstrbuton Method Based on the Mnmum Spatal Proxmty for Parallel Spatal Database Yan Zhou College of Automaton Unversty of Electrc

More information

1. Introduction. Abstract

1. Introduction. Abstract Image Retreval Usng a Herarchy of Clusters Danela Stan & Ishwar K. Seth Intellgent Informaton Engneerng Laboratory, Department of Computer Scence & Engneerng, Oaland Unversty, Rochester, Mchgan 48309-4478

More information

6.854 Advanced Algorithms Petar Maymounkov Problem Set 11 (November 23, 2005) With: Benjamin Rossman, Oren Weimann, and Pouya Kheradpour

6.854 Advanced Algorithms Petar Maymounkov Problem Set 11 (November 23, 2005) With: Benjamin Rossman, Oren Weimann, and Pouya Kheradpour 6.854 Advanced Algorthms Petar Maymounkov Problem Set 11 (November 23, 2005) Wth: Benjamn Rossman, Oren Wemann, and Pouya Kheradpour Problem 1. We reduce vertex cover to MAX-SAT wth weghts, such that the

More information

Fuzzy C-Means Initialized by Fixed Threshold Clustering for Improving Image Retrieval

Fuzzy C-Means Initialized by Fixed Threshold Clustering for Improving Image Retrieval Fuzzy -Means Intalzed by Fxed Threshold lusterng for Improvng Image Retreval NAWARA HANSIRI, SIRIPORN SUPRATID,HOM KIMPAN 3 Faculty of Informaton Technology Rangst Unversty Muang-Ake, Paholyotn Road, Patumtan,

More information

The Research of Support Vector Machine in Agricultural Data Classification

The Research of Support Vector Machine in Agricultural Data Classification The Research of Support Vector Machne n Agrcultural Data Classfcaton Le Sh, Qguo Duan, Xnmng Ma, Me Weng College of Informaton and Management Scence, HeNan Agrcultural Unversty, Zhengzhou 45000 Chna Zhengzhou

More information

Querying by sketch geographical databases. Yu Han 1, a *

Querying by sketch geographical databases. Yu Han 1, a * 4th Internatonal Conference on Sensors, Measurement and Intellgent Materals (ICSMIM 2015) Queryng by sketch geographcal databases Yu Han 1, a * 1 Department of Basc Courses, Shenyang Insttute of Artllery,

More information

A new segmentation algorithm for medical volume image based on K-means clustering

A new segmentation algorithm for medical volume image based on K-means clustering Avalable onlne www.jocpr.com Journal of Chemcal and harmaceutcal Research, 2013, 5(12):113-117 Research Artcle ISSN : 0975-7384 CODEN(USA) : JCRC5 A new segmentaton algorthm for medcal volume mage based

More information

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY 1. SSDH: Semi-supervised Deep Hashing for Large Scale Image Retrieval

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY 1. SSDH: Semi-supervised Deep Hashing for Large Scale Image Retrieval IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY SSDH: Sem-supervsed Deep Hashng for Large Scale Image Retreval Jan Zhang, and Yuxn Peng arxv:607.08477v2 [cs.cv] 8 Jun 207 Abstract Hashng

More information

HCMX: AN EFFICIENT HYBRID CLUSTERING APPROACH FOR MULTI-VERSION XML DOCUMENTS

HCMX: AN EFFICIENT HYBRID CLUSTERING APPROACH FOR MULTI-VERSION XML DOCUMENTS HCMX: AN EFFICIENT HYBRID CLUSTERING APPROACH FOR MULTI-VERSION XML DOCUMENTS VIJAY SONAWANE 1, D.RAJESWARA.RAO 2 1 Research Scholar, Department of CSE, K.L.Unversty, Green Felds, Guntur, Andhra Pradesh

More information

Analyzing Popular Clustering Algorithms from Different Viewpoints

Analyzing Popular Clustering Algorithms from Different Viewpoints 1000-9825/2002/13(08)1382-13 2002 Journal of Software Vol.13, No.8 Analyzng Popular Clusterng Algorthms from Dfferent Vewponts QIAN We-nng, ZHOU Ao-yng (Department of Computer Scence, Fudan Unversty, Shangha

More information

Clustering Algorithm Combining CPSO with K-Means Chunqin Gu 1, a, Qian Tao 2, b

Clustering Algorithm Combining CPSO with K-Means Chunqin Gu 1, a, Qian Tao 2, b Internatonal Conference on Advances n Mechancal Engneerng and Industral Informatcs (AMEII 05) Clusterng Algorthm Combnng CPSO wth K-Means Chunqn Gu, a, Qan Tao, b Department of Informaton Scence, Zhongka

More information

Learning-Based Top-N Selection Query Evaluation over Relational Databases

Learning-Based Top-N Selection Query Evaluation over Relational Databases Learnng-Based Top-N Selecton Query Evaluaton over Relatonal Databases Lang Zhu *, Wey Meng ** * School of Mathematcs and Computer Scence, Hebe Unversty, Baodng, Hebe 071002, Chna, zhu@mal.hbu.edu.cn **

More information

Simulation Based Analysis of FAST TCP using OMNET++

Simulation Based Analysis of FAST TCP using OMNET++ Smulaton Based Analyss of FAST TCP usng OMNET++ Umar ul Hassan 04030038@lums.edu.pk Md Term Report CS678 Topcs n Internet Research Sprng, 2006 Introducton Internet traffc s doublng roughly every 3 months

More information