ApproxMGMSP: A Scalable Method of Mining Approximate Multidimensional Sequential Patterns on Distributed System

Size: px
Start display at page:

Download "ApproxMGMSP: A Scalable Method of Mining Approximate Multidimensional Sequential Patterns on Distributed System"

Transcription

1 ApproxMGMSP: A Scalable Method of Mnng Approxmate Multdmensonal Sequental Patterns on Dstrbuted System Changha Zhang, Kongfa Hu, Zhux Chen, Lng Chen Department of Computer Scence and Engneerng, Yangzhou Unversty, Yangzhou ,Chna Ysheng Dong Department of Computer Scence and Engneerng, Southeast Unversty, Nanjng ,Chna Abstract We present a scalable and effectve algorthm called ApproxMGMSP (Approxmate Mnng of Global Multdmensonal Sequental Patterns) to solve the problem of mnng the multdmensonal sequental patterns for large databases n the dstrbuted envronment. Our method dffers from prevous related works of mnng multdmensonal patterns on dstrbuted system. The man dfference s that an approxmate mnng method s used n large multdmensonal sequence database frstly. In ths paper, to convert the mnng on the multdmensonal sequental patterns to sequental patterns, the multdmensonal nformaton s embedded nto the correspondng sequences. Then the sequences are clustered, summarzed, and analyzed on the dstrbuted stes, and the local patterns could be obtaned by the effectve approxmate sequental pattern mnng method. Fnally, the global multdmensonal sequental patterns could be quckly mned by hgh vote sequental pattern model after collectng all the local patterns on one ste. Both the theores and the experments ndcate that ths method could smplfy the problem of mnng the multdmensonal sequental patterns and avod mnng the redundant nformaton. The global sequental patterns could be obtaned effectvely by the scalable method after reducng the cost of communcaton. 1. Introducton Sequental pattern mnng has become an essental data mnng task, wth broad applcatons, ncludng web log analyss, market and customer analyss, pattern dscovery n proten sequences, and mnng XML query access patterns for cachng. However, mnng multdmensonal sequental patterns could extract more useful nformaton than mnng sequental patterns. At present, databases and data warehouses wth huge amount of data make data mnng on PC not very effectve, especally can not make the need of the ablty of data process on functon and performance. In actual applcatons, most large nformaton systems are dstrbuted, such as the data access of large nterregonal shoppng markets. So, dstrbuted multdmensonal patterns mnng s proposed n order to deal wth ths problem frstly. At present, many multdmensonal sequental pattern mnng-related researches have been advanced. such as the well-known algorthms UnSeq, PSFP and HYBRID[1]. However, the overall performance of these algorthms s not hgh n mnng global multdmensonal patterns for the large amount of data scattered n dstrbuted envronment. So the ssue only can be solved by the dstrbuted or parallel data mnng technology. In 2003, S.C. Zhang proposed the technque of dstrbuted mnng of mult-database[2] to resolve the problem, and then the methods of global assocaton rule mnng[3] and exceptonal sequental patterns mnng[4] n dfferent data sources were also proposed. Recently H.C. Kum has also proposed the method of mnng global sequental patterns[5] n mult-database. Tradtonal methods of mnng sequental patterns are to fnd all the patterns that satsfy the user-specfed mnmum support threshold, such as the well-known algorthms GSP[6], Prefxspan[7], SPADE[8] and so on. However, these sequental patterns mnng algorthms based on support have some nherent lmtatons. So, we propose a novel method of mnng approxmate multdmensonal sequental patterns on dstrbuted system. Our experments ndcate that the method smplfy the process of mnng multdmensonal sequental patterns and solve the problem of hgh dmenson effectvely. The global multdmensonal sequental patterns could be obtaned effectvely by reducng the redundant nformaton.

2 2. Problem formulaton Assume that there are n stes S 1,S 2,,S n n the dstrbuted envronment and the multdmensonal sequence database MSDB s parttoned over the n stes nto {MSDB 1,MSDB 2,,MSDB n }, respectvely. Let the ndependent computer on each ste can communcate each other. Gven schema MSDB (TID, A 1,,A m, S) s a multdmensonal sequence database, where TID s a prmary key, A 1,,A m s multdmensonal nformaton and S are sequences. Let * be any value belong to any doman of A 1,,A m. A multdmensonal sequence takes the form of (a 1,,a m,s), where a ( A {*} ) for(1 m) and s s a sequence. Defnton 1. Gven a local sequence database DB x, let dst (seq,seq j ) be the dstance measure for seq and seq j (0<dst(seq,seq j ) <1), and DB x can be parttoned nto smlarty clusters G x1 G xn such thatσ j dst (seq a, seq jb ) s maxmzed and Σ j dst (seq a, seq jb ) s mnmzed where seq a G x,seq jb G xj. Defnton 2. Let G x1,, G xn be smlarty clusters for a local database DB x, an approxmate sequental pattern for group G x, denoted as lpat x, s a sequence that mnmzes dst (lpat x,seq a ) for all seq a n smlarty group G x. Defnton 3. Let the set M be approxmate sequental patterns on all stes, ts subset HS s a homogeneous set of range γ when the smlarty between any two patterns p and p j n HS s not less than γ, p HS Λ p j HS Λ sm(p,p j ) γ, where sm(p,p j )1-dst(p,p j ). Defnton 4. The vote of a homogeneous set HS s defned as the sze of the homogenous set. Vote(HS, γ) HS(γ). Defnton 5. Let γ and Ө be desred smlarty level and threshold correspondngly, a hgh vote homogenous set s a homogeneous set HS such that Vote(HS, γ) Ө. Gven a hgh vote homogenous set, the hgh vote sequental pattern s the longest common subsequence of all local patterns n the set. Defnton 6. Gven a schema WS<X 1 : v 1,,X l :v l >:n, WS s a weghted sequence when carryng the followng nformaton: the current algnment has n sequences, v sequences have a non-empty temset X algned n the th temset, where(1< <l), and an temset n the algnment s n the form of X (x j1 :w j1,...,x jm :w jm ), whch means, n the current algnment, there are w jk sequences that have tem x jk n the th poston of the algnment, where (1< <l) and (1<k<m). Gven the,ג that s specfed by users, f w jk /n ג mnmum degree then x jk can be collected for obtanng approxmate sequental patterns. 3. Multdmensonal sequental patterns mnng on dstrbuted system 3.1 Embeddng multdmensonal nformaton nto sequences Inspred by UnSeq, for a tuple n the multdmensonal sequence database, the multdmensonal nformaton could be embedded nto the correspondng sequence through ntroducng a specal element. So, the problem of mnng s predgested by convertng the mnng n both nformaton of the dmenson and sequence to the mnng only n the sequence. For example, gven a tuple q(10,busness, Chcago, Mddle, <(bd)cb(ac)>), the multdmensonal nformaton (Busness, Chcago, Mddle) could be embedded nto the correspondng sequence<(bd)cb(ac)>as the frst element. That s to say, the sequence x <(bd)cb(ac)> n q could be extended to y <(Busness Chcago Mddle)(bd)cb(ac)>. Ths method could convert the mnng of the sequence n multdmensonal sequence database to the mnng of the extended sequence n the extended sequence database. In the same way, the multdmensonal nformaton could be embedded nto the correspondng sequence as the last element. Now, let us verfy approxmate multdmensonal sequental pattern mnng usng the extended database. Theorem 1. Gven a multdmensonal sequence database MSDB and extended database ESDB. A multdmensonal sequence t(a 1,,a n,s)s an approxmate sequental pattern n MSDB f and only f sequence t 1 <(a 1,,a n ),s> s an approxmate sequental pattern n ESDB. Proof. If a multdmensonal sequence t(a 1,,a n,s)s an approxmate sequental pattern n MSDB, then the levenshten dstance dst(t,seq) s mnmum for all seq n smlarty group G. So, the dst(t 1,seq) s also mnmum by calculatng the levenshten dstance(algorthm 1), that s to say, the sequence t 1 <(a 1,,a n ),s> s an approxmate sequental pattern n ESDB. In the same way, we can educe that the multdmensonal sequence t(a 1,,a n,s) s an approxmate sequental pattern n MSDB. 3.2 Multdmensonal sequence mnng The goal of the multdmensonal sequental pattern mnng n the dstrbuted envronment s to reduce cost of the communcaton n the network. Though we can get hgh performance by the tradtonal method of mnng patterns wth low dmenson, the effcency s very low when the dmenson s hgh for the need of

3 mnng the long sequental patterns. So we adopt the approxmate sequence mnng method for extended database n every staton. Frst the levenshten dstance s ntroduced whch s commonly used as a dstance measure for sequences. It s used to computng the mnmum cost of nsertng, deletng, and replacng when one sequence S s converted to another sequence T. Gven S<s 1,,s n > and T<t 1,,t m >, the levenshten dstance could be obtaned by the dynamc programmng and the followng crcle operatons. Algorthm 1. Calculatng levenshten dstance Input: Tow sequences S<s 1,,s n >, T<t 1,,t m >. Output: Levenshten dstance between S and T, dst(s,t) 1) If n 0, return m and ext. If m 0, return n and ext. Construct a matrx contanng m rows and n columns 2) Intalze the frst row to 0 n. Intalze the frst column to 0 m. 3) Examne each character of S ( from 1 to n). Examne each character of T (j from 1 to m). 4) If S[] equals T[j], the cost s 0. If S[] doesn't equal T[j], the cost s 1. 5) Set cell dst[,j] of the matrx equal to the mnmum of: a. The cell mmedately above plus 1: dst[-1,j] + 1. b. The cell mmedately to the left plus 1: dst[,j-1] + 1. c. The cell dagonally above and to the left plus the cost: dst[-1,j-1] + cost. 6) After the teraton steps (3, 4, 5, 6) are complete, the dstance s found n cell dst[n,m]. The normalzed levenshten dstance as Formula 1. Formula 1. dst( S, T) D ( S, T) max{ S, T } The normalzed set dfference s used to ft sequence of sets properly for measurng the dstance, as Formula 2. Formula 2. ( s t) ( t s) 2 s t Re pl( s, t) 1 s + t s t + t s + 2 s t We adopt a densty-based clusterng algorthm to cluster sequences. For each sequence s n the database S, let d 1,,d k be the k smallest non-zero values of D(s, s j ), where s j S, s s j, then Den(s ) n/d, dmax{ d 1,,d k },n { s j S D(s, s j ) d}. Algorthm 2. Unform kernel k-nn clusterng Input: A set of sequences {s }, the number of neghbor sequences k. Output: A set of clusters {C j }. 1) Generate ntal cluster. Set every sequence as a cluster, and Den(Cs )Den(s ). 2) Expand ntal cluster based on the densty of sequences. Set s 1,,s n be the nearest neghbor for s, for each s j {s 1,,s n },merge cluster Cs contanng s wth a cluster Cs j contanng s j, f Den(s ) < Den(s j ) and there exsts no s p havng D(s, s p ) < D(s, s j ) and Den(s ) < Den(s p ), set Den(new cluster) max{den(cs ),Den(Cs j )}. 3) Merge based on the densty of new clusters. Fnd sequences s such that Den(s )Den(s j ), merge the two clusters Cs and Cs j contanng each sequence f Den (Cs ) >Den(Cs j ). Sequences n every database are parttoned nto several groups by clusterng. All sequences are sorted wthn a group n densty descendng order, then the frst two sequences are compressed nto the weghted sequence ws 1 ; then a weghted replace cost s adopted to ensure that the dstance between the sequence assgned and the weghted sequence ws 1 s mnmum, as Formula 3, let ws(x 1 :w 1,,x m :w m ):v be an temset n a weghted sequence, and t(y 1,,y l ) s an temset n a sequence n the database. Let n be the global weght of the weghted sequence, the weghted sequence ws n-1 s obtaned by compressng sequences nto the correspondng weghted sequence, and then we could collect approxmate sequental patterns accordng to WS n-1. Formula 3. R v + n v REPL( ws, t) n R m 1 w + t v 2 m 1 w + t v x t The global multdmensonal sequences are obtaned by hgh vote sequental pattern model. Algorthm 3. Global multdmensonal sequence mnng Input: All local patterns L 1,,L n for stes 1,,n. Output: Global patterns G. 1) Collect all local patterns L 1,,L n to a ste, and generate homogeneous sets. 2) Collect hgh vote homogenous sets M from results n step one, and then generate global patterns G, that s the longest common subsequences n M. 3) Broadcast G to each ste. 4 Expermental evaluatons 4.1 Effectveness analyss of ApproxMGMSP w

4 For effectveness analyss of ApproxMGMSP, we adopt a general evaluaton method that can evaluate the accuracy of the approxmaton n terms of how well t fnds the real underlyng patterns n the data and whether or not t generates any spurous patterns. The datasets were generated by the well-known IBM data generator[9]. Base patterns were generated randomly accordng to the user s specfcaton. Then, these base patterns were corrupted and merged to generate the sequences n the database. Dmensonal nformaton was generated and merged randomly so that values were dstrbuted evenly n every dmenson. For evaluaton crtera, recoverablty R,Precson P, N redun : the number of redundant patterns, N spur : the number of base patterns, N max : the number of spurous patterns, L: the average length of sequence. Followng, Table1 and Table2 demonstrate how 7 of the most frequent 10 base patterns were uncovered from 1000 sequences usng ApproxMGMSP Table1. Base patterns 10 base patterns L B 0 <(B, X, D, Y)(20)(63 24)(2)(5)(2 74)(95)(96)> 13 B 1 <(F, A, Z, F)( )(16)( )(58 66) > 13 B 2 <(W, A, D, F)(6)( )( )(58)(22)> 14 B 3 <(W, L,D,Y)(62)(66)(76 31)(2 74)(58 99)(15)(16 66)> 15 <(G, H, C, Y)(63 99)(16)(22 58)(51)(66)(96)(50) B 4 19 (45 36) (94)(96 29)(18)> B 5 <(B, L, I, Y)(40 62)(15)(40)(29 40)(24 63)( )> 15 B 6 <(G, H, I, J)(23 96)(50)(2 22)(16)(58)(10 74)(51 63)> 15 B 7 <(W, X, D, O)(22)(58)(96)(88)(58 78)> 10 B 8 <(B, A, I, O)(22 41)(2 74)(31 76)(2 74)(22)(58 66)> 15 B 9 <(W, H, C, F)(2 22)(24)( )(50)(16)> 12 Table2. Local patterns Local patterns: approxmate sequental patterns L A 0 <(B, X, D, Y)(20)(63 24)(2)(5)(2 74)(95) > 12 A 1 <(F, A, Z, F)( )(16)( ) > 11 A 2 <(W, A, D, F)(6)( )( )(58) > 13 A 3 <(W, L, D, Y)(62)(66)(76 31)(2 74)(58 99)(15) > 13 <(G, H, C, Y)(63 99)(16)(22 58)(51)(66)(96)(50) A 4 (45 36) (94)(96 29)> 18 <(G, H,C,Y)(63 99)(16)(22 58)(51)(66)(96)(50)(45 A )> A 6 <(G, H, I, J)(23 96)(50)(2 22)(16)(58)(10 74)(51 63)> 13 A 7 <(W, X, C, O)(22)(58 66)(96)(88)(58 78)> 11 Clearly, 8 local patterns are generated and recover major parts of the base patterns wth hgh expected frequency n the database from 1000 sequences, each of the 8 approxmate patterns match a base pattern well. The recoverablty s excellent at 90.66%. The precson s qute good at P1-2/8797.7%. In all approxmate patterns, only 2 tems ((W, X, C, O) (58 66)) do not appear on the correspondng poston n the base pattern. There were no spurous patterns and only one redundant pattern A 5. Ths s because B 4 s too long, as a result of the long B 4, the sequences generated from a long base pattern B 4 can be parttoned nto multple clusters by ApproxMGMSP. To sum up, ApproxMGMSP s an effectve method of mnng multdmensonal sequental patterns. 4.2 Scalablty analyss of ApproxMGMSP The followng experments have been carred out to text the scalablty of ApproxMGMSP. Group 1, the recoverablty changes as dfferent sequence numbers on the average length of sequence L 20, the average tem length I 2.5, the number of tem 10000, the number of base pattern N seq 1000, the average length of base pattern L seq 14, the average tem length of base pattern Iseq 2, the number of neghbor sequences k4, the mnmum degree, 50% ג the results n Fgure 1. Group 2, the recoverablty changes as dfferent average lengths of sequence on N , I 2.5, 10000, N seq 1000, I seq 2, k4,, 50% ג the results n Fgure 2. Group 3, the executve tme of ApproxMGMSP changes as dfferent dmensons on N , L 20, I 2.5, 10000, N seq 1000, I seq 2, k4,, 50% ג the results n Fgure 3. Fgure1. Recoverablty vs. N Fgure2. Recoverablty vs. L

5 Fgure3. Runnng Tme vs. Dmenson Obvously, we observe that ApproxMGMSP s scalable wth respect to database sze from Fgure 1. The more the sequences n the database, the better the recoverablty. For a base pattern wth the same Probablty n sequences, the large the database sze, the more the approxmate sequental patterns, so there are more sequences smlar to base patterns, and the recoverablty s more ncreased. From Fgure 2, we could fnd that ApproxMGMSP s scalable wth respect to the average length of sequence. That s because the larger the average length of sequence, the more the repeated tems, so the recoverablty s more ncreased. We can see from Fgure 3, the executve tme decrease wth the ncreasng dmensons. As the ncreasng dmensons, the man task of the entre mnng process s to mne dmensonal nformaton ncreasngly, and dmensonal nformaton mnng does not need to fnd the mnmum dstance between the sequences by sequence comparson. So, wth the dmenson ncreasng, the runnng tme has decreased gradually. 5. Concluson and future work A scalable method s proposed n ths paper to mne multdmensonal sequental patterns effectvely. The multdmensonal nformaton s embedded nto the correspondng sequences to convert complex mnng on multdmensonal sequences to mnng on sequences n ths method. If the dmenson s low, we could adopt the mnng method based on support n every ste, and obtan global multdmensonal sequental patterns by collectng local patterns. But the tradtonal approach would have a lot of redundancy and short patterns, and dffcult to resolve long patterns when the dmenson s hgh. So, the method of mnng approxmate sequences s adopted to mne local patterns, and fnally collect global patterns by hgh vote sequental patterns. The experments show that ths scalable method not only smplfy the problem of mnng multdmensonal patterns, but also resolve the ssue of hgh dmenson. Although ths approach s very effcent for mnng multdmensonal sequental patterns n large databases n the dstrbuted envronment, t brngs a hgh degree of complexty. So, reducng complexty of ApproxMGMSP and the evaluaton of global sequental pattern mnng are our future researches. Acknowledgements: The research n the paper s supported by the Natonal Natural Scence Foundaton of Chna under Grant No ; the Natonal Facltes and Informaton Infrastructure for Scence and Technology of Chna under Grant No. 2004DKA20310; the Natural Scence Foundaton of Jangsu Provnce under Grant No. BK ; the Qng Lan Project Foundaton of Jangsu Provnce of Chna. References [1] H. Pnto, J. Han and J. Pe, Mult- dmensonal Sequental Pattern Mnng, In Proc. of the 10 th Int. Conf. on Informaton and Knowledge Management (CIKM), ACM, Atlanta, Georga, pp , November [2] S. Zhang, X. Wu, and C. Zhang, Mult-Database Mnng, IEEE Computatonal Intellgence Bulletn, Vol.2, No.1, pp. 5-13, June [3] X. Wu and S. Zhang, Syntheszng Hgh-Frequency Rules from Dfferent Data Sources, IEEE Transactons Knowledge Data Engneerng, Vol.15, No.1, pp , January [4] C. Zhang, M. Lu, W. Ne, and S. Zhang, Identfyng Global Exceptonal Patterns n Mult-database Mnng, IEEE Computatonal Intellgence Bulletn, Vol.3, No.1, pp , Feb [5] H.C. Kum, J.H. Chang, W. Wang, Sequental Pattern Mnng n Mult-Databases va Multple Algnment, Data Mnng & Knowledge Dscovery, Vol.12, No.1, pp , January [6] R. Srkant and R. Agrawal, Mnng Sequental Patterns: Generalzatons And Performance Improvements, In Proc. of the 5 th Int. Conf. on Extendng Database Technology (EDBT), Sprnger, Avgnon, France, pp. 3-17, March [7] J. Pe, J. Han, H. Pnto, Q. Chen and U. Dayal, PrefxSpan: Mnng Sequental Patterns Effcently by Prefx-Projected Pattern Growth, IEEE Transactons on Knowledge & Data Engneerng, Vol.16, No.1, pp , January [8] M. Zak, SPADE: An Effcent Algorthm for Mnng Frequent Sequences, Machne Learnng, Vol.42, No. 1/2, pp , January [9] R. Agrawal and R. Srkant, Mnng Sequental Patterns, In Proc. of the 11 th Int. Conf. on Data Engneerng (ICDE), IEEE Computer Socety, Tape, Tawan, pp. 3-14, March 1995.

Concurrent Apriori Data Mining Algorithms

Concurrent Apriori Data Mining Algorithms Concurrent Apror Data Mnng Algorthms Vassl Halatchev Department of Electrcal Engneerng and Computer Scence York Unversty, Toronto October 8, 2015 Outlne Why t s mportant Introducton to Assocaton Rule Mnng

More information

Cluster Analysis of Electrical Behavior

Cluster Analysis of Electrical Behavior Journal of Computer and Communcatons, 205, 3, 88-93 Publshed Onlne May 205 n ScRes. http://www.scrp.org/ournal/cc http://dx.do.org/0.4236/cc.205.350 Cluster Analyss of Electrcal Behavor Ln Lu Ln Lu, School

More information

Programming in Fortran 90 : 2017/2018

Programming in Fortran 90 : 2017/2018 Programmng n Fortran 90 : 2017/2018 Programmng n Fortran 90 : 2017/2018 Exercse 1 : Evaluaton of functon dependng on nput Wrte a program who evaluate the functon f (x,y) for any two user specfed values

More information

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance Tsnghua Unversty at TAC 2009: Summarzng Mult-documents by Informaton Dstance Chong Long, Mnle Huang, Xaoyan Zhu State Key Laboratory of Intellgent Technology and Systems, Tsnghua Natonal Laboratory for

More information

Parallelism for Nested Loops with Non-uniform and Flow Dependences

Parallelism for Nested Loops with Non-uniform and Flow Dependences Parallelsm for Nested Loops wth Non-unform and Flow Dependences Sam-Jn Jeong Dept. of Informaton & Communcaton Engneerng, Cheonan Unversty, 5, Anseo-dong, Cheonan, Chungnam, 330-80, Korea. seong@cheonan.ac.kr

More information

Learning the Kernel Parameters in Kernel Minimum Distance Classifier

Learning the Kernel Parameters in Kernel Minimum Distance Classifier Learnng the Kernel Parameters n Kernel Mnmum Dstance Classfer Daoqang Zhang 1,, Songcan Chen and Zh-Hua Zhou 1* 1 Natonal Laboratory for Novel Software Technology Nanjng Unversty, Nanjng 193, Chna Department

More information

Determining the Optimal Bandwidth Based on Multi-criterion Fusion

Determining the Optimal Bandwidth Based on Multi-criterion Fusion Proceedngs of 01 4th Internatonal Conference on Machne Learnng and Computng IPCSIT vol. 5 (01) (01) IACSIT Press, Sngapore Determnng the Optmal Bandwdth Based on Mult-crteron Fuson Ha-L Lang 1+, Xan-Mn

More information

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching A Fast Vsual Trackng Algorthm Based on Crcle Pxels Matchng Zhqang Hou hou_zhq@sohu.com Chongzhao Han czhan@mal.xjtu.edu.cn Ln Zheng Abstract: A fast vsual trackng algorthm based on crcle pxels matchng

More information

A Simple Methodology for Database Clustering. Hao Tang 12 Guangdong University of Technology, Guangdong, , China

A Simple Methodology for Database Clustering. Hao Tang 12 Guangdong University of Technology, Guangdong, , China for Database Clusterng Guangdong Unversty of Technology, Guangdong, 0503, Chna E-mal: 6085@qq.com Me Zhang Guangdong Unversty of Technology, Guangdong, 0503, Chna E-mal:64605455@qq.com Database clusterng

More information

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data A Fast Content-Based Multmeda Retreval Technque Usng Compressed Data Borko Furht and Pornvt Saksobhavvat NSF Multmeda Laboratory Florda Atlantc Unversty, Boca Raton, Florda 3343 ABSTRACT In ths paper,

More information

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers IOSR Journal of Electroncs and Communcaton Engneerng (IOSR-JECE) e-issn: 78-834,p- ISSN: 78-8735.Volume 9, Issue, Ver. IV (Mar - Apr. 04), PP 0-07 Content Based Image Retreval Usng -D Dscrete Wavelet wth

More information

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points;

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points; Subspace clusterng Clusterng Fundamental to all clusterng technques s the choce of dstance measure between data ponts; D q ( ) ( ) 2 x x = x x, j k = 1 k jk Squared Eucldean dstance Assumpton: All features

More information

The Research of Support Vector Machine in Agricultural Data Classification

The Research of Support Vector Machine in Agricultural Data Classification The Research of Support Vector Machne n Agrcultural Data Classfcaton Le Sh, Qguo Duan, Xnmng Ma, Me Weng College of Informaton and Management Scence, HeNan Agrcultural Unversty, Zhengzhou 45000 Chna Zhengzhou

More information

BIN XIA et al: AN IMPROVED K-MEANS ALGORITHM BASED ON CLOUD PLATFORM FOR DATA MINING

BIN XIA et al: AN IMPROVED K-MEANS ALGORITHM BASED ON CLOUD PLATFORM FOR DATA MINING An Improved K-means Algorthm based on Cloud Platform for Data Mnng Bn Xa *, Yan Lu 2. School of nformaton and management scence, Henan Agrcultural Unversty, Zhengzhou, Henan 450002, P.R. Chna 2. College

More information

Classifier Selection Based on Data Complexity Measures *

Classifier Selection Based on Data Complexity Measures * Classfer Selecton Based on Data Complexty Measures * Edth Hernández-Reyes, J.A. Carrasco-Ochoa, and J.Fco. Martínez-Trndad Natonal Insttute for Astrophyscs, Optcs and Electroncs, Lus Enrque Erro No.1 Sta.

More information

Support Vector Machines

Support Vector Machines /9/207 MIST.6060 Busness Intellgence and Data Mnng What are Support Vector Machnes? Support Vector Machnes Support Vector Machnes (SVMs) are supervsed learnng technques that analyze data and recognze patterns.

More information

A fast algorithm for color image segmentation

A fast algorithm for color image segmentation Unersty of Wollongong Research Onlne Faculty of Informatcs - Papers (Arche) Faculty of Engneerng and Informaton Scences 006 A fast algorthm for color mage segmentaton L. Dong Unersty of Wollongong, lju@uow.edu.au

More information

Network Intrusion Detection Based on PSO-SVM

Network Intrusion Detection Based on PSO-SVM TELKOMNIKA Indonesan Journal of Electrcal Engneerng Vol.1, No., February 014, pp. 150 ~ 1508 DOI: http://dx.do.org/10.11591/telkomnka.v1.386 150 Network Intruson Detecton Based on PSO-SVM Changsheng Xang*

More information

FINDING IMPORTANT NODES IN SOCIAL NETWORKS BASED ON MODIFIED PAGERANK

FINDING IMPORTANT NODES IN SOCIAL NETWORKS BASED ON MODIFIED PAGERANK FINDING IMPORTANT NODES IN SOCIAL NETWORKS BASED ON MODIFIED PAGERANK L-qng Qu, Yong-quan Lang 2, Jng-Chen 3, 2 College of Informaton Scence and Technology, Shandong Unversty of Scence and Technology,

More information

Positive Semi-definite Programming Localization in Wireless Sensor Networks

Positive Semi-definite Programming Localization in Wireless Sensor Networks Postve Sem-defnte Programmng Localzaton n Wreless Sensor etworks Shengdong Xe 1,, Jn Wang, Aqun Hu 1, Yunl Gu, Jang Xu, 1 School of Informaton Scence and Engneerng, Southeast Unversty, 10096, anjng Computer

More information

Available online at Available online at Advanced in Control Engineering and Information Science

Available online at   Available online at   Advanced in Control Engineering and Information Science Avalable onlne at wwwscencedrectcom Avalable onlne at wwwscencedrectcom Proceda Proceda Engneerng Engneerng 00 (2011) 15000 000 (2011) 1642 1646 Proceda Engneerng wwwelsevercom/locate/proceda Advanced

More information

Determining Fuzzy Sets for Quantitative Attributes in Data Mining Problems

Determining Fuzzy Sets for Quantitative Attributes in Data Mining Problems Determnng Fuzzy Sets for Quanttatve Attrbutes n Data Mnng Problems ATTILA GYENESEI Turku Centre for Computer Scence (TUCS) Unversty of Turku, Department of Computer Scence Lemmnkäsenkatu 4A, FIN-5 Turku

More information

Outline. CHARM: An Efficient Algorithm for Closed Itemset Mining. Introductions. Introductions

Outline. CHARM: An Efficient Algorithm for Closed Itemset Mining. Introductions. Introductions CHARM: An Effcent Algorthm for Closed Itemset Mnng Authors: Mohammed J. Zak and Chng-Ju Hsao Presenter: Junfeng Wu Outlne Introductons Itemset-Tdset tree CHARM algorthm Performance study Concluson Comments

More information

Association Rule Mining with Parallel Frequent Pattern Growth Algorithm on Hadoop

Association Rule Mining with Parallel Frequent Pattern Growth Algorithm on Hadoop Assocaton Rule Mnng wth Parallel Frequent Pattern Growth Algorthm on Hadoop Zhgang Wang 1,2, Guqong Luo 3,*,Yong Hu 1,2, ZhenZhen Wang 1 1 School of Software Engneerng Jnlng Insttute of Technology Nanng,

More information

TF 2 P-growth: An Efficient Algorithm for Mining Frequent Patterns without any Thresholds

TF 2 P-growth: An Efficient Algorithm for Mining Frequent Patterns without any Thresholds TF 2 P-growth: An Effcent Algorthm for Mnng Frequent Patterns wthout any Thresholds Yu HIRATE, Ego IWAHASHI, and Hayato YAMANA Graduate School of Scence and Engneerng, Waseda Unversty {hrate, ego, yamana}@yama.nfo.waseda.ac.jp

More information

A Similarity Measure Method for Symbolization Time Series

A Similarity Measure Method for Symbolization Time Series Research Journal of Appled Scences, Engneerng and Technology 5(5): 1726-1730, 2013 ISSN: 2040-7459; e-issn: 2040-7467 Maxwell Scentfc Organzaton, 2013 Submtted: July 27, 2012 Accepted: September 03, 2012

More information

Kent State University CS 4/ Design and Analysis of Algorithms. Dept. of Math & Computer Science LECT-16. Dynamic Programming

Kent State University CS 4/ Design and Analysis of Algorithms. Dept. of Math & Computer Science LECT-16. Dynamic Programming CS 4/560 Desgn and Analyss of Algorthms Kent State Unversty Dept. of Math & Computer Scence LECT-6 Dynamc Programmng 2 Dynamc Programmng Dynamc Programmng, lke the dvde-and-conquer method, solves problems

More information

The Shortest Path of Touring Lines given in the Plane

The Shortest Path of Touring Lines given in the Plane Send Orders for Reprnts to reprnts@benthamscence.ae 262 The Open Cybernetcs & Systemcs Journal, 2015, 9, 262-267 The Shortest Path of Tourng Lnes gven n the Plane Open Access Ljuan Wang 1,2, Dandan He

More information

Module Management Tool in Software Development Organizations

Module Management Tool in Software Development Organizations Journal of Computer Scence (5): 8-, 7 ISSN 59-66 7 Scence Publcatons Management Tool n Software Development Organzatons Ahmad A. Al-Rababah and Mohammad A. Al-Rababah Faculty of IT, Al-Ahlyyah Amman Unversty,

More information

A Webpage Similarity Measure for Web Sessions Clustering Using Sequence Alignment

A Webpage Similarity Measure for Web Sessions Clustering Using Sequence Alignment A Webpage Smlarty Measure for Web Sessons Clusterng Usng Sequence Algnment Mozhgan Azmpour-Kv School of Engneerng and Scence Sharf Unversty of Technology, Internatonal Campus Ksh Island, Iran mogan_az@ksh.sharf.edu

More information

Learning-Based Top-N Selection Query Evaluation over Relational Databases

Learning-Based Top-N Selection Query Evaluation over Relational Databases Learnng-Based Top-N Selecton Query Evaluaton over Relatonal Databases Lang Zhu *, Wey Meng ** * School of Mathematcs and Computer Scence, Hebe Unversty, Baodng, Hebe 071002, Chna, zhu@mal.hbu.edu.cn **

More information

Machine Learning. Topic 6: Clustering

Machine Learning. Topic 6: Clustering Machne Learnng Topc 6: lusterng lusterng Groupng data nto (hopefully useful) sets. Thngs on the left Thngs on the rght Applcatons of lusterng Hypothess Generaton lusters mght suggest natural groups. Hypothess

More information

A Clustering Algorithm Solution to the Collaborative Filtering

A Clustering Algorithm Solution to the Collaborative Filtering Internatonal Journal of Scence Vol.4 No.8 017 ISSN: 1813-4890 A Clusterng Algorthm Soluton to the Collaboratve Flterng Yongl Yang 1, a, Fe Xue, b, Yongquan Ca 1, c Zhenhu Nng 1, d,* Hafeng Lu 3, e 1 Faculty

More information

Related-Mode Attacks on CTR Encryption Mode

Related-Mode Attacks on CTR Encryption Mode Internatonal Journal of Network Securty, Vol.4, No.3, PP.282 287, May 2007 282 Related-Mode Attacks on CTR Encrypton Mode Dayn Wang, Dongda Ln, and Wenlng Wu (Correspondng author: Dayn Wang) Key Laboratory

More information

Outline. Type of Machine Learning. Examples of Application. Unsupervised Learning

Outline. Type of Machine Learning. Examples of Application. Unsupervised Learning Outlne Artfcal Intellgence and ts applcatons Lecture 8 Unsupervsed Learnng Professor Danel Yeung danyeung@eee.org Dr. Patrck Chan patrckchan@eee.org South Chna Unversty of Technology, Chna Introducton

More information

Parallel and Distributed Association Rule Mining - Dr. Giuseppe Di Fatta. San Vigilio,

Parallel and Distributed Association Rule Mining - Dr. Giuseppe Di Fatta. San Vigilio, Parallel and Dstrbuted Assocaton Rule Mnng - Dr. Guseppe D Fatta fatta@nf.un-konstanz.de San Vglo, 18-09-2004 1 Overvew Assocaton Rule Mnng (ARM) Apror algorthm Hgh Performance Parallel and Dstrbuted Computng

More information

An Application of the Dulmage-Mendelsohn Decomposition to Sparse Null Space Bases of Full Row Rank Matrices

An Application of the Dulmage-Mendelsohn Decomposition to Sparse Null Space Bases of Full Row Rank Matrices Internatonal Mathematcal Forum, Vol 7, 2012, no 52, 2549-2554 An Applcaton of the Dulmage-Mendelsohn Decomposton to Sparse Null Space Bases of Full Row Rank Matrces Mostafa Khorramzadeh Department of Mathematcal

More information

A New Approach For the Ranking of Fuzzy Sets With Different Heights

A New Approach For the Ranking of Fuzzy Sets With Different Heights New pproach For the ankng of Fuzzy Sets Wth Dfferent Heghts Pushpnder Sngh School of Mathematcs Computer pplcatons Thapar Unversty, Patala-7 00 Inda pushpndersnl@gmalcom STCT ankng of fuzzy sets plays

More information

A Binarization Algorithm specialized on Document Images and Photos

A Binarization Algorithm specialized on Document Images and Photos A Bnarzaton Algorthm specalzed on Document mages and Photos Ergna Kavalleratou Dept. of nformaton and Communcaton Systems Engneerng Unversty of the Aegean kavalleratou@aegean.gr Abstract n ths paper, a

More information

A Deflected Grid-based Algorithm for Clustering Analysis

A Deflected Grid-based Algorithm for Clustering Analysis A Deflected Grd-based Algorthm for Clusterng Analyss NANCY P. LIN, CHUNG-I CHANG, HAO-EN CHUEH, HUNG-JEN CHEN, WEI-HUA HAO Department of Computer Scence and Informaton Engneerng Tamkang Unversty 5 Yng-chuan

More information

From Comparing Clusterings to Combining Clusterings

From Comparing Clusterings to Combining Clusterings Proceedngs of the Twenty-Thrd AAAI Conference on Artfcal Intellgence (008 From Comparng Clusterngs to Combnng Clusterngs Zhwu Lu and Yuxn Peng and Janguo Xao Insttute of Computer Scence and Technology,

More information

A CALCULATION METHOD OF DEEP WEB ENTITIES RECOGNITION

A CALCULATION METHOD OF DEEP WEB ENTITIES RECOGNITION A CALCULATION METHOD OF DEEP WEB ENTITIES RECOGNITION 1 FENG YONG, DANG XIAO-WAN, 3 XU HONG-YAN School of Informaton, Laonng Unversty, Shenyang Laonng E-mal: 1 fyxuhy@163.com, dangxaowan@163.com, 3 xuhongyan_lndx@163.com

More information

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task Proceedngs of NTCIR-6 Workshop Meetng, May 15-18, 2007, Tokyo, Japan Term Weghtng Classfcaton System Usng the Ch-square Statstc for the Classfcaton Subtask at NTCIR-6 Patent Retreval Task Kotaro Hashmoto

More information

Outline. Self-Organizing Maps (SOM) US Hebbian Learning, Cntd. The learning rule is Hebbian like:

Outline. Self-Organizing Maps (SOM) US Hebbian Learning, Cntd. The learning rule is Hebbian like: Self-Organzng Maps (SOM) Turgay İBRİKÇİ, PhD. Outlne Introducton Structures of SOM SOM Archtecture Neghborhoods SOM Algorthm Examples Summary 1 2 Unsupervsed Hebban Learnng US Hebban Learnng, Cntd 3 A

More information

Parallel matrix-vector multiplication

Parallel matrix-vector multiplication Appendx A Parallel matrx-vector multplcaton The reduced transton matrx of the three-dmensonal cage model for gel electrophoress, descrbed n secton 3.2, becomes excessvely large for polymer lengths more

More information

An Improved Image Segmentation Algorithm Based on the Otsu Method

An Improved Image Segmentation Algorithm Based on the Otsu Method 3th ACIS Internatonal Conference on Software Engneerng, Artfcal Intellgence, Networkng arallel/dstrbuted Computng An Improved Image Segmentaton Algorthm Based on the Otsu Method Mengxng Huang, enjao Yu,

More information

An Optimal Algorithm for Prufer Codes *

An Optimal Algorithm for Prufer Codes * J. Software Engneerng & Applcatons, 2009, 2: 111-115 do:10.4236/jsea.2009.22016 Publshed Onlne July 2009 (www.scrp.org/journal/jsea) An Optmal Algorthm for Prufer Codes * Xaodong Wang 1, 2, Le Wang 3,

More information

Journal of Chemical and Pharmaceutical Research, 2014, 6(6): Research Article. A selective ensemble classification method on microarray data

Journal of Chemical and Pharmaceutical Research, 2014, 6(6): Research Article. A selective ensemble classification method on microarray data Avalable onlne www.ocpr.com Journal of Chemcal and Pharmaceutcal Research, 2014, 6(6):2860-2866 Research Artcle ISSN : 0975-7384 CODEN(USA) : JCPRC5 A selectve ensemble classfcaton method on mcroarray

More information

Wireless Sensor Network Localization Research

Wireless Sensor Network Localization Research Sensors & Transducers 014 by IFSA Publshng, S L http://wwwsensorsportalcom Wreless Sensor Network Localzaton Research Lang Xn School of Informaton Scence and Engneerng, Hunan Internatonal Economcs Unversty,

More information

Research Article A High-Order CFS Algorithm for Clustering Big Data

Research Article A High-Order CFS Algorithm for Clustering Big Data Moble Informaton Systems Volume 26, Artcle ID 435627, 8 pages http://dx.do.org/.55/26/435627 Research Artcle A Hgh-Order Algorthm for Clusterng Bg Data Fanyu Bu,,2 Zhku Chen, Peng L, Tong Tang, 3 andyngzhang

More information

The Greedy Method. Outline and Reading. Change Money Problem. Greedy Algorithms. Applications of the Greedy Strategy. The Greedy Method Technique

The Greedy Method. Outline and Reading. Change Money Problem. Greedy Algorithms. Applications of the Greedy Strategy. The Greedy Method Technique //00 :0 AM Outlne and Readng The Greedy Method The Greedy Method Technque (secton.) Fractonal Knapsack Problem (secton..) Task Schedulng (secton..) Mnmum Spannng Trees (secton.) Change Money Problem Greedy

More information

Vectorization in the Polyhedral Model

Vectorization in the Polyhedral Model Vectorzaton n the Polyhedral Model Lous-Noël Pouchet pouchet@cse.oho-state.edu Dept. of Computer Scence and Engneerng, the Oho State Unversty October 200 888. Introducton: Overvew Vectorzaton: Detecton

More information

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization Problem efntons and Evaluaton Crtera for Computatonal Expensve Optmzaton B. Lu 1, Q. Chen and Q. Zhang 3, J. J. Lang 4, P. N. Suganthan, B. Y. Qu 6 1 epartment of Computng, Glyndwr Unversty, UK Faclty

More information

Overview. Basic Setup [9] Motivation and Tasks. Modularization 2008/2/20 IMPROVED COVERAGE CONTROL USING ONLY LOCAL INFORMATION

Overview. Basic Setup [9] Motivation and Tasks. Modularization 2008/2/20 IMPROVED COVERAGE CONTROL USING ONLY LOCAL INFORMATION Overvew 2 IMPROVED COVERAGE CONTROL USING ONLY LOCAL INFORMATION Introducton Mult- Smulator MASIM Theoretcal Work and Smulaton Results Concluson Jay Wagenpfel, Adran Trachte Motvaton and Tasks Basc Setup

More information

Innovation Typology. Collaborative Authoritativeness. Focused Web Mining. Text and Data Mining In Innovation. Generational Models

Innovation Typology. Collaborative Authoritativeness. Focused Web Mining. Text and Data Mining In Innovation. Generational Models Text and Data Mnng In Innovaton Joseph Engler Innovaton Typology Generatonal Models 1. Lnear or Push (Baroque) 2. Pull (Romantc) 3. Cyclc (Classcal) 4. Strategc (New Age) 5. Collaboratve (Polyphonc) Collaboratve

More information

Fast Computation of Shortest Path for Visiting Segments in the Plane

Fast Computation of Shortest Path for Visiting Segments in the Plane Send Orders for Reprnts to reprnts@benthamscence.ae 4 The Open Cybernetcs & Systemcs Journal, 04, 8, 4-9 Open Access Fast Computaton of Shortest Path for Vstng Segments n the Plane Ljuan Wang,, Bo Jang

More information

Load Balancing for Hex-Cell Interconnection Network

Load Balancing for Hex-Cell Interconnection Network Int. J. Communcatons, Network and System Scences,,, - Publshed Onlne Aprl n ScRes. http://www.scrp.org/journal/jcns http://dx.do.org/./jcns.. Load Balancng for Hex-Cell Interconnecton Network Saher Manaseer,

More information

S1 Note. Basis functions.

S1 Note. Basis functions. S1 Note. Bass functons. Contents Types of bass functons...1 The Fourer bass...2 B-splne bass...3 Power and type I error rates wth dfferent numbers of bass functons...4 Table S1. Smulaton results of type

More information

Smoothing Spline ANOVA for variable screening

Smoothing Spline ANOVA for variable screening Smoothng Splne ANOVA for varable screenng a useful tool for metamodels tranng and mult-objectve optmzaton L. Rcco, E. Rgon, A. Turco Outlne RSM Introducton Possble couplng Test case MOO MOO wth Game Theory

More information

CS 534: Computer Vision Model Fitting

CS 534: Computer Vision Model Fitting CS 534: Computer Vson Model Fttng Sprng 004 Ahmed Elgammal Dept of Computer Scence CS 534 Model Fttng - 1 Outlnes Model fttng s mportant Least-squares fttng Maxmum lkelhood estmaton MAP estmaton Robust

More information

Edge Detection in Noisy Images Using the Support Vector Machines

Edge Detection in Noisy Images Using the Support Vector Machines Edge Detecton n Nosy Images Usng the Support Vector Machnes Hlaro Gómez-Moreno, Saturnno Maldonado-Bascón, Francsco López-Ferreras Sgnal Theory and Communcatons Department. Unversty of Alcalá Crta. Madrd-Barcelona

More information

Steps for Computing the Dissimilarity, Entropy, Herfindahl-Hirschman and. Accessibility (Gravity with Competition) Indices

Steps for Computing the Dissimilarity, Entropy, Herfindahl-Hirschman and. Accessibility (Gravity with Competition) Indices Steps for Computng the Dssmlarty, Entropy, Herfndahl-Hrschman and Accessblty (Gravty wth Competton) Indces I. Dssmlarty Index Measurement: The followng formula can be used to measure the evenness between

More information

Machine Learning: Algorithms and Applications

Machine Learning: Algorithms and Applications 14/05/1 Machne Learnng: Algorthms and Applcatons Florano Zn Free Unversty of Bozen-Bolzano Faculty of Computer Scence Academc Year 011-01 Lecture 10: 14 May 01 Unsupervsed Learnng cont Sldes courtesy of

More information

Professional competences training path for an e-commerce major, based on the ISM method

Professional competences training path for an e-commerce major, based on the ISM method World Transactons on Engneerng and Technology Educaton Vol.14, No.4, 2016 2016 WIETE Professonal competences tranng path for an e-commerce maor, based on the ISM method Ru Wang, Pn Peng, L-gang Lu & Lng

More information

Virtual Machine Migration based on Trust Measurement of Computer Node

Virtual Machine Migration based on Trust Measurement of Computer Node Appled Mechancs and Materals Onlne: 2014-04-04 ISSN: 1662-7482, Vols. 536-537, pp 678-682 do:10.4028/www.scentfc.net/amm.536-537.678 2014 Trans Tech Publcatons, Swtzerland Vrtual Machne Mgraton based on

More information

Support Vector Machines

Support Vector Machines Support Vector Machnes Decson surface s a hyperplane (lne n 2D) n feature space (smlar to the Perceptron) Arguably, the most mportant recent dscovery n machne learnng In a nutshell: map the data to a predetermned

More information

Mining User Similarity Using Spatial-temporal Intersection

Mining User Similarity Using Spatial-temporal Intersection www.ijcsi.org 215 Mnng User Smlarty Usng Spatal-temporal Intersecton Ymn Wang 1, Rumn Hu 1, Wenhua Huang 1 and Jun Chen 1 1 Natonal Engneerng Research Center for Multmeda Software, School of Computer,

More information

A Resources Virtualization Approach Supporting Uniform Access to Heterogeneous Grid Resources 1

A Resources Virtualization Approach Supporting Uniform Access to Heterogeneous Grid Resources 1 A Resources Vrtualzaton Approach Supportng Unform Access to Heterogeneous Grd Resources 1 Cunhao Fang 1, Yaoxue Zhang 2, Song Cao 3 1 Tsnghua Natonal Labatory of Inforamaton Scence and Technology 2 Department

More information

Suppression for Luminance Difference of Stereo Image-Pair Based on Improved Histogram Equalization

Suppression for Luminance Difference of Stereo Image-Pair Based on Improved Histogram Equalization Suppresson for Lumnance Dfference of Stereo Image-Par Based on Improved Hstogram Equalzaton Zhao Llng,, Zheng Yuhu 3, Sun Quansen, Xa Deshen School of Computer Scence and Technology, NJUST, Nanjng, Chna.School

More information

An Image Fusion Approach Based on Segmentation Region

An Image Fusion Approach Based on Segmentation Region Rong Wang, L-Qun Gao, Shu Yang, Yu-Hua Cha, and Yan-Chun Lu An Image Fuson Approach Based On Segmentaton Regon An Image Fuson Approach Based on Segmentaton Regon Rong Wang, L-Qun Gao, Shu Yang 3, Yu-Hua

More information

Meta-heuristics for Multidimensional Knapsack Problems

Meta-heuristics for Multidimensional Knapsack Problems 2012 4th Internatonal Conference on Computer Research and Development IPCSIT vol.39 (2012) (2012) IACSIT Press, Sngapore Meta-heurstcs for Multdmensonal Knapsack Problems Zhbao Man + Computer Scence Department,

More information

Private Information Retrieval (PIR)

Private Information Retrieval (PIR) 2 Levente Buttyán Problem formulaton Alce wants to obtan nformaton from a database, but she does not want the database to learn whch nformaton she wanted e.g., Alce s an nvestor queryng a stock-market

More information

Clustering is a discovery process in data mining.

Clustering is a discovery process in data mining. Cover Feature Chameleon: Herarchcal Clusterng Usng Dynamc Modelng Many advanced algorthms have dffculty dealng wth hghly varable clusters that do not follow a preconceved model. By basng ts selectons on

More information

A Robust Method for Estimating the Fundamental Matrix

A Robust Method for Estimating the Fundamental Matrix Proc. VIIth Dgtal Image Computng: Technques and Applcatons, Sun C., Talbot H., Ourseln S. and Adraansen T. (Eds.), 0- Dec. 003, Sydney A Robust Method for Estmatng the Fundamental Matrx C.L. Feng and Y.S.

More information

Unsupervised Learning and Clustering

Unsupervised Learning and Clustering Unsupervsed Learnng and Clusterng Why consder unlabeled samples?. Collectng and labelng large set of samples s costly Gettng recorded speech s free, labelng s tme consumng 2. Classfer could be desgned

More information

Query Clustering Using a Hybrid Query Similarity Measure

Query Clustering Using a Hybrid Query Similarity Measure Query clusterng usng a hybrd query smlarty measure Fu. L., Goh, D.H., & Foo, S. (2004). WSEAS Transacton on Computers, 3(3), 700-705. Query Clusterng Usng a Hybrd Query Smlarty Measure Ln Fu, Don Hoe-Lan

More information

A Clustering Algorithm for Key Frame Extraction Based on Density Peak

A Clustering Algorithm for Key Frame Extraction Based on Density Peak Journal of Computer and Communcatons, 2018, 6, 118-128 http://www.scrp.org/ournal/cc ISSN Onlne: 2327-5227 ISSN Prnt: 2327-5219 A Clusterng Algorthm for Key Frame Extracton Based on Densty Peak Hong Zhao

More information

Unsupervised Learning

Unsupervised Learning Pattern Recognton Lecture 8 Outlne Introducton Unsupervsed Learnng Parametrc VS Non-Parametrc Approach Mxture of Denstes Maxmum-Lkelhood Estmates Clusterng Prof. Danel Yeung School of Computer Scence and

More information

Face Recognition University at Buffalo CSE666 Lecture Slides Resources:

Face Recognition University at Buffalo CSE666 Lecture Slides Resources: Face Recognton Unversty at Buffalo CSE666 Lecture Sldes Resources: http://www.face-rec.org/algorthms/ Overvew of face recognton algorthms Correlaton - Pxel based correspondence between two face mages Structural

More information

Hierarchical Image Retrieval by Multi-Feature Fusion

Hierarchical Image Retrieval by Multi-Feature Fusion Preprnts (www.preprnts.org) NOT PEER-REVIEWED Posted: 26 Aprl 207 do:0.20944/preprnts20704.074.v Artcle Herarchcal Image Retreval by Mult- Fuson Xaojun Lu, Jaojuan Wang,Yngq Hou, Me Yang, Q Wang* and Xangde

More information

Performance Evaluation of Information Retrieval Systems

Performance Evaluation of Information Retrieval Systems Why System Evaluaton? Performance Evaluaton of Informaton Retreval Systems Many sldes n ths secton are adapted from Prof. Joydeep Ghosh (UT ECE) who n turn adapted them from Prof. Dk Lee (Unv. of Scence

More information

Clustering Algorithm Combining CPSO with K-Means Chunqin Gu 1, a, Qian Tao 2, b

Clustering Algorithm Combining CPSO with K-Means Chunqin Gu 1, a, Qian Tao 2, b Internatonal Conference on Advances n Mechancal Engneerng and Industral Informatcs (AMEII 05) Clusterng Algorthm Combnng CPSO wth K-Means Chunqn Gu, a, Qan Tao, b Department of Informaton Scence, Zhongka

More information

An Efficient Genetic Algorithm with Fuzzy c-means Clustering for Traveling Salesman Problem

An Efficient Genetic Algorithm with Fuzzy c-means Clustering for Traveling Salesman Problem An Effcent Genetc Algorthm wth Fuzzy c-means Clusterng for Travelng Salesman Problem Jong-Won Yoon and Sung-Bae Cho Dept. of Computer Scence Yonse Unversty Seoul, Korea jwyoon@sclab.yonse.ac.r, sbcho@cs.yonse.ac.r

More information

BRDPHHC: A Balance RDF Data Partitioning Algorithm based on Hybrid Hierarchical Clustering

BRDPHHC: A Balance RDF Data Partitioning Algorithm based on Hybrid Hierarchical Clustering 015 IEEE 17th Internatonal Conference on Hgh Performance Computng and Communcatons (HPCC), 015 IEEE 7th Internatonal Symposum on Cyberspace Safety and Securty (CSS), and 015 IEEE 1th Internatonal Conf

More information

The Codesign Challenge

The Codesign Challenge ECE 4530 Codesgn Challenge Fall 2007 Hardware/Software Codesgn The Codesgn Challenge Objectves In the codesgn challenge, your task s to accelerate a gven software reference mplementaton as fast as possble.

More information

Solving two-person zero-sum game by Matlab

Solving two-person zero-sum game by Matlab Appled Mechancs and Materals Onlne: 2011-02-02 ISSN: 1662-7482, Vols. 50-51, pp 262-265 do:10.4028/www.scentfc.net/amm.50-51.262 2011 Trans Tech Publcatons, Swtzerland Solvng two-person zero-sum game by

More information

Hybridization of Expectation-Maximization and K-Means Algorithms for Better Clustering Performance

Hybridization of Expectation-Maximization and K-Means Algorithms for Better Clustering Performance BULGARIAN ACADEMY OF SCIENCES CYBERNETICS AND INFORMATION TECHNOLOGIES Volume 16, No 2 Sofa 2016 Prnt ISSN: 1311-9702; Onlne ISSN: 1314-4081 DOI: 10.1515/cat-2016-0017 Hybrdzaton of Expectaton-Maxmzaton

More information

A Compressing Method for Genome Sequence Cluster using Sequence Alignment

A Compressing Method for Genome Sequence Cluster using Sequence Alignment A Compressng Method for Genome Sequence Cluster usng Sequence Algnment Kwang Su Jung 1, Nam Hee Yu 1, Seung Jung Shn 2, Keun Ho Ryu 1 1 Database/Bonformatcs Laboratory, Chungbuk Natonal Unversty, Korea

More information

Chinese Word Segmentation based on the Improved Particle Swarm Optimization Neural Networks

Chinese Word Segmentation based on the Improved Particle Swarm Optimization Neural Networks Chnese Word Segmentaton based on the Improved Partcle Swarm Optmzaton Neural Networks Ja He Computatonal Intellgence Laboratory School of Computer Scence and Engneerng, UESTC Chengdu, Chna Department of

More information

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration Improvement of Spatal Resoluton Usng BlockMatchng Based Moton Estmaton and Frame Integraton Danya Suga and Takayuk Hamamoto Graduate School of Engneerng, Tokyo Unversty of Scence, 6-3-1, Nuku, Katsuska-ku,

More information

Remote Sensing Image Retrieval Algorithm based on MapReduce and Characteristic Information

Remote Sensing Image Retrieval Algorithm based on MapReduce and Characteristic Information Remote Sensng Image Retreval Algorthm based on MapReduce and Characterstc Informaton Zhang Meng 1, 1 Computer School, Wuhan Unversty Hube, Wuhan430097 Informaton Center, Wuhan Unversty Hube, Wuhan430097

More information

FAHP and Modified GRA Based Network Selection in Heterogeneous Wireless Networks

FAHP and Modified GRA Based Network Selection in Heterogeneous Wireless Networks 2017 2nd Internatonal Semnar on Appled Physcs, Optoelectroncs and Photoncs (APOP 2017) ISBN: 978-1-60595-522-3 FAHP and Modfed GRA Based Network Selecton n Heterogeneous Wreless Networks Xaohan DU, Zhqng

More information

Conditional Speculative Decimal Addition*

Conditional Speculative Decimal Addition* Condtonal Speculatve Decmal Addton Alvaro Vazquez and Elsardo Antelo Dep. of Electronc and Computer Engneerng Unv. of Santago de Compostela, Span Ths work was supported n part by Xunta de Galca under grant

More information

NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS

NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS ARPN Journal of Engneerng and Appled Scences 006-017 Asan Research Publshng Network (ARPN). All rghts reserved. NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS Igor Grgoryev, Svetlana

More information

Proper Choice of Data Used for the Estimation of Datum Transformation Parameters

Proper Choice of Data Used for the Estimation of Datum Transformation Parameters Proper Choce of Data Used for the Estmaton of Datum Transformaton Parameters Hakan S. KUTOGLU, Turkey Key words: Coordnate systems; transformaton; estmaton, relablty. SUMMARY Advances n technologes and

More information

Simulation Based Analysis of FAST TCP using OMNET++

Simulation Based Analysis of FAST TCP using OMNET++ Smulaton Based Analyss of FAST TCP usng OMNET++ Umar ul Hassan 04030038@lums.edu.pk Md Term Report CS678 Topcs n Internet Research Sprng, 2006 Introducton Internet traffc s doublng roughly every 3 months

More information

Recommended Items Rating Prediction based on RBF Neural Network Optimized by PSO Algorithm

Recommended Items Rating Prediction based on RBF Neural Network Optimized by PSO Algorithm Recommended Items Ratng Predcton based on RBF Neural Network Optmzed by PSO Algorthm Chengfang Tan, Cayn Wang, Yuln L and Xx Q Abstract In order to mtgate the data sparsty and cold-start problems of recommendaton

More information

6.854 Advanced Algorithms Petar Maymounkov Problem Set 11 (November 23, 2005) With: Benjamin Rossman, Oren Weimann, and Pouya Kheradpour

6.854 Advanced Algorithms Petar Maymounkov Problem Set 11 (November 23, 2005) With: Benjamin Rossman, Oren Weimann, and Pouya Kheradpour 6.854 Advanced Algorthms Petar Maymounkov Problem Set 11 (November 23, 2005) Wth: Benjamn Rossman, Oren Wemann, and Pouya Kheradpour Problem 1. We reduce vertex cover to MAX-SAT wth weghts, such that the

More information

Mathematics 256 a course in differential equations for engineering students

Mathematics 256 a course in differential equations for engineering students Mathematcs 56 a course n dfferental equatons for engneerng students Chapter 5. More effcent methods of numercal soluton Euler s method s qute neffcent. Because the error s essentally proportonal to the

More information

Research on Categorization of Animation Effect Based on Data Mining

Research on Categorization of Animation Effect Based on Data Mining MATEC Web of Conferences 22, 0102 0 ( 2015) DOI: 10.1051/ matecconf/ 2015220102 0 C Owned by the authors, publshed by EDP Scences, 2015 Research on Categorzaton of Anmaton Effect Based on Data Mnng Na

More information