Journal of Chemical and Pharmaceutical Research, 2014, 6(6): Research Article

Size: px
Start display at page:

Download "Journal of Chemical and Pharmaceutical Research, 2014, 6(6): Research Article"

Transcription

1 Avalable onlne Journal of Chemcal and Pharmaceutcal Research, 2014, 6(6): Research Artcle ISSN : CODEN(USA) : JCPRC5 Communty detecton model based on ncremental EM clusterng method Qu L-qng, Lang Yong-quan and Chen Zhuo-yan College of Informaton Scence and Technology, Shandong Unversty of Scence and Technology, Qngdao, Shandong, Chna ABSTRACT Networks are wdely used n a varety of dfferent felds and attract more and more researchers. Communty detecton, one of the research hotspots, can dentfy salent structure and relatons among ndvduals from the networks. Many dfferent solutons have been put forward to detect communtes. EM as a model on statstcal nference methods has receved more attenton because of ts smple and effcent structure. Unlke many other statstcal nference methods, no extra nformaton s assumed except for the network tself and the number of groups for the EM approach. However, practcal usefulness of the EM method s often lmted by computatonal neffcency. The EM method makes a pass through all of the avalable data n every teraton. Thus, f the sze of the networks s large, every teraton can be computatonally ntensve. Therefore we put forward an ncremental EM method-iem for communty detecton. IEM uses the machnery of probablstc mxture models and the ncremental EM algorthm to generalze a feasble model ft the observed network wthout pror knowledge except the networks and the number of groups. Usng only a subset rather than the entre networks allows for sgnfcant computatonal mprovements snce many fewer data ponts need to be evaluated n every teraton. We also argue that one can select the subsets ntellgently by appealng to EM s hghly-apprecated lkelhood judgment condton and ncrement factor. We perform some expermental studes, on several datasets, to demonstrate that our IEM can detect communtes correctly and prove to be effcent. Key words: Communty Detecton, Expectaton Maxmzaton, Incremental Expectaton Maxmzaton INTRODUCTION As a new emergng dscplne, research on networks attracts researchers from a varety of dfferent felds. In fact, studes that can qualtatvely and quanttatvely characterze networks wll help to unvel the general laws regulatng dfferent real systems modeled by networks, and therefore wll be relevant n a number of dscplnes (bology, socal scences, et al).communty structure s one of the crucal structural characterstcs of networks; therefore, accurately analyzng ther communty structure represents a very relevant topc [1-6]. Communtes are groups of nodes wth a hgh level of group nter-connecton [1]. They can be seen as relatve solated subgroups wth few contacts wth the rest of the network. Communty detecton can dentfy salent structure and relatons among ndvduals from the network. Researchers put forward many dfferent methods, whch are manly used to detect the groups wth dense connectons wthn groups but sparser connectons between them. To detect more latent structures n realty networks, varous models on statstcal nference have been proposed recently, whch are on sound theoretcal prncples and have better performances dentfyng structures, and have become the state-of-the-art models [7-10]. The models am s to defne a generatve process to ft the observed network, and transfer the communty detectng problem to Bayesan nference or Maxmum Lkelhood methods [11-14]. The drawback, shared wth many other methods, s that structure detecton usually mples computatonal expensve exploraton of the solutons maxmzng the posteror probablty of the lkelhood. More recently, a maxmum lkelhood method that consders model clusterng as mssng nformaton and deals wth t usng an teratve 2512

2 Expectaton Maxmzaton (EM) method has been ntroduced by Newman and Leche [2]. The EM method s a smple algorthm that s capable of detectng a broad range of structural sgnatures n networks, ncludng conventonal communty structure, bpartte of dsassortatve structure, fuzzy or overlappng classfcatons, and many mxed or hybrd structural forms that have not been consdered explctly n the past. Due to the smple structure of the EM method, there has been a growng body of work on the analyss of the EM algorthm [3-5]. Many mprovements have been put forward to better the EM method snce then. However, a common weakness n these studes, as we wll dscuss n detal n related work, s that the EM method wll be low effcent when the networks are large-scaled. In fact, the EM method may make sense when the networks are small-scaled or medum-scaled. On the contrary, more often than not, real-world networks are large-scaled. Under such scenaros, f an algorthm lke teratve EM method evaluates all samples at each step, t may results n hgh complcty and low effcency. Therefore, we argue that a more approprate approach s to mprove the EM method n order to reduce samples at each step. Consequently, we propose an ncremental EM algorthm on the sample subset that s converge to optmal solutons usng the proposed formulatons. We prove the correctness and convergence of our algorthm and show that ths algorthm has low tme complexty when the data of the networks s large-scaled. The rest of the paper s organzed as follows: n Secton 2 we dscuss related work and the EM method s formally ntroduced n Secton 3. Next, we descrbe n Secton 4 our generalzaton of ncremental EM method of communty detecton. In Secton 5, we provde expermental studes. Fnally n Secton 6, we gve the concluson and future drectons. RELATED WORKS Communty structure has been extensvely studed n varous research areas such as socal network analyss, Web communty analyss, computer vson, et al. In network analyss, an mportant research topc s to dentfy cohesve subgroups of ndvduals wthn a network where cohesve subgroups are defned as communty detecton. Recently there exsts a growng body of lterature on communty detecton. Many approaches, such as clque-based, degree-based, and matrx-perturbaton-based, have been proposed to extract cohesve subgroups from network. The approach of communty detecton can be characterzed as heurstc measure methods and statstcal nference methods accordng to the bass of object functon. Heurstc measure methods such as modularty maxmzaton [6] and extreme optmzaton [7] use a heurstc metrc to measure communty structure and lack of rgorous theoretcal bass. Statstcal nference methods such as planted partton model [8] and the EM method [2] can dentfy the structure of structural equvalence and regular equvalence, and classfy the vertces of the networks usng the observed networks ft by a generatve process. Statstcal nference methods have perfect theoretcal bass whch s dfferent from heurstc measure methods, and have become the state-of-the-art methods. Statstcal nference methods have the advantage that, dependng on the statstcal model used, they can be very general detectng both structural equvalent and regular equvalent set of ndvduals. Consequently, more lteratures have been proposed on statstcal nference methods. The EM approach as a model on statstcal nference methods has receved more attenton because of ts smple and effcent structure. Unlke many other statstcal nference methods, no extra nformaton s assumed except for the network tself and the number of groups for the EM approach. Contrast to tradtonal communty detecton methods, the EM approach s capable of detectng dsassortatve structure as well as overlappng classfcatons. There are some recent studes on the EM method for communty detecton. The EM approach to communty detecton s frst ntroduced by Newman et al [2]. We wll denote t by the acronym NL-EM from now on. They use the machnery of probablstc mxture models and the EM algorthm to generalze a feasble model ft the observed network wthout pror knowledge except the networks and the number of groups. They also gve a number of examples demonstratng how the method can be used to shed lght on the propertes of real-world networks. In ther model, ther parameter defnton mples that the classfcaton must be such that each class has at least one member wth non-zero out-degree. The constrant forces NL-EM to classfy a smple b-partte graph. Based on the dea, Ramasco et al. [3] generalze an extenson of NL-EM, n whch they extend the parameter θ.the examples show both numercally and analytcally that the new generalzed EM method s able to recover the process that led to the generaton of content-based networks. Muggan et al. [4] use NL-EM to yeld a stablty analyss the groupng that quantfes the extent to whch each node nfluences ts neghbor group membershp. All these studes, however, have a common weak pont-the EM method s usually low effcent and hgh complexty when the networks are large-scaled. That s, when the EM method s used to detect communtes n the networks, t evaluates all samples n every teraton whch may result n low convergence rate and bad clusterng effect. In contrast, n our proposed method, we study an ncremental EM method on the sample subset nstead of whole samples whch prove hgher effcent. 2513

3 NL- EM METHOD NL-EM s capable of detectng networks structure relyng on followng basc assumptons: (1) The actual connectvty of the networks s related to a pror unknown groupng of the ndvduals; (2) The presence or absence of a lnk s ndependent from the other lnks of the networks. We begn wth a quck summary of NL-EM as appled to graphs. Gven a graph G of N nodes and an A j adjacency matrx, NL-EM method searches for a partton of the nodes nto K groups such that a certan log-lkelhood functon for the graph s maxmzed. Henceforth we wll refer to the groups nto whch NL-EM dvdes the nodes, as classes. There are three varables as follows n NL-EM: r,the probablty that a randomly selected node s n group r θ ; rj,the probablty that an edge leavng group r connects to a certan node j q ; r,the probablty that node s assgned to group r θ.the parameters r and rj satsfy the normalzaton condtons: K = 1, θ = 1 N r r= 1 = 1 r (1) Assumng that the parameters and θ are gven, the probablty Pr( A, g, θ ) under a node classfcaton g, such that g of realzng the gven graph s the group that node has been assgned to, can be wrtten as: A, j Pr( A, g, θ ) = g,, θ g j j (2) Pr( A, g, θ ) nstead: s the lkelhood to be maxmzed, but t turns out to be more convenent to consder ts logarthm L(, θ ) = ln g + A ln,. j θg j j (3) Treatng a pror unknown group assgnment averaged log-lkelhood constructed as: L(, θ ) = qr ln r + Aj ln θrj. r j The fnal results are 1 r = N q r A q g of the nodes as statstcal mssng nformaton, one consders the, (5) j r θ rj =, k qr (6) qr k s the total degree of node. The stll unknown probabltes Where notng that: Pr( A, g = r, θ ) qr = Pr( g = r A,, θ ) =, Pr( A, θ ) (4) are then determned a posteror by (7) 2514

4 From whch one obtans: q r = Aj r θ j rj. Aj s s θ j rj (8) Equaton (5), (6), and (8) form a set of self consstent equatons for expected log-lkelhood must satsfy. r, θ rj and qr that any extreme of the Thus, gven a graph G, the EM algorthm conssts of pckng a number of classes K nto whch the nodes are to be classfed and searchng for solutons of Equaton (5), (6), and (8). These equatons are derved by Newman et al. θrj q [2]. They also show that when appled to dverse type of networks the resultng, and r yeld useful nformaton about the nternal structure of the network. Note that only a mnmal amount of a pror nformaton s suppled: the number of classes K and the networks. INCREMENTAL EM METHOD Despte the EM method s wde-spread popularty, practcal usefulness of the EM method s often lmted by computatonal neffcency. The EM method makes a pass through all of the avalable data n every teraton. Thus, f the sze of the networks s large, every teraton can be computatonally ntensve. We ntroduce an ncremental EM algorthm for fast computaton based on random sub-samplng whch s denoted by the acronym IEM from now on. Usng only a subset rather than the entre networks allows for sgnfcant computatonal mprovements snce many fewer data ponts need to be evaluated n every teraton. We also argue that one can select the subsets ntellgently by appealng to EM s hghly-apprecated lkelhood judgment condton and ncrement factor. Gven a graph G of N nodes, we frst select M ( M N ) nodes as the ntal sample subset, and then we wll tran the ntal subset by usng NL-EM method. After the tranng, we wll add d ( d N M ) nodes from the remanng samples to the ntal subsets, and then we wll tran the new formed subsets. The smlar teratve operaton s repeated untl the subset s dentcal to the entre samples. The quanttes n our theory thus fall nto three classes: (1) How to defne parameter M? In other words, how many nodes should be frst chosen as the ntal subsets? (2) How to defne parameter d? That s to say, how many nodes should be complemented after last tranng? (3)When wll d nodes be added to the subset? Namely what condtons should be satsfed when the subset changes? We wll gve some reasonable solutons as followed The defnton of parameter M Parameter M means the number of nodes n the ntal subset. The ntal subset selecton s an mportant part of IEM whch has a great nfluence on the results. Our goal s to select some nodes as the ntal subset whch s most representatve of the entre data, and therefore the selected subset can well descrbe the global features. There s a popular vew n network analyss that the mportant nodes are most representatve of the entre networks. Consequently we wll select the mportant nodes of the whole networks. Centralty analyss provdes answers wth measures that defne the mportance of nodes. There are many classcal and commonly methods used ones [9]: degree centralty, closeness centralty, and betweenness centralty. These centralty measures capture the mportance of nodes n dfferent perspectves. Wth large-scale networks, the computaton of centralty measures can be expensve except for degree centralty. We defne n to be the number of the nodes and m to be the number of edges between nodes. Then we can get tme complexty and space complexty about the centralty measures. Closeness centralty, for nstance, nvolves the computaton of all the parwse shortest paths, wth tme complexty of O( n ) and space complexty of O( n ) wth the Floyd-Warshall algorthm [10] or O( n log n + nm) tme O( nm) complexty wth Johnson s algorthm [11]. The betweenness centralty requres computatonal tme followng [12]. For large-scale networks, effcent computaton of centralty measures s crtcal and requres further research. We propose a new method of measurng the centralty whch s a compromse between complexty and effcency. Now we study degree centralty whch s the smplest measures. For degree centralty, the mportance of a node s determned by the number of nodes adjacent to t. The larger the degree of one node, the more mportant the node s. The degree centralty of node v s defned as: 2515

5 C ( v) = d / ( n 1) D (9) d where network. s defned as the number of nodes adjacent to v, and n s defned as the number of nodes n the However, the measure s not comprehensve enough,.e., some mportant nodes (.e., brdge contacts connect wth merely two edges) don t have hgh degree centralty. Based on the dea, we argue that the mportance of one node s determned by ts connecton model as well as ts role n the networks. Accordngly we consder two factors, namely the connecton model of the node and ts role n the network. The connecton model of one node can be descrbed by ts degree centralty, and the role of one node can be descrbed by ts coheson centralty. Defnton1: The connectvty of node v s defned as the number of the edges between v and the nodes drectly connected wth v. The connectvty of a node measures how close t s to the nodes whch are drectly connected wth t, and reflects C ( )( ( ) 1) / 2 the local connecton property of the node. Obvously, the span of connectvty s between 0 and D v CD v. Defnton 2: The coheson centralty of node v s defned as follows: CD ( v)( CD ( v) 1) Cc ( v) = 2c (10) Where C ( ) D v s the degree centralty of node v, and c s the connectvty of node v. Accordng to the relatons between the nodes and the edges n the network, the value of condtons: Cc ( v) 1 Cc ( v) satsfes the (11) We fnd that the larger the connectvty of one node, the less mportant the node s. Ths s because the deleton of the node wth larger connectvty wll make less affecton on the network. Thus accordng to equaton (10), the more mportance one node s, the larger the coheson centralty of the node s. Therefore, the coheson centralty s the postve evaluaton ndex of the node. To ntegrate the two factors (.e., connecton model of one node and ts role n networks), an mportance functon s ntroduced to measure the mportance of the node, where the mportance conssts of two parts-a degree centralty and a coheson centralty: I( v) = α C ( v) + (1 α) C ( v) D c (12) where α satsfes 0 α 1. C ( ) D v In ths mportance functon, the degree centralty C ( ) coheson centralty c v of emphass on each part of the total mportance functon. measures the connecton model of node v, and the measures the role of node v. The parameter α s set by the user to control the level Thus accordng to equaton (12) we can select mportant nodes wth hgh value as the ntal subset. The defnton of parameter d Parameter d means the number of nodes whch wll be complemented to the subset n every teraton. The defnton of parameter d s another crucal queston n IEM. The parameter should make the subset ft the real model as much as possble. Here we propose the concept of ncremental factor to descrbe parameter d based on 2516

6 nformaton entropy. Accordng to nformaton theory, the entropy measures the uncertanty of the system. The larger the entropy s, the more uncertan the system s. If the densty functon values of every node n the subset are approxmately equal, the uncertanty of the dstrbuton for the entre data s largest (.e., the subset has maxmum entropy). Conversely, f the densty functon values of every node are very asymmetrc, the subset has mnmal uncertanty. Therefore, we ntroduce the concept of densty entropy to measure ncremental factor. D = { x Defnton 3: Gven nodes set 1, x2, L, x N } whch has N nodes, the densty functon value of every node s f ( x ), 1,2,, = L N, and δ s the sample varance, then the densty entropy s defned as follows: N f ( x ) f ( x ) DenEn( δ ) = ln Sum Sum = 1 (13) where Sum s the normalzed factor defned as follows: Sum N = = 1 f ( x ) (14) The densty entropy has two propertes: Property 1: 0 DemEn( δ ) ln( N) ; Property 2: DemEn( δ ) = ln( N) f ( x when and only when 1) = f ( x2 ) = L = f ( x N ), therefore lm DenEn( δ ) = ln( N ) = max( DenEn( δ )) δ 0 δ. From Property 2, when DenEn( δ ) = ln( N), the nodes n the subset are consstent wth the real dstrbuton whch s deal case. Wth the ncrease of δ, the value of DenEn( δ ) decreases whch wll reach a mnmum subsequently, DenEn( δ ) and then the value of wll become larger whch wll reach a maxmum ln( N) when δ 0 and f ( x1 ) = f ( x2) = L = f ( x N ). The change of the sample varance δ s smlar to the densty entropy, and we take nto account the mddle value DenEn( δ ) of. Consequently, we propose ncremental factor β as: β = DenEn( δ ) / 2 = ln( N) / 2 (15) Accordng to equaton (15), the parameter d can be descrbed as follows: d = N / β Once parameter d s determned, the teratve process of IEM can be carred out as follows: when the samples n the subset ft the real networks, d nodes are added to the subset, and then new fttng process goes on. The ncremental process wll end untl the subset s equal to the entre data. In the process, the subset gradually approaches the entre data. It s worth mentonng that the complement nodes n every teraton are selected from the entre nodes. Hence the number of nodes n subset turns out to be: (16) M = M + d = M + N / β The condtons when the subsets changes From Secton 3, we can see that the EM method s an teratve procedure for Maxmzng (17) L(, θ ) whch we 2517

7 descrbe n Equaton (4). Assume that after the and that: th n teraton the current estmate for and θ s gven by θ n. Snce the objectve s maxmze L(, θ ), we wsh to computer an updated estmate n and θ such L(, θ ) > L(, θ ) n n (18) Equaton (18) means the teraton condton of the EM method, namely f the update lkelhood s not more than the current lkelhood then the teraton wll end. Inspred by Equaton (18), we propose the teraton condton of IEM. We defne th lkelhood after t L( t+ teraton and 1)(, θ ) as the maxmum lkelhood after ( condton can be defned as: t + Lt (, θ ) as the maxmum 1)th teraton. The teraton L( t+ 1)(, θ ) > Lt (, θ ) (19) Equvalently we want to maxmze the dfference: D = L( t+ 1)(, θ ) Lt (, θ ) Assume γ s a postve number whch s small enough, then f D γ γ (20) >, we argue that the current estmate s th t teraton s ft to the model of the real undesrable and the teraton should go on. If D, then the subset n data, and the new samples should be complemented to carry out the next teraton. When the subset s equal to the entre data, the termnate condton s consstent to the EM method. The determnng of α How to determne the α n Equaton (12) s a challengng ssue. When the ground truth s avalable, standard valdaton procedures can be used to select an optmalα. However, n many cases there s no ground truth and the communty detecton performance depends on the user s subjectve performance. In the respect, through the parameter α, our IEM provde the user a mechansm to push the communty detecton results toward hs or her preferred outcomes. The problems of whether a correct α exsts and how to automatcally fnd the best α when there s no ground truth are beyond the scope of ths paper. To smply the experments, we wll set α as 0.5 n the followng example applcatons. EXAMPLE APPLICATIONS In ths secton, we use several synthetc dataset to study the performance of our IEM from dfferent aspects. In secton 5.1 we wll frst verfy the correctness of IEM, and then we wll compare our IEM wth baselne algorthm-em n secton 5.2. Frst Example Applcaton We start wth the frst synthetc dataset, whch s a statc network, to llustrate some good propertes of our IEM. Ths dataset s frst studed by Whte and Smyth[13] and s shown n Fgure 1(a). The network contans 15 nodes whch roughly form 3 communtes-c1, C2, and C3-where edges tend to occur between nodes n the same communty. We frst apply our algorthm to the network wth varous communty numbers m and the resultng Q values are plotted n Fgure1 (b). Q values can be nterpreted as modularty values whch s a measure of the devaton between the observed edge-cluster probabltes and what one would predct under an ndependence model. Newman etc. [14] show that larger Q values are correlated wth better graph clusterng. In Fgure 1(b) we also show the ' Q that are reported by Whte and Smyth. As can be seen from Fgure1 (b), both Q and modularty values show dstnct peaks when m = 3, whch corresponds to the correct communty number. Our IEM algorthm gets hgher modularty values whch ndcate that IEM can classfy the network better. ' Q 2518

8 Next, after our IEM algorthm correctly parttons the nodes nto three communtes, we llustrate the soft communty membershp by studyng two communtes-c2 and C3. In Fgure 1(b) we use trangle shape to represent the nodes n C2, and crcle shape to represent the nodes n C3. But we use dfferent gray levels to ndcate ther communty membershp-we use whte color to llustrate the level that a node belongs to C2 and dark color to show the level that a node belongs to C3. As can be seen, whle the nodes whch are whte or black have very clear communty membershps, the nodes whch are on the boundary between C2 and C3, have rather fuzzy membershp. The shallower the nodes are, the more lkely the nodes belong to C2, Centrally, the deeper the nodes are, the more lkely the nodes belong to C3.In other words, our IEM algorthm s capable of assgnng meanngful soft membershp to a node to ndcate to whch level the node belongs to a certan communty. (a) Q Fgure1. Frst example applcaton: (a) applcatons of IEM method, (b) Modularty value and Modularty value dfferent communty numbers (b) ' Q under Second Example Applcaton We secondly apply our IEM algorthm to a small network- Karate club network [14]. The network contans 34 nodes whch roughly form 2 communtes-c1 and C2. The network s of partcular nterest because the club splt n two durng the course of Zachary s observatons as a result of an nternal dspute and Zachary recorded the membershp of the two factons after the splt. Fgure 2 shows the result of our IEM algorthm whch the number of clusters s set to 2. We use dfferent gray levels to ndcate ther communty membershp as frst example applcaton. In Fgure 2 we use crcle shape and rectangle shape to represent the nodes n C1 and C2 respectvely. But we use dfferent gray levels to ndcate ther communty membershp-we use whte color to llustrate the level that a node belongs to C1 and dark color to show the level that a node belongs to C2. As can be seen, node 9, 3, 14, 20 are on the boundary between C1 and C2, whch have rather fuzzy membershp. Fgure2. Second example applcaton: applcatons of IEM method Next, after our IEM algorthm correctly parttons the nodes nto two communtes, we compare our IEM wth baselne algorthm NL-EM. The compared result s shown as Table 1. As we can see from Table 1, under the same computng envronment the tme of IEM need only second, whch s much less than t of NL-EM; the teratons of IEM s only 35, whch s greatly less than t of NL-EM. From the table we have the followng observatons. On the dataset, among the two algorthms (NL-EM and IEM), IEM outperforms NL-EM. In other words, our IEM can reach the neghborhood faster than NL-EM, and s hgh effcent because of fast convergence rate. Table 1.The comparson between NL-EM and IEM Dataset Parameter NL-EM IEM Karate club Tme(s) Iteratons Lkelhood Estmate

9 CONCLUSION Communty detecton s a challengng research problem wth broad applcatons. In ths paper we have descrbed an ncremental EM method-iem for communty detecton. IEM uses the machnery of probablstc mxture models and the ncremental EM algorthm to generalze a feasble model ft the observed network wthout pror knowledge except the networks and the number of groups. The method s more effcent than prevous NL-EM, makng use of a new ncremental approach whch s more close to the optmal solutons. We use only a subset rather than the entre networks allows for sgnfcant computatonal mprovements snce many fewer data ponts need to be evaluated n every teraton. We also argue that one can select the subsets ntellgently by appealng to EM s hghly-apprecated lkelhood-judgment condton and ncrement factor. We have demonstrated the method wth applcatons to some smple examples, ncludng computer-generated and real-world networks. The method s strength s ts effcency whch leads to hgh convergence rate and good clusterng effect. As part of future work, we plan to extend our framework n two drectons. Frst, our current model only appled on statc networks where no temporal analyss s used for evoluton study. We are usng our model n dynamc networks to detect communtes. Second, so far we only consdered the lnk nformaton. In many applcatons, the content nformaton s also very mportant. We are nvestgatng how to ncorporate content nformaton nto our model. REFERENCES [1]Grvan M and Newman MEJ. Proceedngs of the Natonal Academy of Scences, 2002, 99(12), [2]Newman MEJ and Lecht EA. Proceedngs of the Natonal Academy of Scences, 2007, 104(23), [3] Ramasco JJ and Mungan M. Physcs and Socety E, 2008, 77(3), [4] Mungan M and Ramasco JJ. Journal of Statstcal Mechancs: Theory and Experment, 2010, 4, [5]Vazquez A. Populatons and Evoluton, 2008, 77(6), [6] Krkpatrck S, Gelatt CD, Vecch MP. Optmzaton by smulated annealng, Scence, 1983, 220(4598), [7] Duch J and Arenas A. Physcal Revew E, 2005, 72(2), [8] Condon A and Karp RM. Random structures and algorthms, 2001, 18(2), [9] Zhang B.; Zhang S.; Lu G.. Journal of Chemcal and Pharmaceutcal Research, 2013, 5(9), [10] Zhang B.; Internatonal Journal of Appled Mathematcs and Statstcs, 2013, 44(14), [11] Zhang B.; Yue H.. Internatonal Journal of Appled Mathematcs and Statstcs, 2013, 40(10), [12] Zhang B.; Feng Y.. Internatonal Journal of Appled Mathematcs and Statstcs, 2013, 40(10), [13] Bng Zhang. Journal of Chemcal and Pharmaceutcal Research, 2014, 5(2), [14] Bng Zhang; Zhang S.; Lu G.. Journal of Chemcal and Pharmaceutcal Research, 2013, 5(9),

FINDING IMPORTANT NODES IN SOCIAL NETWORKS BASED ON MODIFIED PAGERANK

FINDING IMPORTANT NODES IN SOCIAL NETWORKS BASED ON MODIFIED PAGERANK FINDING IMPORTANT NODES IN SOCIAL NETWORKS BASED ON MODIFIED PAGERANK L-qng Qu, Yong-quan Lang 2, Jng-Chen 3, 2 College of Informaton Scence and Technology, Shandong Unversty of Scence and Technology,

More information

CS 534: Computer Vision Model Fitting

CS 534: Computer Vision Model Fitting CS 534: Computer Vson Model Fttng Sprng 004 Ahmed Elgammal Dept of Computer Scence CS 534 Model Fttng - 1 Outlnes Model fttng s mportant Least-squares fttng Maxmum lkelhood estmaton MAP estmaton Robust

More information

Cluster Analysis of Electrical Behavior

Cluster Analysis of Electrical Behavior Journal of Computer and Communcatons, 205, 3, 88-93 Publshed Onlne May 205 n ScRes. http://www.scrp.org/ournal/cc http://dx.do.org/0.4236/cc.205.350 Cluster Analyss of Electrcal Behavor Ln Lu Ln Lu, School

More information

Outline. Type of Machine Learning. Examples of Application. Unsupervised Learning

Outline. Type of Machine Learning. Examples of Application. Unsupervised Learning Outlne Artfcal Intellgence and ts applcatons Lecture 8 Unsupervsed Learnng Professor Danel Yeung danyeung@eee.org Dr. Patrck Chan patrckchan@eee.org South Chna Unversty of Technology, Chna Introducton

More information

Unsupervised Learning

Unsupervised Learning Pattern Recognton Lecture 8 Outlne Introducton Unsupervsed Learnng Parametrc VS Non-Parametrc Approach Mxture of Denstes Maxmum-Lkelhood Estmates Clusterng Prof. Danel Yeung School of Computer Scence and

More information

The Research of Support Vector Machine in Agricultural Data Classification

The Research of Support Vector Machine in Agricultural Data Classification The Research of Support Vector Machne n Agrcultural Data Classfcaton Le Sh, Qguo Duan, Xnmng Ma, Me Weng College of Informaton and Management Scence, HeNan Agrcultural Unversty, Zhengzhou 45000 Chna Zhengzhou

More information

NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS

NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS ARPN Journal of Engneerng and Appled Scences 006-017 Asan Research Publshng Network (ARPN). All rghts reserved. NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS Igor Grgoryev, Svetlana

More information

An Entropy-Based Approach to Integrated Information Needs Assessment

An Entropy-Based Approach to Integrated Information Needs Assessment Dstrbuton Statement A: Approved for publc release; dstrbuton s unlmted. An Entropy-Based Approach to ntegrated nformaton Needs Assessment June 8, 2004 Wllam J. Farrell Lockheed Martn Advanced Technology

More information

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points;

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points; Subspace clusterng Clusterng Fundamental to all clusterng technques s the choce of dstance measure between data ponts; D q ( ) ( ) 2 x x = x x, j k = 1 k jk Squared Eucldean dstance Assumpton: All features

More information

BioTechnology. An Indian Journal FULL PAPER. Trade Science Inc.

BioTechnology. An Indian Journal FULL PAPER. Trade Science Inc. [Type text] [Type text] [Type text] ISSN : 0974-74 Volume 0 Issue BoTechnology 04 An Indan Journal FULL PAPER BTAIJ 0() 04 [684-689] Revew on Chna s sports ndustry fnancng market based on market -orented

More information

Biostatistics 615/815

Biostatistics 615/815 The E-M Algorthm Bostatstcs 615/815 Lecture 17 Last Lecture: The Smplex Method General method for optmzaton Makes few assumptons about functon Crawls towards mnmum Some recommendatons Multple startng ponts

More information

Support Vector Machines

Support Vector Machines /9/207 MIST.6060 Busness Intellgence and Data Mnng What are Support Vector Machnes? Support Vector Machnes Support Vector Machnes (SVMs) are supervsed learnng technques that analyze data and recognze patterns.

More information

Parallelism for Nested Loops with Non-uniform and Flow Dependences

Parallelism for Nested Loops with Non-uniform and Flow Dependences Parallelsm for Nested Loops wth Non-unform and Flow Dependences Sam-Jn Jeong Dept. of Informaton & Communcaton Engneerng, Cheonan Unversty, 5, Anseo-dong, Cheonan, Chungnam, 330-80, Korea. seong@cheonan.ac.kr

More information

A New Approach For the Ranking of Fuzzy Sets With Different Heights

A New Approach For the Ranking of Fuzzy Sets With Different Heights New pproach For the ankng of Fuzzy Sets Wth Dfferent Heghts Pushpnder Sngh School of Mathematcs Computer pplcatons Thapar Unversty, Patala-7 00 Inda pushpndersnl@gmalcom STCT ankng of fuzzy sets plays

More information

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz Compler Desgn Sprng 2014 Regster Allocaton Sample Exercses and Solutons Prof. Pedro C. Dnz USC / Informaton Scences Insttute 4676 Admralty Way, Sute 1001 Marna del Rey, Calforna 90292 pedro@s.edu Regster

More information

y and the total sum of

y and the total sum of Lnear regresson Testng for non-lnearty In analytcal chemstry, lnear regresson s commonly used n the constructon of calbraton functons requred for analytcal technques such as gas chromatography, atomc absorpton

More information

A Robust Method for Estimating the Fundamental Matrix

A Robust Method for Estimating the Fundamental Matrix Proc. VIIth Dgtal Image Computng: Technques and Applcatons, Sun C., Talbot H., Ourseln S. and Adraansen T. (Eds.), 0- Dec. 003, Sydney A Robust Method for Estmatng the Fundamental Matrx C.L. Feng and Y.S.

More information

Determining the Optimal Bandwidth Based on Multi-criterion Fusion

Determining the Optimal Bandwidth Based on Multi-criterion Fusion Proceedngs of 01 4th Internatonal Conference on Machne Learnng and Computng IPCSIT vol. 5 (01) (01) IACSIT Press, Sngapore Determnng the Optmal Bandwdth Based on Mult-crteron Fuson Ha-L Lang 1+, Xan-Mn

More information

Learning the Kernel Parameters in Kernel Minimum Distance Classifier

Learning the Kernel Parameters in Kernel Minimum Distance Classifier Learnng the Kernel Parameters n Kernel Mnmum Dstance Classfer Daoqang Zhang 1,, Songcan Chen and Zh-Hua Zhou 1* 1 Natonal Laboratory for Novel Software Technology Nanjng Unversty, Nanjng 193, Chna Department

More information

An Improved Image Segmentation Algorithm Based on the Otsu Method

An Improved Image Segmentation Algorithm Based on the Otsu Method 3th ACIS Internatonal Conference on Software Engneerng, Artfcal Intellgence, Networkng arallel/dstrbuted Computng An Improved Image Segmentaton Algorthm Based on the Otsu Method Mengxng Huang, enjao Yu,

More information

Three supervised learning methods on pen digits character recognition dataset

Three supervised learning methods on pen digits character recognition dataset Three supervsed learnng methods on pen dgts character recognton dataset Chrs Flezach Department of Computer Scence and Engneerng Unversty of Calforna, San Dego San Dego, CA 92093 cflezac@cs.ucsd.edu Satoru

More information

Problem Set 3 Solutions

Problem Set 3 Solutions Introducton to Algorthms October 4, 2002 Massachusetts Insttute of Technology 6046J/18410J Professors Erk Demane and Shaf Goldwasser Handout 14 Problem Set 3 Solutons (Exercses were not to be turned n,

More information

6.854 Advanced Algorithms Petar Maymounkov Problem Set 11 (November 23, 2005) With: Benjamin Rossman, Oren Weimann, and Pouya Kheradpour

6.854 Advanced Algorithms Petar Maymounkov Problem Set 11 (November 23, 2005) With: Benjamin Rossman, Oren Weimann, and Pouya Kheradpour 6.854 Advanced Algorthms Petar Maymounkov Problem Set 11 (November 23, 2005) Wth: Benjamn Rossman, Oren Wemann, and Pouya Kheradpour Problem 1. We reduce vertex cover to MAX-SAT wth weghts, such that the

More information

Feature Reduction and Selection

Feature Reduction and Selection Feature Reducton and Selecton Dr. Shuang LIANG School of Software Engneerng TongJ Unversty Fall, 2012 Today s Topcs Introducton Problems of Dmensonalty Feature Reducton Statstc methods Prncpal Components

More information

For instance, ; the five basic number-sets are increasingly more n A B & B A A = B (1)

For instance, ; the five basic number-sets are increasingly more n A B & B A A = B (1) Secton 1.2 Subsets and the Boolean operatons on sets If every element of the set A s an element of the set B, we say that A s a subset of B, or that A s contaned n B, or that B contans A, and we wrte A

More information

A MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS

A MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS Proceedngs of the Wnter Smulaton Conference M E Kuhl, N M Steger, F B Armstrong, and J A Jones, eds A MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS Mark W Brantley Chun-Hung

More information

Meta-heuristics for Multidimensional Knapsack Problems

Meta-heuristics for Multidimensional Knapsack Problems 2012 4th Internatonal Conference on Computer Research and Development IPCSIT vol.39 (2012) (2012) IACSIT Press, Sngapore Meta-heurstcs for Multdmensonal Knapsack Problems Zhbao Man + Computer Scence Department,

More information

Mathematics 256 a course in differential equations for engineering students

Mathematics 256 a course in differential equations for engineering students Mathematcs 56 a course n dfferental equatons for engneerng students Chapter 5. More effcent methods of numercal soluton Euler s method s qute neffcent. Because the error s essentally proportonal to the

More information

A Binarization Algorithm specialized on Document Images and Photos

A Binarization Algorithm specialized on Document Images and Photos A Bnarzaton Algorthm specalzed on Document mages and Photos Ergna Kavalleratou Dept. of nformaton and Communcaton Systems Engneerng Unversty of the Aegean kavalleratou@aegean.gr Abstract n ths paper, a

More information

Hermite Splines in Lie Groups as Products of Geodesics

Hermite Splines in Lie Groups as Products of Geodesics Hermte Splnes n Le Groups as Products of Geodescs Ethan Eade Updated May 28, 2017 1 Introducton 1.1 Goal Ths document defnes a curve n the Le group G parametrzed by tme and by structural parameters n the

More information

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization Problem efntons and Evaluaton Crtera for Computatonal Expensve Optmzaton B. Lu 1, Q. Chen and Q. Zhang 3, J. J. Lang 4, P. N. Suganthan, B. Y. Qu 6 1 epartment of Computng, Glyndwr Unversty, UK Faclty

More information

An Optimal Algorithm for Prufer Codes *

An Optimal Algorithm for Prufer Codes * J. Software Engneerng & Applcatons, 2009, 2: 111-115 do:10.4236/jsea.2009.22016 Publshed Onlne July 2009 (www.scrp.org/journal/jsea) An Optmal Algorthm for Prufer Codes * Xaodong Wang 1, 2, Le Wang 3,

More information

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers IOSR Journal of Electroncs and Communcaton Engneerng (IOSR-JECE) e-issn: 78-834,p- ISSN: 78-8735.Volume 9, Issue, Ver. IV (Mar - Apr. 04), PP 0-07 Content Based Image Retreval Usng -D Dscrete Wavelet wth

More information

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision SLAM Summer School 2006 Practcal 2: SLAM usng Monocular Vson Javer Cvera, Unversty of Zaragoza Andrew J. Davson, Imperal College London J.M.M Montel, Unversty of Zaragoza. josemar@unzar.es, jcvera@unzar.es,

More information

Hierarchical clustering for gene expression data analysis

Hierarchical clustering for gene expression data analysis Herarchcal clusterng for gene expresson data analyss Gorgo Valentn e-mal: valentn@ds.unm.t Clusterng of Mcroarray Data. Clusterng of gene expresson profles (rows) => dscovery of co-regulated and functonally

More information

EECS 730 Introduction to Bioinformatics Sequence Alignment. Luke Huan Electrical Engineering and Computer Science

EECS 730 Introduction to Bioinformatics Sequence Alignment. Luke Huan Electrical Engineering and Computer Science EECS 730 Introducton to Bonformatcs Sequence Algnment Luke Huan Electrcal Engneerng and Computer Scence http://people.eecs.ku.edu/~huan/ HMM Π s a set of states Transton Probabltes a kl Pr( l 1 k Probablty

More information

CHAPTER 2 DECOMPOSITION OF GRAPHS

CHAPTER 2 DECOMPOSITION OF GRAPHS CHAPTER DECOMPOSITION OF GRAPHS. INTRODUCTION A graph H s called a Supersubdvson of a graph G f H s obtaned from G by replacng every edge uv of G by a bpartte graph,m (m may vary for each edge by dentfyng

More information

An Application of the Dulmage-Mendelsohn Decomposition to Sparse Null Space Bases of Full Row Rank Matrices

An Application of the Dulmage-Mendelsohn Decomposition to Sparse Null Space Bases of Full Row Rank Matrices Internatonal Mathematcal Forum, Vol 7, 2012, no 52, 2549-2554 An Applcaton of the Dulmage-Mendelsohn Decomposton to Sparse Null Space Bases of Full Row Rank Matrces Mostafa Khorramzadeh Department of Mathematcal

More information

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance Tsnghua Unversty at TAC 2009: Summarzng Mult-documents by Informaton Dstance Chong Long, Mnle Huang, Xaoyan Zhu State Key Laboratory of Intellgent Technology and Systems, Tsnghua Natonal Laboratory for

More information

Comparison of Heuristics for Scheduling Independent Tasks on Heterogeneous Distributed Environments

Comparison of Heuristics for Scheduling Independent Tasks on Heterogeneous Distributed Environments Comparson of Heurstcs for Schedulng Independent Tasks on Heterogeneous Dstrbuted Envronments Hesam Izakan¹, Ath Abraham², Senor Member, IEEE, Václav Snášel³ ¹ Islamc Azad Unversty, Ramsar Branch, Ramsar,

More information

Classifier Selection Based on Data Complexity Measures *

Classifier Selection Based on Data Complexity Measures * Classfer Selecton Based on Data Complexty Measures * Edth Hernández-Reyes, J.A. Carrasco-Ochoa, and J.Fco. Martínez-Trndad Natonal Insttute for Astrophyscs, Optcs and Electroncs, Lus Enrque Erro No.1 Sta.

More information

DESIGNING TRANSMISSION SCHEDULES FOR WIRELESS AD HOC NETWORKS TO MAXIMIZE NETWORK THROUGHPUT

DESIGNING TRANSMISSION SCHEDULES FOR WIRELESS AD HOC NETWORKS TO MAXIMIZE NETWORK THROUGHPUT DESIGNING TRANSMISSION SCHEDULES FOR WIRELESS AD HOC NETWORKS TO MAXIMIZE NETWORK THROUGHPUT Bran J. Wolf, Joseph L. Hammond, and Harlan B. Russell Dept. of Electrcal and Computer Engneerng, Clemson Unversty,

More information

Hybridization of Expectation-Maximization and K-Means Algorithms for Better Clustering Performance

Hybridization of Expectation-Maximization and K-Means Algorithms for Better Clustering Performance BULGARIAN ACADEMY OF SCIENCES CYBERNETICS AND INFORMATION TECHNOLOGIES Volume 16, No 2 Sofa 2016 Prnt ISSN: 1311-9702; Onlne ISSN: 1314-4081 DOI: 10.1515/cat-2016-0017 Hybrdzaton of Expectaton-Maxmzaton

More information

Edge Detection in Noisy Images Using the Support Vector Machines

Edge Detection in Noisy Images Using the Support Vector Machines Edge Detecton n Nosy Images Usng the Support Vector Machnes Hlaro Gómez-Moreno, Saturnno Maldonado-Bascón, Francsco López-Ferreras Sgnal Theory and Communcatons Department. Unversty of Alcalá Crta. Madrd-Barcelona

More information

User Authentication Based On Behavioral Mouse Dynamics Biometrics

User Authentication Based On Behavioral Mouse Dynamics Biometrics User Authentcaton Based On Behavoral Mouse Dynamcs Bometrcs Chee-Hyung Yoon Danel Donghyun Km Department of Computer Scence Department of Computer Scence Stanford Unversty Stanford Unversty Stanford, CA

More information

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task Proceedngs of NTCIR-6 Workshop Meetng, May 15-18, 2007, Tokyo, Japan Term Weghtng Classfcaton System Usng the Ch-square Statstc for the Classfcaton Subtask at NTCIR-6 Patent Retreval Task Kotaro Hashmoto

More information

An Iterative Solution Approach to Process Plant Layout using Mixed Integer Optimisation

An Iterative Solution Approach to Process Plant Layout using Mixed Integer Optimisation 17 th European Symposum on Computer Aded Process Engneerng ESCAPE17 V. Plesu and P.S. Agach (Edtors) 2007 Elsever B.V. All rghts reserved. 1 An Iteratve Soluton Approach to Process Plant Layout usng Mxed

More information

Solving two-person zero-sum game by Matlab

Solving two-person zero-sum game by Matlab Appled Mechancs and Materals Onlne: 2011-02-02 ISSN: 1662-7482, Vols. 50-51, pp 262-265 do:10.4028/www.scentfc.net/amm.50-51.262 2011 Trans Tech Publcatons, Swtzerland Solvng two-person zero-sum game by

More information

Performance Evaluation of Information Retrieval Systems

Performance Evaluation of Information Retrieval Systems Why System Evaluaton? Performance Evaluaton of Informaton Retreval Systems Many sldes n ths secton are adapted from Prof. Joydeep Ghosh (UT ECE) who n turn adapted them from Prof. Dk Lee (Unv. of Scence

More information

S1 Note. Basis functions.

S1 Note. Basis functions. S1 Note. Bass functons. Contents Types of bass functons...1 The Fourer bass...2 B-splne bass...3 Power and type I error rates wth dfferent numbers of bass functons...4 Table S1. Smulaton results of type

More information

Implementation Naïve Bayes Algorithm for Student Classification Based on Graduation Status

Implementation Naïve Bayes Algorithm for Student Classification Based on Graduation Status Internatonal Journal of Appled Busness and Informaton Systems ISSN: 2597-8993 Vol 1, No 2, September 2017, pp. 6-12 6 Implementaton Naïve Bayes Algorthm for Student Classfcaton Based on Graduaton Status

More information

Active Contours/Snakes

Active Contours/Snakes Actve Contours/Snakes Erkut Erdem Acknowledgement: The sldes are adapted from the sldes prepared by K. Grauman of Unversty of Texas at Austn Fttng: Edges vs. boundares Edges useful sgnal to ndcate occludng

More information

APPLICATION OF MULTIVARIATE LOSS FUNCTION FOR ASSESSMENT OF THE QUALITY OF TECHNOLOGICAL PROCESS MANAGEMENT

APPLICATION OF MULTIVARIATE LOSS FUNCTION FOR ASSESSMENT OF THE QUALITY OF TECHNOLOGICAL PROCESS MANAGEMENT 3. - 5. 5., Brno, Czech Republc, EU APPLICATION OF MULTIVARIATE LOSS FUNCTION FOR ASSESSMENT OF THE QUALITY OF TECHNOLOGICAL PROCESS MANAGEMENT Abstract Josef TOŠENOVSKÝ ) Lenka MONSPORTOVÁ ) Flp TOŠENOVSKÝ

More information

X- Chart Using ANOM Approach

X- Chart Using ANOM Approach ISSN 1684-8403 Journal of Statstcs Volume 17, 010, pp. 3-3 Abstract X- Chart Usng ANOM Approach Gullapall Chakravarth 1 and Chaluvad Venkateswara Rao Control lmts for ndvdual measurements (X) chart are

More information

Analysis of Continuous Beams in General

Analysis of Continuous Beams in General Analyss of Contnuous Beams n General Contnuous beams consdered here are prsmatc, rgdly connected to each beam segment and supported at varous ponts along the beam. onts are selected at ponts of support,

More information

Simulation: Solving Dynamic Models ABE 5646 Week 11 Chapter 2, Spring 2010

Simulation: Solving Dynamic Models ABE 5646 Week 11 Chapter 2, Spring 2010 Smulaton: Solvng Dynamc Models ABE 5646 Week Chapter 2, Sprng 200 Week Descrpton Readng Materal Mar 5- Mar 9 Evaluatng [Crop] Models Comparng a model wth data - Graphcal, errors - Measures of agreement

More information

A new segmentation algorithm for medical volume image based on K-means clustering

A new segmentation algorithm for medical volume image based on K-means clustering Avalable onlne www.jocpr.com Journal of Chemcal and harmaceutcal Research, 2013, 5(12):113-117 Research Artcle ISSN : 0975-7384 CODEN(USA) : JCRC5 A new segmentaton algorthm for medcal volume mage based

More information

A Background Subtraction for a Vision-based User Interface *

A Background Subtraction for a Vision-based User Interface * A Background Subtracton for a Vson-based User Interface * Dongpyo Hong and Woontack Woo KJIST U-VR Lab. {dhon wwoo}@kjst.ac.kr Abstract In ths paper, we propose a robust and effcent background subtracton

More information

Module Management Tool in Software Development Organizations

Module Management Tool in Software Development Organizations Journal of Computer Scence (5): 8-, 7 ISSN 59-66 7 Scence Publcatons Management Tool n Software Development Organzatons Ahmad A. Al-Rababah and Mohammad A. Al-Rababah Faculty of IT, Al-Ahlyyah Amman Unversty,

More information

Related-Mode Attacks on CTR Encryption Mode

Related-Mode Attacks on CTR Encryption Mode Internatonal Journal of Network Securty, Vol.4, No.3, PP.282 287, May 2007 282 Related-Mode Attacks on CTR Encrypton Mode Dayn Wang, Dongda Ln, and Wenlng Wu (Correspondng author: Dayn Wang) Key Laboratory

More information

Support Vector Machines

Support Vector Machines Support Vector Machnes Decson surface s a hyperplane (lne n 2D) n feature space (smlar to the Perceptron) Arguably, the most mportant recent dscovery n machne learnng In a nutshell: map the data to a predetermned

More information

Smoothing Spline ANOVA for variable screening

Smoothing Spline ANOVA for variable screening Smoothng Splne ANOVA for varable screenng a useful tool for metamodels tranng and mult-objectve optmzaton L. Rcco, E. Rgon, A. Turco Outlne RSM Introducton Possble couplng Test case MOO MOO wth Game Theory

More information

FAHP and Modified GRA Based Network Selection in Heterogeneous Wireless Networks

FAHP and Modified GRA Based Network Selection in Heterogeneous Wireless Networks 2017 2nd Internatonal Semnar on Appled Physcs, Optoelectroncs and Photoncs (APOP 2017) ISBN: 978-1-60595-522-3 FAHP and Modfed GRA Based Network Selecton n Heterogeneous Wreless Networks Xaohan DU, Zhqng

More information

Scheduling Remote Access to Scientific Instruments in Cyberinfrastructure for Education and Research

Scheduling Remote Access to Scientific Instruments in Cyberinfrastructure for Education and Research Schedulng Remote Access to Scentfc Instruments n Cybernfrastructure for Educaton and Research Je Yn 1, Junwe Cao 2,3,*, Yuexuan Wang 4, Lanchen Lu 1,3 and Cheng Wu 1,3 1 Natonal CIMS Engneerng and Research

More information

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data A Fast Content-Based Multmeda Retreval Technque Usng Compressed Data Borko Furht and Pornvt Saksobhavvat NSF Multmeda Laboratory Florda Atlantc Unversty, Boca Raton, Florda 3343 ABSTRACT In ths paper,

More information

Sum of Linear and Fractional Multiobjective Programming Problem under Fuzzy Rules Constraints

Sum of Linear and Fractional Multiobjective Programming Problem under Fuzzy Rules Constraints Australan Journal of Basc and Appled Scences, 2(4): 1204-1208, 2008 ISSN 1991-8178 Sum of Lnear and Fractonal Multobjectve Programmng Problem under Fuzzy Rules Constrants 1 2 Sanjay Jan and Kalash Lachhwan

More information

EXTENDED BIC CRITERION FOR MODEL SELECTION

EXTENDED BIC CRITERION FOR MODEL SELECTION IDIAP RESEARCH REPORT EXTEDED BIC CRITERIO FOR ODEL SELECTIO Itshak Lapdot Andrew orrs IDIAP-RR-0-4 Dalle olle Insttute for Perceptual Artfcal Intellgence P.O.Box 59 artgny Valas Swtzerland phone +4 7

More information

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching A Fast Vsual Trackng Algorthm Based on Crcle Pxels Matchng Zhqang Hou hou_zhq@sohu.com Chongzhao Han czhan@mal.xjtu.edu.cn Ln Zheng Abstract: A fast vsual trackng algorthm based on crcle pxels matchng

More information

Synthesizer 1.0. User s Guide. A Varying Coefficient Meta. nalytic Tool. Z. Krizan Employing Microsoft Excel 2007

Synthesizer 1.0. User s Guide. A Varying Coefficient Meta. nalytic Tool. Z. Krizan Employing Microsoft Excel 2007 Syntheszer 1.0 A Varyng Coeffcent Meta Meta-Analytc nalytc Tool Employng Mcrosoft Excel 007.38.17.5 User s Gude Z. Krzan 009 Table of Contents 1. Introducton and Acknowledgments 3. Operatonal Functons

More information

Machine Learning: Algorithms and Applications

Machine Learning: Algorithms and Applications 14/05/1 Machne Learnng: Algorthms and Applcatons Florano Zn Free Unversty of Bozen-Bolzano Faculty of Computer Scence Academc Year 011-01 Lecture 10: 14 May 01 Unsupervsed Learnng cont Sldes courtesy of

More information

Parameter estimation for incomplete bivariate longitudinal data in clinical trials

Parameter estimation for incomplete bivariate longitudinal data in clinical trials Parameter estmaton for ncomplete bvarate longtudnal data n clncal trals Naum M. Khutoryansky Novo Nordsk Pharmaceutcals, Inc., Prnceton, NJ ABSTRACT Bvarate models are useful when analyzng longtudnal data

More information

Structure Formation of Social Network

Structure Formation of Social Network Structure Formaton of Socal Network DU Nan 1, FENG Hu 2, HUANG Zgang 3, Sally MAKI 4, WANG Ru(Ruby) 5, and ZHAO Hongxa (Melssa) 6 1 Bejng Unversty of Posts and Telecommuncatons, Chna 2 Fudan Unversty,

More information

A Topology-aware Random Walk

A Topology-aware Random Walk A Topology-aware Random Walk Inkwan Yu, Rchard Newman Dept. of CISE, Unversty of Florda, Ganesvlle, Florda, USA Abstract When a graph can be decomposed nto clusters of well connected subgraphs, t s possble

More information

CMPS 10 Introduction to Computer Science Lecture Notes

CMPS 10 Introduction to Computer Science Lecture Notes CPS 0 Introducton to Computer Scence Lecture Notes Chapter : Algorthm Desgn How should we present algorthms? Natural languages lke Englsh, Spansh, or French whch are rch n nterpretaton and meanng are not

More information

Unsupervised Learning and Clustering

Unsupervised Learning and Clustering Unsupervsed Learnng and Clusterng Why consder unlabeled samples?. Collectng and labelng large set of samples s costly Gettng recorded speech s free, labelng s tme consumng 2. Classfer could be desgned

More information

TN348: Openlab Module - Colocalization

TN348: Openlab Module - Colocalization TN348: Openlab Module - Colocalzaton Topc The Colocalzaton module provdes the faclty to vsualze and quantfy colocalzaton between pars of mages. The Colocalzaton wndow contans a prevew of the two mages

More information

A Deflected Grid-based Algorithm for Clustering Analysis

A Deflected Grid-based Algorithm for Clustering Analysis A Deflected Grd-based Algorthm for Clusterng Analyss NANCY P. LIN, CHUNG-I CHANG, HAO-EN CHUEH, HUNG-JEN CHEN, WEI-HUA HAO Department of Computer Scence and Informaton Engneerng Tamkang Unversty 5 Yng-chuan

More information

Investigating the Performance of Naïve- Bayes Classifiers and K- Nearest Neighbor Classifiers

Investigating the Performance of Naïve- Bayes Classifiers and K- Nearest Neighbor Classifiers Journal of Convergence Informaton Technology Volume 5, Number 2, Aprl 2010 Investgatng the Performance of Naïve- Bayes Classfers and K- Nearest Neghbor Classfers Mohammed J. Islam *, Q. M. Jonathan Wu,

More information

5 The Primal-Dual Method

5 The Primal-Dual Method 5 The Prmal-Dual Method Orgnally desgned as a method for solvng lnear programs, where t reduces weghted optmzaton problems to smpler combnatoral ones, the prmal-dual method (PDM) has receved much attenton

More information

Categories and Subject Descriptors B.7.2 [Integrated Circuits]: Design Aids Verification. General Terms Algorithms

Categories and Subject Descriptors B.7.2 [Integrated Circuits]: Design Aids Verification. General Terms Algorithms 3. Fndng Determnstc Soluton from Underdetermned Equaton: Large-Scale Performance Modelng by Least Angle Regresson Xn L ECE Department, Carnege Mellon Unversty Forbs Avenue, Pttsburgh, PA 3 xnl@ece.cmu.edu

More information

Lecture 5: Multilayer Perceptrons

Lecture 5: Multilayer Perceptrons Lecture 5: Multlayer Perceptrons Roger Grosse 1 Introducton So far, we ve only talked about lnear models: lnear regresson and lnear bnary classfers. We noted that there are functons that can t be represented

More information

Fast Computation of Shortest Path for Visiting Segments in the Plane

Fast Computation of Shortest Path for Visiting Segments in the Plane Send Orders for Reprnts to reprnts@benthamscence.ae 4 The Open Cybernetcs & Systemcs Journal, 04, 8, 4-9 Open Access Fast Computaton of Shortest Path for Vstng Segments n the Plane Ljuan Wang,, Bo Jang

More information

Learning-Based Top-N Selection Query Evaluation over Relational Databases

Learning-Based Top-N Selection Query Evaluation over Relational Databases Learnng-Based Top-N Selecton Query Evaluaton over Relatonal Databases Lang Zhu *, Wey Meng ** * School of Mathematcs and Computer Scence, Hebe Unversty, Baodng, Hebe 071002, Chna, zhu@mal.hbu.edu.cn **

More information

Vanishing Hull. Jinhui Hu, Suya You, Ulrich Neumann University of Southern California {jinhuihu,suyay,

Vanishing Hull. Jinhui Hu, Suya You, Ulrich Neumann University of Southern California {jinhuihu,suyay, Vanshng Hull Jnhu Hu Suya You Ulrch Neumann Unversty of Southern Calforna {jnhuhusuyay uneumann}@graphcs.usc.edu Abstract Vanshng ponts are valuable n many vson tasks such as orentaton estmaton pose recovery

More information

Query Clustering Using a Hybrid Query Similarity Measure

Query Clustering Using a Hybrid Query Similarity Measure Query clusterng usng a hybrd query smlarty measure Fu. L., Goh, D.H., & Foo, S. (2004). WSEAS Transacton on Computers, 3(3), 700-705. Query Clusterng Usng a Hybrd Query Smlarty Measure Ln Fu, Don Hoe-Lan

More information

Outline. Self-Organizing Maps (SOM) US Hebbian Learning, Cntd. The learning rule is Hebbian like:

Outline. Self-Organizing Maps (SOM) US Hebbian Learning, Cntd. The learning rule is Hebbian like: Self-Organzng Maps (SOM) Turgay İBRİKÇİ, PhD. Outlne Introducton Structures of SOM SOM Archtecture Neghborhoods SOM Algorthm Examples Summary 1 2 Unsupervsed Hebban Learnng US Hebban Learnng, Cntd 3 A

More information

Resource and Virtual Function Status Monitoring in Network Function Virtualization Environment

Resource and Virtual Function Status Monitoring in Network Function Virtualization Environment Journal of Physcs: Conference Seres PAPER OPEN ACCESS Resource and Vrtual Functon Status Montorng n Network Functon Vrtualzaton Envronment To cte ths artcle: MS Ha et al 2018 J. Phys.: Conf. Ser. 1087

More information

Quality Improvement Algorithm for Tetrahedral Mesh Based on Optimal Delaunay Triangulation

Quality Improvement Algorithm for Tetrahedral Mesh Based on Optimal Delaunay Triangulation Intellgent Informaton Management, 013, 5, 191-195 Publshed Onlne November 013 (http://www.scrp.org/journal/m) http://dx.do.org/10.36/m.013.5601 Qualty Improvement Algorthm for Tetrahedral Mesh Based on

More information

Course Introduction. Algorithm 8/31/2017. COSC 320 Advanced Data Structures and Algorithms. COSC 320 Advanced Data Structures and Algorithms

Course Introduction. Algorithm 8/31/2017. COSC 320 Advanced Data Structures and Algorithms. COSC 320 Advanced Data Structures and Algorithms Course Introducton Course Topcs Exams, abs, Proects A quc loo at a few algorthms 1 Advanced Data Structures and Algorthms Descrpton: We are gong to dscuss algorthm complexty analyss, algorthm desgn technques

More information

FEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur

FEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur FEATURE EXTRACTION Dr. K.Vjayarekha Assocate Dean School of Electrcal and Electroncs Engneerng SASTRA Unversty, Thanjavur613 41 Jont Intatve of IITs and IISc Funded by MHRD Page 1 of 8 Table of Contents

More information

CSCI 104 Sorting Algorithms. Mark Redekopp David Kempe

CSCI 104 Sorting Algorithms. Mark Redekopp David Kempe CSCI 104 Sortng Algorthms Mark Redekopp Davd Kempe Algorthm Effcency SORTING 2 Sortng If we have an unordered lst, sequental search becomes our only choce If we wll perform a lot of searches t may be benefcal

More information

Private Information Retrieval (PIR)

Private Information Retrieval (PIR) 2 Levente Buttyán Problem formulaton Alce wants to obtan nformaton from a database, but she does not want the database to learn whch nformaton she wanted e.g., Alce s an nvestor queryng a stock-market

More information

A Statistical Model Selection Strategy Applied to Neural Networks

A Statistical Model Selection Strategy Applied to Neural Networks A Statstcal Model Selecton Strategy Appled to Neural Networks Joaquín Pzarro Elsa Guerrero Pedro L. Galndo joaqun.pzarro@uca.es elsa.guerrero@uca.es pedro.galndo@uca.es Dpto Lenguajes y Sstemas Informátcos

More information

A Unified Framework for Semantics and Feature Based Relevance Feedback in Image Retrieval Systems

A Unified Framework for Semantics and Feature Based Relevance Feedback in Image Retrieval Systems A Unfed Framework for Semantcs and Feature Based Relevance Feedback n Image Retreval Systems Ye Lu *, Chunhu Hu 2, Xngquan Zhu 3*, HongJang Zhang 2, Qang Yang * School of Computng Scence Smon Fraser Unversty

More information

Intra-Parametric Analysis of a Fuzzy MOLP

Intra-Parametric Analysis of a Fuzzy MOLP Intra-Parametrc Analyss of a Fuzzy MOLP a MIAO-LING WANG a Department of Industral Engneerng and Management a Mnghsn Insttute of Technology and Hsnchu Tawan, ROC b HSIAO-FAN WANG b Insttute of Industral

More information

2x x l. Module 3: Element Properties Lecture 4: Lagrange and Serendipity Elements

2x x l. Module 3: Element Properties Lecture 4: Lagrange and Serendipity Elements Module 3: Element Propertes Lecture : Lagrange and Serendpty Elements 5 In last lecture note, the nterpolaton functons are derved on the bass of assumed polynomal from Pascal s trangle for the fled varable.

More information

The Shortest Path of Touring Lines given in the Plane

The Shortest Path of Touring Lines given in the Plane Send Orders for Reprnts to reprnts@benthamscence.ae 262 The Open Cybernetcs & Systemcs Journal, 2015, 9, 262-267 The Shortest Path of Tourng Lnes gven n the Plane Open Access Ljuan Wang 1,2, Dandan He

More information

REFRACTIVE INDEX SELECTION FOR POWDER MIXTURES

REFRACTIVE INDEX SELECTION FOR POWDER MIXTURES REFRACTIVE INDEX SELECTION FOR POWDER MIXTURES Laser dffracton s one of the most wdely used methods for partcle sze analyss of mcron and submcron sze powders and dspersons. It s quck and easy and provdes

More information

F Geometric Mean Graphs

F Geometric Mean Graphs Avalable at http://pvamu.edu/aam Appl. Appl. Math. ISSN: 1932-9466 Vol. 10, Issue 2 (December 2015), pp. 937-952 Applcatons and Appled Mathematcs: An Internatonal Journal (AAM) F Geometrc Mean Graphs A.

More information

Skew Angle Estimation and Correction of Hand Written, Textual and Large areas of Non-Textual Document Images: A Novel Approach

Skew Angle Estimation and Correction of Hand Written, Textual and Large areas of Non-Textual Document Images: A Novel Approach Angle Estmaton and Correcton of Hand Wrtten, Textual and Large areas of Non-Textual Document Images: A Novel Approach D.R.Ramesh Babu Pyush M Kumat Mahesh D Dhannawat PES Insttute of Technology Research

More information