Predicting Transcription Factor Binding Sites with an Ensemble of Hidden Markov Models

Size: px
Start display at page:

Download "Predicting Transcription Factor Binding Sites with an Ensemble of Hidden Markov Models"

Transcription

1 Vol. 3, No. 1, Fall, 2016, pp ISSN X (prnt), (onlne), All Rghts Reserved Predctng Transcrpton Factor Bndng Stes wth an Ensemble of Hdden Markov Models Yngle Song 1 and Albert Y. Ch 2 1 School of Computer Scence and Engneerng Jangsu Unversty of Scence and Technology Zhenjang, Jangsu , Chna Emal: ynglesong@gmal.com 2 Department of Mathematcs and Computer Scence Unversty of Maryland Eastern Shore Prncess Anne, MD 21853, USA Emal: albertchsquare@gmal.com Abstract Transcrpton Factor Bndng Stes (TFBS) are mportant for a number of bologcal processes such as gene expresson and regulaton. One fundamental problem n bonformatcs s to develop software tools that can dentfy TFBSs accurately and rapdly. In practce, exhaustve search of all possble combnatons of subsequences s tme consumng and thus cannot be appled. A large number of heurstc or approxmaton algorthms and machne learnng based approaches have been developed for ths problem. However, none of them have acheved satsfactory predcton accuracy. In ths paper, we develop a novel approach that can effcently explore the space of all possble locatons of TFBSs n a set of sequences wth hgh accuracy. The exploraton s carred out wth an ensemble of a few Hdden Markov Models (HMM). The ensemble s ntally constructed through local algnments of two sequences n the set, each HMM n the ensemble s then progressvely algned to other sequences n the set. The parameters of the HMMs n the ensemble are updated based on the algnment results. Our expermental results showed that ths approach can acheve hgher accuracy wth satsfyng effcency than exstng state-of-art approaches. Keywords: Hdden Markov Model (HMM); Motf fndng; Transcrpton factor bndng ste; ensemble approach 1. INTRODUCTION Transcrpton Factor Bndng Stes (TFBS) are subsequences found n the upstream regon of genes n DNA genomes. A transcrpton factor, whch s a specalzed proten molecule, may bnd to the nucleotdes n the subsequences and thus may affect some relevant bologcal processes. Research n molecular bology has revealed that transcrpton factor bndng stes are mportant for many bologcal processes, ncludng gene expresson and regulaton. Thus, an accurate dentfcaton of TFBSs s mportant for understandng the bologcal mechansm of gene expresson and regulaton. Expermental methods have been avalable for the task [6, 7]. However, most of them are tme consumng and expensve. Moreover, as the amount of newly sequenced data grows explosvely, the low throughput of expermental methods have become an mportant bottleneck for rapd processng of these data. Computatonal methods thus have become an mportant alternatve approach to rapd dentfcaton of TFBSs. Snce TFBSs for the same transcrpton factor have smlar sequence content n homologous sequences, the most often used computatonal approaches make the predcton by analyzng a set

2 2 of homologous sequences and dentfyng subsequences that are smlar n content. The locatons of a TFBS may vary n dfferent homologous sequences. To determne the locaton of a TFBS n each sequence, we need to evaluate all possble startng locatons among all sequences to fnd the optmal soluton. The total number of combnatons of subsequences that need to be examned s exponental and exhaustvely enumeratng all of them s obvously mpractcal when the number or the lengths of the sequences are large. To avod exhaustve search, a large number of heurstcs have been developed to reduce the sze of the search space, such as Gbbs samplng based approaches AlgnACE [15], BoProspector [12], Gbbs Motf sampler [11], expectaton maxmzaton based models [1, 2], greedy approaches such as Consensus [8], and genetc algorthm based approaches such as FMGA [10] and MDGA [4]. Of all these approaches and software tools, Gbbs samplng s a stochastc approach. It randomly selects a canddate motf of a fxed length from each sequence. It then pcks a sequence and uses each substrng of the same length n the sequence to replace the correspondng pre-selected motf for the sequence and computes the probablty. The approach randomly selects a substrng based on the dstrbuton of the probabltes to replace the pre-selected subsequence and obtans a new set of subsequences through the random samplng. The procedure s repeated untl the maxmum number of teratons has been reached or a satsfyng set of local optmal subsequences has been found [11, 12, 15]. Consensus uses a greedy algorthm to algn functonally related sequences and apples the algorthm to dentfy the bndng stes for the E. col CRP proten [8]. MEME+ [2] uses Expectaton Maxmzaton technque to ft a two component mxture model and the model s then used to fnd TFBSs. MEME+ acheves hgher accuracy than ts earler verson MEME [1]. However, the predcton accuracy s stll not satsfactory. Genetc algorthms (GAs) smulate the Darwn evolutonary process to fnd a local optmal soluton for an optmzaton problem. Approaches based on GAs start wth an ntal populaton of a certan sze. They then go through a seres of selecton, crossover and mutaton processes to converge to the global optmum. The selecton, crossover and mutaton operatons are appled to the ndvduals n the populaton to generate the next generaton. These operatons are based on certan methods and probabltes. Ths evolutonary procedure contnues untl the maxmum allowed number of generatons has been generated or the dfference between the values of objectve functons assocated wth two consecutve generatons s less than a pre-set small threshold. Genetc algorthms have been successfully used to solve the TFBS predctng problem, such as FMGA [10] and MDGA[4]. FMGA was declared to have better performance than Gbbs Motf Sampler [11] n terms of both predcton accuracy and computaton effcency. MDGA [4] s another program that uses genetc algorthms to predct TFBSs n homologous sequences. Durng the evolutonary process, MDGA uses nformaton content to evaluate each ndvdual n the populaton. MDGA s able to acheve hgher predcton accuracy than Gbbs samplng algorthm based approaches whle usng a less amount of computaton tme. So far, most of the exstng approaches use heurstcs methods to reduce the sze of the search space. However, heurstcs employed by these approaches may also adversely affect the predcton accuracy. For example, GA based predcton tools cannot guarantee the predcton results are the same for dfferent runs of the program. A well defned strategy that can be used to effcently explore the search space and can generate determnstc and hghly accurate predcton results s thus necessary to further mprove the performance of predcton tools.

3 3 Recent work has shown that an ensemble of HMMs can be effectvely used to mprove the accuracy of the algnment of multple proten sequences [17]. In ths paper, we develop a new approach that can predct the locatons of TFBSs wth an ensemble of Hdden Markov Models (HMMs). The approach uses an ensemble of profle HMMs to generate a lst of postons that are lkely to be the startng postons of the TFBSs. As the frst step, we construct the ensemble from the local algnment of two sequences. The ensemble conssts of HMMs that represent the local algnments wth most sgnfcant algnment scores. We then algn each profle HMM n the ensemble to each sequence n the dataset, the parameters of the HMMs are also changed to ncorporate the new nformaton we have obtaned by algnng the new sequence to the HMMs. Ths procedure s repeated untl all sequences n the dataset have been processed. The number of HMMs n the ensemble can be used as a parameter and can also be adjusted based on the needs of users. We have mplemented ths approach nto a software tool EHMM and our expermental results show that the predcton accuracy of EHMM s hgher than or comparable wth that of the exstng tools. II. ALGORITHMS AND METHODS The method selects the two sequences that have the lowest smlarty to ntalze the ensemble. The smlarty between each par of sequences n the set s computed by globally algnng the two sequences. A local algnment of the selected sequences s then computed. The algnment results are then used to construct an ensemble that conssts of k HMMs, where k s a postve nteger. The algorthm selects the local algnments wth the k largest algnment scores and each of such local algnments can be used to construct an HMM. An ensemble of k HMMs can thus be constructed based on the local algnments wth k most sgnfcant algnment scores. We then progressvely use the HMMs to scan through each remanng sequence n the set. Each sequence segment n a sequence s algned to each HMM n the ensemble and the algnments wth k most sgnfcant scores are selected to update the parameters of the HMM. Ths process wll create up to k 2 HMMs, but only the algnments that have the k most sgnfcant algnment scores are selected to create a new ensemble of k HMMs. We repeat ths procedure untl all sequences n the set have been processed and the HMMs remaned n the ensemble provde the canddate TFBS motfs. Fgure 1 (a) and (b) provde an llustraton of the process. Fgure 2 shows the fnal stage of the approach, where the bndng stes can be determned from the HMMs n the ensemble. The followng sectons provde a detaled descrpton of the steps of the algorthm..

4 4 (a) (b). Fgure 1. (a) An ensemble s constructed from local algnments (b) The ensemble s updated progressvely. Fgure 2. Fnally the bndng stes can be nferred from the HMMs n the ensemble. A. Ensemble Intalzaton The algorthm selects two sequences that are of the lowest smlarty value from the set and uses Smth-Waterman local algnment algorthm [16] to obtan local algnments wth sgnfcant scores. The algnment can be performed n quadratc computaton tme. To construct an ensemble of k HMMs, a dynamc programmng table needs to be mantaned to store the algnment scores. Gven two sequences s and t and a score matrx M that evaluates the ftness value to match two nucleotdes together n an algnment. The recurson relaton for the dynamc programmng s as follows. S, max{0, S[ 1][ M[ s, ], S[ ][ j 1] M[ t ] S[ 1][ j 1] M[ s ][ t ]} (1) [ j j

5 5 where S s the two dmensonal dynamc programmng table; s and t j are the th and j th nucleotdes n s and t, After the dynamc programmng table s completely determned, the algorthm selects the algnments wth the k largest algnment scores n table S. A trace-back table can be mantaned durng the dynamc programmng process. Based on the trace-back table, a trace-back procedure can be employed to dentfy the subsequences n the algnments that correspond to the k largest algnment scores. An ensemble of k profle HMMs can then be constructed from the k algnments. An algnment can be consdered as a set of columns, and each column contans a set of nucleotdes and gaps that are algned together n the algnment. A profle HMM contans two states, namely D and M, for column n the correspondng algnment. The deleton state D does not emt any nucleotde and s used to represent the gaps n column ; the matchng state M emts a nucleotde and s used to descrbe the probabltes for each nucleotde to appear n column. The probabltes of emsson and transton for each state can be computed from each algnment as well. Fgure 3 llustrates the process that converts a multple algnment of subsequences nto the correspondng profle HMM. The parameters of a profle HMM can be computed as follows. Fgure 3. A multple algnment of subsequences can be converted nto a profle HMM. et et Ca ep ( M, a) (2) C bn b P(, b, 1, c) bn, cn ( M, M 1) (3) P(, b, 1, c) bn, cn bn ( 1 1 P(, b, 1, ) et M, D ) 1 et( M, M ) (4) P(, 1, b) bn ( D, M 1 ) (5) P(, 1, b) P(, 1, ) bn ( D, D 1) 1 et( D, M 1 et ) (6)

6 6 where N s the set of all types of nucleotdes, C represents the number of tmes that nucleotde a a appears n column, ep( M, a) s the emsson probablty for state M to emt nucleotde a. et( M, M 1) s the probablty for the transton from M to M 1 to occur; P(, b, 1, c) s the number of tmes that nucleotde b appears n column and nucleotde c appears n poston 1; P(, b, 1, ) s the number of tmes that nucleotde b appears n column and a gap appears n column 1. et ( D, M 1) s the probablty for the transton from D to M 1 to occur; P(, 1, b) s the number of tmes that a gap appears n column and nucleotde b appears n column 1; P(, 1, ) s the number of tmes that gaps appear n both columns and 1. More detals of the algorthm can be found n [5]. B. Updatng Ensemble The remanng sequences n the set are processed based on the profle HMMs n the ensemble. A sequence that has not been processed n the set s scanned through by each profle HMM and subsequences that have the k most sgnfcant algnment scores are selected. The algorthm uses a wndow of certan sze to slde through the sequence. The sze of the wndow s set to be 1.5 tmes of the average lengths of all subsequences n the algnments used to construct the ensemble. The wndow moves by 1bp each tme and each subsequence n the wndow s algned to each HMM n the ensemble. The algnment can be computed wth a dynamc programmng algorthm. The recurson relaton for the dynamc programmng s as follows. S[ s s s1 s1 s s1 s1 j [ M s,, max{ et( M s, Ds ) ep( M s, t ) S[ Ds, 1,, et( M s, M s 1) S[ M s1, 1, j D,, max{ et( D, D ) S[ D,,, et( D, M ) S[ M,, ]} (7) S ]} (8) where 0 j W are ntegers that ndcate the locaton of subsequence t ncluded n the wndow; S[ Ds,, and S[ M s,, are the dynamc programmng table cells that store the maxmum probablty for states D and M to generate the subsequence t [... nucleotde at poston n t. More detals of the algorthm can be found n [5]. ; t s the The algorthm then selects k subsequences wth the largest algnment scores. We thus obtan n 2 total k canddates for updatng the HMMs n the ensemble. We pck k subsequences that 2 correspond to the largest k algnment scores from these k canddates. The parameters of each profle HMM are then updated based on these addtonal k subsequences. Specfcally, the addtonal subsequence changes the counts that appear n equatons (2), (3), (4), (5), and (6), the parameters of each HMM n the ensemble thus needs to be reevaluated. The process s appled progressvely to other remanng sequences n the set untl every sequence n the set has been processed. The locatons of the sequence segments that are used to construct each HMM n the ensemble are then determned by searchng n the sequences n the data set and the algorthm outputs the locatons as those of the bndng stes. C. Computaton Tme We assume the set contans m sequences, each sequence contans n nucleotdes, and the bndng ste contans l nucleotdes. The constructon of the ntal ensemble needs O( m n kn ) tme

7 7 snce the dynamc programmng of Smth-Waterman local algnment needs O ( n 2 ) tme. The computaton tme needed to scan through a sequence wth a sngle HMM s O ( l 2 n). The total amount of computaton needed by the approach s thus O( t( kml n m n kn )), where t s the number of teratons the algorthm needs to execute. Snce the memory space needed by the algnments can be reused, the space complexty of the algorthm s O ( n 2 ). III. EXPERIMENTAL RESULTS We mplemented ths approach and developed a software tool EHMM. We tested ts accuracy on a bologcal dataset cyclc-amp receptor proten (CRP). Ths dataset conssts of 18 sequences, each of whch conssts of 105 bps [13]. Twenty three bndng stes have been determned by usng the DNA footprntng method, wth a motf wdth of 22 [12]. Table 1 compares the predcton accuracy of EHMM wth three other computatonal methods: Gbbs Sampler [8], BoProspector [9], and MDGA [3]. The value of the parameter s set to be k 10 n all the tests. It can be seen from the table that EHMM can acheve comparable accuracy wth other tools n homologous sequences that contan a sngle bndng ste motf. However, ts predcton accuracy on those that contan multple bndng ste motfs s sgnfcantly hgher. For most of such sequences, EHMM can accurately dentfy the locatons of both motfs. Ths s beyond the capablty of all three other methods. In partcular, EHMM obtans excellent predcton results on sequence 17, where all three other methods fal to dentfy ether of the two motfs. It s not surprsng that our method s capable of dentfyng the locatons of multple bndng stes snce t uses an ensemble of HMMs to explore the algnment space of all subsequences, whch sgnfcantly mproves the samplng ablty and the probablty to accurately dentfy the locatons of TFBSs. Seq FP GS E BP E GA E EHMM E 1 17, ,60-1, , ,54 1, , ,59-1, , ,81-1, , ,80 1,-4

8 Table 1. The predcton accuracy of EHMM,GS,BP, and GA. A sngle sequence may contan multple bndng ste motfs. Seq. denotes sequences; FP column lsts the startng postons of the bndng stes measured wth fngerprnt experments. GS, BP, GA columns lst the startng postons predcted by Gbbs Sampler, BoProspector, and MDGA, respectvely. E columns show the devaton of the predcted startng postons from those obtaned wth fngerprnt experments. In addton to the data set CRP, we also use EHMM ( k 10 ) and other tools to predct the bndng stes for a few transcrpton factors ncludng BATF [13], EGR1[9], FOXO1[3], and HSF1[14]. The predcton accuracy of a software tool s evaluated by computng ts predcton accuracy on each sngle sequence n the set and takng the average of the predcton accuracy on all sequences n the set. The predcton accuracy on a sngle sequence s defned to be the percentage of correctly predcted part n the bndng ste. In other words, f we use B to denote the bndng ste and P to denote the predcted bndng ste, the accuracy of the predcton can be computed wth P B A (9) B where we use P B to denote the ntersecton of P and B. For a set D of homologous sequences, the predcton accuracy of an approach on D s computed wth As sd AD (10) D where s s a sequence n D and As s the predcton accuracy of the approach on s. Fgure 4 shows and compares the predcton accuracy of EHMM, Gbbs Sampler, BoProspector, and MDGA on the four data sets. It s not dffcult to see from the Fgure that EHMM acheves sgnfcantly hgher predcton accuracy on data sets BATF, FOXO1, and HSF1 and acheves accuracy that s comparable wth other tools on data set FOXO1. 120% 100% 80% 60% 40% EHMM GS BP GA 20% 0% BATF EGR1 FOXO1 HSF1

9 9 Fgure 4. Predcton accuracy of the EHMM, GS(Gbbs Sampler), BP(BoProspector), GA(MDGA) on data sets BATF, EGR1, FOXO1,and HSF % % 80.00% 60.00% 40.00% k=6 k=8 k=10 k= % 0.00% BATF EGR1 FOXO1 HSF1 Fgure 5. Predcton accuracy of the EHMM when k s 6,8,10,12 respectvely. The sze of the ensemble s a parameter that can be changed by the user to balance the predcton accuracy and the computaton tme needed for predcton. Fgure 5 shows the predcton accuracy on data sets BATF, EGR1, FOXO1, and HSF1 when the value of the parameter k s 6,8,10, and 12. It can be seen from the Fgure that the predcton accuracy mproves when the sze of the ensemble ncreases and the predcton accuracy becomes steady when the value of the parameter s 10. The testng results also show that a parameter value of 10 s thus suffcent to acheve satsfactory predcton accuracy n practce.. IV. CONCLUSIONS In ths paper, we developed a new approach that can accurately and effcently dentfy the bndng ste motfs on a set of homologous DNA sequences. Our approach starts wth a par of sequences n the set and uses the local algnment results of the two sequences to construct an ntal ensemble. It then progressvely processes the remanng sequences n the set and updates the parameters of the HMMs n the ensemble untl every sequence n the set has been processed. Expermental results show that, on the data we have performed our tests, ths approach can acheve hgher or comparable accuracy on sequences wth a sngle bndng ste whle ts accuracy on sequences wth multple bndng stes s sgnfcantly hgher than that of other tools. ACKNOWLEDGMENT Y.Song s work s under the support of the Startup Fundng for New Faculty at Jangsu Unversty of Scence and Technology. REFERENCES [1] T.L. Baley and C. Elkan, Unsupervsed learnng of multple motfs n bopolymers usng expectaton maxmzaton, Techncal Report CS93-302, Department of Computer Scence, Unversty of Calforna, San Dego, August 1993.

10 10 [2] T.L. Baley and C. Elkan, Fttng a mxture model by expectaton maxmzaton to dscover motfs n bopolymers, Proceedngs of the Second Internatonal Conference on Intellgent Systems for Molecular Bology, pp , [3] M. M. Brent, R. Anand, and R. Marmorsten, Structural Bass for DNA Recognton by FoxO1 and ts regulaton by posttranslatonal modfcaton, Structure, 16: , [4] D. Che, Y. Song, and K. Rasheed, MDGA: Motf Dscovery Usng A Genetc Algorthm, Proceedngs of the Genetc and Evolutonary Computaton Conference 2005, pp [5] R. Durbn, S.R. Eddy, A. Krogh, and G. Mtchson, Bologcal Sequence Analyss: Probablstc Models of Protens and Nuclec Acds, Cambrdge Unversty Press, [6] D.J. Galas and A. Schmtz, A DNA footprntng: a smple method for the detecton of proten-dna bndng specfcty, Nuclec Acds Research, 5, 9, pp , [7] M.M. Garner and A. Revzn, A gel electrophoress method for quantfyng he bndng of protens to specfc DNA regons: applcaton to components of the Eschercha col lactose operon regulatory systems, Nuclec Acds Research, 9, 13, pp , [8] G. Z. Hertz and G. D. Stormo, Identfyng DNA and proten patterns wth statstcally sgnfcant algnments of multple sequences, Bonformatcs, 15,7, pp , [9] T.C. Hu, et al., Snal assocates wth EGR-1 and SP-1 to upregulate transcrptonal actvaton of p15ink4b., the FEBS Journal, 277: , [10] F.F.M. Lu, J.J.P. Tsa, R.M. Chen, S.N. Chen, and S.H. Shh, FMGA: fndng motfs by genetc algorthm, IEEE Fourth Symposum on Bonformatcs and Boengneerng, pp , [11] J.S. Lu, A.F. Neuwald, and C.E. Lawrence, Bayesan models fo multple local sequence algnment and Gbbs samplng strateges, J. Am. Stat. Assoc., 90, 432, pp , [12] X. Lu, D.L. Brutlag, and J.S. Lu, BoProspector: dscoverng conserved DNA motfs n upstream regulatory regons of co-expressed genes, Pacfc Symposum of Bocomputng, 6, pp , [13] M. Qugley et al., Transcrptonal analyss of HIV-specfc CD8+ T cells shows that PD-1 nhbts T cell functon by upregulatng BATF, Nature Medcne, 16, , [14] K. T. Rgbolt, et al., System-wde temporal characterzaton of the proteome and phosphoproteome of human embryonc stem cell dfferentaton., Scence Sgnalng, 4: RS3-RS3, [15] F.R. Roth, J.D. Hughes, P.E. Estep, and G.M. Church, Fndng DNA regulatory motfs wthn unalgned noncodng sequences clustered by whole-genome mrna quanttaton, Nature Botechnology, 16,10, pp , [16] T.F. Smth and M.S. Waterman, Identfcaton of Common Molecular Subsequences, Journal of Molecular Bology, 147: [17] J. Song, C. Lu, Y. Song, J. Qu, and G. Hura, Algnment of multple protens wth an ensemble of Hdden Markov Models, Internatonal Journal of Bonformatcs and Data Mnng, 4(1): 60-71, [18] G.D. Stormo, Computer methods for analyzng sequence recognton of nuclec acds, Annu. Rev. BoChem, 17, pp , [19] G.D. Stormo and G.W. Hartzell, Identfyng proten-bndng stes from unalgned DNA fragments, Proc. of Nat. Acad. Sc., 86, 4, pp , 1989.

EECS 730 Introduction to Bioinformatics Sequence Alignment. Luke Huan Electrical Engineering and Computer Science

EECS 730 Introduction to Bioinformatics Sequence Alignment. Luke Huan Electrical Engineering and Computer Science EECS 730 Introducton to Bonformatcs Sequence Algnment Luke Huan Electrcal Engneerng and Computer Scence http://people.eecs.ku.edu/~huan/ HMM Π s a set of states Transton Probabltes a kl Pr( l 1 k Probablty

More information

A Binarization Algorithm specialized on Document Images and Photos

A Binarization Algorithm specialized on Document Images and Photos A Bnarzaton Algorthm specalzed on Document mages and Photos Ergna Kavalleratou Dept. of nformaton and Communcaton Systems Engneerng Unversty of the Aegean kavalleratou@aegean.gr Abstract n ths paper, a

More information

Meta-heuristics for Multidimensional Knapsack Problems

Meta-heuristics for Multidimensional Knapsack Problems 2012 4th Internatonal Conference on Computer Research and Development IPCSIT vol.39 (2012) (2012) IACSIT Press, Sngapore Meta-heurstcs for Multdmensonal Knapsack Problems Zhbao Man + Computer Scence Department,

More information

A mathematical programming approach to the analysis, design and scheduling of offshore oilfields

A mathematical programming approach to the analysis, design and scheduling of offshore oilfields 17 th European Symposum on Computer Aded Process Engneerng ESCAPE17 V. Plesu and P.S. Agach (Edtors) 2007 Elsever B.V. All rghts reserved. 1 A mathematcal programmng approach to the analyss, desgn and

More information

CS 534: Computer Vision Model Fitting

CS 534: Computer Vision Model Fitting CS 534: Computer Vson Model Fttng Sprng 004 Ahmed Elgammal Dept of Computer Scence CS 534 Model Fttng - 1 Outlnes Model fttng s mportant Least-squares fttng Maxmum lkelhood estmaton MAP estmaton Robust

More information

Cluster Analysis of Electrical Behavior

Cluster Analysis of Electrical Behavior Journal of Computer and Communcatons, 205, 3, 88-93 Publshed Onlne May 205 n ScRes. http://www.scrp.org/ournal/cc http://dx.do.org/0.4236/cc.205.350 Cluster Analyss of Electrcal Behavor Ln Lu Ln Lu, School

More information

An Iterative Solution Approach to Process Plant Layout using Mixed Integer Optimisation

An Iterative Solution Approach to Process Plant Layout using Mixed Integer Optimisation 17 th European Symposum on Computer Aded Process Engneerng ESCAPE17 V. Plesu and P.S. Agach (Edtors) 2007 Elsever B.V. All rghts reserved. 1 An Iteratve Soluton Approach to Process Plant Layout usng Mxed

More information

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points;

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points; Subspace clusterng Clusterng Fundamental to all clusterng technques s the choce of dstance measure between data ponts; D q ( ) ( ) 2 x x = x x, j k = 1 k jk Squared Eucldean dstance Assumpton: All features

More information

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers IOSR Journal of Electroncs and Communcaton Engneerng (IOSR-JECE) e-issn: 78-834,p- ISSN: 78-8735.Volume 9, Issue, Ver. IV (Mar - Apr. 04), PP 0-07 Content Based Image Retreval Usng -D Dscrete Wavelet wth

More information

Determining the Optimal Bandwidth Based on Multi-criterion Fusion

Determining the Optimal Bandwidth Based on Multi-criterion Fusion Proceedngs of 01 4th Internatonal Conference on Machne Learnng and Computng IPCSIT vol. 5 (01) (01) IACSIT Press, Sngapore Determnng the Optmal Bandwdth Based on Mult-crteron Fuson Ha-L Lang 1+, Xan-Mn

More information

Learning the Kernel Parameters in Kernel Minimum Distance Classifier

Learning the Kernel Parameters in Kernel Minimum Distance Classifier Learnng the Kernel Parameters n Kernel Mnmum Dstance Classfer Daoqang Zhang 1,, Songcan Chen and Zh-Hua Zhou 1* 1 Natonal Laboratory for Novel Software Technology Nanjng Unversty, Nanjng 193, Chna Department

More information

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization Problem efntons and Evaluaton Crtera for Computatonal Expensve Optmzaton B. Lu 1, Q. Chen and Q. Zhang 3, J. J. Lang 4, P. N. Suganthan, B. Y. Qu 6 1 epartment of Computng, Glyndwr Unversty, UK Faclty

More information

Module Management Tool in Software Development Organizations

Module Management Tool in Software Development Organizations Journal of Computer Scence (5): 8-, 7 ISSN 59-66 7 Scence Publcatons Management Tool n Software Development Organzatons Ahmad A. Al-Rababah and Mohammad A. Al-Rababah Faculty of IT, Al-Ahlyyah Amman Unversty,

More information

S1 Note. Basis functions.

S1 Note. Basis functions. S1 Note. Bass functons. Contents Types of bass functons...1 The Fourer bass...2 B-splne bass...3 Power and type I error rates wth dfferent numbers of bass functons...4 Table S1. Smulaton results of type

More information

An Optimal Algorithm for Prufer Codes *

An Optimal Algorithm for Prufer Codes * J. Software Engneerng & Applcatons, 2009, 2: 111-115 do:10.4236/jsea.2009.22016 Publshed Onlne July 2009 (www.scrp.org/journal/jsea) An Optmal Algorthm for Prufer Codes * Xaodong Wang 1, 2, Le Wang 3,

More information

Recommended Items Rating Prediction based on RBF Neural Network Optimized by PSO Algorithm

Recommended Items Rating Prediction based on RBF Neural Network Optimized by PSO Algorithm Recommended Items Ratng Predcton based on RBF Neural Network Optmzed by PSO Algorthm Chengfang Tan, Cayn Wang, Yuln L and Xx Q Abstract In order to mtgate the data sparsty and cold-start problems of recommendaton

More information

Course Introduction. Algorithm 8/31/2017. COSC 320 Advanced Data Structures and Algorithms. COSC 320 Advanced Data Structures and Algorithms

Course Introduction. Algorithm 8/31/2017. COSC 320 Advanced Data Structures and Algorithms. COSC 320 Advanced Data Structures and Algorithms Course Introducton Course Topcs Exams, abs, Proects A quc loo at a few algorthms 1 Advanced Data Structures and Algorthms Descrpton: We are gong to dscuss algorthm complexty analyss, algorthm desgn technques

More information

The Codesign Challenge

The Codesign Challenge ECE 4530 Codesgn Challenge Fall 2007 Hardware/Software Codesgn The Codesgn Challenge Objectves In the codesgn challenge, your task s to accelerate a gven software reference mplementaton as fast as possble.

More information

A Compressing Method for Genome Sequence Cluster using Sequence Alignment

A Compressing Method for Genome Sequence Cluster using Sequence Alignment A Compressng Method for Genome Sequence Cluster usng Sequence Algnment Kwang Su Jung 1, Nam Hee Yu 1, Seung Jung Shn 2, Keun Ho Ryu 1 1 Database/Bonformatcs Laboratory, Chungbuk Natonal Unversty, Korea

More information

EVALUATION OF THE PERFORMANCES OF ARTIFICIAL BEE COLONY AND INVASIVE WEED OPTIMIZATION ALGORITHMS ON THE MODIFIED BENCHMARK FUNCTIONS

EVALUATION OF THE PERFORMANCES OF ARTIFICIAL BEE COLONY AND INVASIVE WEED OPTIMIZATION ALGORITHMS ON THE MODIFIED BENCHMARK FUNCTIONS Academc Research Internatonal ISS-L: 3-9553, ISS: 3-9944 Vol., o. 3, May 0 EVALUATIO OF THE PERFORMACES OF ARTIFICIAL BEE COLOY AD IVASIVE WEED OPTIMIZATIO ALGORITHMS O THE MODIFIED BECHMARK FUCTIOS Dlay

More information

Outline. Type of Machine Learning. Examples of Application. Unsupervised Learning

Outline. Type of Machine Learning. Examples of Application. Unsupervised Learning Outlne Artfcal Intellgence and ts applcatons Lecture 8 Unsupervsed Learnng Professor Danel Yeung danyeung@eee.org Dr. Patrck Chan patrckchan@eee.org South Chna Unversty of Technology, Chna Introducton

More information

Classifying Acoustic Transient Signals Using Artificial Intelligence

Classifying Acoustic Transient Signals Using Artificial Intelligence Classfyng Acoustc Transent Sgnals Usng Artfcal Intellgence Steve Sutton, Unversty of North Carolna At Wlmngton (suttons@charter.net) Greg Huff, Unversty of North Carolna At Wlmngton (jgh7476@uncwl.edu)

More information

Maximum Variance Combined with Adaptive Genetic Algorithm for Infrared Image Segmentation

Maximum Variance Combined with Adaptive Genetic Algorithm for Infrared Image Segmentation Internatonal Conference on Logstcs Engneerng, Management and Computer Scence (LEMCS 5) Maxmum Varance Combned wth Adaptve Genetc Algorthm for Infrared Image Segmentaton Huxuan Fu College of Automaton Harbn

More information

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz Compler Desgn Sprng 2014 Regster Allocaton Sample Exercses and Solutons Prof. Pedro C. Dnz USC / Informaton Scences Insttute 4676 Admralty Way, Sute 1001 Marna del Rey, Calforna 90292 pedro@s.edu Regster

More information

Support Vector Machines

Support Vector Machines /9/207 MIST.6060 Busness Intellgence and Data Mnng What are Support Vector Machnes? Support Vector Machnes Support Vector Machnes (SVMs) are supervsed learnng technques that analyze data and recognze patterns.

More information

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision SLAM Summer School 2006 Practcal 2: SLAM usng Monocular Vson Javer Cvera, Unversty of Zaragoza Andrew J. Davson, Imperal College London J.M.M Montel, Unversty of Zaragoza. josemar@unzar.es, jcvera@unzar.es,

More information

The Greedy Method. Outline and Reading. Change Money Problem. Greedy Algorithms. Applications of the Greedy Strategy. The Greedy Method Technique

The Greedy Method. Outline and Reading. Change Money Problem. Greedy Algorithms. Applications of the Greedy Strategy. The Greedy Method Technique //00 :0 AM Outlne and Readng The Greedy Method The Greedy Method Technque (secton.) Fractonal Knapsack Problem (secton..) Task Schedulng (secton..) Mnmum Spannng Trees (secton.) Change Money Problem Greedy

More information

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data A Fast Content-Based Multmeda Retreval Technque Usng Compressed Data Borko Furht and Pornvt Saksobhavvat NSF Multmeda Laboratory Florda Atlantc Unversty, Boca Raton, Florda 3343 ABSTRACT In ths paper,

More information

Journal of Chemical and Pharmaceutical Research, 2014, 6(6): Research Article. A selective ensemble classification method on microarray data

Journal of Chemical and Pharmaceutical Research, 2014, 6(6): Research Article. A selective ensemble classification method on microarray data Avalable onlne www.ocpr.com Journal of Chemcal and Pharmaceutcal Research, 2014, 6(6):2860-2866 Research Artcle ISSN : 0975-7384 CODEN(USA) : JCPRC5 A selectve ensemble classfcaton method on mcroarray

More information

Unsupervised Learning

Unsupervised Learning Pattern Recognton Lecture 8 Outlne Introducton Unsupervsed Learnng Parametrc VS Non-Parametrc Approach Mxture of Denstes Maxmum-Lkelhood Estmates Clusterng Prof. Danel Yeung School of Computer Scence and

More information

A MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS

A MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS Proceedngs of the Wnter Smulaton Conference M E Kuhl, N M Steger, F B Armstrong, and J A Jones, eds A MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS Mark W Brantley Chun-Hung

More information

Fast Feature Value Searching for Face Detection

Fast Feature Value Searching for Face Detection Vol., No. 2 Computer and Informaton Scence Fast Feature Value Searchng for Face Detecton Yunyang Yan Department of Computer Engneerng Huayn Insttute of Technology Hua an 22300, Chna E-mal: areyyyke@63.com

More information

Solving two-person zero-sum game by Matlab

Solving two-person zero-sum game by Matlab Appled Mechancs and Materals Onlne: 2011-02-02 ISSN: 1662-7482, Vols. 50-51, pp 262-265 do:10.4028/www.scentfc.net/amm.50-51.262 2011 Trans Tech Publcatons, Swtzerland Solvng two-person zero-sum game by

More information

Classifier Swarms for Human Detection in Infrared Imagery

Classifier Swarms for Human Detection in Infrared Imagery Classfer Swarms for Human Detecton n Infrared Imagery Yur Owechko, Swarup Medasan, and Narayan Srnvasa HRL Laboratores, LLC 3011 Malbu Canyon Road, Malbu, CA 90265 {owechko, smedasan, nsrnvasa}@hrl.com

More information

An Efficient Genetic Algorithm with Fuzzy c-means Clustering for Traveling Salesman Problem

An Efficient Genetic Algorithm with Fuzzy c-means Clustering for Traveling Salesman Problem An Effcent Genetc Algorthm wth Fuzzy c-means Clusterng for Travelng Salesman Problem Jong-Won Yoon and Sung-Bae Cho Dept. of Computer Scence Yonse Unversty Seoul, Korea jwyoon@sclab.yonse.ac.r, sbcho@cs.yonse.ac.r

More information

CSCI 104 Sorting Algorithms. Mark Redekopp David Kempe

CSCI 104 Sorting Algorithms. Mark Redekopp David Kempe CSCI 104 Sortng Algorthms Mark Redekopp Davd Kempe Algorthm Effcency SORTING 2 Sortng If we have an unordered lst, sequental search becomes our only choce If we wll perform a lot of searches t may be benefcal

More information

Parallelism for Nested Loops with Non-uniform and Flow Dependences

Parallelism for Nested Loops with Non-uniform and Flow Dependences Parallelsm for Nested Loops wth Non-unform and Flow Dependences Sam-Jn Jeong Dept. of Informaton & Communcaton Engneerng, Cheonan Unversty, 5, Anseo-dong, Cheonan, Chungnam, 330-80, Korea. seong@cheonan.ac.kr

More information

Clustering Algorithm Combining CPSO with K-Means Chunqin Gu 1, a, Qian Tao 2, b

Clustering Algorithm Combining CPSO with K-Means Chunqin Gu 1, a, Qian Tao 2, b Internatonal Conference on Advances n Mechancal Engneerng and Industral Informatcs (AMEII 05) Clusterng Algorthm Combnng CPSO wth K-Means Chunqn Gu, a, Qan Tao, b Department of Informaton Scence, Zhongka

More information

Using Neural Networks and Support Vector Machines in Data Mining

Using Neural Networks and Support Vector Machines in Data Mining Usng eural etworks and Support Vector Machnes n Data Mnng RICHARD A. WASIOWSKI Computer Scence Department Calforna State Unversty Domnguez Hlls Carson, CA 90747 USA Abstract: - Multvarate data analyss

More information

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task Proceedngs of NTCIR-6 Workshop Meetng, May 15-18, 2007, Tokyo, Japan Term Weghtng Classfcaton System Usng the Ch-square Statstc for the Classfcaton Subtask at NTCIR-6 Patent Retreval Task Kotaro Hashmoto

More information

Using Bayesian Network Inference Algorithms to Recover Molecular Genetic Regulatory Networks

Using Bayesian Network Inference Algorithms to Recover Molecular Genetic Regulatory Networks Usng Bayesan Network Inference Algorthms to Recover Molecular Genetc Regulatory Networks Jng Yu 1,2, V. Anne Smth 1, Paul P. Wang 2, Alexander J. Hartemnk 3, Erch D. Jarvs 1 1 Duke Unversty Medcal Center,

More information

Machine Learning. Topic 6: Clustering

Machine Learning. Topic 6: Clustering Machne Learnng Topc 6: lusterng lusterng Groupng data nto (hopefully useful) sets. Thngs on the left Thngs on the rght Applcatons of lusterng Hypothess Generaton lusters mght suggest natural groups. Hypothess

More information

NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS

NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS ARPN Journal of Engneerng and Appled Scences 006-017 Asan Research Publshng Network (ARPN). All rghts reserved. NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS Igor Grgoryev, Svetlana

More information

Complexity Analysis of Problem-Dimension Using PSO

Complexity Analysis of Problem-Dimension Using PSO Proceedngs of the 7th WSEAS Internatonal Conference on Evolutonary Computng, Cavtat, Croata, June -4, 6 (pp45-5) Complexty Analyss of Problem-Dmenson Usng PSO BUTHAINAH S. AL-KAZEMI AND SAMI J. HABIB,

More information

Load Balancing for Hex-Cell Interconnection Network

Load Balancing for Hex-Cell Interconnection Network Int. J. Communcatons, Network and System Scences,,, - Publshed Onlne Aprl n ScRes. http://www.scrp.org/journal/jcns http://dx.do.org/./jcns.. Load Balancng for Hex-Cell Interconnecton Network Saher Manaseer,

More information

On Supporting Identification in a Hand-Based Biometric Framework

On Supporting Identification in a Hand-Based Biometric Framework On Supportng Identfcaton n a Hand-Based Bometrc Framework Pe-Fang Guo 1, Prabr Bhattacharya 2, and Nawwaf Kharma 1 1 Electrcal & Computer Engneerng, Concorda Unversty, 1455 de Masonneuve Blvd., Montreal,

More information

Classifier Selection Based on Data Complexity Measures *

Classifier Selection Based on Data Complexity Measures * Classfer Selecton Based on Data Complexty Measures * Edth Hernández-Reyes, J.A. Carrasco-Ochoa, and J.Fco. Martínez-Trndad Natonal Insttute for Astrophyscs, Optcs and Electroncs, Lus Enrque Erro No.1 Sta.

More information

The Research of Support Vector Machine in Agricultural Data Classification

The Research of Support Vector Machine in Agricultural Data Classification The Research of Support Vector Machne n Agrcultural Data Classfcaton Le Sh, Qguo Duan, Xnmng Ma, Me Weng College of Informaton and Management Scence, HeNan Agrcultural Unversty, Zhengzhou 45000 Chna Zhengzhou

More information

Unsupervised Learning and Clustering

Unsupervised Learning and Clustering Unsupervsed Learnng and Clusterng Supervsed vs. Unsupervsed Learnng Up to now we consdered supervsed learnng scenaro, where we are gven 1. samples 1,, n 2. class labels for all samples 1,, n Ths s also

More information

TN348: Openlab Module - Colocalization

TN348: Openlab Module - Colocalization TN348: Openlab Module - Colocalzaton Topc The Colocalzaton module provdes the faclty to vsualze and quantfy colocalzaton between pars of mages. The Colocalzaton wndow contans a prevew of the two mages

More information

Programming in Fortran 90 : 2017/2018

Programming in Fortran 90 : 2017/2018 Programmng n Fortran 90 : 2017/2018 Programmng n Fortran 90 : 2017/2018 Exercse 1 : Evaluaton of functon dependng on nput Wrte a program who evaluate the functon f (x,y) for any two user specfed values

More information

Image Emotional Semantic Retrieval Based on ELM

Image Emotional Semantic Retrieval Based on ELM Internatonal Conference on Logstcs Engneerng, Management and Computer Scence (LEMCS 2014) Image Emotonal Semantc Retreval Based on ELM Pele Zhang, Mn Yao, Shenzhang La College of computer scence & Technology

More information

CHAPTER 3 SEQUENTIAL MINIMAL OPTIMIZATION TRAINED SUPPORT VECTOR CLASSIFIER FOR CANCER PREDICTION

CHAPTER 3 SEQUENTIAL MINIMAL OPTIMIZATION TRAINED SUPPORT VECTOR CLASSIFIER FOR CANCER PREDICTION 48 CHAPTER 3 SEQUENTIAL MINIMAL OPTIMIZATION TRAINED SUPPORT VECTOR CLASSIFIER FOR CANCER PREDICTION 3.1 INTRODUCTION The raw mcroarray data s bascally an mage wth dfferent colors ndcatng hybrdzaton (Xue

More information

GA-Based Learning Algorithms to Identify Fuzzy Rules for Fuzzy Neural Networks

GA-Based Learning Algorithms to Identify Fuzzy Rules for Fuzzy Neural Networks Seventh Internatonal Conference on Intellgent Systems Desgn and Applcatons GA-Based Learnng Algorthms to Identfy Fuzzy Rules for Fuzzy Neural Networks K Almejall, K Dahal, Member IEEE, and A Hossan, Member

More information

Vectorization of Image Outlines Using Rational Spline and Genetic Algorithm

Vectorization of Image Outlines Using Rational Spline and Genetic Algorithm 01 Internatonal Conference on Image, Vson and Computng (ICIVC 01) IPCSIT vol. 50 (01) (01) IACSIT Press, Sngapore DOI: 10.776/IPCSIT.01.V50.4 Vectorzaton of Image Outlnes Usng Ratonal Splne and Genetc

More information

A Genetic Programming-PCA Hybrid Face Recognition Algorithm

A Genetic Programming-PCA Hybrid Face Recognition Algorithm Journal of Sgnal and Informaton Processng, 20, 2, 70-74 do:0.4236/jsp.20.23022 Publshed Onlne August 20 (http://www.scrp.org/journal/jsp) A Genetc Programmng-PCA Hybrd Face Recognton Algorthm Behzad Bozorgtabar,

More information

Range images. Range image registration. Examples of sampling patterns. Range images and range surfaces

Range images. Range image registration. Examples of sampling patterns. Range images and range surfaces Range mages For many structured lght scanners, the range data forms a hghly regular pattern known as a range mage. he samplng pattern s determned by the specfc scanner. Range mage regstraton 1 Examples

More information

Application of Maximum Entropy Markov Models on the Protein Secondary Structure Predictions

Application of Maximum Entropy Markov Models on the Protein Secondary Structure Predictions Applcaton of Maxmum Entropy Markov Models on the Proten Secondary Structure Predctons Yohan Km Department of Chemstry and Bochemstry Unversty of Calforna, San Dego La Jolla, CA 92093 ykm@ucsd.edu Abstract

More information

The Shortest Path of Touring Lines given in the Plane

The Shortest Path of Touring Lines given in the Plane Send Orders for Reprnts to reprnts@benthamscence.ae 262 The Open Cybernetcs & Systemcs Journal, 2015, 9, 262-267 The Shortest Path of Tourng Lnes gven n the Plane Open Access Ljuan Wang 1,2, Dandan He

More information

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching A Fast Vsual Trackng Algorthm Based on Crcle Pxels Matchng Zhqang Hou hou_zhq@sohu.com Chongzhao Han czhan@mal.xjtu.edu.cn Ln Zheng Abstract: A fast vsual trackng algorthm based on crcle pxels matchng

More information

Learning-Based Top-N Selection Query Evaluation over Relational Databases

Learning-Based Top-N Selection Query Evaluation over Relational Databases Learnng-Based Top-N Selecton Query Evaluaton over Relatonal Databases Lang Zhu *, Wey Meng ** * School of Mathematcs and Computer Scence, Hebe Unversty, Baodng, Hebe 071002, Chna, zhu@mal.hbu.edu.cn **

More information

CMPS 10 Introduction to Computer Science Lecture Notes

CMPS 10 Introduction to Computer Science Lecture Notes CPS 0 Introducton to Computer Scence Lecture Notes Chapter : Algorthm Desgn How should we present algorthms? Natural languages lke Englsh, Spansh, or French whch are rch n nterpretaton and meanng are not

More information

Sum of Linear and Fractional Multiobjective Programming Problem under Fuzzy Rules Constraints

Sum of Linear and Fractional Multiobjective Programming Problem under Fuzzy Rules Constraints Australan Journal of Basc and Appled Scences, 2(4): 1204-1208, 2008 ISSN 1991-8178 Sum of Lnear and Fractonal Multobjectve Programmng Problem under Fuzzy Rules Constrants 1 2 Sanjay Jan and Kalash Lachhwan

More information

An Application of the Dulmage-Mendelsohn Decomposition to Sparse Null Space Bases of Full Row Rank Matrices

An Application of the Dulmage-Mendelsohn Decomposition to Sparse Null Space Bases of Full Row Rank Matrices Internatonal Mathematcal Forum, Vol 7, 2012, no 52, 2549-2554 An Applcaton of the Dulmage-Mendelsohn Decomposton to Sparse Null Space Bases of Full Row Rank Matrces Mostafa Khorramzadeh Department of Mathematcal

More information

Skew Angle Estimation and Correction of Hand Written, Textual and Large areas of Non-Textual Document Images: A Novel Approach

Skew Angle Estimation and Correction of Hand Written, Textual and Large areas of Non-Textual Document Images: A Novel Approach Angle Estmaton and Correcton of Hand Wrtten, Textual and Large areas of Non-Textual Document Images: A Novel Approach D.R.Ramesh Babu Pyush M Kumat Mahesh D Dhannawat PES Insttute of Technology Research

More information

A Hidden Markov Model Variant for Sequence Classification

A Hidden Markov Model Variant for Sequence Classification Proceedngs of the Twenty-Second Internatonal Jont Conference on Artfcal Intellgence A Hdden Markov Model Varant for Sequence Classfcaton Sam Blasak and Huzefa Rangwala Computer Scence, George Mason Unversty

More information

Clustering System and Clustering Support Vector Machine for Local Protein Structure Prediction

Clustering System and Clustering Support Vector Machine for Local Protein Structure Prediction Georga State Unversty ScholarWorks @ Georga State Unversty Computer Scence Dssertatons Department of Computer Scence 8-2-2006 Clusterng System and Clusterng Support Vector Machne for Local Proten Structure

More information

MINIMUM DESCRIPTION LENGTH BASED PROTEIN SECONDARY STRUCTURE PREDICTION

MINIMUM DESCRIPTION LENGTH BASED PROTEIN SECONDARY STRUCTURE PREDICTION MINIMUM DESCRIPTION LENGTH BASED PROTEIN SECONDARY STRUCTURE PREDICTION Andrea Hategan and Ioan Tabus Insttute of Sgnal Processng, Tampere Unversty of Technology P.O. Box 553, FIN-33101 Tampere, Fnland

More information

Positive Semi-definite Programming Localization in Wireless Sensor Networks

Positive Semi-definite Programming Localization in Wireless Sensor Networks Postve Sem-defnte Programmng Localzaton n Wreless Sensor etworks Shengdong Xe 1,, Jn Wang, Aqun Hu 1, Yunl Gu, Jang Xu, 1 School of Informaton Scence and Engneerng, Southeast Unversty, 10096, anjng Computer

More information

Parallel matrix-vector multiplication

Parallel matrix-vector multiplication Appendx A Parallel matrx-vector multplcaton The reduced transton matrx of the three-dmensonal cage model for gel electrophoress, descrbed n secton 3.2, becomes excessvely large for polymer lengths more

More information

A New Token Allocation Algorithm for TCP Traffic in Diffserv Network

A New Token Allocation Algorithm for TCP Traffic in Diffserv Network A New Token Allocaton Algorthm for TCP Traffc n Dffserv Network A New Token Allocaton Algorthm for TCP Traffc n Dffserv Network S. Sudha and N. Ammasagounden Natonal Insttute of Technology, Truchrappall,

More information

Cracking of the Merkle Hellman Cryptosystem Using Genetic Algorithm

Cracking of the Merkle Hellman Cryptosystem Using Genetic Algorithm Crackng of the Merkle Hellman Cryptosystem Usng Genetc Algorthm Zurab Kochladze 1 * & Lal Besela 2 1 Ivane Javakhshvl Tbls State Unversty, 1, I.Chavchavadze av 1, 0128, Tbls, Georga 2 Sokhum State Unversty,

More information

Degree-Constrained Minimum Spanning Tree Problem Using Genetic Algorithm

Degree-Constrained Minimum Spanning Tree Problem Using Genetic Algorithm Degree-Constraned Mnmum Spannng Tree Problem Usng Genetc Algorthm Keke Lu, Zhenxang Chen, Ath Abraham *, Wene Cao and Shan Jng Shandong Provncal Key Laboratory of Network Based Intellgent Computng Unversty

More information

SURFACE PROFILE EVALUATION BY FRACTAL DIMENSION AND STATISTIC TOOLS USING MATLAB

SURFACE PROFILE EVALUATION BY FRACTAL DIMENSION AND STATISTIC TOOLS USING MATLAB SURFACE PROFILE EVALUATION BY FRACTAL DIMENSION AND STATISTIC TOOLS USING MATLAB V. Hotař, A. Hotař Techncal Unversty of Lberec, Department of Glass Producng Machnes and Robotcs, Department of Materal

More information

Biological Sequence Mining Using Plausible Neural Network and its Application to Exon/intron Boundaries Prediction

Biological Sequence Mining Using Plausible Neural Network and its Application to Exon/intron Boundaries Prediction Bologcal Sequence Mnng Usng Plausble Neural Networ and ts Applcaton to Exon/ntron Boundares Predcton Kuochen L, Dar-en Chang, and Erc Roucha CECS, Unversty of Lousvlle, Lousvlle, KY 40292, USA Yuan Yan

More information

Backpropagation: In Search of Performance Parameters

Backpropagation: In Search of Performance Parameters Bacpropagaton: In Search of Performance Parameters ANIL KUMAR ENUMULAPALLY, LINGGUO BU, and KHOSROW KAIKHAH, Ph.D. Computer Scence Department Texas State Unversty-San Marcos San Marcos, TX-78666 USA ae049@txstate.edu,

More information

Reducing Frame Rate for Object Tracking

Reducing Frame Rate for Object Tracking Reducng Frame Rate for Object Trackng Pavel Korshunov 1 and We Tsang Oo 2 1 Natonal Unversty of Sngapore, Sngapore 11977, pavelkor@comp.nus.edu.sg 2 Natonal Unversty of Sngapore, Sngapore 11977, oowt@comp.nus.edu.sg

More information

An Efficient Background Updating Scheme for Real-time Traffic Monitoring

An Efficient Background Updating Scheme for Real-time Traffic Monitoring 2004 IEEE Intellgent Transportaton Systems Conference Washngton, D.C., USA, October 3-6, 2004 WeA1.3 An Effcent Background Updatng Scheme for Real-tme Traffc Montorng Suchendra M. Bhandarkar and Xngzh

More information

BIOINFORMATICS ORIGINAL PAPER

BIOINFORMATICS ORIGINAL PAPER BIOINFORMATICS ORIGINAL PAPER Vol. 21 no. 11 2005, pages 2636 2643 do:10.1093/bonformatcs/bt402 Sequence analyss A boostng approach for motf modelng usng ChIP-chp data Pengyu Hong 1, X. Shrley Lu 2, Qng

More information

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration Improvement of Spatal Resoluton Usng BlockMatchng Based Moton Estmaton and Frame Integraton Danya Suga and Takayuk Hamamoto Graduate School of Engneerng, Tokyo Unversty of Scence, 6-3-1, Nuku, Katsuska-ku,

More information

Feature Selection for Target Detection in SAR Images

Feature Selection for Target Detection in SAR Images Feature Selecton for Detecton n SAR Images Br Bhanu, Yngqang Ln and Shqn Wang Center for Research n Intellgent Systems Unversty of Calforna, Rversde, CA 95, USA Abstract A genetc algorthm (GA) approach

More information

BOOSTING CLASSIFICATION ACCURACY WITH SAMPLES CHOSEN FROM A VALIDATION SET

BOOSTING CLASSIFICATION ACCURACY WITH SAMPLES CHOSEN FROM A VALIDATION SET 1 BOOSTING CLASSIFICATION ACCURACY WITH SAMPLES CHOSEN FROM A VALIDATION SET TZU-CHENG CHUANG School of Electrcal and Computer Engneerng, Purdue Unversty, West Lafayette, Indana 47907 SAUL B. GELFAND School

More information

On Some Entertaining Applications of the Concept of Set in Computer Science Course

On Some Entertaining Applications of the Concept of Set in Computer Science Course On Some Entertanng Applcatons of the Concept of Set n Computer Scence Course Krasmr Yordzhev *, Hrstna Kostadnova ** * Assocate Professor Krasmr Yordzhev, Ph.D., Faculty of Mathematcs and Natural Scences,

More information

Network Intrusion Detection Based on PSO-SVM

Network Intrusion Detection Based on PSO-SVM TELKOMNIKA Indonesan Journal of Electrcal Engneerng Vol.1, No., February 014, pp. 150 ~ 1508 DOI: http://dx.do.org/10.11591/telkomnka.v1.386 150 Network Intruson Detecton Based on PSO-SVM Changsheng Xang*

More information

Quality Improvement Algorithm for Tetrahedral Mesh Based on Optimal Delaunay Triangulation

Quality Improvement Algorithm for Tetrahedral Mesh Based on Optimal Delaunay Triangulation Intellgent Informaton Management, 013, 5, 191-195 Publshed Onlne November 013 (http://www.scrp.org/journal/m) http://dx.do.org/10.36/m.013.5601 Qualty Improvement Algorthm for Tetrahedral Mesh Based on

More information

Solving Planted Motif Problem on GPU

Solving Planted Motif Problem on GPU Solvng Planted Motf Problem on GPU Naga Shalaja Dasar Old Domnon Unversty Norfolk, VA, USA ndasar@cs.odu.edu Ranjan Desh Old Domnon Unversty Norfolk, VA, USA dranjan@cs.odu.edu Zubar M Old Domnon Unversty

More information

A GENETIC ALGORITHM FOR PROCESS SCHEDULING IN DISTRIBUTED OPERATING SYSTEMS CONSIDERING LOAD BALANCING

A GENETIC ALGORITHM FOR PROCESS SCHEDULING IN DISTRIBUTED OPERATING SYSTEMS CONSIDERING LOAD BALANCING A GENETIC ALGORITHM FOR PROCESS SCHEDULING IN DISTRIBUTED OPERATING SYSTEMS CONSIDERING LOAD BALANCING M. Nkravan and M. H. Kashan Department of Electrcal Computer Islamc Azad Unversty, Shahrar Shahreqods

More information

Intelligent Information Acquisition for Improved Clustering

Intelligent Information Acquisition for Improved Clustering Intellgent Informaton Acquston for Improved Clusterng Duy Vu Unversty of Texas at Austn duyvu@cs.utexas.edu Mkhal Blenko Mcrosoft Research mblenko@mcrosoft.com Prem Melvlle IBM T.J. Watson Research Center

More information

Disulfide Bonding Pattern Prediction Using Support Vector Machine with Parameters Tuned by Multiple Trajectory Search

Disulfide Bonding Pattern Prediction Using Support Vector Machine with Parameters Tuned by Multiple Trajectory Search Proceedngs of the 9th WSEAS Internatonal Conference on APPLIED IFORMAICS AD COMMUICAIOS (AIC '9) Dsulfde Bondng Pattern Predcton Usng Support Vector Machne wth Parameters uned by Multple rajectory Search

More information

A METHOD FOR ANALYSING GENE EXPRESSION DATA TEMPORAL SEQUENCE USING PROBABALISTIC BOOLEAN NETWORKS

A METHOD FOR ANALYSING GENE EXPRESSION DATA TEMPORAL SEQUENCE USING PROBABALISTIC BOOLEAN NETWORKS 4th European Sgnal Processng Conference (EUSIPCO 2006), Florence, Italy, September 4-8, 2006, copyrght by EURASIP A METHOD FOR AALYSIG GEE EXPRESSIO DATA TEMPORAL SEQUECE USIG PROBABALISTIC BOOLEA ETWORKS

More information

GENETIC ALGORITHMS APPLIED FOR PATTERN GENERATION FOR DOWNHOLE DYNAMOMETER CARDS

GENETIC ALGORITHMS APPLIED FOR PATTERN GENERATION FOR DOWNHOLE DYNAMOMETER CARDS GENETIC ALGORITHMS APPLIED FOR PATTERN GENERATION FOR DOWNHOLE DYNAMOMETER CARDS L. Schntman 1 ; B.C.Brandao 1 ; H.Lepkson 1 ; J.A.M. Felppe de Souza 2 ; J.F.S.Correa 3 1 Unversdade Federal da Baha- Brazl

More information

CHAPTER 4 OPTIMIZATION TECHNIQUES

CHAPTER 4 OPTIMIZATION TECHNIQUES 48 CHAPTER 4 OPTIMIZATION TECHNIQUES 4.1 INTRODUCTION Unfortunately no sngle optmzaton algorthm exsts that can be appled effcently to all types of problems. The method chosen for any partcular case wll

More information

From Comparing Clusterings to Combining Clusterings

From Comparing Clusterings to Combining Clusterings Proceedngs of the Twenty-Thrd AAAI Conference on Artfcal Intellgence (008 From Comparng Clusterngs to Combnng Clusterngs Zhwu Lu and Yuxn Peng and Janguo Xao Insttute of Computer Scence and Technology,

More information

Corner-Based Image Alignment using Pyramid Structure with Gradient Vector Similarity

Corner-Based Image Alignment using Pyramid Structure with Gradient Vector Similarity Journal of Sgnal and Informaton Processng, 013, 4, 114-119 do:10.436/jsp.013.43b00 Publshed Onlne August 013 (http://www.scrp.org/journal/jsp) Corner-Based Image Algnment usng Pyramd Structure wth Gradent

More information

Scheduling Remote Access to Scientific Instruments in Cyberinfrastructure for Education and Research

Scheduling Remote Access to Scientific Instruments in Cyberinfrastructure for Education and Research Schedulng Remote Access to Scentfc Instruments n Cybernfrastructure for Educaton and Research Je Yn 1, Junwe Cao 2,3,*, Yuexuan Wang 4, Lanchen Lu 1,3 and Cheng Wu 1,3 1 Natonal CIMS Engneerng and Research

More information

PARETO BAYESIAN OPTIMIZATION ALGORITHM FOR THE MULTIOBJECTIVE 0/1 KNAPSACK PROBLEM

PARETO BAYESIAN OPTIMIZATION ALGORITHM FOR THE MULTIOBJECTIVE 0/1 KNAPSACK PROBLEM PARETO BAYESIAN OPTIMIZATION ALGORITHM FOR THE MULTIOBJECTIVE 0/ KNAPSACK PROBLEM Josef Schwarz Jří Očenáše Brno Unversty of Technology Faculty of Engneerng and Computer Scence Department of Computer Scence

More information

Parallel Smith-Waterman Algorithm for DNA sequences Comparison on different cluster architectures

Parallel Smith-Waterman Algorithm for DNA sequences Comparison on different cluster architectures Parallel Smth-Waterman Algorthm for DNA sequences Comparson on dfferent cluster archtectures Rucc Enzo, De Gust Armando E., Chchzola Franco. Insttuto de Investgacón en Informátca LIDI (III-LIDI) Facultad

More information

Fast Computation of Shortest Path for Visiting Segments in the Plane

Fast Computation of Shortest Path for Visiting Segments in the Plane Send Orders for Reprnts to reprnts@benthamscence.ae 4 The Open Cybernetcs & Systemcs Journal, 04, 8, 4-9 Open Access Fast Computaton of Shortest Path for Vstng Segments n the Plane Ljuan Wang,, Bo Jang

More information

Mathematics 256 a course in differential equations for engineering students

Mathematics 256 a course in differential equations for engineering students Mathematcs 56 a course n dfferental equatons for engneerng students Chapter 5. More effcent methods of numercal soluton Euler s method s qute neffcent. Because the error s essentally proportonal to the

More information

Simulation Based Analysis of FAST TCP using OMNET++

Simulation Based Analysis of FAST TCP using OMNET++ Smulaton Based Analyss of FAST TCP usng OMNET++ Umar ul Hassan 04030038@lums.edu.pk Md Term Report CS678 Topcs n Internet Research Sprng, 2006 Introducton Internet traffc s doublng roughly every 3 months

More information