Using Bayesian Network Inference Algorithms to Recover Molecular Genetic Regulatory Networks

Size: px
Start display at page:

Download "Using Bayesian Network Inference Algorithms to Recover Molecular Genetic Regulatory Networks"

Transcription

1 Usng Bayesan Network Inference Algorthms to Recover Molecular Genetc Regulatory Networks Jng Yu 1,2, V. Anne Smth 1, Paul P. Wang 2, Alexander J. Hartemnk 3, Erch D. Jarvs 1 1 Duke Unversty Medcal Center, Department of Neurobology, Box 3209, Durham, NC Duke Unversty, Department of Electrcal Engneerng, Box 90291,Durham, NC Duke Unversty, Department of Computer Scence, Box 90129, Durham, NC Recent advances n hgh-throughput molecular bology has motvated n the feld of bonformatcs the use of network nference algorthms to predct causal models of molecular networks from correlatonal data. However, t s extremely dffcult to evaluate the effectveness of these algorthms because we possess nether the knowledge of the correct bologcal networks nor the ablty to expermentally valdate the hundreds of predcted gene nteractons wthn a reasonable amount of tme. Here, we apply a new approach developed by Smth, et al. (2002) that tests the ablty of network nference algorthms to accurately and effcently recover network structures based on gene expresson data taken from a smulated bologcal pathway n whch the structure s known a pror. We smulated a genetc regulatory network and used the resultant sampled data to test varatons n the desgn of a Bayesan Network nference algorthm, as well as varatons n total quantty of avalable data, length of samplng nterval, method of data dscretzaton, and presence of nterpolated data between observed data ponts. We also advanced the nference algorthm by developng a heurstc nfluence score that nfers the strength and sgn of regulaton (up or down) between genes. In these experments, we found that our nference algorthm worked best when presented wth data dscretzed nto three categores, when usng a greedy search algorthm wth random restarts, and when evaluatng networks usng the BDe scorng metrc. Under these condtons, the algorthm was both accurate and effcent n recoverng the smulated molecular network when the sampled data sets were large. Under more bologcally reasonable small amounts of sampled data, the algorthm worked best only when nterpolated data was ncluded, but had dffculty recoverng relatonshps descrbng genes wth more than one regulatory nfluence. These results suggest that network nference algorthms and samplng methods must be carefully desgned and tested before they can be used to recover bologcal genetc pathways, especally n the context of hghly lmted quanttes of data. INTRODUCTION The advent of novel technologes for collectng hgh-throughput data n molecular bology has led to the concurrent development of bonformatcs tools for analyzng ths data. Computer scentsts and bonformatcans soon realzed that common nference algorthms used n other felds can be appled to these large amounts bologcal data, such as those from mcroarrays, to statstcally predct causal molecular pathways. However, these potentally powerful algorthms are lmted by our nablty to evaluate ther accuracy, as we do not know the true bologcal network n whch to compare them wth and expermenters can not physcally perform n reasonable tme the multple gene knockouts or other types of nterventons requred to systematcally test the predcted networks. As part of an ongong project dedcated to ntegratng the songbrd bran (Jarvs et al. 2002), Smth, et al. (2002) developed a novel approach for evaluatng the accuracy and effcency of network nference algorthms n a reasonable amount of tme. Ths approach requres the creaton of a bologcally reasonable smulaton on a computer n whch the expermenter makes and knows all the rules. As the smulaton runs, data s sampled from t as one would sample data from a real bologcal system. The sampled data s then passed to an nference algorthm to evaluate the algorthm s ablty to recover the smulated system. The nference algorthm can then be modfed and made more robust to recover a network that closely matches the smulated system. After confdent recovery of the system from lmted smulated data s acheved, the algorthm can be appled to real data. The recovered system can then be used to gude further bologcal expermentaton for verfyng the predcted regulatory relatonshps. In our frst use of ths approach (Jarvs et al. 2002; Smth et al. 2002), we ncorporated multple levels of bologcal organzaton, from the molecular to the behavoral. Here, we attempt to look more closely at a sngle level of bologcal organzaton, the molecular level. We developed a smulator, whch we named GeneSmulator, that models genetc regulatory networks and generates correlatonal data smlar to that collected from hgh-densty gene mcroarrays. We then evaluated varous Bayesan network (BN) nference algorthm desgns for ther ablty to recover the underlyng genetc regulatory network. We chose to use a BN algorthm, because compared wth other common algorthms (Somogy and Snegosk 1996; D haeseleer et al. 1999; Weaver et al. 1999), BN have the ablty to smultaneously model non-lnear combnatoral relatonshps, robustly handle nosy data sets, and guard aganst over-fttng. BN can not handle networks wth cyclc structures, such as regulatory feedback loops; however, dynamc Bayesan networks (DBN) can handle cyclc structures (Fredman et al. 1999; Murphy and Man 1999). We used DBN, and when so confgured, they are also capable of copng wth hdden varables that are not observed n the data, such as proten levels or proten nteractons that affect the measured gene expresson data. In the DBN nference algorthm we developed here, we tested dfferent scorng metrcs and heurstc search methods, as well as dfferent aspects of data collecton and dscretzaton, n order to determne the best confguraton for recoverng the smulated system. Our analyss provdes nsght on how to more effcently use BN nference algorthms for dscoverng genetc networks from correlatonal data.

2 METHODS AND LOGIC GeneSmulator GeneSmulator s programmed n Matlab (MathWorks, Inc.). It models genetc regulatory pathways of arbtrary structure (topology) and produces values of gene expresson levels at dscrete tme ponts. Updates to values at each tme step are governed by a smple stochastc process: Yt + 1 = f ( Yt ) = A( Yt T ) + ε where Y t s a vector representng the expresson levels of all genes at tme t, wth expresson levels rangng from 0 to 100 (arbtrary unts). A s a matrx that represents the relatonshps of gene nteractons n the underlyng regulatory pathway. For every entry of A, the magntude of the entry represents the strength of the regulaton that a regulator gene exerts upon a target gene; the sgn ndcates the type of regulaton, wth postve values ndcatng upregulaton and negatve values ndcatng down-regulaton. T s a vector of threshold regulatng values for each gene: a regulatory gene exerts an nfluence on ts target gene only to the extent that t dffers from ts threshold value. In ths study all gene thresholds have been set to half of the maxmum value;.e., every entry of T s exactly 50. If the regulator gene s present at a level above ts threshold value, then ts regulatory effect on ts target genes occurs as specfed n A. Contrarly, f the regulator gene s present at a level below ts threshold value, then ts effect s n the opposte drecton of that specfed n A to return the gene to ts basal level. The ε term s whte nose, drawn unformly at random from the nterval 10 to 10. Ths term s added for stochastcty and s meant to capture all sources of nose, especally nherent bologcal nose. If a gene has no regulator (the correspondng entres n A are all zero), then t wll move n a random walk, wth steps taken accordng to the values of ε. As the smulaton runs, the data s sampled n pre-specfed ntervals as one would do n an actual bologcal experment, and the samples are exported to a text fle. For example, f we collect data every fve tme ponts, then we defne the samplng nterval to be 5, and the sampled output s the seres of expresson level vectors (Y 0,Y 5,Y 10,...), analogous to data gathered n a mcroarray tme course experment. Data Processng and Collecton Methods Dscretzaton: Before beng passed to our DBN nference algorthms, the data we collect needs to be dscretzed. Dscrete data allows us to model complex non-lnear nteractons between genes wthout resortng to computatonally prohbtve calculatons over contnuous dstrbutons. In ths study, we dscretzed the sampled expresson levels generated by GeneSmulator from contnuous values nto varous numbers of categores to determne f fner or coarser dscretzaton mproves recovery accuracy. We also evaluated two general types of dscretzaton strateges: hard and soft. Hard dscretzaton employs frm boundares between categores, requrng a gven expresson level to belong to only a sngle category. Soft dscretzaton employs fuzzy boundares between categores, allowng a gven expresson level to belong to two or more categores wth dfferent percentages each. Other data processng and collecton methods are descrbed n the results. Bayesan Network Inference Algorthms Our DBN nference algorthms are wrtten n C++ and are desgned to search for hgh-scorng graphcal models (networks) that descrbe probablstc relatonshps between varables. The score that s computed for a graph generated from the data collected and dscretzed s a measure of how successfully the graph explans the relatonshps n the data and also how smply t does so. Graphs are penalzed for over-complexty or overgeneralty so there s a resultant bas towards smpler graphs. Ths guards aganst over-fttng the model to the data. Every node n the BN graph represents a sngle varable, here one gene. Every drected edge, or lnes wth arrows, between two nodes represents a condtonal statstcal dependence of the chld node on the parent node. In the context of a DBN fro recoverng a genetc regulatory network, each edge ndcates a regulatory relatonshp n whch the parent gene regulates the chld gene at a later tme. Basc Theory of BN: A statc BN (Fredman et al. 2000) s an acyclc drected graph that encodes a jont probablty dstrbuton over χ, where χ= {X 1,...,X n } s a set of dscrete random varables X. The BN for χ s specfed as a par < G,Θ >. The varable G represents a drected graph whose vertces correspond to the random varables X 1,..., X n. In ths graphcal representaton, each varable X s ndependent of ts non-descendants gven ts parents n G. The varable Θ represents a set of parameters that collectvely quantfy the probablty dstrbutons assocated wth the varables n the graph. Each parameter of Θ s specfed by θ x pa( X ) = P( x pa( X )), for each possble value x of X, and each possble value pa ( X ) of Pa ( X ). Pa ( X ) denotes the set of parents of X n G and pa ( X ) denotes a partcular nstantaton of the parents. Thus, a BN specfes a unque jont probablty dstrbuton over χ gven by: n P ( X 1,..., X n ) = = 1 P( X Pa( X )) These notons extend qute naturally to DBN, whch we explot here n the context of tme seres data (for more detals, see Murphy and Mlan 1999). The problem of dscoverng a BN from a collecton of observed data can be stated as follows. Gven a data set D ={Y 1,Y 2,Y 3,...Y n }of observed nstances of χ, fnd the most probable graph G for explanng the data contaned n D. One common approach to ths problem s to ntroduce a scorng metrc that evaluates how probable each graph G explans the data n D. In the presence of such a scorng metrc, the problem of dscoverng a BN then reduces to the problem of searchng for a graph that yelds a hgh score, gven the observed data n D. To search the hghest scorng graph, a partcular search method needs to be used. Bayesan Scorng Metrcs: The Bayesan scorng metrc can be generally descrbed (Heckerman 1996) as: Score( G : D) = log P( G D) = log P( D G) + log P( G) log P( D) Whch states that the score of the graph G gven data n D s equvalent to the log of the probablty of G gven D. In ths study, we nvestgated two types of scorng metrcs that employ dfferent assumptons: the BDe (Bayesan Drchlet equvalent) and the BIC (Bayesan Informaton Crteron) scorng metrcs. Both scorng metrcs have an nherent penalty for overcomplexty to guard aganst over-fttng of data. The BDe score captures the full Bayesan posteror probablty P(G D). In ths

3 settng, the pror over graphs needs to be specfed (we use a unform pror) and the pror over parameters s Drchlet,.e. a multnomal dstrbuton descrbng the condtonal probablty of each varable n the network. The BIC score nstead of capturng, s an asymptotc approxmaton to the full posteror probablty P(G D). Ths approxmaton s based on a penalzed maxmum lkelhood estmate. Wth large amounts of data, the BIC s a good approxmaton to the full posteror (BDe) score and s faster to compute; however, t s known to over-penalze wth small amounts of data. Score metrcs also nvolve the generaton of a condtonal probablty table (CPT) for each node. The tables store the occurrences for all combnatons of parent-chld values extracted from the dscretzed data (Heckerman 1996). The occurrence values n the tables are called suffcent statstcs. For a gven graph, a CPT table s made for each node. The occurrence values n the tables are then used to calculate the score for each node. Scores for all nodes are then summed to generate the score for the entre graph. Search Methods To dentfy BN structures wth hgh scores, search methods are employed that search for the hghest scorng graph among a set of graphs usng dfferent heurstc methods. The reason for heurstc search methods s that dentfyng the hghest scorng network usng scorng metrcs s NP complete. As such, heurstc searches are teratve and thus can be run ndefntely and stopped at any tme to reveal the hghest scorng graph vsted thus far. The longer the search, the lkely of fndng a hgher-scorng graph. A sutable cutoff for runnng tme s found emprcally, where longer runnng tmes do not result n sgnfcant changes to the hghest scorng graph found. In our study, we tested three heurstc search methods: 1) greedy search wth random restarts (Heckerman 1996), 2) smulated annealng (Heckerman 1996), and 3) a genetc algorthm (developed n ths study). As n Heckeman (1996), for each type of search we used E to denote the set of elgble changes to a graph and (e) to denote the change n score of a graph resultng from the modfcaton e E, where stands for every elgble change. In addton, we created a hash table (a look up table) to store the calculated scores for each node wth a certan parent set. We found that ths mproved the performance of the search sgnfcantly, savng computatonal tme by avodng the recalculaton of suffcent statstcs n the CPTs when revstng a prevous set of parents for a gven node. Greedy search wth random restarts ntalzes tself by choosng a random graph, then evaluates the change n score (e) assocated wth every possble change e E, and fnally selects the change for whch (e) s maxmzed, provded the maxmal (e) s postve. It proceeds n ths fashon untl all (e) are negatve and no score mprovement can be made. To escape ths local maxmum, the algorthm then restarts from another random graph, and the entre process s repeated untl the total number of teratons s reached. Smulated annealng also ntalzes tself by choosng a random graph, but s gven an ntal temperature T 0, a search parameter. An elgble change e E s selected at random and the probablty expresson p = exp( ( e) / T0 ) s evaluated. If p >1 (whch occurs whenever (e) s postve), then the change e s made; otherwse, the change e s only made wth probablty p. The procedure begns at a very hgh temperature so that almost every elgble change n the graph can be made. As the search progresses, the temperature gradually decreases untl a very low temperature s reached where very lttle change s made n the graph. The search then performs smlarly to the local searches of the random greedy method. A genetc algorthm (GA) (Goldberg 1989) s a search method usng three operators to explore a space of solutons or, n our case, a set of graphs. The three operators are: reproducton, whch promotes the best graphs to the next generaton, mutaton, whch explores new graphs by ntroducng varaton n the populaton to avod local optma, and crossover, whch selects a swappng pont n the parents and exchanges nformaton between them to generate two new graphs, thereby ncreasng the average qualty of a populaton. We are not aware of a GA beng appled to BN searches, and thus explan the operatons we mplemented n more detal. In our genetc algorthm, we generated a mutaton operaton that makes a local change n any possble edge n the graph structure. We generated a crossover operaton that swaps parts of two graphs. Graph structures are specfed as the set of parents for every node, where graph s donated as { Pa ( X ), Pa ( X ),..., Pa ( X n )} and graph j as 1 2 { Pa j ( X ), Pa j ( X ),..., Pa j ( X n )}. To crossover, a randomly 1 2 j chosen varable X becomes the swap pont leadng to two new k structures, graph {Pa (X 1 ),...,Pa (X k ),Pa j (X k +1 )...,Pa j (X n )} ' and graph j {Pa j (X 1 ),...,Pa j (X k ),Pa (X k +1 )...,Pa (X n )} j '. For each GA teraton, ether a mutaton or a crossover operaton s chosen at random and the newly created graphs are reproduced n the next generaton f they have hgher scores than the current graphs n the stored populaton. As t s possble for crossovers to create b-drectonal edges, we check for and elmnate such graphs. Influence Score: Many BN nference algorthms appled to molecular bology are useful for predctng whch genes regulate whch others, but often do not predct the magntude or even the sgn of regulaton (an excepton s Hartemnk et al 2001). Here usng a dfferent approach, we took advantage of the CPT table logc. The occurrence numbers n CPT tables suggest relatonshps between nodes. Here, we generated CPT-lke tables, whch we call nfluence tables, for each par of connected nodes n the hghest scorng graph. The values n the nfluence tables are occurrences from the dscretzed data accordng to the hghest graph. An example of an nfluence table, when there s only one parent node per chld node, s shown n Table 1. Here, three category dscretzaton was used: low (L), medum (M), and hgh Parent L M H L LL ML HL Chld M LM MM HM H LH MH HH Table 1: An example nfluence score table. The table s a modfed verson of a CPT wthout pror knowledge nformaton. Both the parent and chld states are dscretzed as L, M, or H. The values n the cells represent the occurrence of each parent-chld combnaton where the frst descrptor (L, M, or H) s the parent and the second (L, M, or H) s the chld. For clarty, ths table contans the smple case where there s only one parent per chld. In practce, the table has more dmensons to account for multple parents.

4 (H). The occurrence values n the table are then descrbed as combnatons, for example, where HL s the number of tmes n the data that the parent s hgh and the chld s low. When for a gven node the occurrence of LL and HH s greater than the occurrence of LH and HL, then parent gene can be sad to be an actvator of the chld gene. And vce-versa, for the parent beng a repressor. Ths s even the case wthout consderng MX values. Thus, to calculate an nfluence score from these occurrence values, we used the followng formulas: LL HL LL _ HL = and HH LH HH _ LH = LL + HL HH + LH LL LH LL _ LH = and HH HL HH _ HL = LL + LH HH + HL The sgns (+ or -) of the numerators determnes the sgn of the gene regulaton. When LL-HL and HH-LH s greater than 0, and LL-LH and HH-HL s greater than 0, then the parent s consdered an actvator and gets a + sgn. The reverse s the case when both of these values are less than 0, then the parent receves a sgn. Dfferent combnatons of these values, such that one s greater and the other s less than 0, the score changes to less postve or less negatve, or to just 0 (no sgn can be determned). The numerator scales the denomnator to generate values between -1 and 1. Wth four such values, the fnal sgn and magntude of the nfluence s calculated wth the next set of rules and formulas: Frst set the nfluence score to 0. If both of LL_HL and HH_LH are greater than 0 (LL and HH domnate), then add the magntude (LL_HL*HH_LH)/2 to the nfluence score 0. If both of LL_HL and HH_LH are less than 0 (HL and LH domnate), then subtract (LL_HL*HH_LH)/2 from the nfluence score 0. Otherwse, keep the nfluence score 0. Perform smlar operatons for LL_LH or HH_HL. After, add both values from both sets of operatons and sum them to obtan the fnal nfluence score. These two symmetrc steps are used to average the effects from dfferent drectons. If several parents are present, the same calculaton s done wth the other parents fxed n each possble confguraton; the average of these nfluence scores s then used. Regardless of the number of levels of dscretzaton, only the lowest and hghest categores are ncluded nto the calculaton. In the above manner, the nfluence score s mapped to a range between 1 and 1. Stated n terms of gene expresson, postve numbers reflect actvatng relatonshps of a parent on a chld gene, whle negatve numbers reflect repressng relatonshps. The magntude of the nfluence score s suggestve of the gene Fgure 1: Regulatory network and gene expresson tme sequence plot. regulaton strength, whch means the more postve the score s, the stronger the up-regulaton s; the more negatve the score s, the stronger the down-regulaton s. When the nfluence score s close to 0, t s dffcult to nfer the type of regulaton (up or down). RESULTS We frst present results generated by the smulator, then examne dfferent confguratons for the BN nference algorthm usng the same smulated data set. We then examne dfferent samplng and data processng methods usng dfferent data sets. GeneSmulator Usng GeneSmulator, we smulated a genetc regulatory system defned n a matrx A of relatonshps wth the structure shown n the left half of Fgure 1. Twelve of twenty genes (genes 0 to 11) were connected n a regulatory pathway. In addton a feedback connecton, from gene 7 to gene 0, was ncluded. The other eght genes (genes 12 to 19) were not connected to other genes;.e., they were ndependent of all other genes. The absolute regulaton strength for each connecton was set to be the same: 0.1. Consequently, n the matrx A, for every upregulaton of gene y from gene x, A ( y, x) = 0. 1; f the relatonshp s one of down-regulaton, then A(y,x) = 0.1. If there s no connecton between genes x and y, the correspondng entry A(y,x) = 0. To show how GeneSmulator works, we ran t for 500 tme ponts and plotted expresson levels for four of the regulated genes n the rght half of Fgure 1. The results are consstent wth the relatonshps specfed by the orgnal structure: when gene 4 ncrease gene 5 decreases, consstent wth gene 4 down-regulatng gene 5; gene 6 engages n a random walk, consstent wth t havng no regulator; when gene 5 s hgh and 6 s low, gene 9 s hgh, and ths s consstent wth gene 9 beng both up-regulated by 5 and down-regulated by 6. Moreover, the changes n expresson occur over a seres of tme steps. Although our tme steps are untless, f each s consdered to be one mnute, they match well the tme-scale noted for gene expresson n actual bologcal systems (Jarvs and Nottebohm 1997). Ths underlyng regulatory structure was used for all of experments, except for the complexty of network secton. As such, throughout the paper Fgure 1 (left) s to be compared wth all other fgures. Bayesan Network Inference Algorthms Intally we sampled data from GeneSmulator output on a small scale 20 or fewer sampled tme ponts matchng what would be done expermentally usng mcroarrays or other approaches. However, we found that such small sample szes, wthout further data processng, as descrbed n the Data Collecton Methods secton below, were not suffcent for recoverng the smulated structure. In ths secton, to more effectvely evaluate the BN and the dfferent confguratons we made, we ran GeneSmulator wth the system of Fgure 1 (left) for 10,000 tme ponts and sampled at an nterval of every 5 tme ponts, yeldng a total of 2000 sampled tme ponts. Bayesan Scorng Metrcs: To compare the BDe and

5 chld s obscured by the nfluences from the other parent, and the resultng dfference s an artfact of how the nfluence score s calculated. Gven these weaknesses, the nfluence scores stll reflected relatvely well the gene relatonshps. For genes wth one versus two parents, the nfluence scores of the recovered edges are smlar wthn each case (~0.6 for sngle parents; ~0.3 for two parents, Fgure 3) as n the smulated network. We conclude that the nfluence score s capable of recoverng the sgn of the nteracton, but ts relatve magntude depends upon the number of parents. Fgure 2: Comparson of graphs found usng BDe (left) and BIC (rght) scores methods. Shown are the top graphs found. Red correct edges found by one method and not the other. The arrows n these graphs only specfy drecton, not postve or negatve nteractons. BIC scorng metrcs, we chose to use a 3-category hard dscretzaton of the data and a greedy search method wth 1000 random restarts. Compared wth the orgnal smulated structure n Fgure 1 (left), the BN nference algorthm under these condtons had remarkably good recovery, usng ether scorng metrc (Fgure 2). All genes (nodes) n the connected networks and most nteractons (edges) were recovered. For the BDe scorng metrc, the top graph (hghest scorng graph) had exactly the same regulatory structure as the smulated system. For the BIC scorng metrc, the top graph had two edges mssng. We conclude that the BDe score works better than the BIC score n recoverng the underlyng smulated genetc regulatory pathway gven ths quantty of sampled data. The mssng edges under the BIC score are consstent wth ts known over-penalzaton of model complexty. Influence Score: We found that our heurstc nfluence score functon worked, and t gave reasonable results when compared wth the type of regulaton (up or down) n the smulated system (Fgure 3). The sgns of the nteractons (+ and ) were all correctly dentfed (compare Fgure 1 wth Fgure 3). The absolute magntudes of the nfluences scores n the recovered networks ( ) and the regulatory strengths specfed n the smulated system (0.1) although n the same range, were not drectly comparable. Furthermore, n the recovered network when a node had more than one parent, the edges from those parents had lower nfluence score magntudes than f the node had just one parent. Ths s the case because the nfluence of one parent to ts Search Methods: To compare heurstc search methods, we used BDe score wth 3-category hard dscretzaton. Because t s dffcult to set a far stoppng crteron for dfferent search methods, we let each of them run long enough that they dd not make any mprovement for many teratons (determned emprcally). We compared three search methods: greedy search wth random restarts, smulated annealng, and a genetc algorthm. All three yelded the same top graphs wth exact matches to the smulated system, as n the left of Fgure 2. However, we found that wth ths data set greedy search took the least tme (mnutes) to fnd the correct graph wth the hghest score; smulated annealng a longer tme (10s of mnutes); and genetc algorthm took the greatest amount of tme (~hrs). We conclude that the three search methods worked equally well n terms of the top graph found, but the greedy search s best as t can fnd the top graph n the least amount of tme. Data Processng and Collecton Methods For ths secton, we used the BDe scorng metrc and the greedy search method wth 1000 random restarts. Dscretzaton: Usng the same data set as above, we compared the performance of the BN nference algorthm wth dscretzaton nto dfferent numbers of categores: hard dscretzaton nto 2, 3, and 4 categores. The top graph found for each s shown n Fgure 4. All of these dscretzaton approaches allowed recovery of relatvely smlar graphs to the smulated system. Only the 3-category dscretzaton found exactly the same regulatory graph as the smulated system. In the case of the 2-category dscretzaton, extraneous edges were found. These lkely emerged because too much of the nformaton contaned n the data was lost n the overly-coarse dscretzaton. On the other hand, the 4-category dscretzaton led to some dffculty recoverng edges whose chldren had Fgure 3: Graph wth nfluence scores. Numbers besdes the edges are nfluence scores from the top BDe generated graph of Fgure 2. Fgure 4: Comparson of number of dscretzaton categores, wth hard boundares. Black correct edges n common between all three graphs. Red correct edges found n only one or two of the graphs. Blue ncorrect edges. Numbers besde edges are nfluence scores.

6 Fgure 5: Comparson for hard (left) and soft dscretzaton (rght). Red correct edges found n the top graph by one method and not the other. Numbers besde edges are nfluence scores. multple parents, and nstead t found ncorrect edges from ther grandparents. We beleve ths occurs because ncreasngly fne levels of dscretzaton spreads out the data entered n the CPTs, such that ndvdual occurrence values are weakened. To strengthen these values, ntutvely, would requre more data. We next compared hard wth soft dscretzaton usng 3- categores (Fgure 5). Compared wth hard dscretzaton, soft dscretzaton mssed edges to genes wth 2 parents; only one of the parents could be found. We conclude that wth our smulated data the 3-category hard dscretzaton works best n allowng recovery of the smulated genetc pathway. All subsequent analyss below uses 3-category hard dscretzaton. Samplng Amount, Intervals, and Coverage: Snce samplng amount, ntervals, and coverage are dependent on each other, t s not possble to test ther effects on BN recovery ndependently. however, testng ther effects, can be done by multcomparsons. Frst, we sampled dfferent data amounts at the same nterval, 5, whch effectvely changes tme coverage across the smulated output (Fgure 6A). Second, we sampled dfferent data amounts wth the same overall coverage, 10,000 tme ponts, whch effectvely ths changes the samplng nterval (Fgure 6B). Thrd, we sampled at dfferent ntervals wth the same data amount, 500 data ponts, whch effectvely changes the overall coverage (Fgure 7). In each case, two out of three varables change at the same tme. Thus, when the recovery result s dfferent between graphs of the dfferent comparsons (Fgures 6A, 6B, and 7), the responsble varable s one or both of the changng varables. To decded wth a degree of confdence whch one t s, the common dfferences between graphs of two dfferent changng stuatons (for example the common edges n frst graph of Fgure 6A and frst of 6B compared wth ther dfferences n the second graph of 6A and 6B) are due to the effect of the common varable that changes n both. As expected, the more data collected, the better the recovery of the smulated system (compare wthn both Fgures 6A and 6B). Increased coverage does not appear to sgnfcantly mprove recovery (compare wthn both Fgures 6A and 7). Ths counterntutve result, mples that at the coverage across the smulaton s already well represented n our samplng ranges. Interestngly, when changng the samplng nterval, there appears to be an optmum (nterval of ~5) that led to the best recovery (compare wthn both Fgures 7 and 6B). At nterval of 1 (takng every tme pont, but wth a small coverage tme of 500) fewer correct edges were found. Increasng the samplng nterval to 5, more correct edges were found. But as the samplng nterval ncreased more (to 10 and 20) ncorrect edges appeared (Fgure 7 and 6B), even though n one there s more coverage (Fgure 7). The explanaton for these dfferences n Fgure 7 s that the small samplng nterval yelds small coverage, not gvng much nformaton between any two ponts spread dstantly n tme, whle the large samplng nterval, although t yelds large coverage, also loses a large amount of nformaton between any two ponts. In all types of comparsons, t was dffcult recover both edges of genes wth multple parents (genes 0 and 1 to 2; genes 5 and 6 to 9; and genes 8 and 9 to 10). The man varable that had the strongest effect on the recovery of edges from both parents was ncreasng data amount sampled (compare Fgures Fgure 6: Comparson for changes n data amounts and coverage wth same ntervals (A) or changes n data amount and ntervals wth same coverage tme (B). From left to rght, the number of data ponts ncreases. Top graphs are shown. Numbers besde edges are nfluence scores. Red correct edges found n only one or two graphs. Blue ncorrect edges.

7 grandparents (Fgure 6B and 7). There s also an nterval effect on our nfluence score, such that the score ncreased as samplng ntervals ncreased, untl the scores was reduced by the appearance of ncorrect multple parents at very hgh samplng ntervals (Fgures 6A and 7). The ntal ncreased nterval effect on the nfluence score makes sense because the magntude of the dfference n expresson of a gene between two tme ponts ncreases as the nterval ncreases. We conclude that wth our data set, samplng amount and nterval has the largest effect on BN recovery. There s an optmum nterval, whch wll depend upon the underlyng tmng of the gene regulatory pathway under study. Fgure 7: Comparson for samplng ntervals. Labelng s the same as s descrbed n Fgure 7 legend. Numbers besde edges are nfluence scores. 6A, 6B, 7). The type of errors found wth large samplng ntervals were not completely erroneous, but were all ncorrect edges from Bologcal Samplng and Interpolaton of Data Ponts: In an actual bologcal gene expresson experment, the amount of data collected s much smaller than that used n the above tests. To mmc the maxmum amount of data that could be reasonably collected n a mcroarray experment, we lmted the total number of data ponts to be from 100 cdna mcroarray sldes. Wth ths pre-determned total number, we nvestgated the best possble way to allocate the anmal samples. For example, should we sample 100 tme ponts wth one anmal each tme pont or 25 tme ponts wth four anmals each tme pont. For ths amount of data, we found that the recovery results changed wth dfferent data sets from repeated experments, unlke the bgger data sets above whch yelded qute stable results from repeated experments. To better understand ths varaton, we collected 10 dfferent data sets for each of these tests, passed each one separately through the BN algorthms and obtaned the average results (Fgure 8A). The edges that appear frequently across the 10 data sets are of Fgure 8: More bologcally reasonable samplng wth dfferent strateges (A) and performance of nterpolatng data (B). Shown are the average top graph results from 10 datasets each. Black dashed edge only found once n 10 recovery results. Black sold edge fnd more than once but less than 5 tmes n 10 recovery results. Red sold edge fnd more than or equal to 5 tmes n 10 recovery results. Numbers besde edges occurrence (left of slash) and average of nfluence score (rght of slash); these only appear besde the edges found more than once.

8 Fgure 9: Data nterpolaton for smaller amounts of data: 50 (mddle) and 25 (rght) data ponts compared wth 50 unnterpolated data ponts (left). Labelng s the same as s descrbed n Fgure 9 legend. greater confdence. Here we used a samplng nterval of 5 because ths worked best n our prevous tests. Wth ths amount of data, only a partal understandng of the regulatory system was recovered, and many orgnally ndependent genes were ncorrectly ncluded the resultant graph (Fgure 8A). As the number of anmals ncreased, there was no mportant change untl 10 anmals per tme pont n 10 tme ponts. The number of edges found were dramatcally decreased, but for both correct and ncorrect edges. The number genes ncluded n the graph also decreased, but n ths case mostly for those that were not part of the orgnal regulatory system. Another nterestng result s that most of the ncorrect edges revealed (both sold and dashed lnes) had nfluence scores of 0 (Fgure 8A, scores not shown for dashed lnes), and ths can be used to elmnate such edges. We tested whether the recovery performance would be mproved by nterpolatng data ponts. We kept the samplng method the same, except for the samplng nterval. We ncreased the samplng nterval, to 20, to have samplng tme between ponts for nterpolaton. Ths resulted n ncreased coverage, but kept the number of data samplng amount the same. We lnearly nterpolated fve data ponts between every two sampled data ponts, resultng n a sx-fold ncrease n the amount of avalable data, though only 20% of t corresponds to actual sampled data. The correspondng results are shown n Fgure 8B. Regardless of the samplng approach, we found that recovery results much mproved compared wth the results from the raw data ponts wthout nterpolaton. The genes n the connected network were correctly dentfed and no ndependent genes were mplcated. Most of the correct edges were found wth hgh occurrence and all wth the correct sgns of the nfluence score. The prncpal mssng connectons stll nvolved nodes wth multple parents. Some of the ncorrect edges were actually found from a grandparent, such as from gene 7 to 2, gene 4 to 9, and gene 3 to 10. Though techncally ncorrect, they can supply useful nformaton. We conclude that when usng bologcally reasonable expresson data samplng, BN nference algorthms appear to be able to partally recover the nodes of the underlyng genetc pathway and ther sngle parent nteractons, especally when the data s nterpolated. Our BN nference algorthm appears to be less good at recoverng multple parents per node wthn the constrants of bologcally reasonable amounts of data. To nvestgate ths more closely, we tested the data nterpolaton performance wth an even smaller data amount (Fgure 9). Usng 50 data ponts wthout nterpolaton, the recovered genetc system was a mess (Fgure 9, left). However, when usng 50 data ponts wth nterpolaton (Fgure 9, mddle) the recovery was much better, even better than that from 100 data ponts wthout nterpolaton (Fgure 8A). From 25 data ponts wthout nterpolaton, the recovery result was too messy to be shown, but wth data nterpolaton (Fgure 9, rght), the recovery s comparable wth that from 100 data ponts wthout nterpolaton. All the results presented n ths secton are computed usng the BDe score functon; when we tred usng the BIC score functon, no edges were found, agan confrmng that the BIC over-penalzes for over-complexty and that ths excessve penalty s most apparent n the lmt of small amounts of data. Non-unformly Sampled Data: Many bologcal experments sample gene expresson data n a non-unform manner. We tested the effect of non-unform samplng on BN recovery. Here the ntervals between successve samples s a number chosen at random between 15 and 25. Wth a small amount of data, we found that non-unform nterval datasets can recover part of the smulated system only when data nterpolaton was used (Fgure 10). The recovery results are comparable wth that from unformly sampled datasets. Fgure 10: Recovery results from non-unformly sampled dataset. Labelng s the same as s descrbed n Fgure 8 legend. The ntervals for successve samples were chosen at random between 15 and 25.

9 Complexty of the Network The pathway that we smulated has perhaps less complexty than s lkely to be found n bologcal systems. To test some addtonal complexty, we used GeneSmulator to generate data for a pathway that ncludes one node wth three parents nstead of two (gene 2), as well as two ndependent genetc pathways (Fgure 11, left). Fgure 12: A more complex network. The underlyng regulatory network (left) and the recovered one augmented wth nfluence scores (rght). Usng our standard condtons of small-data samplng, we were able to recover most of both ndependent pathways (not shown). However, to recover the three parents to one chld, we found that we needed to collect at least 5000 data ponts (Fgure 12, rght), and the recovered edges had very low nfluence scores (red). We conclude that BN nference algorthms are able to recover ndependent genetc pathways nherent n one data set, but that as the number of parents per node grows, the amount of expresson data needed to recover the entre network wll dramatcally grow, perhaps to an extent that the collecton of ths data becomes mpractcal. DISCUSSION In ths study, we tested and modfed the ablty of BN nference algorthms to recover genetc pathways from smulated gene expresson data. We found that the structure of the underlyng network and the methods used for data processng and samplng had a crtcal mpact on the ablty to recover accurate pathways. Ths concluson s mportant to consder when usng our algorthm on actual gene expresson data, such as that collected usng hghdensty mcroarrays. We found that the best BN nference algorthm for recoverng the smulated genetc pathways was a greedy search method wth random restarts, employng the BDe scorng metrc and beng gven data dscretzed usng a 3-catergory hard dscretzaton. In addton, our nfluence scores were very useful n determnng up or down regulaton and the relatve magntude of regulaton. We also found that there was an optmal samplng nterval and effcent amount of data that allowed our BN nference algorthm to recover the most accurate pathway. We found that the BIC score does not work as well as the BDe score because the BIC score yelds fewer edges or no edge wth bologcally reasonable samplng. Ths happens because BIC apples a strong penalty for more complex structure. One of the key mprovements we made to our nference algorthm for use wth gene expresson data s the generaton of nfluence scores. These scores can determne postve or negatve nteractons and select aganst low-scorng ncorrect nteractons. Ths mproves the accuracy of the network and ncreases the amount of nformaton recovered. Three-category dscretzaton was optmal for the smulated dataset we examned. Ths may seem counterntutve, as dscretzaton leads to nformaton loss from the data: the more coarse the dscretzaton, the more the nformaton loss. Whle ths s true, t s not however the case that fner dscretzaton results n better BN recovery. Ths s because wth more dscretzaton levels, the more CPT values are spread and also the more BN parameters must be estmated, and as a result, more data ponts are needed to confdently predct edges between nodes. In addton, dscretzaton wth hard boundares outperformed dscretzaton wth soft boundares. It s possble that because soft dscretzaton also spreads out the dstrbuton of the data, some of the BN relatonshps are not sgnfcant enough under soft dscretzaton to warrant ncluson of an edge. Based on the results of ths study, we beleve the major lmtaton n dscoverng regulatory networks from gene expresson experments s collectng a suffcent quantty of data to effectvely recover all the nteractons n the genetc network, especally those assocated wth genes wth multple parents. Data collecton n the context of complex organsms s lmted by the physcal constrants of the expermenter and the number of anmals that can be sacrfced for any one gven experment. Although we found here that our BN nference algorthm wll dscover genetc regulatory pathways from bologcally reasonable amounts of data, the generated pathways can be msleadng. Fortunately, nterpolaton of data from these smaller samples szes wth good coverage of the smulated gene expresson pathway can at least yeld a sgnfcant number of correct genes and ther nteractons. Moreover, many of the ncorrect nteractons found are from grandparents, whch at least places genes only one gene removed from ther true regulator. However, our BN nference algorthm had dffculty dentfyng more than one parent for genes wth multple parents. Ths s a serous problem for genetc pathway recovery, as combnatoral regulatory control s a basc property of genetc pathways. However, the soluton to ths problem les n not attemptng to recover complex pathways from lmted amounts of expresson data alone. Other types of data can be brought to bear and, when used n conjuncton wth expresson data, can sgnfcantly enhance the ablty to accurately recover regulatory network structures (Hartemnk et al 2002). Fnally, we suspect that the performance of our BN nference algorthm depends n part on how the networks are smulated. Though we ncluded nose n generatng our smulated data, the data are clearly not a perfect smulaton of true expermental data; some meanngful bologcal nformaton s always lost n mathematcal modelng. Future efforts need to be focused on usng more real data to mprove how well the output of the smulator matches real world data. Effort also needs to be on smulatng nteracton of genes products wthn the same cell versus across cells. As such, the recovery results produced by our BN nference algorthm are not ntended to serve as a substtute for gene nterventon experments, but rather as a gude for optmally desgnng easer to perform correlatonal experments for use wth nference algorthms and to approxmate the accurate genetc pathway leadng to easer to test nterventon experements. ACKNOWLEDGMENTS

10 We thank Kurt Grands and Derek Scott of Duke Unversty for assstance n the begnnng stage of ths project. Ths research s supported by a Whtehall Foundaton grant to EDJ. REFERENCES D haeseleer, P., Wen, X., Furham, S. and Somogy, R., Lnear modelng of mrna expresson levels durng CNS development and njury. (1999) Proceedng of the Pacfc Symposum on Bocomputng. 4: Fredman, N., Murphy, K. and Russell, S., Learnng the Structure form Massve Networks. (1998) Proceedngs of the Fourteenth Conference on Uncertanty n Artfcal Intellgence Fredman, N., Nachman, I. and Pe er, D., Learnng Bayesan Network Structure of Dynamc Probablstc Network. (1999) Proceedng of the Ffteenth Conference on Uncertanty n Artfcal Intellgence Fredman, N., Lnal, M., Nachman, I. and Pe er, D., Usng Bayesan Networks to analyze expresson data (2000) Journal Computatonal Bology. 7: Goldberg, D. E., Genetc Algorthms n Search, Optmzaton, and Machne Learnng. (1989) Wesley, MA. Hartemnk, A. J., Gfford, D., Jaakkola, T. and Young, R., Usng Graphcal Models and Genomc Expresson Data to Statstcally Valdate Modeals of Genetc Regulatory Networks. (2001) Pacfc Symposum on Bocomputng. Hartemnk, A. J., Gfford, D., Jakkoola, T. and Young, R., Combnng locaton and expresson data for prncpled dscovery of genetc regulatory network models. (2002) Pacfc Symposum on Bocomputng. Heckerman, D., A Tutoral on Learnng wth Bayesan Networks. (1996) Techncal Report MSR-TR-95-06, Mcrosoft Research, March, 1995 (revsed November, 1996). Jarvs, E. D. and Nottebohm, F., Motor-drven gene expresson. (1997) Proceedngs of Natonal Academy of Scences of the Unted States of Amerca. Jarvs, E. D. et al, Integrate Songbrd Bran.(2002) Journal of Comparatve Physology, A (n press). Murphy, K. and Man, S., Modelng Gene Expresson Data Usng Dynamc Bayesan Networks. (1999) Techncal Report, Unversty of Calforna, Berkeley. Smth, V. A., Jarvs, E. D. and Hartemnk, A. J., Evaluatng Functonal Network Inference Usng Smulatons of Complex Bologcal Systems. (2002) Accepted by The 10 th nternatonal conference on Intellgent Systems for Molecular Bology. Smth, V. A., Jarvs, E. D. and Hartemnk, A. J., Influence Of Network Topology and Data Collecton On Functonal Network Influence. (2002) Pacfc Symposum on Bocomputng (n press). Somogy, R. and Snegosk, C. A., Modelng the complexty of genetc networks: understandng multgenc and pleotropc regulaton. (1996) Complexty. 1: Weaver, D. C., Workman, C. T. and Stormo, G. D., Modelng regulatory networks wth weght matrcs. (1999) Proceedngs of the Pacfc Symposum on Bocomputng. 4:

S1 Note. Basis functions.

S1 Note. Basis functions. S1 Note. Bass functons. Contents Types of bass functons...1 The Fourer bass...2 B-splne bass...3 Power and type I error rates wth dfferent numbers of bass functons...4 Table S1. Smulaton results of type

More information

User Authentication Based On Behavioral Mouse Dynamics Biometrics

User Authentication Based On Behavioral Mouse Dynamics Biometrics User Authentcaton Based On Behavoral Mouse Dynamcs Bometrcs Chee-Hyung Yoon Danel Donghyun Km Department of Computer Scence Department of Computer Scence Stanford Unversty Stanford Unversty Stanford, CA

More information

Cluster Analysis of Electrical Behavior

Cluster Analysis of Electrical Behavior Journal of Computer and Communcatons, 205, 3, 88-93 Publshed Onlne May 205 n ScRes. http://www.scrp.org/ournal/cc http://dx.do.org/0.4236/cc.205.350 Cluster Analyss of Electrcal Behavor Ln Lu Ln Lu, School

More information

NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS

NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS ARPN Journal of Engneerng and Appled Scences 006-017 Asan Research Publshng Network (ARPN). All rghts reserved. NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS Igor Grgoryev, Svetlana

More information

Meta-heuristics for Multidimensional Knapsack Problems

Meta-heuristics for Multidimensional Knapsack Problems 2012 4th Internatonal Conference on Computer Research and Development IPCSIT vol.39 (2012) (2012) IACSIT Press, Sngapore Meta-heurstcs for Multdmensonal Knapsack Problems Zhbao Man + Computer Scence Department,

More information

An Optimal Algorithm for Prufer Codes *

An Optimal Algorithm for Prufer Codes * J. Software Engneerng & Applcatons, 2009, 2: 111-115 do:10.4236/jsea.2009.22016 Publshed Onlne July 2009 (www.scrp.org/journal/jsea) An Optmal Algorthm for Prufer Codes * Xaodong Wang 1, 2, Le Wang 3,

More information

Mathematics 256 a course in differential equations for engineering students

Mathematics 256 a course in differential equations for engineering students Mathematcs 56 a course n dfferental equatons for engneerng students Chapter 5. More effcent methods of numercal soluton Euler s method s qute neffcent. Because the error s essentally proportonal to the

More information

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration Improvement of Spatal Resoluton Usng BlockMatchng Based Moton Estmaton and Frame Integraton Danya Suga and Takayuk Hamamoto Graduate School of Engneerng, Tokyo Unversty of Scence, 6-3-1, Nuku, Katsuska-ku,

More information

Helsinki University Of Technology, Systems Analysis Laboratory Mat Independent research projects in applied mathematics (3 cr)

Helsinki University Of Technology, Systems Analysis Laboratory Mat Independent research projects in applied mathematics (3 cr) Helsnk Unversty Of Technology, Systems Analyss Laboratory Mat-2.08 Independent research projects n appled mathematcs (3 cr) "! #$&% Antt Laukkanen 506 R ajlaukka@cc.hut.f 2 Introducton...3 2 Multattrbute

More information

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz Compler Desgn Sprng 2014 Regster Allocaton Sample Exercses and Solutons Prof. Pedro C. Dnz USC / Informaton Scences Insttute 4676 Admralty Way, Sute 1001 Marna del Rey, Calforna 90292 pedro@s.edu Regster

More information

CS 534: Computer Vision Model Fitting

CS 534: Computer Vision Model Fitting CS 534: Computer Vson Model Fttng Sprng 004 Ahmed Elgammal Dept of Computer Scence CS 534 Model Fttng - 1 Outlnes Model fttng s mportant Least-squares fttng Maxmum lkelhood estmaton MAP estmaton Robust

More information

For instance, ; the five basic number-sets are increasingly more n A B & B A A = B (1)

For instance, ; the five basic number-sets are increasingly more n A B & B A A = B (1) Secton 1.2 Subsets and the Boolean operatons on sets If every element of the set A s an element of the set B, we say that A s a subset of B, or that A s contaned n B, or that B contans A, and we wrte A

More information

Support Vector Machines

Support Vector Machines /9/207 MIST.6060 Busness Intellgence and Data Mnng What are Support Vector Machnes? Support Vector Machnes Support Vector Machnes (SVMs) are supervsed learnng technques that analyze data and recognze patterns.

More information

A mathematical programming approach to the analysis, design and scheduling of offshore oilfields

A mathematical programming approach to the analysis, design and scheduling of offshore oilfields 17 th European Symposum on Computer Aded Process Engneerng ESCAPE17 V. Plesu and P.S. Agach (Edtors) 2007 Elsever B.V. All rghts reserved. 1 A mathematcal programmng approach to the analyss, desgn and

More information

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers IOSR Journal of Electroncs and Communcaton Engneerng (IOSR-JECE) e-issn: 78-834,p- ISSN: 78-8735.Volume 9, Issue, Ver. IV (Mar - Apr. 04), PP 0-07 Content Based Image Retreval Usng -D Dscrete Wavelet wth

More information

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization Problem efntons and Evaluaton Crtera for Computatonal Expensve Optmzaton B. Lu 1, Q. Chen and Q. Zhang 3, J. J. Lang 4, P. N. Suganthan, B. Y. Qu 6 1 epartment of Computng, Glyndwr Unversty, UK Faclty

More information

6.854 Advanced Algorithms Petar Maymounkov Problem Set 11 (November 23, 2005) With: Benjamin Rossman, Oren Weimann, and Pouya Kheradpour

6.854 Advanced Algorithms Petar Maymounkov Problem Set 11 (November 23, 2005) With: Benjamin Rossman, Oren Weimann, and Pouya Kheradpour 6.854 Advanced Algorthms Petar Maymounkov Problem Set 11 (November 23, 2005) Wth: Benjamn Rossman, Oren Wemann, and Pouya Kheradpour Problem 1. We reduce vertex cover to MAX-SAT wth weghts, such that the

More information

A MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS

A MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS Proceedngs of the Wnter Smulaton Conference M E Kuhl, N M Steger, F B Armstrong, and J A Jones, eds A MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS Mark W Brantley Chun-Hung

More information

X- Chart Using ANOM Approach

X- Chart Using ANOM Approach ISSN 1684-8403 Journal of Statstcs Volume 17, 010, pp. 3-3 Abstract X- Chart Usng ANOM Approach Gullapall Chakravarth 1 and Chaluvad Venkateswara Rao Control lmts for ndvdual measurements (X) chart are

More information

Optimizing Document Scoring for Query Retrieval

Optimizing Document Scoring for Query Retrieval Optmzng Document Scorng for Query Retreval Brent Ellwen baellwe@cs.stanford.edu Abstract The goal of ths project was to automate the process of tunng a document query engne. Specfcally, I used machne learnng

More information

An Entropy-Based Approach to Integrated Information Needs Assessment

An Entropy-Based Approach to Integrated Information Needs Assessment Dstrbuton Statement A: Approved for publc release; dstrbuton s unlmted. An Entropy-Based Approach to ntegrated nformaton Needs Assessment June 8, 2004 Wllam J. Farrell Lockheed Martn Advanced Technology

More information

FEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur

FEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur FEATURE EXTRACTION Dr. K.Vjayarekha Assocate Dean School of Electrcal and Electroncs Engneerng SASTRA Unversty, Thanjavur613 41 Jont Intatve of IITs and IISc Funded by MHRD Page 1 of 8 Table of Contents

More information

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points;

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points; Subspace clusterng Clusterng Fundamental to all clusterng technques s the choce of dstance measure between data ponts; D q ( ) ( ) 2 x x = x x, j k = 1 k jk Squared Eucldean dstance Assumpton: All features

More information

Smoothing Spline ANOVA for variable screening

Smoothing Spline ANOVA for variable screening Smoothng Splne ANOVA for varable screenng a useful tool for metamodels tranng and mult-objectve optmzaton L. Rcco, E. Rgon, A. Turco Outlne RSM Introducton Possble couplng Test case MOO MOO wth Game Theory

More information

An Iterative Solution Approach to Process Plant Layout using Mixed Integer Optimisation

An Iterative Solution Approach to Process Plant Layout using Mixed Integer Optimisation 17 th European Symposum on Computer Aded Process Engneerng ESCAPE17 V. Plesu and P.S. Agach (Edtors) 2007 Elsever B.V. All rghts reserved. 1 An Iteratve Soluton Approach to Process Plant Layout usng Mxed

More information

Range images. Range image registration. Examples of sampling patterns. Range images and range surfaces

Range images. Range image registration. Examples of sampling patterns. Range images and range surfaces Range mages For many structured lght scanners, the range data forms a hghly regular pattern known as a range mage. he samplng pattern s determned by the specfc scanner. Range mage regstraton 1 Examples

More information

Feature Reduction and Selection

Feature Reduction and Selection Feature Reducton and Selecton Dr. Shuang LIANG School of Software Engneerng TongJ Unversty Fall, 2012 Today s Topcs Introducton Problems of Dmensonalty Feature Reducton Statstc methods Prncpal Components

More information

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision SLAM Summer School 2006 Practcal 2: SLAM usng Monocular Vson Javer Cvera, Unversty of Zaragoza Andrew J. Davson, Imperal College London J.M.M Montel, Unversty of Zaragoza. josemar@unzar.es, jcvera@unzar.es,

More information

EECS 730 Introduction to Bioinformatics Sequence Alignment. Luke Huan Electrical Engineering and Computer Science

EECS 730 Introduction to Bioinformatics Sequence Alignment. Luke Huan Electrical Engineering and Computer Science EECS 730 Introducton to Bonformatcs Sequence Algnment Luke Huan Electrcal Engneerng and Computer Scence http://people.eecs.ku.edu/~huan/ HMM Π s a set of states Transton Probabltes a kl Pr( l 1 k Probablty

More information

Three supervised learning methods on pen digits character recognition dataset

Three supervised learning methods on pen digits character recognition dataset Three supervsed learnng methods on pen dgts character recognton dataset Chrs Flezach Department of Computer Scence and Engneerng Unversty of Calforna, San Dego San Dego, CA 92093 cflezac@cs.ucsd.edu Satoru

More information

The Greedy Method. Outline and Reading. Change Money Problem. Greedy Algorithms. Applications of the Greedy Strategy. The Greedy Method Technique

The Greedy Method. Outline and Reading. Change Money Problem. Greedy Algorithms. Applications of the Greedy Strategy. The Greedy Method Technique //00 :0 AM Outlne and Readng The Greedy Method The Greedy Method Technque (secton.) Fractonal Knapsack Problem (secton..) Task Schedulng (secton..) Mnmum Spannng Trees (secton.) Change Money Problem Greedy

More information

Quality Improvement Algorithm for Tetrahedral Mesh Based on Optimal Delaunay Triangulation

Quality Improvement Algorithm for Tetrahedral Mesh Based on Optimal Delaunay Triangulation Intellgent Informaton Management, 013, 5, 191-195 Publshed Onlne November 013 (http://www.scrp.org/journal/m) http://dx.do.org/10.36/m.013.5601 Qualty Improvement Algorthm for Tetrahedral Mesh Based on

More information

Parallel matrix-vector multiplication

Parallel matrix-vector multiplication Appendx A Parallel matrx-vector multplcaton The reduced transton matrx of the three-dmensonal cage model for gel electrophoress, descrbed n secton 3.2, becomes excessvely large for polymer lengths more

More information

Load Balancing for Hex-Cell Interconnection Network

Load Balancing for Hex-Cell Interconnection Network Int. J. Communcatons, Network and System Scences,,, - Publshed Onlne Aprl n ScRes. http://www.scrp.org/journal/jcns http://dx.do.org/./jcns.. Load Balancng for Hex-Cell Interconnecton Network Saher Manaseer,

More information

The Codesign Challenge

The Codesign Challenge ECE 4530 Codesgn Challenge Fall 2007 Hardware/Software Codesgn The Codesgn Challenge Objectves In the codesgn challenge, your task s to accelerate a gven software reference mplementaton as fast as possble.

More information

2x x l. Module 3: Element Properties Lecture 4: Lagrange and Serendipity Elements

2x x l. Module 3: Element Properties Lecture 4: Lagrange and Serendipity Elements Module 3: Element Propertes Lecture : Lagrange and Serendpty Elements 5 In last lecture note, the nterpolaton functons are derved on the bass of assumed polynomal from Pascal s trangle for the fled varable.

More information

TECHNIQUE OF FORMATION HOMOGENEOUS SAMPLE SAME OBJECTS. Muradaliyev A.Z.

TECHNIQUE OF FORMATION HOMOGENEOUS SAMPLE SAME OBJECTS. Muradaliyev A.Z. TECHNIQUE OF FORMATION HOMOGENEOUS SAMPLE SAME OBJECTS Muradalyev AZ Azerbajan Scentfc-Research and Desgn-Prospectng Insttute of Energetc AZ1012, Ave HZardab-94 E-mal:aydn_murad@yahoocom Importance of

More information

Simulation: Solving Dynamic Models ABE 5646 Week 11 Chapter 2, Spring 2010

Simulation: Solving Dynamic Models ABE 5646 Week 11 Chapter 2, Spring 2010 Smulaton: Solvng Dynamc Models ABE 5646 Week Chapter 2, Sprng 200 Week Descrpton Readng Materal Mar 5- Mar 9 Evaluatng [Crop] Models Comparng a model wth data - Graphcal, errors - Measures of agreement

More information

USING GRAPHING SKILLS

USING GRAPHING SKILLS Name: BOLOGY: Date: _ Class: USNG GRAPHNG SKLLS NTRODUCTON: Recorded data can be plotted on a graph. A graph s a pctoral representaton of nformaton recorded n a data table. t s used to show a relatonshp

More information

Classifying Acoustic Transient Signals Using Artificial Intelligence

Classifying Acoustic Transient Signals Using Artificial Intelligence Classfyng Acoustc Transent Sgnals Usng Artfcal Intellgence Steve Sutton, Unversty of North Carolna At Wlmngton (suttons@charter.net) Greg Huff, Unversty of North Carolna At Wlmngton (jgh7476@uncwl.edu)

More information

R s s f. m y s. SPH3UW Unit 7.3 Spherical Concave Mirrors Page 1 of 12. Notes

R s s f. m y s. SPH3UW Unit 7.3 Spherical Concave Mirrors Page 1 of 12. Notes SPH3UW Unt 7.3 Sphercal Concave Mrrors Page 1 of 1 Notes Physcs Tool box Concave Mrror If the reflectng surface takes place on the nner surface of the sphercal shape so that the centre of the mrror bulges

More information

A User Selection Method in Advertising System

A User Selection Method in Advertising System Int. J. Communcatons, etwork and System Scences, 2010, 3, 54-58 do:10.4236/jcns.2010.31007 Publshed Onlne January 2010 (http://www.scrp.org/journal/jcns/). A User Selecton Method n Advertsng System Shy

More information

Some Advanced SPC Tools 1. Cumulative Sum Control (Cusum) Chart For the data shown in Table 9-1, the x chart can be generated.

Some Advanced SPC Tools 1. Cumulative Sum Control (Cusum) Chart For the data shown in Table 9-1, the x chart can be generated. Some Advanced SP Tools 1. umulatve Sum ontrol (usum) hart For the data shown n Table 9-1, the x chart can be generated. However, the shft taken place at sample #21 s not apparent. 92 For ths set samples,

More information

Lecture 5: Multilayer Perceptrons

Lecture 5: Multilayer Perceptrons Lecture 5: Multlayer Perceptrons Roger Grosse 1 Introducton So far, we ve only talked about lnear models: lnear regresson and lnear bnary classfers. We noted that there are functons that can t be represented

More information

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching A Fast Vsual Trackng Algorthm Based on Crcle Pxels Matchng Zhqang Hou hou_zhq@sohu.com Chongzhao Han czhan@mal.xjtu.edu.cn Ln Zheng Abstract: A fast vsual trackng algorthm based on crcle pxels matchng

More information

A Binarization Algorithm specialized on Document Images and Photos

A Binarization Algorithm specialized on Document Images and Photos A Bnarzaton Algorthm specalzed on Document mages and Photos Ergna Kavalleratou Dept. of nformaton and Communcaton Systems Engneerng Unversty of the Aegean kavalleratou@aegean.gr Abstract n ths paper, a

More information

Hermite Splines in Lie Groups as Products of Geodesics

Hermite Splines in Lie Groups as Products of Geodesics Hermte Splnes n Le Groups as Products of Geodescs Ethan Eade Updated May 28, 2017 1 Introducton 1.1 Goal Ths document defnes a curve n the Le group G parametrzed by tme and by structural parameters n the

More information

Context-Specific Bayesian Clustering for Gene Expression Data

Context-Specific Bayesian Clustering for Gene Expression Data Context-Specfc Bayesan Clusterng for Gene Expresson Data Yoseph Barash School of Computer Scence & Engneerng Hebrew Unversty, Jerusalem, 91904, Israel hoan@cs.huj.ac.l Nr Fredman School of Computer Scence

More information

Outline. Type of Machine Learning. Examples of Application. Unsupervised Learning

Outline. Type of Machine Learning. Examples of Application. Unsupervised Learning Outlne Artfcal Intellgence and ts applcatons Lecture 8 Unsupervsed Learnng Professor Danel Yeung danyeung@eee.org Dr. Patrck Chan patrckchan@eee.org South Chna Unversty of Technology, Chna Introducton

More information

Simulation Based Analysis of FAST TCP using OMNET++

Simulation Based Analysis of FAST TCP using OMNET++ Smulaton Based Analyss of FAST TCP usng OMNET++ Umar ul Hassan 04030038@lums.edu.pk Md Term Report CS678 Topcs n Internet Research Sprng, 2006 Introducton Internet traffc s doublng roughly every 3 months

More information

Performance Evaluation of Information Retrieval Systems

Performance Evaluation of Information Retrieval Systems Why System Evaluaton? Performance Evaluaton of Informaton Retreval Systems Many sldes n ths secton are adapted from Prof. Joydeep Ghosh (UT ECE) who n turn adapted them from Prof. Dk Lee (Unv. of Scence

More information

Wishing you all a Total Quality New Year!

Wishing you all a Total Quality New Year! Total Qualty Management and Sx Sgma Post Graduate Program 214-15 Sesson 4 Vnay Kumar Kalakband Assstant Professor Operatons & Systems Area 1 Wshng you all a Total Qualty New Year! Hope you acheve Sx sgma

More information

MATHEMATICS FORM ONE SCHEME OF WORK 2004

MATHEMATICS FORM ONE SCHEME OF WORK 2004 MATHEMATICS FORM ONE SCHEME OF WORK 2004 WEEK TOPICS/SUBTOPICS LEARNING OBJECTIVES LEARNING OUTCOMES VALUES CREATIVE & CRITICAL THINKING 1 WHOLE NUMBER Students wll be able to: GENERICS 1 1.1 Concept of

More information

Support Vector Machines

Support Vector Machines Support Vector Machnes Decson surface s a hyperplane (lne n 2D) n feature space (smlar to the Perceptron) Arguably, the most mportant recent dscovery n machne learnng In a nutshell: map the data to a predetermned

More information

EXTENDED BIC CRITERION FOR MODEL SELECTION

EXTENDED BIC CRITERION FOR MODEL SELECTION IDIAP RESEARCH REPORT EXTEDED BIC CRITERIO FOR ODEL SELECTIO Itshak Lapdot Andrew orrs IDIAP-RR-0-4 Dalle olle Insttute for Perceptual Artfcal Intellgence P.O.Box 59 artgny Valas Swtzerland phone +4 7

More information

Solving two-person zero-sum game by Matlab

Solving two-person zero-sum game by Matlab Appled Mechancs and Materals Onlne: 2011-02-02 ISSN: 1662-7482, Vols. 50-51, pp 262-265 do:10.4028/www.scentfc.net/amm.50-51.262 2011 Trans Tech Publcatons, Swtzerland Solvng two-person zero-sum game by

More information

Gene Regulatory Networks Reconstruction Using the Flooding-Pruning Hill-Climbing Algorithm

Gene Regulatory Networks Reconstruction Using the Flooding-Pruning Hill-Climbing Algorithm Artcle Gene Regulatory Networks Reconstructon Usng the Floodng-Prunng Hll-Clmbng Algorthm Lnln Xng 1, Maozu Guo 1,2,3, *, Xaoyan Lu 1, Chunyu Wang 1 and Le Zhang 2 1 School of Computer Scence and Technology,

More information

K-means and Hierarchical Clustering

K-means and Hierarchical Clustering Note to other teachers and users of these sldes. Andrew would be delghted f you found ths source materal useful n gvng your own lectures. Feel free to use these sldes verbatm, or to modfy them to ft your

More information

Determining the Optimal Bandwidth Based on Multi-criterion Fusion

Determining the Optimal Bandwidth Based on Multi-criterion Fusion Proceedngs of 01 4th Internatonal Conference on Machne Learnng and Computng IPCSIT vol. 5 (01) (01) IACSIT Press, Sngapore Determnng the Optmal Bandwdth Based on Mult-crteron Fuson Ha-L Lang 1+, Xan-Mn

More information

Intelligent Information Acquisition for Improved Clustering

Intelligent Information Acquisition for Improved Clustering Intellgent Informaton Acquston for Improved Clusterng Duy Vu Unversty of Texas at Austn duyvu@cs.utexas.edu Mkhal Blenko Mcrosoft Research mblenko@mcrosoft.com Prem Melvlle IBM T.J. Watson Research Center

More information

Parallelism for Nested Loops with Non-uniform and Flow Dependences

Parallelism for Nested Loops with Non-uniform and Flow Dependences Parallelsm for Nested Loops wth Non-unform and Flow Dependences Sam-Jn Jeong Dept. of Informaton & Communcaton Engneerng, Cheonan Unversty, 5, Anseo-dong, Cheonan, Chungnam, 330-80, Korea. seong@cheonan.ac.kr

More information

Active Contours/Snakes

Active Contours/Snakes Actve Contours/Snakes Erkut Erdem Acknowledgement: The sldes are adapted from the sldes prepared by K. Grauman of Unversty of Texas at Austn Fttng: Edges vs. boundares Edges useful sgnal to ndcate occludng

More information

Module Management Tool in Software Development Organizations

Module Management Tool in Software Development Organizations Journal of Computer Scence (5): 8-, 7 ISSN 59-66 7 Scence Publcatons Management Tool n Software Development Organzatons Ahmad A. Al-Rababah and Mohammad A. Al-Rababah Faculty of IT, Al-Ahlyyah Amman Unversty,

More information

Efficient Distributed File System (EDFS)

Efficient Distributed File System (EDFS) Effcent Dstrbuted Fle System (EDFS) (Sem-Centralzed) Debessay(Debsh) Fesehaye, Rahul Malk & Klara Naherstedt Unversty of Illnos-Urbana Champagn Contents Problem Statement, Related Work, EDFS Desgn Rate

More information

Empirical Distributions of Parameter Estimates. in Binary Logistic Regression Using Bootstrap

Empirical Distributions of Parameter Estimates. in Binary Logistic Regression Using Bootstrap Int. Journal of Math. Analyss, Vol. 8, 4, no. 5, 7-7 HIKARI Ltd, www.m-hkar.com http://dx.do.org/.988/jma.4.494 Emprcal Dstrbutons of Parameter Estmates n Bnary Logstc Regresson Usng Bootstrap Anwar Ftranto*

More information

Learning the Kernel Parameters in Kernel Minimum Distance Classifier

Learning the Kernel Parameters in Kernel Minimum Distance Classifier Learnng the Kernel Parameters n Kernel Mnmum Dstance Classfer Daoqang Zhang 1,, Songcan Chen and Zh-Hua Zhou 1* 1 Natonal Laboratory for Novel Software Technology Nanjng Unversty, Nanjng 193, Chna Department

More information

DESIGNING TRANSMISSION SCHEDULES FOR WIRELESS AD HOC NETWORKS TO MAXIMIZE NETWORK THROUGHPUT

DESIGNING TRANSMISSION SCHEDULES FOR WIRELESS AD HOC NETWORKS TO MAXIMIZE NETWORK THROUGHPUT DESIGNING TRANSMISSION SCHEDULES FOR WIRELESS AD HOC NETWORKS TO MAXIMIZE NETWORK THROUGHPUT Bran J. Wolf, Joseph L. Hammond, and Harlan B. Russell Dept. of Electrcal and Computer Engneerng, Clemson Unversty,

More information

TN348: Openlab Module - Colocalization

TN348: Openlab Module - Colocalization TN348: Openlab Module - Colocalzaton Topc The Colocalzaton module provdes the faclty to vsualze and quantfy colocalzaton between pars of mages. The Colocalzaton wndow contans a prevew of the two mages

More information

Classifier Selection Based on Data Complexity Measures *

Classifier Selection Based on Data Complexity Measures * Classfer Selecton Based on Data Complexty Measures * Edth Hernández-Reyes, J.A. Carrasco-Ochoa, and J.Fco. Martínez-Trndad Natonal Insttute for Astrophyscs, Optcs and Electroncs, Lus Enrque Erro No.1 Sta.

More information

Virtual Machine Migration based on Trust Measurement of Computer Node

Virtual Machine Migration based on Trust Measurement of Computer Node Appled Mechancs and Materals Onlne: 2014-04-04 ISSN: 1662-7482, Vols. 536-537, pp 678-682 do:10.4028/www.scentfc.net/amm.536-537.678 2014 Trans Tech Publcatons, Swtzerland Vrtual Machne Mgraton based on

More information

Fitting: Deformable contours April 26 th, 2018

Fitting: Deformable contours April 26 th, 2018 4/6/08 Fttng: Deformable contours Aprl 6 th, 08 Yong Jae Lee UC Davs Recap so far: Groupng and Fttng Goal: move from array of pxel values (or flter outputs) to a collecton of regons, objects, and shapes.

More information

A Unified Framework for Semantics and Feature Based Relevance Feedback in Image Retrieval Systems

A Unified Framework for Semantics and Feature Based Relevance Feedback in Image Retrieval Systems A Unfed Framework for Semantcs and Feature Based Relevance Feedback n Image Retreval Systems Ye Lu *, Chunhu Hu 2, Xngquan Zhu 3*, HongJang Zhang 2, Qang Yang * School of Computng Scence Smon Fraser Unversty

More information

Application of Maximum Entropy Markov Models on the Protein Secondary Structure Predictions

Application of Maximum Entropy Markov Models on the Protein Secondary Structure Predictions Applcaton of Maxmum Entropy Markov Models on the Proten Secondary Structure Predctons Yohan Km Department of Chemstry and Bochemstry Unversty of Calforna, San Dego La Jolla, CA 92093 ykm@ucsd.edu Abstract

More information

Solutions to Programming Assignment Five Interpolation and Numerical Differentiation

Solutions to Programming Assignment Five Interpolation and Numerical Differentiation College of Engneerng and Coputer Scence Mechancal Engneerng Departent Mechancal Engneerng 309 Nuercal Analyss of Engneerng Systes Sprng 04 Nuber: 537 Instructor: Larry Caretto Solutons to Prograng Assgnent

More information

Comparison of Heuristics for Scheduling Independent Tasks on Heterogeneous Distributed Environments

Comparison of Heuristics for Scheduling Independent Tasks on Heterogeneous Distributed Environments Comparson of Heurstcs for Schedulng Independent Tasks on Heterogeneous Dstrbuted Envronments Hesam Izakan¹, Ath Abraham², Senor Member, IEEE, Václav Snášel³ ¹ Islamc Azad Unversty, Ramsar Branch, Ramsar,

More information

FINDING IMPORTANT NODES IN SOCIAL NETWORKS BASED ON MODIFIED PAGERANK

FINDING IMPORTANT NODES IN SOCIAL NETWORKS BASED ON MODIFIED PAGERANK FINDING IMPORTANT NODES IN SOCIAL NETWORKS BASED ON MODIFIED PAGERANK L-qng Qu, Yong-quan Lang 2, Jng-Chen 3, 2 College of Informaton Scence and Technology, Shandong Unversty of Scence and Technology,

More information

Complex Numbers. Now we also saw that if a and b were both positive then ab = a b. For a second let s forget that restriction and do the following.

Complex Numbers. Now we also saw that if a and b were both positive then ab = a b. For a second let s forget that restriction and do the following. Complex Numbers The last topc n ths secton s not really related to most of what we ve done n ths chapter, although t s somewhat related to the radcals secton as we wll see. We also won t need the materal

More information

y and the total sum of

y and the total sum of Lnear regresson Testng for non-lnearty In analytcal chemstry, lnear regresson s commonly used n the constructon of calbraton functons requred for analytcal technques such as gas chromatography, atomc absorpton

More information

Machine Learning: Algorithms and Applications

Machine Learning: Algorithms and Applications 14/05/1 Machne Learnng: Algorthms and Applcatons Florano Zn Free Unversty of Bozen-Bolzano Faculty of Computer Scence Academc Year 011-01 Lecture 10: 14 May 01 Unsupervsed Learnng cont Sldes courtesy of

More information

The Research of Support Vector Machine in Agricultural Data Classification

The Research of Support Vector Machine in Agricultural Data Classification The Research of Support Vector Machne n Agrcultural Data Classfcaton Le Sh, Qguo Duan, Xnmng Ma, Me Weng College of Informaton and Management Scence, HeNan Agrcultural Unversty, Zhengzhou 45000 Chna Zhengzhou

More information

Lecture #15 Lecture Notes

Lecture #15 Lecture Notes Lecture #15 Lecture Notes The ocean water column s very much a 3-D spatal entt and we need to represent that structure n an economcal way to deal wth t n calculatons. We wll dscuss one way to do so, emprcal

More information

AP PHYSICS B 2008 SCORING GUIDELINES

AP PHYSICS B 2008 SCORING GUIDELINES AP PHYSICS B 2008 SCORING GUIDELINES General Notes About 2008 AP Physcs Scorng Gudelnes 1. The solutons contan the most common method of solvng the free-response questons and the allocaton of ponts for

More information

Edge Detection in Noisy Images Using the Support Vector Machines

Edge Detection in Noisy Images Using the Support Vector Machines Edge Detecton n Nosy Images Usng the Support Vector Machnes Hlaro Gómez-Moreno, Saturnno Maldonado-Bascón, Francsco López-Ferreras Sgnal Theory and Communcatons Department. Unversty of Alcalá Crta. Madrd-Barcelona

More information

Fast Computation of Shortest Path for Visiting Segments in the Plane

Fast Computation of Shortest Path for Visiting Segments in the Plane Send Orders for Reprnts to reprnts@benthamscence.ae 4 The Open Cybernetcs & Systemcs Journal, 04, 8, 4-9 Open Access Fast Computaton of Shortest Path for Vstng Segments n the Plane Ljuan Wang,, Bo Jang

More information

Analysis of Continuous Beams in General

Analysis of Continuous Beams in General Analyss of Contnuous Beams n General Contnuous beams consdered here are prsmatc, rgdly connected to each beam segment and supported at varous ponts along the beam. onts are selected at ponts of support,

More information

Reducing Frame Rate for Object Tracking

Reducing Frame Rate for Object Tracking Reducng Frame Rate for Object Trackng Pavel Korshunov 1 and We Tsang Oo 2 1 Natonal Unversty of Sngapore, Sngapore 11977, pavelkor@comp.nus.edu.sg 2 Natonal Unversty of Sngapore, Sngapore 11977, oowt@comp.nus.edu.sg

More information

Synthesizer 1.0. User s Guide. A Varying Coefficient Meta. nalytic Tool. Z. Krizan Employing Microsoft Excel 2007

Synthesizer 1.0. User s Guide. A Varying Coefficient Meta. nalytic Tool. Z. Krizan Employing Microsoft Excel 2007 Syntheszer 1.0 A Varyng Coeffcent Meta Meta-Analytc nalytc Tool Employng Mcrosoft Excel 007.38.17.5 User s Gude Z. Krzan 009 Table of Contents 1. Introducton and Acknowledgments 3. Operatonal Functons

More information

CHAPTER 2 PROPOSED IMPROVED PARTICLE SWARM OPTIMIZATION

CHAPTER 2 PROPOSED IMPROVED PARTICLE SWARM OPTIMIZATION 24 CHAPTER 2 PROPOSED IMPROVED PARTICLE SWARM OPTIMIZATION The present chapter proposes an IPSO approach for multprocessor task schedulng problem wth two classfcatons, namely, statc ndependent tasks and

More information

Private Information Retrieval (PIR)

Private Information Retrieval (PIR) 2 Levente Buttyán Problem formulaton Alce wants to obtan nformaton from a database, but she does not want the database to learn whch nformaton she wanted e.g., Alce s an nvestor queryng a stock-market

More information

Unsupervised Learning

Unsupervised Learning Pattern Recognton Lecture 8 Outlne Introducton Unsupervsed Learnng Parametrc VS Non-Parametrc Approach Mxture of Denstes Maxmum-Lkelhood Estmates Clusterng Prof. Danel Yeung School of Computer Scence and

More information

Learning Bayesian Networks in Presence of Missing Data Narges Bani Asadi

Learning Bayesian Networks in Presence of Missing Data Narges Bani Asadi Learnng Bayesan Networks n Presence of Mssng Data Narges Ban Asad Motvaton: Sgnal Transducton Networks: The study of Sgnal Transducton Networks s one of the major subjects of nterest n Systems Bology.

More information

MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION

MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION Paulo Quntlano 1 & Antono Santa-Rosa 1 Federal Polce Department, Brasla, Brazl. E-mals: quntlano.pqs@dpf.gov.br and

More information

Sum of Linear and Fractional Multiobjective Programming Problem under Fuzzy Rules Constraints

Sum of Linear and Fractional Multiobjective Programming Problem under Fuzzy Rules Constraints Australan Journal of Basc and Appled Scences, 2(4): 1204-1208, 2008 ISSN 1991-8178 Sum of Lnear and Fractonal Multobjectve Programmng Problem under Fuzzy Rules Constrants 1 2 Sanjay Jan and Kalash Lachhwan

More information

Hierarchical clustering for gene expression data analysis

Hierarchical clustering for gene expression data analysis Herarchcal clusterng for gene expresson data analyss Gorgo Valentn e-mal: valentn@ds.unm.t Clusterng of Mcroarray Data. Clusterng of gene expresson profles (rows) => dscovery of co-regulated and functonally

More information

Analysis of Malaysian Wind Direction Data Using ORIANA

Analysis of Malaysian Wind Direction Data Using ORIANA Modern Appled Scence March, 29 Analyss of Malaysan Wnd Drecton Data Usng ORIANA St Fatmah Hassan (Correspondng author) Centre for Foundaton Studes n Scence Unversty of Malaya, 63 Kuala Lumpur, Malaysa

More information

A New Approach For the Ranking of Fuzzy Sets With Different Heights

A New Approach For the Ranking of Fuzzy Sets With Different Heights New pproach For the ankng of Fuzzy Sets Wth Dfferent Heghts Pushpnder Sngh School of Mathematcs Computer pplcatons Thapar Unversty, Patala-7 00 Inda pushpndersnl@gmalcom STCT ankng of fuzzy sets plays

More information

Lecture 4: Principal components

Lecture 4: Principal components /3/6 Lecture 4: Prncpal components 3..6 Multvarate lnear regresson MLR s optmal for the estmaton data...but poor for handlng collnear data Covarance matrx s not nvertble (large condton number) Robustness

More information

CHAPTER 3 SEQUENTIAL MINIMAL OPTIMIZATION TRAINED SUPPORT VECTOR CLASSIFIER FOR CANCER PREDICTION

CHAPTER 3 SEQUENTIAL MINIMAL OPTIMIZATION TRAINED SUPPORT VECTOR CLASSIFIER FOR CANCER PREDICTION 48 CHAPTER 3 SEQUENTIAL MINIMAL OPTIMIZATION TRAINED SUPPORT VECTOR CLASSIFIER FOR CANCER PREDICTION 3.1 INTRODUCTION The raw mcroarray data s bascally an mage wth dfferent colors ndcatng hybrdzaton (Xue

More information

LECTURE : MANIFOLD LEARNING

LECTURE : MANIFOLD LEARNING LECTURE : MANIFOLD LEARNING Rta Osadchy Some sldes are due to L.Saul, V. C. Raykar, N. Verma Topcs PCA MDS IsoMap LLE EgenMaps Done! Dmensonalty Reducton Data representaton Inputs are real-valued vectors

More information

Bayesian Approach for Fatigue Life Prediction from Field Inspection

Bayesian Approach for Fatigue Life Prediction from Field Inspection Bayesan Approach for Fatgue Lfe Predcton from Feld Inspecton Dawn An, and Jooho Cho School of Aerospace & Mechancal Engneerng, Korea Aerospace Unversty skal@nate.com, jhcho@kau.ac.kr Nam H. Km, and Srram

More information