Using The Central Limit Theorem for Belief Network Learning

Size: px
Start display at page:

Download "Using The Central Limit Theorem for Belief Network Learning"

Transcription

1 Usig The Cetral Limit Theorem for Belief Network Learig Ia Davidso, Mioo Amiia Computer Sciece Dept, SUNY Albay Albay, NY, USA,. Abstract. Learig the parameters (coditioal ad margial probabilities) from a data set is a commo method of buildig a belief etwork. Cosider the situatio where we have may complete (o missig values), same-sized data sets radomly selected from the populatio. For each data set we lear the etwork parameters usig oly that data set. I such a situatio there will be o ucertaity (posterior distributio) over the parameters i each graph, however, how will the parameters leart differ from data set to data set? I this paper we show the parameter estimates across the data sets coverge to a Gaussia distributio with a mea equal to the populatio (true) parameters. This result is obtaied by a straightforward applicatio of the cetral limit theorem to belief etworks. We empirically verify the cetral tedecy of the leart parameters ad show that the parameters variace ca be accurately estimated by Efro s bootstrap samplig approach. Learig multiple etworks from bootstrap samples allows the calculatio of each parameter s expected value (as per stadard belief etworks) ad also its secod momet, the variace. Havig the expectatio ad variace of each parameter has may potetial applicatios. I this paper we discuss iitial attempts to explore their use to obtai cofidece itervals over belief etwork ifereces ad the use of Chebychev s boud to determie if it is worth gatherig more data for learig. Itroductio Bayesia belief etworks are powerful iferece tools that have foud may practical applicatios []. A belief etwork is a model of a real world situatio with each ode (N N m ) i the etwork/graph (G) represetig a evet. A edge/coectio (e e l ) betwee odes is a directioal lik showig that a evet iflueces or causes the value of aother. As each ode value ca be modeled as a radom variable (X X m ), the graph is effectively a compact represetatio of the joit probability distributio over all combiatios of ode values. Figure shows a belief etwork modelig the situatio that relates smokig, cacer ad other relevat iformatio. Buildig belief etworks is a challegig task that ca be accomplished by kowledge egieerig or automatically producig the model from data, a process commoly kow as learig []. Learig graphical models from a traiig data set is a popular approach where domai expertise is lackig. The traiig data cotais the occurrece (or otherwise) of the evets for may situatios with each situatio termed a traiig example, istace or observatio. For example, the parameters for the Cacer etwork show i Figure could be leart from may patiet records that cotaied the occurrece or ot of the evets. The four commoly occurrig situatios of learig belief etworks [] are differet combiatios of a) learig with complete (or icomplete) data ad b) learig with kow (or ukow) graph structure. Learig with complete data idicates that the traiig data cotais o missig values, while icomplete data idicates for some traiig examples, pieces of iformatio of the world situatio were ukow. With kow graph structure the learig process will attempt to estimate the parameters of the radom variables while with ukow structure the graph structure is also leart. There exists a variety of learig algorithms for each of these situatios [][]. I this paper we shall focus o the situatio where the data is complete ad graph structure is specified. I this simplest situatio there is o posterior distributio over the model space as there is o ucertaity due to latet variables or missig values. This allows us to focus o the ucertaity due to variatios i the data. I future work we hope to geeralize our fidigs for other learig situatios. Without loss of geerality ad for clarity we shall focus o Boolea etworks. Learig the parameters of the graph from the traiig data D of s records ivolves coutig the occurreces of each ode value (for paretless odes) across all records to calculate the margial probabilities or combiatio of values (for odes with parets) ad ormalizig to derive the coditioal probabilities. Now cosider havig may radomly draw data sets (D D t ) all of size s from which we lear model θ θ t. Each model places a probability distributio over the evet space, which is all combiatios of ode values. How will these parameters vary from model to model? We shall aswer this questio by cosiderig a probability distributio over the parameters for each ode. We shall call these radom variables {Q Q m }. Note that Q i may be a block of radom variables as the umber of parameters per ode ca vary depedig o

2 its umber of parets. I a Boolea etwork for a paretless ode, Q i is a sigle radom variable whose value represets ode parameter, P(X i = TRUE). For odes that have parets i a Boolea etwork, Q i cosists of (#Parets) radom variables represetig the parameters P(X i = TRUE Coditio ) P(X i = TRUE Coditio ^(#Parets) ) with the differet coditios beig the various combiatios of TRUE ad FALSE values of the parets. What ca we say of these cotiuous radom variables that represet the parameter estimates leart from differet radom samples? I this paper we show that these cotiuous radom variables form a Gaussia distributio that ceter o the populatio values (the parameters that would be leart from the etire populatio of data) ad whose variace is a fuctio of the sample size. We show that Q i ~ Gaussia(p i *, p i * (- p i * ) /(ks)) where k is a costat, p i * is the relevat parameters of the geeratig mechaism that produced the data ad s is the sample size. The stadard deviatio of this radom variable approaches zero ad the sample mea coverges asymptotically to the populatio value as the value of s icreases. We ca ow make two estimates for each parameter i the etwork: its expected value ad its variace. I most learig situatios the actual probability distributio over the leart parameters due to ucertaity/variability i the observed data set is ukow. Sometimes, geeralities are kow such as that decisio trees are ustable learers or that iterative algorithms that miimize the distortio (vector quatizatio error) are sesitive to iitial parameters. I this paper we make use of the probability distributio over the leart parameters for two applicatios. These should be treated as iitial attempts to make use of the mea ad variace of each parameter ad future work will refie these applicatios. Firstly, we use the variace to determie if gatherig more data will ecessarily chage the mea value of the parameters by at least some user specified value. This has applicatios whe collectig data is time cosumig or expesive. We the illustrate how to create cofidece itervals associated with each iferece made from the etwork. We begi this paper by describig the learig of belief etworks, readers familiar with this topic may skip this sectio. The applicability of the cetral limit theorem to belief etwork learig with complete data is described ext. We the empirically verify that the cetral tedecy of the distributio of the leart parameters by samplig the data to lear without replacemet from a large collectio of data (geerated by a Gibbs sampler). Havig such a large pool of data is a uusual luxury ad we ext illustrate approximatig the variace associated with the leart parameters usig Efro s bootstrap samplig (samplig with replacemet) []. We ext discuss how to use the estimated variace for our two proposed applicatios. Fially we coclude our work ad describe future directios. Learig Belief Networks The geeral situatio of learig a belief etwork ca be cosidered as cosistig of estimatig all the parameters (margial ad coditioal probabilities), θ, for the odes ad discoverig the structure of the directed acyclic graph, G. Let θ i ={q i q im } be the parameters of the m ode graph leart from data set i. Note that the correspodig upper-case otatio idicates the radom variable for these parameters. As the etwork is Boolea, we oly eed the probability of the ode beig oe value give its coditios (if ay). I later sectios we shall refer to graph ad etwork together as the model of the data, H={θ, G}. This paper focuses o learig belief etworks i the presece of complete data ad a give graph structure. From a data set D i = {d i,, d is }, d il = {a i a ilm } of s records, we fid either the margial or coditioal probabilities for each ode i the etwork after applyig a Laplace correctio. The term a ilj refers to the j th ode s value of the l th istace withi the i th data set. For complete data, kow graph structure ad Boolea etwork the calculatio for ode i is simply: If N j has o parets, q = ( + a = T ) /( s + ) ( ) ij l ilj If N i has parets Parets(N i ) = N k ( ) qijt = + ailj = T ad ailk = T / + ailk = T l l qijf = + ailj = T ad ailk = F / + ailk = F l l Note the subscript T or F refers to the ode s paret s value, the parameter q always refers to probability of the ode takig o the value T. For example q ijf refers to the probability leart from the i th data set that the j th

3 A Cetral Limit Theorem for Belief Networks ode will be T give its paret is F. If a ode has more tha oe paret the the expressio for the parameter calculatios ivolves similar calculatios over all combiatios of the paret ode values. Age Geder Visit Smokig Exposure Cacer Smokig TB Cacer Brochitis TB or Cacer S. Calcium Tumor Figure. The Cacer Belief Network. XRay Result Dyspea Figure. The Asia Belief Network The Asymptotic Distributio of Leart Parameters for Belief Networks Cosider as before a umber of radom samples of traiig data (D D t ) each of size s. We divide our graph of odes ito two subsets, those with o acestors (Paretless) ad those that have acestors ad are hece childre (Childre). We iitially cosider biary belief etworks, whose odes ca oly take o two values T (TRUE) ad F (FALSE) ad childre that have oly oe paret, but our results geeralize to multi-paret odes. For each traiig data (D k ) set we lear a model θ k as described earlier. Uder quite geeral coditio (fiite variace) the parameters of the odes that belog to Paretless will be Gaussia distributed due to the cetral limit theorem [4]. We ca formally show this by makig use of a commo result from the cetral limit theorem, the Gaussia approximatio to the biomial distributio [4]. A similar lie of argumet that follows could have bee established to show the cetral tedecy aroud each ad every joit probability value (P(N N m )) but sice the belief etwork is a compact represetatio of the joit distributio, we show the cetral tedecy aroud the parameters of the etwork. Situatio #: N j Paretless Let p j be the proportio of times i the etire populatio (all possible data) that the j th ode (a paretless ode) is TRUE. If our samplig process (creatig the traiig set) is radom the whe we add/sample a record, the chace the j th ode value is TRUE ca be modeled as the result of a Berouli trial (a Berouli trial ca oly have oe of two outcomes). As the radom samplig process geerates idepedet ad idetically data (IID) the the probability that of the s records i the sample that records will have TRUE for the j th ode is give by a Biomial distributio: s s P s p j =! (,, ) ( p j ) ( p j )!( s )! ( ) Applyig the Taylor series expasio to equatio ( ) with the logarithm fuctio for its dampeig (smoothess) properties ad a expasio poit p j.s yields the famous Gaussia distributio expressio: ( s. p j ) /(σ ) ( 4 ) P( s,, p j ) = e, σ = sp j ( p j ) πσ If the sample size is s the the umber of occurreces of X j =T i ay radomly selected data set will be give by a Gaussia whose mea is s.p j ad variace s.p j (- p j ) as show i equatio ( 4 ). Therefore, Q j ~ Gaussia(p j, p j (- p j )/s)) i the limit whe t ad p j > ε, where p j is the proportio of times X j =T i the populatio. Situatio #. N i Childre ad Paret(N i ) = N j N k A similar argumet as above holds except that there are ow as may Berouli experimets as there are combiatios of paret values. I total there will be k Berouli experimets. Therefore, Q i ~ Gaussia(p i,j k, (p i,j k )(-p i,j k )) / (sc k )). Note that sc k is just the proportio of times that the coditio of iterest (the paret value combiatio) occurred.

4 4 A valid questio to ask is how big should p j (or p i,j k ) ad s (or sc k ) be to obtai reliable estimates. The geeral cosesus i empirical statistics is that if t >, s > ad p j (or p i,j k ) >. the estimates made from the cetral limit theorem are reliable [4]. This is equivalet to the situatio of learig from oe hudred samples, where each paretal coditio occurs at least twety times ad o margial or coditioal probability estimate less tha.. While these coditios may seem prohibitive, we foud that i practice for stadard belief etworks tried (Asia, Cacer ad Alarm) that the distributio of parameters was typically Gaussia. Table shows for the Cacer data set that from 49 traiig data sets each of size traiig istaces that the average of the leart parameters passed a hypothesis test at the 9% cofidece level whe compared to the populatio (true) parameters. Similar results were obtaied for the Cacer ad Alarm data sets. Mea Geder Hypothesised. Mea Cacer Hypothesised.9 Differece betwee meas -.6 9% CI -. to -.99 Mea Age 49. Hypothesised. Differece betwee meas -. 9% CI -.7 to. Mea S.C. 49. Hypothesised.7 Differece betwee meas -. 9% CI -.7 to. Mea Ex.to.Tox Hypothesised.49 Differece betwee meas.4 9% CI -. to.6 Mea Lug Tmr Hypothesised.9 Differece betwee meas.449 9% CI.4 to.6 Mea Smokig Hypothesised.4 Differece betwee meas.64 9% CI -. to. Differece betwee meas -. 9% CI -. to -.9 Table. For the Cacer data set, hypothesis testig of the mea of the leart parameter ad the populatio (true value) of the parameter. I all cases the differece betwee the true value ad the mea of the leart parameters lied i the 9% cofidece iterval. Oly a subset of the hypothesis tests are preseted for space reasos. Usig Bootstrappig to Estimate the Variace Associated With Parameters Havig the ability to geerate radom idepedet samples from the populatio is a luxury ot always available. What is typically available is a sigle set of data D. The bootstrap samplig approach [] ca be used to sample from D to approximate the situatio of samplig from the populatio. Bootstrap samplig ivolves drawig r samples (B B r ) of size equal to the origial data set by samplig with replacemet from D. Efro showed that i the limit the variace amogst the bootstrap sample meas will approach the variace amogst the idepedetly draw sample meas. We use a visual represetatio of the differece betwee the probability distributio over the leart parameter values (49 bootstrap samples each of size istaces) ad a Gaussia distributio with a mea ad stadard deviatio calculated from the leart parameter values as show i Figure. As expected the most complicated ode (smokig) had the greatest deviatio from the ormal distributio.

5 4 Normal Quatile - - Normal Quatile Geder Cacer 4 Normal Quatile - - Normal Quatile Ex.to.Tox S.C Normal Quatile Age Normal Quatile Lug Tmr Normal Quatile Sm okig Figure. For the Cacer data set the differece betwee the distributio over the leart parameter values ad a Gaussia distributio with the mea ad the stadard deviatio of the leart parameter values. The straight lie idicates the Gaussia distributio. The x-axis is the parameter value while the distace betwee the lies is the Kullback-Leibler distace betwee the distributios at that poit.

6 6 Learig ad Usig the Variace of Leart Parameters We ow describe a approach to lear the belief etwork parameters mea ad variace. Firstly, the parameters of a belief etwork are leart from the available data usig the traditioal learig approach determied earlier. Secodly, bootstrap samplig is performed to estimate the variace associated with each leart parameter. Estimatig the variace ivolves takig a umber of bootstrap samples of the available data, learig the parameters of the belief etwork from each ad measurig the variace across these leart parameters. Therefore, for each parameter i the belief etwork, we ow have its expected value ad variace. Estimates of the variace of the parameters could have bee obtaied from the expressios reported i situatio# ad situatio# earlier. However, i practice we foud that these were ot as accurate as calculatig variaces from a large umber (more tha ) bootstrap samples. This was because from our limited traiig data the leart parameter was ofte differet from the true value. I this paper we propose two uses of these additioal parameters which we describe ext: determiig if more data is eeded for learig ad comig up with cofidece itervals over ifereces. Determiig the Need for More Data A cetral problem with learig is determiig how much data is eed (how big s should be). The computatioal learig theory literature [6] addresses this problem, ufortuately predomiatly for learig situatios where o oise exists, that is the Bayes error of the best classifier would be. Though there has bee much iterestig work studyig the behavior of iterative algorithms [7][] this work is typically for a fixed data set. The field of active learig attempts to determie what type of data to add to the iitial traiig data set [9] (the third approach i the paper) [] ad []. However, to our kowledge this work does ot address the questio of whe to stop addig data. We ow discuss the use of Chebychev s boud to determie if more data is eeded for learig. Others [] have used similar bouds (Cheroff s boud) to produce lower boud results for other patter recogitio problems. Sice both the expectatio ad variace of the parameters of iterest are kow, we ca make use of the Chebychev iequality. The Chebychev iequality allows the defiitio of the sample size, s, required to obtai a parameter estimate ( pˆ ) withi a error ( ε ) of the true value from a distributio with a mea of p ad stadard deviatio of σ as show below. σ ( ) P [ pˆ p > ε ] < sε This expressio ca be iterpreted as a upper boud o the chace that the error is larger tha ε. I-tur we ca upper boud the right-had side of this expressio by δ which ca be cosidered the maximum chace/risk we are willig to take that our estimate ad true value differ by more tha ε. The rearragig these terms to solve for the sample size yields: s σ ε δ Typical applicatios of the Chebychev iequality use σ =p(-p) ad produce a boud that requires kowig the parameter that is tryig to be estimated! []. However, we ca use our empirically (through bootstrappig) obtaied variace ad circumvet this problem. The questio of how much data is eeded, is ow aswered with respect to how much error ( ε ) i our estimates we are willig to tolerate ad the chace (δ) that this error will be exceeded. We ca use equatio ( 6 ) to determie if more data is eeded if we are to tolerate learig parameters that with a chace δ differ from the true parameters by more tha ε. The costats δ ad ε are set by the user ad effectively defie the users defiitio of how close is close eough. For each ode i the belief etwork we compute the umber of istaces of each type required. We the compare this agaist the umber of actual istaces i the data set of this type. If for ay ode the required umber is less tha the actual umber the more data eeds to be added. By istace type, we mea istaces with a specific combiatio of paret ode values. There as may istace types as there are eeded margial or coditioal probability table etries. For example, for the Cacer etwork (Figure ) the Exposure ode geerates two types, oe each for whe its paret s (Age) value is T or F, the value of the remaiig odes are ot specified. ( 6 )

7 To test this stoppig criterio we eed to empirically determie whe addig ew istaces to the traiig set adds o iformatio. This ivolves measurig the expectatio (over the joit distributio) of the code word legths for each combiatio of ode values. We wish to oly add istaces to the traiig set if the istaces reduce the mea amout of iformatio eeded to specify a combiatio of ode values. As the traiig data size icreases the parameters leart stabilize ad better approximate the chace occurrece i the populatio. Formally, the expected iformatio is: m ( 7 ) Average# bits = - H *( E i ).log( θ ( E i )) i= where E i is a particular combiatio of ode values of which there are m for a m biary ode etwork. H*(.) is the probability distributio of the geeratig mechaism ad θ (.) is the leart distributio. Figure 4 shows the expected iformatio for differet sized traiig sets for the Asia ad Cacer etworks averaged over experimets. I each of these experimets the traiig data is icremetally added by samplig without replacemet from the populatio (, istaces geerated from a Gibbs sampler). Calculatig this expected iformatio requires a sigificat amout of time ad caot be used for determiig whe to stop addig istaces to the traiig set Mea Iformatio Versus Size of Traiig Set Cacer Graph: Compariso of *Optimal* Active ad Radom Selectio Mea Iformatio Level Active Selectio Radom Selectio Iteratio Figure 4. The mea iformatio for the joit distributio for the Asia (top curve) ad Cacer etworks agaist traiig set size (x). At approximately istaces ad istaces the decrease i iformatio due to addig istaces are egligible for the Asia ad Cacer data sets respectively. For oe hudred experimets we used the test described earlier i italics with ε =. ad δ=.. Whether these results geeralize to more complicated etworks remais as future work. Stoppig Poit (From Figure 4) Mea Over All Trials Stadard Deviatio Over All Trials Asia 9 Cacer 67 9 Table. For trials, the mea umber of istaces i the traiig set before the test based o equatio ( 6 ) specifies addig o more data poits. The correct umber of istaces is visually determied from Figure 4. Cofidece Itervals Over Ifereces Typically with belief etworks a iferece is of the form that ivolves calculatig a poit estimate of P(N i =T {E}) where E is some set of ode values that ca be cosidered as evidece. For small etworks exact iferece is possible but sice exact iferece is NP-Hard for larger etworks approximate iferece techiques such as Gibbs samplig is used []. However, if the variace of each parameter is kow we ca ow create a cofidece iterval associated with P(N i =T {E}). We ow describe a simple approach to achieve this. For a belief etwork parameter of value p ad stadard deviatio σ we kow that with 9% cofidece that its true value is i the rage p±.96σ. We ca ow create three versios of the belief etwork, oe where all etwork parameters are set to p (the expected case) aother where the etwork parameters are set to p-.96σ

8 (lower boud) ad aother where they are set to p+.96σ (upper boud). Poit estimatio is performed i each of the three etworks usig whatever iferece techique is applicable ad the three values are the bought together to retur a expected, lower ad upper boud o the iferece to provide a 9% cofidece iterval. Though it may be thought that the chages amogst these etworks will be trivial ad ot retur greatly varyig ifereces, recet work [4] has show that eve small chages i etwork parameters ca result i large chages i the iferece/query probabilities. Future Work ad Coclusio Our work posed the questio: Give may idepedetly draw radom samples from the populatio, if we lear the belief etwork parameter estimates from each, how will the leart parameters differ from sample to sample. We have show that for complete data ad kow graph structure that the leart parameters will asymptotically distributed aroud the geeratig mechaism (true) parameters. This result follows from the cetral limit theorem ad has may potetial applicatios i belief etwork learig. This meas each ode i the belief etwork is described by the parameter s mea ad variace. The variace parameter measures the ucertaity i the parameter estimates due to variability i the data. However, havig may radom samples from which to estimate the variace is a great luxury. By usig bootstrap samplig we ca create a estimate of the variace by samplig from a sigle data set with replacemet. Havig a probability distributio over the leart parameters due to observed data ucertaity has may potetial applicatios. We describe two. We show how to create upper ad lower bouds o the probabilities of ifereces by creatig three etworks that represet the expected, lower boud ad upper boud. Usig the mea ad variace associated with each parameter allows determiig if more data is required so as to guaratee with a certai probability that the differece betwee the expected ad actual parameter meas are withi a predetermie error. We have limited our results i this paper to the simplest belief etwork learig situatio to esure the ucertaity i the parameters is oly due to data variability. We ited to ext exted our results to the situatio where EM ad SEM is used to lear from icomplete data ad ukow graph structure. Refereces [] F. Jese, A Itroductio to Bayesia Networks, Spriger Verlag, 996 [] J. Pearl, Probabilistic Reasoig i Itelliget Systems, Morga Kaufma [] Friedma, N. (99), The Bayesia structural EM algorithm, i G. F. Cooper & S. Moral, eds., Proc. Fourteeth Coferece o Ucertaity i Artificial Itelligece (UAI '9)', [4] Devore, Peck,., Statistics: The Exploratio ad Aalysis of Data, rd Ed, Wadsworth Publishig [] Efro, B Bootstrap methods: aother look at the jackkife. Aals of Statistics 7: -6. [6] Mitchell, T., Machie Learig, McGraw Hill, 997. [7] Bottou, L., ad Begio, Y., Covergece properties of the k-meas algorithm. I G. Tesauro ad D. Touretzky, editors, Adv. i Neural Ifo. Proc. Systems, volume 7, pages --9. MIT Press, Cambridge MA, 99. [] Rusla Salakhutdiov & Sam T. Roweis & Zoubi Ghahramai, Optimizatio with EM ad Expectatio-Cojugate- Gradiet, Iteratioal Coferece o Machie Learig (ICML), pp [9] D. MacKay. Iformatio-based objective fuctios for active data selectio. Neural Computatio, 4(4):9--64, [] Simo Tog, Daphe Koller, Active Learig for Parameter Estimatio i Bayesia Networks. NIPS : [] Coh, D.; Atlas, L.; ad Lader, R. 994, Improvig Geeralizatio with Active Learig. Machie Learig ():-. [] H. Maila, H. Toivoe, ad I. Verkamo. Efficiet algorithms for discoverig associatio rules. I AAAI Wkshp. Kowledge Discovery i Databases, July 994 [] M. Pradha ad P. Dagum. Optimal mote carlo estimatio of belief etwork iferece. I Proceedigs of the th Coferece o Ucertaity i Artificial Itelligece, pages , Portlad, Orego, 996. [4] Hei Cha ad Ada Darwiche. Whe do Numbers Really Matter? I JAIR 7 () 6-7.

UNIT 4 Section 8 Estimating Population Parameters using Confidence Intervals

UNIT 4 Section 8 Estimating Population Parameters using Confidence Intervals UNIT 4 Sectio 8 Estimatig Populatio Parameters usig Cofidece Itervals To make ifereces about a populatio that caot be surveyed etirely, sample statistics ca be take from a SRS of the populatio ad used

More information

1 Graph Sparsfication

1 Graph Sparsfication CME 305: Discrete Mathematics ad Algorithms 1 Graph Sparsficatio I this sectio we discuss the approximatio of a graph G(V, E) by a sparse graph H(V, F ) o the same vertex set. I particular, we cosider

More information

Normal Distributions

Normal Distributions Normal Distributios Stacey Hacock Look at these three differet data sets Each histogram is overlaid with a curve : A B C A) Weights (g) of ewly bor lab rat pups B) Mea aual temperatures ( F ) i A Arbor,

More information

( n+1 2 ) , position=(7+1)/2 =4,(median is observation #4) Median=10lb

( n+1 2 ) , position=(7+1)/2 =4,(median is observation #4) Median=10lb Chapter 3 Descriptive Measures Measures of Ceter (Cetral Tedecy) These measures will tell us where is the ceter of our data or where most typical value of a data set lies Mode the value that occurs most

More information

Investigating methods for improving Bagged k-nn classifiers

Investigating methods for improving Bagged k-nn classifiers Ivestigatig methods for improvig Bagged k-nn classifiers Fuad M. Alkoot Telecommuicatio & Navigatio Istitute, P.A.A.E.T. P.O.Box 4575, Alsalmia, 22046 Kuwait Abstract- We experimet with baggig knn classifiers

More information

Improving Template Based Spike Detection

Improving Template Based Spike Detection Improvig Template Based Spike Detectio Kirk Smith, Member - IEEE Portlad State Uiversity petra@ee.pdx.edu Abstract Template matchig algorithms like SSE, Covolutio ad Maximum Likelihood are well kow for

More information

Analysis Metrics. Intro to Algorithm Analysis. Slides. 12. Alg Analysis. 12. Alg Analysis

Analysis Metrics. Intro to Algorithm Analysis. Slides. 12. Alg Analysis. 12. Alg Analysis Itro to Algorithm Aalysis Aalysis Metrics Slides. Table of Cotets. Aalysis Metrics 3. Exact Aalysis Rules 4. Simple Summatio 5. Summatio Formulas 6. Order of Magitude 7. Big-O otatio 8. Big-O Theorems

More information

Pattern Recognition Systems Lab 1 Least Mean Squares

Pattern Recognition Systems Lab 1 Least Mean Squares Patter Recogitio Systems Lab 1 Least Mea Squares 1. Objectives This laboratory work itroduces the OpeCV-based framework used throughout the course. I this assigmet a lie is fitted to a set of poits usig

More information

Bayesian approach to reliability modelling for a probability of failure on demand parameter

Bayesian approach to reliability modelling for a probability of failure on demand parameter Bayesia approach to reliability modellig for a probability of failure o demad parameter BÖRCSÖK J., SCHAEFER S. Departmet of Computer Architecture ad System Programmig Uiversity Kassel, Wilhelmshöher Allee

More information

3D Model Retrieval Method Based on Sample Prediction

3D Model Retrieval Method Based on Sample Prediction 20 Iteratioal Coferece o Computer Commuicatio ad Maagemet Proc.of CSIT vol.5 (20) (20) IACSIT Press, Sigapore 3D Model Retrieval Method Based o Sample Predictio Qigche Zhag, Ya Tag* School of Computer

More information

Fundamentals of Media Processing. Shin'ichi Satoh Kazuya Kodama Hiroshi Mo Duy-Dinh Le

Fundamentals of Media Processing. Shin'ichi Satoh Kazuya Kodama Hiroshi Mo Duy-Dinh Le Fudametals of Media Processig Shi'ichi Satoh Kazuya Kodama Hiroshi Mo Duy-Dih Le Today's topics Noparametric Methods Parze Widow k-nearest Neighbor Estimatio Clusterig Techiques k-meas Agglomerative Hierarchical

More information

Big-O Analysis. Asymptotics

Big-O Analysis. Asymptotics Big-O Aalysis 1 Defiitio: Suppose that f() ad g() are oegative fuctios of. The we say that f() is O(g()) provided that there are costats C > 0 ad N > 0 such that for all > N, f() Cg(). Big-O expresses

More information

Random Graphs and Complex Networks T

Random Graphs and Complex Networks T Radom Graphs ad Complex Networks T-79.7003 Charalampos E. Tsourakakis Aalto Uiversity Lecture 3 7 September 013 Aoucemet Homework 1 is out, due i two weeks from ow. Exercises: Probabilistic iequalities

More information

BAYESIAN WITH FULL CONDITIONAL POSTERIOR DISTRIBUTION APPROACH FOR SOLUTION OF COMPLEX MODELS. Pudji Ismartini

BAYESIAN WITH FULL CONDITIONAL POSTERIOR DISTRIBUTION APPROACH FOR SOLUTION OF COMPLEX MODELS. Pudji Ismartini Proceedig of Iteratioal Coferece O Research, Implemetatio Ad Educatio Of Mathematics Ad Scieces 014, Yogyakarta State Uiversity, 18-0 May 014 BAYESIAN WIH FULL CONDIIONAL POSERIOR DISRIBUION APPROACH FOR

More information

A SOFTWARE MODEL FOR THE MULTILAYER PERCEPTRON

A SOFTWARE MODEL FOR THE MULTILAYER PERCEPTRON A SOFTWARE MODEL FOR THE MULTILAYER PERCEPTRON Roberto Lopez ad Eugeio Oñate Iteratioal Ceter for Numerical Methods i Egieerig (CIMNE) Edificio C1, Gra Capitá s/, 08034 Barceloa, Spai ABSTRACT I this work

More information

SD vs. SD + One of the most important uses of sample statistics is to estimate the corresponding population parameters.

SD vs. SD + One of the most important uses of sample statistics is to estimate the corresponding population parameters. SD vs. SD + Oe of the most importat uses of sample statistics is to estimate the correspodig populatio parameters. The mea of a represetative sample is a good estimate of the mea of the populatio that

More information

DATA MINING II - 1DL460

DATA MINING II - 1DL460 DATA MINING II - 1DL460 Sprig 2017 A secod course i data miig http://www.it.uu.se/edu/course/homepage/ifoutv2/vt17/ Kjell Orsbor Uppsala Database Laboratory Departmet of Iformatio Techology, Uppsala Uiversity,

More information

Image Segmentation EEE 508

Image Segmentation EEE 508 Image Segmetatio Objective: to determie (etract) object boudaries. It is a process of partitioig a image ito distict regios by groupig together eighborig piels based o some predefied similarity criterio.

More information

The Closest Line to a Data Set in the Plane. David Gurney Southeastern Louisiana University Hammond, Louisiana

The Closest Line to a Data Set in the Plane. David Gurney Southeastern Louisiana University Hammond, Louisiana The Closest Lie to a Data Set i the Plae David Gurey Southeaster Louisiaa Uiversity Hammod, Louisiaa ABSTRACT This paper looks at three differet measures of distace betwee a lie ad a data set i the plae:

More information

Lecture 5. Counting Sort / Radix Sort

Lecture 5. Counting Sort / Radix Sort Lecture 5. Coutig Sort / Radix Sort T. H. Corme, C. E. Leiserso ad R. L. Rivest Itroductio to Algorithms, 3rd Editio, MIT Press, 2009 Sugkyukwa Uiversity Hyuseug Choo choo@skku.edu Copyright 2000-2018

More information

Creating Exact Bezier Representations of CST Shapes. David D. Marshall. California Polytechnic State University, San Luis Obispo, CA , USA

Creating Exact Bezier Representations of CST Shapes. David D. Marshall. California Polytechnic State University, San Luis Obispo, CA , USA Creatig Exact Bezier Represetatios of CST Shapes David D. Marshall Califoria Polytechic State Uiversity, Sa Luis Obispo, CA 93407-035, USA The paper presets a method of expressig CST shapes pioeered by

More information

Designing a learning system

Designing a learning system CS 75 Machie Learig Lecture Desigig a learig system Milos Hauskrecht milos@cs.pitt.edu 539 Seott Square, x-5 people.cs.pitt.edu/~milos/courses/cs75/ Admiistrivia No homework assigmet this week Please try

More information

The Magma Database file formats

The Magma Database file formats The Magma Database file formats Adrew Gaylard, Bret Pikey, ad Mart-Mari Breedt Johaesburg, South Africa 15th May 2006 1 Summary Magma is a ope-source object database created by Chris Muller, of Kasas City,

More information

Administrative UNSUPERVISED LEARNING. Unsupervised learning. Supervised learning 11/25/13. Final project. No office hours today

Administrative UNSUPERVISED LEARNING. Unsupervised learning. Supervised learning 11/25/13. Final project. No office hours today Admiistrative Fial project No office hours today UNSUPERVISED LEARNING David Kauchak CS 451 Fall 2013 Supervised learig Usupervised learig label label 1 label 3 model/ predictor label 4 label 5 Supervised

More information

Lower Bounds for Sorting

Lower Bounds for Sorting Liear Sortig Topics Covered: Lower Bouds for Sortig Coutig Sort Radix Sort Bucket Sort Lower Bouds for Sortig Compariso vs. o-compariso sortig Decisio tree model Worst case lower boud Compariso Sortig

More information

The isoperimetric problem on the hypercube

The isoperimetric problem on the hypercube The isoperimetric problem o the hypercube Prepared by: Steve Butler November 2, 2005 1 The isoperimetric problem We will cosider the -dimesioal hypercube Q Recall that the hypercube Q is a graph whose

More information

An Improved Shuffled Frog-Leaping Algorithm for Knapsack Problem

An Improved Shuffled Frog-Leaping Algorithm for Knapsack Problem A Improved Shuffled Frog-Leapig Algorithm for Kapsack Problem Zhoufag Li, Ya Zhou, ad Peg Cheg School of Iformatio Sciece ad Egieerig Hea Uiversity of Techology ZhegZhou, Chia lzhf1978@126.com Abstract.

More information

CS 683: Advanced Design and Analysis of Algorithms

CS 683: Advanced Design and Analysis of Algorithms CS 683: Advaced Desig ad Aalysis of Algorithms Lecture 6, February 1, 2008 Lecturer: Joh Hopcroft Scribes: Shaomei Wu, Etha Feldma February 7, 2008 1 Threshold for k CNF Satisfiability I the previous lecture,

More information

A New Morphological 3D Shape Decomposition: Grayscale Interframe Interpolation Method

A New Morphological 3D Shape Decomposition: Grayscale Interframe Interpolation Method A ew Morphological 3D Shape Decompositio: Grayscale Iterframe Iterpolatio Method D.. Vizireau Politehica Uiversity Bucharest, Romaia ae@comm.pub.ro R. M. Udrea Politehica Uiversity Bucharest, Romaia mihea@comm.pub.ro

More information

Sorting in Linear Time. Data Structures and Algorithms Andrei Bulatov

Sorting in Linear Time. Data Structures and Algorithms Andrei Bulatov Sortig i Liear Time Data Structures ad Algorithms Adrei Bulatov Algorithms Sortig i Liear Time 7-2 Compariso Sorts The oly test that all the algorithms we have cosidered so far is compariso The oly iformatio

More information

New HSL Distance Based Colour Clustering Algorithm

New HSL Distance Based Colour Clustering Algorithm The 4th Midwest Artificial Itelligece ad Cogitive Scieces Coferece (MAICS 03 pp 85-9 New Albay Idiaa USA April 3-4 03 New HSL Distace Based Colour Clusterig Algorithm Vasile Patrascu Departemet of Iformatics

More information

Mathematical Stat I: solutions of homework 1

Mathematical Stat I: solutions of homework 1 Mathematical Stat I: solutios of homework Name: Studet Id N:. Suppose we tur over cards simultaeously from two well shuffled decks of ordiary playig cards. We say we obtai a exact match o a particular

More information

Analysis of Documents Clustering Using Sampled Agglomerative Technique

Analysis of Documents Clustering Using Sampled Agglomerative Technique Aalysis of Documets Clusterig Usig Sampled Agglomerative Techique Omar H. Karam, Ahmed M. Hamad, ad Sheri M. Moussa Abstract I this paper a clusterig algorithm for documets is proposed that adapts a samplig-based

More information

SAMPLE VERSUS POPULATION. Population - consists of all possible measurements that can be made on a particular item or procedure.

SAMPLE VERSUS POPULATION. Population - consists of all possible measurements that can be made on a particular item or procedure. SAMPLE VERSUS POPULATION Populatio - cosists of all possible measuremets that ca be made o a particular item or procedure. Ofte a populatio has a ifiite umber of data elemets Geerally expese to determie

More information

New Results on Energy of Graphs of Small Order

New Results on Energy of Graphs of Small Order Global Joural of Pure ad Applied Mathematics. ISSN 0973-1768 Volume 13, Number 7 (2017), pp. 2837-2848 Research Idia Publicatios http://www.ripublicatio.com New Results o Eergy of Graphs of Small Order

More information

Lecture 13: Validation

Lecture 13: Validation Lecture 3: Validatio Resampli methods Holdout Cross Validatio Radom Subsampli -Fold Cross-Validatio Leave-oe-out The Bootstrap Bias ad variace estimatio Three-way data partitioi Itroductio to Patter Recoitio

More information

How do we evaluate algorithms?

How do we evaluate algorithms? F2 Readig referece: chapter 2 + slides Algorithm complexity Big O ad big Ω To calculate ruig time Aalysis of recursive Algorithms Next time: Litterature: slides mostly The first Algorithm desig methods:

More information

Lecture 6. Lecturer: Ronitt Rubinfeld Scribes: Chen Ziv, Eliav Buchnik, Ophir Arie, Jonathan Gradstein

Lecture 6. Lecturer: Ronitt Rubinfeld Scribes: Chen Ziv, Eliav Buchnik, Ophir Arie, Jonathan Gradstein 068.670 Subliear Time Algorithms November, 0 Lecture 6 Lecturer: Roitt Rubifeld Scribes: Che Ziv, Eliav Buchik, Ophir Arie, Joatha Gradstei Lesso overview. Usig the oracle reductio framework for approximatig

More information

Lecture Notes 6 Introduction to algorithm analysis CSS 501 Data Structures and Object-Oriented Programming

Lecture Notes 6 Introduction to algorithm analysis CSS 501 Data Structures and Object-Oriented Programming Lecture Notes 6 Itroductio to algorithm aalysis CSS 501 Data Structures ad Object-Orieted Programmig Readig for this lecture: Carrao, Chapter 10 To be covered i this lecture: Itroductio to algorithm aalysis

More information

Arithmetic Sequences

Arithmetic Sequences . Arithmetic Sequeces COMMON CORE Learig Stadards HSF-IF.A. HSF-BF.A.1a HSF-BF.A. HSF-LE.A. Essetial Questio How ca you use a arithmetic sequece to describe a patter? A arithmetic sequece is a ordered

More information

Dynamic Programming and Curve Fitting Based Road Boundary Detection

Dynamic Programming and Curve Fitting Based Road Boundary Detection Dyamic Programmig ad Curve Fittig Based Road Boudary Detectio SHYAM PRASAD ADHIKARI, HYONGSUK KIM, Divisio of Electroics ad Iformatio Egieerig Chobuk Natioal Uiversity 664-4 Ga Deokji-Dog Jeoju-City Jeobuk

More information

CIS 121 Data Structures and Algorithms with Java Fall Big-Oh Notation Tuesday, September 5 (Make-up Friday, September 8)

CIS 121 Data Structures and Algorithms with Java Fall Big-Oh Notation Tuesday, September 5 (Make-up Friday, September 8) CIS 11 Data Structures ad Algorithms with Java Fall 017 Big-Oh Notatio Tuesday, September 5 (Make-up Friday, September 8) Learig Goals Review Big-Oh ad lear big/small omega/theta otatios Practice solvig

More information

CSCI 5090/7090- Machine Learning. Spring Mehdi Allahyari Georgia Southern University

CSCI 5090/7090- Machine Learning. Spring Mehdi Allahyari Georgia Southern University CSCI 5090/7090- Machie Learig Sprig 018 Mehdi Allahyari Georgia Souther Uiversity Clusterig (slides borrowed from Tom Mitchell, Maria Floria Balca, Ali Borji, Ke Che) 1 Clusterig, Iformal Goals Goal: Automatically

More information

Our second algorithm. Comp 135 Machine Learning Computer Science Tufts University. Decision Trees. Decision Trees. Decision Trees.

Our second algorithm. Comp 135 Machine Learning Computer Science Tufts University. Decision Trees. Decision Trees. Decision Trees. Comp 135 Machie Learig Computer Sciece Tufts Uiversity Fall 2017 Roi Khardo Some of these slides were adapted from previous slides by Carla Brodley Our secod algorithm Let s look at a simple dataset for

More information

. Written in factored form it is easy to see that the roots are 2, 2, i,

. Written in factored form it is easy to see that the roots are 2, 2, i, CMPS A Itroductio to Programmig Programmig Assigmet 4 I this assigmet you will write a java program that determies the real roots of a polyomial that lie withi a specified rage. Recall that the roots (or

More information

Protected points in ordered trees

Protected points in ordered trees Applied Mathematics Letters 008 56 50 www.elsevier.com/locate/aml Protected poits i ordered trees Gi-Sag Cheo a, Louis W. Shapiro b, a Departmet of Mathematics, Sugkyukwa Uiversity, Suwo 440-746, Republic

More information

Analysis of Server Resource Consumption of Meteorological Satellite Application System Based on Contour Curve

Analysis of Server Resource Consumption of Meteorological Satellite Application System Based on Contour Curve Advaces i Computer, Sigals ad Systems (2018) 2: 19-25 Clausius Scietific Press, Caada Aalysis of Server Resource Cosumptio of Meteorological Satellite Applicatio System Based o Cotour Curve Xiagag Zhao

More information

15-859E: Advanced Algorithms CMU, Spring 2015 Lecture #2: Randomized MST and MST Verification January 14, 2015

15-859E: Advanced Algorithms CMU, Spring 2015 Lecture #2: Randomized MST and MST Verification January 14, 2015 15-859E: Advaced Algorithms CMU, Sprig 2015 Lecture #2: Radomized MST ad MST Verificatio Jauary 14, 2015 Lecturer: Aupam Gupta Scribe: Yu Zhao 1 Prelimiaries I this lecture we are talkig about two cotets:

More information

BASED ON ITERATIVE ERROR-CORRECTION

BASED ON ITERATIVE ERROR-CORRECTION A COHPARISO OF CRYPTAALYTIC PRICIPLES BASED O ITERATIVE ERROR-CORRECTIO Miodrag J. MihaljeviC ad Jova Dj. GoliC Istitute of Applied Mathematics ad Electroics. Belgrade School of Electrical Egieerig. Uiversity

More information

Ones Assignment Method for Solving Traveling Salesman Problem

Ones Assignment Method for Solving Traveling Salesman Problem Joural of mathematics ad computer sciece 0 (0), 58-65 Oes Assigmet Method for Solvig Travelig Salesma Problem Hadi Basirzadeh Departmet of Mathematics, Shahid Chamra Uiversity, Ahvaz, Ira Article history:

More information

Convergence results for conditional expectations

Convergence results for conditional expectations Beroulli 11(4), 2005, 737 745 Covergece results for coditioal expectatios IRENE CRIMALDI 1 ad LUCA PRATELLI 2 1 Departmet of Mathematics, Uiversity of Bologa, Piazza di Porta Sa Doato 5, 40126 Bologa,

More information

Package popkorn. R topics documented: February 20, Type Package

Package popkorn. R topics documented: February 20, Type Package Type Pacage Pacage popkor February 20, 2015 Title For iterval estimatio of mea of selected populatios Versio 0.3-0 Date 2014-07-04 Author Vi Gopal, Claudio Fuetes Maitaier Vi Gopal Depeds

More information

Evaluation scheme for Tracking in AMI

Evaluation scheme for Tracking in AMI A M I C o m m u i c a t i o A U G M E N T E D M U L T I - P A R T Y I N T E R A C T I O N http://www.amiproject.org/ Evaluatio scheme for Trackig i AMI S. Schreiber a D. Gatica-Perez b AMI WP4 Trackig:

More information

Performance Comparisons of PSO based Clustering

Performance Comparisons of PSO based Clustering Performace Comparisos of PSO based Clusterig Suresh Chadra Satapathy, 2 Guaidhi Pradha, 3 Sabyasachi Pattai, 4 JVR Murthy, 5 PVGD Prasad Reddy Ail Neeruoda Istitute of Techology ad Scieces, Sagivalas,Vishaapatam

More information

Big-O Analysis. Asymptotics

Big-O Analysis. Asymptotics Big-O Aalysis 1 Defiitio: Suppose that f() ad g() are oegative fuctios of. The we say that f() is O(g()) provided that there are costats C > 0 ad N > 0 such that for all > N, f() Cg(). Big-O expresses

More information

ENGI 4421 Probability and Statistics Faculty of Engineering and Applied Science Problem Set 1 Descriptive Statistics

ENGI 4421 Probability and Statistics Faculty of Engineering and Applied Science Problem Set 1 Descriptive Statistics ENGI 44 Probability ad Statistics Faculty of Egieerig ad Applied Sciece Problem Set Descriptive Statistics. If, i the set of values {,, 3, 4, 5, 6, 7 } a error causes the value 5 to be replaced by 50,

More information

Criterion in selecting the clustering algorithm in Radial Basis Functional Link Nets

Criterion in selecting the clustering algorithm in Radial Basis Functional Link Nets WSEAS TRANSACTIONS o SYSTEMS Ag Sau Loog, Og Hog Choo, Low Heg Chi Criterio i selectig the clusterig algorithm i Radial Basis Fuctioal Lik Nets ANG SAU LOONG 1, ONG HONG CHOON 2 & LOW HENG CHIN 3 Departmet

More information

CSC165H1 Worksheet: Tutorial 8 Algorithm analysis (SOLUTIONS)

CSC165H1 Worksheet: Tutorial 8 Algorithm analysis (SOLUTIONS) CSC165H1, Witer 018 Learig Objectives By the ed of this worksheet, you will: Aalyse the ruig time of fuctios cotaiig ested loops. 1. Nested loop variatios. Each of the followig fuctios takes as iput a

More information

Fast Fourier Transform (FFT) Algorithms

Fast Fourier Transform (FFT) Algorithms Fast Fourier Trasform FFT Algorithms Relatio to the z-trasform elsewhere, ozero, z x z X x [ ] 2 ~ elsewhere,, ~ e j x X x x π j e z z X X π 2 ~ The DFS X represets evely spaced samples of the z- trasform

More information

Learning to Shoot a Goal Lecture 8: Learning Models and Skills

Learning to Shoot a Goal Lecture 8: Learning Models and Skills Learig to Shoot a Goal Lecture 8: Learig Models ad Skills How do we acquire skill at shootig goals? CS 344R/393R: Robotics Bejami Kuipers Learig to Shoot a Goal The robot eeds to shoot the ball i the goal.

More information

Pruning and Summarizing the Discovered Time Series Association Rules from Mechanical Sensor Data Qing YANG1,a,*, Shao-Yu WANG1,b, Ting-Ting ZHANG2,c

Pruning and Summarizing the Discovered Time Series Association Rules from Mechanical Sensor Data Qing YANG1,a,*, Shao-Yu WANG1,b, Ting-Ting ZHANG2,c Advaces i Egieerig Research (AER), volume 131 3rd Aual Iteratioal Coferece o Electroics, Electrical Egieerig ad Iformatio Sciece (EEEIS 2017) Pruig ad Summarizig the Discovered Time Series Associatio Rules

More information

Chapter 11. Friends, Overloaded Operators, and Arrays in Classes. Copyright 2014 Pearson Addison-Wesley. All rights reserved.

Chapter 11. Friends, Overloaded Operators, and Arrays in Classes. Copyright 2014 Pearson Addison-Wesley. All rights reserved. Chapter 11 Frieds, Overloaded Operators, ad Arrays i Classes Copyright 2014 Pearso Addiso-Wesley. All rights reserved. Overview 11.1 Fried Fuctios 11.2 Overloadig Operators 11.3 Arrays ad Classes 11.4

More information

Probabilistic Fuzzy Time Series Method Based on Artificial Neural Network

Probabilistic Fuzzy Time Series Method Based on Artificial Neural Network America Joural of Itelliget Systems 206, 6(2): 42-47 DOI: 0.5923/j.ajis.2060602.02 Probabilistic Fuzzy Time Series Method Based o Artificial Neural Network Erol Egrioglu,*, Ere Bas, Cagdas Haka Aladag

More information

Fuzzy Rule Selection by Data Mining Criteria and Genetic Algorithms

Fuzzy Rule Selection by Data Mining Criteria and Genetic Algorithms Fuzzy Rule Selectio by Data Miig Criteria ad Geetic Algorithms Hisao Ishibuchi Dept. of Idustrial Egieerig Osaka Prefecture Uiversity 1-1 Gakue-cho, Sakai, Osaka 599-8531, JAPAN E-mail: hisaoi@ie.osakafu-u.ac.jp

More information

On (K t e)-saturated Graphs

On (K t e)-saturated Graphs Noame mauscript No. (will be iserted by the editor O (K t e-saturated Graphs Jessica Fuller Roald J. Gould the date of receipt ad acceptace should be iserted later Abstract Give a graph H, we say a graph

More information

IMP: Superposer Integrated Morphometrics Package Superposition Tool

IMP: Superposer Integrated Morphometrics Package Superposition Tool IMP: Superposer Itegrated Morphometrics Package Superpositio Tool Programmig by: David Lieber ( 03) Caisius College 200 Mai St. Buffalo, NY 4208 Cocept by: H. David Sheets, Dept. of Physics, Caisius College

More information

Image Analysis. Segmentation by Fitting a Model

Image Analysis. Segmentation by Fitting a Model Image Aalysis Segmetatio by Fittig a Model Christophoros Nikou cikou@cs.uoi.gr Images take from: D. Forsyth ad J. Poce. Computer Visio: A Moder Approach, Pretice Hall, 2003. Computer Visio course by Svetlaa

More information

On-line cursive letter recognition using sequences of local minima/maxima. Robert Powalka

On-line cursive letter recognition using sequences of local minima/maxima. Robert Powalka O-lie cursive letter recogitio usig sequeces of local miima/maxima Summary Robert Powalka 19 th August 1993 This report presets the desig ad implemetatio of a o-lie cursive letter recogizer usig sequeces

More information

Chapter 5. Functions for All Subtasks. Copyright 2015 Pearson Education, Ltd.. All rights reserved.

Chapter 5. Functions for All Subtasks. Copyright 2015 Pearson Education, Ltd.. All rights reserved. Chapter 5 Fuctios for All Subtasks Copyright 2015 Pearso Educatio, Ltd.. All rights reserved. Overview 5.1 void Fuctios 5.2 Call-By-Referece Parameters 5.3 Usig Procedural Abstractio 5.4 Testig ad Debuggig

More information

Euclidean Distance Based Feature Selection for Fault Detection Prediction Model in Semiconductor Manufacturing Process

Euclidean Distance Based Feature Selection for Fault Detection Prediction Model in Semiconductor Manufacturing Process Vol.133 (Iformatio Techology ad Computer Sciece 016), pp.85-89 http://dx.doi.org/10.1457/astl.016. Euclidea Distace Based Feature Selectio for Fault Detectio Predictio Model i Semicoductor Maufacturig

More information

A Novel Feature Extraction Algorithm for Haar Local Binary Pattern Texture Based on Human Vision System

A Novel Feature Extraction Algorithm for Haar Local Binary Pattern Texture Based on Human Vision System A Novel Feature Extractio Algorithm for Haar Local Biary Patter Texture Based o Huma Visio System Liu Tao 1,* 1 Departmet of Electroic Egieerig Shaaxi Eergy Istitute Xiayag, Shaaxi, Chia Abstract The locality

More information

6.854J / J Advanced Algorithms Fall 2008

6.854J / J Advanced Algorithms Fall 2008 MIT OpeCourseWare http://ocw.mit.edu 6.854J / 18.415J Advaced Algorithms Fall 2008 For iformatio about citig these materials or our Terms of Use, visit: http://ocw.mit.edu/terms. 18.415/6.854 Advaced Algorithms

More information

Pseudocode ( 1.1) Analysis of Algorithms. Primitive Operations. Pseudocode Details. Running Time ( 1.1) Estimating performance

Pseudocode ( 1.1) Analysis of Algorithms. Primitive Operations. Pseudocode Details. Running Time ( 1.1) Estimating performance Aalysis of Algorithms Iput Algorithm Output A algorithm is a step-by-step procedure for solvig a problem i a fiite amout of time. Pseudocode ( 1.1) High-level descriptio of a algorithm More structured

More information

SOFTWARE usually does not work alone. It must have

SOFTWARE usually does not work alone. It must have Proceedigs of the 203 Federated Coferece o Computer Sciece ad Iformatio Systems pp. 343 348 A method for selectig eviromets for software compatibility testig Łukasz Pobereżik AGH Uiversity of Sciece ad

More information

Descriptive Statistics Summary Lists

Descriptive Statistics Summary Lists Chapter 209 Descriptive Statistics Summary Lists Itroductio This procedure is used to summarize cotiuous data. Large volumes of such data may be easily summarized i statistical lists of meas, couts, stadard

More information

Improved Random Graph Isomorphism

Improved Random Graph Isomorphism Improved Radom Graph Isomorphism Tomek Czajka Gopal Paduraga Abstract Caoical labelig of a graph cosists of assigig a uique label to each vertex such that the labels are ivariat uder isomorphism. Such

More information

Chapter 1. Introduction to Computers and C++ Programming. Copyright 2015 Pearson Education, Ltd.. All rights reserved.

Chapter 1. Introduction to Computers and C++ Programming. Copyright 2015 Pearson Education, Ltd.. All rights reserved. Chapter 1 Itroductio to Computers ad C++ Programmig Copyright 2015 Pearso Educatio, Ltd.. All rights reserved. Overview 1.1 Computer Systems 1.2 Programmig ad Problem Solvig 1.3 Itroductio to C++ 1.4 Testig

More information

Lecture 18. Optimization in n dimensions

Lecture 18. Optimization in n dimensions Lecture 8 Optimizatio i dimesios Itroductio We ow cosider the problem of miimizig a sigle scalar fuctio of variables, f x, where x=[ x, x,, x ]T. The D case ca be visualized as fidig the lowest poit of

More information

Bayesian Network Structure Learning from Attribute Uncertain Data

Bayesian Network Structure Learning from Attribute Uncertain Data Bayesia Network Structure Learig from Attribute Ucertai Data Wetig Sog 1,2, Jeffrey Xu Yu 3, Hog Cheg 3, Hogya Liu 4, Ju He 1,2,*, ad Xiaoyog Du 1,2 1 Key Labs of Data Egieerig ad Kowledge Egieerig, Miistry

More information

Fire Recognition in Video. Walter Phillips III Mubarak Shah Niels da Vitoria Lobo.

Fire Recognition in Video. Walter Phillips III Mubarak Shah Niels da Vitoria Lobo. Fire Recogitio i Video Walter Phillips III Mubarak Shah Niels da Vitoria Lobo {wrp65547,shah,iels}@cs.ucf.edu Computer Visio Laboratory Departmet of Computer Sciece Uiversity of Cetral Florida Orlado,

More information

Data Analysis. Concepts and Techniques. Chapter 2. Chapter 2: Getting to Know Your Data. Data Objects and Attribute Types

Data Analysis. Concepts and Techniques. Chapter 2. Chapter 2: Getting to Know Your Data. Data Objects and Attribute Types Data Aalysis Cocepts ad Techiques Chapter 2 1 Chapter 2: Gettig to Kow Your Data Data Objects ad Attribute Types Basic Statistical Descriptios of Data Data Visualizatio Measurig Data Similarity ad Dissimilarity

More information

Descriptive Data Mining Modeling in Telecom Systems

Descriptive Data Mining Modeling in Telecom Systems Descriptive Data Miig Modelig i Telecom Systems Ivo Pejaović, Zora Sočir, Damir Medved 2 Faculty of Electrical Egieerig ad Computig, Uiversity of Zagreb Usa 3, HR-0000 Zagreb, Croatia Tel: +385 629 763;

More information

OCR Statistics 1. Working with data. Section 3: Measures of spread

OCR Statistics 1. Working with data. Section 3: Measures of spread Notes ad Eamples OCR Statistics 1 Workig with data Sectio 3: Measures of spread Just as there are several differet measures of cetral tedec (averages), there are a variet of statistical measures of spread.

More information

Introduction. Nature-Inspired Computing. Terminology. Problem Types. Constraint Satisfaction Problems - CSP. Free Optimization Problem - FOP

Introduction. Nature-Inspired Computing. Terminology. Problem Types. Constraint Satisfaction Problems - CSP. Free Optimization Problem - FOP Nature-Ispired Computig Hadlig Costraits Dr. Şima Uyar September 2006 Itroductio may practical problems are costraied ot all combiatios of variable values represet valid solutios feasible solutios ifeasible

More information

Recursive Procedures. How can you model the relationship between consecutive terms of a sequence?

Recursive Procedures. How can you model the relationship between consecutive terms of a sequence? 6. Recursive Procedures I Sectio 6.1, you used fuctio otatio to write a explicit formula to determie the value of ay term i a Sometimes it is easier to calculate oe term i a sequece usig the previous terms.

More information

Designing a learning system

Designing a learning system CS 75 Itro to Machie Learig Lecture Desigig a learig system Milos Hauskrecht milos@pitt.edu 539 Seott Square, -5 people.cs.pitt.edu/~milos/courses/cs75/ Admiistrivia No homework assigmet this week Please

More information

CIS 121 Data Structures and Algorithms with Java Spring Stacks, Queues, and Heaps Monday, February 18 / Tuesday, February 19

CIS 121 Data Structures and Algorithms with Java Spring Stacks, Queues, and Heaps Monday, February 18 / Tuesday, February 19 CIS Data Structures ad Algorithms with Java Sprig 09 Stacks, Queues, ad Heaps Moday, February 8 / Tuesday, February 9 Stacks ad Queues Recall the stack ad queue ADTs (abstract data types from lecture.

More information

Elementary Educational Computer

Elementary Educational Computer Chapter 5 Elemetary Educatioal Computer. Geeral structure of the Elemetary Educatioal Computer (EEC) The EEC coforms to the 5 uits structure defied by vo Neuma's model (.) All uits are preseted i a simplified

More information

Accuracy Improvement in Camera Calibration

Accuracy Improvement in Camera Calibration Accuracy Improvemet i Camera Calibratio FaJie L Qi Zag ad Reihard Klette CITR, Computer Sciece Departmet The Uiversity of Aucklad Tamaki Campus, Aucklad, New Zealad fli006, qza001@ec.aucklad.ac.z r.klette@aucklad.ac.z

More information

It just came to me that I 8.2 GRAPHS AND CONVERGENCE

It just came to me that I 8.2 GRAPHS AND CONVERGENCE 44 Chapter 8 Discrete Mathematics: Fuctios o the Set of Natural Numbers (a) Take several odd, positive itegers for a ad write out eough terms of the 3N sequece to reach a repeatig loop (b) Show that ot

More information

9.1. Sequences and Series. Sequences. What you should learn. Why you should learn it. Definition of Sequence

9.1. Sequences and Series. Sequences. What you should learn. Why you should learn it. Definition of Sequence _9.qxd // : AM Page Chapter 9 Sequeces, Series, ad Probability 9. Sequeces ad Series What you should lear Use sequece otatio to write the terms of sequeces. Use factorial otatio. Use summatio otatio to

More information

Lecture 2: Spectra of Graphs

Lecture 2: Spectra of Graphs Spectral Graph Theory ad Applicatios WS 20/202 Lecture 2: Spectra of Graphs Lecturer: Thomas Sauerwald & He Su Our goal is to use the properties of the adjacecy/laplacia matrix of graphs to first uderstad

More information

15 UNSUPERVISED LEARNING

15 UNSUPERVISED LEARNING 15 UNSUPERVISED LEARNING [My father] advised me to sit every few moths i my readig chair for a etire eveig, close my eyes ad try to thik of ew problems to solve. I took his advice very seriously ad have

More information

Characterizing graphs of maximum principal ratio

Characterizing graphs of maximum principal ratio Characterizig graphs of maximum pricipal ratio Michael Tait ad Josh Tobi November 9, 05 Abstract The pricipal ratio of a coected graph, deoted γg, is the ratio of the maximum ad miimum etries of its first

More information

FREQUENCY ESTIMATION OF INTERNET PACKET STREAMS WITH LIMITED SPACE: UPPER AND LOWER BOUNDS

FREQUENCY ESTIMATION OF INTERNET PACKET STREAMS WITH LIMITED SPACE: UPPER AND LOWER BOUNDS FREQUENCY ESTIMATION OF INTERNET PACKET STREAMS WITH LIMITED SPACE: UPPER AND LOWER BOUNDS Prosejit Bose Evagelos Kraakis Pat Mori Yihui Tag School of Computer Sciece, Carleto Uiversity {jit,kraakis,mori,y

More information

Chapter 4. Procedural Abstraction and Functions That Return a Value. Copyright 2015 Pearson Education, Ltd.. All rights reserved.

Chapter 4. Procedural Abstraction and Functions That Return a Value. Copyright 2015 Pearson Education, Ltd.. All rights reserved. Chapter 4 Procedural Abstractio ad Fuctios That Retur a Value Copyright 2015 Pearso Educatio, Ltd.. All rights reserved. Overview 4.1 Top-Dow Desig 4.2 Predefied Fuctios 4.3 Programmer-Defied Fuctios 4.4

More information

Morgan Kaufmann Publishers 26 February, COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Interface. Chapter 5

Morgan Kaufmann Publishers 26 February, COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Interface. Chapter 5 Morga Kaufma Publishers 26 February, 28 COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Iterface 5 th Editio Chapter 5 Set-Associative Cache Architecture Performace Summary Whe CPU performace icreases:

More information

Basic allocator mechanisms The course that gives CMU its Zip! Memory Management II: Dynamic Storage Allocation Mar 6, 2000.

Basic allocator mechanisms The course that gives CMU its Zip! Memory Management II: Dynamic Storage Allocation Mar 6, 2000. 5-23 The course that gives CM its Zip Memory Maagemet II: Dyamic Storage Allocatio Mar 6, 2000 Topics Segregated lists Buddy system Garbage collectio Mark ad Sweep Copyig eferece coutig Basic allocator

More information

Our Learning Problem, Again

Our Learning Problem, Again Noparametric Desity Estimatio Matthew Stoe CS 520, Sprig 2000 Lecture 6 Our Learig Problem, Agai Use traiig data to estimate ukow probabilities ad probability desity fuctios So far, we have depeded o describig

More information

ANN WHICH COVERS MLP AND RBF

ANN WHICH COVERS MLP AND RBF ANN WHICH COVERS MLP AND RBF Josef Boští, Jaromír Kual Faculty of Nuclear Scieces ad Physical Egieerig, CTU i Prague Departmet of Software Egieerig Abstract Two basic types of artificial eural etwors Multi

More information