A SYSTOLIC APPROACH TO LOOP PARTITIONING AND MAPPING INTO FIXED SIZE DISTRIBUTED MEMORY ARCHITECTURES

Size: px
Start display at page:

Download "A SYSTOLIC APPROACH TO LOOP PARTITIONING AND MAPPING INTO FIXED SIZE DISTRIBUTED MEMORY ARCHITECTURES"

Transcription

1 A SYSOLIC APPROACH O LOOP PARIIONING AND MAPPING INO FIXED SIZE DISRIBUED MEMORY ARCHIECURES Ioanns Drosts, Nektaros Kozrs, George Papakonstantnou and Panayots sanakas Natonal echncal Unversty of Athens Department of Electrcal and Computer Engneerng Dvson of Computer Scence Zografou Campus, Zografou, Athens, Greece. {jdros, nkozrs}@cslab.ece.ntua.gr SUBJEC hs paper presents a new method for the problem of mappng of nested FOR-loops wth unform dependences, nto mesh-connected parallel archtectures. hs method s based on loop mappng for systolc arrays. he vrtual array of cells s derved from the ndex space, by applyng a lnear transformaton. hs array s dvded (cut) nto a fxed number of clusters, equal to the number of avalable real processors. he basc dea of our method s that the cuttng s performed along properly selected boundary drectons, so as to mnmze nter-cluster communcaton and equlbrate the number of vrtual cells for every cluster. Each cluster s then assgned to a dfferent processor, whch performs n a roughly ndependent manner, as the communcaton requrements are now mnmzed. hs mappng cuts down overall communcaton delays, whle usng a fxed number of processors from a (n-)-dmensonal meshconnected dstrbuted archtecture.. INRODUCION he prmary task n parallelzng a FOR-loop, s fndng a schedule of computatons nto tme, whle preservng the data dependences of the ntal lexcographc loop order. Most methods are based on fndng a lnear tme transformaton, snce lnear tme schedulng dffers only at a constant from the optmal schedule for fat domans, as Darte proved n [3]. Lnear schedulng was ntroduced by Lamport n [8] wth the hyperplane method, whch organzes ndced computatons nto well-defned dstnct groups called hyperplanes. Many methods were proposed n lterature to fnd the optmal hyperplane; some of them are based on the soluton of dophantne equatons [8, 0] and others on the use of nteger programmng [3, 3] or even lnear programmng n subspaces [3]. When ndex spaces wth unform dependence vectors are concerned, a polynomal complexty schedulng algorthm s presented n [7]. Once optmal parallel executon s found, an effcent method of mappng the concurrent groups of computatons (hyperplanes) nto the parallel archtecture should be appled. A systematc methodology for mappng nto fxed sze systolc arrays was presented n [8]. Snce the target archtecture s synchronously operatng, there s no need for communcaton eff- For further studyng on ths method, see [] and [3].

2 cent mappng and the man crteron for optmalty s now the total number of processors. Other methods dealt wth the same problem of mappng, whle reducng not only the sze but also the resultng dmenson of the systolc array (see [4, 6, ]). Researchers are tryng to mnmze the nter-processor communcaton, by dvdng the ndex space nto as much as possble ndependent groups of computatons. Shang and Fortes [4], and Per [], have presented methods for dvdng the ndex space nto ndependent sets of computatons, whch are assgned to dfferent processors, thus zerong the communcaton cost. Unfortunately, the ndependent parttonng of the ndex space s not always feasble. Sheu and a have presented a systematc method of parttonng the ndex space nto groups and assgn them nto dfferent processng elements [5]. her method frst projects the n- dmensonal ndex space onto the zero hyperplane. he resultng projected space s dvded nto groups of computatons, whch preserve the ntal optmal tme schedule. However, ther method does not reduce overall communcatons, snce t gnores the maxmzaton of ntragroup references. A smlar technque was presented by Kng n [5]. hs method produces better results, snce t groups together chans of related computatons, but requres a greedy (exhaustve) search to defne the group, where each computaton belongs. me schedulng s not explctly defned. hey only dscuss computatonal structures on two dmensonal ndex spaces. However, they ntroduced the dea of groupng related computatons together. In addton to ths, ther target archtecture has an unlmted number of processors, whch s not realstc for most of actual large ndex spaces. In ths paper we propose a new method, based on mappng nto an unbounded array of systolc cells. he ntal ndex space J n s lnearly transformed nto another Jï n, whch s used for ntal systolc mappng. ransformaton matrx s dvded nto the hyperplane transforma- WLRQDQGWKHVSDFHWUDQVIRUPDWLRQ6ZKLFKDFWXDOO\PDSVWKHLQLWLDOLQGH[VSDFHJ n to a (n- )-dmensonal projected vrtual space. hs space represents a vrtual array of cells, whch should be further dvded to ft the sze of the avalable hardware. Our method analytcally derves a parttonng, whch mnmzes communcaton requrements and delays. hs (n-)- dmensonal array s parttoned nto blocks of neghborng vrtual nodes, where each block s assgned to a dfferent physcal processor. We establsh the noton of a sngle cut of the projected ndex space, along a boundary drecton. All possble sngle cuts along these drectons are evaluated, by a formula calculatng the approxmate total communcaton requrements between the separate parts of each cut. Once the optmal cut (the one wth the mnmum communcaton requrements) has been selected, we perform the same procedure along the rest n- dmensons. he resultng set of cuts, called a mappng, dvdes the vrtual space nto clusters; all ponts of each cluster are assgned to the same physcal processor. he nter-processor communcaton s reduced, as neghborng vrtual processors are merged together nto a physcal one. In addton, by cuttng the vrtual space nto equal sze tles (except boundary effects), overall computatonal load s balanced. hus, the resultng space mappng s effcent, n terms of processor utlzaton and communcaton delays. Basc termnology and notaton used throughout ths paper s gven n Secton. Sectons 3 presents the propertes and algorthms used for parttonng; the vrtual array of systolc cells and the proposed mappng s elaborated n Secton 4. An example of our method s presented n Secton 5 and the obtaned results are dscussed n Secton 6.. PRELIMINARY CONCEPS AND DEFINIIONS We focus on algorthms, whch have the form of a nested FOR-loop structure, wth unform data dependences []. he algorthmc model s formally descrbed as follows:

3 for 0 = l 0 to u 0 do for n- = l n- to u n- do S S k end for end for where, 0 n-, are the loop varables, l and u are nteger-valued constants that represent the lower and upper lmts for loop varables, and S j, j k, are k assgnment statements. For the nested loop gven above, the ndex vector s defned as the vector of dmenson n: = [ 0,,, n- ]. Furthermore, each statement S j s an assgnment of the form: v 0 = E(v, v,, v p ), where v 0 s an output varable, v j, j p, are nput varables and E s an arbtrary expresson of the nput varables. It should be noted that all varables n the loop statements, may be ndexed by the ndex vector. Let J n be the set of ndces: n Τ J = {[ 0,,..., n ] : Z l u, where 0 n }, where Z s the set of nteger numbers. J n s an n-dmensonal nteger space. Each pont n ths n-dmensonal nteger space, s a dstnct nstantaton of the loop body. he nstance of statement S j s represented by S j (). Notce that the ponts of J n are ordered lexcographcally; the usual symbol `<` s used to denote ths (lnear) orderng. Dependences may exst between varables appearng n nstances of assgnment statements. A (drected) dependence between two nstances S j ( ) and S j ( ), s characterzed by the dependence vector d = -, as n [0]. he dependence matrx D contans as columns all exstng dependence vectors. A dependence vector s denoted by d, m. Such an algorthm wll be defned by ts ndex space J n and ts dependence matrx D and wll be represented by A(J n, D). 3. PARIIONING HE INDEX SPACE In ths secton, we descrbe our method of parttonng and mappng an n-dmensonal ndex space onto a mesh-connected MIMD archtecture. hs method s based on systolc loop mappng. When bounded number of cells or processors s gven, a parttonng methodology should also be appled to ft the vrtual array (resultng from mappng phase) nto the real one. We follow GPLS method (Globally Parallel Locally Sequental) 3. hs means, that snce the number of avalable cells (physcal processors) s less than the number of vrtual nodes, the vrtual array s parttoned nto blocks, whose number equals the sze of the real (physcal) array of processors. All vrtual nodes nsde the same block are sequentally executed -by the same processor. he cut of the vrtual array nto the fxed number of blocks s done along drectons, whch reduces the overall communcaton requrements and dvdes the array nto equal sze parttons. Snce everythng n each block s executed by the same physcal processor there are no communcaton requrements. he only communcaton overhead s mposed by nter-block message passng, whch s unavodable, snce all blocks are executed n parallel (Globally Parallel). Of course, there are now more local memory requrements, but the sze of memory n dstrbuted archtectures s large enough, and accesses to local memory for the same block are fast. Hereunder, the detals of our approach are further analyzed. In the followng lnes we wll use the notaton of [0]. 3 For the nverse approach LPGS see [4].

4 3. ransformaton of the Index Space J n We assume that a lnear transformaton matrx has already been selected. s gven as two VXEPDWULFHVDQG6DVLQ>@ = S. he frst row of the above matrx s the hyperplane vector whch determnes the tme executon orderng for the transformed ndex space -ï n : -ï n =.J n. he submatrx S, s the space transformaton, whch maps onto a (n-)-dmensonal array of systolc cells S.J n, as n [0]. For the needs of our method, we consder that was optmally selected. hs means that: gves the optmal hyperplane wth respect to the least makespan and S produces the optmal space mappng, n terms of the total number of cells and communcaton lnks, as shown n [7]. By applyng the transformaton matrx to the orgnal ndex space J, the latter has been transformed to a new ndex space Jï. By gnorng the frst dmenson of the transformed ndex space Jï, we obtan a projected (n-)-dmensonal ndex space, whch wll be parttoned and eventually mapped to a (n-)-dmensonal mesh of processng cells, as n [0]. 3. Cuts and ermnology In the rest of the paper, the prefx `h-` wll be used extensvely. It s read as hyper and stands for n-dmensonal. he termnology after the prefx s taken from the 3-dmensonal space. hat s: - h-space: the n-dmensonal space - h-plane: a plane n the n-dmensonal space, that s a lnear subspace of dmenson n- - h-lne: a lne n the n-dmensonal space, that s a lnear subspace of dmenson n- - h-mesh: a mesh n the (n-)-dmensonal space, that s an array of cells connected n a mesh topology. Wth all ths n mnd, the orgnal and the transformed ndex spaces J n and -ï n, are h-spaces. he projected transformed ndex space S.J n (the vrtual array of systolc cells) s an h-plane, and ts bound forms an h-polygon. he h-sdes of the h-polygon defne a number of h-lnes, whch we call bndng h-lnes: - bndng h-lne: an h-lne satsfyng the followng two propertes: at least n- transformed ndex ponts le on t (lne defnton); all remanng transformed ndex ponts are located n the same h-sem-plane. Some more termnology follows: - cut: an h-lne that s parallel to a par of bndng h-lnes; a cut separates the projected transformed ndex space nto two h-sem-planes - multple cut: a non-empty set of cuts, whose h-lnes are parallel and equdstant; ths mples the balanced parttonng of the computatonal load - mappng: a set of multple cuts, n whch there are no two dfferent multple cuts wth parallel h-lnes. 4. HE MAPPING MEHOD In the course of our method, one must keep n mnd the followng ssues: 4 matrx has been prevously selected so as, to best meet the needs of our problem, the transformed ndex space s an h-polygon wth edges that are parallel n pars, the real processor space, onto whch our transformed space s to be mapped, s an 4 As well as the assumptons found n [3].

5 h-mesh and that the optmal cut wll be gven n terms of contnuous space measures (not dscrete). 4.. Communcaton Cost Functon for Alternatve Cuts he cost value of each mappng s computed as the sum of the cost values of ts ndvdual cuts. he cost of a cut s defned to be the number of transformed dependence vectors that traverse the cut's h-lne. he formula that computes the cost value of a sngle cut s thus the followng (cost functon ): m p d c = l!, () p d = where: m s the number of dstnct dependence vectors, p s the vector that s perpendcular to the cut, d s a sngle transformed dependence vector, u s the Eucldean norm of vector u, l s the h-length of the segment of the h-lne that corresponds to the cut and s wthn the boundsriwkhwudqviruphgkvsdfhdqg!lvwkhghqvlw\riyhfwruv d along an h-lne perpendcular to d vectors. he fracton, wthn the sum of equaton, s equal to the cosne of the angle between the two vectors p and d and expresses the mplct noton that cuts that are as parallel as possble to the transformed dependence vectors, are traversed by fewer such vectors, and thus gve better (lower) cost values. he coeffcent p d s there to denote the depth of the calculaton for each dependence vector n the drecton of p. heorem Parameter! s equal to d RU! d. Proof: (A smple geometrc proof for the case that n = 3, s gven. A smlar proof can be easly derved, for greater dmenson.) Consder the dependence vector [ a,b] = d shown n Fgure, as the length segment AC. 0 dï b φ / dï φ a Fgure. Computng parameter! for the -dmensonal case (n = 3). Assume also that 0 s the lne perpendcular to vector d 7KHGHQVLW\!RIGHSHQGHQFHYHFWRUV d along the lne 0, equals to the number w of d vectors that traverse the CE segment, dvded by the length / of CE,.e.:! =. w /

6 In order to compute the number w, of the dependence vectors that traverse CE, one should frst notce that these vectors must have ther startng pont wthn the parallelogram ACEF. he area of ACEF s equal to the area of ABCD and thus s a.b. Snce the startng pont of each dependence vector has nteger coordnates and there are no holes n the projected transformed ndex space, we conclude that the number of vectors wth ther startng pont wthn ACEF, s equal to the area of ACEF. hus w = a.b. Also the length of the segment CE s computed by the formula: AD a ab w ab / = b cos(φ ) = b = b = thus, densty! equals to:! = = = d. (3) AC d d ab / d Cost functon s therefore smplfed to (cost functon ): c = l p d m = p. (5) he cost functon that s defned above, gves a heurstc measure, of how good a mappng s. Moreover, when the projected transformed ndex space s an h-parallelogram, applyng the cost functon to all dfferent mappngs, leads to the optmal soluton wth optmal processor utlzaton. 4.. Determnng Possble Cut Drectons Algorthm. Calculatng the bndng h-lnes Step.: Defne matrx V of dmenson nx n, contanng as columns all the permutatons ( n ) of the coordnates of the ndex space boundary ponts: l0 l0 u0 u0 l l u u V =. ln ln un un l n un ln un Step.&DOFXODWHWKHWUDQVIRUPDWLRQRI99ï 7.V. Step.3: Ignore the frst row of V, whch represents the tme coordnate, and construct matrx W of dmenson (n-)x n, contanng the other rows. W represents the coordnates of boundary ponts n the projected transformed ndex space. Step.4: Calculate the convex hull of all ponts contaned n W. he result s an h-polygon. he h-sdes of the convex hull are the bndng h-lnes. Step.5: Snce the h-sdes of the convex hull come n pars of parallel h-lnes 5, they can be represented n pars by nequaltes of the form: β α α α x + x + + n x n + α n γ. Let the number of such pars be b Mappng Algorthm In order to fnd all possble mappngs, we compute all dfferent ways n whch the projected transformed h-space can be allocated to the processor space. We assume that the processor 5 hs can be easly proved, snce the orgnal ndex space s an h-cube and the convex hull s the projecton of ts lnear transformaton.

7 space s denoted by a vector Œ, of n- elements. Each element shows how many processng cells are avalable on each drecton of the h-mesh, where the transformed h-space s to be mapped 6. hs s equal to the number of cuts that should be made along ths drecton plus one. Our mappng algorthm s dvded n two phases: pre-calculaton, where constant coeffcents are calculated once and for all, and calculaton, where the cost of all possble mappngs s calculated. In the followng, wll denote the number of dfferent non-trval values n vector Œ and % j wll denote these values, for j. Algorthm. Pre-calculaton of cost for multple cuts Step.: For all pars of bndng h-lnes: β α x + α x + + α x + α, b.,,, n n, n γ Step..: Let p be the perpendcular vector: p = [ α,, α,,, α, n ]. Step..: Calculate the constant coeffcent for cuts n the drecton of p : m depcost = p d j. p j= Step..3: For all dfferent numbers % j n the vector of processors, j : Step..3.: Calculate the constant coeffcent for multple cuts n the drecton of everyp, cuttng the projected transformed ndex space n % j segments: mccost γ β p, ψ j ψ j =, j depcost cutarea, p,..., p b, k = where cutarea() s a functon that returns the h-length of the cut s h-lne, accordng to Algorthm 4. Algorthm 3. Calculaton of the mnmal mappng Step 3.: For all vald mappngs: m = [ m, m,, mb ] : Step 3..: cost := 0. Step 3..: For all pars j of bndng h-lnes, j b: Step 3...: If m > then cost := cost + mccost,j, j, such that % j = m. Step 3..3: Keep track of the lowest cost. Algorthm 4. Calculaton of the h-length for a cut Parameters: - All vectors p perpendcular to the bndng h-lnes and p = [ p, p,, pn ] the one that s perpendcular to the cut. - he constant coordnate for the cut: p n. herefore, the equaton of the cut s h-lne s: p x p x p x = p. n n Step 4.: Let P the set of ponts that defne the cut's h-lne segment, ntally empty: P =. Step 4.: For all combnatons of (n-) bndng h-lnes not perpendcular to p (for all p p): Step 4..: Add the gven h-lne. Step 4..: Solve the lnear system of dmenson (n-)x(n-) to compute the pont of ntersecton of the n- h-lnes. Step 4..3: If the soluton satsfes all the remanng bndng h-lnes, then add t to P, else dscard t. Step 4.3: Calculate the h-length of the h-lne segment defned by the ponts n P. n 6 If the actual h-mesh has smaller dmenson than n-, some of ts elements wll have to be taken equal to.

8 Snce all ponts n P le on the gven h-lne, they defne an h-lne segment 7. H-lne segments are really fnte subspaces of dmenson n-, whose h-length must be calculated. he calculaton of the h-length of an h-lne segment that s defned by a set of k ponts, where k n-, can be reduced to the same calculaton, but for a set of n- ponts. he problem does not apply for the case of n = 3, but t s easy to see that n the case of n = 4 the h-lne segment s a polygon and we have to trangulate t, n order to calculate ts area. hus, t suffces to defne the h-length of an h-lne segment that s defned by n- ponts, and ths can be done nductvely: Algorthm 5. Inductve defnton of h-length Step 5.: Base case, for n = 3, use Eucldean dstance. Step 5.: Inductve case, for n > 3 do the followng: Step 5.3: Exclude one pont arbtrarly: u. Step 5.4: Use the same algorthm to calculate the h-length lï of the h-lne segment that s defned by the remanng n- ponts, n an h-space of dmenson n-3. Step 5.5: Fnd the projecton u of u on the h-plane defned by the remanng n- ponts Step 5.6: Calculate the Eucldean dstance d between u and u. Step 5.7: he result s the product of l and d. 5. EXAMPLE Consder the followng FOR-loop: for 0 = to 6 do for = to 4 do for = to 3 do α,, ) = α(,, ) + α(,, ) + α(,, ) ( In ths loop, every ndex pont α( 0,, ) s computed relatvely to the values of three other ponts of the ndex space. he dependence matrx s thus, as long as the transformaton assumed: 0 0 D = 0 0, = he transformaton matrx has been selected, so as to best demonstrate the proposed method. At the end of the example, an optmal transformaton wll be presented. he transformed ndex vector and data dependence matrx Dï are: = = = 0 + and + D = D = 0. 0 he frst lne of Dï denotes the tme dependence, of the transformed algorthm and s gnored, whle the remanng matrx has as columns the space dependences n the transformed ndex space. In our example, the transformed dependences are: d = [,], d = [,0], d = [0,. 0 ] In Fgure we can see the projected transformed ndex space, together wth the transformed dependences. In order to calculate the bounds of the projected transformed ndex space, we apply Algorthm. he matrces that are calculated n the frst three steps of the algorthm are, respectvely: 7 In the case that n = 3, we are certan that only two ponts wll be n P, snce all other ponts wll have been dscarded. However, for n > 3, the number of ponts n P can be larger.

9 V = , ï V = W = he convex hull of the ponts, whose coordnates are gven n matrx W, s a polygon whose vertces are the ponts: (, ), (7, ), (0, 5), (0, 7), (5, 7) and (, 4). d ï d 0 ï d ï j ï jï Fgure. he projected transformed ndex space. he nequaltes that descrbe the nteror of ths polygon, n the form of pars of parallel lnes, are the followng: Par : x 0, Par : y 7, Par 3: - z 5. P 3, P,3 P 3, P, P,3 j ï P, P, j ï j ï j ï Fgure 3. he mappng that was suggested for our example. Fgure 4. he real communcaton cost for the suggested mappng. For our example, we select Œ to be equal to [3, 3], whch means that the fnal -dmensonal mesh wll have 3 processors n the frst drecton and 3 processors n the other, gvng a total processor space of 9 cells. Next, we apply Algorthm, n order to pre-calculate the constant coeffcents for the mappng costs. he results of the algorthm are the followng: depcost = ( ) =, depcost = ( ) =, depcost 3 = ( ) =.44, mccost, = ( ) = ,

10 mccost, = ( ) = , mccost 3, =.44 ( ) = 0, where the length of the cuts have been calculated by usng Algorthms 4 and 5. It s not dffcult to see that the best mappng ndcated by Algorthm 3 conssts of two multple cuts, one n the drecton of par and one n the drecton of par 3. hs mappng s shown n Fgure 3. Its total cost s: cost = mccost + mccost , 3, = hs does not sgnfcantly devate from the real cost, as can be seen n Fgure 4. We can see that the real cost of the multple-cut n the drecton of par,.e. the number of dependence vector traversng the vertcal lnes, s equal to 0, whereas the estmated cost was Smlarly, the real cost of the multple cut n the drecton of par 3, s equal to, nstead of the estmated 0. We should note here, that f the projected transformed ndex space s not an h-parallelogram, there s nevtably a loss of processng cells. hs s the case n our -dmensonal example here, where the underused processors can be seen n Fgure 3. If we apply the method proposed n [9], for fndng an optmal transformaton when mappng to systolc cells, the followng results wll be produced, among many others, for our example: = 0, = 0, 3 = 0, 4 = hese matrces result n systolc arrays of 4, 4, and cells respectvely. hey all preserve the optmal hyper-plane [,, ]. he optmal transformaton for the needs of our problem s 4. hs matrx maps nto the least possble number of cells, as shown n [], and s better than 3, because t produces an array requrng only two external communcaton lnks (two external transformed dependence vectors). In the begnnng of the example, we have selected for presentaton, because the resultng structure s bg enough to demonstrate the merts of our method. 6. CONCLUSION In ths paper we have presented a new method for the parttonng and mappng of nested loops onto fxed sze, dstrbuted, mesh-connected archtectures. hs method s based on transformng the ntal n-dmensonal ndex space J n nto an equvalent -ï n, usng a transformaton (matrx), and then dvde the projected vrtual (n-)-dmensonal space, through S, nto blocks whch are assgned to dfferent processors. Interprocessor communcaton s consderably reduced, by choosng the optmal cut along each dmenson. As t was shown, the proposed method s formally presented and easly programmable. Future work ncludes affne by statement parttonng of the ndex space, to further reduce the redundant nterprocessor communcaton lnks. 7. REFERENCES []. Andronkos, N. Kozrs, G. Papakonstantnou, P. sanakas, "Optmal Schedulng for UE/UE-UC Generalzed N-Dmensonal Grd ask Graphs," to appear n Journal of Parallel and Dstrbuted Computng, he devaton s due to the fact that contnuous space propertes are used. Devatons are qute large n our example, because of ts small sze. hey become nsgnfcant when the sze of the ndex space s larger.

11 [] A. Darte and Y. Robert, Constructve Methods for Schedulng Unform Loop Nests, IEEE ransactons on Parallel and Dstrbuted Systems, vol. 5, pp. 84-8, August 994. [3] A. Darte, L. Khachyan, and Y. Robert, Lnear Schedulng s Nearly Optmal, Parallel Processng Letters, vol.., pp. 73-8, 99. [4] A. Darte and Y. Robert, Mappng Unform Loop Nests onto Dstrbuted Memory Archtectures, Parallel Computng, vol. 0, pp , 994. [5] C.-. Kng, W.-H. Chou, and L. N, Ppelned Data-Parallel Algorthms: Part II Desgn, IEEE ransactons on Parallel and Dstrbuted Systems, vol., pp , October 990. [6] N. Kozrs, G. Papakonstantnou and P. sanakas, Mappng Nested Loops onto Dstrbuted Memory Multprocessors n Proceedngs of the 997 IEEE Internatonal Conference on Parallel and Dstrbuted Systems (ICPADS 97), IEEE Press, Seoul, Korea. [7] N. Kozrs, G. Papakonstantnou, P. sanakas, Automatc Mappng and Parttonng nto Systolc Archtectures, Proceedngs of the 5th Panhelenc Conference on Informatcs, pp , Dec. 995, Athens. [8] L. Lamport, he Parallel Executon of DO Loops, Communcatons of the ACM, vol. 7, pp , February 974. [9] P.-Z. Lee and Z. M. Kedem, Mappng Nested Loop Algorthms nto Multdmensonal Systolc Arrays, IEEE ransactons on Parallel and Dstrbuted Systems, vol., pp , January 990. [0] D. I. Moldovan and J. A. B. Fortes, Parttonng and Mappng Algorthms Into Fxed Sze Systolc Arrays, IEEE ransactons on Computers, vol. C-35, pp. -, January 986. [] D. I. Moldovan, Parallel Processng: From Applcatons to Systems, Morgan Kaufmann Publshers, 993. [] J. K. Per and R. Cytron, Mnmum Dstance: A Method for Parttonng Recurrences for Multprocessors, IEEE ransactons on Computers, vol. 38, pp. 03-, August 989. [3] W. Shang and J. A. B. Fortes, me Optmal Lnear Schedules for Algorthms wth Unform Dependences, IEEE ransactons on Computers, vol. 40, pp , June 99. [4] W. Shang and J. A. B. Fortes, Independent Parttonng of Algorthms wth Unform Dependences, IEEE ransactons on Computers, vol. 4, pp , February 99. [5] J.-P. Sheu and.-h. a, Parttonng and Mappng Nested Loops on Multprocessor Systems, IEEE ransactons on Parallel and Dstrbuted Systems, vol., pp , October 99.

Parallelism for Nested Loops with Non-uniform and Flow Dependences

Parallelism for Nested Loops with Non-uniform and Flow Dependences Parallelsm for Nested Loops wth Non-unform and Flow Dependences Sam-Jn Jeong Dept. of Informaton & Communcaton Engneerng, Cheonan Unversty, 5, Anseo-dong, Cheonan, Chungnam, 330-80, Korea. seong@cheonan.ac.kr

More information

An Application of the Dulmage-Mendelsohn Decomposition to Sparse Null Space Bases of Full Row Rank Matrices

An Application of the Dulmage-Mendelsohn Decomposition to Sparse Null Space Bases of Full Row Rank Matrices Internatonal Mathematcal Forum, Vol 7, 2012, no 52, 2549-2554 An Applcaton of the Dulmage-Mendelsohn Decomposton to Sparse Null Space Bases of Full Row Rank Matrces Mostafa Khorramzadeh Department of Mathematcal

More information

Accounting for the Use of Different Length Scale Factors in x, y and z Directions

Accounting for the Use of Different Length Scale Factors in x, y and z Directions 1 Accountng for the Use of Dfferent Length Scale Factors n x, y and z Drectons Taha Soch (taha.soch@kcl.ac.uk) Imagng Scences & Bomedcal Engneerng, Kng s College London, The Rayne Insttute, St Thomas Hosptal,

More information

Chapter 6 Programmng the fnte element method Inow turn to the man subject of ths book: The mplementaton of the fnte element algorthm n computer programs. In order to make my dscusson as straghtforward

More information

Support Vector Machines

Support Vector Machines /9/207 MIST.6060 Busness Intellgence and Data Mnng What are Support Vector Machnes? Support Vector Machnes Support Vector Machnes (SVMs) are supervsed learnng technques that analyze data and recognze patterns.

More information

Parallel matrix-vector multiplication

Parallel matrix-vector multiplication Appendx A Parallel matrx-vector multplcaton The reduced transton matrx of the three-dmensonal cage model for gel electrophoress, descrbed n secton 3.2, becomes excessvely large for polymer lengths more

More information

CMPS 10 Introduction to Computer Science Lecture Notes

CMPS 10 Introduction to Computer Science Lecture Notes CPS 0 Introducton to Computer Scence Lecture Notes Chapter : Algorthm Desgn How should we present algorthms? Natural languages lke Englsh, Spansh, or French whch are rch n nterpretaton and meanng are not

More information

Hermite Splines in Lie Groups as Products of Geodesics

Hermite Splines in Lie Groups as Products of Geodesics Hermte Splnes n Le Groups as Products of Geodescs Ethan Eade Updated May 28, 2017 1 Introducton 1.1 Goal Ths document defnes a curve n the Le group G parametrzed by tme and by structural parameters n the

More information

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz Compler Desgn Sprng 2014 Regster Allocaton Sample Exercses and Solutons Prof. Pedro C. Dnz USC / Informaton Scences Insttute 4676 Admralty Way, Sute 1001 Marna del Rey, Calforna 90292 pedro@s.edu Regster

More information

Polyhedral Compilation Foundations

Polyhedral Compilation Foundations Polyhedral Complaton Foundatons Lous-Noël Pouchet pouchet@cse.oho-state.edu Dept. of Computer Scence and Engneerng, the Oho State Unversty Feb 8, 200 888., Class # Introducton: Polyhedral Complaton Foundatons

More information

UNIT 2 : INEQUALITIES AND CONVEX SETS

UNIT 2 : INEQUALITIES AND CONVEX SETS UNT 2 : NEQUALTES AND CONVEX SETS ' Structure 2. ntroducton Objectves, nequaltes and ther Graphs Convex Sets and ther Geometry Noton of Convex Sets Extreme Ponts of Convex Set Hyper Planes and Half Spaces

More information

Module Management Tool in Software Development Organizations

Module Management Tool in Software Development Organizations Journal of Computer Scence (5): 8-, 7 ISSN 59-66 7 Scence Publcatons Management Tool n Software Development Organzatons Ahmad A. Al-Rababah and Mohammad A. Al-Rababah Faculty of IT, Al-Ahlyyah Amman Unversty,

More information

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data A Fast Content-Based Multmeda Retreval Technque Usng Compressed Data Borko Furht and Pornvt Saksobhavvat NSF Multmeda Laboratory Florda Atlantc Unversty, Boca Raton, Florda 3343 ABSTRACT In ths paper,

More information

Communication-Minimal Partitioning and Data Alignment for Af"ne Nested Loops

Communication-Minimal Partitioning and Data Alignment for Afne Nested Loops Communcaton-Mnmal Parttonng and Data Algnment for Af"ne Nested Loops HYUK-JAE LEE 1 AND JOSÉ A. B. FORTES 2 1 Department of Computer Scence, Lousana Tech Unversty, Ruston, LA 71272, USA 2 School of Electrcal

More information

Course Introduction. Algorithm 8/31/2017. COSC 320 Advanced Data Structures and Algorithms. COSC 320 Advanced Data Structures and Algorithms

Course Introduction. Algorithm 8/31/2017. COSC 320 Advanced Data Structures and Algorithms. COSC 320 Advanced Data Structures and Algorithms Course Introducton Course Topcs Exams, abs, Proects A quc loo at a few algorthms 1 Advanced Data Structures and Algorthms Descrpton: We are gong to dscuss algorthm complexty analyss, algorthm desgn technques

More information

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision SLAM Summer School 2006 Practcal 2: SLAM usng Monocular Vson Javer Cvera, Unversty of Zaragoza Andrew J. Davson, Imperal College London J.M.M Montel, Unversty of Zaragoza. josemar@unzar.es, jcvera@unzar.es,

More information

Assignment # 2. Farrukh Jabeen Algorithms 510 Assignment #2 Due Date: June 15, 2009.

Assignment # 2. Farrukh Jabeen Algorithms 510 Assignment #2 Due Date: June 15, 2009. Farrukh Jabeen Algorthms 51 Assgnment #2 Due Date: June 15, 29. Assgnment # 2 Chapter 3 Dscrete Fourer Transforms Implement the FFT for the DFT. Descrbed n sectons 3.1 and 3.2. Delverables: 1. Concse descrpton

More information

6.854 Advanced Algorithms Petar Maymounkov Problem Set 11 (November 23, 2005) With: Benjamin Rossman, Oren Weimann, and Pouya Kheradpour

6.854 Advanced Algorithms Petar Maymounkov Problem Set 11 (November 23, 2005) With: Benjamin Rossman, Oren Weimann, and Pouya Kheradpour 6.854 Advanced Algorthms Petar Maymounkov Problem Set 11 (November 23, 2005) Wth: Benjamn Rossman, Oren Wemann, and Pouya Kheradpour Problem 1. We reduce vertex cover to MAX-SAT wth weghts, such that the

More information

Multiblock method for database generation in finite element programs

Multiblock method for database generation in finite element programs Proc. of the 9th WSEAS Int. Conf. on Mathematcal Methods and Computatonal Technques n Electrcal Engneerng, Arcachon, October 13-15, 2007 53 Multblock method for database generaton n fnte element programs

More information

A Binarization Algorithm specialized on Document Images and Photos

A Binarization Algorithm specialized on Document Images and Photos A Bnarzaton Algorthm specalzed on Document mages and Photos Ergna Kavalleratou Dept. of nformaton and Communcaton Systems Engneerng Unversty of the Aegean kavalleratou@aegean.gr Abstract n ths paper, a

More information

2x x l. Module 3: Element Properties Lecture 4: Lagrange and Serendipity Elements

2x x l. Module 3: Element Properties Lecture 4: Lagrange and Serendipity Elements Module 3: Element Propertes Lecture : Lagrange and Serendpty Elements 5 In last lecture note, the nterpolaton functons are derved on the bass of assumed polynomal from Pascal s trangle for the fled varable.

More information

Steps for Computing the Dissimilarity, Entropy, Herfindahl-Hirschman and. Accessibility (Gravity with Competition) Indices

Steps for Computing the Dissimilarity, Entropy, Herfindahl-Hirschman and. Accessibility (Gravity with Competition) Indices Steps for Computng the Dssmlarty, Entropy, Herfndahl-Hrschman and Accessblty (Gravty wth Competton) Indces I. Dssmlarty Index Measurement: The followng formula can be used to measure the evenness between

More information

An Optimal Algorithm for Prufer Codes *

An Optimal Algorithm for Prufer Codes * J. Software Engneerng & Applcatons, 2009, 2: 111-115 do:10.4236/jsea.2009.22016 Publshed Onlne July 2009 (www.scrp.org/journal/jsea) An Optmal Algorthm for Prufer Codes * Xaodong Wang 1, 2, Le Wang 3,

More information

Sum of Linear and Fractional Multiobjective Programming Problem under Fuzzy Rules Constraints

Sum of Linear and Fractional Multiobjective Programming Problem under Fuzzy Rules Constraints Australan Journal of Basc and Appled Scences, 2(4): 1204-1208, 2008 ISSN 1991-8178 Sum of Lnear and Fractonal Multobjectve Programmng Problem under Fuzzy Rules Constrants 1 2 Sanjay Jan and Kalash Lachhwan

More information

Programming in Fortran 90 : 2017/2018

Programming in Fortran 90 : 2017/2018 Programmng n Fortran 90 : 2017/2018 Programmng n Fortran 90 : 2017/2018 Exercse 1 : Evaluaton of functon dependng on nput Wrte a program who evaluate the functon f (x,y) for any two user specfed values

More information

The Greedy Method. Outline and Reading. Change Money Problem. Greedy Algorithms. Applications of the Greedy Strategy. The Greedy Method Technique

The Greedy Method. Outline and Reading. Change Money Problem. Greedy Algorithms. Applications of the Greedy Strategy. The Greedy Method Technique //00 :0 AM Outlne and Readng The Greedy Method The Greedy Method Technque (secton.) Fractonal Knapsack Problem (secton..) Task Schedulng (secton..) Mnmum Spannng Trees (secton.) Change Money Problem Greedy

More information

CHAPTER 2 DECOMPOSITION OF GRAPHS

CHAPTER 2 DECOMPOSITION OF GRAPHS CHAPTER DECOMPOSITION OF GRAPHS. INTRODUCTION A graph H s called a Supersubdvson of a graph G f H s obtaned from G by replacng every edge uv of G by a bpartte graph,m (m may vary for each edge by dentfyng

More information

Machine Learning: Algorithms and Applications

Machine Learning: Algorithms and Applications 14/05/1 Machne Learnng: Algorthms and Applcatons Florano Zn Free Unversty of Bozen-Bolzano Faculty of Computer Scence Academc Year 011-01 Lecture 10: 14 May 01 Unsupervsed Learnng cont Sldes courtesy of

More information

The Shortest Path of Touring Lines given in the Plane

The Shortest Path of Touring Lines given in the Plane Send Orders for Reprnts to reprnts@benthamscence.ae 262 The Open Cybernetcs & Systemcs Journal, 2015, 9, 262-267 The Shortest Path of Tourng Lnes gven n the Plane Open Access Ljuan Wang 1,2, Dandan He

More information

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization Problem efntons and Evaluaton Crtera for Computatonal Expensve Optmzaton B. Lu 1, Q. Chen and Q. Zhang 3, J. J. Lang 4, P. N. Suganthan, B. Y. Qu 6 1 epartment of Computng, Glyndwr Unversty, UK Faclty

More information

A MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS

A MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS Proceedngs of the Wnter Smulaton Conference M E Kuhl, N M Steger, F B Armstrong, and J A Jones, eds A MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS Mark W Brantley Chun-Hung

More information

Loop Transformations, Dependences, and Parallelization

Loop Transformations, Dependences, and Parallelization Loop Transformatons, Dependences, and Parallelzaton Announcements Mdterm s Frday from 3-4:15 n ths room Today Semester long project Data dependence recap Parallelsm and storage tradeoff Scalar expanson

More information

S1 Note. Basis functions.

S1 Note. Basis functions. S1 Note. Bass functons. Contents Types of bass functons...1 The Fourer bass...2 B-splne bass...3 Power and type I error rates wth dfferent numbers of bass functons...4 Table S1. Smulaton results of type

More information

For instance, ; the five basic number-sets are increasingly more n A B & B A A = B (1)

For instance, ; the five basic number-sets are increasingly more n A B & B A A = B (1) Secton 1.2 Subsets and the Boolean operatons on sets If every element of the set A s an element of the set B, we say that A s a subset of B, or that A s contaned n B, or that B contans A, and we wrte A

More information

Feature Reduction and Selection

Feature Reduction and Selection Feature Reducton and Selecton Dr. Shuang LIANG School of Software Engneerng TongJ Unversty Fall, 2012 Today s Topcs Introducton Problems of Dmensonalty Feature Reducton Statstc methods Prncpal Components

More information

y and the total sum of

y and the total sum of Lnear regresson Testng for non-lnearty In analytcal chemstry, lnear regresson s commonly used n the constructon of calbraton functons requred for analytcal technques such as gas chromatography, atomc absorpton

More information

NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS

NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS ARPN Journal of Engneerng and Appled Scences 006-017 Asan Research Publshng Network (ARPN). All rghts reserved. NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS Igor Grgoryev, Svetlana

More information

Cluster Analysis of Electrical Behavior

Cluster Analysis of Electrical Behavior Journal of Computer and Communcatons, 205, 3, 88-93 Publshed Onlne May 205 n ScRes. http://www.scrp.org/ournal/cc http://dx.do.org/0.4236/cc.205.350 Cluster Analyss of Electrcal Behavor Ln Lu Ln Lu, School

More information

R s s f. m y s. SPH3UW Unit 7.3 Spherical Concave Mirrors Page 1 of 12. Notes

R s s f. m y s. SPH3UW Unit 7.3 Spherical Concave Mirrors Page 1 of 12. Notes SPH3UW Unt 7.3 Sphercal Concave Mrrors Page 1 of 1 Notes Physcs Tool box Concave Mrror If the reflectng surface takes place on the nner surface of the sphercal shape so that the centre of the mrror bulges

More information

Chapter 1. Introduction

Chapter 1. Introduction Chapter 1 Introducton 1.1 Parallel Processng There s a contnual demand for greater computatonal speed from a computer system than s currently possble (.e. sequental systems). Areas need great computatonal

More information

Repeater Insertion for Two-Terminal Nets in Three-Dimensional Integrated Circuits

Repeater Insertion for Two-Terminal Nets in Three-Dimensional Integrated Circuits Repeater Inserton for Two-Termnal Nets n Three-Dmensonal Integrated Crcuts Hu Xu, Vasls F. Pavlds, and Govann De Mchel LSI - EPFL, CH-5, Swtzerland, {hu.xu,vasleos.pavlds,govann.demchel}@epfl.ch Abstract.

More information

Kent State University CS 4/ Design and Analysis of Algorithms. Dept. of Math & Computer Science LECT-16. Dynamic Programming

Kent State University CS 4/ Design and Analysis of Algorithms. Dept. of Math & Computer Science LECT-16. Dynamic Programming CS 4/560 Desgn and Analyss of Algorthms Kent State Unversty Dept. of Math & Computer Scence LECT-6 Dynamc Programmng 2 Dynamc Programmng Dynamc Programmng, lke the dvde-and-conquer method, solves problems

More information

F Geometric Mean Graphs

F Geometric Mean Graphs Avalable at http://pvamu.edu/aam Appl. Appl. Math. ISSN: 1932-9466 Vol. 10, Issue 2 (December 2015), pp. 937-952 Applcatons and Appled Mathematcs: An Internatonal Journal (AAM) F Geometrc Mean Graphs A.

More information

Classifier Selection Based on Data Complexity Measures *

Classifier Selection Based on Data Complexity Measures * Classfer Selecton Based on Data Complexty Measures * Edth Hernández-Reyes, J.A. Carrasco-Ochoa, and J.Fco. Martínez-Trndad Natonal Insttute for Astrophyscs, Optcs and Electroncs, Lus Enrque Erro No.1 Sta.

More information

Complex Numbers. Now we also saw that if a and b were both positive then ab = a b. For a second let s forget that restriction and do the following.

Complex Numbers. Now we also saw that if a and b were both positive then ab = a b. For a second let s forget that restriction and do the following. Complex Numbers The last topc n ths secton s not really related to most of what we ve done n ths chapter, although t s somewhat related to the radcals secton as we wll see. We also won t need the materal

More information

Data Representation in Digital Design, a Single Conversion Equation and a Formal Languages Approach

Data Representation in Digital Design, a Single Conversion Equation and a Formal Languages Approach Data Representaton n Dgtal Desgn, a Sngle Converson Equaton and a Formal Languages Approach Hassan Farhat Unversty of Nebraska at Omaha Abstract- In the study of data representaton n dgtal desgn and computer

More information

Constructing Minimum Connected Dominating Set: Algorithmic approach

Constructing Minimum Connected Dominating Set: Algorithmic approach Constructng Mnmum Connected Domnatng Set: Algorthmc approach G.N. Puroht and Usha Sharma Centre for Mathematcal Scences, Banasthal Unversty, Rajasthan 304022 usha.sharma94@yahoo.com Abstract: Connected

More information

Graph-based Clustering

Graph-based Clustering Graphbased Clusterng Transform the data nto a graph representaton ertces are the data ponts to be clustered Edges are eghted based on smlarty beteen data ponts Graph parttonng Þ Each connected component

More information

Analysis of Continuous Beams in General

Analysis of Continuous Beams in General Analyss of Contnuous Beams n General Contnuous beams consdered here are prsmatc, rgdly connected to each beam segment and supported at varous ponts along the beam. onts are selected at ponts of support,

More information

Fast Computation of Shortest Path for Visiting Segments in the Plane

Fast Computation of Shortest Path for Visiting Segments in the Plane Send Orders for Reprnts to reprnts@benthamscence.ae 4 The Open Cybernetcs & Systemcs Journal, 04, 8, 4-9 Open Access Fast Computaton of Shortest Path for Vstng Segments n the Plane Ljuan Wang,, Bo Jang

More information

Array transposition in CUDA shared memory

Array transposition in CUDA shared memory Array transposton n CUDA shared memory Mke Gles February 19, 2014 Abstract Ths short note s nspred by some code wrtten by Jeremy Appleyard for the transposton of data through shared memory. I had some

More information

AMath 483/583 Lecture 21 May 13, Notes: Notes: Jacobi iteration. Notes: Jacobi with OpenMP coarse grain

AMath 483/583 Lecture 21 May 13, Notes: Notes: Jacobi iteration. Notes: Jacobi with OpenMP coarse grain AMath 483/583 Lecture 21 May 13, 2011 Today: OpenMP and MPI versons of Jacob teraton Gauss-Sedel and SOR teratve methods Next week: More MPI Debuggng and totalvew GPU computng Read: Class notes and references

More information

Load Balancing for Hex-Cell Interconnection Network

Load Balancing for Hex-Cell Interconnection Network Int. J. Communcatons, Network and System Scences,,, - Publshed Onlne Aprl n ScRes. http://www.scrp.org/journal/jcns http://dx.do.org/./jcns.. Load Balancng for Hex-Cell Interconnecton Network Saher Manaseer,

More information

Vectorization in the Polyhedral Model

Vectorization in the Polyhedral Model Vectorzaton n the Polyhedral Model Lous-Noël Pouchet pouchet@cse.oho-state.edu Dept. of Computer Scence and Engneerng, the Oho State Unversty October 200 888. Introducton: Overvew Vectorzaton: Detecton

More information

The Codesign Challenge

The Codesign Challenge ECE 4530 Codesgn Challenge Fall 2007 Hardware/Software Codesgn The Codesgn Challenge Objectves In the codesgn challenge, your task s to accelerate a gven software reference mplementaton as fast as possble.

More information

CS 534: Computer Vision Model Fitting

CS 534: Computer Vision Model Fitting CS 534: Computer Vson Model Fttng Sprng 004 Ahmed Elgammal Dept of Computer Scence CS 534 Model Fttng - 1 Outlnes Model fttng s mportant Least-squares fttng Maxmum lkelhood estmaton MAP estmaton Robust

More information

K-means and Hierarchical Clustering

K-means and Hierarchical Clustering Note to other teachers and users of these sldes. Andrew would be delghted f you found ths source materal useful n gvng your own lectures. Feel free to use these sldes verbatm, or to modfy them to ft your

More information

Vanishing Hull. Jinhui Hu, Suya You, Ulrich Neumann University of Southern California {jinhuihu,suyay,

Vanishing Hull. Jinhui Hu, Suya You, Ulrich Neumann University of Southern California {jinhuihu,suyay, Vanshng Hull Jnhu Hu Suya You Ulrch Neumann Unversty of Southern Calforna {jnhuhusuyay uneumann}@graphcs.usc.edu Abstract Vanshng ponts are valuable n many vson tasks such as orentaton estmaton pose recovery

More information

An Iterative Solution Approach to Process Plant Layout using Mixed Integer Optimisation

An Iterative Solution Approach to Process Plant Layout using Mixed Integer Optimisation 17 th European Symposum on Computer Aded Process Engneerng ESCAPE17 V. Plesu and P.S. Agach (Edtors) 2007 Elsever B.V. All rghts reserved. 1 An Iteratve Soluton Approach to Process Plant Layout usng Mxed

More information

Mathematics 256 a course in differential equations for engineering students

Mathematics 256 a course in differential equations for engineering students Mathematcs 56 a course n dfferental equatons for engneerng students Chapter 5. More effcent methods of numercal soluton Euler s method s qute neffcent. Because the error s essentally proportonal to the

More information

Optimization Methods: Integer Programming Integer Linear Programming 1. Module 7 Lecture Notes 1. Integer Linear Programming

Optimization Methods: Integer Programming Integer Linear Programming 1. Module 7 Lecture Notes 1. Integer Linear Programming Optzaton Methods: Integer Prograng Integer Lnear Prograng Module Lecture Notes Integer Lnear Prograng Introducton In all the prevous lectures n lnear prograng dscussed so far, the desgn varables consdered

More information

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points;

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points; Subspace clusterng Clusterng Fundamental to all clusterng technques s the choce of dstance measure between data ponts; D q ( ) ( ) 2 x x = x x, j k = 1 k jk Squared Eucldean dstance Assumpton: All features

More information

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers IOSR Journal of Electroncs and Communcaton Engneerng (IOSR-JECE) e-issn: 78-834,p- ISSN: 78-8735.Volume 9, Issue, Ver. IV (Mar - Apr. 04), PP 0-07 Content Based Image Retreval Usng -D Dscrete Wavelet wth

More information

Radial Basis Functions

Radial Basis Functions Radal Bass Functons Mesh Reconstructon Input: pont cloud Output: water-tght manfold mesh Explct Connectvty estmaton Implct Sgned dstance functon estmaton Image from: Reconstructon and Representaton of

More information

Lecture 5: Multilayer Perceptrons

Lecture 5: Multilayer Perceptrons Lecture 5: Multlayer Perceptrons Roger Grosse 1 Introducton So far, we ve only talked about lnear models: lnear regresson and lnear bnary classfers. We noted that there are functons that can t be represented

More information

12/2/2009. Announcements. Parametric / Non-parametric. Case-Based Reasoning. Nearest-Neighbor on Images. Nearest-Neighbor Classification

12/2/2009. Announcements. Parametric / Non-parametric. Case-Based Reasoning. Nearest-Neighbor on Images. Nearest-Neighbor Classification Introducton to Artfcal Intellgence V22.0472-001 Fall 2009 Lecture 24: Nearest-Neghbors & Support Vector Machnes Rob Fergus Dept of Computer Scence, Courant Insttute, NYU Sldes from Danel Yeung, John DeNero

More information

A New Approach For the Ranking of Fuzzy Sets With Different Heights

A New Approach For the Ranking of Fuzzy Sets With Different Heights New pproach For the ankng of Fuzzy Sets Wth Dfferent Heghts Pushpnder Sngh School of Mathematcs Computer pplcatons Thapar Unversty, Patala-7 00 Inda pushpndersnl@gmalcom STCT ankng of fuzzy sets plays

More information

Range images. Range image registration. Examples of sampling patterns. Range images and range surfaces

Range images. Range image registration. Examples of sampling patterns. Range images and range surfaces Range mages For many structured lght scanners, the range data forms a hghly regular pattern known as a range mage. he samplng pattern s determned by the specfc scanner. Range mage regstraton 1 Examples

More information

LECTURE : MANIFOLD LEARNING

LECTURE : MANIFOLD LEARNING LECTURE : MANIFOLD LEARNING Rta Osadchy Some sldes are due to L.Saul, V. C. Raykar, N. Verma Topcs PCA MDS IsoMap LLE EgenMaps Done! Dmensonalty Reducton Data representaton Inputs are real-valued vectors

More information

An Accurate Evaluation of Integrals in Convex and Non convex Polygonal Domain by Twelve Node Quadrilateral Finite Element Method

An Accurate Evaluation of Integrals in Convex and Non convex Polygonal Domain by Twelve Node Quadrilateral Finite Element Method Internatonal Journal of Computatonal and Appled Mathematcs. ISSN 89-4966 Volume, Number (07), pp. 33-4 Research Inda Publcatons http://www.rpublcaton.com An Accurate Evaluaton of Integrals n Convex and

More information

A Facet Generation Procedure. for solving 0/1 integer programs

A Facet Generation Procedure. for solving 0/1 integer programs A Facet Generaton Procedure for solvng 0/ nteger programs by Gyana R. Parja IBM Corporaton, Poughkeepse, NY 260 Radu Gaddov Emery Worldwde Arlnes, Vandala, Oho 45377 and Wlbert E. Wlhelm Teas A&M Unversty,

More information

CSE 326: Data Structures Quicksort Comparison Sorting Bound

CSE 326: Data Structures Quicksort Comparison Sorting Bound CSE 326: Data Structures Qucksort Comparson Sortng Bound Steve Setz Wnter 2009 Qucksort Qucksort uses a dvde and conquer strategy, but does not requre the O(N) extra space that MergeSort does. Here s the

More information

Support Vector Machines

Support Vector Machines Support Vector Machnes Decson surface s a hyperplane (lne n 2D) n feature space (smlar to the Perceptron) Arguably, the most mportant recent dscovery n machne learnng In a nutshell: map the data to a predetermned

More information

A mathematical programming approach to the analysis, design and scheduling of offshore oilfields

A mathematical programming approach to the analysis, design and scheduling of offshore oilfields 17 th European Symposum on Computer Aded Process Engneerng ESCAPE17 V. Plesu and P.S. Agach (Edtors) 2007 Elsever B.V. All rghts reserved. 1 A mathematcal programmng approach to the analyss, desgn and

More information

CHARUTAR VIDYA MANDAL S SEMCOM Vallabh Vidyanagar

CHARUTAR VIDYA MANDAL S SEMCOM Vallabh Vidyanagar CHARUTAR VIDYA MANDAL S SEMCOM Vallabh Vdyanagar Faculty Name: Am D. Trved Class: SYBCA Subject: US03CBCA03 (Advanced Data & Fle Structure) *UNIT 1 (ARRAYS AND TREES) **INTRODUCTION TO ARRAYS If we want

More information

CHAPTER 10: ALGORITHM DESIGN TECHNIQUES

CHAPTER 10: ALGORITHM DESIGN TECHNIQUES CHAPTER 10: ALGORITHM DESIGN TECHNIQUES So far, we have been concerned wth the effcent mplementaton of algorthms. We have seen that when an algorthm s gven, the actual data structures need not be specfed.

More information

Machine Learning. Support Vector Machines. (contains material adapted from talks by Constantin F. Aliferis & Ioannis Tsamardinos, and Martin Law)

Machine Learning. Support Vector Machines. (contains material adapted from talks by Constantin F. Aliferis & Ioannis Tsamardinos, and Martin Law) Machne Learnng Support Vector Machnes (contans materal adapted from talks by Constantn F. Alfers & Ioanns Tsamardnos, and Martn Law) Bryan Pardo, Machne Learnng: EECS 349 Fall 2014 Support Vector Machnes

More information

Sorting Review. Sorting. Comparison Sorting. CSE 680 Prof. Roger Crawfis. Assumptions

Sorting Review. Sorting. Comparison Sorting. CSE 680 Prof. Roger Crawfis. Assumptions Sortng Revew Introducton to Algorthms Qucksort CSE 680 Prof. Roger Crawfs Inserton Sort T(n) = Θ(n 2 ) In-place Merge Sort T(n) = Θ(n lg(n)) Not n-place Selecton Sort (from homework) T(n) = Θ(n 2 ) In-place

More information

Brave New World Pseudocode Reference

Brave New World Pseudocode Reference Brave New World Pseudocode Reference Pseudocode s a way to descrbe how to accomplsh tasks usng basc steps lke those a computer mght perform. In ths week s lab, you'll see how a form of pseudocode can be

More information

Solving Route Planning Using Euler Path Transform

Solving Route Planning Using Euler Path Transform Solvng Route Plannng Usng Euler Path ransform Y-Chong Zeng Insttute of Informaton Scence Academa Snca awan ychongzeng@s.snca.edu.tw Abstract hs paper presents a method to solve route plannng problem n

More information

Exercises (Part 4) Introduction to R UCLA/CCPR. John Fox, February 2005

Exercises (Part 4) Introduction to R UCLA/CCPR. John Fox, February 2005 Exercses (Part 4) Introducton to R UCLA/CCPR John Fox, February 2005 1. A challengng problem: Iterated weghted least squares (IWLS) s a standard method of fttng generalzed lnear models to data. As descrbed

More information

Hierarchical clustering for gene expression data analysis

Hierarchical clustering for gene expression data analysis Herarchcal clusterng for gene expresson data analyss Gorgo Valentn e-mal: valentn@ds.unm.t Clusterng of Mcroarray Data. Clusterng of gene expresson profles (rows) => dscovery of co-regulated and functonally

More information

Optimal Workload-based Weighted Wavelet Synopses

Optimal Workload-based Weighted Wavelet Synopses Optmal Workload-based Weghted Wavelet Synopses Yoss Matas School of Computer Scence Tel Avv Unversty Tel Avv 69978, Israel matas@tau.ac.l Danel Urel School of Computer Scence Tel Avv Unversty Tel Avv 69978,

More information

Solving two-person zero-sum game by Matlab

Solving two-person zero-sum game by Matlab Appled Mechancs and Materals Onlne: 2011-02-02 ISSN: 1662-7482, Vols. 50-51, pp 262-265 do:10.4028/www.scentfc.net/amm.50-51.262 2011 Trans Tech Publcatons, Swtzerland Solvng two-person zero-sum game by

More information

On Some Entertaining Applications of the Concept of Set in Computer Science Course

On Some Entertaining Applications of the Concept of Set in Computer Science Course On Some Entertanng Applcatons of the Concept of Set n Computer Scence Course Krasmr Yordzhev *, Hrstna Kostadnova ** * Assocate Professor Krasmr Yordzhev, Ph.D., Faculty of Mathematcs and Natural Scences,

More information

Decision Strategies for Rating Objects in Knowledge-Shared Research Networks

Decision Strategies for Rating Objects in Knowledge-Shared Research Networks Decson Strateges for Ratng Objects n Knowledge-Shared Research etwors ALEXADRA GRACHAROVA *, HAS-JOACHM ER **, HASSA OUR ELD ** OM SUUROE ***, HARR ARAKSE *** * nsttute of Control and System Research,

More information

An Approach in Coloring Semi-Regular Tilings on the Hyperbolic Plane

An Approach in Coloring Semi-Regular Tilings on the Hyperbolic Plane An Approach n Colorng Sem-Regular Tlngs on the Hyperbolc Plane Ma Louse Antonette N De Las Peñas, mlp@mathscmathadmueduph Glenn R Lago, glago@yahoocom Math Department, Ateneo de Manla Unversty, Loyola

More information

Computing the Minimum Directed Distances Between Convex Polyhedra

Computing the Minimum Directed Distances Between Convex Polyhedra JOURNAL OF INFORMAION COMUING SCIENCE DISANCES AND ENGINEERING BEWEEN CONVEX 15, 353-373 OLYHEDRA (1999) 353 Computng the Mnmum Drected Dstances Between Convex olyhedra Department of Electrcal Engneerng

More information

Lecture 4: Principal components

Lecture 4: Principal components /3/6 Lecture 4: Prncpal components 3..6 Multvarate lnear regresson MLR s optmal for the estmaton data...but poor for handlng collnear data Covarance matrx s not nvertble (large condton number) Robustness

More information

ON SOME ENTERTAINING APPLICATIONS OF THE CONCEPT OF SET IN COMPUTER SCIENCE COURSE

ON SOME ENTERTAINING APPLICATIONS OF THE CONCEPT OF SET IN COMPUTER SCIENCE COURSE Yordzhev K., Kostadnova H. Інформаційні технології в освіті ON SOME ENTERTAINING APPLICATIONS OF THE CONCEPT OF SET IN COMPUTER SCIENCE COURSE Yordzhev K., Kostadnova H. Some aspects of programmng educaton

More information

Reducing Frame Rate for Object Tracking

Reducing Frame Rate for Object Tracking Reducng Frame Rate for Object Trackng Pavel Korshunov 1 and We Tsang Oo 2 1 Natonal Unversty of Sngapore, Sngapore 11977, pavelkor@comp.nus.edu.sg 2 Natonal Unversty of Sngapore, Sngapore 11977, oowt@comp.nus.edu.sg

More information

AADL : about scheduling analysis

AADL : about scheduling analysis AADL : about schedulng analyss Schedulng analyss, what s t? Embedded real-tme crtcal systems have temporal constrants to meet (e.g. deadlne). Many systems are bult wth operatng systems provdng multtaskng

More information

Unsupervised Learning and Clustering

Unsupervised Learning and Clustering Unsupervsed Learnng and Clusterng Why consder unlabeled samples?. Collectng and labelng large set of samples s costly Gettng recorded speech s free, labelng s tme consumng 2. Classfer could be desgned

More information

Solitary and Traveling Wave Solutions to a Model. of Long Range Diffusion Involving Flux with. Stability Analysis

Solitary and Traveling Wave Solutions to a Model. of Long Range Diffusion Involving Flux with. Stability Analysis Internatonal Mathematcal Forum, Vol. 6,, no. 7, 8 Soltary and Travelng Wave Solutons to a Model of Long Range ffuson Involvng Flux wth Stablty Analyss Manar A. Al-Qudah Math epartment, Rabgh Faculty of

More information

Interpolation of the Irregular Curve Network of Ship Hull Form Using Subdivision Surfaces

Interpolation of the Irregular Curve Network of Ship Hull Form Using Subdivision Surfaces 7 Interpolaton of the Irregular Curve Network of Shp Hull Form Usng Subdvson Surfaces Kyu-Yeul Lee, Doo-Yeoun Cho and Tae-Wan Km Seoul Natonal Unversty, kylee@snu.ac.kr,whendus@snu.ac.kr,taewan}@snu.ac.kr

More information

LEAST SQUARES. RANSAC. HOUGH TRANSFORM.

LEAST SQUARES. RANSAC. HOUGH TRANSFORM. LEAS SQUARES. RANSAC. HOUGH RANSFORM. he sldes are from several sources through James Has (Brown); Srnvasa Narasmhan (CMU); Slvo Savarese (U. of Mchgan); Bll Freeman and Antono orralba (MI), ncludng ther

More information

GSLM Operations Research II Fall 13/14

GSLM Operations Research II Fall 13/14 GSLM 58 Operatons Research II Fall /4 6. Separable Programmng Consder a general NLP mn f(x) s.t. g j (x) b j j =. m. Defnton 6.. The NLP s a separable program f ts objectve functon and all constrants are

More information

Private Information Retrieval (PIR)

Private Information Retrieval (PIR) 2 Levente Buttyán Problem formulaton Alce wants to obtan nformaton from a database, but she does not want the database to learn whch nformaton she wanted e.g., Alce s an nvestor queryng a stock-market

More information

Type-2 Fuzzy Non-uniform Rational B-spline Model with Type-2 Fuzzy Data

Type-2 Fuzzy Non-uniform Rational B-spline Model with Type-2 Fuzzy Data Malaysan Journal of Mathematcal Scences 11(S) Aprl : 35 46 (2017) Specal Issue: The 2nd Internatonal Conference and Workshop on Mathematcal Analyss (ICWOMA 2016) MALAYSIAN JOURNAL OF MATHEMATICAL SCIENCES

More information

Routing in Degree-constrained FSO Mesh Networks

Routing in Degree-constrained FSO Mesh Networks Internatonal Journal of Hybrd Informaton Technology Vol., No., Aprl, 009 Routng n Degree-constraned FSO Mesh Networks Zpng Hu, Pramode Verma, and James Sluss Jr. School of Electrcal & Computer Engneerng

More information