Ant Colony Optimal Algorithm: Fast Ants on the Optical Pipelined R-Mesh

Size: px
Start display at page:

Download "Ant Colony Optimal Algorithm: Fast Ants on the Optical Pipelined R-Mesh"

Transcription

1 At Coloy Optimal Algorithm: Fast Ats o the Optical Pipelied R-Mesh Ke D. Nguye ad Au G. Bourgeois Departmet of Computer Sciece Georgia State Uiversity P.O. Box 3994 Atlata, Georgia {kguye, abourgeois}@cs.gsu.edu Abstract I this paper, we demostrate how to implemet ad improve two At Coloy Optimiatio (ACO) algorithms o the Optical Pipelied Recofigurable Mesh (PR-Mesh): the geeric ACO ad the Fast At Coloy Optimiatio (FACO) algorithm. The ru-time complexity of our improved geeric ACO algorithm, with x geeratios each geeratio havig m ats, o a x PR-Mesh is O((x m + )log), which outperforms the curretly best kow electrical model implemeted i [3] with ru-time complexity of O(x (m + )log). Our FACO algorithm o PR-Mesh yields O((( log 2 )+ log ) loglog) ru-time complexity for 2 jobs while the existig FACO algorithm o the electrical model yields a ru-time complexity of O(( + )log ) but ca oly hadle log 2 jobs, where is the total umber of ats from all geeratios. I additio, we propose a theoretical FACO algorithm o a three dimesioal PR-Mesh solvig 2 jobs i O(x (m + ) log) time. I. Itroductio The At Coloy Optimiatio (ACO) algorithm is a greedy approach. It has bee used to solve may combiatorial NP problems such as the Travelig Salesma Problems (TSP) [1], the Flow-Shop problems [2], the Sigle Machie Total Tardiess problem (SMTTP) [3], the Resource Costraied Project Schedulig problem [4], the Quadratic Assigmet problem (QAP) [5] [6], graph colorig [7], ad routig i commuicatio etworks [8] [9], etc. The ACO algorithm was itroduced by Marco Dorigo [10] i his dissertatio i 1992 ad is described by the behavior of ats fidig food. I geeral, ats from a coloy will dispatch searchig for food i ay radom path. Each at will leave some pheromoe o its path to, ad from, the food source. The amout of pheromoe ca be differet depedig o the goodess of the food foud. A portio of pheromoe o the path will be evaporated after a period of time. All followig ats leavig the coloy will follow the path(s) that has the strogest pheromoe. A geeratio of ats is defied by the order i which they are leavig the coloy. All ats i a geeratio are workig idepedetly. Sice the ats keep followig the strogest pheromoe path(s), oly the best foud path(s) will remai. Because of the ature of the ACO algorithm, it is best to implemet the algorithm o a parallel computer model, where each at ca be simulated by a processig elemet (PE) ad the pheromoe values o the path(s) are represeted by a two dimesioal matrix. There are may approaches to implemet the ACO. For example, Delisle P. et al. [17] implemeted the ACO o OpeMP machies with a ru-time complexity of O( 2 ), where is the umber of jobs, Merkle ad Middedorf [3] implemeted the ACO algorithm o the electrical Recofigurable Mesh (R- Mesh) with a ru-time complexity of O(x (m+) log ), where x is the umber of geeratios ad each geeratio has m ats. Merkle ad Middedorf s implemetatio is curretly the fastest amog all implemetatios due to the characteristics of the recofigurable mesh. However, all ACO algorithm implemetatios o electrical models are costraied by the commuicatio delays betwee processig elemets. I this paper, we preset several techiques to implemet the ACO algorithm ad its fast versio FACO o the Optical Pipelied Recofigurable Mesh (PR-Mesh) model efficietly ad effectively. With our ew approach o the PR-Mesh model, the ru time complexity of FACO algorithm is improved sigificatly. The FACO algorithm

2 ca be scaled-up, scaled-dow or extedig to a three dimesioed PR-Mesh depedig o the umber of jobs ad available resources. The R-Mesh model Merkle ad Middedorf used ad the PR-Mesh model utilied i this work belog to the same architecture class called Recofigurable Architectures. Traditioally, a recofigurable architecture is oe that ca dyamically alter the structure coectig its compoets at every step of a computatio leadig to a chage i its fuctioalities ad capabilities. This paper is orgaied as follows: Sectio II describes the Recofigurable Mesh models ad their operatios, Sectio III describes the existig ACO ad FACO algorithms ad their implemetatios o a R-Mesh, Sectios IV ad V describe our implemetatios of the ACO ad FACO algorithms o the PR-Mesh model, Sectios VI ad VII describe ew techiques to improve the performace ad exted the capability of the FACO algorithm, ad the last sectio summaries our work. II. Recofigurable Mesh Models The commo characteristic of all recofigurable models is the capability to recofigure themselves at every step of computatio. All compoets coected to a bus ca receive data i costat time. The feature that leads to differet recofigurable models is the types of the buses coectig the compoets i each model. The model usig electrical buses ca cofigure cycles while the oe usig optical buses caot. O the other had, the model usig optical buses ca trasmit multiple messages to multiple destiatios o a sigle bus simultaeously, while the other caot. The followig sectios describe more about these two models. A. Optical Pipelied Recofigurable Mesh A optical PR-Mesh [12] is a two dimesioal grid of processors, i which each processor has four ports coected by optical buses. Eight buses coect the four ports of a processor to its eighbors as i Figure 1. Each processor cotrols a set of local switches that allows the processor to segmet or fuse the buses at each itercoectio. The local fusig cofigures a liear bus coectig eighborig processors. Each such liear bus exactly resembles a Liear Array with a Recofigurable Bus Systems (LARPBS) [14]. A LARPBS is a array of N processors p 0,p 1,...,p 1 coected by a optical bus. The optical bus is costructed from three idetical waveguides: the message, select ad referece waveguides. The waveguides are divided ito two segmets: receivig segmet ad trasmittig segmet. The message Directioal couplers West T R F C C T F R Left T Trasmittig segmet R Receivig segmet North Top Bottom South Right R C Coditioal delay loop F Fixed delay loop F T C C F R T East Fusig coectio Fig. 1. PR-Mesh processor coectios waveguide is used for sedig data, the select ad referece waveguides are used for sedig addressig iformatio. Each processor p i is coected to the optical bus with two directioal couplers. Oe coupler is used for writig data o the trasmittig segmet (upper segmet) of the bus ad the other oe is used for readig the data from the receivig segmet (lower segmet) of the bus. Thus, the LARPBS uses six couplers for each processor p i to coect to the three waveguides. Two cosecutive processors are coected by a optical fiber segmet of oe uit pulse-legth,, as i Figure 2,(for simplicity, the figure omits the data waveguide, which resembles the referece waveguide). The optical sigals are propagated uidirectioal from left to right o the trasmittig segmet ad from right to left o the receivig segmet. The LARPBS has a set of coditioal-delay switches o the trasmittig segmet of the select bus ad a set of fixed-delay switches o the receivig segmet of the referece bus. The LARPBS uses the coicidet pulse techique [15] to route messages by maipulatig the select ad referece sigals, through the switches, o separate bus so that they will coicide at the desired destiatio processor(s). Each processor has a select frame ad a referece frame of N-bit each, which ca be used to sed a message by ijectig, up to N slots, its pulse sigals ito the frames. Whe processor p i detect a coicidece of the select ad referece pulses o the receivig segmet, it reads the data frame. Ay subsequece coicideces at processor p i i the same bus cycle will be igored. I additio, the LARPBS cotais a set of segmet switches o the trasmittig ad receivig segmet of the bus. Whe all the segmet switches are set to straight, the bus system operates exactly like a regular pipelied bus system. Settig the switches at p i to cross, the bus is segmeted ito two separate bus systems, oe cosists of p 0,p 1...,p i ad the other cosists of p i+1,p i+2,...,p 1.

3 Trasmittig segmet P0 P1 P2 P3 P4 Receivig segmet Fig. 2. A five-processor LARPBS model {NS,E,W} {EW, N, S} {NW, S, E} {NE, S, W} {SW, N, E} {SE, N, W} {NW, SE} {NE, SW} {NS, EW} {N,S, E,W} it goes through the R-Mesh. The iformatio about which items have ot bee selected is kept i a selectable set S i the at s memory. The ext item is always chose from S accordig to the Pseudo-Radom-Proportioal Actio Choice Rule [3], usig probability q 0, where 0 q 0 1 is a parameter for the algorithm. A at chooses item j that has ot bee selected so far which maximies τ α ij η β ij (1) where α d β are costats that determie the relative ifluece of the pheromoe value o the at s decisio i selectig the ext item. The probability of item to be selected is 1 q 0. This probability is computed as q ij {NSE, W} {NSW, E} {NEW, S} {SEW, N} {NSEW} Fig. 3. local possible coectios of ports withi a PE(right), R-Mesh (left) B. Electrical Recofigurable Mesh Descriptio A electrical Recofigurable Mesh (R-Mesh) [11] is a two dimesioal grid of PEs coected by electrical buses. Each processor ca locally cofigure ay or all of its ports to the coectig buses, as i Figure 3, withi oe time step. Sigals propagatig o a bus are odirectioal. Thus, the R-Mesh ca form cycles. Due to the ature of the bus, to avoid collisio, the COMMON write rule is used for cocurret write operatios. III. ACO ad FACO o R-Mesh Geerally, permutatio problems require oe to fid the optimal permutatio from a set of give items. These items represet the distaces betwee cities as i the Travellig Salesma problems (TSP) [1] or the jobs as i Sigle Machie Total (Weighted) Tardiess problems (SMTWTP) [3]. A geeric at coloy optimiatio algorithm uses a x pheromoe matrix to fid the permutatios of the items. I this sectio we describe Middedorf s implemetatio of a geeric at algorithm (ACO) ad its fast versio (FACO) o the R-Mesh model. A. ACO algorithm o R-Mesh I [3], Middedorf embedded the x pheromoe matrix M ito a x R-Mesh. Each processor p ij cotais a pheromoe value τ ij ad a heuristic value η ij, (i, j [1,]). The ats are pipelied through the R-Mesh from the first row. Each at selects a uselected item from row i as τij α q ij = ηβ ij h S τ ij α ηβ ij (Most ofte, heuristic values η ij are ot used). A at stops its jourey whe it reaches row, at which poit its solutio is foud. O a system of m ats, the last at reaches row i after +(m 1) steps. Whe all ats have foud their solutios, their solutios will be compared to existig solutios i order to determie which ats ca update the solutios. The selected ats update their solutios ad the pheromoe o their paths. This completes a at geeratio. The algorithm will ru for x geeratios, x 0. I geeral, a at selects a item i row i as follows: whe the at is i row i 1, p i 1,j kows whether item j has bee selected i oe of the rows 1...(i 1) or ot, p i 1,j seds this iformatio to p i,j as the at moves to row i. This step ca be doe i time O(1) by cofigurig p i 1,j {NS, E, W}. Next, the at selects a item i row i by computig the prefix-sums of τ ij η ij, where τ ij η ij = τ ij1 η ij1 + τ ij2 η ij2 + + τ ijk η ijk,k [1, (i 1)], which is the prefix-sums of τ ij η ij of all elemets i S, S = {j 1,j 2,...,j (i 1) },j 1 <j 2 < <j (i 1), across row i. The first PE i row i the radomly chooses a value, [0, j S τ i,j α ηβ i,j ), ad broadcasts to all PEs i the row. p i,j is selected whe [ l<j,l S τ ij α ηβ il, l j,l S τ ij α ηβ il, ). The prefixsums ca be doe i time O(log) usig biary tree reductio method [11](page 5), ad the selected PE ca be determied by comparig the prefix-sums of p i 1,j ad p i,j i time O(1). Whe all the ats i a geeratio have foud their solutios, which at is allowed to update the pheromoe is determied. The selected ats store their solutios ad the pheromoe value is evaporated from every PE before the ext geeratio of ats starts. The amout of pheromoe evaporated is defied follow: (2) τ ij =(1 ρ) τ ij (3)

4 where ρ is ay predefied factioal parameter for pheromoe evaporatig fuctio. To force ats from followig geeratios to follow the ear best foud solutios, a elitist at is desigated to update the pheromoe accordig to the best foud solutio i each geeratio. Sice each step of the algorithm ca be carried out i costat time, except the prefix-sums computatio, the algorithm ru time is O(x (m + ) log), where x is the umber of geeratios. B. FACO algorithm o the R-Mesh The fast versio of ACO algorithm is the result of abadoig the geeratioal pheromoe update ad simplifyig the probability distributio. First, istead of updatig the pheromoe at the ed of each geeratio, a at is allowed to update the pheromoe whe some criteria is met [18]. I particular, a at waits util the followig (m 1)/2 ats have foud their solutios. The at is allowed to update the pheromoe oly if its solutio is better tha the m 1,m m, best solutios that have bee foud by the precedig (m 1)/2 ats ad the followig (m 1)/2 ats. If multiple ats with equal solutios, oly the (m/m )th at is allowed to update. Evaporatio is doe whe a at updates the pheromoe. Therefore, the ruig time of the updated ACO algorithm is O((x m + ) log) istead of O(x (m + ) log). Secodly, each item per row is assiged two bit-values h ad l, where h>l>0, for low ad high probabilities of beig selected ext. These values are determied by a threshold fuctio t. For each item j S, defie { g(τij α η β h if τ ij )= α ij η β ij >t (4) l otherwise I p ij, for each j S, a bit l ij is set to 1 if g(τij α ηβ ij )=l, otherwise, a bit h ij is set to 1. Next, the umber of PEs with bit l set to oe, l, ad the umber of PEs with bit h set to oe, h, i each row are determied. A radom umber, [0, l l + h h), is selected as i the geeric ACO algorithm. If [(p 1) l, p l), the p th PE i the row with a l bit set to 1 is selected. If [ l l +(p 1) h, l l + p.h) the the p th PE i the row with a h bit set to 1 is selected. The t, h, ad l values could be chaged durig the ru of the algorithm. Similarly, the Pseudo-Radom-Proportioal Actio Choice Rule is modified as follows: let t τ > 0 be a pheromoe threshold ad h τ > l τ > 0. For each item j S defie { hτ if τij g(τ ij )= α >t τ (5) l τ otherwise a at will select the item that maximies the value of g(τ ij ) η β ij. Fig. 4. Embeddig a row of the pheromoe matrix i a submesh of the R-mesh C. Implemetatio of FACO algorithm o the R-Mesh ad its Ru Time Middedorf proposed [3] to use a log 2 x (/log 2 ) R-mesh. The a row of pheromoe matrix forms a log 2 x(/log 2 ) submesh as i Figure 4. The sums l ad h ca be computed i O(log ) as stated i [16] ad [3]. After selectig a radom umber, the p th PE with bit l set to oe is selected. By [3] ad [16], the sum of all l bits i the first i log 2 colums, i [1, (/log 4 )], of all sub-meshes ca be foud i time O(log ). The p th PE with l-bit is set is determied usig the same techique. Therefore, the total time to select a item is O(log ). At every step, there will be at most + m/2 3 2 solutios eed to be stored. I order for a at to update its solutio, it must kow the rak of its solutio. Therefore, the stored solutios must be raked by their goodess. The rak of the ew solutio ca be computed i O(log ) by comparig the solutio to the stored solutios i parallel, ad a bit is set whe it is worse. Additio of the resultig bits gives the rak of the ew solutio. The pheromoe ca be updated i O(1) as the at updatig the solutio usig the Formula 3. Therefore, this FACO algorithm ru-time complexity is O(( + ) log ), where is the umber of ats o a log 2 x /log 2 R-mesh. This ru time ca oly be achieved if the umber of differet weights is at most log 2 [3]. I the ext sectio, we describe techiques to implemet the ACO o PR-Mesh model. IV. ACO algorithm o PR-Mesh (ACO-PR) As stated i Sectio II the optical model caot cofigure ay cycles. To implemet the ACO algorithm o a PR-Mesh, we have to esure that oe of the steps i the algorithm cofigures a cycle i the PR-Mesh. Whe a at moves from row i 1 to row i, p i 1,j passes this iformatio to p i,j usig oly colum j ad forms o cycles. p i,j broadcasts its ew iformatio to all PEs i row i, which does ot form ay cycle either. Next, the prefixsums of row i is computed usig biary tree reductio o

5 row i, which also uses o cycle. The first PE i row i chooses the value ad broadcasts it to all PEs i row i, ad p i,j determies whether it is selected or ot usig oly row i for commuicatio. The last step, whe all ats have foud their solutios, the ew foud solutios must be routed to the PEs that holdig the previously foud solutios to determie whether ay at has foud a better solutio. Let us assume the m best solutios foud so far are positioed i the last row of the PR-Mesh. If we keep each at i a colum, i.e. at k, 1 k m moves from row1torow usig oly colum k, the at k will store its solutio to p,k whe it reaches row. Obviously, the ACO algorithm ca be implemeted o a x PR-Mesh with ru time O(x (m + ) log). I the ext sectio, we show how to determie the goodess of a solutio ad update the pheromoe i costat time. Usig that techique, each at ca update its solutio ad the pheromoe immediately if its solutio is better tha previously foud solutios. The ru time will be reduced to O((x m + ) log). V. FACO algorithm o PR-Mesh (FACO-PR) The FACO algorithm ca be simulated i the PR-Mesh as follows: istead of fidig the sums of of l bits l ad h bits h l of each row, we compute the prefix-sums of these bits. The prefix-sums ca be computed i time O(1) [11](page 374). After computig the prefix-sums, the p th PE with l bit or h bit set to 1 is the oe has the prefixsums equals to p. This iformatio is obviously ca be determied i time O(1). Updatig the solutios i PR-Mesh is as follows: first, we use the last two rows of the PR-Mesh to store 3 2 possible solutios. These two rows are coected i a sake-like array, where the last PE of the mesh, p,,is the head of the bus ad the last PE of row (-1), p ( 1),, is the tail of the bus, actually, we oly use the lower half, p 1,0,...,p 1, /2, of row (-1). We call this bus the solutio bus. Wheever a at fids a solutio, this solutio is broadcasted to all the PEs i the solutio bus. Each PE i the first 3 2 of the bus, startig from the bus head, holdig a solutio will set a bit to 1 if its solutio is better tha or equal to the ew solutio, otherwise set its bit to 0. The sum of these bits is the rak of the ew solutio. Each PE that set a bit to 0 (holdig worse solutio) i the previous step icreases its rak by 1. The all the solutios are routed to their locatios based o the rak. The best solutio will have the lowest rak. The sum of these bit ca be added i O(1) time [11](page 374). The routig ca be doe i O(1) time [11](page 369). Evaporatig the pheromoe is doe similar to the origial algorithm. The updatig at seds a sigal to all PEs i its path to evaporate the pheromoe. Sice the goodess of each solutio ca be determied i costat time, we ca allow the solutio ad the pheromoe to be updated as soo as the solutio emerges to make it available for m 1 ats i the pipelie. I additio, we oly eed store at most m solutios, which ca be stored i the last row of the PR-Mesh. Sice each step of the FACO algorithm o PR-Mesh rus i O(1) time, the algorithm yields ( 1+) steps or O( + ), for is the total umber of ats leavig the coloy, which is better tha the oe ruig o the R-Mesh by log factor. O( + ) is the lower boud for ACO algorithm o the pipelied recofigurable mesh, ad this is ot hard to verify. By the ature of the ACO algorithm, the decisio of choosig the ext move of a at depeds o the locatios it has visited. It ca ot visit a locatio twice or it ca visit locatio i before visitig locatio i 1. Therefore, the at takes exactly steps to fid its solutio. I additio, a states pipelie with jobs take at least +( 1) time slots to fiish. Thus, the ru time of the FACO algorithm o the PR-Mesh is optimal. VI. FACO Algorithm o sub-pr-meshes (FACO-PR-Sub) Sice the ACO algorithm practices the greedy approach, where each at chooses the best possible path adjacet to its curret locatio, the over all solutio may ot be optimal. I this sectio we preset a divide ad coquer approach, i which the solutio will be the optimal of subproblems ad each sub-problem is solved by applyig the FACO-PR algorithm. A. Selectio o PR sub-mesh I the ACO algorithm, a at chooses the ext item by selectig a item i the ext row that maximies the pheromoe ad heuristic values τij α ηβ ij (similarly for the ats i the FACO algorithm). If all the items are equally likely to be selected, the probability of this selectio behavior ca be represeted as: P = 1 1 j=0 j = 1!.To speedup the algorithm, we propose the followig: First, we divide the x PR-Mesh ito blocks (sub-mesh) of sie log x log, givig log2 such blocks. The solutio for each block is carried out the same as described i the FACO-PR algorithm. Next, the solutio for all the blocks is determied usig oe of the followig two methods. (i) sum the sub-solutios of all blocks i the same blockrow, the fid the maximum of all the block-rows (Max Row method). (ii) fid the maximum sub-solutios of each block-colum, the add all the maximum values to obtai a solutio (Max Colum method). The two methods share almost the same behavior. The selectio probability for all

6 Selectio Methods b 1,1 b 2,1 b 1,2 b 1, b 2, Solutio Matrix sie ACO Max Row Max Colum Fig. 5. Simulatio of the origial ACO selectio, Max Row selectio ad Max Colum selectio Methods - The Max Colum selectio method resembles the origial ACO selectio method the disjoit blocks is P b = log 1 j=0 1 log j =(log )!. I additio, the probability of pickig the best sub-solutio 1 block out of a block-colum, or block-row, is log, ad P c = ( 1 log 1 log... 1 log ) = ( 1 log ) 1 log = 1 (log) log for log block-colums. Therefore, the ew selectio log! probability is P = P b P c = (log) log!. Sice the solutio of the problem closely depedig o the distributio of the items, it is ot clear which selectio method, the origial oe or the proposig oes, yields better solutio. Based o our simulatio (see Figure 5) of the three selectio methods, where the pheromoe values are radomly geerated ad distributed, the Max Colum method resembles the origial selectio method. Next, we will describe the Max Colum method i details, (the Max Row Method is similar). B. Applyig Max-Colum Selectio Method o PR-Mesh b,1 b,2 b, Fig. 6. A x PR-Mesh partitioed ito logxlog blocks block-colums is determied. I order to fid the best subsolutio of a colum, all sub-solutios s 1,s 2,...,s log of block-colums are routed to the last row of block b log,j of each block-colum j, where p,i is the destiatio of s i. Next, i each block b log,j the solutios are broadcasted to all other PEs through the colums of the block. p i,i of the block broadcasts its solutio alog row i, aype holdig a worse solutio sets its bit to 1. The bitwise-or over every colum bit of the blocks is computed. A result ero of a colum sigifies the solutio i that colum is the best solutio. Each oe of these steps take O(1) time. Next, the solutio to the problem is determied by addig all the best sub-solutios. This solutio is the routed to the solutio bus for rakig ad storig as described i the previous sectio. The ru time of every step ivolved i this method is either costat or similar to the FACO- PR, except the summig of partial solutios. The sum of these log solutios ca be computed i O(loglog) time usig biary-tree reductio. It takes O( log ) time to fill the pipelie. Therefore, the ru time of the FACO-PR-Sub is O((( log 2 )+ log ) loglog). VII. FACO Algorithm o 3D PR-Mesh Ats used i the ACO models are cotrol logics ad ofte represeted as iteratios of ruig a algorithm; therefore, requirig more ats may ot impose ay physical restrictio o the algorithm. Iitially, we partitio a x PR-Mesh ito log 2 sub-meshes, or blocks, of sie. The x PR-Mesh M ca be viewed as a log x log log x log sub- logxlog matrix, where each elemet is a mesh as i Figure 6. Assumig we have at least ats, log 2. We desigate the last row of the x PR-Mesh as the solutio bus as i Sectio V. First we distribute /log 2 ats to each block. The ats are pipelied through the sub-mesh i the same way as described i Sectio V. After a at fids its solutio i its block b i,j, where i, j [1,log], the best of all the solutios foud by these Combiatorial problems that have their weights chagig dyamically durig the executio of a algorithm ad require iteratig over the solutios may times for the solutios to coverge to a optimal solutio ted to use a large umber of ats to solve. I this sectio, we describe a techique to exted the FACO-PR-Sub to multidimesioal PR-Mesh to solve such problem effectively. A. Jobs FACO Algorithm o 3D PR- Mesh (FACO-PR 3D) I geeral, if the umber of ats used i the ACO algorithms is relatively large, the FACO ca be exteded to ru o a k-dimesioal PR-Mesh. I particular, we will

7 slice j across all the slices Slice k Slice 1 j i row 1 row j s s s s s Fig. 7. A xx PR-Mesh Soluio mesh Head of the bus Fig. 8. A x PR-Mesh cofigured it a sake-like liear bus describe a techique to exted the FACO to ru o a xx PR-Mesh i logarithmic time. A 3D PR-Mesh ca be viewed as of x PR-Meshes ( slices) coected by the optical buses i the third dimesio k, where all the last rows of these slices form a x solutio mesh (see Figure 7). Let row t h of each slice holds the best solutios foud so far - we call these rows as solutio buses. The FACO algorithm ca be implemeted as follows: First, let the ats pipeliig through the slices as i the 2D PR-Mesh. After steps, there will be ats arrivig to the last row with their solutios, oe i every slice. For the first arrivig solutios, we must sort them by their raks. This ca be achieved by usig Leighto s colum sort algorithm o a x mesh, as described i [11] (page 377), i O(1) time. After sortig, these solutios are broadcasted to all other slices. O every subsequece step, the raks of ew solutios are determied ad routed to their positios i the solutio buses. To extract the best solutios we perform the followig stages. First, the solutios i p,j,1 are swapped with the solutios i p,j, if j is eve. Secod, p,j,1,p,j,2,...,p,j, are coected to form a bus goig across all slices. p,j,k is the head of the bus if j is odd, otherwise, p,j, is the head of the bus. All PEs holdig ew solutios ad head PEs are set to active. All active PEs have their solutios raked ad routed to their positios. At this momet, all the ew solutios are packed to the head of the buses. Next, the solutio mesh is cofigured ito a sake-like bus,(see Figure 8), where p,1,1 is the head of the bus. The best solutios are routed to the head of the bus by computig the prefix-sum of the iactive PEs ad lettig all active PEs sed their solutios to the PEs of distaces equal to their prefix-sums toward the head of the bus. The fial step is to propagate the best solutios to each solutio bus i the solutio mesh. This is doe i two steps:p,1,k seds its solutio to p,k,k ad p,k,k broadcasts its value across slices p,k,1,p,k,2,...,p,k,. All of these steps ru i O(1) time. The pheromoe is updated i each slice similar to the earlier described algorithm. p,k,k updates slice k if the its solutio is ew. Next, 2 buses coectig PEs i the first slice to the last slice are formed to broadcast the ew pheromoe values to all the slices. As a result, the ru time for this algorithm is O(( )+ ), where is the total umber of ats. Usig the same method described i Sectio VI, the ru time of this algorithm ca be lowered to O((( log 2 )+ log ) loglog). B. x Jobs At Coloy Algorithm o 3D PR-Mesh (FACO-PR 3Dx) To solve a problem with 2 jobs usig the PR-Mesh as described i Sectio V we will eed a 2 x 2 PR- Mesh to store the pheromoe values. I this sectio, we will use the result of the previous sectio to solve this problem efficietly. Let the 2 jobs be distributed alog the j ad k directios of the 3D-PR-Mesh such that each slice cotaiig jobs. The solutios are computed i both directios i ad j as i Figure 7. This algorithm requires at least +1 ats. Each at steps dow from the first row to the last row of its slice ad selects a item similar to the the geeral ACO described i Sectio III-A simultaeously. We call the at from the first slice the leader at. Whe the leader at is o row i ad selectig item j, the weight of that item w j is set to the crossed slice j (cotaiig p 1,j,1,p 2,j,1,...,p,j, ). This is possible sice each PE has may separate buses, (see Figure 1). For clarity, we ca visualie the first slice as a additioal x mesh attachig to the xx cube. The solutio of slice j is the sum of its solutio ad the weight w j of the leader at. After steps, the leader at tailors its solutio by summig the solutios of all crossed slices i O(log) time. The pheromoe is updated o each crossed slice as described i previous sectios. At the ed of each geeratio, the maximum values τ ij ad η ij, (i, j [1,]) must be routed to the first slice for the leader at to work properly. This step is doe as follows: First, each p i,j o slice 1 becomes the head of a bus coectig all other p i,j from all other slices. Next, the max values are computed ad routed to the heads of

8 TABLE I. Algorithms Ru times Algorithm Ru Time ACO O(x (m + ) log) ACO-PR O(( + ) log) FACO O(( + ) log ) FACO-PR O( + ) FACO-PR-Sub O(( log 2 + log ) loglog) FACO-PR 3D O(( log 2 + log ) loglog) FACO-PR 3Dx O(x (m + ) log) TABLE II. Algorithms Characteristics Algorithms Number of ats Jobs Max. Wts. Mesh Sie ACO x m 2 x ACO-PR x m 2 x FACO x m log 2 log 2 x log 2 FACO-PR x m 2 x FACO-PR-Sub log2 x 2 x FACO-PR 3D log2 x 2 xx FACO-PR 3Dx xx the buses. Each bus works idepedetly. These steps ca be doe i O(1) time [11] (page 376). The total ru time to compute the first solutio is O(+ log). The ru time of the 3D PR-Mesh is O(x (m + ) log), where x is the umber of geeratios each havig m ats. This ru time ca be further reduced by applyig the partitioig techique i Sectio VI to the algorithm. VIII. Coclusio I this paper, we show that the At Coloy Optimiatio (ACO) ad its simplified versio Fast At Coloy Optimiatio (FACO) algorithms with ats ca be simulated o the Optical Pipelied Recofigurable Mesh (PR-Mesh) with better time complexity tha the electrical recofigurable mesh. Ulike the electrical model, which restricts the umber of differet weights to be at most log 2, the optical model imposes o restrictio o the weights. I particular, we preset techiques to implemet the FACO algorithm o the PR-Mesh with ru time O(( log 2 + log log 2 + log ) loglog) ad O(( ) loglog) o 2D PR-Mesh ad 3D-PR-Mesh (FACO-PR-Sub ad FACO-PR 3D), ad a versio of the FACO algorithm with 2 jobs o a 3D-PR- Mesh (FACO-PR 3Dx) i O(x (m + ) log). The rutime complexity compariso of the two models is give i Table I, ad the characteristics of the algorithms o these models are give i Table II, (ote: x is the umber of geeratios, m is the umber of ats per geeratio, = x m, ad italicied algorithms are o the electrical recofigurable mesh). Refereces [1] M. Dorigo ad L. M. Gambardella, At Coloy System: A Cooperative Learig Approach to the Travelig Salesma Problem, IEEE tras. o Evolutioary Comp. Vol. 1: 53-66, [2] T. Stütle, A at Approach for the Flow Shop Problem, Proc. 6th Europea Cogress o Itelleget Techiques ad Soft Computig, Verlag Mai: Aache, Vol. 3: , [3] D. Merkle ad M. Middedorf, Fast At Coloy Optimiatio o Rutime Recofigurable Processor Arrays,. Joural of Geetic Programmig ad Evolvable Machies, Vol. 3: , [4] D. Merkle, M. Middedorf, ad H. Schmeck, At Coloy Optimiatio for Resource-Costraied Project Schedulig, IEEE tras. o Evolutioary Comp. Vol. 6: , [5] L. M. Gambardella, E. Taillard, ad M. Dorigo, At Coloies for the Quadratic Assigmet Problem, Joural of the Operatioal Research Society, Vol. 50: , [6] V. Maieo, A. Colori, ad M. Dorigo, The At System Applied to the Quaratic Assigmet Problem, IEEE Tras. Kowledge ad Data Egieerig, Vol. 11: , [7] D. Costa ad A. Hert, Ats ca colour graphs, Joural of the Operatioal Research Society, Vol. 48: , [8] G. Di Caro ad M. Dorigo, At coloies for adaptive routig i packet-switchied commuicatios etworks, Proceedig of PPSN- V, Fifth Iteratioal Coferece o Parallel Problem Solvig from Nature, Vol. 1498: , [9] R. Schooderwoerd, O. Hollad, J. Brute, ad L. Rothkrat, At-based load balacig i telecommuicatio etworks, Adaptive behavior Vol. 5: , [10] M. Dorigo, Optimiatio, learig ad atural algorithm, Ph.D Thesis, DEI, Politecico di Milaio, Italy, [11] R. Vaidyaatha ad J. Traha, Dyamic Recofiguratio - Architectures ad Algorithms, Kluwer Academic/Pleum Publishers, ISBN , [12] J L. Traha, A. G. Bourgeois, ad R. Vaidyaatha, Tighter ad Broader Complexity Results for Recofigurable Models, Parallel Processig Letters, Vol. 8: , [13] M. Middedorf, Bit-summatio o the Recofigurable Mesh, Parallel ad Distributed Computig, Proc. of the 11 IPPS/SPDP 99 Workshops, 6th Recofigurable Architectures Workshop RAW-99, J. Rolim et al. (eds.), Spriger: Berli, LNCS, Vol. 1586: , [14] Y. Pa ad K. Li, Liear Array with a Recofigurable Pipelied Bus Systems: Cocepts ad Applicatios, Iformatio Sciece 106: , (1998). [15] C. Qiao ad R. Melhem, Time-Divisio Optical Commuicatio i Multiprocessor Arrays, IEEE Tras. Comput. 42: , [16] K. Nakao. Prefix-sums Algorithms o Recofigurable Mesh, Par. Process. Letter, 5:23-25,1995. [17] Delisle P., Krajecki M., Gravel M., Gag C., Parallel implemetatio of a at coloy optimiatio metaheuristic with OpeMP, Iteratioal Coferece o Parallel Architectures ad Compilatio Techiques, Proceedigs of the 3rd Europea Workshop o OpeMP (EWOMP01), September 2001, Barceloe, Espage. [18] V. Maieo, Exact ad approximate oditermiistic tree-search procedures for the quaratic assigmet problem, INFORMS Joural o Computig, Vol. 11: , [19] T. Stütle, H.H. Hoos, The MAX-MIN at system ad local search for the travelig salesma problem, Proceedigs of the IEEE Iteratioal Coferece o Evolutioary Computatio(ICEC 97): , 1997.

Ones Assignment Method for Solving Traveling Salesman Problem

Ones Assignment Method for Solving Traveling Salesman Problem Joural of mathematics ad computer sciece 0 (0), 58-65 Oes Assigmet Method for Solvig Travelig Salesma Problem Hadi Basirzadeh Departmet of Mathematics, Shahid Chamra Uiversity, Ahvaz, Ira Article history:

More information

An Improved Shuffled Frog-Leaping Algorithm for Knapsack Problem

An Improved Shuffled Frog-Leaping Algorithm for Knapsack Problem A Improved Shuffled Frog-Leapig Algorithm for Kapsack Problem Zhoufag Li, Ya Zhou, ad Peg Cheg School of Iformatio Sciece ad Egieerig Hea Uiversity of Techology ZhegZhou, Chia lzhf1978@126.com Abstract.

More information

Lecturers: Sanjam Garg and Prasad Raghavendra Feb 21, Midterm 1 Solutions

Lecturers: Sanjam Garg and Prasad Raghavendra Feb 21, Midterm 1 Solutions U.C. Berkeley CS170 : Algorithms Midterm 1 Solutios Lecturers: Sajam Garg ad Prasad Raghavedra Feb 1, 017 Midterm 1 Solutios 1. (4 poits) For the directed graph below, fid all the strogly coected compoets

More information

Elementary Educational Computer

Elementary Educational Computer Chapter 5 Elemetary Educatioal Computer. Geeral structure of the Elemetary Educatioal Computer (EEC) The EEC coforms to the 5 uits structure defied by vo Neuma's model (.) All uits are preseted i a simplified

More information

Lecture Notes 6 Introduction to algorithm analysis CSS 501 Data Structures and Object-Oriented Programming

Lecture Notes 6 Introduction to algorithm analysis CSS 501 Data Structures and Object-Oriented Programming Lecture Notes 6 Itroductio to algorithm aalysis CSS 501 Data Structures ad Object-Orieted Programmig Readig for this lecture: Carrao, Chapter 10 To be covered i this lecture: Itroductio to algorithm aalysis

More information

Exact Minimum Lower Bound Algorithm for Traveling Salesman Problem

Exact Minimum Lower Bound Algorithm for Traveling Salesman Problem Exact Miimum Lower Boud Algorithm for Travelig Salesma Problem Mohamed Eleiche GeoTiba Systems mohamed.eleiche@gmail.com Abstract The miimum-travel-cost algorithm is a dyamic programmig algorithm to compute

More information

Pattern Recognition Systems Lab 1 Least Mean Squares

Pattern Recognition Systems Lab 1 Least Mean Squares Patter Recogitio Systems Lab 1 Least Mea Squares 1. Objectives This laboratory work itroduces the OpeCV-based framework used throughout the course. I this assigmet a lie is fitted to a set of poits usig

More information

Heuristic Approaches for Solving the Multidimensional Knapsack Problem (MKP)

Heuristic Approaches for Solving the Multidimensional Knapsack Problem (MKP) Heuristic Approaches for Solvig the Multidimesioal Kapsack Problem (MKP) R. PARRA-HERNANDEZ N. DIMOPOULOS Departmet of Electrical ad Computer Eg. Uiversity of Victoria Victoria, B.C. CANADA Abstract: -

More information

How do we evaluate algorithms?

How do we evaluate algorithms? F2 Readig referece: chapter 2 + slides Algorithm complexity Big O ad big Ω To calculate ruig time Aalysis of recursive Algorithms Next time: Litterature: slides mostly The first Algorithm desig methods:

More information

CSC 220: Computer Organization Unit 11 Basic Computer Organization and Design

CSC 220: Computer Organization Unit 11 Basic Computer Organization and Design College of Computer ad Iformatio Scieces Departmet of Computer Sciece CSC 220: Computer Orgaizatio Uit 11 Basic Computer Orgaizatio ad Desig 1 For the rest of the semester, we ll focus o computer architecture:

More information

COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Interface. Chapter 4. The Processor. Part A Datapath Design

COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Interface. Chapter 4. The Processor. Part A Datapath Design COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Iterface 5 th Editio Chapter The Processor Part A path Desig Itroductio CPU performace factors Istructio cout Determied by ISA ad compiler. CPI ad

More information

Image Segmentation EEE 508

Image Segmentation EEE 508 Image Segmetatio Objective: to determie (etract) object boudaries. It is a process of partitioig a image ito distict regios by groupig together eighborig piels based o some predefied similarity criterio.

More information

Load balanced Parallel Prime Number Generator with Sieve of Eratosthenes on Cluster Computers *

Load balanced Parallel Prime Number Generator with Sieve of Eratosthenes on Cluster Computers * Load balaced Parallel Prime umber Geerator with Sieve of Eratosthees o luster omputers * Soowook Hwag*, Kyusik hug**, ad Dogseug Kim* *Departmet of Electrical Egieerig Korea Uiversity Seoul, -, Rep. of

More information

3D Model Retrieval Method Based on Sample Prediction

3D Model Retrieval Method Based on Sample Prediction 20 Iteratioal Coferece o Computer Commuicatio ad Maagemet Proc.of CSIT vol.5 (20) (20) IACSIT Press, Sigapore 3D Model Retrieval Method Based o Sample Predictio Qigche Zhag, Ya Tag* School of Computer

More information

. Written in factored form it is easy to see that the roots are 2, 2, i,

. Written in factored form it is easy to see that the roots are 2, 2, i, CMPS A Itroductio to Programmig Programmig Assigmet 4 I this assigmet you will write a java program that determies the real roots of a polyomial that lie withi a specified rage. Recall that the roots (or

More information

Improvement of the Orthogonal Code Convolution Capabilities Using FPGA Implementation

Improvement of the Orthogonal Code Convolution Capabilities Using FPGA Implementation Improvemet of the Orthogoal Code Covolutio Capabilities Usig FPGA Implemetatio Naima Kaabouch, Member, IEEE, Apara Dhirde, Member, IEEE, Saleh Faruque, Member, IEEE Departmet of Electrical Egieerig, Uiversity

More information

Lecture 1: Introduction and Strassen s Algorithm

Lecture 1: Introduction and Strassen s Algorithm 5-750: Graduate Algorithms Jauary 7, 08 Lecture : Itroductio ad Strasse s Algorithm Lecturer: Gary Miller Scribe: Robert Parker Itroductio Machie models I this class, we will primarily use the Radom Access

More information

condition w i B i S maximum u i

condition w i B i S maximum u i ecture 10 Dyamic Programmig 10.1 Kapsack Problem November 1, 2004 ecturer: Kamal Jai Notes: Tobias Holgers We are give a set of items U = {a 1, a 2,..., a }. Each item has a weight w i Z + ad a utility

More information

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe Copyright 2016 Ramez Elmasri ad Shamkat B. Navathe CHAPTER 19 Query Optimizatio Copyright 2016 Ramez Elmasri ad Shamkat B. Navathe Itroductio Query optimizatio Coducted by a query optimizer i a DBMS Goal:

More information

An Efficient Algorithm for Graph Bisection of Triangularizations

An Efficient Algorithm for Graph Bisection of Triangularizations A Efficiet Algorithm for Graph Bisectio of Triagularizatios Gerold Jäger Departmet of Computer Sciece Washigto Uiversity Campus Box 1045 Oe Brookigs Drive St. Louis, Missouri 63130-4899, USA jaegerg@cse.wustl.edu

More information

Heaps. Presentation for use with the textbook Algorithm Design and Applications, by M. T. Goodrich and R. Tamassia, Wiley, 2015

Heaps. Presentation for use with the textbook Algorithm Design and Applications, by M. T. Goodrich and R. Tamassia, Wiley, 2015 Presetatio for use with the textbook Algorithm Desig ad Applicatios, by M. T. Goodrich ad R. Tamassia, Wiley, 201 Heaps 201 Goodrich ad Tamassia xkcd. http://xkcd.com/83/. Tree. Used with permissio uder

More information

COMP Parallel Computing. PRAM (1): The PRAM model and complexity measures

COMP Parallel Computing. PRAM (1): The PRAM model and complexity measures COMP 633 - Parallel Computig Lecture 2 August 24, 2017 : The PRAM model ad complexity measures 1 First class summary This course is about parallel computig to achieve high-er performace o idividual problems

More information

DESIGN AND ANALYSIS OF LDPC DECODERS FOR SOFTWARE DEFINED RADIO

DESIGN AND ANALYSIS OF LDPC DECODERS FOR SOFTWARE DEFINED RADIO DESIGN AND ANALYSIS OF LDPC DECODERS FOR SOFTWARE DEFINED RADIO Sagwo Seo, Trevor Mudge Advaced Computer Architecture Laboratory Uiversity of Michiga at A Arbor {swseo, tm}@umich.edu Yumig Zhu, Chaitali

More information

Parallel Polygon Approximation Algorithm Targeted at Reconfigurable Multi-Ring Hardware

Parallel Polygon Approximation Algorithm Targeted at Reconfigurable Multi-Ring Hardware Parallel Polygo Approximatio Algorithm Targeted at Recofigurable Multi-Rig Hardware M. Arif Wai* ad Hamid R. Arabia** *Califoria State Uiversity Bakersfield, Califoria, USA **Uiversity of Georgia, Georgia,

More information

CS 683: Advanced Design and Analysis of Algorithms

CS 683: Advanced Design and Analysis of Algorithms CS 683: Advaced Desig ad Aalysis of Algorithms Lecture 6, February 1, 2008 Lecturer: Joh Hopcroft Scribes: Shaomei Wu, Etha Feldma February 7, 2008 1 Threshold for k CNF Satisfiability I the previous lecture,

More information

Data Structures and Algorithms. Analysis of Algorithms

Data Structures and Algorithms. Analysis of Algorithms Data Structures ad Algorithms Aalysis of Algorithms Outlie Ruig time Pseudo-code Big-oh otatio Big-theta otatio Big-omega otatio Asymptotic algorithm aalysis Aalysis of Algorithms Iput Algorithm Output

More information

CSE 417: Algorithms and Computational Complexity

CSE 417: Algorithms and Computational Complexity Time CSE 47: Algorithms ad Computatioal Readig assigmet Read Chapter of The ALGORITHM Desig Maual Aalysis & Sortig Autum 00 Paul Beame aalysis Problem size Worst-case complexity: max # steps algorithm

More information

WEBSITE STRUCTURE IMPROVEMENT USING ANT COLONY TECHNIQUE

WEBSITE STRUCTURE IMPROVEMENT USING ANT COLONY TECHNIQUE WEBSITE STRUCTURE IMPROVEMENT USING ANT COLONY TECHNIQUE Wiwik Aggraei 1, Agyl Ardi Rahmadi 1, Radityo Prasetyo Wibowo 1 1 Iformatio System Departmet, Faculty of Iformatio Techology, Istitut Tekologi Sepuluh

More information

A New Morphological 3D Shape Decomposition: Grayscale Interframe Interpolation Method

A New Morphological 3D Shape Decomposition: Grayscale Interframe Interpolation Method A ew Morphological 3D Shape Decompositio: Grayscale Iterframe Iterpolatio Method D.. Vizireau Politehica Uiversity Bucharest, Romaia ae@comm.pub.ro R. M. Udrea Politehica Uiversity Bucharest, Romaia mihea@comm.pub.ro

More information

Lecture 5. Counting Sort / Radix Sort

Lecture 5. Counting Sort / Radix Sort Lecture 5. Coutig Sort / Radix Sort T. H. Corme, C. E. Leiserso ad R. L. Rivest Itroductio to Algorithms, 3rd Editio, MIT Press, 2009 Sugkyukwa Uiversity Hyuseug Choo choo@skku.edu Copyright 2000-2018

More information

A Study on the Performance of Cholesky-Factorization using MPI

A Study on the Performance of Cholesky-Factorization using MPI A Study o the Performace of Cholesky-Factorizatio usig MPI Ha S. Kim Scott B. Bade Departmet of Computer Sciece ad Egieerig Uiversity of Califoria Sa Diego {hskim, bade}@cs.ucsd.edu Abstract Cholesky-factorizatio

More information

Data Structures Week #9. Sorting

Data Structures Week #9. Sorting Data Structures Week #9 Sortig Outlie Motivatio Types of Sortig Elemetary (O( 2 )) Sortig Techiques Other (O(*log())) Sortig Techiques 21.Aralık.2010 Boraha Tümer, Ph.D. 2 Sortig 21.Aralık.2010 Boraha

More information

Algorithms for Disk Covering Problems with the Most Points

Algorithms for Disk Covering Problems with the Most Points Algorithms for Disk Coverig Problems with the Most Poits Bi Xiao Departmet of Computig Hog Kog Polytechic Uiversity Hug Hom, Kowloo, Hog Kog csbxiao@comp.polyu.edu.hk Qigfeg Zhuge, Yi He, Zili Shao, Edwi

More information

Chapter 1. Introduction to Computers and C++ Programming. Copyright 2015 Pearson Education, Ltd.. All rights reserved.

Chapter 1. Introduction to Computers and C++ Programming. Copyright 2015 Pearson Education, Ltd.. All rights reserved. Chapter 1 Itroductio to Computers ad C++ Programmig Copyright 2015 Pearso Educatio, Ltd.. All rights reserved. Overview 1.1 Computer Systems 1.2 Programmig ad Problem Solvig 1.3 Itroductio to C++ 1.4 Testig

More information

Switching Hardware. Spring 2018 CS 438 Staff, University of Illinois 1

Switching Hardware. Spring 2018 CS 438 Staff, University of Illinois 1 Switchig Hardware Sprig 208 CS 438 Staff, Uiversity of Illiois Where are we? Uderstad Differet ways to move through a etwork (forwardig) Read sigs at each switch (datagram) Follow a kow path (virtual circuit)

More information

n n B. How many subsets of C are there of cardinality n. We are selecting elements for such a

n n B. How many subsets of C are there of cardinality n. We are selecting elements for such a 4. [10] Usig a combiatorial argumet, prove that for 1: = 0 = Let A ad B be disjoit sets of cardiality each ad C = A B. How may subsets of C are there of cardiality. We are selectig elemets for such a subset

More information

Analysis Metrics. Intro to Algorithm Analysis. Slides. 12. Alg Analysis. 12. Alg Analysis

Analysis Metrics. Intro to Algorithm Analysis. Slides. 12. Alg Analysis. 12. Alg Analysis Itro to Algorithm Aalysis Aalysis Metrics Slides. Table of Cotets. Aalysis Metrics 3. Exact Aalysis Rules 4. Simple Summatio 5. Summatio Formulas 6. Order of Magitude 7. Big-O otatio 8. Big-O Theorems

More information

CIS 121 Data Structures and Algorithms with Java Spring Stacks and Queues Monday, February 12 / Tuesday, February 13

CIS 121 Data Structures and Algorithms with Java Spring Stacks and Queues Monday, February 12 / Tuesday, February 13 CIS Data Structures ad Algorithms with Java Sprig 08 Stacks ad Queues Moday, February / Tuesday, February Learig Goals Durig this lab, you will: Review stacks ad queues. Lear amortized ruig time aalysis

More information

ISSN (Print) Research Article. *Corresponding author Nengfa Hu

ISSN (Print) Research Article. *Corresponding author Nengfa Hu Scholars Joural of Egieerig ad Techology (SJET) Sch. J. Eg. Tech., 2016; 4(5):249-253 Scholars Academic ad Scietific Publisher (A Iteratioal Publisher for Academic ad Scietific Resources) www.saspublisher.com

More information

Bezier curves. Figure 2 shows cubic Bezier curves for various control points. In a Bezier curve, only

Bezier curves. Figure 2 shows cubic Bezier curves for various control points. In a Bezier curve, only Edited: Yeh-Liag Hsu (998--; recommeded: Yeh-Liag Hsu (--9; last updated: Yeh-Liag Hsu (9--7. Note: This is the course material for ME55 Geometric modelig ad computer graphics, Yua Ze Uiversity. art of

More information

An Efficient Algorithm for Graph Bisection of Triangularizations

An Efficient Algorithm for Graph Bisection of Triangularizations Applied Mathematical Scieces, Vol. 1, 2007, o. 25, 1203-1215 A Efficiet Algorithm for Graph Bisectio of Triagularizatios Gerold Jäger Departmet of Computer Sciece Washigto Uiversity Campus Box 1045, Oe

More information

Introduction. Nature-Inspired Computing. Terminology. Problem Types. Constraint Satisfaction Problems - CSP. Free Optimization Problem - FOP

Introduction. Nature-Inspired Computing. Terminology. Problem Types. Constraint Satisfaction Problems - CSP. Free Optimization Problem - FOP Nature-Ispired Computig Hadlig Costraits Dr. Şima Uyar September 2006 Itroductio may practical problems are costraied ot all combiatios of variable values represet valid solutios feasible solutios ifeasible

More information

Pseudocode ( 1.1) Analysis of Algorithms. Primitive Operations. Pseudocode Details. Running Time ( 1.1) Estimating performance

Pseudocode ( 1.1) Analysis of Algorithms. Primitive Operations. Pseudocode Details. Running Time ( 1.1) Estimating performance Aalysis of Algorithms Iput Algorithm Output A algorithm is a step-by-step procedure for solvig a problem i a fiite amout of time. Pseudocode ( 1.1) High-level descriptio of a algorithm More structured

More information

A Parallel DFA Minimization Algorithm

A Parallel DFA Minimization Algorithm A Parallel DFA Miimizatio Algorithm Ambuj Tewari, Utkarsh Srivastava, ad P. Gupta Departmet of Computer Sciece & Egieerig Idia Istitute of Techology Kapur Kapur 208 016,INDIA pg@iitk.ac.i Abstract. I this

More information

IMP: Superposer Integrated Morphometrics Package Superposition Tool

IMP: Superposer Integrated Morphometrics Package Superposition Tool IMP: Superposer Itegrated Morphometrics Package Superpositio Tool Programmig by: David Lieber ( 03) Caisius College 200 Mai St. Buffalo, NY 4208 Cocept by: H. David Sheets, Dept. of Physics, Caisius College

More information

Chapter 3 Classification of FFT Processor Algorithms

Chapter 3 Classification of FFT Processor Algorithms Chapter Classificatio of FFT Processor Algorithms The computatioal complexity of the Discrete Fourier trasform (DFT) is very high. It requires () 2 complex multiplicatios ad () complex additios [5]. As

More information

A SOFTWARE MODEL FOR THE MULTILAYER PERCEPTRON

A SOFTWARE MODEL FOR THE MULTILAYER PERCEPTRON A SOFTWARE MODEL FOR THE MULTILAYER PERCEPTRON Roberto Lopez ad Eugeio Oñate Iteratioal Ceter for Numerical Methods i Egieerig (CIMNE) Edificio C1, Gra Capitá s/, 08034 Barceloa, Spai ABSTRACT I this work

More information

Fundamentals of Media Processing. Shin'ichi Satoh Kazuya Kodama Hiroshi Mo Duy-Dinh Le

Fundamentals of Media Processing. Shin'ichi Satoh Kazuya Kodama Hiroshi Mo Duy-Dinh Le Fudametals of Media Processig Shi'ichi Satoh Kazuya Kodama Hiroshi Mo Duy-Dih Le Today's topics Noparametric Methods Parze Widow k-nearest Neighbor Estimatio Clusterig Techiques k-meas Agglomerative Hierarchical

More information

A Note on Least-norm Solution of Global WireWarping

A Note on Least-norm Solution of Global WireWarping A Note o Least-orm Solutio of Global WireWarpig Charlie C. L. Wag Departmet of Mechaical ad Automatio Egieerig The Chiese Uiversity of Hog Kog Shati, N.T., Hog Kog E-mail: cwag@mae.cuhk.edu.hk Abstract

More information

FREQUENCY ESTIMATION OF INTERNET PACKET STREAMS WITH LIMITED SPACE: UPPER AND LOWER BOUNDS

FREQUENCY ESTIMATION OF INTERNET PACKET STREAMS WITH LIMITED SPACE: UPPER AND LOWER BOUNDS FREQUENCY ESTIMATION OF INTERNET PACKET STREAMS WITH LIMITED SPACE: UPPER AND LOWER BOUNDS Prosejit Bose Evagelos Kraakis Pat Mori Yihui Tag School of Computer Sciece, Carleto Uiversity {jit,kraakis,mori,y

More information

A New Bit Wise Technique for 3-Partitioning Algorithm

A New Bit Wise Technique for 3-Partitioning Algorithm Special Issue of Iteratioal Joural of Computer Applicatios (0975 8887) o Optimizatio ad O-chip Commuicatio, No.1. Feb.2012, ww.ijcaolie.org A New Bit Wise Techique for 3-Partitioig Algorithm Rajumar Jai

More information

1.2 Binomial Coefficients and Subsets

1.2 Binomial Coefficients and Subsets 1.2. BINOMIAL COEFFICIENTS AND SUBSETS 13 1.2 Biomial Coefficiets ad Subsets 1.2-1 The loop below is part of a program to determie the umber of triagles formed by poits i the plae. for i =1 to for j =

More information

Data diverse software fault tolerance techniques

Data diverse software fault tolerance techniques Data diverse software fault tolerace techiques Complemets desig diversity by compesatig for desig diversity s s limitatios Ivolves obtaiig a related set of poits i the program data space, executig the

More information

What are we going to learn? CSC Data Structures Analysis of Algorithms. Overview. Algorithm, and Inputs

What are we going to learn? CSC Data Structures Analysis of Algorithms. Overview. Algorithm, and Inputs What are we goig to lear? CSC316-003 Data Structures Aalysis of Algorithms Computer Sciece North Carolia State Uiversity Need to say that some algorithms are better tha others Criteria for evaluatio Structure

More information

Computers and Scientific Thinking

Computers and Scientific Thinking Computers ad Scietific Thikig David Reed, Creighto Uiversity Chapter 15 JavaScript Strigs 1 Strigs as Objects so far, your iteractive Web pages have maipulated strigs i simple ways use text box to iput

More information

Counting the Number of Minimum Roman Dominating Functions of a Graph

Counting the Number of Minimum Roman Dominating Functions of a Graph Coutig the Number of Miimum Roma Domiatig Fuctios of a Graph SHI ZHENG ad KOH KHEE MENG, Natioal Uiversity of Sigapore We provide two algorithms coutig the umber of miimum Roma domiatig fuctios of a graph

More information

Lecture 6. Lecturer: Ronitt Rubinfeld Scribes: Chen Ziv, Eliav Buchnik, Ophir Arie, Jonathan Gradstein

Lecture 6. Lecturer: Ronitt Rubinfeld Scribes: Chen Ziv, Eliav Buchnik, Ophir Arie, Jonathan Gradstein 068.670 Subliear Time Algorithms November, 0 Lecture 6 Lecturer: Roitt Rubifeld Scribes: Che Ziv, Eliav Buchik, Ophir Arie, Joatha Gradstei Lesso overview. Usig the oracle reductio framework for approximatig

More information

Running Time. Analysis of Algorithms. Experimental Studies. Limitations of Experiments

Running Time. Analysis of Algorithms. Experimental Studies. Limitations of Experiments Ruig Time Aalysis of Algorithms Iput Algorithm Output A algorithm is a step-by-step procedure for solvig a problem i a fiite amout of time. Most algorithms trasform iput objects ito output objects. The

More information

Hash Tables. Presentation for use with the textbook Algorithm Design and Applications, by M. T. Goodrich and R. Tamassia, Wiley, 2015.

Hash Tables. Presentation for use with the textbook Algorithm Design and Applications, by M. T. Goodrich and R. Tamassia, Wiley, 2015. Presetatio for use with the textbook Algorithm Desig ad Applicatios, by M. T. Goodrich ad R. Tamassia, Wiley, 2015 Hash Tables xkcd. http://xkcd.com/221/. Radom Number. Used with permissio uder Creative

More information

Efficient Parallel Algorithms for Euclidean Distance Transform

Efficient Parallel Algorithms for Euclidean Distance Transform The Computer Joural, Vol. 47 No. 6, The British Computer Society; all rights reserved Efficiet Parallel Algorithms for Euclidea Distace Trasform Lig Che 1,,YiPa 3, Yixi Che 4 ad Xiao-hua Xu 1 1 Departmet

More information

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe Copyright 2016 Ramez Elmasri ad Shamkat B. Navathe CHAPTER 18 Strategies for Query Processig Copyright 2016 Ramez Elmasri ad Shamkat B. Navathe Itroductio DBMS techiques to process a query Scaer idetifies

More information

Security of Bluetooth: An overview of Bluetooth Security

Security of Bluetooth: An overview of Bluetooth Security Versio 2 Security of Bluetooth: A overview of Bluetooth Security Marjaaa Träskbäck Departmet of Electrical ad Commuicatios Egieerig mtraskba@cc.hut.fi 52655H ABSTRACT The purpose of this paper is to give

More information

Running Time ( 3.1) Analysis of Algorithms. Experimental Studies. Limitations of Experiments

Running Time ( 3.1) Analysis of Algorithms. Experimental Studies. Limitations of Experiments Ruig Time ( 3.1) Aalysis of Algorithms Iput Algorithm Output A algorithm is a step- by- step procedure for solvig a problem i a fiite amout of time. Most algorithms trasform iput objects ito output objects.

More information

Analysis of Algorithms

Analysis of Algorithms Aalysis of Algorithms Iput Algorithm Output A algorithm is a step-by-step procedure for solvig a problem i a fiite amout of time. Ruig Time Most algorithms trasform iput objects ito output objects. The

More information

Morgan Kaufmann Publishers 26 February, COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Interface. Chapter 5

Morgan Kaufmann Publishers 26 February, COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Interface. Chapter 5 Morga Kaufma Publishers 26 February, 28 COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Iterface 5 th Editio Chapter 5 Set-Associative Cache Architecture Performace Summary Whe CPU performace icreases:

More information

Lecture 28: Data Link Layer

Lecture 28: Data Link Layer Automatic Repeat Request (ARQ) 2. Go ack N ARQ Although the Stop ad Wait ARQ is very simple, you ca easily show that it has very the low efficiecy. The low efficiecy comes from the fact that the trasmittig

More information

Greedy Algorithms. Interval Scheduling. Greedy Algorithms. Interval scheduling. Greedy Algorithms. Interval Scheduling

Greedy Algorithms. Interval Scheduling. Greedy Algorithms. Interval scheduling. Greedy Algorithms. Interval Scheduling Greedy Algorithms Greedy Algorithms Witer Paul Beame Hard to defie exactly but ca give geeral properties Solutio is built i small steps Decisios o how to build the solutio are made to maximize some criterio

More information

BASED ON ITERATIVE ERROR-CORRECTION

BASED ON ITERATIVE ERROR-CORRECTION A COHPARISO OF CRYPTAALYTIC PRICIPLES BASED O ITERATIVE ERROR-CORRECTIO Miodrag J. MihaljeviC ad Jova Dj. GoliC Istitute of Applied Mathematics ad Electroics. Belgrade School of Electrical Egieerig. Uiversity

More information

Multiprocessors. HPC Prof. Robert van Engelen

Multiprocessors. HPC Prof. Robert van Engelen Multiprocessors Prof. Robert va Egele Overview The PMS model Shared memory multiprocessors Basic shared memory systems SMP, Multicore, ad COMA Distributed memory multicomputers MPP systems Network topologies

More information

Reversible Realization of Quaternary Decoder, Multiplexer, and Demultiplexer Circuits

Reversible Realization of Quaternary Decoder, Multiplexer, and Demultiplexer Circuits Egieerig Letters, :, EL Reversible Realizatio of Quaterary Decoder, Multiplexer, ad Demultiplexer Circuits Mozammel H.. Kha, Member, ENG bstract quaterary reversible circuit is more compact tha the correspodig

More information

Improving Template Based Spike Detection

Improving Template Based Spike Detection Improvig Template Based Spike Detectio Kirk Smith, Member - IEEE Portlad State Uiversity petra@ee.pdx.edu Abstract Template matchig algorithms like SSE, Covolutio ad Maximum Likelihood are well kow for

More information

The Magma Database file formats

The Magma Database file formats The Magma Database file formats Adrew Gaylard, Bret Pikey, ad Mart-Mari Breedt Johaesburg, South Africa 15th May 2006 1 Summary Magma is a ope-source object database created by Chris Muller, of Kasas City,

More information

Dynamic Programming and Curve Fitting Based Road Boundary Detection

Dynamic Programming and Curve Fitting Based Road Boundary Detection Dyamic Programmig ad Curve Fittig Based Road Boudary Detectio SHYAM PRASAD ADHIKARI, HYONGSUK KIM, Divisio of Electroics ad Iformatio Egieerig Chobuk Natioal Uiversity 664-4 Ga Deokji-Dog Jeoju-City Jeobuk

More information

Continuous Ant Colony System and Tabu Search Algorithms Hybridized for Global Minimization of Continuous Multi-minima Functions

Continuous Ant Colony System and Tabu Search Algorithms Hybridized for Global Minimization of Continuous Multi-minima Functions Cotiuous At Coloy System ad Tabu Search Algorithms Hybridized for Global Miimizatio of Cotiuous Multi-miima Fuctios Akbar Karimi Departmet of Aerospace Egieerig, Sharif Uiversity of Techology, P.O. Box:

More information

6.854J / J Advanced Algorithms Fall 2008

6.854J / J Advanced Algorithms Fall 2008 MIT OpeCourseWare http://ocw.mit.edu 6.854J / 18.415J Advaced Algorithms Fall 2008 For iformatio about citig these materials or our Terms of Use, visit: http://ocw.mit.edu/terms. 18.415/6.854 Advaced Algorithms

More information

Improving Information Retrieval System Security via an Optimal Maximal Coding Scheme

Improving Information Retrieval System Security via an Optimal Maximal Coding Scheme Improvig Iformatio Retrieval System Security via a Optimal Maximal Codig Scheme Dogyag Log Departmet of Computer Sciece, City Uiversity of Hog Kog, 8 Tat Chee Aveue Kowloo, Hog Kog SAR, PRC dylog@cs.cityu.edu.hk

More information

Computational Geometry

Computational Geometry Computatioal Geometry Chapter 4 Liear programmig Duality Smallest eclosig disk O the Ageda Liear Programmig Slides courtesy of Craig Gotsma 4. 4. Liear Programmig - Example Defie: (amout amout cosumed

More information

LU Decomposition Method

LU Decomposition Method SOLUTION OF SIMULTANEOUS LINEAR EQUATIONS LU Decompositio Method Jamie Traha, Autar Kaw, Kevi Marti Uiversity of South Florida Uited States of America kaw@eg.usf.edu http://umericalmethods.eg.usf.edu Itroductio

More information

The Counterchanged Crossed Cube Interconnection Network and Its Topology Properties

The Counterchanged Crossed Cube Interconnection Network and Its Topology Properties WSEAS TRANSACTIONS o COMMUNICATIONS Wag Xiyag The Couterchaged Crossed Cube Itercoectio Network ad Its Topology Properties WANG XINYANG School of Computer Sciece ad Egieerig South Chia Uiversity of Techology

More information

Some non-existence results on Leech trees

Some non-existence results on Leech trees Some o-existece results o Leech trees László A.Székely Hua Wag Yog Zhag Uiversity of South Carolia This paper is dedicated to the memory of Domiique de Cae, who itroduced LAS to Leech trees.. Abstract

More information

BGP Attributes and Path Selection. ISP Training Workshops

BGP Attributes and Path Selection. ISP Training Workshops BGP Attributes ad Path Selectio ISP Traiig Workshops 1 BGP Attributes The tools available for the job 2 What Is a Attribute?... Next Hop AS Path MED...... p Part of a BGP Update p Describes the characteristics

More information

Chapter 11. Friends, Overloaded Operators, and Arrays in Classes. Copyright 2014 Pearson Addison-Wesley. All rights reserved.

Chapter 11. Friends, Overloaded Operators, and Arrays in Classes. Copyright 2014 Pearson Addison-Wesley. All rights reserved. Chapter 11 Frieds, Overloaded Operators, ad Arrays i Classes Copyright 2014 Pearso Addiso-Wesley. All rights reserved. Overview 11.1 Fried Fuctios 11.2 Overloadig Operators 11.3 Arrays ad Classes 11.4

More information

Analysis of Algorithms

Analysis of Algorithms Aalysis of Algorithms Ruig Time of a algorithm Ruig Time Upper Bouds Lower Bouds Examples Mathematical facts Iput Algorithm Output A algorithm is a step-by-step procedure for solvig a problem i a fiite

More information

A Novel Approach to Solve Multiple Traveling Salesmen Problem by Genetic Algorithm

A Novel Approach to Solve Multiple Traveling Salesmen Problem by Genetic Algorithm A Novel Approach to Solve Multiple Travelig Salesme Problem by Geetic Algorithm Adrás Király, Jáos Aboyi Uiversity of Paoia, Departmet of Process Egieerig, P.O. Box 58. Veszprém H-8200, HUNGARY, e-mail:

More information

1 Graph Sparsfication

1 Graph Sparsfication CME 305: Discrete Mathematics ad Algorithms 1 Graph Sparsficatio I this sectio we discuss the approximatio of a graph G(V, E) by a sparse graph H(V, F ) o the same vertex set. I particular, we cosider

More information

Evaluation scheme for Tracking in AMI

Evaluation scheme for Tracking in AMI A M I C o m m u i c a t i o A U G M E N T E D M U L T I - P A R T Y I N T E R A C T I O N http://www.amiproject.org/ Evaluatio scheme for Trackig i AMI S. Schreiber a D. Gatica-Perez b AMI WP4 Trackig:

More information

Minimum Spanning Trees

Minimum Spanning Trees Miimum Spaig Trees Miimum Spaig Trees Spaig subgraph Subgraph of a graph G cotaiig all the vertices of G Spaig tree Spaig subgraph that is itself a (free) tree Miimum spaig tree (MST) Spaig tree of a weighted

More information

Computer Science Foundation Exam. August 12, Computer Science. Section 1A. No Calculators! KEY. Solutions and Grading Criteria.

Computer Science Foundation Exam. August 12, Computer Science. Section 1A. No Calculators! KEY. Solutions and Grading Criteria. Computer Sciece Foudatio Exam August, 005 Computer Sciece Sectio A No Calculators! Name: SSN: KEY Solutios ad Gradig Criteria Score: 50 I this sectio of the exam, there are four (4) problems. You must

More information

Lower Bounds for Sorting

Lower Bounds for Sorting Liear Sortig Topics Covered: Lower Bouds for Sortig Coutig Sort Radix Sort Bucket Sort Lower Bouds for Sortig Compariso vs. o-compariso sortig Decisio tree model Worst case lower boud Compariso Sortig

More information

Lecture 18. Optimization in n dimensions

Lecture 18. Optimization in n dimensions Lecture 8 Optimizatio i dimesios Itroductio We ow cosider the problem of miimizig a sigle scalar fuctio of variables, f x, where x=[ x, x,, x ]T. The D case ca be visualized as fidig the lowest poit of

More information

Speeding-up dynamic programming in sequence alignment

Speeding-up dynamic programming in sequence alignment Departmet of Computer Sciece Aarhus Uiversity Demark Speedig-up dyamic programmig i sequece aligmet Master s Thesis Dug My Hoa - 443 December, Supervisor: Christia Nørgaard Storm Pederse Implemetatio code

More information

Optimal Mapped Mesh on the Circle

Optimal Mapped Mesh on the Circle Koferece ANSYS 009 Optimal Mapped Mesh o the Circle doc. Ig. Jaroslav Štigler, Ph.D. Bro Uiversity of Techology, aculty of Mechaical gieerig, ergy Istitut, Abstract: This paper brigs out some ideas ad

More information

Cubic Polynomial Curves with a Shape Parameter

Cubic Polynomial Curves with a Shape Parameter roceedigs of the th WSEAS Iteratioal Coferece o Robotics Cotrol ad Maufacturig Techology Hagzhou Chia April -8 00 (pp5-70) Cubic olyomial Curves with a Shape arameter MO GUOLIANG ZHAO YANAN Iformatio ad

More information

Civil Engineering Computation

Civil Engineering Computation Civil Egieerig Computatio Fidig Roots of No-Liear Equatios March 14, 1945 World War II The R.A.F. first operatioal use of the Grad Slam bomb, Bielefeld, Germay. Cotets 2 Root basics Excel solver Newto-Raphso

More information

EE 459/500 HDL Based Digital Design with Programmable Logic. Lecture 13 Control and Sequencing: Hardwired and Microprogrammed Control

EE 459/500 HDL Based Digital Design with Programmable Logic. Lecture 13 Control and Sequencing: Hardwired and Microprogrammed Control EE 459/500 HDL Based Digital Desig with Programmable Logic Lecture 13 Cotrol ad Sequecig: Hardwired ad Microprogrammed Cotrol Refereces: Chapter s 4,5 from textbook Chapter 7 of M.M. Mao ad C.R. Kime,

More information

Throughput-Delay Scaling in Wireless Networks with Constant-Size Packets

Throughput-Delay Scaling in Wireless Networks with Constant-Size Packets Throughput-Delay Scalig i Wireless Networks with Costat-Size Packets Abbas El Gamal, James Mamme, Balaji Prabhakar, Devavrat Shah Departmets of EE ad CS Staford Uiversity, CA 94305 Email: {abbas, jmamme,

More information

CIS 121 Data Structures and Algorithms with Java Spring Stacks, Queues, and Heaps Monday, February 18 / Tuesday, February 19

CIS 121 Data Structures and Algorithms with Java Spring Stacks, Queues, and Heaps Monday, February 18 / Tuesday, February 19 CIS Data Structures ad Algorithms with Java Sprig 09 Stacks, Queues, ad Heaps Moday, February 8 / Tuesday, February 9 Stacks ad Queues Recall the stack ad queue ADTs (abstract data types from lecture.

More information

Transitioning to BGP

Transitioning to BGP Trasitioig to BGP ISP Workshops These materials are licesed uder the Creative Commos Attributio-NoCommercial 4.0 Iteratioal licese (http://creativecommos.org/liceses/by-c/4.0/) Last updated 24 th April

More information

Solution printed. Do not start the test until instructed to do so! CS 2604 Data Structures Midterm Spring, Instructions:

Solution printed. Do not start the test until instructed to do so! CS 2604 Data Structures Midterm Spring, Instructions: CS 604 Data Structures Midterm Sprig, 00 VIRG INIA POLYTECHNIC INSTITUTE AND STATE U T PROSI M UNI VERSI TY Istructios: Prit your ame i the space provided below. This examiatio is closed book ad closed

More information

CIS 121 Data Structures and Algorithms with Java Fall Big-Oh Notation Tuesday, September 5 (Make-up Friday, September 8)

CIS 121 Data Structures and Algorithms with Java Fall Big-Oh Notation Tuesday, September 5 (Make-up Friday, September 8) CIS 11 Data Structures ad Algorithms with Java Fall 017 Big-Oh Notatio Tuesday, September 5 (Make-up Friday, September 8) Learig Goals Review Big-Oh ad lear big/small omega/theta otatios Practice solvig

More information