Efficient Hough transform on the FPGA using DSP slices and block RAMs

Size: px
Start display at page:

Download "Efficient Hough transform on the FPGA using DSP slices and block RAMs"

Transcription

1 Efficiet Hough trasform o the FPGA usig DSP slices ad block RAMs Xi Zhou, Norihiro Tomagou, Yasuaki Ito, ad Koji Nakao Departmet of Iformatio Egieerig Hiroshima Uiversity Kagamiyama 1-4-1, Higashi Hiroshima, Japa Abstract The mai cotributio of this paper is to preset a ew FPGA architecture for the Hough trasform that idetifies straight lies i a biary image. Recet FPGAs have hudreds of embedded DSP slices ad block RAMs. For example, Xilix Virtex-6 Family FPGAs have a DSP48E1 slice, which is a cofigurable logic block equipped with fast multipliers, adders, pipelie registers, ad so o. They also have a dualport memory with 18Kbits as a block RAM. Oe of the most importat key techiques for acceleratig computatio usig FPGAs is a efficiet usage of DSP slices ad block RAMs. Our ew architecture for the Hough trasform uses 178 DSP48E1 slices ad 180 block RAMs with 18Kbits that work i parallel. As far as we kow, there is o previously published work that fully utilizes DSP slices ad block RAMs for the Hough trasform. Roughly speakig, a covetioal sequetial implemetatio performs 180m votig operatios for m edge poits. Our architecture performs votig operatios i parallel, ad outputs idetified straight lies i m+97 clock cycles. Sice 180m votig operatios are performed usig 178 DSP48E1 slices, the lower boud of the computig time is m clock cycles. Hece our implemetatio is close to optimal. The implemetatio results show that the Hough trasform for a image with 333 edge poits ca be doe i oly µs. Keywords-Image processig, Lie detectio, Hough trasform, FPGA, Embedded DSP slices, Embedded block RAMs parameter space appropriately. I the followigs, we call this coutig to the accumulators votig. More specifically, for each edge poit (x, y) i a -dimetioal image, the votig is performed alog a curve = x cos θ+y si θ (0 θ < 180). Possible lies ca be detected by searchig poits that are voted itesively. Figure 1 shows a example of straight lie detectio usig the Hough trasform. For a iput image (Figure 1(a)), the biary edge image (Figure 1(b)) is obtaied by the edge detector such as Sobel filter. The result of votig to the parameter space is show i Figure. I this figure, darker poits show poits that are voted itesively, that is, represet probable lies. Accordig to the result of votig, the pricipal lies are detected (Figure 1(c)). I. INTRODUCTION Hough trasform is a techique to fid shapes i images [1]. I particular, it has bee utilized to extract lies, circles, ellipses ad arbitrary shapes. The Hough trasform defies a mappig from a image ito a parameter space represeted by a accumulate array. The parameter space is defied by parameterizig detected shapes. Based o each edge poit of the image, the mappig adds a vote to correspodig elemets i the accumulate array. The elemets that are icreased represet associated parameters based o detected shapes. Therefore, the elemets that are voted itesively correspod to the parameters of shapes i the image space. The Hough trasform ca be used to extract straight lies i a biary image []. The idea of this method is to exploit the duality betwee poits of a lie ad parameters of that lie. A poit i the image is represeted by a curve i the parameter space ad lies of colliear poits itersect i the parameter space at oe poit. These itersectios are couted i a array of accumulators that quatizes the Figure. Hough parameter space A Field Programmable Array (FPGA) is a programmable logic device desiged to be cofigured by the customer or desiger by hardware descriptio laguage after maufacturig. The most commo FPGA architecture cosists of a array of logic blocks, I/O pads, block RAMs ad routig chaels. Furthermore, recet FPGAs have embedded DSP slices that make a higher performace ad a broader applicatio. The Xilix Virtex-6 series FPGAs have DSP48E1 slices that are equipped with a multiplier, adders, logic operators, etc [3]. More specifically, the DSP48E1 slice has a twoiput multiplier followed by multiplexers ad a three iput adder/subtractor/accumulator. The DSP48E1 multiplier ca

2 (a) Iput image (b) Biary edge image by Sobel filter (c) Lie detectio usig the Hough trasform Figure 1. Example of straight lie detectio usig the Hough trasform perform multiplicatio of a 18bit ad a 5bit two s complemet umbers ad produces oe 48bit two s complemet productio. Programmable pipeliig of iput operads, itermediate products, ad accumulator outputs ehaces throughput ad improves frequecy. The DSP48E1 also has pipelie registers betwee operators to reduce the delay. The block RAM i the Virtex-6 FPGA is a embedded memory supportig sychroized read ad write operatios. I the Virtex-6 FPGA, it ca cofigured as 36Kbit dual port block RAMs, FIFOs, or two 18Kbit dual port RAMs. I our architecture, it is used as a 1K 18bit dual port RAM. Sice FPGA chips maitai relatively low price ad its programmable features, it is widely used i those fields which eed to update architecture or fuctios frequetly such as commuicatio ad educatio areas. They are widely used i cosumer ad idustrial products for acceleratig processor itesive algorithms [4], [5], [6], [7], [8], [9], [10], [11], [1]. The mai cotributio of this paper is to preset a ew FPGA architecture for the Hough trasform fully utilizes embedded DSP slices ad block RAMs. Our ew idea icludes: Votig Space Partitioig: Polar coordiate votig space (θ, ) is partitioed ad arraged ito block RAMs. This eables us to perform votig operatios i parallel. Also, the fuctio of dual-port of block RAMs are fully used to accumulate the votig value istatly. Efficiet Usage of DSP slices: DSP slices are used to compute x cos θ ad y si θ i parallel for each edge pixel (x, y). We compute x cos θ ad y si θ for θ such that 0 θ < 90 istead of computig them for θ such that 0 θ < 180. Also, we avoid the computatio of the values of cos θ ad si θ by pre-loadig them i the DSP slices. Fully Pipelied Architecture: We take ito accout a layout of DSP slices ad block RAMs i Virtex-6 FPGA architecture, ad desig our Hough trasform architecture as a fully pipelied oe. For example, i the Virtex- 6 FPGA XC6VLX40T has 768 DSP48E1 slices arraged i 8 colums of 96 adjacet DSP48E1 slices. Neighborig DSP48E1 slices are coected directly through pipelie registers. Our Hough trasform architecture uses colums to compute x cos θ ad y si θ each, ad uses a pipelie techique to maximize the clock frequecy. Usig these ideas, our ew architecture for the Hough trasform uses 178 DSP48E1 slices ad 180 block RAMs with 18Kbits that work i parallel. Oe of the most importat key techiques for acceleratig computatio usig FPGAs is a efficiet usage of DSP slices ad block RAMs. Nevertheless, as far as we kow, there is o previously published work that fully utilizes DSP slices ad block RAMs for the Hough trasform. Roughly speakig, a covetioal sequetial implemetatio performs 180m votig operatios for m edge poits. Our architecture performs votig operatios i parallel, ad outputs idetified straight lies i m + 97 clock cycles. Sice 180m votig operatios are performed usig 178 DSP48E1 slices, the lower boud of the computig time is m clock cycles. Hece our implemetatio is close to optimal. We have implemeted our ew architecture o a Virtex-6 family FPGA XC6VLX40T-1. The circuit rus i MHz ad outputs idetified straight lies i m + 97 cycles. For example, Figure 1 icludes 333 edge poits. Therefore, the circuit ca perform the Hough trasform i µs. May hardware algorithms for FPGA implemetatio of the Hough trasform for lies have bee proposed i past. As far as we kow, however, there is o published hardware algorithm usig embedded DSP slices or multipliers i the

3 FPGA. I the existig researches, istead of circuits of multiplicatio with DSP slices or multipliers, they itroduced icremetal Hough trasform [13], [14], [15], CORDIC [16], [17], ad hybrid-log arithmetic [18] to the computatio of the Hough trasform. Sice most of recet FPGAs produced by pricipal vedors equip embedded DSP blocks [19], [0], [1], oe of the most importat key techiques for acceleratig computatio usig FPGAs is a efficiet usage of DSP slices ad block RAMs. This paper is orgaized as follows. Sectio II itroduces the Hough trasform algorithms for lies. We show the FPGA architecture for the Hough trasform i Sectio III. Sectio IV shows the experimetal results. Fially, Sectio V cocludes the paper. II. HOUGH TRANSFORM The mai purpose of this sectio is to review the Hough trasform algorithms for straight lies. Suppose that we have a image of size. We assume that pixels are arraged i two dimesioal xy-space such that the origi is i the ceter of the image as illustrated i Figure 3. Hece, both coordiates x ad y take itegers i the rage [ + 1, ]. A pixel (x, y) ( + 1 x, y ) i the xy-space is coverted to a curve i the θ-space by the followig formula: = x cos θ + y si θ (0 θ < 180) (1) Clearly, the double iequality < is satisfied. The values of θ ad ca also be obtaied geometrically. Suppose that we draw a lie goig through the origi with agle θ as illustrated i Figure 3. For such a lie, we ca draw the orthogoal lie goig through a pixel (x, y). The value of correspods to the distace to the lie. I other words, a poit (θ, ) of θ-space correspods to a lie of xy-space. The key idea of the Hough trasform is to vote i θ-space for every pixel i the xy-space. Let (x 0, y 0 ), (x 1, y 1 ),..., (x k 1, y k 1 ) be the k pixels i xyspace. The Hough trasform is spelled out as follows: [Straight Forward Hough Trasform] for i 0 to k 1 for θ 0 to 179 begi x k cos θ + y k si θ v[θ][] v[θ][] + 1 ed for θ 0 to 179 do i parallel for to do i parallel output (θ, ) if v[θ][] threshold For simplicity, we assume that the value of is automatically rouded to a iteger. I the Straight Forward Hough Trasform, for each poit (x k, y k ), the values of x k cos θ ad y k si θ are computed for θ = 0, 1,..., 179. If v[θ][] is storig a large value, may poits i the k iput pixels lie i the lie i xy-space correspods to a poit (θ, ) i θ-space. We will show that, it is sufficiet to compute these values for θ = 0, 1,..., 90. From the additio theorem of trigoometric fuctios, we have = x k cos(180 θ) + y k si(180 θ) = x k cos(θ) + y k si(θ). () Usig Formula (), the Hough trasform ca also be doe by partitioig the rage [0, 179] of θ ito two rages [0, 89] ad [90, 179]. Also, we avoid goig through array v for fidig elemets larger tha a threshold. Thus, our ew Hough trasform, called the Circuit-orieted Hough Trasform is be spelled out as follows: [Circuit-orieted Hough Trasform] for i 0 to k 1 do begi for θ 0 to 89 do begi x k cos θ + y k si θ v[θ][] v[θ][] + 1 output (θ, ) if v[θ][] = threshold ed for θ 1 to 90 do begi x cos(θ) + y si(θ) v[180 θ][] v[180 θ][] + 1 output (θ, ) if v[θ][] = threshold ed ed I the followig sectio, we show a efficiet implemetatio of the Circuit-orieted Hough Trasform. III. OUR FPGA ARCHITECTURE FOR THE HOUGH TRANSFORM This sectio describes our FPGA architecture for the Hough trasform usig DSP slices ad block RAMs i Xilix Virtex-6 FPGA. We use Xilix Virtex-6 Family FPGA XC6VLX40T-1 as the target device []. A. Structure of our architecture for the Hough trasform Figure 4 illustrates our architecture for the Hough trasform. We use 178 DSP slices X 1, X,... X 89 ad Y 1, Y,..., Y 89. For each θ (0 θ 90) X θ ad Y θ compute x k cos θ ad y k cos θ for give x k ad y k, respectively. Sice x k cos 0 = x k, x k cos 90 = 0, y k si 0 = 0, ad y k cos 90 = y k, DSP slices X 0, X 90, Y 0, ad Y 90 are ot ecessary. Usig a adder ad a subtractor for each pair X θ ad Y θ, θ = x k cos θ + y k cos θ ad 180 θ = x k cos θ + y k cos θ are computed. We also use 180 block RAMs V 0, V 1,... V 179 to store the votig value. Address

4 y (x, y) (θ, ) + 1 θ x θ + 1 Figure 3. Two dimesioal Spaces xy ad θ used i the Hough trasform of each V θ (0 θ 179) is used to store the value of v[θ][]. To miimize the delay betwee registers, DSP slices X 1,..., X 90 are coected i a pipelie fashio as illustrated i Figure 4. Each X θ has a register to store the value of x k. I every clock cycle, the value is trasferred from X θ to X θ+1. Similarly, DSP slices Y 0, Y 1,..., Y 90 are coected i a pipelie fashio. Figure 5 illustrates two DSP slices X θ ad Y θ with a adder ad subtractor to compute. I X θ, the value of x k is loaded i a iteral register. Also, the value of cos θ is pre-computed. Note that the value of cos θ used i X θ is a fixed value. The product of x k ad cos θ is computed i a multiplier of the DSP slice X θ. Similarly, the value of si θ used i Y θ is a fixed value ad the product of y k ad si θ is computed i a multiplier of the DSP slice Y θ. I the Virtex-6 FPGA XC6VLX40T, that is our target device, DSP48E1 slices are arraged i 8 colums of 96 adjacet DSP48E1 slices. Neighborig DSP48E1 slices are coected directly through pipelie registers. Our Hough trasform architecture uses colums to compute x k cos θ ad y k si θ each, ad uses a pipelie techique to maximize the clock frequecy (Figure 6). Figure 7 illustrates the architecture of V θ usig a block RAM. A block RAM i the FPGA is dual port architecture. Xilix Virtex-6 Family has 18Kbit dual-port block RAMs, which have two sets of ports operated idepedetly. Two sets of ports are: Port Set A ADDRA (ADDRess A), DOA (Data Output A), DIA (Data Iput A), ad Port Set B ADDRB (ADDRess B), DOB (Data Output B), DIB (Data Iput B). Let M[i] deote a data of address i of the block RAM. I read operatio of Port Set A, M[ADDRA] is output from x k y k x k cos θ y k si θ x k cos θ y k cos θ Figure 5. Two DSP slices X θ ad Y θ with a adder ad subtractor to compute DOA after the risig clock edge. I write operatio of Port Set A, the data give to DIA is writte i M[ADDRA] at the risig clock edge. Read/write operatios of Port Set B are the same as Port Set A. Port Set A ad Port Set B work idepedetly. I the block RAMs i the target device of this work, read/write operatios ca be cofigured as either RF (Read First) mode or WF (Write First) mode. I the RF mode, if readig ad writig operatios are performed to the same address, readig operatio is performed before the

5 x k X 1 X X 89 y k Y 1 Y Y 89 V 90 V 0 V 179 V 1 V 178 V V 91 V 89 (θ, ) Figure 4. The outlie of our FPGA architecture for the Hough trasform x k y k x k (=x k cos0) y k (=y k si90) cos1 si1 x k cos1 y k si1 cos si 89 DSP blocks x k cos 89 DSP blocks y k si cos89 si89 x k cos89 y k si89 Figure 6. Pipelie architecture to compute x k cos θ ad y k si θ with DSP slices readig operatio. Hece the readig data is the data before writig data. O the other had, i the WF mode, sice the writig performed before the readig, the readig data is the updated data. However, whe a dual port is used, there is a restrictio that if read ad write operatios to the same address are performed for each port, the settig of block RAMs must be RF [3]. We use the block RAM to store the values of v[θ][] ( < ). Let v θ [i] deote the data of address i i block RAM V θ. Sice is give to it ADDRA, v θ [] is output from DOA after the risig clock edge as illustrated i Figure 7. After that, v θ [] + 1 is computed ad it is give to DOB. Sice is give to ADDB, v θ [] + 1 is writte i v θ []. I other words, v θ [] v θ [] + 1 is performed. At that time, accordig to the restrictio stated i the above, sice the same value of may be iput cotiuously, the settig of block RAMs must be RF. Namely, whe the same value of is iput cotiuously, the former voted value is ot read from the block RAM. To avoid this situatio, we use a additioal register to store the latest voted value ad if the same value of is iput cotiuously, the stored value is used istead of the value read from the block RAM. I the same time, a comparator is used to determie if v θ [] + 1 = threshold. If so, the value of is writte i a register. After that, a pair (θ, ) is writte ito a ext register. The pair (θ, ) represets a probable lie. It moves toward the output of the circuit usig series of shift registers oe by oe show i Figure 4. I order to reduce the umber of clock cycles ecessary to move data to the output, we use two series of shift registers. Oe is used for output data of V 0,..., V 89. The other is used for output data of V 90,..., V 179. Therefore, the umber of clock cycles ecessary to move data to the output is reduced to at most 90 clock cycles. B. Data represetatio The choice of data precisio is guided by the implemetatio cost i terms of area, simplicity of desig, speed ad power cosumptio. Higher precisio will lead to less quatizatio error i the fial implemetatio. O the other had, lower precisio will produce more compactio ad faster desigs with less power cosumptio. A trade-off choice eeds to be made depedig o the give applicatio ad available FPGA resources. I our work, i order to miimize chip space ad computatio time, short fixed poit represetatio of umbers

6 = v θ [] + 1 threshold = θ block RAM ADDRA DOA ADDRB DIB +1 v θ [] usig Xilix ISE I the implemetatio, to reduce the delay of the circuit, some pipelie registers are iserted ito betwee circuit elemets. It takes 3 clock cycles to compute the values of for give x k ad y k. Also, 4 clock cycles are ecessary to output a pair (θ, ) that represets a probable lie. Moreover, the umber of clock cycles ecessary to move data to the output is reduced to at most 90 clock cycles. Therefore, this circuit ca output idetified straight lies represeted by (θ, ) i m + 97 m+97 cycles, i.e., µs. For example, Figure 1(b) icludes 333 edge poits. Therefore, the circuit ca perform the Hough trasform i µs. Also, if all the poits of a image of size 51 51(= 6144) are edge poits, it takes µs to complete to output the results. Of course, it is ot possible that all poits are edge poits, however, this fact guaratees that our Hough trasform implemetatio for ay image termiates i less tha µs. (θ, ) Table I PERFORMANCE EVALUATION OF THE PROPOSED ARCHITECTURE FOR THE HOUGH TRANSFORM Figure 7. A block RAM V θ to store v[θ][] DSP48E1 slices (out of 768) 178 (3.1%) 18Kbit block RAMs (out of 83) 180 (1.6%) Slices (out of ) (4.81%) Clock frequecy [MHz] are used. Cosiderig the structure of DSP slices ad block RAMs, we choose the data presetatio i our implemetatio, as follows. The data format of iputs that are pairs of coordiates x k ad y k are 10bit two s complemet iteger each. Also, the data format of cos θ ad si θ is 16bit fixed poit umber, which cosists of 1bit sig, 1bit iteger ad 14bit fractio based o two s complemet. O the other had, the data format of is 10bit two s complemet iteger. The data format of the voted value is 18bit iteger. Namely, the umber of the vote is at most Sice the rage of the value of θ is 0 to 180, the data format of θ is 8bit iteger. IV. EXPERIMENTAL RESULTS We have implemeted ad evaluated our proposed methods of the Hough trasform o the FPGA. For the purpose of estimatig the speed up of our implemetatios, we have also implemeted a covetioal software approach of the Hough trasform usig GNU C. We have used Itel Xeo X7460 ruig i.66ghz ad 18GB memory to ru the sequetial algorithm for the Hough trasform. For the image show i Figure 1(b) that icludes 333 edge poits, the software implemetatio ca perform the Hough trasform i 37.10ms. If the iput image is worst case i terms of the computig time, that is, if all the poits of a image of size 51 51(= 6144) are edge poits, it takes 359.7ms to complete to output the results. I the evaluatio, we have used the Xilix Virtex-6 FPGA XC6VLX40T-1. Table I shows the experimetal results Table II shows the computig time of the Hough trasform for Figures 1(b), 8(a), ad 9(a). Accordig to the table, we ca fid that the computig time for both the CPU implemetatio ad the FPGA implemetatio almost depeds o the umber of the edge poits, ot the size of the image. There are a umber of literatures reported to implemet the Hough trasform for lies usig the FPGA show i Sectio I. Performaces such as device, logic blocks, DSP slices, frequecy ad throughput are compared i Table III. It is difficult to directly compare to other works because utilized FPGAs ad supported size of images differ. Cosiderig the throughput, however, it is clear that the performace of our FPGA implemetatio is better tha that of other works. Table III COMPARISON WITH RELATED WORKS FOR THE HOUGH TRANSFORM USING FPGAS Karaberou [16] Deg [17] Device XC4010EPC84 XC4010XL Logic blocks 05 CLBs 333 CLBs DSP slices Frequecy 3.166MHz 40MHz Throughput Mpixel/s 0.63Mpixel/s Lee [18] This work Device Virtex 4 XC6VLX40T-1 Logic blocks 314 CLBs Slices DSP slices 178 DSP48E1s Frequecy 13MHz MHz Throughput 3.768Mpixel/s 45.48Mpixel/s

7 (a) Iput biary edge image (b) Lie detectio usig the Hough trasform Figure 8. Example of straight lie detectio usig the Hough trasform ( , 393 edge poits) (a) Iput biary edge image (b) Lie detectio usig the Hough trasform Figure 9. Example of straight lie detectio usig the Hough trasform ( , 8009 edge poits) Table II COMPUTING TIME OF THE HOUGH TRANSFORM Image Size # edge poits Time (FPGA) Time (CPU) Speed-up Figure 1(b) µs 37.10ms 73.3 Figure 8(a) µs 7.47ms 88.3 Figure 9(a) µs 11.64ms 37.4 V. CONCLUSIONS We have preseted a ew FPGA implemetatio for the Hough trasform that idetifies straight lies i a biary image. Our basic idea is to partitio the votig space ad the votig operatio is performed i parallel. I our implemetatio, we utilize DSP slices ad block RAMs o the Virtex-6 Family FPGA. Partitioig the parameter space to vote, the 180 votig operatios are performed i parallel with 178 DSP48E1 slices ad Kbit block RAMs. We have implemeted our architecture o the Virtex-6 Family FPGA XC6VLX40T-1. The experimetal results show that this implemetatio rus i MHz ad give m coordiates of edge poits, it ca output idetified straight m+97 lies i m + 97 cycles, i.e., µs. The implemetatio results show that the Hough trasform for a image with 333 edge poits ca be doe i µs o the FPGA. REFERENCES [1] P. V. C. Hough, Method ad meas for recogizig complex patters, U.S. Patet 3,069,654, 196. [] R. O. Duda ad P. E. Hart, Use of the Hough trasformatio to detect lies ad curves i pictures, Commuicatios of the ACM, vol. 15, o. 1, pp , 197. [3] Xilix Ic., Virtex-6 FPGA DSP48E1 Slice User Guide (v1.3), 011. [4] J. L. Bordim, Y. Ito, ad K. Nakao, Acceleratig the CKY parsig usig FPGAs, IEICE Trasactios o Iformatio ad Systems, vol. E86-D, o. 5, pp , May 003. [5], Istace-specific solutios to accelerate the CKY parsig for large cotext-free grammars, Iteratioal Joural o Foudatios of Computer Sciece, pp , 004.

8 [6] Y. Ito ad K. Nakao, Low-latecy coected compoet labelig usig a FPGA, Iteratioal Joural o Foudatios of Computer Sciece, pp , 010. [7] Y. Ago, Y. Ito, ad K. Nakao, A FPGA implemetatio for eural etworks with the FDFM processor core approach, Iteratioal Joural of Parallel, Emerget ad Distributed Systems, pp. 1 13, 01. [] Xilix Ic., Virtex-6 Family Overview(v.4), 01. [3], Virtex-6 FPGA Memory Resources User Guide (v1.6), 011. [8] Y. Ito ad K. Nakao, A ew FM screeig method to geerate cluster-dot biary images usig the local exhaustive search with FPGA acceleratio, Iteratioal Joural o Foudatios of Computer Sciece, pp , 008. [9], Efficiet exhaustive verificatio of the Collatz cojecture usig DSP blocks of Xilix FPGAs, Iteratioal Joural of Networkig ad Computig, vol. 1, o. 1, pp. 49 6, 011. [10] Y. Ito, K. Nakao, ad S. Bo, The parallel FDFM processor core approach for CRT-based RSA decryptio, Iteratioal Joural of Networkig ad Computig, vol., o. 1, pp , 01. [11] K. Nakao ad E. Takamichi, A image retrieval system usig FPGAs, IEICE Trasactios o Iformatio ad Systems, vol. E86-D, o. 5, pp , May 003. [1] K. Nakao ad Y. Yamagishi, Hardware choose k couters with applicatios to the partial exhaustive search, IEICE Tras. o Iformatio & Systems, 005. [13] S. Tagzout, K. Achour, ad O. Djekoue, Hough trasform algorithm for FPGA implemetatio, Sigal Processig, vol. 81, o. 6, pp , 001. [14] H. Bessalah, S. Seddiki, F. Alim, ad M. Becherif, O lie mode icremetal Hough trasform implemetatio o Xilix fpga s, i Proc. of the 8th coferece o Sigal, Speech ad image processig, 008, pp [15] O. Djekoue ad K. Achour, Icremetal Hough trasform: a improved algorithm for digital device implemetatio, Real-Time Imagig, vol. 10, o. 6, pp , 004. [16] S. M. Karaberou ad F. Terrati, Real-time FPGA implemetatio of Hough trasform usig gradiet ad CORDIC algorithm, Image ad Visio Computig, vol. 3, o. 11, pp , 005. [17] D. D. S. Deg ad H. ElGidy, High-speed parameterisable Hough trasform usig recofigurable hardware, i Proc. of the Pa-Sydey area workshop o Visual iformatio processig, vol. 11, 001, pp [18] P. Lee ad A. Evagelos, A implemetatio of a multiplierless Hough trasform o a FPGA platform usig hybrid-log arithmetic, i Proc. of Real-Time Image Processig 008, vol. 6811, 008, pp G 1. [19] Xilix Ic., Virtex-4 FPGA User Guide(v.6), 008. [0], Virtex-5 FPGA User Guide(v5.), 009. [1] Altera Corp., Stratix V Device Hadbook, 01.

An Efficient Implementation of the Gradient-based Hough Transform using DSP slices and block RAMs on the FPGA

An Efficient Implementation of the Gradient-based Hough Transform using DSP slices and block RAMs on the FPGA A Efficiet Implemetatio of the Gradiet-based Hough Trasform usig DSP slices ad block RAMs o the FPGA Xi Zhou, Yasuaki Ito, ad Koji Nakao Departmet of Iformatio Egieerig Hiroshima Uiversity Kagamiyama 1-4-1,

More information

Improvement of the Orthogonal Code Convolution Capabilities Using FPGA Implementation

Improvement of the Orthogonal Code Convolution Capabilities Using FPGA Implementation Improvemet of the Orthogoal Code Covolutio Capabilities Usig FPGA Implemetatio Naima Kaabouch, Member, IEEE, Apara Dhirde, Member, IEEE, Saleh Faruque, Member, IEEE Departmet of Electrical Egieerig, Uiversity

More information

CSC 220: Computer Organization Unit 11 Basic Computer Organization and Design

CSC 220: Computer Organization Unit 11 Basic Computer Organization and Design College of Computer ad Iformatio Scieces Departmet of Computer Sciece CSC 220: Computer Orgaizatio Uit 11 Basic Computer Orgaizatio ad Desig 1 For the rest of the semester, we ll focus o computer architecture:

More information

Running Time ( 3.1) Analysis of Algorithms. Experimental Studies. Limitations of Experiments

Running Time ( 3.1) Analysis of Algorithms. Experimental Studies. Limitations of Experiments Ruig Time ( 3.1) Aalysis of Algorithms Iput Algorithm Output A algorithm is a step- by- step procedure for solvig a problem i a fiite amout of time. Most algorithms trasform iput objects ito output objects.

More information

Analysis of Algorithms

Analysis of Algorithms Aalysis of Algorithms Iput Algorithm Output A algorithm is a step-by-step procedure for solvig a problem i a fiite amout of time. Ruig Time Most algorithms trasform iput objects ito output objects. The

More information

What are we going to learn? CSC Data Structures Analysis of Algorithms. Overview. Algorithm, and Inputs

What are we going to learn? CSC Data Structures Analysis of Algorithms. Overview. Algorithm, and Inputs What are we goig to lear? CSC316-003 Data Structures Aalysis of Algorithms Computer Sciece North Carolia State Uiversity Need to say that some algorithms are better tha others Criteria for evaluatio Structure

More information

Running Time. Analysis of Algorithms. Experimental Studies. Limitations of Experiments

Running Time. Analysis of Algorithms. Experimental Studies. Limitations of Experiments Ruig Time Aalysis of Algorithms Iput Algorithm Output A algorithm is a step-by-step procedure for solvig a problem i a fiite amout of time. Most algorithms trasform iput objects ito output objects. The

More information

Data Structures and Algorithms. Analysis of Algorithms

Data Structures and Algorithms. Analysis of Algorithms Data Structures ad Algorithms Aalysis of Algorithms Outlie Ruig time Pseudo-code Big-oh otatio Big-theta otatio Big-omega otatio Asymptotic algorithm aalysis Aalysis of Algorithms Iput Algorithm Output

More information

Outline and Reading. Analysis of Algorithms. Running Time. Experimental Studies. Limitations of Experiments. Theoretical Analysis

Outline and Reading. Analysis of Algorithms. Running Time. Experimental Studies. Limitations of Experiments. Theoretical Analysis Outlie ad Readig Aalysis of Algorithms Iput Algorithm Output Ruig time ( 3.) Pseudo-code ( 3.2) Coutig primitive operatios ( 3.3-3.) Asymptotic otatio ( 3.6) Asymptotic aalysis ( 3.7) Case study Aalysis

More information

Accuracy Improvement in Camera Calibration

Accuracy Improvement in Camera Calibration Accuracy Improvemet i Camera Calibratio FaJie L Qi Zag ad Reihard Klette CITR, Computer Sciece Departmet The Uiversity of Aucklad Tamaki Campus, Aucklad, New Zealad fli006, qza001@ec.aucklad.ac.z r.klette@aucklad.ac.z

More information

Chapter 3 Classification of FFT Processor Algorithms

Chapter 3 Classification of FFT Processor Algorithms Chapter Classificatio of FFT Processor Algorithms The computatioal complexity of the Discrete Fourier trasform (DFT) is very high. It requires () 2 complex multiplicatios ad () complex additios [5]. As

More information

Outline. Research Definition. Motivation. Foundation of Reverse Engineering. Dynamic Analysis and Design Pattern Detection in Java Programs

Outline. Research Definition. Motivation. Foundation of Reverse Engineering. Dynamic Analysis and Design Pattern Detection in Java Programs Dyamic Aalysis ad Desig Patter Detectio i Java Programs Outlie Lei Hu Kamra Sartipi {hul4, sartipi}@mcmasterca Departmet of Computig ad Software McMaster Uiversity Caada Motivatio Research Problem Defiitio

More information

Ones Assignment Method for Solving Traveling Salesman Problem

Ones Assignment Method for Solving Traveling Salesman Problem Joural of mathematics ad computer sciece 0 (0), 58-65 Oes Assigmet Method for Solvig Travelig Salesma Problem Hadi Basirzadeh Departmet of Mathematics, Shahid Chamra Uiversity, Ahvaz, Ira Article history:

More information

Chapter 1. Introduction to Computers and C++ Programming. Copyright 2015 Pearson Education, Ltd.. All rights reserved.

Chapter 1. Introduction to Computers and C++ Programming. Copyright 2015 Pearson Education, Ltd.. All rights reserved. Chapter 1 Itroductio to Computers ad C++ Programmig Copyright 2015 Pearso Educatio, Ltd.. All rights reserved. Overview 1.1 Computer Systems 1.2 Programmig ad Problem Solvig 1.3 Itroductio to C++ 1.4 Testig

More information

Elementary Educational Computer

Elementary Educational Computer Chapter 5 Elemetary Educatioal Computer. Geeral structure of the Elemetary Educatioal Computer (EEC) The EEC coforms to the 5 uits structure defied by vo Neuma's model (.) All uits are preseted i a simplified

More information

Simple and Fast Parallel Algorithms for the Voronoi Maps and the Euclidean Distance Map, with GPU implementations

Simple and Fast Parallel Algorithms for the Voronoi Maps and the Euclidean Distance Map, with GPU implementations Simple ad Fast Parallel Algorithms for the Vorooi Maps ad the Euclidea Distace Map, with GPU implemetatios Takumi Hoda, Shiosuke Yamamoto, Hiroaki Hoda, Koji Nakao, Yasuaki Ito Departmet of Iformatio Egieerig

More information

APPLICATION NOTE PACE1750AE BUILT-IN FUNCTIONS

APPLICATION NOTE PACE1750AE BUILT-IN FUNCTIONS APPLICATION NOTE PACE175AE BUILT-IN UNCTIONS About This Note This applicatio brief is iteded to explai ad demostrate the use of the special fuctios that are built ito the PACE175AE processor. These powerful

More information

A SOFTWARE MODEL FOR THE MULTILAYER PERCEPTRON

A SOFTWARE MODEL FOR THE MULTILAYER PERCEPTRON A SOFTWARE MODEL FOR THE MULTILAYER PERCEPTRON Roberto Lopez ad Eugeio Oñate Iteratioal Ceter for Numerical Methods i Egieerig (CIMNE) Edificio C1, Gra Capitá s/, 08034 Barceloa, Spai ABSTRACT I this work

More information

Alpha Individual Solutions MAΘ National Convention 2013

Alpha Individual Solutions MAΘ National Convention 2013 Alpha Idividual Solutios MAΘ Natioal Covetio 0 Aswers:. D. A. C 4. D 5. C 6. B 7. A 8. C 9. D 0. B. B. A. D 4. C 5. A 6. C 7. B 8. A 9. A 0. C. E. B. D 4. C 5. A 6. D 7. B 8. C 9. D 0. B TB. 570 TB. 5

More information

Pseudocode ( 1.1) Analysis of Algorithms. Primitive Operations. Pseudocode Details. Running Time ( 1.1) Estimating performance

Pseudocode ( 1.1) Analysis of Algorithms. Primitive Operations. Pseudocode Details. Running Time ( 1.1) Estimating performance Aalysis of Algorithms Iput Algorithm Output A algorithm is a step-by-step procedure for solvig a problem i a fiite amout of time. Pseudocode ( 1.1) High-level descriptio of a algorithm More structured

More information

Fast Fourier Transform (FFT) Algorithms

Fast Fourier Transform (FFT) Algorithms Fast Fourier Trasform FFT Algorithms Relatio to the z-trasform elsewhere, ozero, z x z X x [ ] 2 ~ elsewhere,, ~ e j x X x x π j e z z X X π 2 ~ The DFS X represets evely spaced samples of the z- trasform

More information

Pattern Recognition Systems Lab 1 Least Mean Squares

Pattern Recognition Systems Lab 1 Least Mean Squares Patter Recogitio Systems Lab 1 Least Mea Squares 1. Objectives This laboratory work itroduces the OpeCV-based framework used throughout the course. I this assigmet a lie is fitted to a set of poits usig

More information

Neural Networks A Model of Boolean Functions

Neural Networks A Model of Boolean Functions Neural Networks A Model of Boolea Fuctios Berd Steibach, Roma Kohut Freiberg Uiversity of Miig ad Techology Istitute of Computer Sciece D-09596 Freiberg, Germay e-mails: steib@iformatik.tu-freiberg.de

More information

GPUMP: a Multiple-Precision Integer Library for GPUs

GPUMP: a Multiple-Precision Integer Library for GPUs GPUMP: a Multiple-Precisio Iteger Library for GPUs Kaiyog Zhao ad Xiaowe Chu Departmet of Computer Sciece, Hog Kog Baptist Uiversity Hog Kog, P. R. Chia Email: {kyzhao, chxw}@comp.hkbu.edu.hk Abstract

More information

A Very Simple Approach for 3-D to 2-D Mapping

A Very Simple Approach for 3-D to 2-D Mapping A Very Simple Approach for -D to -D appig Sadipa Dey (1 Ajith Abraham ( Sugata Sayal ( Sadipa Dey (1 Ashi Software Private Limited INFINITY Tower II 10 th Floor Plot No. - 4. Block GP Salt Lake Electroics

More information

How do we evaluate algorithms?

How do we evaluate algorithms? F2 Readig referece: chapter 2 + slides Algorithm complexity Big O ad big Ω To calculate ruig time Aalysis of recursive Algorithms Next time: Litterature: slides mostly The first Algorithm desig methods:

More information

EE260: Digital Design, Spring /16/18. n Example: m 0 (=x 1 x 2 ) is adjacent to m 1 (=x 1 x 2 ) and m 2 (=x 1 x 2 ) but NOT m 3 (=x 1 x 2 )

EE260: Digital Design, Spring /16/18. n Example: m 0 (=x 1 x 2 ) is adjacent to m 1 (=x 1 x 2 ) and m 2 (=x 1 x 2 ) but NOT m 3 (=x 1 x 2 ) EE26: Digital Desig, Sprig 28 3/6/8 EE 26: Itroductio to Digital Desig Combiatioal Datapath Yao Zheg Departmet of Electrical Egieerig Uiversity of Hawaiʻi at Māoa Combiatioal Logic Blocks Multiplexer Ecoders/Decoders

More information

3D Model Retrieval Method Based on Sample Prediction

3D Model Retrieval Method Based on Sample Prediction 20 Iteratioal Coferece o Computer Commuicatio ad Maagemet Proc.of CSIT vol.5 (20) (20) IACSIT Press, Sigapore 3D Model Retrieval Method Based o Sample Predictio Qigche Zhag, Ya Tag* School of Computer

More information

Load balanced Parallel Prime Number Generator with Sieve of Eratosthenes on Cluster Computers *

Load balanced Parallel Prime Number Generator with Sieve of Eratosthenes on Cluster Computers * Load balaced Parallel Prime umber Geerator with Sieve of Eratosthees o luster omputers * Soowook Hwag*, Kyusik hug**, ad Dogseug Kim* *Departmet of Electrical Egieerig Korea Uiversity Seoul, -, Rep. of

More information

The isoperimetric problem on the hypercube

The isoperimetric problem on the hypercube The isoperimetric problem o the hypercube Prepared by: Steve Butler November 2, 2005 1 The isoperimetric problem We will cosider the -dimesioal hypercube Q Recall that the hypercube Q is a graph whose

More information

COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Interface. Chapter 4. The Processor. Part A Datapath Design

COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Interface. Chapter 4. The Processor. Part A Datapath Design COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Iterface 5 th Editio Chapter The Processor Part A path Desig Itroductio CPU performace factors Istructio cout Determied by ISA ad compiler. CPI ad

More information

Algorithms for Disk Covering Problems with the Most Points

Algorithms for Disk Covering Problems with the Most Points Algorithms for Disk Coverig Problems with the Most Poits Bi Xiao Departmet of Computig Hog Kog Polytechic Uiversity Hug Hom, Kowloo, Hog Kog csbxiao@comp.polyu.edu.hk Qigfeg Zhuge, Yi He, Zili Shao, Edwi

More information

UNIVERSITY OF MORATUWA

UNIVERSITY OF MORATUWA UNIVERSITY OF MORATUWA FACULTY OF ENGINEERING DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING B.Sc. Egieerig 2014 Itake Semester 2 Examiatio CS2052 COMPUTER ARCHITECTURE Time allowed: 2 Hours Jauary 2016

More information

BOOLEAN MATHEMATICS: GENERAL THEORY

BOOLEAN MATHEMATICS: GENERAL THEORY CHAPTER 3 BOOLEAN MATHEMATICS: GENERAL THEORY 3.1 ISOMORPHIC PROPERTIES The ame Boolea Arithmetic was chose because it was discovered that literal Boolea Algebra could have a isomorphic umerical aspect.

More information

COARSE ANGLE ROTATION MODE CORDIC BASED SINGLE PROCESSING ELEMENT QR-RLS PROCESSOR

COARSE ANGLE ROTATION MODE CORDIC BASED SINGLE PROCESSING ELEMENT QR-RLS PROCESSOR 7th Europea Sigal Processig Coferece (EUSIPCO 9 Glasgow, Scotlad, August 4-8, 9 COARSE ANGLE ROTATION MODE CORDIC BASED SINGLE PROCESSING ELEMENT QR-RLS PROCESSOR Qiag Gao, Louise Crockett ad Robert Stewart

More information

Image Segmentation EEE 508

Image Segmentation EEE 508 Image Segmetatio Objective: to determie (etract) object boudaries. It is a process of partitioig a image ito distict regios by groupig together eighborig piels based o some predefied similarity criterio.

More information

FREQUENCY ESTIMATION OF INTERNET PACKET STREAMS WITH LIMITED SPACE: UPPER AND LOWER BOUNDS

FREQUENCY ESTIMATION OF INTERNET PACKET STREAMS WITH LIMITED SPACE: UPPER AND LOWER BOUNDS FREQUENCY ESTIMATION OF INTERNET PACKET STREAMS WITH LIMITED SPACE: UPPER AND LOWER BOUNDS Prosejit Bose Evagelos Kraakis Pat Mori Yihui Tag School of Computer Sciece, Carleto Uiversity {jit,kraakis,mori,y

More information

Computer Graphics Hardware An Overview

Computer Graphics Hardware An Overview Computer Graphics Hardware A Overview Graphics System Moitor Iput devices CPU/Memory GPU Raster Graphics System Raster: A array of picture elemets Based o raster-sca TV techology The scree (ad a picture)

More information

CIS 121 Data Structures and Algorithms with Java Spring Stacks, Queues, and Heaps Monday, February 18 / Tuesday, February 19

CIS 121 Data Structures and Algorithms with Java Spring Stacks, Queues, and Heaps Monday, February 18 / Tuesday, February 19 CIS Data Structures ad Algorithms with Java Sprig 09 Stacks, Queues, ad Heaps Moday, February 8 / Tuesday, February 9 Stacks ad Queues Recall the stack ad queue ADTs (abstract data types from lecture.

More information

Lecture Notes 6 Introduction to algorithm analysis CSS 501 Data Structures and Object-Oriented Programming

Lecture Notes 6 Introduction to algorithm analysis CSS 501 Data Structures and Object-Oriented Programming Lecture Notes 6 Itroductio to algorithm aalysis CSS 501 Data Structures ad Object-Orieted Programmig Readig for this lecture: Carrao, Chapter 10 To be covered i this lecture: Itroductio to algorithm aalysis

More information

A New Morphological 3D Shape Decomposition: Grayscale Interframe Interpolation Method

A New Morphological 3D Shape Decomposition: Grayscale Interframe Interpolation Method A ew Morphological 3D Shape Decompositio: Grayscale Iterframe Iterpolatio Method D.. Vizireau Politehica Uiversity Bucharest, Romaia ae@comm.pub.ro R. M. Udrea Politehica Uiversity Bucharest, Romaia mihea@comm.pub.ro

More information

A Study on the Performance of Cholesky-Factorization using MPI

A Study on the Performance of Cholesky-Factorization using MPI A Study o the Performace of Cholesky-Factorizatio usig MPI Ha S. Kim Scott B. Bade Departmet of Computer Sciece ad Egieerig Uiversity of Califoria Sa Diego {hskim, bade}@cs.ucsd.edu Abstract Cholesky-factorizatio

More information

End Semester Examination CSE, III Yr. (I Sem), 30002: Computer Organization

End Semester Examination CSE, III Yr. (I Sem), 30002: Computer Organization Ed Semester Examiatio 2013-14 CSE, III Yr. (I Sem), 30002: Computer Orgaizatio Istructios: GROUP -A 1. Write the questio paper group (A, B, C, D), o frot page top of aswer book, as per what is metioed

More information

Isn t It Time You Got Faster, Quicker?

Isn t It Time You Got Faster, Quicker? Is t It Time You Got Faster, Quicker? AltiVec Techology At-a-Glace OVERVIEW Motorola s advaced AltiVec techology is desiged to eable host processors compatible with the PowerPC istructio-set architecture

More information

Analysis of Algorithms

Analysis of Algorithms Presetatio for use with the textbook, Algorithm Desig ad Applicatios, by M. T. Goodrich ad R. Tamassia, Wiley, 2015 Aalysis of Algorithms Iput 2015 Goodrich ad Tamassia Algorithm Aalysis of Algorithms

More information

Analysis of Algorithms

Analysis of Algorithms Aalysis of Algorithms Ruig Time of a algorithm Ruig Time Upper Bouds Lower Bouds Examples Mathematical facts Iput Algorithm Output A algorithm is a step-by-step procedure for solvig a problem i a fiite

More information

Lecturers: Sanjam Garg and Prasad Raghavendra Feb 21, Midterm 1 Solutions

Lecturers: Sanjam Garg and Prasad Raghavendra Feb 21, Midterm 1 Solutions U.C. Berkeley CS170 : Algorithms Midterm 1 Solutios Lecturers: Sajam Garg ad Prasad Raghavedra Feb 1, 017 Midterm 1 Solutios 1. (4 poits) For the directed graph below, fid all the strogly coected compoets

More information

Reversible Realization of Quaternary Decoder, Multiplexer, and Demultiplexer Circuits

Reversible Realization of Quaternary Decoder, Multiplexer, and Demultiplexer Circuits Egieerig Letters, :, EL Reversible Realizatio of Quaterary Decoder, Multiplexer, ad Demultiplexer Circuits Mozammel H.. Kha, Member, ENG bstract quaterary reversible circuit is more compact tha the correspodig

More information

. Written in factored form it is easy to see that the roots are 2, 2, i,

. Written in factored form it is easy to see that the roots are 2, 2, i, CMPS A Itroductio to Programmig Programmig Assigmet 4 I this assigmet you will write a java program that determies the real roots of a polyomial that lie withi a specified rage. Recall that the roots (or

More information

EE 459/500 HDL Based Digital Design with Programmable Logic. Lecture 13 Control and Sequencing: Hardwired and Microprogrammed Control

EE 459/500 HDL Based Digital Design with Programmable Logic. Lecture 13 Control and Sequencing: Hardwired and Microprogrammed Control EE 459/500 HDL Based Digital Desig with Programmable Logic Lecture 13 Cotrol ad Sequecig: Hardwired ad Microprogrammed Cotrol Refereces: Chapter s 4,5 from textbook Chapter 7 of M.M. Mao ad C.R. Kime,

More information

MOST of the advanced signal processing algorithms are

MOST of the advanced signal processing algorithms are A edited versio of this work was publiched i IEEE TRANS. ON CIRCUITS AND SYSTEMS II, VOL. 6, NO. 9, SEPT 05 DOI:0.09/TCSII.05.35753 High-Throughput FPGA Implemetatio of QR Decompositio Sergio D. Muñoz

More information

Heaps. Presentation for use with the textbook Algorithm Design and Applications, by M. T. Goodrich and R. Tamassia, Wiley, 2015

Heaps. Presentation for use with the textbook Algorithm Design and Applications, by M. T. Goodrich and R. Tamassia, Wiley, 2015 Presetatio for use with the textbook Algorithm Desig ad Applicatios, by M. T. Goodrich ad R. Tamassia, Wiley, 201 Heaps 201 Goodrich ad Tamassia xkcd. http://xkcd.com/83/. Tree. Used with permissio uder

More information

EE123 Digital Signal Processing

EE123 Digital Signal Processing Last Time EE Digital Sigal Processig Lecture 7 Block Covolutio, Overlap ad Add, FFT Discrete Fourier Trasform Properties of the Liear covolutio through circular Today Liear covolutio with Overlap ad add

More information

The Magma Database file formats

The Magma Database file formats The Magma Database file formats Adrew Gaylard, Bret Pikey, ad Mart-Mari Breedt Johaesburg, South Africa 15th May 2006 1 Summary Magma is a ope-source object database created by Chris Muller, of Kasas City,

More information

The Counterchanged Crossed Cube Interconnection Network and Its Topology Properties

The Counterchanged Crossed Cube Interconnection Network and Its Topology Properties WSEAS TRANSACTIONS o COMMUNICATIONS Wag Xiyag The Couterchaged Crossed Cube Itercoectio Network ad Its Topology Properties WANG XINYANG School of Computer Sciece ad Egieerig South Chia Uiversity of Techology

More information

An Efficient Algorithm for Graph Bisection of Triangularizations

An Efficient Algorithm for Graph Bisection of Triangularizations A Efficiet Algorithm for Graph Bisectio of Triagularizatios Gerold Jäger Departmet of Computer Sciece Washigto Uiversity Campus Box 1045 Oe Brookigs Drive St. Louis, Missouri 63130-4899, USA jaegerg@cse.wustl.edu

More information

Design of Efficient Pipelined Radix-2 2 Single Path Delay Feedback FFT

Design of Efficient Pipelined Radix-2 2 Single Path Delay Feedback FFT IOSR Joural of VLSI ad Sigal Processig IOSR-JVSP Volume Issue Ver. I May-Ju. 0 PP 88-9 e-iss: 9 00 p-iss o. : 9 97 www.iosrjourals.org Desig of Efficiet Pipelied Radi- Sigle Path Delay Feedbac FFT isha

More information

Intermediate Statistics

Intermediate Statistics Gait Learig Guides Itermediate Statistics Data processig & display, Cetral tedecy Author: Raghu M.D. STATISTICS DATA PROCESSING AND DISPLAY Statistics is the study of data or umerical facts of differet

More information

New Fuzzy Color Clustering Algorithm Based on hsl Similarity

New Fuzzy Color Clustering Algorithm Based on hsl Similarity IFSA-EUSFLAT 009 New Fuzzy Color Clusterig Algorithm Based o hsl Similarity Vasile Ptracu Departmet of Iformatics Techology Tarom Compay Bucharest Romaia Email: patrascu.v@gmail.com Abstract I this paper

More information

Lecture 5. Counting Sort / Radix Sort

Lecture 5. Counting Sort / Radix Sort Lecture 5. Coutig Sort / Radix Sort T. H. Corme, C. E. Leiserso ad R. L. Rivest Itroductio to Algorithms, 3rd Editio, MIT Press, 2009 Sugkyukwa Uiversity Hyuseug Choo choo@skku.edu Copyright 2000-2018

More information

An Efficient Implementation Method of Fractal Image Compression on Dynamically Reconfigurable Architecture

An Efficient Implementation Method of Fractal Image Compression on Dynamically Reconfigurable Architecture A Efficiet Implemetatio Method of Fractal Image Compressio o Dyamically Recofigurable Architecture Hidehisa Nagao, Akihiro Matsuura, ad Akira Nagoya NTT Commuicatio Sciece Laboratories 2-4 Hikaridai, Seika-cho,

More information

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe Copyright 2016 Ramez Elmasri ad Shamkat B. Navathe CHAPTER 18 Strategies for Query Processig Copyright 2016 Ramez Elmasri ad Shamkat B. Navathe Itroductio DBMS techiques to process a query Scaer idetifies

More information

Computer Systems - HS

Computer Systems - HS What have we leared so far? Computer Systems High Level ENGG1203 2d Semester, 2017-18 Applicatios Sigals Systems & Cotrol Systems Computer & Embedded Systems Digital Logic Combiatioal Logic Sequetial Logic

More information

An Efficient Algorithm for Graph Bisection of Triangularizations

An Efficient Algorithm for Graph Bisection of Triangularizations Applied Mathematical Scieces, Vol. 1, 2007, o. 25, 1203-1215 A Efficiet Algorithm for Graph Bisectio of Triagularizatios Gerold Jäger Departmet of Computer Sciece Washigto Uiversity Campus Box 1045, Oe

More information

9.1. Sequences and Series. Sequences. What you should learn. Why you should learn it. Definition of Sequence

9.1. Sequences and Series. Sequences. What you should learn. Why you should learn it. Definition of Sequence _9.qxd // : AM Page Chapter 9 Sequeces, Series, ad Probability 9. Sequeces ad Series What you should lear Use sequece otatio to write the terms of sequeces. Use factorial otatio. Use summatio otatio to

More information

Appendix D. Controller Implementation

Appendix D. Controller Implementation COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Iterface 5 th Editio Appedix D Cotroller Implemetatio Cotroller Implemetatios Combiatioal logic (sigle-cycle); Fiite state machie (multi-cycle, pipelied);

More information

DATA SHEET AND USER GUIDE

DATA SHEET AND USER GUIDE V L S I I P O W N E D B Y A V I R A L M I T T A L Mp3HufDec DATA SHEET AND USER GUIDE Mp3HufDec ISO 11172-3 LAYER III HUFFMAN DECODER. ISO 11172-3 Layer III or commoly called as Mp3, employs Huffma ecodi

More information

BOOLEAN DIFFERENTIATION EQUATIONS APPLICABLE IN RECONFIGURABLE COMPUTATIONAL MEDIUM

BOOLEAN DIFFERENTIATION EQUATIONS APPLICABLE IN RECONFIGURABLE COMPUTATIONAL MEDIUM MATEC Web of Cofereces 79, 01014 (016) DOI: 10.1051/ mateccof/0167901014 T 016 BOOLEAN DIFFERENTIATION EQUATIONS APPLICABLE IN RECONFIGURABLE COMPUTATIONAL MEDIUM Staislav Shidlovskiy 1, 1 Natioal Research

More information

APPLICATION NOTE. Automated Gain Flattening. 1. Experimental Setup. Scope and Overview

APPLICATION NOTE. Automated Gain Flattening. 1. Experimental Setup. Scope and Overview APPLICATION NOTE Automated Gai Flatteig Scope ad Overview A flat optical power spectrum is essetial for optical telecommuicatio sigals. This stems from a eed to balace the chael powers across large distaces.

More information

CSC165H1 Worksheet: Tutorial 8 Algorithm analysis (SOLUTIONS)

CSC165H1 Worksheet: Tutorial 8 Algorithm analysis (SOLUTIONS) CSC165H1, Witer 018 Learig Objectives By the ed of this worksheet, you will: Aalyse the ruig time of fuctios cotaiig ested loops. 1. Nested loop variatios. Each of the followig fuctios takes as iput a

More information

Sorting in Linear Time. Data Structures and Algorithms Andrei Bulatov

Sorting in Linear Time. Data Structures and Algorithms Andrei Bulatov Sortig i Liear Time Data Structures ad Algorithms Adrei Bulatov Algorithms Sortig i Liear Time 7-2 Compariso Sorts The oly test that all the algorithms we have cosidered so far is compariso The oly iformatio

More information

Euclidean Distance Based Feature Selection for Fault Detection Prediction Model in Semiconductor Manufacturing Process

Euclidean Distance Based Feature Selection for Fault Detection Prediction Model in Semiconductor Manufacturing Process Vol.133 (Iformatio Techology ad Computer Sciece 016), pp.85-89 http://dx.doi.org/10.1457/astl.016. Euclidea Distace Based Feature Selectio for Fault Detectio Predictio Model i Semicoductor Maufacturig

More information

An Algorithm to Solve Multi-Objective Assignment. Problem Using Interactive Fuzzy. Goal Programming Approach

An Algorithm to Solve Multi-Objective Assignment. Problem Using Interactive Fuzzy. Goal Programming Approach It. J. Cotemp. Math. Scieces, Vol. 6, 0, o. 34, 65-66 A Algorm to Solve Multi-Objective Assigmet Problem Usig Iteractive Fuzzy Goal Programmig Approach P. K. De ad Bharti Yadav Departmet of Mathematics

More information

INTERSECTION CORDIAL LABELING OF GRAPHS

INTERSECTION CORDIAL LABELING OF GRAPHS INTERSECTION CORDIAL LABELING OF GRAPHS G Meea, K Nagaraja Departmet of Mathematics, PSR Egieerig College, Sivakasi- 66 4, Virudhuagar(Dist) Tamil Nadu, INDIA meeag9@yahoocoi Departmet of Mathematics,

More information

Task scenarios Outline. Scenarios in Knowledge Extraction. Proposed Framework for Scenario to Design Diagram Transformation

Task scenarios Outline. Scenarios in Knowledge Extraction. Proposed Framework for Scenario to Design Diagram Transformation 6-0-0 Kowledge Trasformatio from Task Scearios to View-based Desig Diagrams Nima Dezhkam Kamra Sartipi {dezhka, sartipi}@mcmaster.ca Departmet of Computig ad Software McMaster Uiversity CANADA SEKE 08

More information

6.854J / J Advanced Algorithms Fall 2008

6.854J / J Advanced Algorithms Fall 2008 MIT OpeCourseWare http://ocw.mit.edu 6.854J / 18.415J Advaced Algorithms Fall 2008 For iformatio about citig these materials or our Terms of Use, visit: http://ocw.mit.edu/terms. 18.415/6.854 Advaced Algorithms

More information

On (K t e)-saturated Graphs

On (K t e)-saturated Graphs Noame mauscript No. (will be iserted by the editor O (K t e-saturated Graphs Jessica Fuller Roald J. Gould the date of receipt ad acceptace should be iserted later Abstract Give a graph H, we say a graph

More information

CSE 2320 Notes 8: Sorting. (Last updated 10/3/18 7:16 PM) Idea: Take an unsorted (sub)array and partition into two subarrays such that.

CSE 2320 Notes 8: Sorting. (Last updated 10/3/18 7:16 PM) Idea: Take an unsorted (sub)array and partition into two subarrays such that. CSE Notes 8: Sortig (Last updated //8 7:6 PM) CLRS 7.-7., 9., 8.-8. 8.A. QUICKSORT Cocepts Idea: Take a usorted (sub)array ad partitio ito two subarrays such that p q r x y z x y y z Pivot Customarily,

More information

A Parallel Reconfigurable Architecture for Real-Time Stereo Vision

A Parallel Reconfigurable Architecture for Real-Time Stereo Vision 2009 Iteratioal Cofereces o Embedded Software ad Systems A Parallel Recofigurable Architecture for Real-Time Stereo Visio Lei Che Yude Jia Beijig Laboratory of Itelliget Iformatio Techology, School of

More information

arxiv: v2 [cs.ds] 24 Mar 2018

arxiv: v2 [cs.ds] 24 Mar 2018 Similar Elemets ad Metric Labelig o Complete Graphs arxiv:1803.08037v [cs.ds] 4 Mar 018 Pedro F. Felzeszwalb Brow Uiversity Providece, RI, USA pff@brow.edu March 8, 018 We cosider a problem that ivolves

More information

Creating Exact Bezier Representations of CST Shapes. David D. Marshall. California Polytechnic State University, San Luis Obispo, CA , USA

Creating Exact Bezier Representations of CST Shapes. David D. Marshall. California Polytechnic State University, San Luis Obispo, CA , USA Creatig Exact Bezier Represetatios of CST Shapes David D. Marshall Califoria Polytechic State Uiversity, Sa Luis Obispo, CA 93407-035, USA The paper presets a method of expressig CST shapes pioeered by

More information

Counting the Number of Minimum Roman Dominating Functions of a Graph

Counting the Number of Minimum Roman Dominating Functions of a Graph Coutig the Number of Miimum Roma Domiatig Fuctios of a Graph SHI ZHENG ad KOH KHEE MENG, Natioal Uiversity of Sigapore We provide two algorithms coutig the umber of miimum Roma domiatig fuctios of a graph

More information

Lecture 1: Introduction and Strassen s Algorithm

Lecture 1: Introduction and Strassen s Algorithm 5-750: Graduate Algorithms Jauary 7, 08 Lecture : Itroductio ad Strasse s Algorithm Lecturer: Gary Miller Scribe: Robert Parker Itroductio Machie models I this class, we will primarily use the Radom Access

More information

Homework 1 Solutions MA 522 Fall 2017

Homework 1 Solutions MA 522 Fall 2017 Homework 1 Solutios MA 5 Fall 017 1. Cosider the searchig problem: Iput A sequece of umbers A = [a 1,..., a ] ad a value v. Output A idex i such that v = A[i] or the special value NIL if v does ot appear

More information

Automatic Generation of Polynomial-Basis Multipliers in GF (2 n ) using Recursive VHDL

Automatic Generation of Polynomial-Basis Multipliers in GF (2 n ) using Recursive VHDL Automatic Geeratio of Polyomial-Basis Multipliers i GF (2 ) usig Recursive VHDL J. Nelso, G. Lai, A. Teca Abstract Multiplicatio i GF (2 ) is very commoly used i the fields of cryptography ad error correctig

More information

CS Polygon Scan Conversion. Slide 1

CS Polygon Scan Conversion. Slide 1 CS 112 - Polygo Sca Coversio Slide 1 Polygo Classificatio Covex All iterior agles are less tha 180 degrees Cocave Iterior agles ca be greater tha 180 degrees Degeerate polygos If all vertices are colliear

More information

CS 683: Advanced Design and Analysis of Algorithms

CS 683: Advanced Design and Analysis of Algorithms CS 683: Advaced Desig ad Aalysis of Algorithms Lecture 6, February 1, 2008 Lecturer: Joh Hopcroft Scribes: Shaomei Wu, Etha Feldma February 7, 2008 1 Threshold for k CNF Satisfiability I the previous lecture,

More information

FPGA IMPLEMENTATION OF BASE-N LOGARITHM. Salvador E. Tropea

FPGA IMPLEMENTATION OF BASE-N LOGARITHM. Salvador E. Tropea FPGA IMPLEMENTATION OF BASE-N LOGARITHM Salvador E. Tropea Electróica e Iformática Istituto Nacioal de Tecología Idustrial Bueos Aires, Argetia email: salvador@iti.gov.ar ABSTRACT I this work, we preset

More information

Chapter 3. Floating Point Arithmetic

Chapter 3. Floating Point Arithmetic COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Iterface 5 th Editio Chapter 3 Floatig Poit Arithmetic Review - Multiplicatio 0 1 1 0 = 6 multiplicad 32-bit ALU shift product right multiplier add

More information

Counting Regions in the Plane and More 1

Counting Regions in the Plane and More 1 Coutig Regios i the Plae ad More 1 by Zvezdelia Stakova Berkeley Math Circle Itermediate I Group September 016 1. Overarchig Problem Problem 1 Regios i a Circle. The vertices of a polygos are arraged o

More information

CSE 417: Algorithms and Computational Complexity

CSE 417: Algorithms and Computational Complexity Time CSE 47: Algorithms ad Computatioal Readig assigmet Read Chapter of The ALGORITHM Desig Maual Aalysis & Sortig Autum 00 Paul Beame aalysis Problem size Worst-case complexity: max # steps algorithm

More information

DESIGN AND ANALYSIS OF LDPC DECODERS FOR SOFTWARE DEFINED RADIO

DESIGN AND ANALYSIS OF LDPC DECODERS FOR SOFTWARE DEFINED RADIO DESIGN AND ANALYSIS OF LDPC DECODERS FOR SOFTWARE DEFINED RADIO Sagwo Seo, Trevor Mudge Advaced Computer Architecture Laboratory Uiversity of Michiga at A Arbor {swseo, tm}@umich.edu Yumig Zhu, Chaitali

More information

Improving Template Based Spike Detection

Improving Template Based Spike Detection Improvig Template Based Spike Detectio Kirk Smith, Member - IEEE Portlad State Uiversity petra@ee.pdx.edu Abstract Template matchig algorithms like SSE, Covolutio ad Maximum Likelihood are well kow for

More information

One advantage that SONAR has over any other music-sequencing product I ve worked

One advantage that SONAR has over any other music-sequencing product I ve worked *gajedra* D:/Thomso_Learig_Projects/Garrigus_163132/z_productio/z_3B2_3D_files/Garrigus_163132_ch17.3d, 14/11/08/16:26:39, 16:26, page: 647 17 CAL 101 Oe advatage that SONAR has over ay other music-sequecig

More information

Eigenimages. Digital Image Processing: Bernd Girod, Stanford University -- Eigenimages 1

Eigenimages. Digital Image Processing: Bernd Girod, Stanford University -- Eigenimages 1 Eigeimages Uitary trasforms Karhue-Loève trasform ad eigeimages Sirovich ad Kirby method Eigefaces for geder recogitio Fisher liear discrimat aalysis Fisherimages ad varyig illumiatio Fisherfaces vs. eigefaces

More information

Multiprocessors. HPC Prof. Robert van Engelen

Multiprocessors. HPC Prof. Robert van Engelen Multiprocessors Prof. Robert va Egele Overview The PMS model Shared memory multiprocessors Basic shared memory systems SMP, Multicore, ad COMA Distributed memory multicomputers MPP systems Network topologies

More information

Chapter 4 The Datapath

Chapter 4 The Datapath The Ageda Chapter 4 The Datapath Based o slides McGraw-Hill Additioal material 24/25/26 Lewis/Marti Additioal material 28 Roth Additioal material 2 Taylor Additioal material 2 Farmer Tae the elemets that

More information

condition w i B i S maximum u i

condition w i B i S maximum u i ecture 10 Dyamic Programmig 10.1 Kapsack Problem November 1, 2004 ecturer: Kamal Jai Notes: Tobias Holgers We are give a set of items U = {a 1, a 2,..., a }. Each item has a weight w i Z + ad a utility

More information

1. SWITCHING FUNDAMENTALS

1. SWITCHING FUNDAMENTALS . SWITCING FUNDMENTLS Switchig is the provisio of a o-demad coectio betwee two ed poits. Two distict switchig techiques are employed i commuicatio etwors-- circuit switchig ad pacet switchig. Circuit switchig

More information

Efficient Hardware Design for Implementation of Matrix Multiplication by using PPI-SO

Efficient Hardware Design for Implementation of Matrix Multiplication by using PPI-SO Efficiet Hardware Desig for Implemetatio of Matrix Multiplicatio by usig PPI-SO Shivagi Tiwari, Niti Meea Dept. of EC, IES College of Techology, Bhopal, Idia Assistat Professor, Dept. of EC, IES College

More information