Reconfigurable HDL Library Development Platform for Arithmetic and Matrix Operations
|
|
- Franklin Owens
- 5 years ago
- Views:
Transcription
1 IMPLEMENT Tanslate Map Place & Route Intenational Jounal of Compute Applications ( ) Reconfiguable HDL Libay Development Platfom fo Aithmetic and Matix Opeations Semih Aslan Ingam School of Engineeing Texas State Univesity S. Macos, Texas, 78666, USA ABSTRACT Embedded systems used in eal-time applications equie design tools that could be costly and may have long veification cycles. Many design tools use pedefined libaies and costly IPs duing these design and veification cycles, and most of these libaies and IPs ae static and difficult to modify. Many design equiements ae changed duing o afte design and veification cycle, and designes need to addess these changes and modify the system. This could be moe time consuming due to veification cycle and static libaies. It is impotant to have dynamic libaies that could be modified and econfigued based on the applications. This wok ceates econfiguable aithmetic design blocks that could be used fo aithmetic and matix opeations. The econfigued libay development system modifies the equied libay elements using Pel scipting language and veifies them on-the-fly using MATLAB. The development tool impoves design time and educes the veification pocess, but the key point is to use a unified design that combines some of the basic opeations with moe complex opeations to educe aea and powe consumption. The esults indicate that using the econfiguable development tool educes veification time and inceases the poductivity. These libaies include stuctual Veilog HDL codes, testbench files, and MATLAB scipt files fo local customization. Even though the econfiguable HDL libay is used fo FPGA design flow, it could be easily modified fo VLSI design flow. Geneal Tems FPGA, Veilog HDL, Pel, MATLAB Keywods Hadwae optimized, HDL, High Level Synthesis, MATLAB, Optimized Hadwae, Pel, Powe Efficient, Reconfiguable, RTL, Veilog HDL.. INTRODUCTION Designing complex systems such as image and video pocessing, compession, face ecognition, object tacking, multi-standad CODECs, and HD decoding schemes equies many basic and complex aithmetic blocks and a long veification pocess []. These complex designs ae based on many Input/put, pocessos, bus intefaces, memoies and sensos. Many times, these systems can be designed as a single chip that is known as System-On-Chip (SoC) []. Many designes use RTL design flow when SoC ae designed and veified. These design tools use design libaies that ae mostly static and difficult to configue and veify. When a new o modified libay element is needed, designes ty to ceate new libay elements fom scatch o ode new libaies fom the vendo. These could be time consuming and costly fo the designes. Classic design flow that uses static libay and RTL design flow [] [] fo both FPGA and ASIC is shown in Figue. Algoithm RTL Libay Logic Synthesis Fomal Poof Timing Fig : FPGA RTL level synthesis flow Bit File An algoithm can be conveted to RTL using the behavioal desciption model, pedefined libaies and IP coes. Afte completing this RTL code, fomal veification must be done befoe implementation. Afte implementation of the RTL code, timing veification needs to be done fo pope opeation. When a equied change o futue modification is equied by the costume, the design needs to go though same synthesis flow. This dastically inceases time-to-maket (TTM) if new libaies need to be used, because it will equie evey pat of the design to be veified. This will cause longe veification peiod of the design and it will incease design cost. The design and veification of the libaies and oveall design shown in Figue can take up 4-5% of the Time to Maket (TTM). The RTL design that is shown in Figue becomes costly and impactical fo lage system change and updates. Fo example, a base system design team that is woking on 3G wants to move to 4G design using FPGA can have shote TTM if the team uses econfiguable pe-veified libaies [3]. One method to ovecome this poblem is to intoduce high level languages to the design cycle. Because of the extensive wok done in Electonics System Level Design (ESLD), HW/SW co-design of a system and High Level Synthesis (HLS) [3][4] ae integated into FPGA and ASIC design flow. RTL desciption of a system can be implemented fom a behavioal desciption of the system in Pel, C, Python and MATLAB [5]. This will esult in a faste veification pocess and shote TTM. This HLS idea focuses on design as whole. Some designes want to contol the design and make cetain changes in HDL code, but this could have a low possibility whee HLS is used. The poposed design uses the scipting language Pel to ceate and MATLAB to veify each equied libay file. This integation is shown in Figue, and it intoduces MATLAB and Pel into synthesis flow and veifies all the libay design files. Each libay element can be ceated and tested independently without inteupting the design and veification flow. This enables softwae designes to join the design pocess duing hadwae design and veification. Afte the veification pocess, the design can be implemented using FPGA synthesis tools. 4
2 IMPLEMENT Tanslate Map Place & Route Intenational Jounal of Compute Applications ( ) Algoithm Fomal Poof Pel MATLAB Libay RTL Timing Bit File USER CONSTRAINTS PERL Scipt Veilog HDL Design File Veilog HDL Testbench File Fig. : Poposed FPGA high level synthesis flow The next section will descibe the poposed design and libay geneation pocess. Section 3 will focus on the desciption of each block, and the conclusion will descibe futue wok and impovements.. PROPOSED DESIGN The poposed design focuses on designing a econfiguable libay that could be used fo a new o updated design. The TTM will be shote with a design pinciple simila to HLS, as explained above. The main wok focuses on designing libay blocks that could be used and modified based on custome need. The libay design will be done using Pel command line inteface. The cuent libay development platfom suppots; OS platfoms such as Windows, Linux and Mac OS X Customized ange and accuacy FPGA o ASIC suppot Vendo based IP coe integation Aea and powe optimized Use defined module and file names Veilog HDL suppot n-bit Fixed point numbe system (<n<9) Signed o unsigned numbe systems Testbench geneation Automated testbench with MATLAB Modelsim.do file fo fast automation Eo compaison with MATLAB Use defined test data option The development platfom ceates the libay blocks fo desied opeations such as addition, multiplication, etc. based on use constaints. The poposed design is explained and synthesized and implemented with Xilinx FPGAs [6] fo opeational veification. Libay geneation and veification flow is shown in Figue 3. MATLAB Scipt File Iveilog/ MODELSIM Veification and Eo Files Fig. 3: Libay geneation and veification flow The poposed development platfom is witten in Pel scipting language. The platfom accepts use constaints such as file name, numbe of input bits, test vecto numbe, etc. It geneates the Veilog HDL, testbench, MATLAB and batch files, and executes the batch file. The batch file opens MATLAB to geneate testvectos and Modelsim o Iveilog fo functional veification. Afte unning Modelsim o Iveilog, the output files ae geneated fo veification. These files ae tansfeed into MATLAB to compae with the expected esults, and MATLAB displays the eo plots. This method can design and veify the design much shote time, and allows the use to pefom a full veification (veification time can incease based on platfom and compute specs) o specify his/he testvectos fo custom veification. 3. BUILDING BLOCKS Basic aithmetic and logic opeations such as addition, subtaction, multiplication, memoy elements, egistes, multiplexes, demultiplexes, s and s complements systems ae building blocks of most of the systems in DSP and communication systems. By using these basic building blocks, moe complex and useful blocks can be designed. Some of these aithmetic building blocks ae division, squae oot, invese squae oot and CORDIC (used to design tigonometic, hypebolic and exponential functions) and some matix opeations such as addition, and subtaction and multiplication. These libay blocks that ae geneated by the econfiguable development tool ae shown in Figue 4 below. In this section, these libay blocks and thei constuction by the development tool is discussed in detail. Due to the simplicity and extensive eseach done on memoy elements, multiplexes, demultiplexes, shiftes, complement systems, addes and multiplies ae not discussed in detail hee. Hadwae implementation of complex aithmetic opeations and elementay functions is moe difficult than hadwae implementation of basic aithmetic opeations. Thee ae a few algoithms used on today s computes to implement these functions. This chapte includes a bief desciption of these algoithms.the design platfom uses some of these algoithms to implement these elementay functions, as well as some of the basic aithmetic opeations. 4
3 Intenational Jounal of Compute Applications ( ) In In In In In In Shift In In In In In In In In In In In In Mux DeMux In Complement Data_In Memoy Addess CORDIC Matix Add/Sub Matix Multiplication Date_ These algoithms can pefom diffeently based on the equied sub functions. As a futhe example, division using a CORDIC algoithm may equie less hadwae [7] but it may be slow due to linea convegence; on the othe hand, division by the Newton-Raphson [8] method may be faste but it may equie moe hadwae. 3. Addition/Subtaction Thee ae two adde systems designed by the development system. Depending on speed o aea, the development tool chooses Cay-Lookahead Adde (CLA) o Ripple-Cay Adde, espectively. Based on the selection, the subtaction system is designed with the adde. 3. Multiplication The development tool can design sign and unsigned type multiplies based on Aay and Booth multiplies. Based on the design platfom, uses can choose thei multiplies fom an IP coe povided by the manufactues. This may incease the oveall pefomance and suggested method whee IPs ae available. The Pel code that geneates any bit signed multiplie [9][] Veilog is given in Listing below. Listing. Multiplie Veilog file geneation. #!/us/bin/pel # Signed Multiplie use wanings; use stict; my $multiplie; pint "Please Ente Module Name fo Signed Multiplie :"; chomp ($multiplie = <STDIN>); my $bit; pint "Please Ente numbe of bits fo Multiplie(x) :"; chomp ($bit = <STDIN>); my $bit; pint "Please Ente numbe of bits fo Multiplicand (y) :"; chomp ($bit = <STDIN>); my $i; my $taget; while () { my $taget = $multiplie; chomp $taget; if (-d $taget) { pint "$taget is a diectoy. This might ceate poblem\n"; if (-e $taget.".v") { pint "$taget.v aleady exists. \n"; pint "Ente '' to wite to a diffeent name : "; pint "\nente 'o' to ovewite \n"; pint "Ente 'b' to back up to $taget.old\n"; my $choice = <STDIN>; chomp $choice; if ($choice eq "") { elsif ($choice eq "o") { Fig. 4: The econfiguable libay blocks unless (-o $taget.".v") { pint "Can't ovewite $taget.v, it's not yous.\n"; unless (-w $taget.".v") { pint "Can't ovewite $taget: $!\n"; elsif ($choice eq "b") { if ( ename($taget.".v",$taget.".old") ) { pint "OK, moved $taget.v to $taget.old\n"; else { pint "Couldn't ename file: $!\n"; else { pint "I didn't undestand that answe.\n"; last if open OUTPUT, "> $taget.v"; pint "I couldn't wite on $taget: $!\n"; # cannot wite. pint OUTPUT "/*\n"; pint OUTPUT "Name:Signed Multiplie\n"; pint OUTPUT "Designe:Semih Aslan\n"; pint OUTPUT "*/\n"; pint OUTPUT "module $multiplie (x, y, poduct);\n"; pint OUTPUT "paamete M = ",$bit-,"; \n"; pint OUTPUT "paamete N = ",$bit-,"; \n"; pint OUTPUT "input [M-:] x;\n"; pint OUTPUT "input [N-:] y;\n"; pint OUTPUT "output [M+N-:] poduct;\n"; pint OUTPUT "wie sum [M-:][N-:];\n"; pint OUTPUT "wie cay [M-:][N-:];\n"; pint OUTPUT "genva i, j;\n"; pint OUTPUT "geneate fo (i=; i<n; i = i +) begin:\n"; pint OUTPUT " signed_multiplie\n"; pint OUTPUT " if (i==)\n"; pint OUTPUT " fo (j=;j<m;j=j+) begin: fist_ow\n"; pint OUTPUT " if (j==m-)\n"; pint OUTPUT " assign sum[j][i] =!(x[i]&y[n-]),\n"; pint OUTPUT " cay[j][i]=;\n"; pint OUTPUT " else\n"; pint OUTPUT " assign sum[j][i]=x[j]&y[i],\n"; pint OUTPUT " cay[j][i] = ; end\n"; pint OUTPUT " else if (i==n-)\n"; pint OUTPUT " fo (j=;j<m;j=j+) begin: last_ow\n"; pint OUTPUT " if (j==m-)\n"; pint OUTPUT " assign {cay[j][i],sum[j][i] =(x[m-]&y[n- ])+cay[j][i-]+cay[j-][i];\n"; pint OUTPUT " else if (j==)\n"; pint OUTPUT " assign {cay[j][i],sum[j][i] =!(x[m- ]&y[j])+sum[j+][i-];\n"; pint OUTPUT "else \n"; pint OUTPUT " assign {cay[j][i],sum[j][i] =!(x[m- ]&y[j])+sum[j+][i-]+cay[j-][i];end\n"; pint OUTPUT "else \n"; pint OUTPUT " fo (j=;j<m;j=j+) begin : est_ows\n"; pint OUTPUT " if (j==)\n"; pint OUTPUT " assign {cay[j][i],sum[j][i] =(x[j]&y[i])+sum[j+][i- ];\n"; 4
4 Intenational Jounal of Compute Applications ( ) pint OUTPUT " else if (j==m-)\n"; pint OUTPUT " assign {cay[j][i],sum[j][i] =!(x[i]&y[n- ])+cay[m-][i-]+cay[j-][i];\n"; pint OUTPUT "else \n"; pint OUTPUT " assign {cay[j][i],sum[j][i] =(x[j]&y[i])+sum[j+][i- ]+cay[j-][i];end\n"; pint OUTPUT "end endgeneate\n"; pint OUTPUT "geneate fo (i=;i<n;i=i+)\n"; pint OUTPUT " begin: poduct_lowe_pat\n"; pint OUTPUT " assign poduct[i] = sum[][i];\n"; pint OUTPUT " end endgeneate\n"; pint OUTPUT " geneate fo (i=;i<m;i=i+)\n"; pint OUTPUT " begin: poduct_uppe_pat\n"; pint OUTPUT " assign poduct[n-+i] = sum[i][n-]; \n"; pint OUTPUT " end endgeneate\n"; pint OUTPUT " assign poduct[m+n-] = cay[m-][n-] + 'b; \n"; pint OUTPUT " endmodule \n"; 3.3 Division/Squae Root/Invese Squae Root The development tool can design division, squae oot, and invese squae oot hadwae and thei testbenches individually o all togethe using the hadwae euse pinciple. These building blocks ae designed using Newton-Raphson [9][][][] and CORDIC algoithms [3][4][5]. The CORDIC algoithm is used in many diffeent aithmetic and elementay functions except fo division, squae oot and invese squae oot. The division opeation can be witten as: N = D. Q + R () whee N is dividend, D is diviso, Q is quotient, R is emainde and R < D.ulp and sign (R) = sign(n). The unit in the last position (ulp) epesents the lowest tem whee ulp = fo intege numbes and ulp = (adix)-n fo n-bit factional numbes [6] Newton-Raphson Division The Newton-Raphson method shown in Equation () is a well-known technique to find the oot of nonlinea functions. This oot can be calculated using an initial value by appoaching the oot quadatically [8][9][7][8]. The accuacy of the division opeation doubles in each iteation. The initial value estimation can educe the iteation numbe and incease accuacy. X i+ = X i f(x i) f (X i ) The function f X = D can be defined to calculate X X i+ = whee D is the diviso. Afte applying f X and D f X to Equation (), X i+ = can be calculated as D X i+ = X i ( DX i ) (3) Afte n th iteation, the value of X i+ conveges to /D and the quotient can be calculated as Q = N (/D). Design accuacy and eo can be impoved by inceasing the numbe of iteations and bette estimate of initial value [9]. The hadwae implementation of Newton-Raphson division and its iteation steps ae descibed in Figue 5 and Table, espectively. () D N Mux_A Mux_B X Mux_C B 's Comp. x Fig. 5: Newton-Raphson division method A Table below shows the opeation of the division block. In cycle, pe-detemined values of initial appoximation X and D ae fetched into the multiplie. The poduct tem is calculated and its twos complement is stoed in B. In cycle, this complemented value is multiplied by X and the esult is stoed in A as X. Afte thee iteations o six clock cycles, the value of X 3 conveges to /D. Afte calculation of /D, the quotient can be calculated as shown in cycle 7 by multiplying X 3 by N. This algoithm and its pefomance depend on the numbe of iteations and initial appoximation. Calculation of.85/.5 when X =.5 and X =.75 fo diffeent iteations ae shown in Figues 6 and 7, espectively. Opeation Cycle Table. Newton-Raphson division cycles. Mux A Mux B Mux C A B --- -D. X X D. X 4 X D. X 6 X N.X X i+ 43
5 Intenational Jounal of Compute Applications ( ) 3.3. Newton-Raphson Squae Root and Invese Squae Root Hadwae implementation of squae oot and invese squae oot ae possible using Newton-Raphson algoithms that ae simila to division opeations. This makes it possible to have unified division, squae oot and invese squae oot opeations. Invese squae oot opeations can be geneated using the Newton-Raphson algoithm. The function f X = X can be defined to calculate X D i+ =. Afte applying D f X and f X to Equation (); X i+ = X i (3 DX i ) (4) Fig. 6: Calculation of.85/.5 when X =.5 Afte nth iteations, the value of X i+ will convege at the squae oot of D. Calculations of the squae oot can be done by multiplying the final value of Equation (4) by D. The hadwae implementation of Newton-Raphson squae oot and invese squae oot methods and thei iteation steps ae descibed in Figue 9 and Table, espectively. C Shift_R Add 's Comp. X Mux_B D Mux_A Mux_C x A B Fig. 7: Calculation of.85/.5 when X =.75 The development system detemines the numbe of iteations and calculates the optimal initial value based on that iteation as shown in Figue 8 below. Fig. 8: Calculation of.85/.5 fo diffeent initial values. Fig. 9: Newton-Raphson invese-sqt and sqt methods. Table. Newton Raphson invese sqt and sqt cycles. OC A B C X D.X (3 - D.X ) 3 - X.(3 - D.X ) - X.(3 - D.X ) X 5 D.X X (3 - D.X ) 6 - X.(3 - D.X ) - X.(3 - D.X ) X 8 D.X X (3 - D.X ) 9 - X.(3 - D.X ) - X.(3- D.X ) --- D.X Table above shows the opeation of the invese squae oot and squae oot blocks. In cycle, X is calculated. This value is stoed in A. In cycle, X is multiplied by D and the esult D X is stoed in A and - (3 - D X ) is stoed in C. In cycle 3, - (3 - D X ) is multiplied by X and esult - X (3 - D X ) is stoed in A and B. This is the end of the fist iteation. istes A and B ae holding the value of X that will be used in the second iteation. Iteations and 3 ae done in cycles 4 though 9, and 44
6 Intenational Jounal of Compute Applications ( ) the final esult X 3 will convege into calculated by multiplying D D. D. In cycle, D is 3.4 Sine and Cosine Implementations Sine and cosine functions can be implemented using the CORDIC algoithm. The CORDIC (COodinate Rotation DIgital Compute) algoithm was fist descibed in 959 by Jack E. Volde [][3] to eplace the analog esolve in the B-58 bombe s navigation system. Since then, quite a bit of eseach has been done on the topic. This algoithm is used today in digital filtes, FFT, DFT, Kalman filtes, adaptive lattice stuctue, linea algeba applications, singula value decomposition (SVD) calculations, Given s otation and QRD-RLS filteing []. The CORDIC algoithm is based on two modes; vectoing and otation. This algoithm and its deivation ove linea, cicula and hypebolic coodinate systems can compute many elementay functions descibed above. Let us assume that point A otates to B by an angle of otation angle θ, as shown in Figue. This will ceate a new point B(X,Y ), and the elation of the new point B based on pevious point A and the otation angle θ [7][4][5][]. R Y Y x = R. cos β (5) y = R. sin (β) (6) θ R β X B(X,Y ) R X A(X,Y) Fig. : Rotation of vecto A(x,y) to B(x,y ) x = R. cos θ + β (7) y = R. sin (θ + β) (8) Using the tigonometic popeties; sin θ + β = sin θ cos β + cos θ sin β (9) cos θ + β = cos θ cos β sin θ sin β () Putting (9) and () in (7) and (8) x = R. cos θ cos β R. sin θ sin β () y = R. sin θ cos β + R. cos θ sin β () Putting (5) and (6) in () and () x y = cos θ tan θ tan θ x = cos θ x y. tan θ x y (3) (4.a) y = cos θ x. tan θ + y (4.b) In this iteation, the value of tan θ can be chosen in tems of powe of. This will make it easy to implement as hadwae because i is simply an L-R shift. This tansfomation can be done by a sequence of smalle angle otations ( θ i ) in which, n θ = θ i And by using this Equation [3] can be witten as; x i+ y = cos θ i sin θ i i+ sin θ i cos θ i i (5) x i y i (6) tan θ i = i (7) θ i = tan i (8) In addition, the value of the poduct of the cos θ will be constant. This value is known as K i (scaling facto). K i = K i = n i= n i= cos θ i cos tan i (9) () Using the tigonometic popety; cos tan α = () + α Equation () scaling facto would be n K i = i= + i () This scaling facto value will be constant fo lage n (iteation numbe). Using the Equations (8) and () in Equation (6) the following may occu. and x i+ = K i. x i y i ( i ) (3) y i+ = K i. y i + x i ( i ) (4) x i+ = K i. x i d i. y i ( i ) (5) y i+ = K i. x i + d i. x i( i ) (6) The otation of the angle must be updated. This will intoduce a new set of equations: z i+ = z i d i. tan ( i ) (7) Values fo tan ( i ) can be stoed in the memoy. Vaiable d i is the diection of the otation and its value depends on z i. This may be epesented as shown below: d z i i (8) z i < Afte the desciption of the vectoing mode opeation, a geneal desciption of CORDIC algoithm fo linea, cicula and hypebolic coodinates using vectoing and otation modes can be given as shown below [4] []. z i+ = m = cicula coodinates linea coodinates hypebolic coodinates (9) x i+ = x i m i y i i (3) y i+ = y i + i x i i (3) z i i tan i z i i tanh i z i i i if m = if m = if m = (3) 45
7 . Intenational Jounal of Compute Applications ( ) Table 3. CORDIC shift sequences and scaling facto Coo. System M Shift Sequence S m,i Convegence α max Scale Facto K m (n ),,,,i, ,,,i+,... -,,3,4, Using these Equations MATLAB code to fo design and veification ae shown in Figues, and 3, espectively. Table 4. CORDIC pocesso fo thee coodinate systems Coo Sys Rot. / Vec. Rot. Vec. Initializing X = X S Y = Y S Z = β X =/K,n Y = Z = β X = X S Y = Y S Z = β Result Vectos X n = K,n (X S cos(β) - Y S sin(β)) Y n = K,n (Y S cos(β) + X S sin(β)) Z n = β X n = cos(β) Y n = sin(β) Z n = X n = K,n (sgn(x )(sqt(x +y ) Y n = Z n = β+tan - (Y S /X S ) Fig. : MATLAB sine(θ) and cos(θ) calculations Rot. X = X S Y = Y S Z = Z S X n = X S Y n = Y S + X S Y S Z n = Vec. X = X S Y = Y S Z = Z S X n = X S Y n = Z n = Z S + Y S / X S - Rot. X = X S Y = Y S Z = β X =/K -,n Y = Z = β X n = K -,n (X S cosh(β) + Y S sinh(β)) Y n = K -,n (Y S cosh(β) + X S sinh(β)) Z n = X n = cosh(β) Y n = sinh(β) Z n = - Vec. X = X S Y = Y S Z = β X n = K,n (sgn(x )(sqt(x +y ) Y n = Z n = β+tan - (Y S /X S ) Using Table 3 and Table 4, sin(θ) and cos(θ) values can be calculated. The cicula coodinate otation mode CORDIC can be used [4][]. Fig. : Sin(θ) and cos(θ) values between < θ <.5 x i+ = K. x i i y i i (33) y i+ = K. y i + i x i i (34) z i+ = K. z i i tan i (35) To stat the iteation, the following initial values need to be assigned. K =.6676 (36) x = K =.6676 (37) y =, (38) z = θ (39) Fig. 3: Sin(θ) and cos(θ) values between < θ < Sinh, Cosh and e (x) Implementations Sinh and Cosh functions can be implemented using the CORDIC povided above in Table 3 and 4. To calculate sinh(θ) and cosh(θ) values, the hypebolic coodinate otation mode CORDIC can be used [4][]. x i+ = K. x i + i y i i (4) 46
8 . Intenational Jounal of Compute Applications ( ) y i+ = K. y i + i x i i (4) z i+ = K. z i i tanh i (4) To stat the iteation, the following initial values need to be assigned. Afte n th (4). K =.8386 (43) x = K =.8386 (44) iteation, and using the Equations (4), (4) and x n = sinh(θ) (45) y = cosh(θ) (46) z = (47) An exponential function can be calculated using (45) and (46). e x = cosh(θ) + sinh(θ) (48). MATLAB epesentation of sinh, cosh, and exponential functions and eo analysis ae shown in Figues 4, 5, and 6, espectively. Fig. 4: MATLAB code fo sinh(θ), cosh(θ) and exponential function calculations Fig. 6: Sinh(θ), cosh(θ) and e θ values between < θ < 3.6 Matix Opeations The libay design platfom can ceate libay elements fo some basic matix opeations such as addition, subtaction and multiplication. The libay design matix hadwae and veification as same as explained above in Figue Matix Addition/Subtaction Hadwae implementation of matix addition is done as shown in Equations (49) and (5), espectively. C = A + B (49) c i,j = a i,j + b i,j whee i, j =,,, n (5) The conventional design is easy to undestand and implement. The design platfom uses the equation that is given in Equation (5) to calculate all elements of matix C. The matix addition/subtaction block is shown in Figue 7 below. Memoy b, a, b, a, n n n n n n n n n c, c, c i,j b i,j a i,j Memoy Fig. 5: Eo analysis of sin(θ) and cos(θ) Fig. 7: Matix addition/ subtaction block 3.6. Matix Multiplication Matix multiplications [8][] ae heavily used in many signal and image pocessing applications, such as adaptive beamfoming [6][8] and multiple-input-multiple-output (MIMO) systems [8], and factoizations [][][3][4] such as QR factoization and DCT. Matix multiplication equies opeation elements (OE) such as addition and multiplication. In a matix multiplication, the numbe of OEs depends on the matix size. The elation between matix size and the numbe of OEs is quadatic. This made it difficult to implement eal time matix multiplication libaies fo lage 47
9 Intenational Jounal of Compute Applications ( ) matices. The design platfom uses taditional matix multiplication to geneate libay element. Matix multiplication of an m matix A and n matix B poduces a m n matix C [5]. a, a, a m, a m, whee; A m, B,n = C m,n (5) b, b,n = b, b,n c i,j = k= a i,k b k,j c, c,n (5) c m, c m,n (53) This matix multiplication equies m n [M+(-)A] aithmetic opeations, whee M is fo multiplie and A is fo adde. Fo example; multiplication of A 4,3 and B 3,6 esults in C 4,6. This opeation equies 7 multiplication and 48 addition opeations. This shows that a n n matix multiplication equies n 3 multiplications and n (n ) additions. The numbe of multiplications in the matix multiplication opeation inceases by an exponent of thee (n 3 ) with the matix size. The hadwae ealization of a matix multiplication focuses on designing c i,j due to the paallel natue of the system. In geneal, calculation of c i,j is done using a bottom-up appoach. The tem c i,j in Equation (53) can be witten as; c i,j = a i, b,j + a i, b,j + + a i,n b n,j (54) Due to the matix size choice of n = k, the Equation 54 can be witten as c i,j = d + d + + d t (55) This d t calculation block shown in Figue 8Fig is one of the most basic and impotant blocks in matix multiplication. Any matix multiplication can be done using only one of these blocks, an adde and ounding scheme. Howeve, this is impactical fo lage matices due to slow oveall opeation and incease in eo boundaies. Thee ae many diffeent designs [6] that use diffeent numbes of d t calculation blocks to impove speed. The platfom geneates the libaies based on the smalle block that is shown in Figue 8 above and using this block lage blocks can be ceated as shown in Figue 9 below. Lage matix multiplications can be ealized by using these blocks. A 3x3 matix multiplication design block is shown in Figue below. a i,t- b t-,j a i,t b t,j iste iste iste iste iste iste Fig. 9: Realization of x block iste x d t whee; d t = a i,t b t,j + a i,t b t,j t n (56) The hadwae ealization of Equation (56) is shown in Figue 8. a i,t- a i,t- b t-,j b t-,j Memoy a i,t d t Memoy b t,j a i,t b t,j Fig. 8: Hadwae ealization of d t 48
10 Ci,j Intenational Jounal of Compute Applications ( ) a i,3 a i,3 a i,3 a i,9 a i,8 a i,7 a i,6 a i,5 a i,4 a i,3 a i, a i, a i, a i,9 a i,8 a i,7 a i,6 a i,5 a i,4 a i,3 a i, a i, a i, a i,9 a i,8 a i,7 a i,6 a i,5 a i,4 a i,3 a i, a i, b 3,j b 3,j b 3,j b 9,j b 8,j b 7,j b 6,j b 5,j b 4,j b 3,j b,j b,j b,j b 9,j b 8,j b 7,j b 6,j b 5,j b 4,j b 3,j b,j b,j b,j b 9,j b 8,j b 7,j b 6,j b 5,j b 4,j b 3,j b,j b,j x x x x x x x x x x x x x x x x d6 d5 d4 d3 d d d d9 d8 d7 d6 d5 d4 d3 d d x 8 8 x 8 8 x 8 8 x x 6 6 x x 3 4. CONCLUSION An aea efficient, econfiguable HDL libay development platfom fo Aithmetic and Matix Opeations is designed fo digital design and veification. The platfom geneates dynamic design libaies and veification files to impove design and veification of oveall systems. Based on design complexity and equied changes, TTM can be impoved by up to 6%. Even though this platfom is not tue HLS, it could be consideed as hybid HLS. Any designed system can be econfigued at any time in any way using the dynamic libaies without going though the same design and veification hassle. MATLAB-based veification makes it possible to use all the featues of MATLAB fo faste and moe efficient veification. The Pel-based design makes it possible to use all impotant text manipulations in Pel language. Even though command line use inteface is use fiendly, it ceates some hassle fo uses when the libay size inceases. The futue development platfom will integate a use fiendly GUI using Pel/Tk. Anothe futue goal is to make this platfom totally open souce by using only Iveilog and eplacing MATLAB with Octave. In addition, cuent matix libaies ae limited to addition, subtaction and matix multiplication. The futue development platfom will include some impotant matix factoizations such as QR and LU factoizations, and Stassen based matix multiplication fo aea optimization. 5. ACKNOWLEDGMENTS Autho thanks to Xilinx [6] and MATLAB fo thei suppot fo this eseach. 6. REFERENCES [] Andieux, J., M. Feix, G. Mougues, P. Betand, B. Iza, and V. Nguyen. "Optimum Smoothing of the Wigne Ville Distibution." IEEE Tansactions on Acoustics, Speech, and Signal Pocessing 36.5(987): [] Adle, J., K. Delgado, and B. Rao. "Compaison of Basis Selection Methods." Poceeding of IEEE Asiloma Confeence on Signals, Systems, and Compute Fequency and Time Scale Analysis (996): 5-57 [3] Aslan, S., Ouklu, E., and Saniie J., A high-level synthesis and veification tool fo fixed to floating point convesion, IEEE Intenational Midwest Symposium on Cicuits and Systems,, Pages, Fig. : Realization of 3x3 block [4] Desmoulies, C., Aslan, S., Ouklu E., Saniie, J., Vallina, F.M., HW/SW co-design platfom fo image and video pocessing applications on Vitex-5 FPGA using PICO IEEE Intenational Confeence on Electo/Infomation Technology,, Pages, -6. [5] Chen, W. The VLSI Handbook. Boca Raton: CRC Publishe, 7. [6] Xilinx. (6), [7] Kilts, S. Advanced FPGA Design Achitectue, Implementation, and Optimizations. New Yok: Wiley Inte-Science, 7. [8] Stine, J. E. "Digital Compute Aithmetic Datapath Design using Veilog HDL", Nowell, Massachusetts, Kluwe Academic Publishing, 4. [9] Lin, Ming-Bo, "Digital System Design and Pactices Using Veilog HDL and FPGAs", Singapoe, Wiley Publishing, 8. [] Joseph Cavanagh, " Compute Aithmetic and Veilog HDL Fundamentals ", Boca Raton, FL, CRC Pess, Taylo & Fancis Goup,. [] Flynn, M. J., and S. F. Obeman. "Division Algoithms and Implementations." IEEE Tansactions on Computes 46.8 (997): [] Schulte, M. J., and L. K. Wang. "Decimal floating-point squae oot using Newton-Raphson iteation." Application-Specific Systems, Achitectues and Pocessos (5): [3] Volde, J. "The CORDIC Tigonometic Computing Technique." IEEE Tansactions Electonic Computes 8.3 (959): [4] Stiling, W. C., and T. K. Moon. Mathematical Methods and algoithms fo Signal Pocessing. New Jesey: Pentice Hall,. [5] Andaka, R. "A suvey of CORDIC algoithms fo FPGAs." Poceedings of the 998 ACM/SIGDA sixth intenational symposium on Field pogammable gate aays (998): 9-. [6] Lang, T. M., and D. Ecegovac. Digital Aithmetic. San Fancisco: Mogan Kaufmann, 4. 49
11 Intenational Jounal of Compute Applications ( ) [7] Flynn, M. J., and S. F. Obeman. "Division Algoithms and Implementations." IEEE Tansactions on Computes 46.8 (997): [8] Swatzlande, E.E. J., and W.L.Gallaghe. "Fault- Toleant Newton-Raphson and Goldschmidt Divides Using Time Shaed TMR." IEEE Tansactions on Computes 49.6 (): [9] Oma, J., E. E. Swatzlande J., and M. J. Schulte. "Optimal Initial Appoximations fo the Newton- Raphson Division Algoithm." Spinge-Velag Jounal of Computing (994): [] Dehon, A., and S. Hauck. Reconfiguable Computing The Theoy and Pactice of FPGA-Based Computing. Bulington, Massachusetts: Elsevie, 8. [] Teukolsky, S. A., W. T. Vetteling, B. P. Flanney, and W. H. Pess. "Numeical Recipes: The At of Scientific Computing." Numeical Recipes: The At of Scientific Computing, 3d ed. New Yok, New Yok: Cambidge Univesity Pess, 7. [] Eisman, A. M., I. S. Duff, and J. K. Reid. Diect Methods fo Spase Matices. New Yok, United States of Ameica: Oxfod Univesity Pess, 3. [3] Fujii, A, R Suda, and A Nishida. "Paallel Matix Distibution Libay fo Spase Matix Solves." Poceedings of the Eighth Intenational Confeence on High-Pefomance Computing in Asia-Pacific ion (5): 9-6. [4] Aslan, S., E. Ouklu, and J. Saniie. "Realization of aea efficient QR factoization using unified division, squae oot, and invese squae oot hadwae." IEEE Intenational Confeence on Electo/Infomation Technology (9): [5] Watkins, David S. Fundamentals of Matix Computations, nd ed. New Yok, United States of Ameica: John Wiley & Sons,. [6] Hendy, D.C., and A.A. Duncan. "Aea Efficient DSP Datapath Synthesis." Design Automation Confeence (995): IJCA TM : 5
XFVHDL: A Tool for the Synthesis of Fuzzy Logic Controllers
XFVHDL: A Tool fo the Synthesis of Fuzzy Logic Contolles E. Lago, C. J. Jiménez, D. R. López, S. Sánchez-Solano and A. Baiga Instituto de Micoelectónica de Sevilla. Cento Nacional de Micoelectónica, Edificio
More informationAll lengths in meters. E = = 7800 kg/m 3
Poblem desciption In this poblem, we apply the component mode synthesis (CMS) technique to a simple beam model. 2 0.02 0.02 All lengths in metes. E = 2.07 10 11 N/m 2 = 7800 kg/m 3 The beam is a fee-fee
More informationIP Network Design by Modified Branch Exchange Method
Received: June 7, 207 98 IP Netwok Design by Modified Banch Method Kaiat Jaoenat Natchamol Sichumoenattana 2* Faculty of Engineeing at Kamphaeng Saen, Kasetsat Univesity, Thailand 2 Faculty of Management
More informationPositioning of a robot based on binocular vision for hand / foot fusion Long Han
2nd Intenational Confeence on Advances in Mechanical Engineeing and Industial Infomatics (AMEII 26) Positioning of a obot based on binocula vision fo hand / foot fusion Long Han Compute Science and Technology,
More informationCoordinate Systems. Ioannis Rekleitis
Coodinate Systems Ioannis ekleitis Position epesentation Position epesentation is: P p p p x y z P CS-417 Intoduction to obotics and Intelligent Systems Oientation epesentations Descibes the otation of
More informationColor Correction Using 3D Multiview Geometry
Colo Coection Using 3D Multiview Geomety Dong-Won Shin and Yo-Sung Ho Gwangju Institute of Science and Technology (GIST) 13 Cheomdan-gwagio, Buk-ku, Gwangju 500-71, Republic of Koea ABSTRACT Recently,
More informationWe are IntechOpen, the world s leading publisher of Open Access books Built by scientists, for scientists. International authors and editors
We ae IntechOpen, the wold s leading publishe of Open Access books Built by scientists, fo scientists,800 6,000 0M Open access books available Intenational authos and editos Downloads Ou authos ae among
More informationSegmentation of Casting Defects in X-Ray Images Based on Fractal Dimension
17th Wold Confeence on Nondestuctive Testing, 25-28 Oct 2008, Shanghai, China Segmentation of Casting Defects in X-Ray Images Based on Factal Dimension Jue WANG 1, Xiaoqin HOU 2, Yufang CAI 3 ICT Reseach
More informationTransmission Lines Modeling Based on Vector Fitting Algorithm and RLC Active/Passive Filter Design
Tansmission Lines Modeling Based on Vecto Fitting Algoithm and RLC Active/Passive Filte Design Ahmed Qasim Tuki a,*, Nashien Fazilah Mailah b, Mohammad Lutfi Othman c, Ahmad H. Saby d Cente fo Advanced
More informationA Memory Efficient Array Architecture for Real-Time Motion Estimation
A Memoy Efficient Aay Achitectue fo Real-Time Motion Estimation Vasily G. Moshnyaga and Keikichi Tamau Depatment of Electonics & Communication, Kyoto Univesity Sakyo-ku, Yoshida-Honmachi, Kyoto 66-1, JAPAN
More informationFrequency Domain Approach for Face Recognition Using Optical Vanderlugt Filters
Optics and Photonics Jounal, 016, 6, 94-100 Published Online August 016 in SciRes. http://www.scip.og/jounal/opj http://dx.doi.og/10.436/opj.016.68b016 Fequency Domain Appoach fo Face Recognition Using
More informationAn Unsupervised Segmentation Framework For Texture Image Queries
An Unsupevised Segmentation Famewok Fo Textue Image Queies Shu-Ching Chen Distibuted Multimedia Infomation System Laboatoy School of Compute Science Floida Intenational Univesity Miami, FL 33199, USA chens@cs.fiu.edu
More informationOptical Flow for Large Motion Using Gradient Technique
SERBIAN JOURNAL OF ELECTRICAL ENGINEERING Vol. 3, No. 1, June 2006, 103-113 Optical Flow fo Lage Motion Using Gadient Technique Md. Moshaof Hossain Sake 1, Kamal Bechkoum 2, K.K. Islam 1 Abstact: In this
More informationJournal of World s Electrical Engineering and Technology J. World. Elect. Eng. Tech. 1(1): 12-16, 2012
2011, Scienceline Publication www.science-line.com Jounal of Wold s Electical Engineeing and Technology J. Wold. Elect. Eng. Tech. 1(1): 12-16, 2012 JWEET An Efficient Algoithm fo Lip Segmentation in Colo
More informationModule 6 STILL IMAGE COMPRESSION STANDARDS
Module 6 STILL IMAE COMPRESSION STANDARDS Lesson 17 JPE-2000 Achitectue and Featues Instuctional Objectives At the end of this lesson, the students should be able to: 1. State the shotcomings of JPE standad.
More informationA New and Efficient 2D Collision Detection Method Based on Contact Theory Xiaolong CHENG, Jun XIAO a, Ying WANG, Qinghai MIAO, Jian XUE
5th Intenational Confeence on Advanced Mateials and Compute Science (ICAMCS 2016) A New and Efficient 2D Collision Detection Method Based on Contact Theoy Xiaolong CHENG, Jun XIAO a, Ying WANG, Qinghai
More informationA Shape-preserving Affine Takagi-Sugeno Model Based on a Piecewise Constant Nonuniform Fuzzification Transform
A Shape-peseving Affine Takagi-Sugeno Model Based on a Piecewise Constant Nonunifom Fuzzification Tansfom Felipe Fenández, Julio Gutiéez, Juan Calos Cespo and Gacián Tiviño Dep. Tecnología Fotónica, Facultad
More informationLecture Topics ECE 341. Lecture # 12. Control Signals. Control Signals for Datapath. Basic Processing Unit. Pipelining
EE 341 Lectue # 12 Instucto: Zeshan hishti zeshan@ece.pdx.edu Novembe 10, 2014 Potland State Univesity asic Pocessing Unit ontol Signals Hadwied ontol Datapath contol signals Dealing with memoy delay Pipelining
More informationDesired Attitude Angles Design Based on Optimization for Side Window Detection of Kinetic Interceptor *
Poceedings of the 7 th Chinese Contol Confeence July 6-8, 008, Kunming,Yunnan, China Desied Attitude Angles Design Based on Optimization fo Side Window Detection of Kinetic Intecepto * Zhu Bo, Quan Quan,
More informationMulti-azimuth Prestack Time Migration for General Anisotropic, Weakly Heterogeneous Media - Field Data Examples
Multi-azimuth Pestack Time Migation fo Geneal Anisotopic, Weakly Heteogeneous Media - Field Data Examples S. Beaumont* (EOST/PGS) & W. Söllne (PGS) SUMMARY Multi-azimuth data acquisition has shown benefits
More informationA modal estimation based multitype sensor placement method
A modal estimation based multitype senso placement method *Xue-Yang Pei 1), Ting-Hua Yi 2) and Hong-Nan Li 3) 1),)2),3) School of Civil Engineeing, Dalian Univesity of Technology, Dalian 116023, China;
More informationOPTIMAL KINEMATIC SYNTHESIS OF CRANK & SLOTTED LEVER QUICK RETURN MECHANISM FOR SPECIFIC STROKE & TIME RATIO
OPTIMAL KINEMATIC SYNTHESIS OF CRANK & SLOTTED LEVER QUICK RETURN MECHANISM FOR SPECIFIC STROKE & TIME RATIO Zeeshan A. Shaikh 1 and T.Y. Badguja 2 1,2 Depatment of Mechanical Engineeing, Late G. N. Sapkal
More informationISyE 4256 Industrial Robotic Applications
ISyE 456 Industial Robotic Applications Quiz # Oct. 9, 998 Name This is a closed book, closed notes exam. Show wok fo poblem questions ) ( pts) Please cicle one choice fo each item. a) In an application,
More informationPROBABILITY-BASED OPTIMAL PATH PLANNING FOR TWO-WHEELED MOBILE ROBOTS
Poceedings of the ASME 215 Dynamic Systems and Contol Confeence DSCC215 Octobe 28-3, 215, Columbus, Ohio, USA DSCC215-999 PROBABILITY-BASED OPTIMAL PATH PLANNING FOR TWO-WHEELED MOBILE ROBOTS Jaeyeon Lee
More informationA New Finite Word-length Optimization Method Design for LDPC Decoder
A New Finite Wod-length Optimization Method Design fo LDPC Decode Jinlei Chen, Yan Zhang and Xu Wang Key Laboatoy of Netwok Oiented Intelligent Computation Shenzhen Gaduate School, Habin Institute of Technology
More informationRANDOM IRREGULAR BLOCK-HIERARCHICAL NETWORKS: ALGORITHMS FOR COMPUTATION OF MAIN PROPERTIES
RANDOM IRREGULAR BLOCK-HIERARCHICAL NETWORKS: ALGORITHMS FOR COMPUTATION OF MAIN PROPERTIES Svetlana Avetisyan Mikayel Samvelyan* Matun Kaapetyan Yeevan State Univesity Abstact In this pape, the class
More informationA Minutiae-based Fingerprint Matching Algorithm Using Phase Correlation
A Minutiae-based Fingepint Matching Algoithm Using Phase Coelation Autho Chen, Weiping, Gao, Yongsheng Published 2007 Confeence Title Digital Image Computing: Techniques and Applications DOI https://doi.og/10.1109/dicta.2007.4426801
More informationdc - Linux Command Dc may be invoked with the following command-line options: -V --version Print out the version of dc
- CentOS 5.2 - Linux Uses Guide - Linux Command SYNOPSIS [-V] [--vesion] [-h] [--help] [-e sciptexpession] [--expession=sciptexpession] [-f sciptfile] [--file=sciptfile] [file...] DESCRIPTION is a evese-polish
More informationA Full-mode FME VLSI Architecture Based on 8x8/4x4 Adaptive Hadamard Transform For QFHD H.264/AVC Encoder
20 IEEE/IFIP 9th Intenational Confeence on VLSI and System-on-Chip A Full-mode FME VLSI Achitectue Based on 8x8/ Adaptive Hadamad Tansfom Fo QFHD H264/AVC Encode Jialiang Liu, Xinhua Chen College of Infomation
More informationImage Enhancement in the Spatial Domain. Spatial Domain
8-- Spatial Domain Image Enhancement in the Spatial Domain What is spatial domain The space whee all pixels fom an image In spatial domain we can epesent an image by f( whee x and y ae coodinates along
More informationAdaptation of Motion Capture Data of Human Arms to a Humanoid Robot Using Optimization
ICCAS25 June 2-5, KINTEX, Gyeonggi-Do, Koea Adaptation of Motion Captue Data of Human Ams to a Humanoid Robot Using Optimization ChangHwan Kim and Doik Kim Intelligent Robotics Reseach Cente, Koea Institute
More informationProf. Feng Liu. Fall /17/2016
Pof. Feng Liu Fall 26 http://www.cs.pdx.edu/~fliu/couses/cs447/ /7/26 Last time Compositing NPR 3D Gaphics Toolkits Tansfomations 2 Today 3D Tansfomations The Viewing Pipeline Mid-tem: in class, Nov. 2
More informationa Not yet implemented in current version SPARK: Research Kit Pointer Analysis Parameters Soot Pointer analysis. Objectives
SPARK: Soot Reseach Kit Ondřej Lhoták Objectives Spak is a modula toolkit fo flow-insensitive may points-to analyses fo Java, which enables expeimentation with: vaious paametes of pointe analyses which
More informationHigh performance CUDA based CNN image processor
High pefomance UDA based NN image pocesso GEORGE VALENTIN STOIA, RADU DOGARU, ELENA RISTINA STOIA Depatment of Applied Electonics and Infomation Engineeing Univesity Politehnica of Buchaest -3, Iuliu Maniu
More informationAutomatically Testing Interacting Software Components
Automatically Testing Inteacting Softwae Components Leonad Gallaghe Infomation Technology Laboatoy National Institute of Standads and Technology Gaithesbug, MD 20899, USA lgallaghe@nist.gov Jeff Offutt
More informationCOSC 6385 Computer Architecture. - Pipelining
COSC 6385 Compute Achitectue - Pipelining Sping 2012 Some of the slides ae based on a lectue by David Culle, Pipelining Pipelining is an implementation technique wheeby multiple instuctions ae ovelapped
More informationControlled Information Maximization for SOM Knowledge Induced Learning
3 Int'l Conf. Atificial Intelligence ICAI'5 Contolled Infomation Maximization fo SOM Knowledge Induced Leaning Ryotao Kamimua IT Education Cente and Gaduate School of Science and Technology, Tokai Univeisity
More information4.2. Co-terminal and Related Angles. Investigate
.2 Co-teminal and Related Angles Tigonometic atios can be used to model quantities such as
More informationDEADLOCK AVOIDANCE IN BATCH PROCESSES. M. Tittus K. Åkesson
DEADLOCK AVOIDANCE IN BATCH PROCESSES M. Tittus K. Åkesson Univesity College Boås, Sweden, e-mail: Michael.Tittus@hb.se Chalmes Univesity of Technology, Gothenbug, Sweden, e-mail: ka@s2.chalmes.se Abstact:
More informationParallel processing model for XML parsing
Recent Reseaches in Communications, Signals and nfomation Technology Paallel pocessing model fo XML pasing ADRANA GEORGEVA Fac. Applied Mathematics and nfomatics Technical Univesity of Sofia, TU-Sofia
More informationSYSTEM LEVEL REUSE METRICS FOR OBJECT ORIENTED SOFTWARE : AN ALTERNATIVE APPROACH
I J C A 7(), 202 pp. 49-53 SYSTEM LEVEL REUSE METRICS FOR OBJECT ORIENTED SOFTWARE : AN ALTERNATIVE APPROACH Sushil Goel and 2 Rajesh Vema Associate Pofesso, Depatment of Compute Science, Dyal Singh College,
More informationANALYTIC PERFORMANCE MODELS FOR SINGLE CLASS AND MULTIPLE CLASS MULTITHREADED SOFTWARE SERVERS
ANALYTIC PERFORMANCE MODELS FOR SINGLE CLASS AND MULTIPLE CLASS MULTITHREADED SOFTWARE SERVERS Daniel A Menascé Mohamed N Bennani Dept of Compute Science Oacle, Inc Geoge Mason Univesity 1211 SW Fifth
More informationUCB CS61C : Machine Structures
inst.eecs.bekeley.edu/~cs61c UCB CS61C : Machine Stuctues Lectue SOE Dan Gacia Lectue 28 CPU Design : Pipelining to Impove Pefomance 2010-04-05 Stanfod Reseaches have invented a monitoing technique called
More informationOn Error Estimation in Runge-Kutta Methods
Leonado Jounal of Sciences ISSN 1583-0233 Issue 18, Januay-June 2011 p. 1-10 On Eo Estimation in Runge-Kutta Methods Ochoche ABRAHAM 1,*, Gbolahan BOLARIN 2 1 Depatment of Infomation Technology, 2 Depatment
More informationImproved Fourier-transform profilometry
Impoved Fouie-tansfom pofilomety Xianfu Mao, Wenjing Chen, and Xianyu Su An impoved optical geomety of the pojected-finge pofilomety technique, in which the exit pupil of the pojecting lens and the entance
More informationA Neural Network Model for Storing and Retrieving 2D Images of Rotated 3D Object Using Principal Components
A Neual Netwok Model fo Stong and Reteving 2D Images of Rotated 3D Object Using Pncipal Components Tsukasa AMANO, Shuichi KUROGI, Ayako EGUCHI, Takeshi NISHIDA, Yasuhio FUCHIKAWA Depatment of Contol Engineeng,
More informationHaptic Glove. Chan-Su Lee. Abstract. This is a final report for the DIMACS grant of student-initiated project. I implemented Boundary
Physically Accuate Haptic Rendeing of Elastic Object fo a Haptic Glove Chan-Su Lee Abstact This is a final epot fo the DIMACS gant of student-initiated poject. I implemented Bounday Element Method(BEM)
More informationImprovement of First-order Takagi-Sugeno Models Using Local Uniform B-splines 1
Impovement of Fist-ode Takagi-Sugeno Models Using Local Unifom B-splines Felipe Fenández, Julio Gutiéez, Gacián Tiviño and Juan Calos Cespo Dep. Tecnología Fotónica, Facultad de Infomática Univesidad Politécnica
More informationFifth Wheel Modelling and Testing
Fifth heel Modelling and Testing en Masoy Mechanical Engineeing Depatment Floida Atlantic Univesity Boca aton, FL 4 Lois Malaptias IFMA Institut Fancais De Mechanique Advancee ampus De lemont Feand Les
More informationA Recommender System for Online Personalization in the WUM Applications
A Recommende System fo Online Pesonalization in the WUM Applications Mehdad Jalali 1, Nowati Mustapha 2, Ali Mamat 2, Md. Nasi B Sulaiman 2 Abstact foeseeing of use futue movements and intentions based
More informationIntroduction to Medical Imaging. Cone-Beam CT. Introduction. Available cone-beam reconstruction methods: Our discussion:
Intoduction Intoduction to Medical Imaging Cone-Beam CT Klaus Muelle Available cone-beam econstuction methods: exact appoximate Ou discussion: exact (now) appoximate (next) The Radon tansfom and its invese
More informationMapReduce Optimizations and Algorithms 2015 Professor Sasu Tarkoma
apreduce Optimizations and Algoithms 2015 Pofesso Sasu Takoma www.cs.helsinki.fi Optimizations Reduce tasks cannot stat befoe the whole map phase is complete Thus single slow machine can slow down the
More informationLecture # 04. Image Enhancement in Spatial Domain
Digital Image Pocessing CP-7008 Lectue # 04 Image Enhancement in Spatial Domain Fall 2011 2 domains Spatial Domain : (image plane) Techniques ae based on diect manipulation of pixels in an image Fequency
More informationCOEN-4730 Computer Architecture Lecture 2 Review of Instruction Sets and Pipelines
1 COEN-4730 Compute Achitectue Lectue 2 Review of nstuction Sets and Pipelines Cistinel Ababei Dept. of Electical and Compute Engineeing Maquette Univesity Cedits: Slides adapted fom pesentations of Sudeep
More informationCSE4201. Computer Architecture
CSE 4201 Compute Achitectue Pof. Mokhta Aboelaze Pats of these slides ae taken fom Notes by Pof. David Patteson at UCB Outline MIPS and instuction set Simple pipeline in MIPS Stuctual and data hazads Fowading
More information10/29/2010. Rendering techniques. Global Illumination. Local Illumination methods. Today : Global Illumination Modules and Methods
Rendeing techniques Compute Gaphics Lectue 10 Can be classified as Local Illumination techniques Global Illumination techniques Global Illumination 1: Ray Tacing and Radiosity Taku Komua 1 Local Illumination
More informationElliptic Generation Systems
4 Elliptic Geneation Systems Stefan P. Spekeijse 4.1 Intoduction 4.1 Intoduction 4.2 Two-Dimensional Gid Geneation Hamonic Maps, Gid Contol Maps, and Poisson Systems Discetization and Solution Method Constuction
More informationEgo-Motion Estimation on Range Images using High-Order Polynomial Expansion
Ego-Motion Estimation on Range Images using High-Ode Polynomial Expansion Bian Okon and Josh Haguess Space and Naval Wafae Systems Cente Pacific San Diego, CA, USA {bian.okon,joshua.haguess}@navy.mil Abstact
More informationAny modern computer system will incorporate (at least) two levels of storage:
1 Any moden compute system will incopoate (at least) two levels of stoage: pimay stoage: andom access memoy (RAM) typical capacity 32MB to 1GB cost pe MB $3. typical access time 5ns to 6ns bust tansfe
More informationAnd Ph.D. Candidate of Computer Science, University of Putra Malaysia 2 Faculty of Computer Science and Information Technology,
(IJCSIS) Intenational Jounal of Compute Science and Infomation Secuity, Efficient Candidacy Reduction Fo Fequent Patten Mining M.H Nadimi-Shahaki 1, Nowati Mustapha 2, Md Nasi B Sulaiman 2, Ali B Mamat
More informationComputer Science 141 Computing Hardware
Compute Science 141 Computing Hadwae Fall 2006 Havad Univesity Instucto: Pof. David Books dbooks@eecs.havad.edu [MIPS Pipeline Slides adapted fom Dave Patteson s UCB CS152 slides and May Jane Iwin s CSE331/431
More informationSpiral Recognition Methodology and Its Application for Recognition of Chinese Bank Checks
Spial Recognition Methodology and Its Application fo Recognition of Chinese Bank Checks Hanshen Tang 1, Emmanuel Augustin 2, Ching Y. Suen 1, Olivie Baet 2, Mohamed Cheiet 3 1 Cente fo Patten Recognition
More informationCISC 662 Graduate Computer Architecture Lecture 6 - Hazards
CISC 662 Gaduate Compute Achitectue Lectue 6 - Hazads Michela Taufe http://www.cis.udel.edu/~taufe/teaching/cis662f07 Powepoint Lectue Notes fom John Hennessy and David Patteson s: Compute Achitectue,
More information5 4 THE BERNOULLI EQUATION
185 CHATER 5 the suounding ai). The fictional wok tem w fiction is often expessed as e loss to epesent the loss (convesion) of mechanical into themal. Fo the idealied case of fictionless motion, the last
More informationApproximating Euclidean Distance Transform with Simple Operations in Cellular Processor Arrays
00 th Intenational Wokshop on Cellula Nanoscale Netwoks and thei Applications (CNNA) Appoximating Euclidean Distance Tansfom with Simple Opeations in Cellula Pocesso Aas Samad Razmjooei and Piot Dudek
More informationIllumination methods for optical wear detection
Illumination methods fo optical wea detection 1 J. Zhang, 2 P.P.L.Regtien 1 VIMEC Applied Vision Technology, Coy 43, 5653 LC Eindhoven, The Nethelands Email: jianbo.zhang@gmail.com 2 Faculty Electical
More informationCS 2461: Computer Architecture 1 Program performance and High Performance Processors
Couse Objectives: Whee ae we. CS 2461: Pogam pefomance and High Pefomance Pocessos Instucto: Pof. Bhagi Naahai Bits&bytes: Logic devices HW building blocks Pocesso: ISA, datapath Using building blocks
More informationA Novel Parallel Deadlock Detection Algorithm and Architecture
A Novel Paallel Deadlock Detection Aloithm and Achitectue Pun H. Shiu 2, Yudon Tan 2, Vincent J. Mooney III {ship, ydtan, mooney}@ece.atech.ed }@ece.atech.edu http://codesin codesin.ece.atech.eduedu,2
More informationAugmented Reality. Integrating Computer Graphics with Computer Vision Mihran Tuceryan. August 16, 1998 ICPR 98 1
Augmented Reality Integating Compute Gaphics with Compute Vision Mihan Tuceyan August 6, 998 ICPR 98 Definition XCombines eal and vitual wolds and objects XIt is inteactive and eal-time XThe inteaction
More informationEmbeddings into Crossed Cubes
Embeddings into Cossed Cubes Emad Abuelub *, Membe, IAENG Abstact- The hypecube paallel achitectue is one of the most popula inteconnection netwoks due to many of its attactive popeties and its suitability
More informationA ROI Focusing Mechanism for Digital Cameras
A ROI Focusing Mechanism fo Digital Cameas Chu-Hui Lee, Meng-Feng Lin, Chun-Ming Huang, and Chun-Wei Hsu Abstact With the development and application of digital technologies, the digital camea is moe popula
More informationThe Processor: Improving Performance Data Hazards
The Pocesso: Impoving Pefomance Data Hazads Monday 12 Octobe 15 Many slides adapted fom: and Design, Patteson & Hennessy 5th Edition, 2014, MK and fom Pof. May Jane Iwin, PSU Summay Pevious Class Pipeline
More informationAdministrivia. CMSC 411 Computer Systems Architecture Lecture 5. Data Hazard Even with Forwarding Figure A.9, Page A-20
Administivia CMSC 411 Compute Systems Achitectue Lectue 5 Basic Pipelining (cont.) Alan Sussman als@cs.umd.edu as@csu dedu Homewok poblems fo Unit 1 due today Homewok poblems fo Unit 3 posted soon CMSC
More informationTopic -3 Image Enhancement
Topic -3 Image Enhancement (Pat 1) DIP: Details Digital Image Pocessing Digital Image Chaacteistics Spatial Spectal Gay-level Histogam DFT DCT Pe-Pocessing Enhancement Restoation Point Pocessing Masking
More informationAssessment of Track Sequence Optimization based on Recorded Field Operations
Assessment of Tack Sequence Optimization based on Recoded Field Opeations Matin A. F. Jensen 1,2,*, Claus G. Søensen 1, Dionysis Bochtis 1 1 Aahus Univesity, Faculty of Science and Technology, Depatment
More informationShortest Paths for a Two-Robot Rendez-Vous
Shotest Paths fo a Two-Robot Rendez-Vous Eik L Wyntes Joseph S B Mitchell y Abstact In this pape, we conside an optimal motion planning poblem fo a pai of point obots in a plana envionment with polygonal
More informationObstacle Avoidance of Autonomous Mobile Robot using Stereo Vision Sensor
Obstacle Avoidance of Autonomous Mobile Robot using Steeo Vision Senso Masako Kumano Akihisa Ohya Shin ichi Yuta Intelligent Robot Laboatoy Univesity of Tsukuba, Ibaaki, 35-8573 Japan E-mail: {masako,
More informationA Mathematical Implementation of a Global Human Walking Model with Real-Time Kinematic Personification by Boulic, Thalmann and Thalmann.
A Mathematical Implementation of a Global Human Walking Model with Real-Time Kinematic Pesonification by Boulic, Thalmann and Thalmann. Mashall Badley National Cente fo Physical Acoustics Univesity of
More informationQuality-of-Content (QoC)-Driven Rate Allocation for Video Analysis in Mobile Surveillance Networks
Quality-of-Content (QoC)-Diven Rate Allocation fo Video Analysis in Mobile Suveillance Netwoks Xiang Chen, Jenq-Neng Hwang, Kuan-Hui Lee, Ricado L. de Queioz Depatment of Electical Engineeing, Univesity
More information(a, b) x y r. For this problem, is a point in the - coordinate plane and is a positive number.
Illustative G-C Simila cicles Alignments to Content Standads: G-C.A. Task (a, b) x y Fo this poblem, is a point in the - coodinate plane and is a positive numbe. a. Using a tanslation and a dilation, show
More informationANN Models for Coplanar Strip Line Analysis and Synthesis
200 IJCSNS Intenational Jounal of Compute Science and Netwok Secuity, VOL.8 No.10, Octobe 2008 Models fo Coplana Stip Line Analysis and J.Lakshmi Naayana D.K.Si Rama Kishna D.L.Patap Reddy Chalapathi Institute
More informationHigh Performance Computing on GPU for Electromagnetic Logging
Intenational Confeence "Paallel and Distiuted Computing Systems" High Pefomance Computing on GPU fo lectomagnetic Logging Glinskikh V.N. Kontoovich A.. pov M.I. Tofimuk Institute of Petoleum Geology and
More informationA Hybrid DWT-SVD Image-Coding System (HDWTSVD) for Color Images
A Hybid DWT-SVD Image-Coding System (HDWTSVD) fo Colo Images Humbeto Ochoa a,b, K.R. Rao a a Univesity of Texas at Alington, Box 906 46 ates Steet, Neddeman Hall, Rm 530 Alington, TX 7609 hdo594@exchange.uta.edu
More informationPoint-Biserial Correlation Analysis of Fuzzy Attributes
Appl Math Inf Sci 6 No S pp 439S-444S (0 Applied Mathematics & Infomation Sciences An Intenational Jounal @ 0 NSP Natual Sciences Publishing o Point-iseial oelation Analysis of Fuzzy Attibutes Hao-En hueh
More informationPersistent Memory what developers need to know Mark Carlson Co-chair SNIA Technical Council Toshiba
Pesistent Memoy what developes need to know Mak Calson Co-chai SNIA Technical Council Toshiba 2018 Stoage Develope Confeence EMEA. All Rights Reseved. 1 Contents Welcome Pesistent Memoy Oveview Non-Volatile
More informationPOMDP: Introduction to Partially Observable Markov Decision Processes Hossein Kamalzadeh, Michael Hahsler
POMDP: Intoduction to Patially Obsevable Makov Decision Pocesses Hossein Kamalzadeh, Michael Hahsle 2019-01-02 The R package pomdp povides an inteface to pomdp-solve, a solve (witten in C) fo Patially
More informationHISTOGRAMS are an important statistic reflecting the
JOURNAL OF L A T E X CLASS FILES, VOL. 14, NO. 8, AUGUST 2015 1 D 2 HistoSketch: Disciminative and Dynamic Similaity-Peseving Sketching of Steaming Histogams Dingqi Yang, Bin Li, Laua Rettig, and Philippe
More informationPerformance Optimization in Structured Wireless Sensor Networks
5 The Intenational Aab Jounal of Infomation Technology, Vol. 6, o. 5, ovembe 9 Pefomance Optimization in Stuctued Wieless Senso etwoks Amine Moussa and Hoda Maalouf Compute Science Depatment, ote Dame
More informationADDING REALISM TO SOURCE CHARACTERIZATION USING A GENETIC ALGORITHM
ADDING REALISM TO SOURCE CHARACTERIZATION USING A GENETIC ALGORITHM Luna M. Rodiguez*, Sue Ellen Haupt, and Geoge S. Young Depatment of Meteoology and Applied Reseach Laboatoy The Pennsylvania State Univesity,
More informationOPTIMUM DESIGN OF 3R ORTHOGONAL MANIPULATORS CONSIDERING ITS TOPOLOGY
Copyight by ABCM Page 38 OPTIMUM DESIGN OF 3R ORTHOGONAL MANIPULATORS CONSIDERING ITS TOPOLOGY Giovana Tindade da Silva Oliveia, gtindadeso@yahoo.com.b School of Mechanical Engineeing, Fedeal Univesity
More informationMonte Carlo Techniques for Rendering
Monte Calo Techniques fo Rendeing CS 517 Fall 2002 Compute Science Conell Univesity Announcements No ectue on Thusday Instead, attend Steven Gotle, Havad Upson Hall B17, 4:15-5:15 (efeshments ealie) Geomety
More informationA VECTOR PERTURBATION APPROACH TO THE GENERALIZED AIRCRAFT SPARE PARTS GROUPING PROBLEM
Accepted fo publication Intenational Jounal of Flexible Automation and Integated Manufactuing. A VECTOR PERTURBATION APPROACH TO THE GENERALIZED AIRCRAFT SPARE PARTS GROUPING PROBLEM Nagiza F. Samatova,
More informationComparisons of Transient Analytical Methods for Determining Hydraulic Conductivity Using Disc Permeameters
Compaisons of Tansient Analytical Methods fo Detemining Hydaulic Conductivity Using Disc Pemeametes 1,,3 Cook, F.J. 1 CSRO Land and Wate, ndoooopilly, Queensland The Univesity of Queensland, St Lucia,
More informationMultidimensional Testing
Multidimensional Testing QA appoach fo Stoage netwoking Yohay Lasi Visuality Systems 1 Intoduction Who I am Yohay Lasi, QA Manage at Visuality Systems Visuality Systems the leading commecial povide of
More informationTopological Characteristic of Wireless Network
Topological Chaacteistic of Wieless Netwok Its Application to Node Placement Algoithm Husnu Sane Naman 1 Outline Backgound Motivation Papes and Contibutions Fist Pape Second Pape Thid Pape Futue Woks Refeences
More informationDetection and Recognition of Alert Traffic Signs
Detection and Recognition of Alet Taffic Signs Chia-Hsiung Chen, Macus Chen, and Tianshi Gao 1 Stanfod Univesity Stanfod, CA 9305 {echchen, macuscc, tianshig}@stanfod.edu Abstact Taffic signs povide dives
More informationPrediction of Time Series Using RBF Neural Networks: A New Approach of Clustering
138 The Intenational Aab Jounal of Infomation Technology, Vol. 6,. 2, Apil 2009 Pediction of Time Seies Using RBF Neual Netwoks: A New Appoach of Clusteing Mohammed Awad 2, Hécto Pomaes 1, Ignacio Rojas
More informationAdaptation of TDMA Parameters Based on Network Conditions
Adaptation of TDMA Paametes Based on Netwok Conditions Boa Kaaoglu Dept. of Elect. and Compute Eng. Univesity of Rocheste Rocheste, NY 14627 Email: kaaoglu@ece.ocheste.edu Tolga Numanoglu Dept. of Elect.
More informationINCORPORATION OF ADVANCED NUMERICAL FIELD ANALYSIS TECHNIQUES IN THE INDUSTRIAL TRANSFORMER DESIGN PROCESS
INCORPORATION OF ADVANCED NUMERICAL FIELD ANALYSIS TECHNIQUES IN THE INDUSTRIAL TRANSFORMER DESIGN PROCESS M A Tsili 1, A G Kladas 1, P S Geogilakis 2, A T Souflais 3 and D G Papaigas 3 1 National Technical
More informationUsing Data Flow Diagrams for Supporting Task Models
in Companion Poc. of 5 th Euogaphics Wokshop on Design, Specification, Veification of Inteactive Systems DSV-IS 98 (Abingdon, 3-5 June 1998), P. Makopoulos & P. Johnson (Eds.), Spinge-Velag, Belin, 1998.
More information