Optimized Design Platform for High Speed Digital Filter using Folding Technique
|
|
- Bruno Patterson
- 5 years ago
- Views:
Transcription
1 Volume-2, Issue-1, January-February, 2014, pp , IASTER Online: , Print: ABSTRACT Optimized Design Platform for High Speed Digital Filter using Folding Technique 1 Shreyas Patel, 2 Prof.J.S. Rani Alex 1 Department of SENSE, VIT University, Chennai, India 2 Department of SENSE, VIT University, Chennai, India Implementation of DSP system must satisfy the sampling rate constraint and must require less space and power consumption. Thus finding a reasonable solution to optimize design platform using different algorithm is much needed. In this paper an optimized platform is designed by lifetime analysis which is one of the techniques of folding algorithm for minimizing the registers such that synthesizable RTL is obtained. Folding techniques can be used for the synthesis of DSP architecture that can be operated using single or multiple clocks with less number of registers and functional units resulting in an integrated circuit with usage of small silicon area. A technique is presented for computing the minimum number of registers, allocating the data to these registers and obtains synthesizable RTL code for folded architecture. Keywords: Folding Architecture, RTL (Register Transfer Logic), Register Minimization, Lifetime Analysis. I. INTRODUCTION In today's VLSI world, Designers had to design circuit with high performance and with less area and this to be done with a rapid design time. CAD tool play a very important role in achieving this requirement. ASIC design process start with given specification, from these high level functional block is obtained. These can be later used for obtaining circuit level device. In present work designed of 3-tap IIR filter model is design in MATLAB SIMULINK using XILINX block set, System generator which generate automatic synthesizable RTL code and design specification report of speed, area, power and registers. Folding technique provide a mean for trading area for time in a DSP architecture. DSP Architecture consists of adders and multipliers, in CMOS technology multiplier consume more power and thus structure must be implemented using one adder and multiplier using folding technique with minimum registers. The work carried out in previous paper is for reduction in clock period using retiming method [1]. In this paper, it had been reported that there is minimization of clock period but number of register is increasing. In this paper the technique is applied on folded retimed filter to reduce the registers. First, Design a 3-tap IIR folded retimed filter in MATLAB SIMULINK using XILINX block and obtain synthesizable RTL code automatic which reduce time for designer, observe the number of register that has been used from synthesize report. Next, Find iteration bound using longest path matrix (LPM) and minimum cycle mean (MCM) algorithm using MATLAB. Then, Obtain folded retimed architecture of 3-tap IIR filter(manually) and again check for iteration bound using LPM and MCM algorithm. Iteration and loop bound must remain same(matlab).required number of registers is more in folded structure so use life time analysis technique which is part of folding technique for minimization of registers(manually). Finally, Design a folded structure according to life time analysis technique and write an HDL code and synthesize report of folded structure compare with pervious synthesize result.(xilinx). 19
2 II. FOLDING TECHNIQUE Folding can be used to reduce the number of hardware functional unit by a factor of N at the expense of increasing the computational time by a factor of N. While folding transformation reduces the number of functional unit in the architecture, it may also apply to an architecture that uses a larger number of register. To avoid architecture consist of excessive amount of register, life time analysis technique can be used to compute the minimum number of register required to implement a folded DSP architecture. Using register minimization along with folding transformation not only reduce number of functional unit but also keeps the area as minimum as possible[8]. Fig-1 shows an example of 2 addition operations can be time multiplexed on a single pipelined hardware adder [9]. Fig-1 DSP program with 2 addition operation [9] y(n)=a(n)+b(n)+c(n) (1)[8] In Fig-2, the 2 addition operation are time-multiplexed on a single pipelined adder. Fig-2 A folded architecture 2 addition operation are folded to a single hardware adder with 1 stage of pipelining.[9] Table-1 operation of first six cycle of the folded hardware[8][9] In Table-1 in cycle 0,th sample a(0) and b(0) are switched into adder and in cycle 1[8], sum of (a(0)+b(0)) is switched into adder along with c(0),in cycle 2 when sum of (a(0)+b(0)+c(0)) is output and intermediate result (a(1)+b(1)) is computed by the adder[8]. This process continues as shown in table-1[8].the use of systematic folding technique is explained by folding the 2-tap retimed IIR filter, shown in Fig-4. Assume that addition and Multiplication require 1 and 2 unit the filter is folded with folding factor N=4[8],folding factor N means that iteration period of folded hardware is 4 unit i.e each node of filter is executed exactly once every 4 unit in folded architecture[8]. 20
3 For folded system to be realized D F (U V) >= 0 must hold for all of edge in DFG (data flow graph), must implies Nw r (e)-p u+ V-U >=0 (2) where Pu is processing unit time and Wr(e) is number of delay in edge Consider a one node-1 at instance (S 1 /3) doing to Node-2 with instance (S 1 /1) with one delay 4(0)-1+1-3=-3 (before folding) 4(1)-1+1-3=1 (after folding) Fig-2(A) Retimed Biquard Filter with Valid Folding Structure Fig-2(B) The Folded Biquard Filter using 1 Adder and 1 Multipier [8] As shown in Fig-2(b) number of adder and multiplier reduce, consider node 1 in Fig-2(a) at instance 4l+3 input to adder and at instance 4l+1 is output of filter compare this operation with Fig-2(b).as per equation (2) delay is 1 unit so in Fig-2(b) sample at IN,{3} enter input as adder and after 1 delay again input to adder{1},this structure give same functionality as Fig-2(a).but problem with this structure, it required more number of delay(register). III. LIFETIME ANALYSIS Lifetime analysis is one of folding technique used to compute minimum number of register require to implement a dsp algorithm in hardware[8].a data sample is live from the time it is product through the time it is consumed. After the variable is consumed it is dead[10]. A variable occupies one register during each time unit that is live[10]. In lifetime analysis, the number of live variable at any time unit is determined[10]. This is the minimum number of register required to implement the DSP program[8]. The folded architecture without lifetime analysis show in Fig-2(b) requires 6 register and 1 adder and multiplier. Since retiming for folding has already been performed,the next step is to construct the lifetime show in Table-2.In life time there is one entry for each node in DFG, that specify the lifetime(t input T output ) for a node. 21
4 T input =u+pu (3) T outout= u+pu+max v (D F (U V) (4) The time T in put for node U is u+pu where u is folding order of U and Pu is number of pipeline stage in functional unit that execute U[9]. This value of Tinput is the time unit in which the node produce data in hardware for the 0-th iteration of DSP programmed[8]. For example T input for node 1 in Fig-3 is 3+1=4.The time T output for node U is u+pu+max v (D F (U V)).where max{d F (U V)} represent longest folded path delay among all edge that begin at node U[9]. from equation T input and T ouput develop a table show in Table-2. Table-2 Lifetime for the Retimed Biquard filter NODE T in T out Fig-3 Lifetime Chart[8] Table-3 The Allocation Table for the Folded Biquard Filter[8] The linear lifetime chart can be drawn from Table-2 for the lifetime Fig-3 shown, at last the allocation of data variable to register shown in Table-3.Lifetime analysis need less number of register compare to folded technique. Same folded architecture is obtained by using lifetime analysis with 2 register shown in Fig-4[8]. 22
5 Fig-4 A Folded Biquard Filter Architecture Implementing the DFG Using Minimum Number of Registers [8][9] As from Fig-4 same biquade filter is implemented by using 1 adder and multiplier with two registers and data allocate in registers using switching activity IV. DESIGN AND ANALYSIS In this paper the main goal to reduce the number of registers used in retimed folded 3- Tap IIR filter using HDL, for comparison of designer HDL code first we are designing the retimed folded 3-Tap IIR filter in Matlab Simulink using Xilinx System Generator, show in Fig-5(a) and output for 5 discrete sample shown in Fig-5(b),System Generator is a system-level modeling tool that facilitate FPGA hardware design. It extends Simulink in many ways to provide modeling environment that is well suited to hardware design. Fig-5(a) Implementation of Retimed Folded 3-TAP IIR Filter in Matlab Simulink using System Generator Fig-5(b) Output of 3-TAP IIR Filter with 5 Sample 23
6 System Generator automatically compiles design into low-level representation. Design is compiled and simulated using the System Generator. Automatically code is generated and code is synthesis in Xilinx simulator to find number of register used in retimed folded 3-TAP IIR filter, Synthesis report is been show in Fig-5(c). Fig-5(c) Automatic Synthesis Report generated by System Generator From the synthesis report the number of registers slice generated by System generator is 48, so our aim to reduce number of Registers by writing HDL for folded structure. For folded structure we need to do calculation analytically and by using matlab. The 3-tap IIR Filter been designed by using dataflow graph.dataflow graph gives detail information without implementation of hardware and can be able to represent any algorithm. A DFG is a directed graph G(V,E) with a set of edges E. These set of nodes V are subdivided into computational nodes, input and output nodes [1]. (a) (b) Fig-6 (a) 3-TAP IIR filter (b) Dataflow graph of 3-TAP IIR filter In dataflow graph representation the node represent computational time and directed edge represent data path and each has a non-negative number of delay associated with node implementation of data flow graph represent in Fig-7.This filter is folded with folding factor N=6,means that iteration period of folded hardware is 4 U.T,each node in 3-tap IIR filter is executed exactly ones the iteration period can be founded by using LPM(longest path matrix) and MCM(Minimum cycle mean) algorithm, algorithm is implemented in matlab to check iteration period, after and before folding, the property of folding transformation that loop bound and iteration bound should not change after adding number of delay in path. 24
7 Fig-7 Dataflow Graph of 3-Tap IIR Filter Using Matlab In present paper as per eqa-2 the weight (delay) of D f (U V) is calculated, some of edge may get negative value shown in Table-8, the edge with negative D f (U V) can be made non-negative by increasing (decreasing) number of delay the D f (U V) by Nw,while adding delay property should not be effected. Table-4 Folding Equation for Folding Constraint for DFG D f (U V) Delay In the Table-4 some of edge get negative value to make non-negative value, we added a delay(register) to make them positive after adding delay to each negative value retime 3-TAP IIR filter with valid folding retimed structure is shown in Fig-8,but adding delay there is increase in latency but functionality and property 1.loop bound remain same 2.iteration bound must remain same iteration bound and Loop bound of folded architecture can check by using LPM and MCM algorithm shown in Fig-8(a),Fig-8(b). 25
8 Fig-8 Retimed 3-TAP IIR Filter with Valid Folding Architecture Fig-8(a)Verified Iteration Bound using LPM after Adding Delay Fig-8(B) Verified Iteration Bound using MCM after Adding Delay 26
9 In folded structure {(s1/0),(s1/1),(s1/2),(s1/3),(s1/4),(s1/5),(s2/0),(s2/1),(s2/2),(s2/3),(s2/4),(s2/5)}, are assumed instance at particular time. For folded structure again calculate the delay for each edge, as per mention above Life time Analysis a linear life time chart is used to graphically represent the lifetime of variable in a linear fashions be calculated as per (eqation-3 and equation-4) show in Table-5. Table-5 Lifetime Chart NODE T input T output Fig-9 Life Time Chart The vertical line in Fig-9 represent the clock cycle and horizontal line represent the activation of node at particular clock cycle. For example sample leaving from node-1(fig-8) should activate at 6 th clock cycle and must reach at node 6 with 9 delay. While writing HDL code Table-6 gives information about data allocation in registers. Table-6 Data Allocation in Register for Every Clock Cycle 27
10 Fig-10 Folded Architecture of 3-TAP IIR Filter Using Lifetime Chart Fig-10 show is folded structure of 3-TAP IIR filter now to represent this structure in digital design for writing HDL we need to replace those switches by multiplexer and need RAM to store data for filter co-efficient and to store Multiplier output which can be further used Fig-11 show the implementation of Fig-10(Folded architecture of 3-TAP IIR filter) in digital design. Fig-11 3-TAP IIR Filter Folded Digital Design 3-TAP IIR filter with folded structure using 4 register,1 adder and 1 multiplier in Xilinx with HDL code and synthesis and design summary report is to be compare the result with report generated by System Generator V. SIMULATION RESULT Fig-12 3-tap IIR Folded Filter using Xilinx Simulation Tool 28
11 Fig-13 RTL Schematic View of Folded 3-TAP IIR Filter Fig-14 Synthesis Report of Folded Digital Design in Xilinx From Fig-14 Synthesis report, registers get reduce with usage of 5 Look-up Table. In previous design work been carried out for optimizing the clock period by using retiming technique but disadvantage of paper is after doing retiming, they are able to reduce clock period but in report they shown the number of registers is increasing shown in Fig-15,so our design give reduction in register can be seen by synthesis report. Fig-15 Previous Work Simulation Result [1] 29
12 VI. CONCLUSION In this particular work a design optimized platform is developed for Digital filter. There are two ways by which optimization is performed in the current work. Firstly folding and second lifetime analysis technique but in folding functional unit and critical path is reduced but there is increasing in number of registers so lifetime analysis method is chosen which reduce the critical path, functional unit as well as registers and generates the synthesizable HDL. Since the entire process is reduce area occupied by register. VII. REFERENCES [1] Deepa Yagain,Dr. Vijaya Krishna A"Design Optimization Platform for Synthesizable High Speed Digital Filters Using Retiming Technique"IEEE-ICSE2012 Proc., 2012, Kuala Lumpur, Malaysia. [2] Daniel D. Gajski, Lognath Ramachandran IEEE Design & Test, volume 11, Issue 4 (Oct 1994), Publishers: IEEE computer society press, Los Alamitos, CA,USA,: ,pp [3] Zahra Jeddi and Esmail Amini Power optimization of Sequential Circuits by Retiming and Rewiring, IEEE, 2006 [4] Ozgur Sinanoglu and Vishwani D. Agrawal Retiming Scan Circuit to Eliminate Timing Penalty,IEEE, [5] A. Chandrakasan, S. Sheng, and R. Brodersen, Low-power CMOS digital design, IEEE J. Solid-State Circuits, vol. 27, pp , Apr [6] Zahra Jeddi and Esmail Amini Power optimization of Sequential Circuits by Retiming and Rewiring, IEEE, [7] K. K. Parhi "Synthesis of Control Circuits in Folded Pipelined DSP Architectures", IEEE Jl. of Solid-State Circuits, vol. SC-27, no. 1, pp [8] KESHAB K.PARHI "VLSI DIGITAL SIGNAL PROCESSING SYSTEM design and implementation" ISBN: , [9] Pierre COULON "Postgraduate Course on Signal Processing in Communications, FALL 99. [10] S. Srinivasan. "A novel architecture for lifting-based discrete wavelet transform for JPEG2000standard suitable for VLSI implementation", 16th International Conference on VLSI Design2003 Proceedings ICVD-03,
Chapter 6: Folding. Keshab K. Parhi
Chapter 6: Folding Keshab K. Parhi Folding is a technique to reduce the silicon area by timemultiplexing many algorithm operations into single functional units (such as adders and multipliers) Fig(a) shows
More informationSynthesis of DSP Systems using Data Flow Graphs for Silicon Area Reduction
Synthesis of DSP Systems using Data Flow Graphs for Silicon Area Reduction Rakhi S 1, PremanandaB.S 2, Mihir Narayan Mohanty 3 1 Atria Institute of Technology, 2 East Point College of Engineering &Technology,
More informationFolding. Lan-Da Van ( 范倫達 ), Ph. D. Department of Computer Science National Chiao Tung University Taiwan, R.O.C. Fall,
Folding ( 范倫達 ), Ph. D. Department of Computer Science National Chiao Tung University Taiwan, R.O.C. Fall, 2010 ldvan@cs.nctu.edu.tw http://www.cs.nctu.tw/~ldvan/ Outline Introduction Folding Transformation
More informationHIGH-PERFORMANCE RECONFIGURABLE FIR FILTER USING PIPELINE TECHNIQUE
HIGH-PERFORMANCE RECONFIGURABLE FIR FILTER USING PIPELINE TECHNIQUE Anni Benitta.M #1 and Felcy Jeba Malar.M *2 1# Centre for excellence in VLSI Design, ECE, KCG College of Technology, Chennai, Tamilnadu
More informationFPGA Implementation of Multiplierless 2D DWT Architecture for Image Compression
FPGA Implementation of Multiplierless 2D DWT Architecture for Image Compression Divakara.S.S, Research Scholar, J.S.S. Research Foundation, Mysore Cyril Prasanna Raj P Dean(R&D), MSEC, Bangalore Thejas
More informationA SIMULINK-TO-FPGA MULTI-RATE HIERARCHICAL FIR FILTER DESIGN
A SIMULINK-TO-FPGA MULTI-RATE HIERARCHICAL FIR FILTER DESIGN Xiaoying Li 1 Fuming Sun 2 Enhua Wu 1, 3 1 University of Macau, Macao, China 2 University of Science and Technology Beijing, Beijing, China
More informationDesign and Implementation of 3-D DWT for Video Processing Applications
Design and Implementation of 3-D DWT for Video Processing Applications P. Mohaniah 1, P. Sathyanarayana 2, A. S. Ram Kumar Reddy 3 & A. Vijayalakshmi 4 1 E.C.E, N.B.K.R.IST, Vidyanagar, 2 E.C.E, S.V University
More informationFOLDED ARCHITECTURE FOR NON CANONICAL LEAST MEAN SQUARE ADAPTIVE DIGITAL FILTER USED IN ECHO CANCELLATION
FOLDED ARCHITECTURE FOR NON CANONICAL LEAST MEAN SQUARE ADAPTIVE DIGITAL FILTER USED IN ECHO CANCELLATION Pradnya Zode 1 and Dr.A.Y.Deshmukh 2 1 Research Scholar, Department of Electronics Engineering
More informationFILTER SYNTHESIS USING FINE-GRAIN DATA-FLOW GRAPHS. Waqas Akram, Cirrus Logic Inc., Austin, Texas
FILTER SYNTHESIS USING FINE-GRAIN DATA-FLOW GRAPHS Waqas Akram, Cirrus Logic Inc., Austin, Texas Abstract: This project is concerned with finding ways to synthesize hardware-efficient digital filters given
More informationMemory, Area and Power Optimization of Digital Circuits
Memory, Area and Power Optimization of Digital Circuits Laxmi Gupta Electronics and Communication Department Jaypee Institute of Information Technology Noida, Uttar Pradesh, India Ankita Bharti Electronics
More informationVLSI Implementation of Low Power Area Efficient FIR Digital Filter Structures Shaila Khan 1 Uma Sharma 2
IJSRD - International Journal for Scientific Research & Development Vol. 3, Issue 05, 2015 ISSN (online): 2321-0613 VLSI Implementation of Low Power Area Efficient FIR Digital Filter Structures Shaila
More informationHardware Design Environments. Dr. Mahdi Abbasi Computer Engineering Department Bu-Ali Sina University
Hardware Design Environments Dr. Mahdi Abbasi Computer Engineering Department Bu-Ali Sina University Outline Welcome to COE 405 Digital System Design Design Domains and Levels of Abstractions Synthesis
More informationChapter 8 Folding. VLSI DSP 2008 Y.T. Hwang 8-1. Introduction (1)
Chapter 8 olding LSI SP 008 Y.T. Hang 8- folding Introduction SP architecture here multiple operations are multiplexed to a single function unit Trading area for time in a SP architecture Reduce the number
More informationOPTIMIZATION OF FIR FILTER USING MULTIPLE CONSTANT MULTIPLICATION
OPTIMIZATION OF FIR FILTER USING MULTIPLE CONSTANT MULTIPLICATION 1 S.Ateeb Ahmed, 2 Mr.S.Yuvaraj 1 Student, Department of Electronics and Communication/ VLSI Design SRM University, Chennai, India 2 Assistant
More informationCase Study on DiaHDL: A Web-based Electronic Design Automation Tool for Education Purpose
Case Study on DiaHDL: A Web-based Electronic Design Automation Tool for Education Purpose Muhammad Shoaib Iqbal Ansari, Thomas Schumann Faculty of Electrical Engineering h da University of Applied Sciences
More informationKeywords: Soft Core Processor, Arithmetic and Logical Unit, Back End Implementation and Front End Implementation.
ISSN 2319-8885 Vol.03,Issue.32 October-2014, Pages:6436-6440 www.ijsetr.com Design and Modeling of Arithmetic and Logical Unit with the Platform of VLSI N. AMRUTHA BINDU 1, M. SAILAJA 2 1 Dept of ECE,
More informationCOE 561 Digital System Design & Synthesis Introduction
1 COE 561 Digital System Design & Synthesis Introduction Dr. Aiman H. El-Maleh Computer Engineering Department King Fahd University of Petroleum & Minerals Outline Course Topics Microelectronics Design
More informationVLSI Design and Implementation of High Speed and High Throughput DADDA Multiplier
VLSI Design and Implementation of High Speed and High Throughput DADDA Multiplier U.V.N.S.Suhitha Student Department of ECE, BVC College of Engineering, AP, India. Abstract: The ever growing need for improved
More informationUnit 2: High-Level Synthesis
Course contents Unit 2: High-Level Synthesis Hardware modeling Data flow Scheduling/allocation/assignment Reading Chapter 11 Unit 2 1 High-Level Synthesis (HLS) Hardware-description language (HDL) synthesis
More information1 Introduction Data format converters (DFCs) are used to permute the data from one format to another in signal processing and image processing applica
A New Register Allocation Scheme for Low Power Data Format Converters Kala Srivatsan, Chaitali Chakrabarti Lori E. Lucke Department of Electrical Engineering Minnetronix, Inc. Arizona State University
More informationFPGA Based FIR Filter using Parallel Pipelined Structure
FPGA Based FIR Filter using Parallel Pipelined Structure Rajesh Mehra, SBL Sachan Electronics & Communication Engineering Department National Institute of Technical Teachers Training & Research Chandigarh,
More informationLow Power Complex Multiplier based FFT Processor
Low Power Complex Multiplier based FFT Processor V.Sarada, Dr.T.Vigneswaran 2 ECE, SRM University, Chennai,India saradasaran@gmail.com 2 ECE, VIT University, Chennai,India vigneshvlsi@gmail.com Abstract-
More informationHead, Dept of Electronics & Communication National Institute of Technology Karnataka, Surathkal, India
Mapping Signal Processing Algorithms to Architecture Sumam David S Head, Dept of Electronics & Communication National Institute of Technology Karnataka, Surathkal, India sumam@ieee.org Objectives At the
More informationAn Efficient Carry Select Adder with Less Delay and Reduced Area Application
An Efficient Carry Select Adder with Less Delay and Reduced Area Application Pandu Ranga Rao #1 Priyanka Halle #2 # Associate Professor Department of ECE Sreyas Institute of Engineering and Technology,
More informationIntroduction to C and HDL Code Generation from MATLAB
Introduction to C and HDL Code Generation from MATLAB 이웅재차장 Senior Application Engineer 2012 The MathWorks, Inc. 1 Algorithm Development Process Requirements Research & Design Explore and discover Design
More informationAccelerating DSP Applications in Embedded Systems with a Coprocessor Data-Path
Accelerating DSP Applications in Embedded Systems with a Coprocessor Data-Path Michalis D. Galanis, Gregory Dimitroulakos, and Costas E. Goutis VLSI Design Laboratory, Electrical and Computer Engineering
More informationAn Efficient Implementation of Floating Point Multiplier
An Efficient Implementation of Floating Point Multiplier Mohamed Al-Ashrafy Mentor Graphics Mohamed_Samy@Mentor.com Ashraf Salem Mentor Graphics Ashraf_Salem@Mentor.com Wagdy Anis Communications and Electronics
More informationDesign of a Multiplier Architecture Based on LUT and VHBCSE Algorithm For FIR Filter
African Journal of Basic & Applied Sciences 9 (1): 53-58, 2017 ISSN 2079-2034 IDOSI Publications, 2017 DOI: 10.5829/idosi.ajbas.2017.53.58 Design of a Multiplier Architecture Based on LUT and VHBCSE Algorithm
More informationParallel FIR Filters. Chapter 5
Chapter 5 Parallel FIR Filters This chapter describes the implementation of high-performance, parallel, full-precision FIR filters using the DSP48 slice in a Virtex-4 device. ecause the Virtex-4 architecture
More informationMOJTABA MAHDAVI Mojtaba Mahdavi DSP Design Course, EIT Department, Lund University, Sweden
High Level Synthesis with Catapult MOJTABA MAHDAVI 1 Outline High Level Synthesis HLS Design Flow in Catapult Data Types Project Creation Design Setup Data Flow Analysis Resource Allocation Scheduling
More informationAn HEVC Fractional Interpolation Hardware Using Memory Based Constant Multiplication
2018 IEEE International Conference on Consumer Electronics (ICCE) An HEVC Fractional Interpolation Hardware Using Memory Based Constant Multiplication Ahmet Can Mert, Ercan Kalali, Ilker Hamzaoglu Faculty
More informationDesign and Implementation of VLSI 8 Bit Systolic Array Multiplier
Design and Implementation of VLSI 8 Bit Systolic Array Multiplier Khumanthem Devjit Singh, K. Jyothi MTech student (VLSI & ES), GIET, Rajahmundry, AP, India Associate Professor, Dept. of ECE, GIET, Rajahmundry,
More informationFPGAs: FAST TRACK TO DSP
FPGAs: FAST TRACK TO DSP Revised February 2009 ABSRACT: Given the prevalence of digital signal processing in a variety of industry segments, several implementation solutions are available depending on
More informationExercises in DSP Design 2016 & Exam from Exam from
Exercises in SP esign 2016 & Exam from 2005-12-12 Exam from 2004-12-13 ept. of Electrical and Information Technology Some helpful equations Retiming: Folding: ω r (e) = ω(e)+r(v) r(u) F (U V) = Nw(e) P
More informationInternational Journal for Research in Applied Science & Engineering Technology (IJRASET) IIR filter design using CSA for DSP applications
IIR filter design using CSA for DSP applications Sagara.K.S 1, Ravi L.S 2 1 PG Student, Dept. of ECE, RIT, Hassan, 2 Assistant Professor Dept of ECE, RIT, Hassan Abstract- In this paper, a design methodology
More informationOn the Design of High Speed Parallel CRC Circuits using DSP Algorithams
On the Design of High Speed Parallel CRC Circuits using DSP Algorithams 1 B.Naresh Reddy, 2 B.Kiran Kumar, 3 K.Mohini sirisha 1 Dept.of ECE,Kodada institute of Technology & Science for women,kodada,india
More informationEEL 4783: HDL in Digital System Design
EEL 4783: HDL in Digital System Design Lecture 4: HLS Intro* Prof. Mingjie Lin *Notes are drawn from the textbook and the George Constantinides notes 1 Course Material Sources 1) Low-Power High-Level Synthesis
More informationOverview. CSE372 Digital Systems Organization and Design Lab. Hardware CAD. Two Types of Chips
Overview CSE372 Digital Systems Organization and Design Lab Prof. Milo Martin Unit 5: Hardware Synthesis CAD (Computer Aided Design) Use computers to design computers Virtuous cycle Architectural-level,
More informationDESIGN AND IMPLEMENTATION OF VLSI SYSTOLIC ARRAY MULTIPLIER FOR DSP APPLICATIONS
International Journal of Computing Academic Research (IJCAR) ISSN 2305-9184 Volume 2, Number 4 (August 2013), pp. 140-146 MEACSE Publications http://www.meacse.org/ijcar DESIGN AND IMPLEMENTATION OF VLSI
More informationDesign of Convolution Encoder and Reconfigurable Viterbi Decoder
RESEARCH INVENTY: International Journal of Engineering and Science ISSN: 2278-4721, Vol. 1, Issue 3 (Sept 2012), PP 15-21 www.researchinventy.com Design of Convolution Encoder and Reconfigurable Viterbi
More informationVLSI DESIGN OF REDUCED INSTRUCTION SET COMPUTER PROCESSOR CORE USING VHDL
International Journal of Electronics, Communication & Instrumentation Engineering Research and Development (IJECIERD) ISSN 2249-684X Vol.2, Issue 3 (Spl.) Sep 2012 42-47 TJPRC Pvt. Ltd., VLSI DESIGN OF
More informationArea Efficient, Low Power Array Multiplier for Signed and Unsigned Number. Chapter 3
Area Efficient, Low Power Array Multiplier for Signed and Unsigned Number Chapter 3 Area Efficient, Low Power Array Multiplier for Signed and Unsigned Number Chapter 3 3.1 Introduction The various sections
More informationAn Efficient Constant Multiplier Architecture Based On Vertical- Horizontal Binary Common Sub-Expression Elimination Algorithm
Volume-6, Issue-6, November-December 2016 International Journal of Engineering and Management Research Page Number: 229-234 An Efficient Constant Multiplier Architecture Based On Vertical- Horizontal Binary
More informationLow Power and Memory Efficient FFT Architecture Using Modified CORDIC Algorithm
Low Power and Memory Efficient FFT Architecture Using Modified CORDIC Algorithm 1 A.Malashri, 2 C.Paramasivam 1 PG Student, Department of Electronics and Communication K S Rangasamy College Of Technology,
More informationInternational Journal of Advanced Research in Electrical, Electronics and Instrumentation Engineering
An Efficient Implementation of Double Precision Floating Point Multiplier Using Booth Algorithm Pallavi Ramteke 1, Dr. N. N. Mhala 2, Prof. P. R. Lakhe M.Tech [IV Sem], Dept. of Comm. Engg., S.D.C.E, [Selukate],
More informationVHDL IMPLEMENTATION OF FLOATING POINT MULTIPLIER USING VEDIC MATHEMATICS
VHDL IMPLEMENTATION OF FLOATING POINT MULTIPLIER USING VEDIC MATHEMATICS I.V.VAIBHAV 1, K.V.SAICHARAN 1, B.SRAVANTHI 1, D.SRINIVASULU 2 1 Students of Department of ECE,SACET, Chirala, AP, India 2 Associate
More informationHigh-Level Synthesis (HLS)
Course contents Unit 11: High-Level Synthesis Hardware modeling Data flow Scheduling/allocation/assignment Reading Chapter 11 Unit 11 1 High-Level Synthesis (HLS) Hardware-description language (HDL) synthesis
More informationPower Optimized Programmable Truncated Multiplier and Accumulator Using Reversible Adder
Power Optimized Programmable Truncated Multiplier and Accumulator Using Reversible Adder Syeda Mohtashima Siddiqui M.Tech (VLSI & Embedded Systems) Department of ECE G Pulla Reddy Engineering College (Autonomous)
More informationA High Speed Binary Floating Point Multiplier Using Dadda Algorithm
455 A High Speed Binary Floating Point Multiplier Using Dadda Algorithm B. Jeevan, Asst. Professor, Dept. of E&IE, KITS, Warangal. jeevanbs776@gmail.com S. Narender, M.Tech (VLSI&ES), KITS, Warangal. narender.s446@gmail.com
More informationISSN Vol.02, Issue.11, December-2014, Pages:
ISSN 2322-0929 Vol.02, Issue.11, December-2014, Pages:1208-1212 www.ijvdcs.org Implementation of Area Optimized Floating Point Unit using Verilog G.RAJA SEKHAR 1, M.SRIHARI 2 1 PG Scholar, Dept of ECE,
More information16 BIT IMPLEMENTATION OF ASYNCHRONOUS TWOS COMPLEMENT ARRAY MULTIPLIER USING MODIFIED BAUGH-WOOLEY ALGORITHM AND ARCHITECTURE.
16 BIT IMPLEMENTATION OF ASYNCHRONOUS TWOS COMPLEMENT ARRAY MULTIPLIER USING MODIFIED BAUGH-WOOLEY ALGORITHM AND ARCHITECTURE. AditiPandey* Electronics & Communication,University Institute of Technology,
More informationAdvanced Design System DSP Synthesis
Advanced Design System 2002 DSP Synthesis February 2002 Notice The information contained in this document is subject to change without notice. Agilent Technologies makes no warranty of any kind with regard
More informationHigh-Performance FIR Filter Architecture for Fixed and Reconfigurable Applications
High-Performance FIR Filter Architecture for Fixed and Reconfigurable Applications Pallavi R. Yewale ME Student, Dept. of Electronics and Tele-communication, DYPCOE, Savitribai phule University, Pune,
More informationMOST computations used in applications, such as multimedia
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 13, NO. 9, SEPTEMBER 2005 1023 Pipelining With Common Operands for Power-Efficient Linear Systems Daehong Kim, Member, IEEE, Dongwan
More informationIMPLEMENTATION OF AN ADAPTIVE FIR FILTER USING HIGH SPEED DISTRIBUTED ARITHMETIC
IMPLEMENTATION OF AN ADAPTIVE FIR FILTER USING HIGH SPEED DISTRIBUTED ARITHMETIC Thangamonikha.A 1, Dr.V.R.Balaji 2 1 PG Scholar, Department OF ECE, 2 Assitant Professor, Department of ECE 1, 2 Sri Krishna
More informationModel-Based Design for effective HW/SW Co-Design Alexander Schreiber Senior Application Engineer MathWorks, Germany
Model-Based Design for effective HW/SW Co-Design Alexander Schreiber Senior Application Engineer MathWorks, Germany 2013 The MathWorks, Inc. 1 Agenda Model-Based Design of embedded Systems Software Implementation
More informationVLSI Implementation of Parallel CRC Using Pipelining, Unfolding and Retiming
IOSR Journal of VLSI and Signal Processing (IOSR-JVSP) Volume 2, Issue 5 (May. Jun. 203), PP 66-72 e-issn: 239 4200, p-issn No. : 239 497 VLSI Implementation of Parallel CRC Using Pipelining, Unfolding
More informationImplementation of Two Level DWT VLSI Architecture
V. Revathi Tanuja et al Int. Journal of Engineering Research and Applications RESEARCH ARTICLE OPEN ACCESS Implementation of Two Level DWT VLSI Architecture V. Revathi Tanuja*, R V V Krishna ** *(Department
More informationA Novel Design of 32 Bit Unsigned Multiplier Using Modified CSLA
A Novel Design of 32 Bit Unsigned Multiplier Using Modified CSLA Chandana Pittala 1, Devadas Matta 2 PG Scholar.VLSI System Design 1, Asst. Prof. ECE Dept. 2, Vaagdevi College of Engineering,Warangal,India.
More informationA Low Power DDR SDRAM Controller Design P.Anup, R.Ramana Reddy
A Low Power DDR SDRAM Controller Design P.Anup, R.Ramana Reddy Abstract This paper work leads to a working implementation of a Low Power DDR SDRAM Controller that is meant to be used as a reference for
More informationEfficient Implementation of Low Power 2-D DCT Architecture
Vol. 3, Issue. 5, Sep - Oct. 2013 pp-3164-3169 ISSN: 2249-6645 Efficient Implementation of Low Power 2-D DCT Architecture 1 Kalyan Chakravarthy. K, 2 G.V.K.S.Prasad 1 M.Tech student, ECE, AKRG College
More informationCOPY RIGHT. To Secure Your Paper As Per UGC Guidelines We Are Providing A Electronic Bar Code
COPY RIGHT 2018IJIEMR.Personal use of this material is permitted. Permission from IJIEMR must be obtained for all other uses, in any current or future media, including reprinting/republishing this material
More informationINTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY
INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK DESIGN OF QUATERNARY ADDER FOR HIGH SPEED APPLICATIONS MS. PRITI S. KAPSE 1, DR.
More informationDesign of Efficient Fast Fourier Transform
Design of Efficient Fast Fourier Transform Shymna Nizar N. S PG student, VLSI & Embedded Systems, ECE Department TKM Institute of Technology Karuvelil P.O, Kollam, Kerala-691505, India Abhila R Krishna
More informationIteration Bound. Lan-Da Van ( 范倫達 ), Ph. D. Department of Computer Science National Chiao Tung University Taiwan, R.O.C.
Iteration Bound ( 范倫達 ) Ph. D. Department of Computer Science National Chiao Tung University Taiwan R.O.C. Fall 2 ldvan@cs.nctu.edu.tw http://www.cs.nctu.tw/~ldvan/ Outline Introduction Data Flow Graph
More informationVHDL for Synthesis. Course Description. Course Duration. Goals
VHDL for Synthesis Course Description This course provides all necessary theoretical and practical know how to write an efficient synthesizable HDL code through VHDL standard language. The course goes
More informationII. LITERATURE SURVEY
Hardware Co-Simulation of Sobel Edge Detection Using FPGA and System Generator Sneha Moon 1, Prof Meena Chavan 2 1,2 Department of Electronics BVUCOE Pune India Abstract: This paper implements an image
More informationSimulink-Hardware Flow
5/2/22 EE26B: VLSI Signal Processing Simulink-Hardware Flow Prof. Dejan Marković ee26b@gmail.com Development Multiple design descriptions Algorithm (MATLAB or C) Fixed point description RTL (behavioral,
More informationMethod We follow- How to Get Entry Pass in SEMICODUCTOR Industries for 3rd year engineering. Winter/Summer Training
Method We follow- How to Get Entry Pass in SEMICODUCTOR Industries for 3rd year engineering Winter/Summer Training Level 2 continues. 3 rd Year 4 th Year FIG-3 Level 1 (Basic & Mandatory) & Level 1.1 and
More informationRISC IMPLEMENTATION OF OPTIMAL PROGRAMMABLE DIGITAL IIR FILTER
RISC IMPLEMENTATION OF OPTIMAL PROGRAMMABLE DIGITAL IIR FILTER Miss. Sushma kumari IES COLLEGE OF ENGINEERING, BHOPAL MADHYA PRADESH Mr. Ashish Raghuwanshi(Assist. Prof.) IES COLLEGE OF ENGINEERING, BHOPAL
More informationPipelined Quadratic Equation based Novel Multiplication Method for Cryptographic Applications
, Vol 7(4S), 34 39, April 204 ISSN (Print): 0974-6846 ISSN (Online) : 0974-5645 Pipelined Quadratic Equation based Novel Multiplication Method for Cryptographic Applications B. Vignesh *, K. P. Sridhar
More informationVerilog for High Performance
Verilog for High Performance Course Description This course provides all necessary theoretical and practical know-how to write synthesizable HDL code through Verilog standard language. The course goes
More informationOPTIMIZATION OF AREA COMPLEXITY AND DELAY USING PRE-ENCODED NR4SD MULTIPLIER.
OPTIMIZATION OF AREA COMPLEXITY AND DELAY USING PRE-ENCODED NR4SD MULTIPLIER. A.Anusha 1 R.Basavaraju 2 anusha201093@gmail.com 1 basava430@gmail.com 2 1 PG Scholar, VLSI, Bharath Institute of Engineering
More informationDESIGN AND IMPLEMENTATION OF APPLICATION SPECIFIC 32-BITALU USING XILINX FPGA
DESIGN AND IMPLEMENTATION OF APPLICATION SPECIFIC 32-BITALU USING XILINX FPGA T.MALLIKARJUNA 1 *,K.SREENIVASA RAO 2 1 PG Scholar, Annamacharya Institute of Technology & Sciences, Rajampet, A.P, India.
More informationA High Speed Design of 32 Bit Multiplier Using Modified CSLA
Journal From the SelectedWorks of Journal October, 2014 A High Speed Design of 32 Bit Multiplier Using Modified CSLA Vijaya kumar vadladi David Solomon Raju. Y This work is licensed under a Creative Commons
More informationMulti Design Exploration and Register Minimization of Retimed Circuits Using GA in DSP Applications
ISSN: -965; IC Value: 5.98; SJ Impact Factor: 6.887 Volume 6 Issue IV, April 8- Available at www.ijraset.com Multi Design Exploration and Register Minimization of Retimed Circuits Using GA in DSP Applications
More informationVLSI Design Of a Novel Pre Encoding Multiplier Using DADDA Multiplier. Guntur(Dt),Pin:522017
VLSI Design Of a Novel Pre Encoding Multiplier Using DADDA Multiplier 1 Katakam Hemalatha,(M.Tech),Email Id: hema.spark2011@gmail.com 2 Kundurthi Ravi Kumar, M.Tech,Email Id: kundurthi.ravikumar@gmail.com
More informationReduction of Latency and Resource Usage in Bit-Level Pipelined Data Paths for FPGAs
Reduction of Latency and Resource Usage in Bit-Level Pipelined Data Paths for FPGAs P. Kollig B. M. Al-Hashimi School of Engineering and Advanced echnology Staffordshire University Beaconside, Stafford
More informationAnalysis of Performance and Designing of Bi-Quad Filter using Hybrid Signed digit Number System
International Journal of Electronics and Computer Science Engineering 173 Available Online at www.ijecse.org ISSN: 2277-1956 Analysis of Performance and Designing of Bi-Quad Filter using Hybrid Signed
More informationIteration Bound. Lan-Da Van ( 倫 ), Ph. D. Department of Computer Science National Chiao Tung University Taiwan, R.O.C.
Iteration Bound Lan-Da Van ( 倫 ) Ph. D. Department of Computer Science National Chiao Tung University Taiwan R.O.C. Spring 27 ldvan@cs.nctu.edu.tw http://www.cs.nctu.tw/~ldvan/ Outline Introduction Data
More informationPracy II Konferencji Krajowej Reprogramowalne uklady cyfrowe, RUC 99, Szczecin, 1999, pp Implementation of IIR Digital Filters in FPGA
Pracy II Konferencji Krajowej Reprogramowalne uklady cyfrowe, RUC 99, Szczecin, 1999, pp.233-239 Implementation of IIR Digital Filters in FPGA Anatoli Sergyienko*, Volodymir Lepekha*, Juri Kanevski**,
More information2 Asst Prof, Kottam College of Engineering, Chinnatekur, Kurnool, AP-INDIA,
www.semargroups.org ISSN 2319-8885 Vol.02,Issue.06, July-2013, Pages:413-418 A H/W Efficient 64-Bit Parallel CRC for High Speed Data Transactions P.ABDUL RASOOL 1, N.MOHAN RAJU 2 1 Research Scholar, Kottam
More informationAn FPGA based Implementation of Floating-point Multiplier
An FPGA based Implementation of Floating-point Multiplier L. Rajesh, Prashant.V. Joshi and Dr.S.S. Manvi Abstract In this paper we describe the parameterization, implementation and evaluation of floating-point
More informationINTERNATIONAL JOURNAL OF PROFESSIONAL ENGINEERING STUDIES Volume VII /Issue 2 / OCT 2016
NEW VLSI ARCHITECTURE FOR EXPLOITING CARRY- SAVE ARITHMETIC USING VERILOG HDL B.Anusha 1 Ch.Ramesh 2 shivajeehul@gmail.com 1 chintala12271@rediffmail.com 2 1 PG Scholar, Dept of ECE, Ganapathy Engineering
More informationOptimized Design and Implementation of a 16-bit Iterative Logarithmic Multiplier
Optimized Design and Implementation a 16-bit Iterative Logarithmic Multiplier Laxmi Kosta 1, Jaspreet Hora 2, Rupa Tomaskar 3 1 Lecturer, Department Electronic & Telecommunication Engineering, RGCER, Nagpur,India,
More informationImplementation of Ripple Carry and Carry Skip Adders with Speed and Area Efficient
ISSN (Online) : 2278-1021 Implementation of Ripple Carry and Carry Skip Adders with Speed and Area Efficient PUSHPALATHA CHOPPA 1, B.N. SRINIVASA RAO 2 PG Scholar (VLSI Design), Department of ECE, Avanthi
More informationDesign and Optimized Implementation of Six-Operand Single- Precision Floating-Point Addition
2011 International Conference on Advancements in Information Technology With workshop of ICBMG 2011 IPCSIT vol.20 (2011) (2011) IACSIT Press, Singapore Design and Optimized Implementation of Six-Operand
More informationDesign of a Floating-Point Fused Add-Subtract Unit Using Verilog
International Journal of Electronics and Computer Science Engineering 1007 Available Online at www.ijecse.org ISSN- 2277-1956 Design of a Floating-Point Fused Add-Subtract Unit Using Verilog Mayank Sharma,
More informationCopyright 2007 Society of Photo-Optical Instrumentation Engineers. This paper was published in Proceedings of SPIE (Proc. SPIE Vol.
Copyright 2007 Society of Photo-Optical Instrumentation Engineers. This paper was published in Proceedings of SPIE (Proc. SPIE Vol. 6937, 69370N, DOI: http://dx.doi.org/10.1117/12.784572 ) and is made
More information: : (91-44) (Office) (91-44) (Residence)
Course: VLSI Circuits (Video Course) Faculty Coordinator(s) : Prof. S. Srinivasan Department of Electrical Engineering Indian Institute of Technology Madras Chennai 600036 Email Telephone : srinis@iitm.ac.in,
More informationSimulation & Synthesis of FPGA Based & Resource Efficient Matrix Coprocessor Architecture
Simulation & Synthesis of FPGA Based & Resource Efficient Matrix Coprocessor Architecture Jai Prakash Mishra 1, Mukesh Maheshwari 2 1 M.Tech Scholar, Electronics & Communication Engineering, JNU Jaipur,
More informationResource Efficient Multi Ported Sram Based Ternary Content Addressable Memory
IOSR Journal of Engineering (IOSRJEN) ISSN (e): 2250-3021, ISSN (p): 2278-8719 PP 11-18 www.iosrjen.org Resource Efficient Multi Ported Sram Based Ternary Content Addressable Memory S.Parkavi (1) And S.Bharath
More informationImplementation of Lifting-Based Two Dimensional Discrete Wavelet Transform on FPGA Using Pipeline Architecture
International Journal of Computer Trends and Technology (IJCTT) volume 5 number 5 Nov 2013 Implementation of Lifting-Based Two Dimensional Discrete Wavelet Transform on FPGA Using Pipeline Architecture
More informationMulti-Gigahertz Parallel FFTs for FPGA and ASIC Implementation
Multi-Gigahertz Parallel FFTs for FPGA and ASIC Implementation Doug Johnson, Applications Consultant Chris Eddington, Technical Marketing Synopsys 2013 1 Synopsys, Inc. 700 E. Middlefield Road Mountain
More informationIntroduction to Electronic Design Automation. Model of Computation. Model of Computation. Model of Computation
Introduction to Electronic Design Automation Model of Computation Jie-Hong Roland Jiang 江介宏 Department of Electrical Engineering National Taiwan University Spring 03 Model of Computation In system design,
More informationFPGA Design Challenge :Techkriti 14 Digital Design using Verilog Part 1
FPGA Design Challenge :Techkriti 14 Digital Design using Verilog Part 1 Anurag Dwivedi Digital Design : Bottom Up Approach Basic Block - Gates Digital Design : Bottom Up Approach Gates -> Flip Flops Digital
More informationRetiming Arithmetic Datapaths using Timed Taylor Expansion Diagrams
Retiming Arithmetic Datapaths using Timed Taylor Expansion Diagrams Daniel Gomez-Prado Dusung Kim Maciej Ciesielski Emmanuel Boutillon 2 University of Massachusetts Amherst, USA. {dgomezpr,ciesiel,dukim}@ecs.umass.edu
More informationInternational Journal of Research in Computer and Communication Technology, Vol 4, Issue 11, November- 2015
Design of Dadda Algorithm based Floating Point Multiplier A. Bhanu Swetha. PG.Scholar: M.Tech(VLSISD), Department of ECE, BVCITS, Batlapalem. E.mail:swetha.appari@gmail.com V.Ramoji, Asst.Professor, Department
More informationEvolution of Implementation Technologies. ECE 4211/5211 Rapid Prototyping with FPGAs. Gate Array Technology (IBM s) Programmable Logic
ECE 42/52 Rapid Prototyping with FPGAs Dr. Charlie Wang Department of Electrical and Computer Engineering University of Colorado at Colorado Springs Evolution of Implementation Technologies Discrete devices:
More informationPower and Area Efficient Implementation for Parallel FIR Filters Using FFAs and DA
Power and Area Efficient Implementation for Parallel FIR Filters Using FFAs and DA Krishnapriya P.N 1, Arathy Iyer 2 M.Tech Student [VLSI & Embedded Systems], Sree Narayana Gurukulam College of Engineering,
More information