RISC IMPLEMENTATION OF OPTIMAL PROGRAMMABLE DIGITAL IIR FILTER

Similar documents
VLSI Implementation of Low Power Area Efficient FIR Digital Filter Structures Shaila Khan 1 Uma Sharma 2

FPGA based Simulation of Clock Gated ALU Architecture with Multiplexed Logic Enable for Low Power Applications

FPGA Implementation of 2-D DCT Architecture for JPEG Image Compression

REAL TIME DIGITAL SIGNAL PROCESSING

Prachi Sharma 1, Rama Laxmi 2, Arun Kumar Mishra 3 1 Student, 2,3 Assistant Professor, EC Department, Bhabha College of Engineering

Implementation of a Low Power Decimation Filter Using 1/3-Band IIR Filter

An Efficient FPGA Implementation of the Advanced Encryption Standard (AES) Algorithm Using S-Box

VHDL for Synthesis. Course Description. Course Duration. Goals

DIGITAL VS. ANALOG SIGNAL PROCESSING Digital signal processing (DSP) characterized by: OUTLINE APPLICATIONS OF DIGITAL SIGNAL PROCESSING

Analysis of Radix- SDF Pipeline FFT Architecture in VLSI Using Chip Scope

MCM Based FIR Filter Architecture for High Performance

Hardware Design Environments. Dr. Mahdi Abbasi Computer Engineering Department Bu-Ali Sina University

DESIGN AND IMPLEMENTATION OF DA- BASED RECONFIGURABLE FIR DIGITAL FILTER USING VERILOGHDL

INTERNATIONAL JOURNAL OF PROFESSIONAL ENGINEERING STUDIES Volume 10 /Issue 1 / JUN 2018

Head, Dept of Electronics & Communication National Institute of Technology Karnataka, Surathkal, India

CAD SUBSYSTEM FOR DESIGN OF EFFECTIVE DIGITAL FILTERS IN FPGA

Parallel FIR Filters. Chapter 5

ISSN (Online), Volume 1, Special Issue 2(ICITET 15), March 2015 International Journal of Innovative Trends and Emerging Technologies

IMPLEMENTATION OF AN ADAPTIVE FIR FILTER USING HIGH SPEED DISTRIBUTED ARITHMETIC

Method We follow- How to Get Entry Pass in SEMICODUCTOR Industries for 3rd year engineering. Winter/Summer Training

International Journal for Research in Applied Science & Engineering Technology (IJRASET) IIR filter design using CSA for DSP applications

A SIMULINK-TO-FPGA MULTI-RATE HIERARCHICAL FIR FILTER DESIGN

Implementation of FFT Processor using Urdhva Tiryakbhyam Sutra of Vedic Mathematics

Verilog for High Performance

Low Power and Memory Efficient FFT Architecture Using Modified CORDIC Algorithm

A HIGH PERFORMANCE FIR FILTER ARCHITECTURE FOR FIXED AND RECONFIGURABLE APPLICATIONS

COMPARISON OF DIFFERENT REALIZATION TECHNIQUES OF IIR FILTERS USING SYSTEM GENERATOR

FPGA Implementation of 16-Point Radix-4 Complex FFT Core Using NEDA

All MSEE students are required to take the following two core courses: Linear systems Probability and Random Processes

Review Study on a Design of Finite Impulse Response Filter

INTEGER SEQUENCE WINDOW BASED RECONFIGURABLE FIR FILTERS.

Design of a Pipelined 32 Bit MIPS Processor with Floating Point Unit

Design of Adaptive Filters Using Least P th Norm Algorithm

The Efficient Implementation of Numerical Integration for FPGA Platforms

Design, Analysis and Processing of Efficient RISC Processor

FPGA Implementation of Discrete Fourier Transform Using CORDIC Algorithm

VHDL Design and Implementation of ASIC Processor Core by Using MIPS Pipelining

A Novel Design of 32 Bit Unsigned Multiplier Using Modified CSLA

VHDL MODEL OF SMART SENSOR

Research Article International Journal of Emerging Research in Management &Technology ISSN: (Volume-6, Issue-8) Abstract:

DESIGN AND IMPLEMENTATION OF VLSI SYSTOLIC ARRAY MULTIPLIER FOR DSP APPLICATIONS

Design and Implementation of 5 Stages Pipelined Architecture in 32 Bit RISC Processor

Binary Adders. Ripple-Carry Adder

Evaluation of High Speed Hardware Multipliers - Fixed Point and Floating point

Register Transfer Methodology II

Outline. Register Transfer Methodology II. 1. One shot pulse generator. Refined block diagram of FSMD

Fatima Michael College of Engineering & Technology

Design of Feature Extraction Circuit for Speech Recognition Applications

VHDL Implementation of DIT-FFT using CORDIC

FPGA Implementation of MIPS RISC Processor

Design and Implementation of VLSI 8 Bit Systolic Array Multiplier

Implimentation of A 16-bit RISC Processor for Convolution Application

Design of Convolution Encoder and Reconfigurable Viterbi Decoder

Implementation of Lifting-Based Two Dimensional Discrete Wavelet Transform on FPGA Using Pipeline Architecture

ASSEMBLY LANGUAGE MACHINE ORGANIZATION

FPGA Based Implementation of Pipelined 32-bit RISC Processor with Floating Point Unit

Field Programmable Gate Array (FPGA)

FPGA Based Design and Simulation of 32- Point FFT Through Radix-2 DIT Algorith

CONTACT: ,

Batchu Jeevanarani and Thota Sreenivas Department of ECE, Sri Vasavi Engg College, Tadepalligudem, West Godavari (DT), Andhra Pradesh, India

Digital Signal Processing with Field Programmable Gate Arrays

AnEfficientImplementationofDigitFIRFiltersusingMemorybasedRealization

FPGA Implementation of Multiplierless 2D DWT Architecture for Image Compression

CHAPTER 4. DIGITAL DOWNCONVERTER FOR WiMAX SYSTEM

High-Performance FIR Filter Architecture for Fixed and Reconfigurable Applications

VHDL Implementation of Multiplierless, High Performance DWT Filter Bank

FPGA for Software Engineers

Implementation of Low Power High Speed 32 bit ALU using FPGA

Vertical-Horizontal Binary Common Sub- Expression Elimination for Reconfigurable Transposed Form FIR Filter

INTRODUCTION TO CATAPULT C

DC57 COMPUTER ORGANIZATION JUNE 2013

Optimized Design and Implementation of a 16-bit Iterative Logarithmic Multiplier

DESIGN OF HIGH PERFORMANCE LOW POWER 32 BIT RISC PROCESSOR

Area And Power Efficient LMS Adaptive Filter With Low Adaptation Delay

RUN-TIME RECONFIGURABLE IMPLEMENTATION OF DSP ALGORITHMS USING DISTRIBUTED ARITHMETIC. Zoltan Baruch

Chapter 5. Hardware Software co-simulation

VLSI DESIGN OF REDUCED INSTRUCTION SET COMPUTER PROCESSOR CORE USING VHDL

Hardware Implementation of Cryptosystem by AES Algorithm Using FPGA

High Level Abstractions for Implementation of Software Radios

FPGA IMPLEMENTATION OF FLOATING POINT ADDER AND MULTIPLIER UNDER ROUND TO NEAREST

Low Power Complex Multiplier based FFT Processor

ECE 341 Midterm Exam

FPGA architecture and design technology

An introduction to DSP s. Examples of DSP applications Why a DSP? Characteristics of a DSP Architectures

Novel Design of Dual Core RISC Architecture Implementation

High Speed Special Function Unit for Graphics Processing Unit

A Dedicated Hardware Solution for the HEVC Interpolation Unit

RTL Coding General Concepts

Analytical Approach for Numerical Accuracy Estimation of Fixed-Point Systems Based on Smooth Operations

High speed Integrated Circuit Hardware Description Language), RTL (Register transfer level). Abstract:

STUDY OF A CORDIC BASED RADIX-4 FFT PROCESSOR

Introduction to Microcontrollers

CHAPTER 7 FPGA IMPLEMENTATION OF HIGH SPEED ARITHMETIC CIRCUIT FOR FACTORIAL CALCULATION

Design & Analysis of 16 bit RISC Processor Using low Power Pipelining

An Efficient Design of Vedic Multiplier using New Encoding Scheme

Design and Implementation of 3-D DWT for Video Processing Applications

Simultaneous Optimization of Delay and Number of Operations in Multiplierless Implementation of Linear Systems

Multi Cycle Implementation Scheme for 8 bit Microprocessor by VHDL

קורס VHDL for High Performance. VHDL

An Efficient Implementation of Fixed-Point LMS Adaptive Filter Using Verilog HDL

Transcription:

RISC IMPLEMENTATION OF OPTIMAL PROGRAMMABLE DIGITAL IIR FILTER Miss. Sushma kumari IES COLLEGE OF ENGINEERING, BHOPAL MADHYA PRADESH Mr. Ashish Raghuwanshi(Assist. Prof.) IES COLLEGE OF ENGINEERING, BHOPAL MADHYA PRADESH Abstract: This paper is based on the implementation of Reduced Instruction set computer with the application of Discrete Cosine transform (DCT), Inverse DCT, Discrete Fourier Transform (DFT) and Fast Fourier Transform (FFT), Digital filter are performed by DSP system. Digital filter is one of the important contents of digital signal process. The performance of the processor design is improved by using the pipeline approach. It allows the processor to work on different steps of the instruction at the same time, thus more instruction can be execute in a shorter period of time. The analysis of this processor will provide various features including arithmetic operations. The speed of operation is mainly affected by the computational complexity due to multipliers and adder modules of the digital systems. Our work will targets the computer architecture courses and presents an FPGA (Field Programmable Gate Array) implementation of a MIPS (Microprocessor without Interlocked Pipeline Stages) via VHDL (Very high speed integrated circuit Hardware Description Language) design. The latency and computational time is utmost important in microprocessor. Thus we design the multiplier and adder module with improve latency and computational time. Key Words: RISC, IIR, FPGA, DSP Processor, Direct Form I, Direct Form II, VHDL, Parallel, Cascade Structure Introduction: Conventionally, the digital filter, Discrete Fourier and Z transform applications are implemented either using general purpose DSP processors (low speed, less expensive, flexible) or using Application Specific Integrated Circuits (ASIC) which offer high speed but are expensive and less flexible. An alternate approach is to use reduce instruction set computer (RISC) as they provide solutions that maintain both the advantages of the approach based on DSP processors and the approach based on microprocessor. The RISC decodes the instruction format and execute one instruction per cycle. By pipeline approach it fetches, decoded, and execute two or three instructions at the time. Instructions are of fixed number of bytes and take fixed amount of time for execution with lesser amount of circuitry. Minimisation of hardware resources by utilisation of symmetrical coefficient FIR filter properties will reduce combinational logic signal path with balanced adder trees and pipelining. Pipelining balance the delay balancing needed in high data rate applications. It also reuses multiplication results in Direct cascade and parallel form IIR structure. Principles of IIR digital filter: Digital filter is actually a linear time-invariant discrete system using finite precision algorithms, and its function is actually achieved by a large number of addition and multiplication operations [4]. IIR digital 262

filters are recursive systems that involve fewer design parameters, less memory requirements, and lower computational complexity than finite impulse response (FIR) digital filters. It is characterized by the general linear constantcoefficient difference equation as follows: multiplications, M+N additions, and the maximum of {M,N} memory locations. Because the Direct-Form II structure requires less memory locations than the Direct- Form I structure, it is referred to as being canonical. Figure above shows an IIR digital filter in Direct-Form II format [4]. Transforming this difference equation into the z-domain by means of the z-transform, such a class of linear time-invariant discrete-time systems is also characterized by the transfer function as follows: (z) = Different structures of IIR filters are described by the difference equation in above. These structures are referred to as direct-form realizations. It should be noted that although these structures are different from one another by design, they are all functionally equivalent. Three prominent direct-form realizations are the Direct-Form I, the Direct-Form II, and the Transposed Direct-Form II structures. In terms of hardware implementation, the Direct- Form I structure requires M+ N+ 1 multiplication, M+N additions, and M+N+1 memory locations. Fig 2 Direct form II realization Implementation of Digital filters: The aim of digital filter design is to removing noise, shaping spectrum, and minimizing intersymbol interference in communication architectures and is use to find a suitable structure which passes requirements and minimizes numerical and quantization errors. There are few types of realizations and their combinations which can be used for implementation: 1. Universal processor: There operation is usually processed in a few clock cycles in universal processors so the computation performance is low. 2. Digital signal processor (DSP processor) with instruction set optimized for signal processing. 3. Special hardware such as Field programmable gate array (FPGA) or Application-specific integrated circuit (ASIC). 4. Reduce Instruction Set Computer (RISC) These realizations are described below and the advantages and disadvantages are summarized in Table below. Fig 1 Direct form I realization The Direct-Form II structures require M+N+1 263

Table I: Comparative analysis of our work with base papers. Type of Advantages Disadvantag Implementati es on Universal processor DSP processor FPGA RISC low cost Speed increases as the number of calculation cycle decreases, simple implementati on highest performance Reduce Instructions Computation of one output sample in one clock cycle. The fastest architecture Low Performance but architecture limitations Area depends on the number of calculation cycles selected more complicated implementati on, higher power consumption Though the RISC has less instruction set, as its the bit processing size increases then the test pattern becomes complicated and the structural faults are maintained high. RISC: Reduce instruction set computer (RISC) is a microprocessor with few instruction set with simple addressing modes that takes same amount of time to execute. Thus it has the faster architecture. It has superscalar and pipelining architecture. The module design of RISC includes the Register, Instruction Memory, Data Memory, instruction fetch unit, instruction decode unit, the control unit, and execution unit. Instruction unit fetches the instruction code as per the address pointed by program counter register. It includes the incrementer logic to increment the content of program counter after every instruction byte execution. The instruction decoder decodes the instruction provided from the instruction fetch unit. The control unit generates the control signals for the operation of the arithmetic and logical operations. The memory read and write signals are asserted for the control unit is use to access the data or write the data in memory. Implementation tool Xilinx and VHDL: VHDL (Very High Speed Integrated Hardware Description Language) is a very powerful, high-level, concurrent programming language. At the implementation level we can build structural models using component instantiation statements that connect and invoke subcomponents. The VHDL generate statement provides ease of block replication and control. A dataflow level of description offers a combination of the behavioral and structural levels of description. Methodology for IIR implementation in RISC First of all, we convert the model design files into corresponding VHDL design files using Signal Compiler, compile the.vhd files under Xilinx integrated environment, and conduct timing simulation to simulate and verify the design files. (1) Convert the digital filter specifications into an Binary representations. (2) Extract the filter coefficients. (3) We use a parallel-processing filter implementation, in which parallel IIR paths are formed, 264

4) We use multiplier and/or adder, in parallel approach which is faster. (5) Pipelining, a standard feature in RISC processors, is a technique used to improve both clock speed and overall performance. (6) Pipelining allows a processor to work on different steps of the instruction at the same time, thus more instruction can be executed in a shorter period of time. (7) When our design processor is pipelined, during a single clock cycle each one of design modules or stages is in use at exactly the same time executing on different instructions in parallel. Design Flow diagram: Fig 3 RTL view of Parallel structure IIR filter The fig 2 shows the RTL view of the parallel structure of IIR filter. Parallel-form realization of an IIR digital filter can be obtained by performing a partial-fraction expansion on the transfer function H(z). Performing this mathematical function produces the resulting transfer function in the form as follows: H(z) = C + The function Hk(z) is in 2nd order form as follows: (z) = s Design and its simulation: The processor implementation consists of two main types of logic elements: combinational and sequential elements. Combinational elements are elements that operate on data values, meaning that their outputs depend on the current status of inputs. Such elements in the processor implementation include the arithmetic logic unit (ALU) and memory. Sequential elements are elements that depend on current status of input and previous status of output i.e. it acts as latch. Each state element has at least two inputs and one output. The two inputs are the data value to be written and a clock signal. The output signal provides the data values that were written in an earlier clock cycle. Logic units are design and implement in VHDL. Transfer function H(z) is generally composed of poles and coefficients (residues) of the partial-fraction expansion. A more direct result of the partial-fraction expansion of H(z) yields the functional equivalent as follows H(z) = C + Fig 4 RTL view of Cascade structure IIR filter 265

Cascade-form realization of an IIR digital filter can be obtained by performing mathematical factoring on the transfer function H(z) into a cascade of 2nd order sub-blocks. The resulting transfer function can then be expressed as: H(z) = where K is the integer part of (N+1)/2 and Hk(z) has the 2nd order form as follows: (z) = Since this is a cascade system, the matter of grouping together a pair of complex-conjugate poles and a pair of complex-conjugate zeros becomes extremely critical. The output of a quantized digital filter is strongly dependent on the sequential ordering of the K sub-blocks as well as the exact way in which numerator and denominator sections are paired together. Arbitrarily grouping these terms can be performed on the part of the system designer but at the cost of a potentially high output noise variance. Fig 6 Timing simulation of coefficient multiplier unit. The multiplier module is design for coefficients multiplication in the signal flow graph of direct form structures in IIR filter design. The RTL view of the multiplier module requires 818 gate counts. The comparative analysis in tabulation format is shown below in table I, II and III. Table II: Utilization of Cells, Registers, Blocks and Slices in design. design Cell uses Regis ters Multiplic ations blocks Sli ces Multiplier 940 0 1 7 Parallel Structure 34 0 7 1 Cascade structure 34 0 8 1 Controller logic 468 49 0 3 Program counter 158 16 0 3 SRAM 2106 1170 0 9 Fig 5 Timing simulation of Z -1 Delay module By the time shifting property the time delay of signal by one sample is represented by Z -1 in Z domain. The fig 4 shows timing simulation of the delay module. 266

Table III: Latency computational time comparative analysis with related work design Multiplier Parallel Structure Cascade structure Controller logic Latency computational time Our work Reference Paper [1] Reference paper [2] 0.1us 3.0us 1.62us 0.1us 5.1us 1.62us 0.1us 5.1us 1.62us 0.1us 3.0us 1.62us Table IV: Comparative analysis of synthesis results with related work Utilization Number of slice registers Number used as logic cells Number of slice LUTs Our Reference work paper[3] 1235 2191 3740 700 700 997 Conclusion: The goal of this thesis is to enhance the simulator based approach by integrating RISC hardware implementation to help the digital IIR filter with the improved latency, computation time 0.1us and throughput. Efficiency is measured in terms of the number of multiplications required for each output sample point, for this we use RISC implementation because it have reduce Instructions Computation of one output sample in one clock cycle, thus it is fastest architecture. Utilization of number of Slice registers is 1235 and number of slice LUTs is 700. References: [1] Sheikh Md. Rabiul Islam, Robin Sarker, Shumit Saha, A. F. M. Nokib Uddin " Design of a programmable digital IIR filter based on FPGA" IEEE International Conference on Infonnatics, Electronics & Vision ICIEV year 2012. [2]Asma Hemdani,M.Wissem Naouar,Ilhem Slama Belkhodja,Eric Monmasson,"Design of a Digital IIR Filter for Active Filtering Applications"IEEE international conference in year 2012. [3]Sufeng,Semih and Jafar Saniie"FPGA based Architectures for High Performance Adaptive FIR Filter Systems"IEEE International Conference 2013. [4] ZHANG Jian-ping, PAN Ling-ling, DING Quan-fei "IIR Digital Filter Design Based on DSP and SOPC" International Symposium on Intelligence Information Processing and Trusted Computing year 2010. [5] Amit Kumar Singh Tomar, Rita Jain "20- Bit RISC & DSP System Design in an FPGA" IEEE international conference in year 2013. [6] Chaohua Dai, Weirong Chen, and Yunfang Zhu "Seeker Optimization Algorithm for Digital IIR Filter Design" IEEE Transactions On Industrial Electronics, Vol. 57, No. 5, May 2010 pp no. 1710. [7] Shing-Tai Pan, Member, IEEE" Evolutionary Computation on Programmable Robust IIR Filter Pole-Placement design"ieee transactions on instrumentation and measurement, vol.60, no.4,april2011. [8] Masayoshi Nakamoto,and Shuichi Ohno,"Design of Multi-Band Digital Filters and Full-Band Digital Differentiators Without Frequency Sampling and Iterative Optimization ieee transactions on industrial electronics, vol. 61, no. 9, september 2014. [9] Joaquín,J. Carlos, Daniel, Luis, Pedro, "Effects of Intermittent Faults on the Reliability of a Reduced Instruction Set Computing (RISC) Microprocessor ieee transactions on reliability. [10] S. Salivahanan, A. Vallavaraj And C. Gnanapriya, Digital Signal Processing,Tata McGraw - Hill, New Delhi 2009. 267