A Novel Argument to Use 8-BIT Media Processor for Low Power VLSI Design

Similar documents
HARDWARE IMPLEMENTATION OF PIPELINE BASED ROUTER DESIGN FOR ON- CHIP NETWORK

VLSI ARCHITECTURE FOR NANO WIRE BASED ADVANCED ENCRYPTION STANDARD (AES) WITH THE EFFICIENT MULTIPLICATIVE INVERSE UNIT

A Novel Pseudo 4 Phase Dual Rail Asynchronous Protocol with Self Reset Logic & Multiple Reset

Evolution of CAD Tools & Verilog HDL Definition

Low Power Testing of VLSI Circuits Using Test Vector Reordering

HIGH-PERFORMANCE RECONFIGURABLE FIR FILTER USING PIPELINE TECHNIQUE

FPGA based Design of Low Power Reconfigurable Router for Network on Chip (NoC)

Design and Implementation of Hamming Code on FPGA using Verilog

Design and Implementation of VLSI 8 Bit Systolic Array Multiplier

FABRICATION TECHNOLOGIES

Lab. Course Goals. Topics. What is VLSI design? What is an integrated circuit? VLSI Design Cycle. VLSI Design Automation

Design and Verification Point-to-Point Architecture of WISHBONE Bus for System-on-Chip

A REVIEW ON LOW POWER SRAM

For a long time, programming languages such as FORTRAN, PASCAL, and C Were being used to describe computer programs that were

Low Power and Memory Efficient FFT Architecture Using Modified CORDIC Algorithm

Design of 2-Bit ALU using CMOS & GDI Logic Architectures.

CHAPTER 3 METHODOLOGY. 3.1 Analysis of the Conventional High Speed 8-bits x 8-bits Wallace Tree Multiplier

International Journal of Computer Sciences and Engineering. Research Paper Volume-6, Issue-2 E-ISSN:

Evolution of Implementation Technologies. ECE 4211/5211 Rapid Prototyping with FPGAs. Gate Array Technology (IBM s) Programmable Logic

FPGA-BASED DATA ACQUISITION SYSTEM WITH RS 232 INTERFACE

A Novel Design of High Speed and Area Efficient De-Multiplexer. using Pass Transistor Logic

Overview. CSE372 Digital Systems Organization and Design Lab. Hardware CAD. Two Types of Chips

VLSI Design Automation

RTL Coding General Concepts

RIZALAFANDE CHE ISMAIL TKT. 3, BLOK A, PPK MIKRO-e KOMPLEKS PENGAJIAN KUKUM. SYNTHESIS OF COMBINATIONAL LOGIC (Chapter 8)

International Journal of Applied Sciences, Engineering and Management ISSN , Vol. 05, No. 02, March 2016, pp

DYNAMIC CIRCUIT TECHNIQUE FOR LOW- POWER MICROPROCESSORS Kuruva Hanumantha Rao 1 (M.tech)

INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATION ENGINEERING & TECHNOLOGY (IJECET)

Overview of Digital Design with Verilog HDL 1

Prachi Sharma 1, Rama Laxmi 2, Arun Kumar Mishra 3 1 Student, 2,3 Assistant Professor, EC Department, Bhabha College of Engineering

FPGA PROTOTYPING BY VERILOG EXAMPLES XILINX SPARTAN 3 VERSION

INTERNATIONAL JOURNAL OF PROFESSIONAL ENGINEERING STUDIES Volume 9 /Issue 3 / OCT 2017

Low Power Circuits using Modified Gate Diffusion Input (GDI)

VERY large scale integration (VLSI) design for power

Reconfigurable PLL for Digital System

Design Verification Lecture 01

International Journal of Advance Engineering and Research Development LOW POWER AND HIGH PERFORMANCE MSML DESIGN FOR CAM USE OF MODIFIED XNOR CELL

Chapter 1 Overview of Digital Systems Design

ELCT 501: Digital System Design

Design, Analysis and Processing of Efficient RISC Processor

FPGA Implementation of I2C and SPI Protocols using VHDL

A Low Power Asynchronous FPGA with Autonomous Fine Grain Power Gating and LEDR Encoding

Date Performed: Marks Obtained: /10. Group Members (ID):. Experiment # 09 MULTIPLEXERS

Design and simulation of 4-bit ALU Design using GDI Technique for Low Power Application on Microwind 2.6K

VLSI Design Automation

Analysis of Power Dissipation and Delay in 6T and 8T SRAM Using Tanner Tool

FPGA based Simulation of Clock Gated ALU Architecture with Multiplexed Logic Enable for Low Power Applications

Implementation of Ripple Carry and Carry Skip Adders with Speed and Area Efficient

A 65nm Parallel Prefix High Speed Tree based 64-Bit Binary Comparator

ISSN Vol.04,Issue.01, January-2016, Pages:

Test Data Compression Using Variable Prefix Run Length (VPRL) Code

DESIGN AND IMPLEMENTATION OF VLSI SYSTOLIC ARRAY MULTIPLIER FOR DSP APPLICATIONS

Development of a Web based Education Module for Verilog HDL

Mapping real-life applications on run-time reconfigurable NoC-based MPSoC on FPGA. Singh, A.K.; Kumar, A.; Srikanthan, Th.; Ha, Y.

VLSI Design Automation. Calcolatori Elettronici Ing. Informatica

Golomb Coding Implementation in FPGA

C Program Adventures. From C code to motion

Design and Analysis of Kogge-Stone and Han-Carlson Adders in 130nm CMOS Technology

ECE 459/559 Secure & Trustworthy Computer Hardware Design

Modeling and Verifying Mixed-Signal Designs with MATLAB and Simulink

VHDL. Chapter 1 Introduction to VHDL. Course Objectives Affected. Outline

A SIMULINK-TO-FPGA MULTI-RATE HIERARCHICAL FIR FILTER DESIGN

An overview of standard cell based digital VLSI design

FPGA Implementation of Double Error Correction Orthogonal Latin Squares Codes

DESIGN, LAYOUT AND SIMULATION OF 8 BIT ARITHMETIC AND LOGIC UNIT USING C5 PROCESS TECHNOLOGY

Design Methodologies and Tools. Full-Custom Design

CHAPTER 1 INTRODUCTION

Implementation of Efficient Ternary Content Addressable Memory by Using Butterfly Technique

VLSI Based 16 Bit ALU with Interfacing Circuit

EE 466/586 VLSI Design. Partha Pande School of EECS Washington State University

Jung-Lin Yang. Ph.D. and M.S. degree in the Dept. of Electrical and Computer Engineering University of Utah expected spring 2003

Anisha Rani et al., International Journal of Computer Engineering In Research Trends Volume 2, Issue 11, November-2015, pp.

DESIGN STRATEGIES & TOOLS UTILIZED

An Efficient FPGA Implementation of the Advanced Encryption Standard (AES) Algorithm Using S-Box

Memory-Efficient and High-Speed Line-Based Architecture for 2-D Discrete Wavelet Transform with Lifting Scheme

DESIGN AND IMPLEMENTATION 0F 64-BIT PARALLEL PREFIX BRENTKUNG ADDER

Power Gated Match Line Sensing Content Addressable Memory

Design and Implementation of High Security Optical Shaft Encoder for an Artificial Limb using Xilinx on FPGA

ISSN Vol.03,Issue.29 October-2014, Pages:

DESIGN AND IMPLEMENTATION OF UNIVERSAL GATE USING DG-FinFET 32NM TECHNOLOGY

Power Efficient Register Assignment

Hardware Description Languages. Introduction to VHDL

Introduction to VHDL. Module #5 Digilent Inc. Course

Efficient Comparator based Sum of Absolute Differences Architecture for Digital Image Processing Applications

Analysis of 8T SRAM with Read and Write Assist Schemes (UDVS) In 45nm CMOS Technology

Bus Matrix Synthesis Based On Steiner Graphs for Power Efficient System on Chip Communications

A Novel Architecture of SRAM Cell Using Single Bit-Line

A Comprehensive Digital Cross Connect (DCS) for Buffered and Unbuffered Switching Applications

ORIGINAL ARTICLE. Abstract. ISSN: (print) ISSN: (online)

DESIGN AND SIMULATION OF 1 BIT ARITHMETIC LOGIC UNIT DESIGN USING PASS-TRANSISTOR LOGIC FAMILIES

VLSI Design and Implementation of High Speed and High Throughput DADDA Multiplier

iphone Noise Filtration Hardware

Hardware implementation of sorting algorithm using FPGA

DOWNLOAD OR READ : VHDL FOR DIGITAL DESIGN VAHID SOLUTION BING PDF EBOOK EPUB MOBI

DUE to the high computational complexity and real-time

Hardware Design Environments. Dr. Mahdi Abbasi Computer Engineering Department Bu-Ali Sina University

DEVELOPMENT AND VERIFICATION OF AHB2APB BRIDGE PROTOCOL USING UVM TECHNIQUE

System Verification of Hardware Optimization Based on Edge Detection

HARDWARE IMPLEMENTATION OF LOSSLESS LZMA DATA COMPRESSION ALGORITHM

Implementation of Reduced Power Open Core Protocol Compliant Memory System using VHDL

Transcription:

A Novel Argument to Use 8-BIT Media Processor for Low Power VLSI Design P. Karthigai Kumar, Member IAENG*, Dr. K. Baskaran, MIEEE**, C. Praveen Babu + ABSTRACT: A novel idea pertaining to the selection of a processor size for low power IC applications has been proposed. 8-bit processors were introduced in the beginning by Intel. Later, technology has rapidly advanced forcing the designers to go for 32-bit and 64-bit processors for edge cutting, high performance technologies. In this paper, it has been proposed that an 8 bit media processor can perform the same applications that the others do, at reduced power consumption and hence is more advantageous than other processor sizes. The main concept of the paper is that, once a VHDL or a Verilog code is designed for any media processing application, its Xilinx synthesis report is converted to transistor level net list by careful observation. This transistor level net list is implemented in Tanner Tools EDA and the power can be easily calculated. By this method, the experimental results have proved that 8-bit processor consumes less power than others and hence can be used in place of 16-bit and 32-bit processor chips. Index Terms: Low Power, Media Processor, 8-Bit Processor, Synthesis Report, Tanner Tools, Power Calculation I. INTRODUCTION Low power has always been the main and foremost goal of any chip designer. When the chip is designed for a particular application, a Hardware Description Language Code is first written which can be dumped into the IC, which then works as per the designed application. Then the IC works as per the designed application. Before dumping the code into the IC, it has to be ensured that the code which was designed consumes the least amount of power and least memory [5]. It should also be ensured that there are no hotspots which would otherwise cause uneven distribution of heat There are many methods to find the power consumption of a VHDL or a Verilog code. Some of the software s are Synopsis and Mentor Graphics. Tanner Tools is an Electronic Automation Tool, which provides a schematic design entry, whose power [7] can be calculated. It is simple and can be downloaded easily through the internet. Manuscript received December 21, 2007. This work was supported financially in part by the Karunya University. * Faculty in Electronics and Communication Engineering Department, Karunya University, Coimbatore (Email: karthi_kumar_p@rediffmail.com) ** Faculty in Computer Science and engineering, Government college of Technology, Coimbatore + PG Student in Electronics and Communication Engineering, Karunya University, Coimbatore A. Media encryption processors at the chip level Media processing has seen the most rapid development in recent years, especially in media encryption for data security. Hence media processors [1] are now being designed at the chip level on a large scale. 8-bit processors were introduced in the beginning by Intel. Later, technology has rapidly advanced forcing the designers to go for 32-bit and 64-bit processors for edge cutting, high performance [6] media technologies [4]. This paper proposes that an 8 bit media processor [3] can perform the same applications that the others do, at reduced power consumption and hence is much more advantageous than other processor sizes. The proposal of this paper is that once a VHDL or a Verilog code is designed for any media processing [8] application, its Xilinx synthesis report is converted to transistor level net list by careful observation. This transistor level net list is implemented in Tanner Tools EDA and the power can be easily calculated. Our results have shown that an 8-bit processor consumes lesser power, while performing the same functions and hence highly efficient than the other processor sizes. II. PROBLEM FORMULATION 32-bit and 16-bit processors are used these days to lure people towards high technology [2] and edge cutting media applications. But these processors consume high amount of power. Even though software s like Mentor Graphics Synopsis are available to find the power consumption of a VHDL or a Verilog Code, they are very costly and cannot be readily available for a normal student for his research. Hence this paper aims to bring out an argument that 8 bit processor can be used instead and it consumes less amount of power than the others. An easy method of power calculation is also introduced. III. METHODOLOGY OF THE PAPER An 8 bit audio and a 16 bit video data in binary form are the incoming signals. The media processor has to reconfigure itself to the incoming bit stream and invert the bits and send them. Firstly, the Verilog code is designed on a 8 bit chip, for the simple application, operating on customized wordlength audio and video data. The main concept is that all operations take place on 8 bit signals and 8 bit buses.

V=5.0 video_in1 video_in2 video_in3 video_in4 video_in5 video_in6 video_in7 video_in8 audio_in_11 audio_in_12 audio_in_13 audio_in_14 audio_in_15 audio_in_16 audio_in_17 audio_in_18 audio_in_21 video_out audio_in_22 audio_in_23 audio_in_24 audio_in_25 audio_in_26 audio_in_27 audio_in_28 audio_out Proceedings of the International MultiConference of Engineers and Computer Scientists 2008 Vol I The 16 bit audio data is handled by two separate 8 bit data path s in this case. The 8 bit video data is handled directly. Secondly, the Verilog code is designed on a 16 bit chip, for the simple application, operating on customized word-length audio and video data. The main concept is that all operations take place on 16 bit signals and busses. The 16 bit audio data is handled directly. For the 8 bit video data, 8 bits of a data-path are used and the rest are used for other purposes like carrying control signals. Thirdly, the Verilog code is designed on a 32 bit chip, for the simple application, operating on customized word-length audio and video data. The main concept is that all operations take place on 32 bit signals and busses. Both 8 bit video and 16 bit audio are handled by 32 bit busses, the free bits are used for control purposes. The synthesis reports of all the three codes are studied carefully and their corresponding transistor level net lists are drawn. These transistor level net lists are implemented in Tanner Tool EDA and the schematic entry is done for all the three codes. Once the schematic entry is done the T-Spice is run which will give the net-list. The power consumption for each circuit is calculated by the command IV. EXPERIMENTAL RESULTS The application selected in this paper is related to media processing. We consider an 8 bit video and a 16 bit audio digital data coming together. Normally, sound travels faster than video. Hence audio data reaches the processor first. Once the processor detects the audio data, it starts operating on the data. A simple media operation Inversion of the bits is taken in our application. Once video signal is received, the processor has to switch immediately to the video and perform both audio and video inversion simultaneously. Once a particular data stops, the processor has to use its free resources to service the other data. The main concept that has been introduced in the coding was that the 8 bit processor handles the 8 bit video data directly. The 16 bit audio data is handled by two 8 bit data paths. It has been thus proved that 8 bit processor can be used for the same high-end media operations with the added advantage that it consumes less amount of power. Hence the paper makes an argument that instead of using higher processor sizes, 8 bit processor could be used to perform 16 or even higher bit operations as any other processor size with efficiency and reduced power consumption. This code is designed for an 8 bit, 16 bit and a 32 bit chip separately and the corresponding tanner implementations for 8 bit, 16 bit and 32 bit processor are shown in Figure.1, Figure.2, and Figure.3 respectively. Also the power consumption waveforms for 8 bit, 16 bit and 32 bit processor is shown in Figure.4, Figure.5, Figure.6 respectively and the comparison results are shown in Table.1..model pmos pmos.model nmos nmos.tran 4n 400n.print p( / output node number / ) Converting the net lists into circuit level entry This is done by designing a Verilog program for a simple DSP application of Inversion of bits on an 8 bit, 16 bit and a 32 bit processor assuming that the incoming audio signal is of 16 bits and video signal is of 8 bit length and converting the Xilinx synthesis report to transistor level net list and implementing it in Tanner tool and hence select which processor size gives low power. Experimental results show that an 8 bit processor consumes less power than the others. Figure 1. Tanner implementation for an 8 bit processor inverting the 8 bit video and 16 bit audio data

audio_in_1 audio_in_2 audio_in_3 audio_in_4 audio_in_5 audio_in_6 audio_in_7 audio_in_8 audio_in_9 audio_in_10 audio_in_11 audio_in_12 audio_in_13 audio_in_14 audio_in_15 audio_in_16 audio_out video_in1 video_in2 video_in3 video_in4 video_in5 video_in6 video_in7 video_in8 padbits_1 padbits_2 padbits_3 padbits_4 padbits_5 padbits_6 padbits_7 padbits_8 V= 5.0 video_out Figure 2. Tanner implementation for a 16 bit processor inverting the 8 bit video and 16 bit audio data audio_in_1 audio_in_2 audio_in_3 audio_in_4 audio_in_5 audio_in_6 audio_in_7 audio_in_8 audio_in_9 audio_in_10 audio_in_11 audio_in_12 audio_in_13 audio_in_14 audio in 15 padbits1 padbits2 padbits3 padbits4 padbits5 padbits6 padbits7 padbits8 pad1 pad2 pad3 pad4 pad5 pad6 pad7 pad8 audio_out video_in_1 video_in_2 video_in_3 video_in_4 video_in_5 video_in_6 video_in_7 video_in_8 video_out V=5.0 Proceedings of the International MultiConference of Engineers and Computer Scientists 2008 Vol I Figure 3. Tanner implementation for a 32 bit processor inverting the 8 bit video and 16 bit audio data

Figure 4. Power waveform for an 8 bit chip inverting the 8 bit video and 16 bit audio data Figure 5. Power waveform for a 16 bit chip inverting the 8 bit video and 16 bit audio data

Figure 6. Power waveform for a 32 bit chip inverting the 8 bit video and 16 bit audio data Table 1. Experimental results for different processor Size to invert 8 bit video and 16 bit audio data PROCESSOR SIZE POWER CONSUMPTION(mW) 8 BIT 118 16 BIT 125 32 BIT 135

From the Table 1, it can be proved that an 8 bit chip consumes less amount of power than the other processor sizes. But using 8 bit data-paths would slow the system. Hence techniques like pipelining and parallel processing can be used. At the outset, this paper makes a strong argument to use an 8 bit processor size for high end VLSI Low Power media applications. V. CONCLUSIONS The paper brings to light a novel argument of the use of a 8 bit processor in place of 32 bit or 64 bit processors, for low power chip design. The main point of consideration is that the designer has to carefully extract the net list information so that his circuit level interpretation is correct and as per the designed HDL Code. But it is advisable to implement this method for small codes of small experimental applications, since as the size of the code increases, the schematic size increases enormously. This method, even though is complex, is reliable, simple and cheap. The results have proved that an 8 bit processor chip can do all the functions at good speed, but has the least power consumption. Hence it can be concluded that 8 bit processor can still be used for high edge media applications. ACKNOWLEDGEMENT The authors would like to thank the University management for providing all the tools necessary to carry out the research successfully. REFERENCES [1] Jer Min Jou, Yun Lung Lee, Chen Yen Lin and Chien Ming Sun, A Novel Reconfigurable computation unit for DSP applications, IEEE comp. society annual symp. on VLSI, pp 439-444, ISVLSI 07. [2] Navid Lashkarian, Ed Hemphi, Helen Tarn, Hemang Parekh and Chris Dick, Reconfigurable Digital Front End Hardware for wire less base-station transmitters: Analysis, Design and FPGA implementation, IEEE transactions on circuits and systems, vol 54, No. 8, pp 1666-1677, Aug 2007. [3] Huijuan Yang and Alex.C.Kot, Pattern based data hiding for Binary Image Authentication by Connectivity- Preserving, IEEE Transactions on multimedia, vol 9, no. 3, pp 475-486, APRIL 2007. [4] Giorgos Dimitrakopoulos, Christos Mavrokefalidis, Kostas Galanopoulos and Dimitris Niolos, Sortor based permutation units for Media-Enhanced Processors IEEE Transactions on VLSI systems, vol 15, no. 6, pp 711-715, June 2007. [5] Florin Balasa, Hongwei Zhu and Ilie.I.Lucian, Computation of storage requirements for Multi dimensional signal processing applications, IEEE transactions on VLSI systems, vol 13, no 5, pp 447-460, April 2007. [6] Ismail Kadayif, Partho Nath, Mahmut Kandemir and Anand Sivasubramaniam, Reducing data TLB power via compiler directed Address Generation, IEEE transactions on Comp. Aided design of IC s and systems, vol 26, no 2, pp 312-324, Feb 2007. [7] Alexandru Andrei, Petru Elas, Zebo Peng, Marcus.Y.Schmitz and Bashir M Al Hashni, Energy optimization of multiprocessor systems on chip by voltage selection, IEEE transactions on VLSI systems, vol 15, no 3, pp 262-275, Mar 2007. [8] Frank Zhigang Wang, Sining Wu, Na Helian, Michael Andrew Parker, Yike Guo and Yuhui Deng and Vineet.R.Khare, Grain Oriented Storage: A single-image cross domain. High bandwidth architecture, IEEE transactions on computers, vol 56, no 4, pp 474-487, April 2007.