Design and Implementation o 64 bit RISC Processor on FPGA

Size: px
Start display at page:

Download "Design and Implementation o 64 bit RISC Processor on FPGA"

Transcription

1 Design and Implementation o 64 bit RISC Processor on FPGA Mr. Mohammad Gousuddin H Maniyar 1 Department of Electronic & Communication Engineering R V College of Engineering College,Bengulur Mrs. Sujatha Hiremath 2 Assistant Professor Department of Electronic & Communication Engineering R V College of Engineering College,Bengulur ABSTRACT In the early years this computer was stack architecture, later replaced by a RISC architecture. Now the intent is to replace the hypothetical, emulated computer by a real one. This idea was made realistic by the advent of programmable hardware components called field programmable gate arrays (FPGA). Complex Instruction Set Computer (CISC) and Reduced Instruction Set Computer (RISC). The CISC concept is an approach to the Instruction Set Architecture (ISA) design that emphasizes doing more with each instruction using a wide variety of addressing modes, number of operands in various locations in its Instruction Set. As a result, the instructions are of widely varying lengths and execution times thus demanding a very complex Control Unit, which occupies a large real estate on chip. On the other hand, the RISC Processor have reduced number of Instructions, fixed instruction length, more general purpose registers, load-store architecture and simplified addressing modes which makes individual instructions execute faster, achieve a net gain in performance and an overall simpler design with less silicon consumption as compared to CISC. The fact that real processors are far more complex than the one presented here. It is concentrate on the fundamental concepts rather than on their elaboration. The strive for a fair degree of completeness of facilities, but refrain from their optimization. In fact, the dominant part of the vast size and complexity of modern processors and software is due to speed-up called optimization (improvement would be a more honest word). It is the main culprit in obfuscating the basic principles, making them hard, if not impossible to study. In this light, the choice of a RISC (Reduced Instruction Set Computer) is obvious. The use of an FPGA provides a substantial amount of freedom for design. Yet, the hardware designer must be much more aware of availability of resources and of limitations than the software developer. Also, timing is a concern that usually does not occur in software, but pops up unavoidably in circuit design. I. INTRODUCTION RISC Processor is increasing widely used in every field. Most microprocessor in today s market is based on either RISC or CISC architecture technologies. The RISC architecture boost the computer speed also used in control algorithms.using of RISC Processor the time required to execute each instruction can be shortened and the number of cycles reduces. The depicts a RISC building design in which 2 cycle operation is gotten utilizing a pipelined outline. The pipelined architecture is used which minimizes the latency and increases the speed and in the new innovation the 2 stage pipelining which works on the positive edge and as well as in the negative edge minimizes the latency and increases the speed and also reduce the stalling in instruction. The whole architecture of RISC processor work on the 2 cycle.the fixed size of instruction allows the given instruction to be easily piped. RISC processor has a flexible architecture. The clock gating technique is used to minimize the Power. The CISC concept is an approach to the Instruction Set Architecture (ISA) design that emphasizes doing more with each instruction using a wide variety of addressing modes, number of operands in various locations in its Instruction Set. As a result, the instructions are of widely varying lengths and execution times thus demanding a very complex Control Unit, which occupies a large real estate on chip. On the other hand, the RISC Processor have reduced number of Instructions, NCICT-2016 Special Issue 2 Page 167

2 fixed instruction length, more general purpose registers, load-store architecture and simplified addressing modes which makes individual instructions execute faster, achieve a net gain in performance and an overall simpler design with less silicon consumption as compared to CISC. The fact that real processors are far more complex than the one presented here. It is concentrate on the fundamental concepts rather than on their elaboration. The strive for a fair degree of completeness of facilities, but refrain from their optimization.this is a most Popular method to reduce the dynamic Power consumption. Power is devoured by the combinational rationale whose qualities are changing on each one clock edge so the gating rationale comes into the Picture and clock is turned off. In the present research work, the outline of 16 bit RISC processor is introduced; implemented for high efficiency and low power [2]. The architecture supports 33 instructions. The instruction cycle consist of 2 stage pipelining and perform fetch, decode, execute, write back operation simultaneously. The control unit Generate signals from the given instructions. The architecture supports arithmetic, logical, shifting and rotation operations. On the other hand, RISC processor requires very few data types and performs the simple operations. It also supports very few addressing modes and mostly based on registers. Many of the instructions operate on data which are present in internal registers. LOAD and STORE instructions are the only instructions which access data from external memory. Here decoding becomes easier, since the instruction length is fixed. Execution of instructions in parallel using the pipelined stages will improves the overall throughput of the processor but it will introduces some of the hazards in its working operation. Data hazards are those hazards which are generated due to sharing of source and destination resources in succeeding instructions, this will happen in the case when the source for an instruction is destination for previous instruction [1]. This can be prevented by using the forwarding method. Structural hazards are generated when the program and data memory is used commonly. By designing the prefetch queue processor, structural hazards can be removed. RISC architecture s analysis gives many important issues in computer architecture. Most RISC computers have the same key elements: A limited and simple instruction set. A large number of GPR s (General Purpose Registers). Optimization of the instruction pipeline. Organization of RISC computers can be represented by three main components: ALU (Arithmetic Logic Unit): performs the actual computation and processing of data. CU (Control Unit): controls the movement of data and instructions into and out of the CPU and controls the operation of the ALU. RS (Register Set): a minimal internal memory, which consists of a set of storage locations. Comparing to CISC, RISC CPU have more advantages, such as faster speed, simplified structure easier implementation. RISC CPU is extensive use in embedded system. Therefore, designing of RISC CPU based on MIPS is the necessary choice II. LITERATURE SURVEY Most microprocessor in today s market is based on either RISC or CISC architecture technologies. The RISC architecture boost the computer speed also used in control algorithms.using of RISC Processor the time required to execute each instruction can be shortened and the number of cycles reduces. In [1] paper depicts a RISC building design in which 2 cycle operation is gotten utilizing a pipelined outline. The pipelined architecture is used which minimizes the latency and increases the speed and in the new innovation the 2 stage pipelining which works on the positive edge and as well as in the negative edge minimizes the latency and increases the speed and also reduce the stalling in instruction. The whole architecture of RISC processor work on the 2 cycle.the fixed size of instruction allows the given instruction to be easily piped. RISC processor has a flexible architecture. In [2] the second approach clock gating technique is used to minimize the Power. This is a most Popular method to reduce the dynamic Power consumption. Power is devoured by the combinational rationale whose qualities are changing on each one clock edge so the gating rationale comes into the Picture and clock is turned off. In the present research work, the outline of 16 bit RISC processor is introduced implemented for high efficiency and low power. The architecture supports 33 instructions. The instruction cycle consist of 2 stage pipelining and perform fetch, decode, execute, write back operation simultaneously. The control unit Generate signals from the given instructions. The architecture NCICT-2016 Special Issue 2 Page 168

3 supports arithmetic, logical, shifting and rotation operations. In [3] this approach proposed RISC processor based on MIPS designed here is an effort towards efficient processor suitable for various applications. CISC processors have received the marketplace over the years. They support various addressing modes and various data types like others complex processors. Length of instruction varies from instruction to instruction. They generally access data from external memory. They are basically implemented using micro programmed control. There is little gap between instructions of CISC processor and higher-level language statements. However, they may save memory space, its design is complicated and instructions are variable in length; special hardware is required for boundary marking of instruction. After an deep study it is proved that simple instructions has been used 80% of the time and complex instructions has been replaced by group of simple instructions. In [4] this approach a lot many new amount of Processor's are also into the market. Out of all these processor's a few of them were designed using processor cores i.e. Hardware Description Languages like Verilog-HDL and VHDL ( Very High Speed Integrated Circuit Hardware Description Language ), is used for writing a particular version of processor. This helps the designer to use them in any of the embedded applications. These can be used in the processor just by embedding a particular application in the processor. RISC (Reduced Instruction Set Computer) is an efficient Computer Architecture which can be used for the Low power and high speed applications of the processor RISC Processors are important in application of pipelining. The curb of the processor is the Instruction Set Architecture used for developing it. The total worthiness of the processor depends on utilizing the Instruction Set Architecture. Instruction Set Architecture is a metaphysical interface between Low level system of the machine and the hardware, that contain all the information about the machine, required to However a lot of research is being carried out in the field of processor's to satisfy the performance issues. But now a days it is mandatory to use a machine which is efficient in the terms of speed, power, performance and size. Though there are tradeoffs between all the performance parameter's, Research is being carried out to satisfy all the above performance parameters. Though a more number of instruction sets are available in the market, an instruction set which can handle less power, high speed low area needs to be selected for the efficient functioning. To satisfy all the above requirements we consider the MIPS (Microprocessor without Interlocked Pipelining Stages) Instruction Set Architecture. This paper also concentrates on reducing the power utilized by the processor in order to satisfy the Low Power constraint of the developed Processor. III. PROBLEM STATEMENT The RISC architecture boost the computer speed also used in control algorithms.using of RISC Processor the time required to execute each instruction can be shortened and the number of cycles reduces the RISC processor consists of the block mainly ALU, Universal shift register and Barrel Shifter. We have used modified Harvard architecture that uses separate memories for its instruction & data memory. The processor is the Instruction Set Architecture used for developing it. The total worthiness of the processor. Depends on utilizing the Instruction Set Architecture. However a lot of research is being carried out in the field of processor's to satisfy the performance issues. But now a days it is mandatory to use a machine which is efficient in the terms of speed, power, performance and size. Though there are tradeoffs between all the performance parameter's, Research is being carried out to satisfy all the above performance parameters. IV. OBJECTIVE Through this approach a lot many new amount of Processor's are also into the market. Out of all these processor's a few of them were designed using processor cores i.e. Hardware Description Languages like Verilog-HDL and VHDL ( Very High Speed Integrated Circuit Hardware Description Language ), is used for writing a particular version of processor. This helps the designer to use them in any of the embedded applications. These can be used in the processor just by embedding a particular application in the processor. RISC (Reduced Instruction Set Computer) is an efficient Computer Architecture which can be used for the Low power and high speed applications of the processor RISC Processors are important in application of pipelining. The curb of the processor is the Instruction Set Architecture used for developing it. The total worthiness of the processor depends on utilizing the Instruction Set Architecture. Instruction Set Architecture is a NCICT-2016 Special Issue 2 Page 169

4 metaphysical interface between Low level system of the machine and the hardware, that contain all the information about the machine, required to However a lot of research is being carried out in the field of processor's to satisfy the performance issues. 1) To develop and implement RISC processor algorithm, with high efficiency and less power consumption. 2) To maximize the performance and operations of a processors for many application. 3) Software scheduling and optimizing compilers of RISC architectures try to maximize the synergy between hardware and software 4) Develop with a suitable technique and dump it on FPGA board. 5) To carry out study the RICS processors for many further application V. METHODOLOGY In the present research work, the outline of 64 bit RISC processor is introduced; implemented for high efficiency and low power. The architecture supports minimum 35 instructions. The instruction cycle consist of single stage pipelining and perform fetch, decode, execute, write back operation simultaneously. The control unit Generate signals from the given instructions. The architecture supports arithmetic, logical, shifting and rotation operations. On the other hand, RISC processor requires very few data types and performs the simple operations. It also supports very few addressing modes and mostly based on registers. Many of the instructions operate on data which are present in internal registers. LOAD and STORE instructions are the only instructions which access data from external memory. Here decoding becomes easier, since the instruction length is fixed. Execution of instructions in parallel using the pipelined stages will improves the overall throughput of the processor but it will introduces some of the hazards in its working operation. Data hazards are those hazards which are generated due to sharing of source and destination resources in succeeding instructions, this will happen in the case when the source for an instruction is destination for previous instruction. This can be prevented by using the forwarding method. Structural hazards are generated when the program and data memory is used commonly. By designing the prefetch queue processor, structural hazards can be removed. Control hazards are generated in non-sequential executed circuits and to remove this hazards flushing method is used. RISC architecture s analysis gives many important issues in computer architecture. Most RISC computers have the same key elements: A limited and simple instruction set. A large number of GPR s (General Purpose Registers). Optimization of the instruction pipeline. Organization of RISC computers can be represented by three main components: ALU (Arithmetic Logic Unit): performs the actual computation and processing of data. CU (Control Unit): controls the movement of data and instructions into and out of the CPU and controls the operation of the ALU. RS (Register Set): a minimal internal memory, which consists of a set of storage location. FLOATING POINT UNIT: FPGA structures are optimized for fixed point computations. Implementation of floating point (FP) arithmetic requires large amount of resources, so maximum clock frequency becomes relatively low.to overcome this in part, efficient implementations of FP arithmetic have been designed, for example, sequential and pipelined divider and square root block [14], logarithm and exponential function [8], and a set of specific libraries [9]. The ALU of the CPU mentioned in the previous section handles integer numbers only. REALs in the CPD. tool are single precision floating point numbers of IEEE 754 standard. Apart from basic operations like addition, subtraction, multiplication and division, the controller FPU also performs comparisons (equal, more than, etc.), integer to floating point two-way conversion with rounding and truncating, determination of absolute value, sign inversion. Multiplication of two floating point numbers is executed by multiplication of mantissas and addition of exponents. Simplified architecture of the floating point multiplier is shown in Fig. 1. Normalization block on the right side is a simple shift register, which conditionally right-shifts the result of mantissas multiplication. The result is rounded using round to nearest even mode. Rounding requires three additional bits determined in A3 block, i.e. guard, round and sticky bits. Four versions of the multiplier have been investigated. They differ in the design of fixed point multiplier block A3, whose structure has essential influence on final implementation parameters, such as FPGA resource requirement and calculation speed. Standard serial paper-and-pencil multiplication algorithm as well as parallel, pipelined, fast array multipliers with or without FPGA embedded NCICT-2016 Special Issue 2 Page 170

5 hardware multipliers have been examined. Performance tests of the multiplier versions and implementation parameter evaluations have shown that the paper-and-pencil algorithm is a good compromise between speed and resources requirements. Therefore, this version has been chosen for the FPU floating point multiplier. Similar evaluations have been carried out for other components of the FPU before final implementation. In this project the architecture of the proposed RISC CPU is a uniform 64- bit instruction format, single cycle non-pipelined processor. It has a load/store architecture, where the operations will only be performed on registers, and not on memory locations. It follows the classical von-neumann architecture with just one common memory bus for both instructions and data. A total of 35 instructions are designed as a first step in the process of development of the processor. The instruction set consists of Logical, Immediate, Jump, Load, store and HALT type of instructions. The Halt instruction acts as a border line between the instruction and data memory. This offers the flexibility to the programmer, who uses this processor core to define their own instruction and data memory within the allotted 128 memory registers. Each of the register is of 64-bits width capacity. Program Counter: The Program Counter (PC) is a 64-bit latch that holds the memory address of location, from which the next machine language instruction will be fetched by the processor. It is a 12-bit pointer to indicate the instruction memory. It additionally uses a 12-bit pointer to point to the data memory, which will be used only when a Load/Store instruction is encountered for execution. Fig. 1. Architecture of floating point multiplier. Pre-fetching: The process of fetching next instruction or instructions into an event queue before the current instruction is complete is called pre-fetching. The earliest 64-bit microprocessor, the Intel 8086/8, pre-fetches into a non-board queue up to six bytes following the byte currently being executed thereby making them immediately available for decoding and execution, without latency. Pipelining: Pipelining instructions means starting or issuing an instruction prior to the completion of the currently executing one. The current generation of machines carries this to a considerable extent. The PowerPC 601 has 20 separate pipeline stages in which various portions of various instructions are executing simultaneously. VI. WORKING Arithmetic and Logic unit: The arithmetic and logic unit (ALU) performs arithmetic and logic operations. It also performs the bit operations such as rotate and shift by a defined number of bit positions. The proposed ALU contains three submodules, viz. arithmetic, logic and shift modules. The arithmetic unit involves the execution of addition and multiplication operations and generates Sign flag and Zero flag. The shift module is used for executing instructions such as rotation and shift operations [9]. Register File: The register file consists of 8 general purpose registers of 64-bits capacity each. These register files are utilized during the execution of arithmetic and data-centric instructions. The load instruction is used to load the values into the registers and store instruction is used to retrieve the values back to the memory to obtain the processed outputs back from the processor. Instruction fetch unit: The function of the instruction fetch unit is to obtain an NCICT-2016 Special Issue 2 Page 171

6 instruction from the instruction memory using the current value of the PC and increment the PC value for the next instruction as shown in Figure 2 Since this design uses an 64-bit data width we had to implement byte addressing to access the registers and word address to access the instruction memory. The instruction fetch component contains the following logic elements that are implemented in VHDL 64-bit program counter (PC) register, an adder to increment the PC by four, the instruction memory, a multiplexor, and an AND gate used to select the value of the next PC. Architecture of 64 bit RlSC Processor: Fig.2 Architecture of 64 bit RlSC Processor Floating Point Unit: A floating point (FPU), also known as a math co-processor or numeric processor is a specialized co-processor that manipulates numbers more quickly than the basic microprocessor circuitry. The FPU does this by means of instructions that focus entirely on large mathematical operations. This FPGA implementation of 64-bit double precision floating point has been shown which performs certain operations like addition, subtraction, multiplication and division. This kind of unit can be tremendously useful in the FPGA implementation of complex systems that benefits from the parallelism of the FPGA device [9]. FP Add: In the module FP Add, the inputs operands are separated into their mantissa and exponent components. Then the exponents are compared to check which variable is larger. The larger variable goes into "mantissa Jarge" and exponent_large". Similarly the smaller variable goes into "mantissa_small" and "exponent_small". The sign and exponent of the output will be determined the smaller exponent can be right shifted before performing the addition. FP Sub: The input variables are separated into two components namely mantissa and exponent. Subtraction is similar to that of addition such that the mantissa of the smaller exponent is shifted to the right before performing the subtraction [10]. FP Mul: Multiplying all 53 bits of varl by 53 bits of var2 would result in a 106-bit product. 53 bit by 53 bit multipliers are not available in the Altera FPGAs, so the multiply would be broken down into smaller multiplies and the results would be added together to give the final 106-bit product. The module (FP Mul) breaks up the multiply which can perform 24-bit by 17-bit. FP Div: Division is performed in FP Div. The exponent is obtained by adding 1023 with the exponent of varl and then by subtracting the exponent of var2 from this sum. Then, the mantissa of varl is the dividend and the mantissa of var2 is the divisor. F. Memory Unit: The load and store instructions are used to access this module. Finally, the memory access stage is where, if necessary, system memory is accessed for data. Also if a write to the data memory is required by the instruction it is done in this stage. In order to avoid additional complications it is assumed that a single read or write is accomplished within a single CPU clock cycle. G. Instruction Set: The instruction set used in this architecture consists of arithmetic, logical, memory and branch instructions. It will have short (8-bit) and long (16-bit) instructions, which are shown in Table 1. For all arithmetic & logical operations, 8- bit instructions are used. For all memory transactions and jump instructions, 16-bit instructions are used. It will have special instructions to access external ports. The architecture will also have 64-bit general purpose NCICT-2016 Special Issue 2 Page 172

7 registers that can be used in all operations. For all the jump instruction, the processor architecture will automatically flush the data in the pipeline, so as to avoid any misbehavior. Fig.3 Clock Pulses of Low Power Unit VII. SIMULATION RESULTS TABLE I. INSTRUCTION SET Short Instruction Format: Long Instruction Format: H Low Power Technique :There are several different RTL and gate-level design strategies for reducing power. In the present work, Clock Gating design is used for reducing dynamic power. In this method, clock is applied to only the modules that are working at that instant [11]. Clock gating is a dynamic power reduction method in which the clock signals are stopped for selected registers banks during the time when the stored logic values are not changing. The clock pulse for low power technique is shown in Fig. 3. The input to low power unit is global clock and its output is gated clock, since the module will block the main clock in the following conditions. 1. When instruction is halt. 2. When there is a continuous Nop operation. 3. When program counter fails to increment. Fig. 4 Simulation Waveform of Double Precision Floating Point Fig. 5 Simulation Waveforms of 64-bit RlSC Processor VIII. CONCLUSION In this paper, we proposed an FPGA based low power pipelined 64-bit RISC processor with Double Precision Floating Point is designed. Modelsim is used to verify the simulation results. The design is implemented on Vertex 5 FPGA on which Arithmetic, Branch operations and Logical NCICT-2016 Special Issue 2 Page 173

8 functions are verified. Pipelining would not flush when branch instruction occurs as it is implemented using dynamic branch prediction. Branch predictions will increase flow in instruction pipeline and achieve high effective performance. The proposed architecture is able to prevent pipeline to multiple executions with a single instruction. Whenever the processor enters in sleep mode, then it disables the clock enable signal so this saves some power by using low power technique. The proposed design can access more data processing for data intensive applications like packet processing. This 64-bit RISC processor consumes only 1 instruction, whereas 32-bit RISC processor needs more than 1 instruction. The Carry select adder structures are employed verified through exhaustive simulation and lower power dissipation and it can increases the speed. It is expandable up to 35 instructions. Clock gating technique disabling the portion which is not required so the flip flop don t change their state and save the power. This processor with floating point operations is used in many applications like Signal processing, Graphics and Medical equipments, mathematical computation. It is also able to apply any control algorithm. REFERENCES [I] Preetam Bhosle, Hari Krishna Moorthy,"FPGA Implementation of Low Power Pipelined 32-bit RlSC Processor", Proceedings of International Journal of Innovative Technology and Exploring Engineering (IJITEE), ISSN: , Vol-I, Issue-3, August [2] Galani Tina G,Riya Saini and R.D.Daruwala,"Design and Implementation of 32-bit RlSC Processor using Xilinx",lnternational Journal of Emerging Trends in Electrical and Electronics(IJETEE),ISNN: ,Vol- 5,lssue I,July [3] Imran Mohammad, Ramananjaneyulu, "FPGA Implementation of a 64-bit RlSC Processor Using VHDL", Proceedings of International Journal of Reconfigurable and Embedded Systems(IJRES),ISSN: ,Vol-l, No.2, July [4] Aboobacker Sidheeq.V.M,"Four Stage Pipelined 16 bit RlSC on Xilinx Sparatn 3AN FPGA", Proceedings of International Journal of Computer Applications, ISNN: , Vol-48, June [5] Tashfia.Afreen, Minhaz. Uddin Md Ikram, Aqib. AI Azad, and Iqbalur Rahman Rokon," Efficient FPGA Implementation of Double Precision Floating Point Unit Using Verilog HDL", International Conference on Innovations in Electrical and Electronics Engineering (ICIEE'20 12),October 20 12,Dubai (UAE). [6] Addanki Purna Ramesh,Ch.Pradeep,"FPGA Based Implementation of Double Precision Floating point AdderlSubtarctor Using Verilog", Proceedings of International Journal of Emerging Technology and Advanced EngineeringISSN ,Vol-2,lssue 7,July [7] Indu, Arun Kumar, Design of Low Power Pipelined RISC Processor, International Journal of Advanced Research in Electrical & electronics & instrumentation Engineering, vol.2, no.3, pp , August [8] Priyanka Trivedi, Rajan Prasad Tripathi low Power pipelined RISC processor: A Review, IJSRD vol.2, no.4, pp , July [9] Jagrit Kathuria, M. Ayoubkhan, Arti Noor, A Review of Clock Gating Techniques, MIT International Journal of Electronics and Communication Engineering vol 1,no. 2, August [10] B. Ramkumar and Harish M Kittur Low- Power and Area- Efficient Carry Select Adder IEEE Transactions on Very Large Scale Integration (VLSI) Systems, vol. 20, no.2, 2012, pp [11] J.Ravindra, T.Anuradha,"Design of Low Power RlSC Processor by Applying Clock gating Technique", International Journal of Engineering Research and Applications, ISSN , Vol-2, Issue-3, May-Jun [12] R.uma, Design and Performance analysis of 8 bit RISC Processor Using Xilinx Tool, International Journal of Engineering Research and Application, vol.2, no.2,pp , April [13] Samiappa Sakthikumaran,S. Salivahanan,V.S,kanchan. Bhaaskaran, 16 bit RISC Processor Design For Convolution Application IEEE - International. Conference on Recent Trends in Information technology pp , June [14] Li Li and Ken Choi SeSCG: Selective Sequential Clock Gating for Ultra - low-power Multimedia Mobile Processor Design, IEEE EIT Conference, May NCICT-2016 Special Issue 2 Page 174

9 [15] Arora H,Gupta A,Singhai,R, Purwar D, Design space Exploration of RISC Architecture Using Retargetability IEEE, pp. 1-3,Jan [16] Geun-young Jeong, Ju-sung Park, Design of 32-BIT RISC Processor and Efficient Verification Proceeding -of the 7 th Korea - Russia International Symposium, Korws,2003. [17] Hai Li, Swarup Bhunia, Yiran Chen, Kaushik Roy, DCG: Deterministic Clock - Gating for Low - Power Microprocessor Design, IEEE Trans. On VLSI Systems, vol. 12, no. 3, March [18] Kumar J, v Nagaraju B,swapana C, Ramanjappa T, Deisgn & Development of FPGA based low Power Pipelined 64 Bit RISC Processor with double Precision Floating Point unit, IEEE, pp NCICT-2016 Special Issue 2 Page 175

Design & Analysis of 16 bit RISC Processor Using low Power Pipelining

Design & Analysis of 16 bit RISC Processor Using low Power Pipelining International OPEN ACCESS Journal ISSN: 2249-6645 Of Modern Engineering Research (IJMER) Design & Analysis of 16 bit RISC Processor Using low Power Pipelining Yedla Venkanna 148R1D5710 Branch: VLSI ABSTRACT:-

More information

FPGA Based Implementation of Pipelined 32-bit RISC Processor with Floating Point Unit

FPGA Based Implementation of Pipelined 32-bit RISC Processor with Floating Point Unit RESEARCH ARTICLE OPEN ACCESS FPGA Based Implementation of Pipelined 32-bit RISC Processor with Floating Point Unit Jinde Vijay Kumar 1, Chintakunta Swapna 2, Boya Nagaraju 3, Thogata Ramanjappa 4 1,2Research

More information

AN INNOVATIVE APPROACH FOR ASYNCHRONOUS MICROPROCESSOR DESIGN BASED ON FPGA

AN INNOVATIVE APPROACH FOR ASYNCHRONOUS MICROPROCESSOR DESIGN BASED ON FPGA AN INNOVATIVE APPROACH FOR ASYNCHRONOUS MICROPROCESSOR DESIGN BASED ON FPGA Archana Rani 1, Dr. Naresh Grover 2 Faculty of Engineering and Technology, Manav Rachna International University, Faridabad,

More information

Novel Design of Dual Core RISC Architecture Implementation

Novel Design of Dual Core RISC Architecture Implementation Journal From the SelectedWorks of Kirat Pal Singh Spring May 18, 2015 Novel Design of Dual Core RISC Architecture Implementation Akshatha Rai K, VTU University, MITE, Moodbidri, Karnataka Basavaraj H J,

More information

Novel Design of 32-bit Asynchronous (RISC) Microprocessor & its Implementation on FPGA

Novel Design of 32-bit Asynchronous (RISC) Microprocessor & its Implementation on FPGA I.J. Information Engineering and Electronic Business, 2018, 1, 39-47 Published Online January 2018 in MECS (http://www.mecs-press.org/) DOI: 10.5815/ijieeb.2018.01.06 Novel Design of 32-bit Asynchronous

More information

International Journal Of Global Innovations -Vol.6, Issue.II Paper Id: SP-V6-I1-P01 ISSN Online:

International Journal Of Global Innovations -Vol.6, Issue.II Paper Id: SP-V6-I1-P01 ISSN Online: IMPLEMENTATION OF LOW POWER PIPELINED 64-BIT RISC PROCESSOR WITH DOUBLE PRECISION FLOATING POINT UNIT #1 CH.RAVALI, M.Tech student, #2 T.S.GAGANDEEP, Assistant Professor, Dept of ECE, DRK INSTITUTE OF

More information

Design of a Pipelined 32 Bit MIPS Processor with Floating Point Unit

Design of a Pipelined 32 Bit MIPS Processor with Floating Point Unit Design of a Pipelined 32 Bit MIPS Processor with Floating Point Unit P Ajith Kumar 1, M Vijaya Lakshmi 2 P.G. Student, Department of Electronics and Communication Engineering, St.Martin s Engineering College,

More information

Implimentation of A 16-bit RISC Processor for Convolution Application

Implimentation of A 16-bit RISC Processor for Convolution Application Advance in Electronic and Electric Engineering. ISSN 2231-1297, Volume 4, Number 5 (2014), pp. 441-446 Research India Publications http://www.ripublication.com/aeee.htm Implimentation of A 16-bit RISC

More information

Implementation of RISC Processor for Convolution Application

Implementation of RISC Processor for Convolution Application Implementation of RISC Processor for Convolution Application P.Siva Nagendra Reddy 1, A.G.Murali Krishna 2 1 P.G. Scholar (M. Tech), Dept. of ECE, Intell Engineering College, Anantapur, A.P, India 2 Asst.Professor,

More information

ASSEMBLY LANGUAGE MACHINE ORGANIZATION

ASSEMBLY LANGUAGE MACHINE ORGANIZATION ASSEMBLY LANGUAGE MACHINE ORGANIZATION CHAPTER 3 1 Sub-topics The topic will cover: Microprocessor architecture CPU processing methods Pipelining Superscalar RISC Multiprocessing Instruction Cycle Instruction

More information

DESIGN OF HIGH PERFORMANCE LOW POWER 32 BIT RISC PROCESSOR

DESIGN OF HIGH PERFORMANCE LOW POWER 32 BIT RISC PROCESSOR DESIGN OF HIGH PERFORMANCE LOW POWER 32 BIT RISC PROCESSOR K. Maneesh 1,A.Uday Kumar 2 1 PG Student, Dept. of ECE, SVCET, Srikakulam, AP, (India) 2 Associate Professor,Dept. of ECE, SVCET, Srikakulam,

More information

Embedded Soc using High Performance Arm Core Processor D.sridhar raja Assistant professor, Dept. of E&I, Bharath university, Chennai

Embedded Soc using High Performance Arm Core Processor D.sridhar raja Assistant professor, Dept. of E&I, Bharath university, Chennai Embedded Soc using High Performance Arm Core Processor D.sridhar raja Assistant professor, Dept. of E&I, Bharath university, Chennai Abstract: ARM is one of the most licensed and thus widespread processor

More information

Design and Implementation of 5 Stages Pipelined Architecture in 32 Bit RISC Processor

Design and Implementation of 5 Stages Pipelined Architecture in 32 Bit RISC Processor Design and Implementation of 5 Stages Pipelined Architecture in 32 Bit RISC Processor Abstract The proposed work is the design of a 32 bit RISC (Reduced Instruction Set Computer) processor. The design

More information

An FPGA Implementation of 8-bit RISC Microcontroller

An FPGA Implementation of 8-bit RISC Microcontroller An FPGA Implementation of 8-bit RISC Microcontroller Krishna Kumar mishra 1, Vaibhav Purwar 2, Pankaj Singh 3 1 M.Tech scholar, Department of Electronic and Communication Engineering Kanpur Institute of

More information

Design and Simulation of Pipelined Double Precision Floating Point Adder/Subtractor and Multiplier Using Verilog

Design and Simulation of Pipelined Double Precision Floating Point Adder/Subtractor and Multiplier Using Verilog Design and Simulation of Pipelined Double Precision Floating Point Adder/Subtractor and Multiplier Using Verilog Onkar Singh (1) Kanika Sharma (2) Dept. ECE, Arni University, HP (1) Dept. ECE, NITTTR Chandigarh

More information

ISSN Vol.02, Issue.11, December-2014, Pages:

ISSN Vol.02, Issue.11, December-2014, Pages: ISSN 2322-0929 Vol.02, Issue.11, December-2014, Pages:1208-1212 www.ijvdcs.org Implementation of Area Optimized Floating Point Unit using Verilog G.RAJA SEKHAR 1, M.SRIHARI 2 1 PG Scholar, Dept of ECE,

More information

REALIZATION OF AN 8-BIT PROCESSOR USING XILINX

REALIZATION OF AN 8-BIT PROCESSOR USING XILINX REALIZATION OF AN 8-BIT PROCESSOR USING XILINX T.Deepa M.E (Applied Electronics) Department of Electronics and Communication Engineering, Sri Venkateswara College of Engineering, Sriperumbudur, Chennai,

More information

Design and Implementation of a Super Scalar DLX based Microprocessor

Design and Implementation of a Super Scalar DLX based Microprocessor Design and Implementation of a Super Scalar DLX based Microprocessor 2 DLX Architecture As mentioned above, the Kishon is based on the original DLX as studies in (Hennessy & Patterson, 1996). By: Amnon

More information

Prachi Sharma 1, Rama Laxmi 2, Arun Kumar Mishra 3 1 Student, 2,3 Assistant Professor, EC Department, Bhabha College of Engineering

Prachi Sharma 1, Rama Laxmi 2, Arun Kumar Mishra 3 1 Student, 2,3 Assistant Professor, EC Department, Bhabha College of Engineering A Review: Design of 16 bit Arithmetic and Logical unit using Vivado 14.7 and Implementation on Basys 3 FPGA Board Prachi Sharma 1, Rama Laxmi 2, Arun Kumar Mishra 3 1 Student, 2,3 Assistant Professor,

More information

VLSI DESIGN OF REDUCED INSTRUCTION SET COMPUTER PROCESSOR CORE USING VHDL

VLSI DESIGN OF REDUCED INSTRUCTION SET COMPUTER PROCESSOR CORE USING VHDL International Journal of Electronics, Communication & Instrumentation Engineering Research and Development (IJECIERD) ISSN 2249-684X Vol.2, Issue 3 (Spl.) Sep 2012 42-47 TJPRC Pvt. Ltd., VLSI DESIGN OF

More information

FPGA IMPLEMENTATION OF FLOATING POINT ADDER AND MULTIPLIER UNDER ROUND TO NEAREST

FPGA IMPLEMENTATION OF FLOATING POINT ADDER AND MULTIPLIER UNDER ROUND TO NEAREST FPGA IMPLEMENTATION OF FLOATING POINT ADDER AND MULTIPLIER UNDER ROUND TO NEAREST SAKTHIVEL Assistant Professor, Department of ECE, Coimbatore Institute of Engineering and Technology Abstract- FPGA is

More information

CHAPTER 5 A Closer Look at Instruction Set Architectures

CHAPTER 5 A Closer Look at Instruction Set Architectures CHAPTER 5 A Closer Look at Instruction Set Architectures 5.1 Introduction 199 5.2 Instruction Formats 199 5.2.1 Design Decisions for Instruction Sets 200 5.2.2 Little versus Big Endian 201 5.2.3 Internal

More information

VHDL implementation of 32-bit floating point unit (FPU)

VHDL implementation of 32-bit floating point unit (FPU) VHDL implementation of 32-bit floating point unit (FPU) Nikhil Arora Govindam Sharma Sachin Kumar M.Tech student M.Tech student M.Tech student YMCA, Faridabad YMCA, Faridabad YMCA, Faridabad Abstract The

More information

VHDL Design and Implementation of ASIC Processor Core by Using MIPS Pipelining

VHDL Design and Implementation of ASIC Processor Core by Using MIPS Pipelining Journal From the SelectedWorks of Journal April, 2014 VHDL Design and Implementation of ASIC Processor Core by Using MIPS Pipelining G. Triveni Aswini Kumar Gadige This work is licensed under a Creative

More information

CHAPTER 5 A Closer Look at Instruction Set Architectures

CHAPTER 5 A Closer Look at Instruction Set Architectures CHAPTER 5 A Closer Look at Instruction Set Architectures 5.1 Introduction 293 5.2 Instruction Formats 293 5.2.1 Design Decisions for Instruction Sets 294 5.2.2 Little versus Big Endian 295 5.2.3 Internal

More information

Design of 16-bit RISC Processor Supraj Gaonkar 1, Anitha M. 2

Design of 16-bit RISC Processor Supraj Gaonkar 1, Anitha M. 2 Design of 16-bit RISC Processor Supraj Gaonkar 1, Anitha M. 2 1 M.Tech student, Sir M Visvesvaraya Institute of Technology Bangalore. Karnataka, India 2 Associate Professor Department of Telecommunication

More information

Implementation of Low Power Pipelined 64-bit RISC Processor with Unbiased FPU on CPLD

Implementation of Low Power Pipelined 64-bit RISC Processor with Unbiased FPU on CPLD Indian Journal of Science and Technology, Vol 9(33), DOI: 0.7485/ijst/06/v9i33/8985, September 06 ISSN (Print) : 0974-6846 ISSN (Online) : 0974-5645 Implementation of Low Power Pipelined 64-bit RISC Processor

More information

Functional Verification of Enhanced RISC Processor

Functional Verification of Enhanced RISC Processor Functional Verification of Enhanced RISC Processor SHANKER NILANGI 1 1 Assistant Professor, Dept of ECE, Bheemanna Khandre Institute of Technology, Bhalki, Karnataka, India s.nilangi@gmail.com 1 SOWMYA

More information

Advanced Computer Architecture

Advanced Computer Architecture Advanced Computer Architecture Chapter 1 Introduction into the Sequential and Pipeline Instruction Execution Martin Milata What is a Processors Architecture Instruction Set Architecture (ISA) Describes

More information

INTEL Architectures GOPALAKRISHNAN IYER FALL 2009 ELEC : Computer Architecture and Design

INTEL Architectures GOPALAKRISHNAN IYER FALL 2009 ELEC : Computer Architecture and Design INTEL Architectures GOPALAKRISHNAN IYER FALL 2009 GBI0001@AUBURN.EDU ELEC 6200-001: Computer Architecture and Design Silicon Technology Moore s law Moore's Law describes a long-term trend in the history

More information

Design, Analysis and Processing of Efficient RISC Processor

Design, Analysis and Processing of Efficient RISC Processor Design, Analysis and Processing of Efficient RISC Processor Ramareddy 1, M.N.Pradeep 2 1M-Tech., VLSI D& Embedded Systems, Dept of E&CE, Dayananda Sagar College of Engineering, Bangalore. Karnataka, India

More information

Implementation of IEEE754 Floating Point Multiplier

Implementation of IEEE754 Floating Point Multiplier Implementation of IEEE754 Floating Point Multiplier A Kumutha 1 Shobha. P 2 1 MVJ College of Engineering, Near ITPB, Channasandra, Bangalore-67. 2 MVJ College of Engineering, Near ITPB, Channasandra, Bangalore-67.

More information

Understand the factors involved in instruction set

Understand the factors involved in instruction set A Closer Look at Instruction Set Architectures Objectives Understand the factors involved in instruction set architecture design. Look at different instruction formats, operand types, and memory access

More information

[Sahu* et al., 5(7): July, 2016] ISSN: IC Value: 3.00 Impact Factor: 4.116

[Sahu* et al., 5(7): July, 2016] ISSN: IC Value: 3.00 Impact Factor: 4.116 IJESRT INTERNATIONAL JOURNAL OF ENGINEERING SCIENCES & RESEARCH TECHNOLOGY SPAA AWARE ERROR TOLERANT 32 BIT ARITHMETIC AND LOGICAL UNIT FOR GRAPHICS PROCESSOR UNIT Kaushal Kumar Sahu*, Nitin Jain Department

More information

Advanced processor designs

Advanced processor designs Advanced processor designs We ve only scratched the surface of CPU design. Today we ll briefly introduce some of the big ideas and big words behind modern processors by looking at two example CPUs. The

More information

Design and Implementation of IEEE-754 Decimal Floating Point Adder, Subtractor and Multiplier

Design and Implementation of IEEE-754 Decimal Floating Point Adder, Subtractor and Multiplier International Journal of Engineering and Advanced Technology (IJEAT) ISSN: 2249 8958, Volume-4 Issue 1, October 2014 Design and Implementation of IEEE-754 Decimal Floating Point Adder, Subtractor and Multiplier

More information

FPGA Implementation of MIPS RISC Processor

FPGA Implementation of MIPS RISC Processor FPGA Implementation of MIPS RISC Processor S. Suresh 1 and R. Ganesh 2 1 CVR College of Engineering/PG Student, Hyderabad, India 2 CVR College of Engineering/ECE Department, Hyderabad, India Abstract The

More information

DC57 COMPUTER ORGANIZATION JUNE 2013

DC57 COMPUTER ORGANIZATION JUNE 2013 Q2 (a) How do various factors like Hardware design, Instruction set, Compiler related to the performance of a computer? The most important measure of a computer is how quickly it can execute programs.

More information

Module 5 - CPU Design

Module 5 - CPU Design Module 5 - CPU Design Lecture 1 - Introduction to CPU The operation or task that must perform by CPU is: Fetch Instruction: The CPU reads an instruction from memory. Interpret Instruction: The instruction

More information

Application of Power-Management Techniques for Low Power Processor Design

Application of Power-Management Techniques for Low Power Processor Design 1 Application of Power-Management Techniques for Low Power Processor Design Sivaram Gopalakrishnan, Chris Condrat, Elaine Ly Department of Electrical and Computer Engineering, University of Utah, UT 84112

More information

VHDL Implementation of a MIPS-32 Pipeline Processor

VHDL Implementation of a MIPS-32 Pipeline Processor Journal From the SelectedWorks of Kirat Pal Singh Winter November 9, 2012 VHDL Implementation of a MIPS-32 Pipeline Processor Kirat Pal Singh Shivani Parmar This work is licensed under a Creative Commons

More information

COMPUTER ORGANIZATION AND DESIGN. 5 th Edition. The Hardware/Software Interface. Chapter 4. The Processor

COMPUTER ORGANIZATION AND DESIGN. 5 th Edition. The Hardware/Software Interface. Chapter 4. The Processor COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Interface 5 th Edition Chapter 4 The Processor Introduction CPU performance factors Instruction count Determined by ISA and compiler CPI and Cycle

More information

The von Neumann Architecture. IT 3123 Hardware and Software Concepts. The Instruction Cycle. Registers. LMC Executes a Store.

The von Neumann Architecture. IT 3123 Hardware and Software Concepts. The Instruction Cycle. Registers. LMC Executes a Store. IT 3123 Hardware and Software Concepts February 11 and Memory II Copyright 2005 by Bob Brown The von Neumann Architecture 00 01 02 03 PC IR Control Unit Command Memory ALU 96 97 98 99 Notice: This session

More information

Math 230 Assembly Programming (AKA Computer Organization) Spring MIPS Intro

Math 230 Assembly Programming (AKA Computer Organization) Spring MIPS Intro Math 230 Assembly Programming (AKA Computer Organization) Spring 2008 MIPS Intro Adapted from slides developed for: Mary J. Irwin PSU CSE331 Dave Patterson s UCB CS152 M230 L09.1 Smith Spring 2008 MIPS

More information

VHDL IMPLEMENTATION OF IEEE 754 FLOATING POINT UNIT

VHDL IMPLEMENTATION OF IEEE 754 FLOATING POINT UNIT VHDL IMPLEMENTATION OF IEEE 754 FLOATING POINT UNIT Ms. Anjana Sasidharan Student, Vivekanandha College of Engineering for Women, Namakkal, Tamilnadu, India. Abstract IEEE-754 specifies interchange and

More information

ARCHITECTURAL DESIGN OF 8 BIT FLOATING POINT MULTIPLICATION UNIT

ARCHITECTURAL DESIGN OF 8 BIT FLOATING POINT MULTIPLICATION UNIT ARCHITECTURAL DESIGN OF 8 BIT FLOATING POINT MULTIPLICATION UNIT Usha S. 1 and Vijaya Kumar V. 2 1 VLSI Design, Sathyabama University, Chennai, India 2 Department of Electronics and Communication Engineering,

More information

Chapter 5. A Closer Look at Instruction Set Architectures

Chapter 5. A Closer Look at Instruction Set Architectures Chapter 5 A Closer Look at Instruction Set Architectures Chapter 5 Objectives Understand the factors involved in instruction set architecture design. Gain familiarity with memory addressing modes. Understand

More information

UNIT-II. Part-2: CENTRAL PROCESSING UNIT

UNIT-II. Part-2: CENTRAL PROCESSING UNIT Page1 UNIT-II Part-2: CENTRAL PROCESSING UNIT Stack Organization Instruction Formats Addressing Modes Data Transfer And Manipulation Program Control Reduced Instruction Set Computer (RISC) Introduction:

More information

Electronics Engineering, DBACER, Nagpur, Maharashtra, India 5. Electronics Engineering, RGCER, Nagpur, Maharashtra, India.

Electronics Engineering, DBACER, Nagpur, Maharashtra, India 5. Electronics Engineering, RGCER, Nagpur, Maharashtra, India. Volume 5, Issue 3, March 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Design and Implementation

More information

Hardware Implementation High Speed RISC Processor for Convolution

Hardware Implementation High Speed RISC Processor for Convolution Hardware Implementation High Speed RISC Processor for Convolution Mr. Prashant Bhirange Department of Electronics Engineering, S.D.C.O College of Engineering, Wardha Maharashtra, India prashantbhirange@gmail.com

More information

FPGA Implementation of A Pipelined MIPS Soft Core Processor

FPGA Implementation of A Pipelined MIPS Soft Core Processor FPGA Implementation of A Pipelined MIPS Soft Core Processor Lakshmi S.S 1, Chandrasekhar N.S 2 P.G. Student, Department of Electronics and Communication Engineering, DBIT, Bangalore, India 1 Assistant

More information

Processing Unit CS206T

Processing Unit CS206T Processing Unit CS206T Microprocessors The density of elements on processor chips continued to rise More and more elements were placed on each chip so that fewer and fewer chips were needed to construct

More information

A 32-Bit RISC Processor for Convolution Application

A 32-Bit RISC Processor for Convolution Application IJISET - International Journal of Innovative Science, Engineering & Technology, Vol. 3 Issue 7, July 2016 A 32-Bit RISC rocessor for Convolution Application 1 2 3 1 2 Mrs. Ussra Fatima, Dr. Baswaraj Gadgay

More information

Chapter 4. Instruction Execution. Introduction. CPU Overview. Multiplexers. Chapter 4 The Processor 1. The Processor.

Chapter 4. Instruction Execution. Introduction. CPU Overview. Multiplexers. Chapter 4 The Processor 1. The Processor. COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Interface 5 th Edition COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Interface 5 th Edition Chapter 4 The Processor The Processor - Introduction

More information

COMPUTER ORGANIZATION AND DESIGN. 5 th Edition. The Hardware/Software Interface. Chapter 4. The Processor

COMPUTER ORGANIZATION AND DESIGN. 5 th Edition. The Hardware/Software Interface. Chapter 4. The Processor COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Interface 5 th Edition Chapter 4 The Processor COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Interface 5 th Edition The Processor - Introduction

More information

Typical Processor Execution Cycle

Typical Processor Execution Cycle Typical Processor Execution Cycle Instruction Fetch Obtain instruction from program storage Instruction Decode Determine required actions and instruction size Operand Fetch Locate and obtain operand data

More information

EMBEDDED SYSTEM DESIGN (10EC74)

EMBEDDED SYSTEM DESIGN (10EC74) UNIT 2 The Hardware Side: An Introduction, The Core Level, Representing Information, Understanding Numbers, Addresses, Instructions, Registers-A First Look, Embedded Systems-An Instruction Set View, Embedded

More information

CS 101, Mock Computer Architecture

CS 101, Mock Computer Architecture CS 101, Mock Computer Architecture Computer organization and architecture refers to the actual hardware used to construct the computer, and the way that the hardware operates both physically and logically

More information

Instruction Set Architecture. "Speaking with the computer"

Instruction Set Architecture. Speaking with the computer Instruction Set Architecture "Speaking with the computer" The Instruction Set Architecture Application Compiler Instr. Set Proc. Operating System I/O system Instruction Set Architecture Digital Design

More information

ARM ARCHITECTURE. Contents at a glance:

ARM ARCHITECTURE. Contents at a glance: UNIT-III ARM ARCHITECTURE Contents at a glance: RISC Design Philosophy ARM Design Philosophy Registers Current Program Status Register(CPSR) Instruction Pipeline Interrupts and Vector Table Architecture

More information

Review on Floating Point Adder and Converter Units Using VHDL

Review on Floating Point Adder and Converter Units Using VHDL Review on Floating Point Adder and Converter Units Using VHDL Abhishek Kumar 1, Mayur S. Dhait 2 1 Research Scholar, Agnihotri College of Engineering, Nagthana Road, Wardha (M.S), India 2 Professor, Department

More information

Architectures & instruction sets R_B_T_C_. von Neumann architecture. Computer architecture taxonomy. Assembly language.

Architectures & instruction sets R_B_T_C_. von Neumann architecture. Computer architecture taxonomy. Assembly language. Architectures & instruction sets Computer architecture taxonomy. Assembly language. R_B_T_C_ 1. E E C E 2. I E U W 3. I S O O 4. E P O I von Neumann architecture Memory holds data and instructions. Central

More information

Keywords: Soft Core Processor, Arithmetic and Logical Unit, Back End Implementation and Front End Implementation.

Keywords: Soft Core Processor, Arithmetic and Logical Unit, Back End Implementation and Front End Implementation. ISSN 2319-8885 Vol.03,Issue.32 October-2014, Pages:6436-6440 www.ijsetr.com Design and Modeling of Arithmetic and Logical Unit with the Platform of VLSI N. AMRUTHA BINDU 1, M. SAILAJA 2 1 Dept of ECE,

More information

Chapter 5. A Closer Look at Instruction Set Architectures. Chapter 5 Objectives. 5.1 Introduction. 5.2 Instruction Formats

Chapter 5. A Closer Look at Instruction Set Architectures. Chapter 5 Objectives. 5.1 Introduction. 5.2 Instruction Formats Chapter 5 Objectives Understand the factors involved in instruction set architecture design. Chapter 5 A Closer Look at Instruction Set Architectures Gain familiarity with memory addressing modes. Understand

More information

Power Optimized Programmable Truncated Multiplier and Accumulator Using Reversible Adder

Power Optimized Programmable Truncated Multiplier and Accumulator Using Reversible Adder Power Optimized Programmable Truncated Multiplier and Accumulator Using Reversible Adder Syeda Mohtashima Siddiqui M.Tech (VLSI & Embedded Systems) Department of ECE G Pulla Reddy Engineering College (Autonomous)

More information

CS311 Lecture: Pipelining, Superscalar, and VLIW Architectures revised 10/18/07

CS311 Lecture: Pipelining, Superscalar, and VLIW Architectures revised 10/18/07 CS311 Lecture: Pipelining, Superscalar, and VLIW Architectures revised 10/18/07 Objectives ---------- 1. To introduce the basic concept of CPU speedup 2. To explain how data and branch hazards arise as

More information

COPROCESSOR APPROACH TO ACCELERATING MULTIMEDIA APPLICATION [CLAUDIO BRUNELLI, JARI NURMI ] Processor Design

COPROCESSOR APPROACH TO ACCELERATING MULTIMEDIA APPLICATION [CLAUDIO BRUNELLI, JARI NURMI ] Processor Design COPROCESSOR APPROACH TO ACCELERATING MULTIMEDIA APPLICATION [CLAUDIO BRUNELLI, JARI NURMI ] Processor Design Lecture Objectives Background Need for Accelerator Accelerators and different type of parallelizm

More information

Universität Dortmund. ARM Architecture

Universität Dortmund. ARM Architecture ARM Architecture The RISC Philosophy Original RISC design (e.g. MIPS) aims for high performance through o reduced number of instruction classes o large general-purpose register set o load-store architecture

More information

Chapter 5. A Closer Look at Instruction Set Architectures

Chapter 5. A Closer Look at Instruction Set Architectures Chapter 5 A Closer Look at Instruction Set Architectures Chapter 5 Objectives Understand the factors involved in instruction set architecture design. Gain familiarity with memory addressing modes. Understand

More information

Design of Low Power Pipelined RISC Processor

Design of Low Power Pipelined RISC Processor Design of Low Power Pipelined RISC Processor Indu.M 1, Arun Kumar.M 2 PG Student[VLSI Design and Embedded System],Dept of ECE, East Point College of Engineering and Technology,Bangalore,Karnataka,India

More information

Architecture and Design of Generic IEEE-754 Based Floating Point Adder, Subtractor and Multiplier

Architecture and Design of Generic IEEE-754 Based Floating Point Adder, Subtractor and Multiplier Architecture and Design of Generic IEEE-754 Based Floating Point Adder, Subtractor and Multiplier Sahdev D. Kanjariya VLSI & Embedded Systems Design Gujarat Technological University PG School Ahmedabad,

More information

RISC & Superscalar. COMP 212 Computer Organization & Architecture. COMP 212 Fall Lecture 12. Instruction Pipeline no hazard.

RISC & Superscalar. COMP 212 Computer Organization & Architecture. COMP 212 Fall Lecture 12. Instruction Pipeline no hazard. COMP 212 Computer Organization & Architecture Pipeline Re-Cap Pipeline is ILP -Instruction Level Parallelism COMP 212 Fall 2008 Lecture 12 RISC & Superscalar Divide instruction cycles into stages, overlapped

More information

ISSN: [Bilani* et al.,7(2): February, 2018] Impact Factor: 5.164

ISSN: [Bilani* et al.,7(2): February, 2018] Impact Factor: 5.164 IJESRT INTERNATIONAL JOURNAL OF ENGINEERING SCIENCES & RESEARCH TECHNOLOGY A REVIEWARTICLE OF SDRAM DESIGN WITH NECESSARY CRITERIA OF DDR CONTROLLER Sushmita Bilani *1 & Mr. Sujeet Mishra 2 *1 M.Tech Student

More information

EE 3170 Microcontroller Applications

EE 3170 Microcontroller Applications EE 3170 Microcontroller Applications Lecture 4 : Processors, Computers, and Controllers - 1.2 (reading assignment), 1.3-1.5 Based on slides for ECE3170 by Profs. Kieckhafer, Davis, Tan, and Cischke Outline

More information

Chapter 4. The Processor

Chapter 4. The Processor Chapter 4 The Processor Introduction CPU performance factors Instruction count Determined by ISA and compiler CPI and Cycle time Determined by CPU hardware We will examine two MIPS implementations A simplified

More information

CS146 Computer Architecture. Fall Midterm Exam

CS146 Computer Architecture. Fall Midterm Exam CS146 Computer Architecture Fall 2002 Midterm Exam This exam is worth a total of 100 points. Note the point breakdown below and budget your time wisely. To maximize partial credit, show your work and state

More information

Design of Double Precision Floating Point Multiplier Using Vedic Multiplication

Design of Double Precision Floating Point Multiplier Using Vedic Multiplication Design of Double Precision Floating Point Multiplier Using Vedic Multiplication 1 D.Heena Tabassum, 2 K.Sreenivas Rao 1, 2 Electronics and Communication Engineering, 1, 2 Annamacharya institute of technology

More information

PART A (22 Marks) 2. a) Briefly write about r's complement and (r-1)'s complement. [8] b) Explain any two ways of adding decimal numbers.

PART A (22 Marks) 2. a) Briefly write about r's complement and (r-1)'s complement. [8] b) Explain any two ways of adding decimal numbers. Set No. 1 IV B.Tech I Semester Supplementary Examinations, March - 2017 COMPUTER ARCHITECTURE & ORGANIZATION (Common to Electronics & Communication Engineering and Electronics & Time: 3 hours Max. Marks:

More information

Computer Systems. Binary Representation. Binary Representation. Logical Computation: Boolean Algebra

Computer Systems. Binary Representation. Binary Representation. Logical Computation: Boolean Algebra Binary Representation Computer Systems Information is represented as a sequence of binary digits: Bits What the actual bits represent depends on the context: Seminar 3 Numerical value (integer, floating

More information

CPE300: Digital System Architecture and Design

CPE300: Digital System Architecture and Design CPE300: Digital System Architecture and Design Fall 2011 MW 17:30-18:45 CBC C316 Pipelining 11142011 http://www.egr.unlv.edu/~b1morris/cpe300/ 2 Outline Review I/O Chapter 5 Overview Pipelining Pipelining

More information

Real instruction set architectures. Part 2: a representative sample

Real instruction set architectures. Part 2: a representative sample Real instruction set architectures Part 2: a representative sample Some historical architectures VAX: Digital s line of midsize computers, dominant in academia in the 70s and 80s Characteristics: Variable-length

More information

An Implementation of Double precision Floating point Adder & Subtractor Using Verilog

An Implementation of Double precision Floating point Adder & Subtractor Using Verilog IOSR Journal of Electrical and Electronics Engineering (IOSR-JEEE) e-issn: 2278-1676,p-ISSN: 2320-3331, Volume 9, Issue 4 Ver. III (Jul Aug. 2014), PP 01-05 An Implementation of Double precision Floating

More information

Pipelining and Vector Processing

Pipelining and Vector Processing Pipelining and Vector Processing Chapter 8 S. Dandamudi Outline Basic concepts Handling resource conflicts Data hazards Handling branches Performance enhancements Example implementations Pentium PowerPC

More information

International Journal of Digital Application & Contemporary research Website: (Volume 1, Issue 2, September 2012)

International Journal of Digital Application & Contemporary research Website:   (Volume 1, Issue 2, September 2012) Area Optimized 32-Bit Pipeline RISC Processor in VHDL Swati Joshi swati_joshi20@yahoo.co.in Puran Gour purangour@rediffmail.com Abstract RISC architecture is a solution of recent era for complex computation

More information

Hardware Design I Chap. 10 Design of microprocessor

Hardware Design I Chap. 10 Design of microprocessor Hardware Design I Chap. 0 Design of microprocessor E-mail: shimada@is.naist.jp Outline What is microprocessor? Microprocessor from sequential machine viewpoint Microprocessor and Neumann computer Memory

More information

Computer Architecture V Fall Practice Exam Questions

Computer Architecture V Fall Practice Exam Questions Computer Architecture V22.0436 Fall 2002 Practice Exam Questions These are practice exam questions for the material covered since the mid-term exam. Please note that the final exam is cumulative. See the

More information

Latches. IT 3123 Hardware and Software Concepts. Registers. The Little Man has Registers. Data Registers. Program Counter

Latches. IT 3123 Hardware and Software Concepts. Registers. The Little Man has Registers. Data Registers. Program Counter IT 3123 Hardware and Software Concepts Notice: This session is being recorded. CPU and Memory June 11 Copyright 2005 by Bob Brown Latches Can store one bit of data Can be ganged together to store more

More information

Embedded Systems Ch 15 ARM Organization and Implementation

Embedded Systems Ch 15 ARM Organization and Implementation Embedded Systems Ch 15 ARM Organization and Implementation Byung Kook Kim Dept of EECS Korea Advanced Institute of Science and Technology Summary ARM architecture Very little change From the first 3-micron

More information

Chapter 5. A Closer Look at Instruction Set Architectures

Chapter 5. A Closer Look at Instruction Set Architectures Chapter 5 A Closer Look at Instruction Set Architectures Chapter 5 Objectives Understand the factors involved in instruction set architecture design. Gain familiarity with memory addressing modes. Understand

More information

Chapter 4. MARIE: An Introduction to a Simple Computer

Chapter 4. MARIE: An Introduction to a Simple Computer Chapter 4 MARIE: An Introduction to a Simple Computer Chapter 4 Objectives Learn the components common to every modern computer system. Be able to explain how each component contributes to program execution.

More information

CO Computer Architecture and Programming Languages CAPL. Lecture 15

CO Computer Architecture and Programming Languages CAPL. Lecture 15 CO20-320241 Computer Architecture and Programming Languages CAPL Lecture 15 Dr. Kinga Lipskoch Fall 2017 How to Compute a Binary Float Decimal fraction: 8.703125 Integral part: 8 1000 Fraction part: 0.703125

More information

VLSI DESIGN OF FLOATING POINT ARITHMETIC & LOGIC UNIT

VLSI DESIGN OF FLOATING POINT ARITHMETIC & LOGIC UNIT VLSI DESIGN OF FLOATING POINT ARITHMETIC & LOGIC UNIT 1 DHANABAL R, 2 BHARATHI V, 3 G.SRI CHANDRAKIRAN, 4 BHARATH BHUSHAN REDDY.M 1 Assistant Professor (Senior Grade), VLSI division, SENSE, VIT University,

More information

Structure of Computer Systems

Structure of Computer Systems 288 between this new matrix and the initial collision matrix M A, because the original forbidden latencies for functional unit A still have to be considered in later initiations. Figure 5.37. State diagram

More information

A SIMULINK-TO-FPGA MULTI-RATE HIERARCHICAL FIR FILTER DESIGN

A SIMULINK-TO-FPGA MULTI-RATE HIERARCHICAL FIR FILTER DESIGN A SIMULINK-TO-FPGA MULTI-RATE HIERARCHICAL FIR FILTER DESIGN Xiaoying Li 1 Fuming Sun 2 Enhua Wu 1, 3 1 University of Macau, Macao, China 2 University of Science and Technology Beijing, Beijing, China

More information

COMPUTER STRUCTURE AND ORGANIZATION

COMPUTER STRUCTURE AND ORGANIZATION COMPUTER STRUCTURE AND ORGANIZATION Course titular: DUMITRAŞCU Eugen Chapter 4 COMPUTER ORGANIZATION FUNDAMENTAL CONCEPTS CONTENT The scheme of 5 units von Neumann principles Functioning of a von Neumann

More information

Introduction to Microcontrollers

Introduction to Microcontrollers Introduction to Microcontrollers Embedded Controller Simply an embedded controller is a controller that is embedded in a greater system. One can define an embedded controller as a controller (or computer)

More information

MARIE: An Introduction to a Simple Computer

MARIE: An Introduction to a Simple Computer MARIE: An Introduction to a Simple Computer 4.2 CPU Basics The computer s CPU fetches, decodes, and executes program instructions. The two principal parts of the CPU are the datapath and the control unit.

More information

EC 413 Computer Organization

EC 413 Computer Organization EC 413 Computer Organization Review I Prof. Michel A. Kinsy Computing: The Art of Abstraction Application Algorithm Programming Language Operating System/Virtual Machine Instruction Set Architecture (ISA)

More information

CISC Attributes. E.g. Pentium is considered a modern CISC processor

CISC Attributes. E.g. Pentium is considered a modern CISC processor What is CISC? CISC means Complex Instruction Set Computer chips that are easy to program and which make efficient use of memory. Since the earliest machines were programmed in assembly language and memory

More information

Digital System Design Using Verilog. - Processing Unit Design

Digital System Design Using Verilog. - Processing Unit Design Digital System Design Using Verilog - Processing Unit Design 1.1 CPU BASICS A typical CPU has three major components: (1) Register set, (2) Arithmetic logic unit (ALU), and (3) Control unit (CU) The register

More information