Soft processors as a prospective platform of the future

Similar documents
Microelettronica. J. M. Rabaey, "Digital integrated circuits: a design perspective" EE141 Microelettronica

CS310 Embedded Computer Systems. Maeng

CAD for VLSI. Debdeep Mukhopadhyay IIT Madras

EITF35: Introduction to Structured VLSI Design

System-on-Chip Architecture for Mobile Applications. Sabyasachi Dey

INTRODUCTION TO FPGA ARCHITECTURE

EE586 VLSI Design. Partha Pande School of EECS Washington State University

VLSI Design Automation

Altera SDK for OpenCL

Computers: Inside and Out

Digital Integrated Circuits

VLSI Design Automation

FPGA Based Digital Design Using Verilog HDL

Embedded Computing Platform. Architecture and Instruction Set

Power Solutions for Leading-Edge FPGAs. Vaughn Betz & Paul Ekas

Reduce Your System Power Consumption with Altera FPGAs Altera Corporation Public

VLSI Design Automation. Calcolatori Elettronici Ing. Informatica

ECE520 VLSI Design. Lecture 1: Introduction to VLSI Technology. Payman Zarkesh-Ha

Chapter 5: ASICs Vs. PLDs

Elettronica T moduli I e II

Design Metrics. A couple of especially important metrics: Time to market Total cost (NRE + unit cost) Performance (speed latency and throughput)

DIGITAL DESIGN TECHNOLOGY & TECHNIQUES

ELCT 503: Semiconductors. Fall Lecture 01: Introduction

Using ASIC circuits. What is ASIC. ASIC examples ASIC types and selection ASIC costs ASIC purchasing Trends in IC technologies

A VARIETY OF ICS ARE POSSIBLE DESIGNING FPGAS & ASICS. APPLICATIONS MAY USE STANDARD ICs or FPGAs/ASICs FAB FOUNDRIES COST BILLIONS

2011 Signal Processing CoDR: Technology Roadmap W. Turner SPDO. 14 th April 2011

HOME :: FPGA ENCYCLOPEDIA :: ARCHIVES :: MEDIA KIT :: SUBSCRIBE

Basic FPGA Architectures. Actel FPGAs. PLD Technologies: Antifuse. 3 Digital Systems Implementation Programmable Logic Devices

Introduction 1. GENERAL TRENDS. 1. The technology scale down DEEP SUBMICRON CMOS DESIGN

Field Programmable Gate Array (FPGA) Devices

Welcome. Altera Technology Roadshow 2013

EECS150 - Digital Design Lecture 6 - Field Programmable Gate Arrays (FPGAs)

An Introduction to Programmable Logic

VLSI Digital Signal Processing

OUTLINE. System-on-Chip Design ( ) System-on-Chip Design for Embedded Systems ( ) WHAT IS A SYSTEM-ON-CHIP?

Outline. EECS150 - Digital Design Lecture 6 - Field Programmable Gate Arrays (FPGAs) FPGA Overview. Why FPGAs?

Modeling Arbitrator Delay-Area Dependencies in Customizable Instruction Set Processors

Introduction to ICs and Transistor Fundamentals

ECE 2162 Intro & Trends. Jun Yang Fall 2009

ECE 637 Integrated VLSI Circuits. Introduction. Introduction EE141

Lecture 1: Welcome, why are you here? James C. Hoe Department of ECE Carnegie Mellon University

What is This Course About? CS 356 Unit 0. Today's Digital Environment. Why is System Knowledge Important?

Computer Architecture s Changing Definition

TABLE OF CONTENTS III. Section 1. Executive Summary

ECE 4514 Digital Design II. Spring Lecture 22: Design Economics: FPGAs, ASICs, Full Custom

RUN-TIME RECONFIGURABLE IMPLEMENTATION OF DSP ALGORITHMS USING DISTRIBUTED ARITHMETIC. Zoltan Baruch

ECE 471 Embedded Systems Lecture 2

ADVANCED FPGA BASED SYSTEM DESIGN. Dr. Tayab Din Memon Lecture 3 & 4

COE 561 Digital System Design & Synthesis Introduction

More Course Information

L2: FPGA HARDWARE : ADVANCED DIGITAL DESIGN PROJECT FALL 2015 BRANDON LUCIA

Research Challenges for FPGAs

CMPEN 411 VLSI Digital Circuits. Lecture 01: Introduction

Introduction. Summary. Why computer architecture? Technology trends Cost issues

Spiral 2-8. Cell Layout

ECE 486/586. Computer Architecture. Lecture # 2

Introduction CHAPTER IN THIS CHAPTER

EE241 - Spring 2004 Advanced Digital Integrated Circuits

What is this class all about?

VLSI Design Automation. Maurizio Palesi

What is this class all about?

What is this class all about?

Functional Programming in Hardware Design

Teaching Computer Architecture with FPGA Soft Processors

ECE484 VLSI Digital Circuits Fall Lecture 01: Introduction

CS Computer Architecture

Hardware Design Environments. Dr. Mahdi Abbasi Computer Engineering Department Bu-Ali Sina University

Trends in the Infrastructure of Computing

Moore s Law: Alive and Well. Mark Bohr Intel Senior Fellow

Computer Architecture

Hardware Design with VHDL PLDs I ECE 443. FPGAs can be configured at least once, many are reprogrammable.

Microprocessor Soft-Cores: An Evaluation of Design Methods and Concepts on FPGAs

Lecture #1. Teach you how to make sure your circuit works Do you want your transistor to be the one that screws up a 1 billion transistor chip?

Bringing the benefits of Cortex-M processors to FPGA

Embedded Systems. 7. System Components

CMPE 415 Programmable Logic Devices Introduction

EE141- Spring 2007 Introduction to Digital Integrated Circuits

Beyond Moore. Beyond Programmable Logic.

Introduction to ASIC Design

Microprocessor Systems

Programmable Logic Devices Introduction CMPE 415. Programmable Logic Devices

Now we are going to speak about the CPU, the Central Processing Unit.

Actel s SX Family of FPGAs: A New Architecture for High-Performance Designs

Advanced FPGA Design Methodologies with Xilinx Vivado

CSE : Introduction to Computer Architecture

Higher Level Programming Abstractions for FPGAs using OpenCL

EE241 - Spring 2000 Advanced Digital Integrated Circuits. Practical Information

Design Space Exploration Using Parameterized Cores

Advanced Computer Architecture (CS620)

Embedded System Design

FPGA for Complex System Implementation. National Chiao Tung University Chun-Jen Tsai 04/14/2011

The QR code here provides a shortcut to go to the course webpage.

Hardware Modeling using Verilog Prof. Indranil Sengupta Department of Computer Science and Engineering Indian Institute of Technology, Kharagpur

Heterogenous Computing

C Program Adventures. From C code to motion

Lecture 1: Introduction

Embedded Systems. 8. Hardware Components. Lothar Thiele. Computer Engineering and Networks Laboratory

A 3-D CPU-FPGA-DRAM Hybrid Architecture for Low-Power Computation

All MSEE students are required to take the following two core courses: Linear systems Probability and Random Processes

Introduction to Embedded Systems

Transcription:

Procedia Computer Science Volume 88, 2016, Pages 294 299 7th Annual International Conference on Biologically Inspired Cognitive Architectures, BICA 2016 Soft processors as a prospective platform of the future Dmitry Efanov 1, Konstantin Grigoryev 1, Pavel Roschin 1, and Vassili Leonov 2 1 National Research Nuclear University MEPhI, Moscow, Russian Federation 2 Scientific Research Institute of Rheumatology named after V. A. Nasonovoy, RAMS, Moscow, Russian Federation Abstract Modern field-programmable gate arrays (FPGA) play a very important role in designing of new soft CPUs and integrated systems-on-chips. Compared to an application-specific integrated circuit (ASIC), FPGAs provide the highest degree of flexibility being almost fully application neutral. The price of such flexibility is higher usage of basic logic gates and decrease in circuit operating frequency caused by the use of switched interconnect fabric as opposed to fixed metal interconnect defined by masks at manufacturing for ASIC. However due to more regular and less complex FPGA structure they lead in terms of new IC manufacturing technologies adoption. Therefore we can expect next generation of general purpose FPGA devices based on 14 nm norms in 2017. This article examines the applicability of Moore s Law to general purpose FPGAs, using datum for Xilinx Spartan and Altera Cyclone families. On the basis of the obtained data the conclusion that in the near future FPGAs become a prospective platform for general purpose CPUs implementation is drawn. Keywords: Moore s law, soft CPU, FPGA, Spartan, Xilinx, Cyclone, Altera, OpenRISC 1 Introduction Soft CPU is a microprocessor kernel implemented on an FPGA device using methods of logical synthesis from standard logic elements (LE), arithmetic units (AU) and distributed memory cells. There is always an engineering choice between general purpose and application specific digital devices. FPGA device, compared to an ASIC, provides the highest degree of flexibility being almost fully application neutral. The price of such flexibility though is higher usage of basic logic gates (transistors) and decrease in maximum circuit operating frequency caused by the use of switched interconnect fabric as opposed to fixed metal interconnect defined by masks at manufacturing for ASIC. But if we look at the history of computing, modern ASIC CPUs are less efficient compared to special purpose CPUs, still widely used for time critical or special applications. Nevertheless 294 Selection and peer-review under responsibility of the Scientific Programme Committee of BICA 2016 c The Authors. Published by Elsevier B.V. doi:10.1016/j.procs.2016.07.438

the reduction in difficulty of device implementing such functions justifies the use of general purpose CPUs (MPUs). Similarly the use of FPGA-based soft-cpus allows speeding up the development of such device features that can be primarily implemented only in hardware security, special functions, time critical and deterministic functions that require exact and predictable timing. In contrast to ASIC solutions soft-cpus allow: improve or fully replace CPU architecture and eliminate defects in the field; implement hard logic solutions with maximum reliability and information security; implement functions that can be efficiently implemented only with hard logic, for instance coding/decoding and hashing algorithms; implement functions that require hard deterministic and fixed timing. In particular hardware interfaces protocols implementation: significantly reduce components nomenclature by avoiding interface ASICs; standardize hardware component of digital control system for different products; avoid undocumented black box components in the design (full control over technologies used). 2 Applicability of Moore s law for general purpose FPGA devices 2.1 Main metrics Main metrics used to compare ASIC and FPGA-based soft-cpus implementations that determine their performance are the number of gates (transistors) and maximum internal clock frequency. Modern FPGAs also include distributed memory blocks and embedded blocks such as ALU, DSP and communication protocol engines. Elementary switch is single of multiple gate FET or complimentary pair, whereas LE corresponds to low-level integration IC, and consists of 20..30 elementary gates. Static memory cell consists of 3..6 gates. We use the following equation to estimate the equivalent gates counts for ASIC corresponding to FPGA metrics: N eq 2 N LE +4 N bit (1) N LE number of FPGA logic elements. N bit number of FPGA distributed RAM bits. Maximum clock freq for FPGA based soft-cpu is usually 3..5 times lower then for ASIC implementation, since in FPGA clock signal must pass through several statically controlled FET switches with resistance considerably higher then of ASIC s metal interconnect. 295

2.2 Moore s law Moore s law (named after the Intel Corp. co-founder Gordon E. Moore) as originally stated says that maximum amount of integrated circuit gates increases by a factor of two every 24 months [10]. In other words the number of elements grows exponentially until reaching the technology limits. Currently Intel s 14 nm FinFET looks like the limit of silicon planar process, being though the transitional towards future 3D technologies. We expect introduction of new physical principles base approaches, like graphene circuits that have inner predisposition towards dynamic reconfiguration in the field, as well as significant increase in the speed of signal propagation between LE, while decreasing power consumption by several orders of magnitude [1, 9]. Generation Year LE cost, 10 3 USD LE, thousands Technology, nm Spartan1 1998 22,68 1,86 350 Spartan2 2000 4,96 5,29 180 Spartan2E 2001 5,67 6,91 150 Cyclone1 2002 2,93 20,06 130 Spartan3 2003 1,19 62,21 90 Cyclone2 2004 2,28 18,75 90 Cyclone3 2007 1,6 24,62 65 Cyclone4 2009 1,38 28,85 60 Spartan6 LX 2009 1,08 147,44 45 Cyclone5 2013 0,7 301 28 Table 1: Cost and number of LE by general purpose FPGA generations We investigated the applicability of Moore s law for general purpose FPGA families Spartan (by Xilinx) and Cyclone (by Altera) that exhibit industry s best price/performance and are targeted at both general and special purpose devices markets, including embedded and mobile computing fields. There are 10 generations of general purpose FPGA devices passed since 1998, when first Spartan device was announced, to 2013, when Cyclone V became available. IC manufacturing norms improved from 350 to 28 nm, LE densities for these devices went from 2 to 300 thousands. Cost per one LE changed from 0.022 to 0.0007 USD (in 2013 prices) [2, 3, 4, 5, 7, 6]. We analyzed number of LE and cost of one LE for minimal LE cost devices in each of the FPGA generations, over 1.6 thousands part numbers (958 for Altera, 594 for Xilinx). We performed liner regression over base 2 logarithm of obtained datum, the liner regression coefficient then represents the inverse of time over which data changes by a factor of!2 (or 0.5 for negative values). We see then that Moore s law is true with considerable precision time for doubling the number of LE on a chip is close to 2.5 years (Figure 1), time for price reduction by a factor of 2 is close to 3 years. Since price depends on the inflation, we speculate that 0.5 year difference between LE doubling and cost reduction corresponds to inflation. Correlation coefficients: for LE quantity per chip R 2 =0.77, for LE cost R 2 =0.75. The model suitability was checked using Fisher s criteria. For confidence level α =0.05, degrees of freedom number ν =8(forn = 10 data points) the critical level is F cirt =5.3. Values obtained from data F actual =27.5 for LE quantity, and F actual =23.9 for LE cost, are in both cases above F crit. Thus our hypothesis that Moore s law is true for both LE quantity and LE cost versus time is statistically valid. 296

Figure 1: Quantity and LE cost for general purpose FPGAs, logarithmic scale 2.3 FPGA devices technologies are going 1-2 generations ahead of general purpose CPUs Due to more regular and less complex FPGA structure they lead in terms of new IC manufacturing technologies adoption. In particular Altera s Cyclone V series, that started shipping during the second quarter of 2013 is using 28 nm technologies, whereas none of the general purpose CPUs are using 28 nm technologies at the same time. In 2013 Altera started development of Stratix-10 series that will use 14 nm FinFET technology, with samples expected in this year. In the past delay from certain technology adoption in Stratix series to its use in Cyclone general purpose series was from 2 to 3 years. Thus we can expect next generation of general purpose FPGA devices based on 14 nm norms in 2017. Using regression relationships obtained above expected density should be in the 600-800 thousands LE, and cost of about $300 for such device, which relates well with prior transitions to next generation IC manufacturing technologies. 3 Economics of soft-cpu adoption Custom made ASICs development and NRE costs are much higher, while the unit cost for huge production volumes is much lower. In contrast to ASIC, FPGAs development and NRE costs are much lower, but the final product unit cost is higher, even for huge volumes. Since IC development is not end in itself, but is pursuing economic goal, the costs for different approaches should be considered. There are 3 types of costs the unit cost for final product (C prod ), the development cost (C devel ) and the cost (losses) of time to market delay (C ttm ). Product cost (C prod ) is the cost of production for one device unit and consists of cost of preparation for production one-time non-recurring engineering costs (C NRE ), that are incurred for each new device type manufactured, and cost of one unit production (C unit ), that consists of 297

Figure 2: Dependence of product unit cost on the number of units produced cost of materials used and cost of device (chip) packaging. C NRE costs are spread over all units produced and their share in the end product cost goes down with production volume increase. Therefore the cost of one device unit for production run of Nunits (Equation 2 and Figure 2): C prod = C unit + ( CNRE N units ) (2) The development cost (C devel ) for ASIC based product is much higher, due to the need for comprehensive logic and physical modeling that requires good understanding of specific foundry process used and manufacturing files (tape out) production and high cost of samples production (0.5 to 2 millions USD). C devel for FPGA-based product is much lower, mainly due to the ability to use successive approximations method, since the changes and the requirements and configuration debugging is reduced to well known for software development recompile/load/test cycle. With FPGA solution cost reduction the number of product units (N units ) making FPGA based product profitable increases and currently, for soft-cpus and systems on the chip (SOC) is over 10 thousand units. 4 Comparison of soft-cpus solutions with well known ASIC CPUs Equation 1 may be used to evaluate the feasibility of FPGA-based soft-cpus solutions. For instance FPGA with 300K LE and 15Mbit block memory is equivalent to an ASIC with approximately 50M gates. Using this approach we can estimate which well known CPUs and GPUs [8] may be implemented using modern low cost FPGA devices (Table 2). RISC architecture should be used for soft-cpu implementation, in particular, OpenRISC is optimized for FPGA implementation and achieves CISC performance at much lower gates counts. 298

Processor Year Gate, Millions Clock frequency, Ghz Technology, nm ARM Cortex A9 2007 26 1.4 40 Intel Pentium Mobile 2003 55 1.7 90 Intel Atom 2008 47 1.8 45 Intel Pentium 4 2000 42 1.5 180 Nvidia NV25 2002 63 1.6 150 UltraSPARC T1 2005 279 1 90 5 Conclusions Table 2: Number of gates for some well known CPUs and GPUs The dynamics of the development of technology and the characteristics of modern FPGA families of FPGA devices allow to conclude that today soft-cpus is the most promising hardware platform for the next generation of information technology. Also soft-cpus are most prospective for use in the unified digital platform aimed at: general purpose products with digital control; tablet and mobile computers; computer terminals of various kinds; personal computers with improved security and reliability. References [1] S. Chun-Yung and L. Ji Ung. Graphene: The ultimate switch. IEEE Spectrum, 49(1):32 59, 2012. [2] Altera Coporation. Cyclone device handbook. [3] Altera Coporation. Cyclone II device handbook. [4] Altera Coporation. Cyclone III device handbook. [5] Altera Coporation. Cyclone IV device handbook. [6] Altera Coporation. Cyclone V device datasheet. [7] Altera Coporation. Cyclone V device handbook. [8] Intel Coporation. Transistors to transformations. [9] Kyeong-Jae Lee, Hyesung Park, Jing Kong, and Anantha P Chandrakasan. Demonstration of a subthreshold fpga using monolithically integrated graphene interconnects. Electron Devices, IEEE Transactions on, 60(1):383 390, 2013. [10] Gordon E Moore. Cramming more components onto integrated circuits, reprinted from electronics, volume 38, number 8, april 19, 1965, pp. 114 ff. IEEE Solid-State Circuits Newsletter, 3(20):33 35, 2006. 299