DRAM Fab partnership for Intelligent RAM (IRAM)

Similar documents
IRAM: A Microprocessor for the Post-PC Era

Intelligent RAM (IRAM): Chips that remember and compute

FABRICATION TECHNOLOGIES

The Microprocessor as a Microcosm:

An Overview of Standard Cell Based Digital VLSI Design

An overview of standard cell based digital VLSI design

The Future of Microprocessors Embedded in Memory

Digital System Design Lecture 2: Design. Amir Masoud Gharehbaghi

CHAPTER 1 INTRODUCTION

Design Objectives of the 0.35µm Alpha Microprocessor (A 500MHz Quad Issue RISC Microprocessor)

Memory Systems IRAM. Principle of IRAM

Design Solutions in Foundry Environment. by Michael Rubin Agilent Technologies

HOMEWORK 10 CMPEN 411 Due: 4/28/ :30pm

CMOS VLSI Design. Final Project

Intelligent RAM (IRAM): Chips that remember and compute

Introduction to CMOS VLSI Design (E158) Project 2 Spring 2008

TSBCD025 High Voltage 0.25 mm BCDMOS

HP PA-8000 RISC CPU. A High Performance Out-of-Order Processor

On GPU Bus Power Reduction with 3D IC Technologies

Overview of Digital Design Methodologies

EECS150 - Digital Design Lecture 17 Memory 2

1. Designing a 64-word Content Addressable Memory Background

Spiral 2-8. Cell Layout

Processor Architectures At A Glance: M.I.T. Raw vs. UC Davis AsAP

Chip Core. Final Assembly. Core Sizes. Connecting Core to Pads. Core. Pad Ring The connection to the outside world

CMOS VLSI Design. MIPS Processor Example. Outline

CMPEN411 Memory Chip Design Project Report

ECE410 Design Project Spring 2013 Design and Characterization of a CMOS 8-bit pipelined Microprocessor Data Path

Physical Implementation

The Design of the KiloCore Chip

HOMEWORK 7 CMPEN 411 Due: 3/22/ :30pm

Physical Design of a 3D-Stacked Heterogeneous Multi-Core Processor

IMAGINE: Signal and Image Processing Using Streams

Design Methodologies and Tools. Full-Custom Design

0.35um design verifications

Design and Implementation of a Super Scalar DLX based Microprocessor

The Fujitsu ASIC Platform:

MOSAID Semiconductor

COE 561 Digital System Design & Synthesis Introduction

CS654 Advanced Computer Architecture. Lec 3 - Introduction

ECE232: Hardware Organization and Design

CS 152 Computer Architecture and Engineering. Lecture 6 - Memory

UCLA 3D research started in 2002 under DARPA with CFDRC

DRC and LVS checks using Cadence Virtuoso Version 3.0

Lecture 18: DRAM Technologies

HOMEWORK 9 CMPEN 411 Due: 4/12/ :30pm

Using ASIC circuits. What is ASIC. ASIC examples ASIC types and selection ASIC costs ASIC purchasing Trends in IC technologies

Main Memory. EECC551 - Shaaban. Memory latency: Affects cache miss penalty. Measured by:

Intelligent RAM (IRAM): Chips that remember and compute

Graphics: Alexandra Nolte, Gesine Marwedel, Universität Dortmund. RTL Synthesis

EE 330 Laboratory Experiment Number 11

ECE 459/559 Secure & Trustworthy Computer Hardware Design

CMSC 411 Computer Systems Architecture Lecture 2 Trends in Technology. Moore s Law: 2X transistors / year

HOMEWORK 2 CMPEN 411 Due: 1/31/ :30pm

take control of your photonics design flow Photonic-Electronic IC design and implementation Pieter Dumon 27/09/2016

TEXAS INSTRUMENTS ANALOG UNIVERSITY PROGRAM DESIGN CONTEST MIXED SIGNAL TEST INTERFACE CHRISTOPHER EDMONDS, DANIEL KEESE, RICHARD PRZYBYLA SCHOOL OF

EE 330 Laboratory 3 Layout, DRC, and LVS Fall 2015

File: 'ReportV37P-CT89533DanSuo.doc' CMPEN 411, Spring 2013, Homework Project 9 chip, 'Tiny Chip' fabricated through MOSIS program

electronic lab 11 Fedora Electronic Lab empowers hardware engineers and universities with opensource solutions for micro nano electronics engineering.

A Non-Volatile Microcontroller with Integrated Floating-Gate Transistors

Professor Lee, Yong Surk. References 고성능마이크로프로세서구조의개요. Topics Microprocessor & microcontroller

10 Gbit/s Challenge inside the Openlab framework

CS 152, Spring 2011 Section 8

Design of a Low Density Parity Check Iterative Decoder

NVIDIA GTX200: TeraFLOPS Visual Computing. August 26, 2008 John Tynefield

Implementing Tile-based Chip Multiprocessors with GALS Clocking Styles

EE 330 Laboratory 3 Layout, DRC, and LVS

VLSI Lab Tutorial 3. Virtuoso Layout Editing Introduction

RTL Coding General Concepts

Three-Dimensional Integrated Circuits: Performance, Design Methodology, and CAD Tools

ECE331: Hardware Organization and Design

Serial Adapter for I 2 C / APFEL and 8 channel DAC ASIC

VLSI Design Automation

3D TECHNOLOGIES: SOME PERSPECTIVES FOR MEMORY INTERCONNECT AND CONTROLLER

Lecture 14: Multithreading

UNIVERSITY OF CALIFORNIA College of Engineering Department of Electrical Engineering and Computer Sciences Lab #2: Layout and Simulation

Laboratory 6. - Using Encounter for Automatic Place and Route. By Mulong Li, 2013

Chapter 5: ASICs Vs. PLDs

Jim Keller. Digital Equipment Corp. Hudson MA

CS61C Machine Structures. Lecture 1 Introduction. 8/27/2006 John Wawrzynek (Warzneck)

CS 152 Computer Architecture and Engineering. Lecture 7 - Memory Hierarchy-II

3D-IC is Now Real: Wide-IO is Driving 3D-IC TSV. Samta Bansal and Marc Greenberg, Cadence EDPS Monterey, CA April 5-6, 2012

Allegro Sigrity SI Streamlining the creation of high-speed interconnect on digital PCBs and IC packages

Verifying the Correctness of the PA 7300LC Processor

The Alpha and Microprocessors:

ECE520 VLSI Design. Lecture 1: Introduction to VLSI Technology. Payman Zarkesh-Ha

PrimeTime: Introduction to Static Timing Analysis Workshop

CALIFORNIA INSTITUTE OF TECHNOLOGY VIBA Correlator Project. VLSI TECHNOLOGY REPORT M. S. Ewing December 28, 1984

Vector Architectures Vs. Superscalar and VLIW for Embedded Media Benchmarks

TUTORIAL II ECE 555 / 755 Updated on September 11 th 2006 CADENCE LAYOUT AND PARASITIC EXTRACTION

Application of Power-Management Techniques for Low Power Processor Design

Implementing Multi-Gigabit Serial Links in a System of PCBs

Ting Wu, Chi-Ying Tsui, Mounir Hamdi Hong Kong University of Science & Technology Hong Kong SAR, China

RISECREEK: From RISC-V Spec to 22FFL Silicon

Evolution of Implementation Technologies. ECE 4211/5211 Rapid Prototyping with FPGAs. Gate Array Technology (IBM s) Programmable Logic

EE115C Digital Electronic Circuits. Tutorial 4: Schematic-driven Layout (Virtuoso XL)

Lab. Course Goals. Topics. What is VLSI design? What is an integrated circuit? VLSI Design Cycle. VLSI Design Automation

Introduction 1. GENERAL TRENDS. 1. The technology scale down DEEP SUBMICRON CMOS DESIGN

Chapter 2: Memory Hierarchy Design (Part 3) Introduction Caches Main Memory (Section 2.2) Virtual Memory (Section 2.4, Appendix B.4, B.

COMPUTER ARCHITECTURE

Transcription:

DRA Fab partnership for Intelligent RA (IRA) David Patterson and John Wawrzynek patterson@cs.berkeley.edu http://iram.cs.berkeley.edu/ EECS, University of California Berkeley, CA 94720-1776 1

Outline Overview of V-IRA-1 Tentative Schedule Phases and Deliverables V-IRA-1 SW/HW/Fab Tasks Requirements for Test Chips Requirements for V-IRA-1 Prototype Where do we go from here? 2

V-IRA-1: 0.25 µm, Fast Logic, 500 hz 4 GFLOPS(64b)/32 GOPS(8b)/24B 2-way Superscalar Processor Vector Instruction Queue + x Load/Store 4 x 64 or 8 x 32 or 16 x 16 or 32 x 8 Net Int 8K I cache 8K D cache Vector Registers 4 x 64 emory Crossbar 4 x 64 4 x 64 4 x 64 4 x 64 4 x 64 4 x 64 4 x 64 3

V-IRA-1 Floorplan emory (96 bits) emory Crossbar 4 Vector Units emory Crossbar CPU +$ I/O 0.25 µm, 256 bdra Die size = DRA die 256 Xtors: 80% emory, 8% Vector, 6% CPU regular design emory (96 bits) 4

Goals for Vector IRA Generations V-IRA-1 ( 1999) 256 bit generation (0.25) Die size = 256 b DRA die 1.5-2.0 v logic, 0.5-2.0 watts 300-500 Hz 4 64-bit pipes/lanes 4 GFLOPS(64b)/32GOPS(8b) 30-50 GB/sec em. BW 24 B capacity + DRA bus PCI bus/ FC-AL (serial SCSI) V-IRA-2 ( 2002) 1 Gbit generation (0.18) Die size = 1 Gb DRA die 1.0-1.5 v logic, 0.5-2.0 w 500-1000 Hz 8 64-bit pipes/lanes 16 GFLOPS/128GOPS 100-200 GB/sec em. BW 96 B cap. + DRA bus any Gbit Ethernet/FC-AL 5

V-IRA-1 Tentative Plan Phase I: Feasibility stage ( H1 98) Test chip, CAD agreement, architecture defined Phase 2: Design Stage ( H2 98) Simulated design Phase 3: Layout & Verification ( H2 99) Tape-out Phase 4: Fabrication,Testing, and Demonstration ( H1 00) Functional integrated circuit 6

Phase I: Feasibility Stage Berkeley Deliverables White Paper, Architecture, Project Schedule Partner Deliverables HSPICE, Design Rules, PAD Design, Cell Library? Prototype runs for test chip (2 times?) ilestones Test chip working, architecture defined, CAD agreement (Test chip to exercise tools and partner relationship as well as DRA/logic circuits) 7

Phase 2: Design Stage Berkeley Deliverables HDL description of logic portion of IRA, compilers, software libraries Partner Deliverables DRA Design, PAD design, Cell Library Prototype runs for test chip in 256bit process? ilestones Simulated design 8

Phase 3: Layout and Verification Berkeley Deliverables Test vectors Partner Deliverables DRA layout, PAD layout ilestone Tape-out 9

Phase 4: Fabrication,Testing, and Demonstration Berkeley Deliverables Prototype PCB, Testing Partner Deliverables Packaged parts, Test rig ilestone Functional integrated circuit 10

V-IRA-1 Software Tasks Libraries Vector/Graphic/Image BLAS I/O Kernel OS Compilers Scalar C compiler Vector C compiler Instruction set level simulator Who does it? Berkeley Berkeley Berkeley Berkeley Compilers Partner? Berkeley Berkeley 11

V-IRA-1 Hardware Tasks Reg. Trans. Level simulator Scalar Processor Vector Unit/emory System Circuits/Layout/Schematics Processor Caches Vector Unit» Fl. Point Unit/Datapath Generator? Crossbar DRA odules Fast Serial I/O Who does it? Partner? Berkeley Partner? Partner? Berkeley» Partner? Berkeley Partner Berkeley 12

V-IRA-1 Validation/Fab Tasks Simulation/Validation Fabrication Wafer test DRA Logic Packaging Testing packaged parts Board design (copies for both) Demo (copies for both) ISSCC paper Berkeley Partner Joint plan Partner Berkeley? Partner? Berkeley? Berkeley Berkeley Joint! 13

Phase I: Requirements for Test Chips CAD Design Flow General Design Information DRA acrocell description Testing plan/strategy Work on together? (logic and DRA) Someone to answer questions/visitor at UCB? Letter of understanding and wafer foundry agreement or prototype service agreement 14

Requirements for Test Chips CAD Design Flow What is it at Partner s company? What CAD tools should we buy? What CAD tools should we borrow? One example set: Layout: Cadence Dracula Schematic Entry: Cadence ICDE (Integrated Circuit Design Environment) Circuit: HSPICE Logic Synthesis/Simulations: Synopsys Cell Library:?? 15

Requirements for Test Chips General Design Information: 1. HSPICE device model parameters (typical, fast, slow) 2. Process Parameters: Cadence DRACULA LPE file 3. Design rules file: Cadence DRACULA DRC file 4. Electrical rules file: Cadence DRACULA ERC command file 5. Cadence DRACULA LVS command file 6. I/O pads layout 16

Requirements for Test Chips DRA acrocell 1. Available configuration» Layout area, number of inputs/outputs, location of pins 2. Power requirements» Routing requirement, peak and average current 3. Interface timing» Access time, clock timing, signal level 4. Guidelines of metal routing over array 5. Refresh rate and cycles 17

Desired for V-IRA-1 Prototype 0.25 micron, fast transistor, 4 metal layers Fast turnaround/testing Flexible interface to DRA modules Scalar processor in that process? Group (not at UCB) to help with design? Testing plan/strategy Someone to answer questions/visitor at UCB? Letter of understanding and wafer foundry agreement or prototype service agreement 18

Specific Questions Scalar processor available for V-IRA-1? Clock rate, power, area, pipeline, floating point, superscalar, performance on benchmarks Technology for test chips, V-IRA-1 Logic: L effective, V t, V cc ; C cc x V dd / I sat DRA: cell size, V t, V cc Scaling logic across generations? (drop-in DRA) Simplified logic design rules span DRA generations? e.g., Test chip 1 (Spring 98) in 4th Gen. 64b; Test chip 2 (Spr 99)/V-IRA-1 (Fall 99) in 2G 256b 19

Specific Questions (cont d) What is maturity of proposed technology? Engineering samples, first customer ship, mass production? How long is process available to fab in? Technology roadmap for DRA, merged logic? Test chip die size limits? Larger than DRA die? V-IRA=1 die size limit? Larger than DRA die? Packaging options? Ball grid array? 20

Where do we go from here? Partner determines if is interested or not Partner proposes what resources/technology can provide UCB, plan for prototype Pick 2 partners for test chips Depending on results/relationship/business plans, pick 1 partner for V-IRA-1 prototype Even if not prototype partner, companies can participate in IRA project, supporting students, attending two retreats per year,... Other issues? 21