ECE 486/586. Computer Architecture. Lecture # 2

Similar documents
Copyright 2012, Elsevier Inc. All rights reserved.

Fundamentals of Quantitative Design and Analysis

Computer Architecture A Quantitative Approach, Fifth Edition. Chapter 1. Copyright 2012, Elsevier Inc. All rights reserved. Computer Technology

EECS4201 Computer Architecture

Transistors and Wires

TDT 4260 lecture 2 spring semester 2015

Lecture 1: Introduction

CS/EE 6810: Computer Architecture

Fundamentals of Computer Design

CSE 502 Graduate Computer Architecture

Advanced Computer Architecture (CS620)

Lecture 2: Performance

Introduction to Computer Architecture II

PERFORMANCE METRICS. Mahdi Nazm Bojnordi. CS/ECE 6810: Computer Architecture. Assistant Professor School of Computing University of Utah

Advanced Computer Architecture Week 1: Introduction. ECE 154B Dmitri Strukov

Microelettronica. J. M. Rabaey, "Digital integrated circuits: a design perspective" EE141 Microelettronica

Computer Architecture. Minas E. Spetsakis Dept. Of Computer Science and Engineering (Class notes based on Hennessy & Patterson)

ECE 2162 Intro & Trends. Jun Yang Fall 2009

When and Where? Course Information. Expected Background ECE 486/586. Computer Architecture. Lecture # 1. Spring Portland State University

The Computer Revolution. Classes of Computers. Chapter 1

Exercise 1 Due 02.November 2010, 12:15pm

Lecture 1: CS/ECE 3810 Introduction

CAD for VLSI. Debdeep Mukhopadhyay IIT Madras

Advanced Computer Architecture Week 1: Introduction. ECE 154B Dmitri Strukov

Introduction. Summary. Why computer architecture? Technology trends Cost issues

CS Computer Architecture Spring Lecture 01: Introduction

1.13 Historical Perspectives and References

Problem 2 If the cost of a 12 inch wafer (actually 300mm) is $3500, what is the cost/die for the circuit in Problem 1.

Microprocessor Trends and Implications for the Future

LECTURE 1. Introduction

CMSC 411 Computer Systems Architecture Lecture 2 Trends in Technology. Moore s Law: 2X transistors / year

COMPUTER ORGANIZATION AND DESIGN. 5 th Edition. The Hardware/Software Interface. Chapter 1. Computer Abstractions and Technology

Contents Slide Set 9. Final Notes on Textbook Chapter 7. Outline of Slide Set 9. More about skipped sections in Chapter 7. Outline of Slide Set 9

Advanced Computer Architecture Week 1: Introduction. ECE 154B Dmitri Strukov

COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Interface. 5 th. Edition. Chapter 1. Computer Abstractions and Technology

Chapter 1. The Computer Revolution

Computer Architecture Lecture 1: Fundamentals of Quantitative Design and Analysis (Chapter 1)

EE282 Computer Architecture. Lecture 1: What is Computer Architecture?

ECE484 VLSI Digital Circuits Fall Lecture 01: Introduction

Outline Marquette University

Slide Set 8. for ENCM 369 Winter 2018 Section 01. Steve Norman, PhD, PEng

Using ASIC circuits. What is ASIC. ASIC examples ASIC types and selection ASIC costs ASIC purchasing Trends in IC technologies

EE586 VLSI Design. Partha Pande School of EECS Washington State University

EN105 : Computer architecture. Course overview J. CRENNE 2015/2016

ECE 154A. Architecture. Dmitri Strukov

The Memory Hierarchy 1

CSCI 402: Computer Architectures. Computer Abstractions and Technology (4) Fengguang Song Department of Computer & Information Science IUPUI.

EECS2021E EECS2021E. The Computer Revolution. Morgan Kaufmann Publishers September 12, Chapter 1 Computer Abstractions and Technology 1

Overview of Today s Lecture: Cost & Price, Performance { 1+ Administrative Matters Finish Lecture1 Cost and Price Add/Drop - See me after class

Power dissipation! The VLSI Interconnect Challenge. Interconnect is the crux of the problem. Interconnect is the crux of the problem.

Thomas Polzer Institut für Technische Informatik

Advanced Computer Architecture Week 1: Introduction. ECE 154B Dmitri Strukov

Performance of computer systems

ECE520 VLSI Design. Lecture 1: Introduction to VLSI Technology. Payman Zarkesh-Ha

ECE 637 Integrated VLSI Circuits. Introduction. Introduction EE141

CSE 548 Computer Architecture. Clock Rate vs IPC. V. Agarwal, M. S. Hrishikesh, S. W. Kechler. D. Burger. Presented by: Ning Chen

DEPARTMENT OF ECE IV YEAR ECE EC6009 ADVANCED COMPUTER ARCHITECTURE LECTURE NOTES

Computer Architecture s Changing Definition

EE5780 Advanced VLSI CAD

3D systems-on-chip. A clever partitioning of circuits to improve area, cost, power and performance. The 3D technology landscape

ENIAC - background. ENIAC - details. Structure of von Nuemann machine. von Neumann/Turing Computer Architecture

Large and Fast: Exploiting Memory Hierarchy

Chapter 1. Computer Abstractions and Technology. Lesson 2: Understanding Performance

Multi-Core Microprocessor Chips: Motivation & Challenges

Energy Efficient Computing Systems (EECS) Magnus Jahre Coordinator, EECS

COMPUTER ORGANIZATION AND DESIGN. 5 th Edition. The Hardware/Software Interface. Chapter 1. Computer Abstractions and Technology

ELE 455/555 Computer System Engineering. Section 1 Review and Foundations Class 5 Computer System Performance

Computer Architecture. R. Poss

IMPROVING ENERGY EFFICIENCY THROUGH PARALLELIZATION AND VECTORIZATION ON INTEL R CORE TM

ECE 152 Introduction to Computer Architecture

Microelectronics. Moore s Law. Initially, only a few gates or memory cells could be reliably manufactured and packaged together.

Parallelism and Concurrency. COS 326 David Walker Princeton University

Moore s s Law, 40 years and Counting

Parallelism: The Real Y2K Crisis. Darek Mihocka August 14, 2008

Organization. 5.1 Semiconductor Main Memory. William Stallings Computer Organization and Architecture 6th Edition

Chapter 2. Perkembangan Komputer

CS24: INTRODUCTION TO COMPUTING SYSTEMS. Spring 2017 Lecture 13

What is this class all about?

Last Time. Making correct concurrent programs. Maintaining invariants Avoiding deadlocks

Computer Architecture

Advanced Computer Architecture

CS/COE0447: Computer Organization

OVERCOMING THE MEMORY WALL FINAL REPORT. By Jennifer Inouye Paul Molloy Matt Wisler

Reducing DRAM Latency at Low Cost by Exploiting Heterogeneity. Donghyuk Lee Carnegie Mellon University

Lecture-14 (Memory Hierarchy) CS422-Spring

Computer & Microprocessor Architecture HCA103

Cutting Power Consumption in HDD Electronics. Duncan Furness Senior Product Manager

How What When Why CSC3501 FALL07 CSC3501 FALL07. Louisiana State University 1- Introduction - 1. Louisiana State University 1- Introduction - 2

Core 2 vs I-series. How Far Have We Really Come?

Phase Change Memory An Architecture and Systems Perspective

Many-Core Computing Era and New Challenges. Nikos Hardavellas, EECS

Course overview Computer system structure and operation

Low-Power Technology for Image-Processing LSIs

What is this class all about?

Part 1 of 3 -Understand the hardware components of computer systems

Silicon Memories. Why store things in silicon? It s fast!!! Compatible with logic devices (mostly)

Addressing the Memory Wall

BREAKING THE MEMORY WALL

Emergence of Segment-Specific DDRn Memory Controller and PHY IP Solution. By Eric Esteve (PhD) Analyst. July IPnest.

Computer and Information Sciences College / Computer Science Department CS 207 D. Computer Architecture

Transcription:

ECE 486/586 Computer Architecture Lecture # 2 Spring 2015 Portland State University

Recap of Last Lecture Old view of computer architecture: Instruction Set Architecture (ISA) design Real computer architecture: Design to maximize performance within constraints: cost, power, and availability Includes ISA, microarchitecture, hardware Tasks of a computer architect: Determine features needed by a market and incorporate those features in the computer Identify important technology scaling trends and adapt computer design accordingly

Lecture Topics Trends in Semiconductor Technology Logic, DRAM and Storage Technology Bandwidth and Latency scaling Transistors and Wires Trends in power and Energy Trends in Cost Impact of time, volume and commoditization Cost of an Integrated Circuit Reference: Chapter 1: Sections 1.4, 1.5 and 1.6 (Pages 17 32)

Technology Trends: Transistor Counts Transistor density increases by ~ 35% per year Die size increase less predictable, ranging from 10% to 20% per year Combined effect: On-chip transistor count increases by 40% to 55% per year (2x increase every 18 to 24 months) Popularly known as Moore s Law What can we do with all these extra transistors: More complex processors (e.g., deeper pipelines, SIMD units) More general-purpose cores on a chip More special-purpose cores (e.g., graphics) on a chip Larger on-chip caches

Technology Trends: DRAM In 1990s, DRAM capacity was increasing by 60% per year (quadrupling every 3 years) More recently, DRAM capacity increase has slowed down to 25% -- 40% (doubling every 2 to 3 years) Capacity increases will continue to slow down due to charge storage limits of the DRAM capacitors DRAM may stop scaling before the end of this decade Impact on Architecture: Need to explore other memory technologies to replace DRAM as the main memory

Technology Trends: Storage Flash Memory: Capacity increasing by 50 to 60% per year (doubling in < 2 years) 15-20X cheaper/bit than DRAM Increasing use as SSD in Laptops and as the only storage device in tablets and smartphones Magnetic Disk Technology: Capacity increasing by 40% per year (doubling every 2 to 3 years) 15-25X cheaper/bit then Flash 300-500X cheaper/bit than DRAM Most widely used in server and warehouse-scale storage

Why care about Technology Trends? Architect needs to be aware of technology trends to make the correct design choices and trade-offs Product design begins 3 5 years before the product is expected to hit the market Consistent technology trends help in knowing how a design would behave in the future

Bandwidth vs. Latency Bandwidth or throughput Total work done in a given time 10,000-25,000X improvement for processors 300-1200X improvement for memory and disks Latency or response time Time between start and completion of an event 30-80X improvement for processors 6-8X improvement for memory and disks

Bandwidth vs. Latency Bandwidth improvements significantly outpace latency improvements

Technology Trends: Transistors Scaling refers to reduction in integrated circuit feature size (in x or y dimension) 1971: 10 micrometer 2011: 32 nanometers Transistor density (count per unit area) increases quadratically with feature size reductions (e.g. 2x density increase from 32 nm to 22 nm process technology) Transistor speed typically increases linearly with decreasing feature size Complex for multiple reasons including decrease in supply voltage

Technology Trends: Wires Wire delays do not scale well with technology Delay proportional to Resistance * Capacitance Wire lengths decrease with feature size reductions But, resistance and capacitance per unit length increase Poor wire delay scaling compared to transistor scaling creates design challenges Larger fractions of processor clock cycle consumed by signal propagation delay on wires

Power & Energy: A Systems Perspective System Needs: Power needs to brought in and distributed around the chip Power dissipated as heat needs to be removed Considerations: Peak Power Power supply system needs to provision for peak power needs If a processor attempts to draw more current than the supply can provide, there is a voltage droop, which leads to malfunction Thermal Design power (TDP) Characterizes sustained power consumption Determined by the capabilities of the cooling system Lower than peak power, higher than average power consumption Energy Efficiency Considers power and execution time (Energy = Power * Time)

Power vs. Energy Power poses operating constraints: can only execute fast enough to max out the power delivery or the cooling solution Energy is the ultimate metric measures the true cost of performing a task has a direct impact on battery lives (portable devices) and electricity bills (servers) Example: If processor A consumes 20% more power than processor B but finishes the task in 30% less time, its relative energy is 1.2 * 0.7 = 0.84. Processor A is better, provided its higher power can be supported by the power delivery and cooling systems

Dynamic Energy and Power Dynamic energy Transistor switches from 0 -> 1 or 1 -> 0 ½ x Capacitive load x Voltage 2 Dynamic power ½ x Capacitive load x Voltage 2 x Frequency switched Reducing clock rate reduces power, not energy Reducing voltage reduces both power and energy Voltages have dropped from 5V to < 1V in 20 years

Example Question: Consider a processor which can operate in two different modes: (i) Mode-1: a high voltage mode (1V, 3GHz), and (ii) Mode-2: a low voltage mode (0.75V, 2 GHz). How much savings in dynamic energy and dynamic power will the processor achieve by operating in Mode-2 as compared to Mode-1? Answer: (Energy) mode2 /(ENERGY) mode1 = (0.75) 2 /(1) 2 = 0.56 44% energy savings (Power) mode2 /(Power) mode1 = (0.75) 2 (2) / (1) 2 (3)= 0.375 62.5% power savings

Power Wall Until 2003, increases in transistor count and frequency dominated reductions in voltage => net increase in power Intel 80386 consumed ~ 2 W 3.3 GHz Intel Core i7 consumes 130 W Heat must be dissipated from 1.5 x 1.5 cm chip This is the limit of what can be cooled by air Result: Clock speeds became stagnant from 2003 onwards

Techniques to Reduce Power Do nothing well: Turn off the clock of inactive modules, e.g., idle FP units or cores Dynamic Voltage-Frequency Scaling: Multiple operating modes with different voltages and frequencies In periods of low activity, switch to lower voltage (frequency) Low power memory states DRAMs have a series of increasingly lower power modes Switching from a low power mode to active mode consumes extra latency Overclocking Turbo mode in Intel processors

Static Power Power consumed when the system is idle Current static x Voltage Proportional to the number of transistors Increasing rapidly with larger on-chip SRAM caches To cut down static power: Need to turn off the power supply to inactive modules (power gating)

Cost and Price Trends Impacted by time, volume, and commodification Cost decreases with time due to: Learning curve resulting in improved yields Recognized opportunities for cost reductions Cost decreases with volume due to: Learning curve reached much faster Purchasing and manufacturing efficiency Reduction in amortized per unit cost of R&D Cost decreases with commodification due to: Multiple vendors producing large volumes of mostly identical products, leading to competition and price reduction Intense competition results in lower profit margins and reduced prices

Intel Pentium 4 and Pentium M Pricing Price reduces with time, as the manufacturing process matures and volume increases

IC Manufacturing Integrated circuit manufacturing starts from the production of silicon wafers Silicon ingot is sliced into silicon wafers Wafers go through multiple processing steps Patterned wafers are tested and bad wafers are removed from the population Good wafers are chopped into dies which go through another testing process Good dies are packaged, re-tested and then sent to customers

Example: Intel Sandy Bridge 280 dies/300 mm wafer, 32nm process technology

Cost of an Integrated Circuit

Cost of an Integrated Circuit Bose-Einstein formula: Wafer yield: accounts for wafers that are completely bad Defects per unit area: Measure of random manufacturing defects 0.016-0.057 defects per square cm (2010) N (Process complexity factor) Measure of manufacturing difficulty 11.5-15.5 (40 nm, 2010)

Yield Example Problem: Assume Wafer yield = 100% Assume Defect Density = 0.031 per cm 2 Assume N= 13.5 Compare die yields for a die that is 1.5 cm on a side with a die that is 1 cm on a side?

Yield Example Problem: Assume Wafer yield = 100% Assume Defect Density = 0.031 per cm 2 Assume N= 13.5 Compare die yields for a die that is 1.5 cm on a side with a die that is 1 cm on a side? Solution: Die A: 1.5 cm on a side Area = 1.5 2 = 2.25cm 2 Die Yield = 100% * 1/(1 + 0.031 * 2.25) 13.5 = 0.4

Yield Example Problem: Assume Wafer yield = 100% Assume Defect Density = 0.031 per cm 2 Assume N= 13.5 Compare die yields for a die that is 1.5 cm on a side with a die that is 1 cm on a side? Solution: Die A: 1.5 cm on a side Area = 1.5 2 = 2.25cm 2 Die Yield = 100% * 1/(1 + 0.031 * 2.25) 13.5 = 0.4 Die B: 1 cm on a side Area = 1 2 = 1cm 2 Die Yield = 100% * 1/(1 + 0.031 * 1) 13.5 = 0.66

Yield Example (cont.) Problem: For the problem on previous page, compare the number of good dies for a 30 cm wafer in each case.

Yield Example (cont.) Problem: For the problem on previous page, compare the number of good dies for a 30 cm wafer in each case. Solution: Die A: 1.5 cm on a side Dies per 30 cm wafer = π * ((30/2) 2 /(1.5) 2 30/(2*(1.5) 2 ) 0.5 ) = 270 Die Yield = 0.4 Number of good dies per wafer = 270 * 0.4 = 108

Yield Example (cont.) Problem: For the problem on previous page, compare the number of good dies for a 30 cm wafer in each case. Solution: Die A: 1.5 cm on a side Dies per 30 cm wafer = π * ((30/2) 2 /(1.5) 2 30/(2*(1.5) 2 ) 0.5 ) = 270 Die Yield = 0.4 Number of good dies per wafer = 270 * 0.4 = 108 Die B: 1 cm on a side Dies per 30 cm wafer = π * ((30/2) 2 /(1) 2 30/(2*(1) 2 ) 0.5 ) = 640 Die Yield = 0.66 Number of good dies per wafer = 640 * 0.66 = 422

Cost Summary Total cost is determined by many factors: raw material cost, wafer yield, defect density, die size, testing costs, packaging costs etc. Manufacturing process dictates Wafer cost Wafer yield Defect density Architect/Designer controls Die size Larger die less dies per wafer Higher cost Larger die Less die yield Higher cost Package Pins I/Os

Cost vs. Price For commodity products (DRAM, disks, embedded processors), price closely tracks cost (low profit margin) Lots of competitors Higher volumes manufacturing costs easier to amortize For more specialized products (HPC, servers), price is significantly higher than cost (high profit margins) Fewer competitors Lower volumes With the proliferation of low-end consumer devices (such as smartphones), profit margins will continue to shrink in near-future