The RM9150 and the Fast Device Bus High Speed Interconnect

Similar documents
The CoreConnect Bus Architecture

Introduction Electrical Considerations Data Transfer Synchronization Bus Arbitration VME Bus Local Buses PCI Bus PCI Bus Variants Serial Buses

ECE 485/585 Microprocessor System Design

nforce 680i and 680a

EE108B Lecture 17 I/O Buses and Interfacing to CPU. Christos Kozyrakis Stanford University

The Nios II Family of Configurable Soft-core Processors

Hardware Design. MicroBlaze 7.1. This material exempt per Department of Commerce license exception TSU Xilinx, Inc. All Rights Reserved

Buses. Disks PCI RDRAM RDRAM LAN. Some slides adapted from lecture by David Culler. Pentium 4 Processor. Memory Controller Hub.

SEMICON Solutions. Bus Structure. Created by: Duong Dang Date: 20 th Oct,2010

Quality-of-Service for a High-Radix Switch

Achieving UFS Host Throughput For System Performance

Challenges for Future Interconnection Networks Hot Interconnects Panel August 24, Dennis Abts Sr. Principal Engineer

ECE 551 System on Chip Design

Qsys and IP Core Integration

Product Technical Brief S3C2413 Rev 2.2, Apr. 2006

2. System Interconnect Fabric for Memory-Mapped Interfaces

SONICS, INC. Sonics SOC Integration Architecture. Drew Wingard. (Systems-ON-ICS)

Buses. Maurizio Palesi. Maurizio Palesi 1

KeyStone C665x Multicore SoC

Chapter 11: Input/Output Organisation. Lesson 15: Standard I/O bus PCI

Product Technical Brief S3C2412 Rev 2.2, Apr. 2006


Intelop. *As new IP blocks become available, please contact the factory for the latest updated info.

IO System. CP-226: Computer Architecture. Lecture 25 (24 April 2013) CADSL

Where We Are in This Course Right Now. ECE 152 Introduction to Computer Architecture Input/Output (I/O) Copyright 2012 Daniel J. Sorin Duke University

CHAPTER 6 FPGA IMPLEMENTATION OF ARBITERS ALGORITHM FOR NETWORK-ON-CHIP

System Buses Ch 3. Computer Function Interconnection Structures Bus Interconnection PCI Bus. 9/4/2002 Copyright Teemu Kerola 2002

INT 1011 TCP Offload Engine (Full Offload)

Multi-core microcontroller design with Cortex-M processors and CoreSight SoC

Chapter 8. A Typical collection of I/O devices. Interrupts. Processor. Cache. Memory I/O bus. I/O controller I/O I/O. Main memory.

D Demonstration of disturbance recording functions for PQ monitoring

ECE 485/585 Microprocessor System Design

FCQ2 - P2020 QorIQ implementation

Migrating RC3233x Software to the RC32434/5 Device

INPUT/OUTPUT DEVICES Dr. Bill Yi Santa Clara University

A Closer Look at the Epiphany IV 28nm 64 core Coprocessor. Andreas Olofsson PEGPUM 2013

Applying the Benefits of Network on a Chip Architecture to FPGA System Design

The von Neuman architecture characteristics are: Data and Instruction in same memory, memory contents addressable by location, execution in sequence.

Chapter 6 Storage and Other I/O Topics

Novel Intelligent I/O Architecture Eliminating the Bus Bottleneck

Simplify System Complexity

Lecture 18: Communication Models and Architectures: Interconnection Networks

QUIZ Ch.6. The EAT for a two-level memory is given by:

Lecture 23. Finish-up buses Storage

AMBA Protocol for ALU

Copyright 2016 Xilinx

Storage Systems. Storage Systems

FPGA memory performance

Zatara Series ARM ASSP High-Performance 32-bit Solution for Secure Transactions

PHY-Less Ethernet Implementation Using Freescale Power Architecture Based Microprocessors

C900 PowerPC G4+ Rugged 3U CompactPCI SBC

DESIGN OF EFFICIENT ROUTING ALGORITHM FOR CONGESTION CONTROL IN NOC

Replacement Policy: Which block to replace from the set?

VLSI Design of Multichannel AMBA AHB

A Single Chip Shared Memory Switch with Twelve 10Gb Ethernet Ports

COMMUNICATION AND I/O ARCHITECTURES FOR HIGHLY INTEGRATED MPSoC PLATFORMS OUTLINE

Embedded Systems: Hardware Components (part II) Todor Stefanov

Chapter 8: Input and Output. Principles of Computer Architecture. Principles of Computer Architecture by M. Murdocca and V.

The S6000 Family of Processors

(Advanced) Computer Organization & Architechture. Prof. Dr. Hasan Hüseyin BALIK (3 rd Week)

AT-501 Cortex-A5 System On Module Product Brief

Ten Reasons to Optimize a Processor

C6100 Ruggedized PowerPC VME SBC

PCI and PCI Express Bus Architecture

Chapter 6. I/O issues

Overcoming the Memory System Challenge in Dataflow Processing. Darren Jones, Wave Computing Drew Wingard, Sonics

Lecture 13. Storage, Network and Other Peripherals

Zilog Real-Time Kernel

Asynchronous on-chip Communication: Explorations on the Intel PXA27x Peripheral Bus

JBus Architecture Overview

Lectures More I/O

William Stallings Computer Organization and Architecture 10 th Edition Pearson Education, Inc., Hoboken, NJ. All rights reserved.

Input/Output Problems. External Devices. Input/Output Module. I/O Steps. I/O Module Function Computer Architecture

Simplify System Complexity

KeyStone C66x Multicore SoC Overview. Dec, 2011

SoC Platforms and CPU Cores

Chapter 3. Top Level View of Computer Function and Interconnection. Yonsei University

FlexRay The Hardware View

Product Technical Brief S3C2440X Series Rev 2.0, Oct. 2003

Implementing Flexible Interconnect Topologies for Machine Learning Acceleration

A Reconfigurable Crossbar Switch with Adaptive Bandwidth Control for Networks-on

NVIDIA nforce IGP TwinBank Memory Architecture

Typical System Implementation

Optimizing the PCI Interface Configuration for the RC32336

Lecture 25: Busses. A Typical Computer Organization

SoC FPGAs. Your User-Customizable System on Chip Altera Corporation Public

Lecture 23: Storage Systems. Topics: disk access, bus design, evaluation metrics, RAID (Sections )

This page intentionally left blank

Administrivia. CMSC 411 Computer Systems Architecture Lecture 19 Storage Systems, cont. Disks (cont.) Disks - review

Nitro240/260 CPU Board Scalable 680x0 VME board for I/O intensive applications

Synchronous Bus. Bus Topics

Bus System. Bus Lines. Bus Systems. Chapter 8. Common connection between the CPU, the memory, and the peripheral devices.

Course Introduction. Purpose: Objectives: Content: Learning Time:

Unit 1. Chapter 3 Top Level View of Computer Function and Interconnection

An Ultra High Performance Scalable DSP Family for Multimedia. Hot Chips 17 August 2005 Stanford, CA Erik Machnicki

Sparrowhawk FX FPGA Video Processing Board

Computer Organization and Microprocessors SYLLABUS CHAPTER - 1 : BASIC STRUCTURE OF COMPUTERS CHAPTER - 3 : THE MEMORY SYSTEM

ARM Processors for Embedded Applications

INT G bit TCP Offload Engine SOC

SoC Design Lecture 11: SoC Bus Architectures. Shaahin Hessabi Department of Computer Engineering Sharif University of Technology

Transcription:

The RM9150 and the Fast Device High Speed Interconnect John R. Kinsel Principal Engineer www.pmc -sierra.com 1 August 2004 Agenda CPU-based SOC Design Challenges Fast Device (FDB) Overview Generic Device FDB Block Diagram Architectural Advantages Summary 2 August 2004

Typical CPU-based SOC Building Blocks CPU Core Needs low latency Parallelism Interconnect DDR Memory Local Hyper Needs high throughput System bus architecture is becoming increasingly important CPU frequency increase must be matched by bus and memory speeds 3 August 2004 SoC Design Issues 1 GHz CPU Core 66MHz Multiple clock domains needed Interconnect DDR Memory 200MHz Local Hyper 50 MHz 600 MHz 1 Gbps x2 1 Gbps x2 (Tx / Rx) (Tx / Rx) Custom bus interface to each peripheral - Little design re-use Performance scalability issues 4 August 2004

Typical Bandwidth of Various Blocks CPU Core Bandwidth 1.6 GBytes/s Interconnect 0.25 GB/s DDR Memory 2.6 GB/s Local Hyper 2.4 GB/s 0.25 GB/s 0.25 GB/s Bandwidth 1.6 GBytes/s < 0.25 GB/s () 2.6 GB/s (DDR) 2.4 GB/s (HT) 0.25 GB/s (GE) 0.25 GB/s (GE) 5.75 GB/s peripheral bandwidth Typical standard buses can t keep up with memory controller and other high-bandwidth peripherals 5 August 2004 Fast Device / Generic Device port port port port port Request Request Core (128-bits) port port port port Physical Address Map Common design interface for all peripherals interface specification available to PMC customers 6 August 2004

Fast Device Features port port port port port Request Request Core (128-bits) port port port port Physical Address Map arbitration logic and physical address map Modular design, ports can be easily added and subtracted (2-12) 7 August 2004 Fast Device Features (cont.) port port port port port Request Request Core (128-bits) port port port port Physical Address Map 128-bits, multiplexed address and data Separate grant signal, addr./data grant per port 400 MHz 500 MHz multiple master, shared on-chip bus 3-bit credit release bus, one per port connected from each port to the central arbiter 8 August 2004

Operation / Advantages Employs a two-stage arbiter - Request : Round robin priority - Second stage: Choose next owner of A/D bus based on request priority / target transaction resource availability - Both arbiters are pipelined, selecting a new owner every cycle The is for efficient scheduling of main bus usage - Main bus is never granted unless the target can accept the transaction, therefore no not acknowledges and no retries The guarantees fair access to high-contention targets port port port port port Request Request Core (128-bits) port port port port Physical Address Map 9 August 2004 First Device Based on the FDB: RM9150 DDRI/2 Local Hyper 128-bit 64-bit 64-bit 64-bit DMA Fast Device (FDB) 64-bit 64-bit 64-bit 64-bit E9000 On-chip INT XDMA Ctrler CPU Mem. Packet FIFO GE Port 0 GE Port 1 GE Subsystem Proc. Int. DUART SCMB SCMB SCMB 2-wire MDIO/ MDC FDB is implemented using a single core clock domain 64-bit Each port contains a fixed number of buffers to accept a fixed number of transactions Number of buffers optimized to meet each device s specific performance requirements Built-in re- synchronization logic to support different frequency modes - Asynchronous - Semi-synchronous - Full synchronous 128-bit to FDB interface for DDR1/2 memory controller for optimum bandwidth 10 August 2004

FDB/ Methodology Greatly Reduces Verification Effort Only need to design new Intellectual Property (IP) blocks with interface (or modify existing IP to support ) Once this IP with interface has been verified, then this ensures that the IP block can communicate with the FDB and all devices connected through to the FDB Without / FDB, would need to design a custom interface to each IP block and verify that it can communicate with the system interconnect (for each peripheral device in the SOC) Solves an order (n 2 ) verification problem by reducing it to an order (n) effort 11 August 2004 RM9150 Feature Summary First PMC-Sierra SOC to implement the FDB/ E9000 processor (<500 MHz to 1GHz capability) - 64-bit, dual-issue, superscalar, 7-stage pipeline - 16KB L1 caches / 256KB L2 cache (parity, ECC) FDB Trace and Performance Monitor 8-bit 600 MHz DDR Hyper v1.03 Two 32-bit buses, 66 MHz, v2.3 Dual 10/100/1000 MII/GMII/TBI 2BI/MDIO EJTAG Interrupts Clocks RM9150 Local DUART GPIO 167MHz/ 200 MHz 64-bit DDRI/II SDRAM - Up to 4GB of memory DDRI/II 72- bit SDRAM Dual 32- bit 8-bit Hyper Two 10/100/1000 s (802.3), TBI, GMII, MII 4-channel DMA controller 896 BGA package 0.13um TSMC process Local controller to external ROM, Flash, Compact Flash, PCMCIA, USB 2.0 devices Serial Communications Master Block - DUART, 2-wire interfaces, MDIO/MDC, 64 GPIO 12 August 2004

PMC s Solution: New Design Methodology - Performance flexibility & quick-turn SOC designs High End RM9000x2GL Non-Blocking Crossbar Switch Mid Range RM9150 Fast Device Low End Future Lite Fast Device 13 August 2004 Fast Device / Generic Device Architectural Advantages allows each peripheral to interface to FDB fully synchronously, semi-synchronously, or totally asynchronously, essentially eliminating global clocking issues Standard interface allows development of re-usable IP blocks, ability to use different future bus interconnects allows replacement of the interconnect behind it without redesigning the peripheral devices -> scalability Allows customers to integrate their IP or leverage PMC s IP to quickly design complex SOC s at various price/performance points Greatly reduces device verification effort (factor of n) for fast time-tomarket 14 August 2004

Summary Innovative Fast Device / bus architecture - Tuned for optimal performance (latency, bandwidth to critical peripherals) - High utilization due to efficient arbitration - Enables easy design re-use (common interface) E9000 CPU and robust set of RM9150 peripherals address multiple market segments - Networking, Storage, Industrial Control, and High-end Printer and Consumer First instance of PMC s platform-based high-performance, highly integrated SOCs using the FDB 15 August 2004 Thinking You Can Build On www.pmc-sierra.com 16 August 2004