PUSHING THE LIMITS, A PERSPECTIVE ON ROUTER ARCHITECTURE CHALLENGES

Similar documents
100 GBE AND BEYOND. Diagram courtesy of the CFP MSA Brocade Communications Systems, Inc. v /11/21

Disruptive Innovation in ethernet switching

Network Processors. Nevin Heintze Agere Systems

The iflow Address Processor Forwarding Table Lookups using Fast, Wide Embedded DRAM

High-Performance Network Data-Packet Classification Using Embedded Content-Addressable Memory

Multi-gigabit Switching and Routing

100 GBE - WHAT'S NEW AND WHAT'S NEXT

Cisco Nexus 9508 Switch Power and Performance

High-Speed Network Processors. EZchip Presentation - 1

Cisco ASR 1000 Series Aggregation Services Routers: QoS Architecture and Solutions

A 400Gbps Multi-Core Network Processor

IOS XR Platform Hardware Architectures

Last Lecture: Network Layer

Professor Yashar Ganjali Department of Computer Science University of Toronto.

Network Superhighway CSCD 330. Network Programming Winter Lecture 13 Network Layer. Reading: Chapter 4

ECE 697J Advanced Topics in Computer Networks

IOS XR Platform Hardware Architectures

TOC: Switching & Forwarding

Chapter 1. Introduction

BROCADE CLOUD-OPTIMIZED NETWORKING: THE BLUEPRINT FOR THE SOFTWARE-DEFINED NETWORK

Switch and Router Design. Packet Processing Examples. Packet Processing Examples. Packet Processing Rate 12/14/2011

Forwarding Architecture

Scaling routers: Where do we go from here?

Deploying Data Center Switching Solutions

Cubro Packetmaster EX48600

Topics for Today. Network Layer. Readings. Introduction Addressing Address Resolution. Sections 5.1,

A Single Chip Shared Memory Switch with Twelve 10Gb Ethernet Ports

Big Buffer of Switch

CSE 123A Computer Networks

Production OpenFlow Switches Now Available -Building CORD Using OpenFlow Switches CORD Build

Cubro Packetmaster EX32100

Cisco ASR 1000 Series Routers Embedded Services Processors

The Evolution Path from Frames to Services

Routing in Geoff Huston APNIC

Lecture 11: Packet forwarding

TOC: Switching & Forwarding

CSCD 330 Network Programming

Accelerating the Mass Transition of 100 GbE in the Data Center. Siddharth Sheth, Vice President of Marketing, High-Speed Connectivity

FM4000. A Scalable, Low-latency 10 GigE Switch for High-performance Data Centers

Routers Technologies & Evolution for High-Speed Networks

Centec V350 Product Introduction. Centec Networks (Suzhou) Co. Ltd R

Managing and Securing Computer Networks. Guy Leduc. Chapter 2: Software-Defined Networks (SDN) Chapter 2. Chapter goals:

Enabling Technology for the Cloud and AI One Size Fits All?

CN-100 Network Analyzer Product Overview

P51: High Performance Networking

Arista 7020R Series: Q&A

Router Architectures

SwitchX Virtual Protocol Interconnect (VPI) Switch Architecture

Introduction. Network Architecture Requirements of Data Centers in the Cloud Computing Era

Cisco Nexus 9300-EX Platform Switches Architecture

Routers: Forwarding EECS 122: Lecture 13

Cisco Nexus 9500 Series Switches Architecture

fleximac A MICROSEQUENCER FOR FLEXIBLE PROTOCOL PROCESSING

The Memory Hierarchy 1

Objectives for HSSG. Peter Tomaszewski Scientist, Signal Integrity & Packaging. rev3 IEEE Knoxville Interim Meeting, September 2006

Higher Speed Ethernet

Advancing high performance heterogeneous integration through die stacking

High Performance Ethernet for Grid & Cluster Applications. Adam Filby Systems Engineer, EMEA

100G and Beyond: high-density Ethernet interconnects

Cisco 7600 Series Route Switch Processor 720

SPARTA: Scalable Per-Address RouTing Architecture

EECS 122: Introduction to Computer Networks Switch and Router Architectures. Today s Lecture

Shim6: Network Operator Concerns. Jason Schiller Senior Internet Network Engineer IP Core Infrastructure Engineering UUNET / MCI

15-744: Computer Networking. Routers

DPX19000 Next Generation Cloud-Ready Service Core Platform

Scaling and Energy Efficiency in Next Generation Core Networks and Switches

The Network Processor Revolution

Nokia 7950 Extensible Routing System

Five Emerging DRAM Interfaces You Should Know for Your Next Design

Arista 7170 series: Q&A

Routers: Forwarding EECS 122: Lecture 13

MOSAID Semiconductor

Network Processors and their memory

Building Core Networks and Routers in the 2002 Economy

Decision Forest: A Scalable Architecture for Flexible Flow Matching on FPGA

Switch shall have 4 SFP 1000 Mb/s ports (2 Port dual-personality ports; 10/100/1000BASE-T or SFP and 2 Fixed 1G SFP port)

Hybrid Memory Cube (HMC)

Arista 7010 Series: Q&A

Routing Geoff Huston APNIC

Introduction. Executive Summary. Test Highlights

Cubro Sessionmaster EXA32100

Line Cards and Physical Layer Interface Modules Overview, page 1

Introduction to Routers and LAN Switches

Broadcom Trident 3 Programmable, Varied And Volume

White Paper. OCP Enabled Switching. SDN Solutions Guide

Configuring QoS CHAPTER

Industry Collaboration and Innovation

Barcelona: a Fibre Channel Switch SoC for Enterprise SANs Nital P. Patwa Hardware Engineering Manager/Technical Leader

ENTERPRISE MPLS. Kireeti Kompella

1 Copyright 2013 Oracle and/or its affiliates. All rights reserved.

Themes. The Network 1. Energy in the DC: ~15% network? Energy by Technology

Configuring OpenFlow 1

Configuring NetFlow Statistics Collection

Arista EOS Central Drop Counters

Lecture 5: Router Architecture. CS 598: Advanced Internetworking Matthew Caesar February 8, 2011

Scalable Name-Based Packet Forwarding: From Millions to Billions. Tian Song, Beijing Institute of Technology

PARALLEL ALGORITHMS FOR IP SWITCHERS/ROUTERS

Hardware Evolution in Data Centers

TAP Aggregation-Network Visibility and Security

PacketShader: A GPU-Accelerated Software Router

Transcription:

PUSHING THE LIMITS, A PERSPECTIVE ON ROUTER ARCHITECTURE CHALLENGES Greg Hankins <ghankins@brocade.com> APRICOT 2012 2012 Brocade Communications Systems, Inc. 2012/02/28

Lookup Capacity and Forwarding Complexity Lookup and Forwarding Hardware Design Challenges Solutions are Good, Fast, or Cheap Pick any Two System Design Component Technology $ Lookup/ Forwarding Port Density Complex multiprotocol forwarding Growing IPv4/IPv6 Internet and VPN routing tables IPv4 deaggregation and IPv6 adoption make future growth hard to predict High-density 100 GbE Line-rate 100 GbE is one packet every 6.72 ns Port Density Diagrams courtesy of the CFP MSA and bgp.potaroo.net. 2

Media Basic Router Forwarding Architecture Packet Lookup Memory 100 GbE: n x 150 Mpps Lookups Packet Buffer Memory 100 GbE: n x 200 Gbps RX+TX Line Card TCAM RAM DRAM Switching Fabric Fabric Card PHY Network Processor Traffic Manager Chip to Chip Interfaces Multiple 3.125 to 28 Gbps Lanes CPU DRAM Fabric Card Management Interconnect CPU System RAM DRAM Management Card 3

Agenda Introduction Memory Technology ASIC Technology Summary 4

Key Memory Challenges Packet rates gave greatly exceeded memory random read rates Lookup: 1 ns random read rates needed yesterday Buffer: 1 ns random read and writes rates needed yesterday Dynamic memory technology characteristics impose significant constraints on lookup and buffering architectures Inherent restrictions and non-random access Bank blocking due to previous read/write events Applies to both on-chip and off-chip solutions Adds forwarding latency ~2 ns On-Chip Embedded Memory ~20 ns Commodity Off-Chip RLDRAM ~48 ns Commodity Off-Chip DRAM 0 10 20 30 40 50 Need 1 ns 6.72 ns Line-Rate 100 GbE Memory Random Read Rates (ns) 5

Lookup and Buffering Memory Requirements Fast! Have to store everything needed for packet lookups in hardware to forward at line rate with all features enabled 10 GbE: 15 Mpps or one packet every 67 ns 100 GbE: 150 Mpps or one packet every 6.72 ns Multiple ports on a network processor Multiple lookups are needed per packet I m Fast and Big, But I Don t Exist Big! Packet lookup tables hold MAC address table (L2, VPN) IPv4 FIB (unicast and multicast, VPN) IPv6 FIB (unicast and multicast, VPN) VLAN tags, MPLS labels ACLs (L2, IPv4, IPv6, ingress and egress) QoS policies (PHB, rewrite, rate limiting/shaping) Deep buffering, queuing and shaping Multiple 100 Gbps of sustained throughput into buffer memory 1 GB buffer is only 80 ms at 100 GbE rates 1000s of queues per port 6

Lookup and Buffering Memory Technology Overview TCAM SRAM DRAM DDR RLDRAM GDDR What is it? Ternary Content Addressable Memory Static RAM Double Data Rate Dynamic RAM Reduced- Latency Dynamic RAM Graphics Double Data Rate Dynamic RAM Primary Function Wildcard Search High-Speed Storage High-Capacity Storage High-Speed Storage High- Bandwidth Storage Industry Usage Specialty Commodity Commodity Specialty Specialty Access Speed Lower Lowest Highest Higher Higher Density Lower Lower Much Higher Higher Much Higher Cost per Bit Much Higher High Much Lower Lower Lower Power Consumption Mass Production Capacities Highest Lower Higher Higher Higher 40 Mbit Today, 80 Mbit Soon 72 Mbit Today, 144 Mbit Soon 2 Gbit DDR3 Today, DDR4 Soon 576 Mbit Today, 1 Gbit Soon 2 Gbit GDDR5 Today 7

Longest Prefix Matching (LPM) Mechanisms TCAM RAM What is it? IP Destination Address? Index to Next Hop Table TCAM IP Destination Address? Index to Next Hop Table ASIC *RAM *RAM RAM Wildcard Hardware Data Search Ordered Tree Data Structure Search in Hardware Cost Much Higher Lower Power Consumption Higher Lower Search Latency Fixed Variable Throughput Higher Lower, Must Parallelize Add/Delete Time Fixed Variable Prefix Capacity Fixed, Lower Variable, Higher Search Algorithm Software Complexity Ordering of Data in TCAM Versatile Data Storage No Need to Worry About How You Write the Data Lots of Different (Patented) Algorithms with Tradeoffs Need to Implement Lookup and Data Structure Mechanisms Separately Must Optimize Layout for Data Structure 8

LPM Memory Architecture Solutions Divide and conquer parallel architecture Deterministic search using a combination of SRAM/DRAM Large number of banks allows parallel searching in reasonable time Integrate lookup memory into packet processing ASICs Combine embedded memories for higher performance and reasonable density Component technology must be available for 5+ years at a minimum Many specialized memory technologies have a window of production that is too small 9

Buffering Memory Solutions DRAM read and write times are the limiting factor in guaranteeing buffering performance Commodity off-chip DDR4 DRAM will give us higher memory throughput Embedded on-chip memory limits buffer sizes but offers higher performance Proprietary buffer memory management techniques Economically and technically feasible to design a custom buffer memory chip, instead of using commodity DRAM 10

Agenda Introduction Memory Technology ASIC Technology 11

Key ASIC Challenges Multi-protocol packet forwarding requires complex packet parsing and deep header inspection L2, IPv4/IPv6, unicast/multicast, broadcast, 802.1Q, 802.1ah, TRILL, 802.1aq, MPLS LAG load balancing requires complex hashing functions and looking deep into multiple packet headers sflow, OAM (hardware timestamps) Tons of counters (virtual interfaces, aggregate VLAN) It s all about integration the more functionality you can put on one chip, the more scalable you can make the system Available process geometry, packaging and chip interconnects limit what we can do today Using more chips in parallel means higher cost, higher power consumption, higher heat dissipation and a lower MTBF Current generation ASICs use 45 nm and 32 nm technology Industry is moving to 22 nm and smaller as soon as it s technically and commercially feasible Current SERDES and chip interconnects are limited to ~6.25 Gbps, 10 Gbps between chip and optics, 28 Gbps available soon Our 2-port 100 GbE card uses around 100 ICs, need to reduce the number of components! 12

Semiconductor Processes Geometries Moore's Law: The Number of Transistors on a Chip Will Double Approximately Every Two Years Process Geometry 180 nm 130 nm 90 nm 65 nm 45 nm 32 nm 22 nm 16 nm 11 nm Availability 1999 2000 2002 2006 2008 2010 2011 ~2013 ~2015 Transistors (Billions) 0.06 0.18 0.52 0.73 1.77 4.8 Gates (Millions) 2 6 15 21 30 50 Frequency (MHz) 106 212 360 400 450 750 Memory (MB) 6 49 164 72 315 900 In Use Today Process geometry refers to the smallest dimension that can be drawn into the silicon to define a transistor Transistors are the building blocks used to make gates Maximum capacities are often proprietary to a vendor, examples are from actual Brocade ASICs 13

ASIC Technology Evolution 2D Single Chip Package 2D Multi-chip Package 3D Silicon Interposer 3D Package What is it? Single Large Chip Package Multiple Chips or Components in the Same Package Multiple Chips in the Same Package With a Silicon Interconnect Two Chips in a Stack in the Same Package Size Larger Larger Smaller Smaller Interconnect Longest, External Off-chip Shorter, Copper/PCB Shorter, Silicon Shortest, Vertical Power Highest Lower Lower Lowest Heat Density Lowest Higher Higher Highest Signal Density Lowest Higher Higher Highest IBM 3D Package Images courtesy of IBM. 14

Chip Interconnect Signaling Evolutionary step in SERDES technology standardized by the OIF 25/28 Gbps electrical signaling will make newer chip to chip interfaces and pluggable media modules possible 1. Backplane: CEI-25G-LR 30 2. Chip to chip: CEI-28G-SR 12 3. Chip to module: CEI-28G-VSR 4 (used by 2nd generation 100 GbE media modules) Line Card Line Card Line Card 1 2 Backplane Chip Chip Chip Daughter Card Chip Chip 2 Chip 3 Media Module 15

ASIC Technology Integration Current process technology is optimized for either logic or memory requirements ASIC technology is logic driven (network processors) Memory technology is memory driven (RAM) 3D multi-chip packaging allows us to do both in the same package with optimized chips for each function When a new process geometry is developed, component vendors focus on transistor density first The rest of the technologies (SERDES, on-die memories, etc.) in the logic library are developed afterwards Typically used for general-purpose CPU production first Even though a process geometry may be commercially available, we have to be able to integrate it 16

ASIC Technology Integration Board layout architecture must be carefully planned together with memory and ASIC components Board real estate Power consumption Heat dissipation Signal integrity and routing between components Advanced 2D and new 3D ASIC technology will simplify the board layout challenges Expected to be mainstream within the next few years Combine multiple ASICs or memories within the same package Shorter interconnect and lower latency between components (especially memory) Lower board power consumption and heat dissipation Uses less board real estate 8-Port 10 GbE Oversubscribed L2/L3 Switch Card The Integration Challenge Main Board and Daughter Board Component Density vs. Real Estate 8-Port 10 GbE Line-Rate IP/MPLS Router Card 17

Agenda Introduction Memory Technology ASIC Technology Summary 18

Alternatives to FIB Scaling Challenge is using available technology vs router density and complexity to continue to scale Moore s Law still holds true Software FIB aggregation or suppression mechanisms On the Aggregatability of Router Forwarding Tables http://www.cs.arizona.edu/~bzhang/paper/10-infocom-aggregate.pdf Simple Virtual Aggregation (S-VA) https://datatracker.ietf.org/doc/draft-ietf-grow-simple-va/ OpenFlow/Software Defined Networking (SDN) https://www.opennetworking.org/ Locator/ID Separation Protocol (LISP) https://datatracker.ietf.org/wg/lisp/charter/ 19

Summary Want to design hardware with a minimum of 7+ years of lifetime that supports Multiprotocol forwarding High-density 100 GbE with CFP2 optics Scalable and flexible FIB (global routing table + VPN) Requirements dictate solution complexity and cost Technology capabilities Functionality and port density Design tradeoffs have to be chosen carefully Higher capacity and faster components are coming that will allow us to continue to scale router density, lookup capacity and memory requirements Multi-100 Gbps packet processing ASICs Custom SRAM/DRAM lookup memory Custom DRAM packet buffering memory 3D technology with ASIC and memory chips in the same package 20

Questions?