EE384Y: Packet Switch Architectures Part II Scaling Crossbar Switches

Similar documents
Parallelism in Network Systems

Packet Switch Architectures Part 2

The Arbitration Problem

Routers with a Single Stage of Buffering *

Themes. The Network 1. Energy in the DC: ~15% network? Energy by Technology

Routers with a Single Stage of Buffering *

Crossbar - example. Crossbar. Crossbar. Combination: Time-space switching. Simple space-division switch Crosspoints can be turned on or off

Routers with a Single Stage of Buffering * Sigcomm Paper Number: 342, Total Pages: 14

CSE 123A Computer Networks

Outline. The demand The San Jose NAP. What s the Problem? Most things. Time. Part I AN OVERVIEW OF HARDWARE ISSUES FOR IP AND ATM.

Switching. An Engineering Approach to Computer Networking

Internet Routers Past, Present and Future

Network Processors and their memory

Multi-gigabit Switching and Routing

Scaling routers: Where do we go from here?

EE/CSCI 451: Parallel and Distributed Computation

ECE 697J Advanced Topics in Computer Networks

INF5050 Protocols and Routing in Internet (Friday ) Subject: IP-router architecture. Presented by Tor Skeie

Switching CHAPTER 8. Solutions to Review Questions and Exercises. Review Questions

Generic Architecture. EECS 122: Introduction to Computer Networks Switch and Router Architectures. Shared Memory (1 st Generation) Today s Lecture

EECS 122: Introduction to Computer Networks Switch and Router Architectures. Today s Lecture

Sample Routers and Switches. High Capacity Router Cisco CRS-1 up to 46 Tb/s thruput. Routers in a Network. Router Design

CMPSCI611: The SUBSET-SUM Problem Lecture 18

Routers Technologies & Evolution for High-Speed Networks

Lecture 16: Router Design

Professor Yashar Ganjali Department of Computer Science University of Toronto.

Ting Wu, Chi-Ying Tsui, Mounir Hamdi Hong Kong University of Science & Technology Hong Kong SAR, China

Lecture 17: Router Design

Chapter 8 Switching 8.1

Lecture 2: Topology - I

Switch Fabrics. Switching Technology S P. Raatikainen Switching Technology / 2006.

Adaptive Routing Strategies for Modern High Performance Networks

Network layer (addendum) Slides adapted from material by Nick McKeown and Kevin Lai

Lecture 3: Topology - II

Routers: Forwarding EECS 122: Lecture 13

Discrete Wiskunde II. Lecture 6: Planar Graphs

Routers: Forwarding EECS 122: Lecture 13

Switch Fabric. Switch Fabric Overview

Topics for Today. Network Layer. Readings. Introduction Addressing Address Resolution. Sections 5.1,

Definition 1.1. A matching M in a graph G is called maximal if there is no matching M in G so that M M.

A Four-Terabit Single-Stage Packet Switch with Large. Round-Trip Time Support. F. Abel, C. Minkenberg, R. Luijten, M. Gusat, and I.

Lecture 12: Interconnection Networks. Topics: communication latency, centralized and decentralized switches, routing, deadlocks (Appendix E)

Literature Survey of nonblocking network topologies

Building a Fast, Virtualized Data Plane with Programmable Hardware. Bilal Anwer Nick Feamster

Joint Transmission in Cellular Networks: Scheduling and Stability

Network Performance: Queuing

TOC: Switching & Forwarding

Outline. Issues with the Memory System Loop Transformations Data Transformations Prefetching Alias Analysis

Future of Interconnect Fabric A Contrarian View. Shekhar Borkar June 13, 2010 Intel Corp. 1

TOC: Switching & Forwarding

CS-534 Packet Switch Architecture

Input/Output. Today. Next. Principles of I/O hardware & software I/O software layers Disks. Protection & Security

Hybrid On-chip Data Networks. Gilbert Hendry Keren Bergman. Lightwave Research Lab. Columbia University

The GLIMPS Terabit Packet Switching Engine

Planar graphs. Chapter 8

CSE 3214: Computer Network Protocols and Applications Network Layer

A Fast Switched Backplane for a Gigabit Switched Router

Interconnection Networks: Topology. Prof. Natalie Enright Jerger

Advanced and parallel architectures. Part B. Prof. A. Massini. June 13, Exercise 1a (3 points) Exercise 1b (3 points) Exercise 2 (8 points)

Network Performance: Queuing

Challenges for Future Interconnection Networks Hot Interconnects Panel August 24, Dennis Abts Sr. Principal Engineer

Switching CHAPTER 8. Figure 8.1

Scaling Internet Routers Using Optics Producing a 100TB/s Router. Ashley Green and Brad Rosen February 16, 2004

Sizing Router Buffers

Solutions to the Second Midterm Exam, Math 170, Section 002 Spring 2012

The IP Data Plane: Packets and Routers

CSC 4900 Computer Networks: Network Layer

Lecture 17: Router Design

15-744: Computer Networking. Routers

Meet in the Middle: Leveraging Optical Interconnection Opportunities in Chip Multi Processors

Module 17: "Interconnection Networks" Lecture 37: "Introduction to Routers" Interconnection Networks. Fundamentals. Latency and bandwidth

CS244a: An Introduction to Computer Networks

Graphs. Pseudograph: multiple edges and loops allowed

From control of networks to networked control. Wing Shing Wong Department of Information Engineering The Chinese University of Hong Kong

Chapter 1. Introduction

Homework Assignment #1: Topology Kelly Shaw

Link-State Routing OSPF

Interconnection Network Project EE482 Advanced Computer Organization May 28, 1999

CS-534 Packet Switch Architecture

Greedy algorithms is another useful way for solving optimization problems.

1. a graph G = (V (G), E(G)) consists of a set V (G) of vertices, and a set E(G) of edges (edges are pairs of elements of V (G))

CSC 401 Data and Computer Communications Networks

HW Graph Theory Name (andrewid) - X. 1: Draw K 7 on a torus with no edge crossings.

Space-division switch fabrics. Copyright 2003, Tim Moors

CS 552 Computer Networks

BROADBAND AND HIGH SPEED NETWORKS

Memory Hierarchies. Instructor: Dmitri A. Gusev. Fall Lecture 10, October 8, CS 502: Computers and Communications Technology

Dartmouth Computer Science Technical Report TR Chain Match: An Algorithm for Finding a Perfect Matching of a Regular Bipartite Multigraph

Switch Fabrics. Switching Technology S Recursive factoring of a strict-sense non-blocking network

DATA CENTER FABRIC COOKBOOK

/633 Introduction to Algorithms Lecturer: Michael Dinitz Topic: Sorting lower bound and Linear-time sorting Date: 9/19/17

List of Theorems. Mat 416, Introduction to Graph Theory. Theorem 1 The numbers R(p, q) exist and for p, q 2,

An O(log 2 N) parallel algorithm for output queuing

Buffer Sizing in a Combined Input Output Queued (CIOQ) Switch

Using Euler s Theorem

Shared-Memory Combined Input-Crosspoint Buffered Packet Switch for Differentiated Services

Introduction. Router Architectures. Introduction. Introduction. Recent advances in routing architecture including

EE/CSCI 451: Parallel and Distributed Computation

Topic 4a Router Operation and Scheduling. Ch4: Network Layer: The Data Plane. Computer Networking: A Top Down Approach

Lessons learned from MPI

Transcription:

High Performance Switching and Routing Telecom Center Workshop: Sept 4, 997. EE384Y: Packet Switch Architectures Part II Scaling Crossbar Switches Nick McKeown Professor of Electrical Engineering and Computer Science, Stanford University nickm@stanford.edu http://www.stanford.edu/~nickm

Outline Up until now, we have focused on high performance packet switches with:. A crossbar switching fabric, 2. Input queues (and possibly output queues as well), 3. Virtual output queues, and 4. Centralized arbitration/scheduling algorithm. Today we ll talk about the implementation of the crossbar switch fabric itself. How are they built, how do they scale, and what limits their capacity? 2

Crossbar switch Limiting factors. N 2 crosspoints per chip, or N x N-to- multiplexors 2. It s not obvious how to build a crossbar from multiple chips, 3. Capacity of I/O s per chip. State of the art: About 300 pins each operating at 3.25Gb/s ~= Tb/s per chip. About /3 to /2 of this capacity available in practice because of overhead and speedup. Crossbar chips today are limited by I/O capacity. 3

Scaling number of outputs: Trying to build a crossbar from multiple chips Building Block: 6x6 crossbar switch: 4 inputs 4 outputs Eight inputs and eight outputs required! 4

Scaling line-rate: Bit-sliced parallelism Linecard Cell Cell Cell k 8 7 6 5 4 3 2 Cell is striped across multiple identical planes. Crossbar switched bus. Scheduler makes same decision for all slices. Scheduler 5

Scaling line-rate: Time-sliced parallelism Linecard Cell Cell Cell Cell Cell Cell k 8 7 6 5 4 3 2 Cell carried by one plane; takes k cell times. Scheduler is unchanged. Scheduler makes decision for each slice in turn. Scheduler 6

Scaling a crossbar Conclusion: scaling the capacity is relatively straightforward (although the chip count and power may become a problem). What if we want to increase the number of ports? Can we build a crossbar-equivalent from multiple stages of smaller crossbars? If so, what properties should it have? 7

3-stage Clos Network m x m n x k k x n n 2 n 2 2 N m m N k N = n x m k >= n 8

With k = n, is a Clos network nonblocking like a crossbar? Consider the example: scheduler chooses to match (,), (2,4), (3,3), (4,2) 9

With k = n is a Clos network nonblocking like a crossbar? Consider the example: scheduler chooses to match (,), (2,2), (4,4), (5,3), By rearranging matches, the connections could be added. Q: Is this Clos network rearrangeably non-blocking? 0

With k = n a Clos network is rearrangeably non-blocking Routing matches is equivalent to edge-coloring in a bipartite multigraph. Colors correspond to middle-stage switches. (,), (2,4), (3,3), (4,2) Each vertex corresponds to an n x k or k x n switch. No two edges at a vertex may be colored the same. Vizing 64: a D-degree bipartite graph can be colored in D colors. Therefore, if k = n, a 3-stage Clos network is rearrangeably non-blocking (and can therefore perform any permutation).

How complex is the rearrangement? Method : Find a maximum size bipartite matching for each of D colors in turn, O(DN 2.5 ). Method 2: Partition graph into Euler sets, O(N.logD) [Cole et al. 00] 2

Edge-Coloring using Euler sets Make the graph regular: Modify the graph so that every vertex has the same degree, D. [combine vertices and add edges; O(E)]. For D=2 i, perform i Euler splits and -color each resulting graph. This is logd operations, each of O(E). 3

Euler partition of a graph Euler partiton of graph G:. Each odd degree vertex is at the end of one open path. 2. Each even degree vertex is at the end of no open path. 4

G Euler split of a graph G Euler split of G into G and G 2 :. Scan each path in an Euler partition. 2. Place each alternate edge into G and G 2 G 2 5

Edge-Coloring using Euler sets Make the graph regular: Modify the graph so that every vertex has the same degree, D. [combine vertices and add edges; O(E)]. For D=2 i, perform i Euler splits and -color each resulting graph. This is logd operations, each of O(E). 6

Implementation Request graph Scheduler Permutation Route connections Paths 7

Implementation Pros A rearrangeably non-blocking switch can perform any permutation A cell switch is time-slotted, so all connections are rearranged every time slot anyway Cons Rearrangement algorithms are complex (in addition to the scheduler) Can we eliminate the need to rearrange? 8

Strictly non-blocking Clos Network Clos Theorem: If k >= 2n, then a new connection can always be added without rearrangement. 9

m x m n x k M k x n n I M 2 O n I 2 O 2 N I m O m N M k N = n x m k >= n 20

Clos Theorem x x + n I a n n already in use at input and output. n O b k k. Consider adding the n-th connection between st stage I a and 3 rd stage O b. 2. We need to ensure that there is always some center-stage M available. 3. If k > (n ) + (n ), then there is always an M available. i.e. we need k >= 2n. 2

Scaling Crossbars: Summary Scaling capacity through parallelism (bitslicing and time-slicing) is straightforward. Scaling number of ports is harder Clos network: Rearrangeably non-blocking with k = n, but routing is complicated, Strictly non-blocking with k >= 2n, so routing is simple. But requires more bisection bandwidth. 22