Lecture 28: Networks & Interconnect Architectural Issues Professor Randy H. Katz Computer Science 252 Spring 1996

Similar documents
Lecture 15: Networks & Interconnect Interface, Switches, Routing, Examples Professor David A. Patterson Computer Science 252 Fall 1996

CMSC 611: Advanced. Interconnection Networks

Interconnection Network

CS 258, Spring 99 David E. Culler Computer Science Division U.C. Berkeley Wide links, smaller routing delay Tremendous variation 3/19/99 CS258 S99 2

4. Networks. in parallel computers. Advances in Computer Architecture

Lecture 16: Networks & Interconnect (Routing, Examples, Protocols) + Intro to Parallel Processing

Chapter 9 Multiprocessors

Lecture: Interconnection Networks

Lecture 26: Interconnects. James C. Hoe Department of ECE Carnegie Mellon University

Communication Performance in Network-on-Chips

Recall: The Routing problem: Local decisions. Recall: Multidimensional Meshes and Tori. Properties of Routing Algorithms

Lecture 12: Interconnection Networks. Topics: communication latency, centralized and decentralized switches, routing, deadlocks (Appendix E)

Lecture 24: Interconnection Networks. Topics: topologies, routing, deadlocks, flow control

Interconnection topologies (cont.) [ ] In meshes and hypercubes, the average distance increases with the dth root of N.

CS252 Graduate Computer Architecture Lecture 14. Multiprocessor Networks March 9 th, 2011

Internet II. CS10 : Beauty and Joy of Computing. cs10.berkeley.edu. !!Senior Lecturer SOE Dan Garcia!!! Garcia UCB!

Outline: Connecting Many Computers

Networks: Routing, Deadlock, Flow Control, Switch Design, Case Studies. Admin

Review 1/2 Operating System started as shared I/O library. CS61C Anatomy of I/O Devices: Networks. Lecture 14

Lecture 2: Topology - I

Interconnection Networks: Topology. Prof. Natalie Enright Jerger

Multiprocessor Interconnection Networks

Interconnection Networks

Network Properties, Scalability and Requirements For Parallel Processing. Communication assist (CA)

Lecture 12: Interconnection Networks. Topics: dimension/arity, routing, deadlock, flow control

Network Properties, Scalability and Requirements For Parallel Processing. Communication assist (CA)

CS152 Computer Architecture and Engineering Lecture 21 Buses and Networks Dave Patterson (

CS575 Parallel Processing

EEC-484/584 Computer Networks

CS61C : Machine Structures

CS152 Computer Architecture and Engineering Lecture 21 Buses and Networks Dave Patterson (

Interconnection Networks

CS61C : Machine Structures

TDT Appendix E Interconnection Networks

CS61C : Machine Structures

CS610- Computer Network Solved Subjective From Midterm Papers

Lecture 2: Internet Structure

Interconnection networks

Lecture 6 Datalink Framing, Switching. From Signals to Packets

1: Review Of Semester Provide an overview of encapsulation.

BROADBAND AND HIGH SPEED NETWORKS

Parallel Computer Architecture II

Lecture 13: Interconnection Networks. Topics: lots of background, recent innovations for power and performance

Revision of Previous Lectures

Communication Networks - 3 general areas: data communications, networking, protocols

1/29/2008. From Signals to Packets. Lecture 6 Datalink Framing, Switching. Datalink Functions. Datalink Lectures. Character and Bit Stuffing.

Routing Algorithm. How do I know where a packet should go? Topology does NOT determine routing (e.g., many paths through torus)

ECE 4750 Computer Architecture, Fall 2017 T06 Fundamental Network Concepts

INTERCONNECTION NETWORKS LECTURE 4

Networks 15.2 Multiplexing Technologies Access Networks 15.5 Common Peripheral Interfaces

Next Steps Spring 2011 Lecture #18. Multi-hop Networks. Network Reliability. Have: digital point-to-point. Want: many interconnected points

NOW Handout Page 1. Outline. Networks: Routing and Design. Routing. Routing Mechanism. Routing Mechanism (cont) Properties of Routing Algorithms

Module 17: "Interconnection Networks" Lecture 37: "Introduction to Routers" Interconnection Networks. Fundamentals. Latency and bandwidth

Integrating Information Systems: Technology, Strategy, and Organizational Factors

Module 2 Overview of Computer Networks

Module 2 Overview of. Computer Networks

Introduction to Wireless Networking ECE 401WN Spring 2008

CCM 4300 Lecture 5 Computer Networks, Wireless and Mobile Communications. Dr Shahedur Rahman. Room: T115

Homework Assignment #1: Topology Kelly Shaw

CS 5520/ECE 5590NA: Network Architecture I Spring Lecture 13: UDP and TCP

A closer look at network structure:

ET4254 Communications and Networking 1

CS 498 Hot Topics in High Performance Computing. Networks and Fault Tolerance. 9. Routing and Flow Control

Data Link Layer, Part 5. Medium Access Control

Lecture: Interconnection Networks. Topics: TM wrap-up, routing, deadlock, flow control, virtual channels

Local Area Networks (LANs): Packets, Frames and Technologies Gail Hopkins. Part 3: Packet Switching and. Network Technologies.

Parallel Systems Prof. James L. Frankel Harvard University. Version of 6:50 PM 4-Dec-2018 Copyright 2018, 2017 James L. Frankel. All rights reserved.

Announcements. CS 5565 Network Architecture and Protocols. Outline for today. The Internet: nuts and bolts view. The Internet: nuts and bolts view

Interconnection Networks

Digital Communication Networks

BlueGene/L. Computer Science, University of Warwick. Source: IBM

Network-on-chip (NOC) Topologies

ECE 669 Parallel Computer Architecture

Communicating over the Network

Interconnection Networks and Clusters. Interconnection Networks and Clusters. Interconnection Networks and Clusters

Internetworking is connecting two or more computer networks with some sort of routing device to exchange traffic back and forth, and guide traffic on

Communication has significant impact on application performance. Interconnection networks therefore have a vital role in cluster systems.

Lecture 16: On-Chip Networks. Topics: Cache networks, NoC basics

WAN Technologies CCNA 4

Data Communication & Computer Networks INFO

ITP 140 Mobile Applications Technologies. Networks

Chapter 2 - Part 1. The TCP/IP Protocol: The Language of the Internet

CS 162 Operating Systems and Systems Programming Professor: Anthony D. Joseph Spring Lecture 20: Networks and Distributed Systems

Reminder: Datalink Functions Computer Networking. Datalink Architectures

Local Area Network(LAN)

Lecture 2 Parallel Programming Platforms

IP Packet Switching. Goals of Todayʼs Lecture. Simple Network: Nodes and a Link. Connectivity Links and nodes Circuit switching Packet switching

Network.... communication system for connecting end- systems. End-systems a.k.a. hosts PCs, workstations dedicated computers network components

EE382 Processor Design. Illinois

Interconnect Technology and Computational Speed

Introduction to computer networking

Packet Switch Architecture

Packet Switch Architecture

Distributed Memory Machines. Distributed Memory Machines

Networks 15.2 Multiplexing Technologies Access Networks 15.5 Common Peripheral Interfaces

TCP/IP THE TCP/IP ARCHITECTURE

Chapter 8 LAN Topologies

CS 428/528 Computer Networks Lecture 01. Yan Wang

Computer Networks Medium Access Control. Mostafa Salehi Fall 2008

Chapter 2 Network Models 2.1

Transcription:

Lecture 28: Networks & Interconnect Architectural Issues Professor Randy H. Katz Computer Science 252 Spring 1996 RHK.S96 1

Review: ABCs of Networks Starting Point: Send bits between 2 computers Queue on each end Can send both ways ( Full Duplex ) Rules for communication? protocol Inside a computer? Loads/Stores: Request(Address) & Response (Data) Need Request & Response Name for standard group of bits sent: Packet RHK.S96 2

Review: Questions about Simple Network What if more than 2 computers want to communicate? Need computer address field in packet What if packet is garbled in transit? Add error detection field in packet What if packet is lost? More elaborate protocols to detect loss What if multiple processes/machine? Queue per process Questions such as these lead to more complex protocols and packet formats RHK.S96 3

Review: Implementation Issues Interconnect MPP LAN WAN Example CM-5 Ethernet ATM Maximum length 25 m 500 m; copper: 100 m between nodes 5 repeaters optical(multimode): 2 km Number data lines 4 1 1 optical(single mode): 25 km Clock Rate 40 MHz 10 MHz 155.5 MHz Shared vs. Switch Switch Shared Switch Maximum number 2048 254 > 10,000 of nodes Media Material Copper Twisted pair Twisted pair copper wire copper wire or or Coaxial optical fiber cable RHK.S96 4

Review: Network Performance Measures Overhead: latency of interface vs. Latency: network RHK.S96 5

Review: Impact of Overhead on Delivered BW 1000.00 1 Delivered BW (MB/sec) 100.00 10.00 1.00 10 100 1000 0.10 1 10 100 1000 MinTime one-way µsecs Peak BW (MB/sec) BW model: Time = overhead + msg size/peak BW > 50% data transfered in packets = 8KB RHK.S96 6

Review: Interconnect Issues Implementation Issues Performance Measures Architectual Issues Practical Issues RHK.S96 7

Example Architecture Measures Interconnect MPP LAN WAN Example CM-5 Ethernet ATM Topology Fat tree Line, Bus Variable, constructed from multistage switches Connection based? No No Yes Data Transfer Size Variable: Variable: Fixed: 4 to 20B 0 to 1500B 48B Store & Forward? No n.a. Yes Congestion control At source: At source: Rate based Flow Listen for via choke control E-net idle packets via back pressure RHK.S96 8

Topology Structure of the interconnect Determines Degree: number of links from a node Diameter: max number of links crossed between nodes Average distance: number of hops to random destination Bisection: minimum number of links that separate the network into two halves Warning: these three-dimensional drawings must be mapped onto chips and boards which are essentially two-dimensional media Elegant when sketched on the blackboard may look awkward when constructed from chips, cables, boards, and boxes RHK.S96 9

Important Topologies N = 1024 Type Degree Diameter Ave Dist Bisection Diam Ave D 1D mesh 2 N-1 N/3 1 2D mesh 4 2(N 1/2-1) 2N 1/2 / 3 N 1/2 63 21 3D mesh 6 3(N 1/3-1) 3N 1/3 / 3 N 2/3 ~30 ~10 nd mesh 2n n(n 1/n - 1) nn 1/n / 3 N (n-1) / n (N = k n ) Ring 2 N / 2 N/4 2 2D torus 4 N 1/2 N 1/2 / 2 2N 1/2 32 16 k-ary n-cube 2n n(n 1/n ) nn 1/n /2 15 8 (3D) (N = k n ) nk/2 nk/4 2k n-1 Hypercube n n = LogN n/2 N/2 10 5 Cube-Connected Cycles RHK.S96 10

Topologies (cont) N = 1024 Type Degree Diameter Ave Dist Bisection Diam Ave D 2D Tree 3 2Log 2 N ~2Log 2 N 1 20 ~20 4D Tree 5 2Log 4 N 2Log 4 N - 2/3 1 10 9.33 kd k+1 Log k N 2D fat tree 4 Log 2 N N 2D butterfly 4 Log 2 N N/2 20 20 CM-5 Thinned Fat Tree RHK.S96 11

Butterfly Multistage: nodes at ends, switches in middle N/2 Butterfly N/2 Butterfly All paths equal length Unique path from any input to any output Conflicts Benes Network N Butterfly Reversed N Butterfly Routes all permutations w/o conflict Notice similarity to Fat Tree Randomization is major breakthrough RHK.S96 12

Multistage Fat Tree Randomly assign packets to different paths on way up to spread the load RHK.S96 13

Example Networks Name Number Topology Bits Clock Link Bisect. Year ncube/ten 1-1024 10-cube 1 10 MHz 1.2 640 1987 ipsc/2 16-128 7-cube 1 16 MHz 2 345 1988 MP-1216 32-512 2D grid 1 25 MHz 3 1,300 1989 Delta 540 2D grid 16 40 MHz 40 640 1991 CM-5 32-2048 fat tree 4 40 MHz 20 10,240 1991 CS-2 32-1024 fat tree 8 70 MHz 50 50,000 1992 Paragon 4-1024 2D grid 16 100 MHz 200 6,400 1992 T3D 16-1024 3D Torus 16 150 MHz 300 19,200 1993 No standard topology! MBytes/second RHK.S96 14

Connection-Based vs. Connectionless Telephone: operator sets up connection between the caller and the receiver Once the connection is established, conversation can continue for hours Share transmission lines over long distances by using switches to multiplex several conversations on the same lines Time division multiplexing divide B/W transmission line into a fixed number of slots, with each slot assigned to a conversation Problem: lines busy based on number of conversations, not amount of information sent Advantage: reserved bandwidth RHK.S96 15

Connection-Based vs. Connectionless Connectionless: every package of information must have an address => packets Each package is routed to its destination by looking at its address, e.g., the postal system Split phase buses are send packets Statistical multiplexing RHK.S96 16

Packet Formats Fields: Destination, Checksum(C), Length(L), Type(T) Data/Header Sizes in bytes: (4 to 20)4, (0 to 1500)/26, 48/5 RHK.S96 17

Congestion Control Packet switched networks do not reserve bandwidth; this leads to contention Solution: prevent packets from entering until contention is reduced (e.g., metering lights) Options: Packet discarding: If a packet arrives at a switch and there is no room in the buffer, the packet is discarded Flow control: between pairs of receivers and senders; use feedback to tell the sender when it is allowed to send the next packet» Back-pressure: separate wires to tell to stop» Window: give the original sender the right to send N packets before getting permission to send more (overlap the latency of the interconnection with the overhead to send and receive a packet) Choke packets: aka rate-based ; Each packet received by busy switch in warning state sent back to the source via choke packet. Source reduces traffic to that destination by a fixed % (ATM Forum) RHK.S96 18

Store and Forward vs. Cut-Through Store-and-forward policy: each switch waits for the full packet to arrive in the switch before it is sent on to the next switch Cut-through routing or worm hole routing: switch examines the header, decides where to send the message, and then starts forwarding it immediately In worm hole routing, when the head of the message is blocked the message stays strung out over the network, potentially blocking other messages (needs only buffer the piece of the packet that is sent between switches). CM-5 uses it, with each switch buffer being 4 bits per port. Cut through routing lets the tail continue when the head is blocked, accordioning the whole message into a single switch. (Requires a buffer large enough to hold the largest packet). RHK.S96 19

Store and Forward vs. Cut-Through Advantage Latency reduces from function of: number of intermediate switches X by the size of the packet to time for 1st part of the packet to negotiate the switches + the packet size interconnect BW RHK.S96 20

Switching w = wire width, m = message size b = m/w, H = number of hops R = delay per router Circuit switching Establish end-to-end route, then transmit data Packet switching Route data as it moves forward L = HR c + b Store-and-forward: L = H(b + R sf ) Cut-through: L = HR ct + b Typically, R ct = 32/w Most modern networks are about 1 small packet deep Link transfer time RHK.S96 21

Examples Machine Network W R rate MHz MB/s per link ncube/2 13-D hypercube 1 40 40 2.2 CM-5 4-D Thinned Fat Tree 4 8 40 20 Delta 2-D Mesh 8 2 ~40 40 T3D 3-D Torus 16? 150 300 What about performance? Degree vs. Distance Generally, topologies with larger distance have better physical layout, so they have shorter, wider links and smaller routing delays. Hence, latencies are similar: L = HR ct + m/w (H increases, b decreases)» Low degree networks have few links per processor» If each message travels a larger number of links => few outstanding messages per processor» With skinny wires, message occupies path for a long time RHK.S96 22

Routing Deterministic follows a prespecified route mesh: dimension-order routing» (x 1, y 1 ) -> (x 2, y 2 )» first x = x 2 - x 1,» then y = y 2 - y 1, hypercube: edge-cube routing» X = x o x 1 x 2... x n -> Y = y o y 1 y 2... y n» R = X xor Y» Traverse dimensions of differing address in order tree: common ancestor Adaptive route based on network state (e.g., contention) Deadlock free 010 110 111 011 100 000 001 101 RHK.S96 23

Practical Issues Interconnection MPP LAN WAN Example CM-5 Ethernet ATM Standard No Yes Yes Fault Tolerance? No Yes Yes Hot Insert? No Yes Yes Standards: required for WAN, LAN! Fault Tolerance: Can nodes fail and still deliver messages to other nodes? required for WAN, LAN! Hot Insert: If the interconnection can survive a failure, can it also continue operation while a new node is added to the interconnection? required for WAN, LAN! RHK.S96 24

Protocols: HW/SW Interface Internetworking: allows computers on independent and incompatible networks to communicate reliably and efficiently; Enabling technologies: SW standards that allow reliable communications without reliable networks Hierarchy of layers, giving each layer responsibility for portion of overall communications task, called protocol families or protocol suites Transmission Control Protocol/Internet Protocol (TCP/IP) This protocol family is the basis of the Internet RHK.S96 25

Protocol Key to protocol families is that communication occurs logically at the same level of the protocol, called peer-topeer, but is implemented via services at the lower level Danger is each level increases latency RHK.S96 26

FTP From Stanford to Berkeley FDDI Ethernet Hennessy T1 FDDI Patterson Ethernet Ethernet BARRNet is WAN for Bay Area T1 is 1.5 mbps leased line; T3 is 45 mbps; FDDI is 100 mbps LAN IP sets up connection, TCP sends file RHK.S96 27

Summary: Interconnections Communication between computers Packets for standards, protocols to cover normal and abnormal events Implementation issues: length, width, media Performance issues: overhead, latency, bisection BW Topologies: many to chose from, but (SW) overheads make them look alike; cost issues in topologies RHK.S96 28