Simulation Analysis of Permutation Passibility behavior of Multi-stage Interconnection Networks

Size: px
Start display at page:

Download "Simulation Analysis of Permutation Passibility behavior of Multi-stage Interconnection Networks"

Transcription

1 Simulation Analysis of Permutation Passibility behavior of Multi-stage Interconnection Networks A Thesis Report Submitted in the partial fulfillment of the requirements for the award of the degree of ME in Software Engineering Submitted by Yukti Jindal M.E. (Software Engineering) Under the supervision of: Mrs. Rinkle Aggarwal Lecturer Computer Science and Engineering Department Thapar Institute of Engineering and Technology Computer Science and Engineering Department Thapar Institute of Engineering and Technology Deemed University Patiala 1

2 DECLARATION I declare that the work presented in this thesis is, to the best of my knowledge, original and my own work, except as acknowledged in the text, and that it has not been submitted, either in whole or part, for a degree at this, or any other, university. Yukti Jindal Regd. No CERTIFICATE

3 This is to certify that the thesis work entitled, Simulation Analysis of Permutation Passibility behavior of Multi-stage Interconnection Networks submitted by Yukti Jindal, in the partial fulfillment of the requirement for the award of degree of Master of Engineering in Software Engineering at Thapar Institute of Engineering & Technology (Deemed University), Patiala, is a record of candidate s own work carried out by her under my supervision and guidance. Mrs. Rinkle Aggarwal Lecturer, CSED TIET, Patiala Ms. Seema Bawa Head, CSED TIET, Patiala Dr. D.S. Bawa Dean, Academic Affairs TIET, Patiala The M.E. (Thesis) viva-voce examination of Yukti Jindal, Regd. No. 8011, M.E. (Software Engineering), Thapar Institute of Engineering & Technology (Deemed University), Patiala has been held on Supervisor External Examiner ACKNOWLEDGEMENTS I wish to express my sincere gratitude Mrs. Rinkle Aggarwal, Lecturer, Computer Science and Engg. Department, for providing invaluable guidance, suggestions and

4 sympathetic attitude, which inspired me to submit this thesis report on time. I also wish to express my gratitude to Mr. Rajesh Bhatia, Lecturer, Computer Science and Engg. Department, for her valuable advises and suggestions. I am also thankful to Ms. Seema Bawa, Head, Computer Science and Engg. Department, for her kind help and cooperation. I would also like to thank all the staff members of Computer Science and Engineering Department for providing me all the facilities required for the completion of this work. I would like to extend my special thanks to Mr. Himanshu Aggarwal, Lecturer, Computer Science and Engg. Department in Punjabi Univ for providing me invaluable support. I am deeply indebted to my parents,sister and brother for the inspiration and ever encouraging moral support, which enabled me to pursue my studies. Yukti Jindal CONTENTS Declaration (ii) Certificate.....(iii) Acknowledgment.....(iv) Page no.

5 Contents....(v) List of Figures.. (ix) Abstract.....(x) Chapter 1.1 Introduction Introduction to parallel systems Classification of parallel systems Multicomputer Multiprocessors Tightly coupled systems Loosely coupled systems Network Concepts Problem definition Organization of thesis Chapter Interconnection Networks Introduction

6 . Interconnection Networks Design dimensions Switching methodology Operating Modes Control strategy Network topology Chapter Multistage Interconnection Networks Introduction 1. Classification of Multistage interconnection networks Classification according to Path Multipath networks Unique path Classification according to Switches Regular networks Irregular networks Classification according to Control Flip controlled networks Distributed control networks Other classification Blocking networks Non blocking networks Type of connections in MINs Routing in MINs Routing tags

7 .. Type of routing in MINs Non adaptive routing Adaptive routing Dynamic Routing Fault tolerance.... Chapter A survey of MINs Introduction.... Unique- path MINs Crossbar Network Omega Network Generalized Cube Network Shuffle Exchange network Multi-path Static Regular MINs Benes Network Multipath Regular Dynamic MINs Augmented Shuffle Exchange MINs Irregular MINs Four-Tree Network (FT)..8.. Modified Four tree network..0 Chapter Permutation Passabiltiy in MINs...1 Introduction..

8 . Simulation Analysis.... Simulation of Permutation Passability in Regular Networks..1 ASEN network... FT Network.... Modified FT Network..0 Chapter Algorithm for Calculating the Permutation Passabiltiy Behavior of MINs..1 Introduction Algorithm for ASEN Network Virtual Inputs..... Graph Matrix... Outline of input, Computation and Output Parameters..... Illustrated Example of an Algorithm Algorithm for the ASEN network.... Algorithm for FT Network..1 Virtual Inputs Graph Matrix.8.. Outline of Input, Computation and Output Parameters Illustrated Example of an Algorithm Algorithm for the FT network..... Algorithm for Modified FT Network Virtual Inputs Graph Matrix. 8.. Outline of input, Computation and Output Parameters Illustrated Example of an Algorithm Algorithm for the Modified FT network..8. Pseudocode..89 8

9 ..1 Pseudocode for ASEN Network...89 Chapter Conclusion and Further Scope Conclusions Further Scope.9 References. 9 9

10 LIST OF FIGURES Figure no. Figure name Page no..1. Fully connected network 10. Mesh (a) -D mesh network with width 1 (b) -D mesh network with width (c) -D mesh network. Simple Ring network 1. (b) Chordal Ring network 1. Hypercubes of dimension zero through four 1.1 A multistage interconnection network 1. Communication in a bi-directional MIN. The communication indicated at 18.1 Conceptual view of a single stage interconnection network. Classification of MINs. Crossbar Network 9. Omega Network 0. Generalized Cube Network 1. A perfect shuffle. A Inverse shuffle.8. Benes Network.9 ASEN 10

11 .10. FT Modified FT 1 ABSTRACT There are many different ways to organize computational structures to exploit parallelism. Many research efforts around the world are being conducted with the purpose of determining that hardware and software organizations that are best suited for general purpose parallel processing. The communication subsystems linking processors/memory modules and input/output controllers in a parallel processing system is one of its most important architecture features and has a profound impact on system capabilities, performance, size and cost. An interconnection network of the processors provides the desired connectivity and performance at minimum cost is required for communication in parallel processing systems with a large number of components. Multistage interconnection networks play an important role in the parallel computing systems. In multistage interconnection networks the fixed inter stage connections between adjacent stages exist with a number of switches at each stage that are dynamically set to establish the desired connection to route the requests from the inputs to outputs. The distribution of switches as well as their complexity is very important in designing multistage interconnection networks. The important features in the study of multistage interconnection networks are estimation of complexity, fault tolerant, communication efficiency, performance and 11

12 cost. In this thesis, a survey of various regular and irregular multistage interconnection networks is made. A new algorithm has been developed for finding the permutation passiability behavior of some regular as well as irregular networks. The permutation passibility behavior of existing regular and irregular networks is analyzed and also the simulation of permutation possibility behavior of some regular as well as irregular networks is defined. 1

13 CHAPTER 1 Introduction 1.1 Introduction to parallel systems High performance Computing Systems can be designed using parallel or distributed processing. There is considerable research in these areas due to following reasons: œ Technological development in the area of VLSI chips. œ Significant software parallelism is inherent in many scientific and engineering applications. A parallel computer is a collection of processing elements that cooperate to solve large problems fast. A parallel system means a set of different processors or computers interconnected together to improve performance. Use of more than one processing units at a time to enhance the overall output is called multiprocessing or multicomputing. The performance of a computer depends directly on the time required to perform a basic operation and the number of these basic operations that can be performed concurrently. High performance computing systems can be designed using parallel processing. The principle target in designing any parallel system is to reduce the communication overheads between both, the processors as well as the memory. Following are the desirable characteristics for the communication between processors and memory units: œ High speed œ High reliability œ On-line reparability œ Good performance even under faults œ Low cost œ Simple routing control œ Full access capability 1

14 However, multiple processor systems have a number of potential disadvantages, probably the most important being the very real problem of using the processing power in an efficient manner. This involves a number of factors such as, the ability to decompose a problem into an optimum number of modules, their size, and to define them in such a way that communication between processors may be carried out with the absolute minimum wait time and minimum delay time across the communication network. This communication network is a link between different processors as well as memory modules. 1. Classification of parallel systems Generally parallel systems[1] are of two categories, multicomputers and multiprocessors 1..1 Multicomputer A multicomputer comprises a number of computers, or nodes, linked by an interconnection network. Each computer executes its own program. This program may access local memory or may send and receive messages over the network i.e. message passing. Messages are used to communicate with other computers or, equivalently, to read and write remote memories. In the idealized network, the cost of messages passing between two nodes is independent of both, node location and other network traffic, but does depend on message length. Therefore, multicomputers means different computers, not sharing memory, connected by some communication link, with ring, linear bus, linear star, linear bus and tree topologies, used generally for those applications which require less inter-processor interaction because the interaction among the processors ids through messages passing between source and destination processors or computers. Store and forward techniques are to pass on the message from a source to a destination. Thus, there are a lot of communication overhead involved. 1

15 1.. Multiprocessors In a computer system it is common to connect multiple components, such as processors and memory. Except for the memory with embedded processors (called PIM), processors do not have memory does not include a processor in it. Multiprocessors are a common architecture today, so there is also the need to connect more than one processor with more than one-memory modules in an efficient manner. Multiprocessor[] computer architectures consisting of interconnected, multiple processors, are basically of two types Tightly coupled systems: In these systems, all the processors share primary memory and any communication usually take place through their shared global memory. The access time from any of the processors to memory is the same. In addition to the primary memory each processor has its own local memory Loosely coupled systems: In these systems, the processors do not share memory but each processor has its own memory. In these systems, all the communication between processors is done by passing messages across the network those interconnect processors. If a processor wants to access data in another processor s memory, it must send message through the communication subsystem, requesting the other processor to send these data. Partitioning and allocation of program and data into segment play crucial role in overall performance of an application in case of loosely coupled systems. Usually, tightly coupled systems are referred to as parallel processing systems and loosely coupled systems are referred to as distributed computing systems or simply distributed systems. 1

16 1. Network Concepts œ Network Topology : The lay-out of the network. The common topologies are Static (point-to-point networks) Dynamic (indirect networks) œ Node Neighborhood : Two nodes are neighbors if there is a link connecting them. œ Node degree : Number of channels connected to one node œ Diameter of network : Maximum shortest path between two nodes œ Network redundancy (fault tolerance) : Amount of alternative paths between two nodes œ Network scalability : Measure for expandability of the network œ Node symmetry : A node symmetric network has no distinguished node, that is, the view of the rest of the network is the same from any node. Rings, fully connected networks, and hypercubes are all node symmetric. Trees and stars are not. œ Network functionality : Measure for support of routing, fault tolerance, synchronization, message combining etc. œ Network throughput : Amount of transferred data/unit time œ Network latency : Worst case delay for transfer of a unit (empty) message through the network. 1. Problem definition Multistage interconnection networks play an important role in the parallel computing systems. A Multistage interconnection networks consist of more than one stages of interconnection elements and links. Reliability and efficiency in terms of speed of operations and the costs are major considerations in the design of multistage interconnection networks. A lot of work has already been done in the design and analysis of regular and irregular networks. Since irregular networks, in general, are less costly and multipath in nature as compared to regular multistage interconnection networks. So 1

17 analysis of irregular multistage interconnection networks is important.a study of different multistage interconnection networks is made here. Permutation possibility is an important parameter to study the behavior of any multistage interconnection networks. Permutation passibility means how many input requests are simultaneously able to pass through a given network and reach successfully at the intended destination. Some of the requests will pass through the most favorable path, others have to be routed through an alternative paths are available or paths are busy in serving some other requests. In this thesis, permutation passibility behavior of existing regular and irregular networks is analyzed and also the simulation of permutation possibility behavior of some regular as well as irregular networks is defined. 1. Organization of thesis Chapter covers interconnection networks. The details of multistage interconnection networks are given in Chapter. A survey of existing regular and irregular multistage interconnection networks is covered in Chapter. Chapter deals with permutation passibilty of irregular networks and Chapter covers the algorithm for calculating the permutation passibiltiy behavior of MINs. The last Chapter is regarding the Conclusion and future scope of the work. 1

18 CHAPTER Interconnection Networks.1 Introduction There is one way for processors to communicate data is to use a shared memory and shared variables. However this is unrealistic for large numbers of processors. A more realistic assumption is that each processor has its own private memory and data communication takes place-using message passing via an Interconnection networks.. Interconnection Networks Interconnection networks[] make a major factor to differentiate modern multiprocessor architectures. They can be categorized according to a number of criteria such as topology, routing strategy and switching technique. Interconnection network is a complex connection of switches and links permitting processors in a multiprocessor system to communicate among themselves or with memory modules. It is the path, the data must travel in order to access memory in a shred memory computer or to communicate with another processor in distributed memory environment. The performance criteria for interconnection networks is: œ Fast communication œ Low cost œ Reliability during failures œ Efficiency œ Bandwidth 18

19 However, there is no network that would optimize all the performance criteria. But there are several different configurations, which provide trade-offs in terms of cost and efficiency.. Design dimensions The interconnection network plays a central role in determining the overall performance of a multicomputer system. If the network cannot provide adequate performance, for a particular application, nodes will frequently be forced to wait for data to arrive. The key design dimensions for interconnection networks are switching methodology, operational mode, control strategy and network topology, The space of interconnection networks can be represented by Cartesian product of all these dimensions i.e. operational mode, control strategy, switching methodology and network topology. The choice of a particular interconnection network depends upon the application demands, technology supports and cost effectiveness...1 Switching methodology Two major switching methodologies are circuiting switching and packet switching. In circuit switching a physical path is actually established between a source and a destination. This path exists as long as the data transmission of data. In packet switching, data is broken into packets and routed through the interconnection network without establishing a physical connection path. These packets are transmitted from source to the destination in store and forward manner, which introduce delay at each of the switching point. This delay depends on the traffic in the network along its path to the destination. Packet switching is more efficient for many short data messages. Combining the capabilities of both circuit switching and packet switching, the third type of switching methodology can be called integrated switching. 19

20 .. Operating Modes Operating modes can be either synchronous or asynchronous or a combination of the two. Synchronous communication is needed for either a data manipulating function or for a data instruction broadcast. Synchronous control techniques are characterized by a global clock, which broadcast clock signals to all devices in a system so that the entire system operates in a lock-step fashion. Asynchronous communication is needed for multi processing in which connection request is issued dynamically. Asynchronous techniques do not utilize a single global clock, but rather distribute the control function throughout the system, often utilizing many individual clocks for timing. Control and coordination of the various parts of the system are accomplished via some form of communication. Thus the interconnection network can operate synchronously-off of a global clock or it may have distributed control down to the level of the individual switches. The advantage of a single global clock for control is expandability and flexibility... Control strategy A typical interconnection network consists of a no. of switching elements and interconnecting links. Interconnection functions are realized by properly setting control of switching elements. The control setting function, for interconnection networks, can be managed by centralized controller or by individual switching element. The control strategy in the first case is centralized control and in the second case it is a distributed control strategy. 0

21 .. Network topology A network can be represented by a graph in which nodes indicate switches and edges represent communication link. Interconnection networks are built up of switching elements topology is the pattern in which the individual switches are connected to other elements, like processors, memories and other switches. Direct topologies connect each switch directly to a node, while in indirect topologies at least some of switches connect to other. The topologies can be categorized into two groups, static and dynamic. In static topology, links between two processors are passive and dedicated buses cannot be reconfigured for direct connection to other processors. Static interconnection networks are mainly used in message-passing architectures Some of the more important networks include œ Fully connected or all-to all œ Mesh œ Rings œ Hypercube œ Shuffle Exchange 1

22 œ Fully connected or all-to-all This is the most powerful interconnection network (topology): each node is directly connected to all other nodes. Fig..1.Fully connected network Each node has N-1 connections (N-1 nearest neighbors) giving a total of N (N-1) / connections for the network. Even though this is the best network to have the high number of connections per node mean this network can only be implemented for small values of N. Therefore some form of limited interconnection network must be used.

23 œ Mesh (Torus) In a mesh network, the nodes are arranged in a k dimensional lattice of width w, giving a total of w k nodes. Fig...(a) -D mesh network with width Fig. (b) -D mesh network with width

24 Fig.. (c) -D mesh network Communication is allowed only between neighboring nodes. All interior nodes are connected to k other nodes. œ Ring Fig.(a) Simple Ring network A simple ring is just a linear array with the end nodes linked.

25 It is equivalent to a 1D mesh with wraparound connections. A chordal ring is a simple ring with cross or chordal links between nodes on opposite sides. Fig.(b) Chordal Ring network œ Hypercube Connection ( Binary n-cube ) Hypercube networks consist of N = ^k nodes arranged in a k dimensional hypercube. The nodes are numbered 0, 1,...^k -1 and two nodes are connected if their binary labels differ by exactly one bit.

26 Fig. Hypercubes of dimension zero through four

27 K dimensional hypercube is formed by combining two k-1 dimensional hypercubes and connecting corresponding nodes i.e. hypercubes are recursive. each node is connected to k other nodes i.e. each is of degree k. The departmental NCUBE is based on this topology i.e. a dimensional hypercube ( nodes). K dimensional hypercube is formed by combining two k-1 dimensional hypercubes and connecting corresponding nodes i.e. hypercubes are recursive. Each node is connected to k other nodes i.e. each is of degree k. The departmental NCUBE is based on this topology i.e. a dimensional hypercube ( nodes).

28 CHAPTER Multistage Interconnection Networks.1 Introduction Multistage Interconnection networks (MINs) consist of more than one stages of small interconnection elements called switching elements and links interconnecting them. Multistage networks are described by three characteristic features, the switching elements, and network topology and control structures. A multistage interconnection network is actually a compromise between crossbar and shared bus networks. Multistage interconnection networks: œ Attempt to reduce cost œ Attempt to decrease diameter, diameter is the longest path any two nodes In a multistage interconnection network[], as in a crossbar, switching elements are distinct from processors. However, fewer than O(P ) switches are used to connect P processors.instead, messages pass through a series of switches stages. Figure.1. illustrates two MINs, which are representatives of a general class of networks characterized by parameters dimension n and radix k. Shaded circles represent processors and unshaded circles represent crossbar switches.the network on the left has k= and n=. On the right, k= and n=.the network can be constructed from unidirectional switches and links. In which case it is folded so that the processors on the left and right are the same. Alternatively, it can be constructed from bi-directional switches and links, in which case processors on the left and right are distinct. 8

29 Fig.1 A multistage interconnection network These networks are sometimes referred to as radix k, dimension n butterflies, or k-ay n- flies. Either n stages of k (n-1)k*k unidirectional crossbar switches connect p=k n processors or n stages of k (n-1)k*k bi-directional crossbar switches connect p=k n processors. In the latter case, each link comprises two channels that carry data in opposite directions, and each crossbar switch can route data arriving on any of k inputs to any of k outputs. Each stage of these networks connects p inputs with p outputs, although not every input is directly connected to every output in each stage. In a unidirectional MIN, all messages must traverse the same number of wires, and so the cost of sending a message is independent of processor location. In effect, all processors are equidistant. In a bi-directional MIN, the number of wires traversed depends to some extent on processor location, although to a lesser extent than in a mesh or hypercube. 9

30 (a) (b) Fig.. Communication in a bi-directional MIN. The communication indicated at (a) involves processors connected to the same crossbar; it takes just two hops and passes through a single switch. The communication at (b) takes three hops and passes through two switches. 0

31 The fact that messages destined for different destinations may need to pass over the same wire means that MINs are not immune to competition for bandwidth. Nevertheless, a MIN connecting P processors typically provides P wires at each stage, so in principle we should be able to organize communications so that little competition occurs. Switching elements The switching elements[11] may be viewed as a very small network. These switches are the devices having multiple input and outputs.the number of inputs/outputs and the input-to-output connections supported within a switch vary among the various designs. A two-function switch can assume either the straight or the exchange states i.e. straight, exchange, upper broadcast and lower broadcast. Switching Element Straight Exchange Lower Broadcast Upper broadcast 1

32 . Classification of Multistage interconnection networks Multistage interconnection networks can be classified according to different categories. The main classification categories are path, switches and control...1 Classification according to Path..1.1 Unique path networks: These networks provide unique path between every source and destination. The failure of any switching elements along the path disconnects some source-destination pairs. So adversely affecting the capabilities of the existing network. These are not reliable for a large multiprocessor system, as they cannot tolerate even a single fault. In case of multiple requests, a source destination connection any be blocked by a previously established connection. Thereby providing a poor performance...1. Multipath networks: These provide more than one paths between source and destination.in case, there is a failure of one switching element in the path, the request is routed through some other alternative path. Unique path multistage interconnection networks can be made multipath by adding redundancy in the form of extra switching elements, links, stages, sub networks, by increasing the size of switching elements or using multipath networks. Multipath multistage interconnection networks[8] can be either static or dynamic. For static networks, if a fault is encountered, then data has to backtrack, to the source or some fixed point to select an alternative path in the network. The implementation of backtracking is expensive in terms of the hardware. In dynamic networks, if a fault is encountered in a particular stage, a switching element in proceeding stage will re-route data through an alternative available path.

33 .. Classification according to Switches...1 Regular networks: Regular multistage interconnection networks have got an equal number of switching elements per stage; as a result they may impose equal time delay to all requests passing through them.... Irregular networks: Irregular multistage interconnection networks have unequal number of switching elements per stage and thus they are inherently multipaths in nature. For a given source-destination pair, there are different path lengths available... Classification according to Control...1 Flip controlled networks: Flip controlled multistage interconnection networks have a common control signal for switching in various switching elements at a given stage. This network is less complicated due to lesser number of control signals but has lesser bandwidth.... Distributed control networks: Distributed control multistage interconnection networks have a separate signal for every switching element. These have higher bandwidth due to selection of source destination pair at a given time and are quite complex... Other classification...1 Blocking networks: In blocking network, simultaneous connections of more than one terminal pair may result in conflict in use of network communication links. For example Omega networks.... Non blocking networks: A network is called non blocking if it is possible to route from any source to any destination, in presence of other established sourcedestination routes, provided no two sources have same destination. In other words, a

34 network is called rearrangeble if it can perform all possible connections between inputs and outputs by rearranging its existing connections so that a connection path for a new input output pair can always be established. For example Benes network.. Type of connections in MINs There are four types of connections[9], which are commonly used, in multistage interconnection networks. These are: œ One to one connections: A one to one connection passes information from a source to a destination. The exact route, taken by the information, is determined by path itself. œ Multipath connection: Multipath means many one to one connections are active simultaneously. œ Permutation connection: A set of one to one connections such that no two connections have the same sources and destinations. œ Broadcast connection: Information flow from source to various destinations either some or all. Thus a number of destinations simultaneously receive the information.. Routing in MINs No decision regarding the routing in a network is a perfect one. To acquire nearly complete knowledge for routing would require so many overheads that traffic throughout would be simultaneously reduced. For example, if there is minimal traffic, the network path with minimum number of links will normally be the best. If a node or switching elements fails, then the path with minimum number of links, which bypass the failures, will be the best. As traffic builds up, however this simple routing strategy can give poor results at times because the shortest path may happen to be congested. So the network as a whole should employ a routing strategy that would by pass areas of congestion.

35 There are several objectives of routing strategy: œ Minimize the transmit times œ Minimize the costs œ Maximize the network through put capability. To minimize the transmit times under conditions changing load, many control signals or overheads would be sent so that network throughput would be reduced. On the other hand, maximizing the throughput could be done at the expense of packet transmit times...1 Routing tags Routing tag is a way of describing the path though the network. For multistage interconnection networks, these tags are generally expressed as a multi-digit integer expressed by the destination. Each successive digit in this integer encodes the settings for the switch in the next stage along a desired path. The control is called distributed if the devices, using the network switches can be set their own based on the tag information... Type of routing in MINs The routing in multistage interconnection networks take place through the generation of routing tags, which specify a fault free path. There is basically three type of routing that is commonly used in multistage interconnection networks....1 Non adaptive routing: In this method a source leans a fault when a path it is attempting to establish reaches the faulty network component. A notice of fault is sent to the source, which tries next alternative path. This method has poor performance though it requires little hardware.

36 ... Adaptive routing: The adaptive routing can be of the following types: Notification on demand: with notification on demand, a source maintains a table of faults it encountered in attempting to establish paths and uses this information to guide the future routing. Broadcast routing: With broadcast notification of a fault, all the sources are notified of the fault components as they diagnosed.... Dynamic Routing: A fault free path need not be specified by a source if the routing tags are modified in response to the faults as a path is followed or established. The dynamic routing can be accomplished in multistage interconnection networks constructed of switches, which are capable of performing the necessary tag revision.. Fault tolerance A fault tolerance[10] multistage interconnection network provides service routing even under the faults. Fault can be permanent or transient in nature. Fault tolerance is a criteria that must be met for the network which has tolerated a given fault or faults. A network is a single fault tolerant if it can function as specified by its fault-tolerance criteria despite any single fault conforming to its faults models. In general, if any set of I- faults can be tolerated by a network, then network is I-fault tolerant. A network that can tolerate some instances of I-faults is robust although not I-fault tolerant.

37 CHAPTER A survey of MINs.1 Introduction To solve the problem of providing fast, reliable and efficient communication at a reasonable cost in large parallel processing systems, many different networks between the extremes of single bus and the cross bar have been proposed. Such interconnection networks can be constructed from single or multiple stages of switches. In a single stage network, data may have to be passed through the switches several times before reaching the final destination. In multistage network, one pass of multiple stages of switches is usually sufficient. Sources Destination 0 i u 0 O u i u 0 O u N-1 I u N O u N N-1 Fig..1 Conceptual view of a single stage interconnection network

38 A conceptual model of a single stage network to interconnect an N component parallel processing system is shown in fig.1, which indicates that the single stage network can be viewed as an intra-connected set of N input and N output units. The way input units are connected with the output units, determine the functional characteristics of the network i.e. the allowable interconnections. The single stage network is also called a recirculating network. Data items may have to recirculate through the single stage several times before reaching their final destination. Number of recirculations needed depends upon connectivity in a single stage network. In general, the higher is the hardware connectivity, the lesser is the number of recirculations. Multistage interconnection networks are built from the stages of the basic from stages of the basic single stage networks. Communication through MINs varies in multiprocessor systems and therefore a number of MINs have been proposed, analyzed and notified by various researchers. MINs have different design depending upon the number of stages, number of SEs per stage and number of links etc. The design also determines the control strategy and routing algorithms. Therefore, different MINs offer varying degrees of reliability, efficiency, cost and fault-tolerant. 8

39 MINS UNIQUE PATH MULTI PATH REGULAR MINS IRREGULAR MINS STATIC DYNAMIC STATIC DYNAMIC Fig.. Classification of MINs 9

40 . Unique- path MINs Unique path MINs is characterized by the presence of a single path between any sourcedestination pair. They are not even single fault tolerant and failure of even one of the switches disconnects some of the source-destination pairs. Further, distinct sourcedestination pairs may have common link, therefore, blocking of link due to preoccupation may occur. Hence, they are unreliable, as they are non-fault tolerant. Some of the examples of these MINs are described below:..1 Crossbar Network The crossbar[1] MIN is shown in Fig..It is the simplest and cost intensive method for connecting N sources to N destinations. Total No. of Stages =N Main features: For a N network No. of stages = N No. of Switches/ Stage = N Total No. of stages =N Limitations: Cost intensive way of connection, which becomes highly prohibitive for a large network. 0

41 Fig.. Crossbar Network.. Omega Network The Omega[] network is shown in Fig... In Omega network the interconnection pattern is often called as Perfect Shuffle. The routing in this network takes place based on the destination tag. If tag bit is 0 signal goes to the upper destination and if it is 1 then the request is routed to the lower destination. Unlike Crossbar it is a Blocking network, because not all set of requests are processed simultaneously. Conflicts can occur over the use of an interconnection or the switch as well as between the requests and replies. Main Features: For a N network: No. of stages=log N No.of switches per stage = N/ Total No. of switches= N/ log N 1

42 Fig..Omega Network

43 Fig..Generalized Cube Network

44 .. Generalized Cube Network The generalized cube network is shown Fig... It is very much similar to the Omega network explained above and is also an example of the Unique path MINs. Main Features: For an N*N cube No. of stages= log N No.of switches per stage = N/ Total No. of switches = N/ log N.. Shuffle Exchange network Shuffle Exchange network is based on routing functions that are Shuffle and Exchange. A perfect shuffle of N=8 is shown in Fig. Perfect shuffle[1] cuts the deck into two halves from the center and intermixes them evenly. Inverse perfect shuffle does the opposite to restore the original ordering as shown in Fig (0) (1) () 011 () 100 () 101 () 110 () 111 () Fig. A perfect shuffle 000 (0) (1) () 011 () 100 () 101 () 110 () 111 () Fig. A Inverse shuffle

45 The unique paths MINs have many important properties 1) The hardware cost is of O (N log N), providing N simultaneous paths having pathlengths O (log N). ) They provide simple and distributed routing algorithms making the central routing unnecessary and thus result in low communication latency times. ) A unique path exists between any source to any destination, and distinct source/ destination paths may have common links. There are few problems with this kind of configuration. They are 1) A source/destination connection may be blocked by a previously established connection thereby providing a poor performance in a random access environment. ) The failures of even a single link or switch will cause disconnection of several source/destination paths. Such a MIN with 1 source and 1 destinations, built from * switching elements is the shuffle-exchange MIN that, can be easily depicted in the Fig.. Multi-path Static Regular MINs..1 Benes Network The Benes network is shown in Fig..8. It is also similar to the Omega Network but is no blocking problem in the Benes network as it has more number of stages compared to the Omega network. Unlike the Omega network, in which there is exactly one path from any source to the destination, in Benes network there are many alternatives. For example source 0 can be connected to the destination to the destination 0 via 1A, A, C, A, A, but another route is through 1A, C, C, C, A.If the complete list of source-destination pairs is available, it is possible to find a set of routes beforehand provided n o two sources want to use the same destination simultaneously. However, if the requests come in sequential order, it is not always possible to satisfy al of them due to the sequential order,

46 it is not always possible to satisfy all of them due to the choices already made. The routing in this network is much more complex compared to the Omega Network as the choice for the selection of a route is made at every step.. Multipath Regular Dynamic MINs In these MINs, a fork at every SE re-routes the data to an alternate available path without resorting to the backtracking. It avoids the unnecessary time delay and the fault-tolerance is generally achieved by providing extra links in the network.. Multipath Regular Dynamic MINs..1 Augmented Shuffle Exchange MINs The Augmented Shuffle Exchange[] MINs provide the fault-tolerance using links between the Conjugate pairs of switches. At a stage I, two switches k and 1 belong to the same conjugate subset if k =1 mod I. In each conjugate subset of switches there are several pairs of switches called Conjugate pair of switches. For example in Fig.9 the switches (1,0) and (,0) form conjugate pair since both are connected to the switches (,1) and (,1). Conjugate subsets and Conjugate pairs of switches play a fundamental role in the augmentation scheme. The scheme for creating multiple paths is based on connecting the switches to a conjugate subset by using additional links to form loops for this purpose the * switches by 8 switches. The vertical links that are used to connect switches in the same stage are called auxiliary links. The network of Fig.9 shows this augmentation. By using these auxiliary links fault-tolerant routing can be accomplished. If a switch is not able to process a request because o a faulty switch in the next stage or because of a busy link, it can re-route that request via its auxiliary destination link to the conjugate switch in the loop. The next switch in turn will make connection to a different (non-faulty) switch in the following stage. For example, suppose a connection is made from source link 8 to destination link. Under the normal conditions, the path will be (,0), (0,1), (1,) and (,). Now suppose that switch (1,) and (,). Now, suppose that switch (1,) is faulty,

47 Fig.8.Benes Network

48 using an auxiliary link whenever a fault is encountered allows any source to be connected to any destination while tolerating any single faulty switch in any stage other than the initial stage and the final stage. If a switch failure occurs at the initial stage then it disconnects those sources from rest of the network. Similarly, a switch failure in the final stage disconnects the destinations attached to that switch. To attain fault tolerance in these stages (Initial and final), each source and destination must be provided with at least switches. Schemes for connecting sources and destination must be provided with at least one additional input/output link so that they can be connected to two switches. Schemes for connecting sources and destinations equipped with two ports each are shown in Fig..9 A *1 multiplexer is placed at each source link of stage 0, and each source is connected to two distinct multiplexers. To provide additional connections to the destinations, N 1* demultiplexers replace the N/ switches in the final stage of MIN. Each destination is directly connected to two demultiplexers. In this Fig..9 various loops are so formed subject to the condition that no two switches in a given loop form the conjugate pair, in order to achieve the property of on-line repair and maintenance in addition to single fault-tolerance. Such a network is called an augmented shuffle exchange network or ASEN. The ASEN-Max with maximum number of switches connected in a loop is shown in fig. Similarly ASEN- and other networks can be formed. In general, an ASEN with number of switches in a loop in stage I is equal to min ( n-i-, K) is called ASEN-K. 8

49 Fig.9 ASEN 9

50 . Irregular MINs..1 Four-Tree Network (FT) FT[] network is represented in Fig.10 FT is an irregular Dynamic MIN that supports multiple paths of different lengths. Two subgroups of identical sub networks are used in its formation each having the size ^n-1 * ^n-1 are used. In this network * switches are used instead of * switches. Ever * switch in first subnetwork forms a loop with the corresponding switch in the other subnetwork. Connecting both the subgroups by means of the Multiplexers and Demultiplexers provides the Fault Tolerance in the first and the last stages. Every request is first tried to be routed through the primary path, if unavailable due fault or if it is busy, the request is routed through the path, if unavailable due fault or if it is busy, the request is routed through the other subnetwork. Thus a fork is available at every stage except the last sage which makes alternate routing possible. FT is a single-fault tolerant in nature. For a N*N FT network Main Features: No. of stages= log N/-1 No. of to 1 Multiplexers= N No. of 1 to Demultiplexers= N No. of SEs in stage i and log N/-1= n-i-1 0

51 Fig.10 FT 1

52 .. Modified Four tree network MFT represented in Fig.11.which uses one stage less, compared to the FT and uses two switches less compared to the FT. It connects all sources to the destinations and provides fault-tolerance and reliability features well as FT. Here the maximum path length is, which is one less compared to FT. Thus there is benefit in terms of cost and path-length. Also, as it uses lesser number of paths and switches, therefore, more fault tolerant. More the number of links and switches more are the number of faults. For a N*N MFT network Main Features: No. of stages = log N/- No. of to 1 Multiplexers = N No. of 1 to Demultiplexers = N No. of SEs in stage i and log N/-= n-i-1 If the number of switches at different stages in an network are not the same then such networks are called Irregular networks. Irregular networks are inherently multi-path in nature but, they may or may not be fault tolerant i nature. We will discuss both the Unique and Multi-path Irregular MINs.

53 Fig..11 Modified FT

54 Chapter Permutation Passabiltiy in MINs.1 Introduction In unique path multistage interconnection networks there is a unique path between a source and a destination. While multipath multistage interconnection networks provide more than one paths from a source to a destination. In multipath multistage interconnection networks, the request is routed through an alternative path, if the most favorable path is not amiable due to the reasons: œ Some switch(es) in that path may be faulty œ Some other request is occupying that path at that instant of time In the multipath MINs a very important performance parameter is the Permutation passability (which means that if a number of requests simultaneously occur at source at a particular moment of time then how many of them successfully mature after being routed through the network). The network tries to send the request firstly through the most favorable path if such path is not available then the request is routed through the alternative path. If there is no alternative path available, the request is dropped. The desirable character of any network is that the network should be such that it allows maximum requests through it with minimum path length. A minimum path length means a request will: œ Take less time to reach destination œ Have less probability of intermediate link failures So problem can be defined as to find permutation passability of regular and irregular networks under fault is calculated. This will show that at a particular moment of time, if a number of requests simultaneously occur at source, how many of them successfully mature will i.e. will reach their destination.

55 . Simulation Analysis The simulation of permutation passability behavior of a network will generate randomly the number of requests that will appear at source side at a particular instant of time. The same numbers of source/destination pairs are formulated. Each request is tried routed through its most favorable path, others have to be routed through an alternative path. In case the former is already busy in serving to some other request, the remaining will have to be simply dropped, due to unavailability of an alternative path between the source and destination. This simulation analysis of permutation passability behavior is checked for 0 times, for each network, taking different number of inputs appearing at source side. Then total number of request matured and average path length is calculated. The regular networks which are simulated are: œ ASEN And the irregular networks are: œ œ Four-Tree Networks (FT) Modified-Four Tree networks The following regular and irregular multistage interconnection networks to be analyzed on the basis of two parameters i.e. number of requests reaching the destination and average path length.. Simulation of Permutation Passability in Regular Networks No. of Requests = N Request Matured = RM Average Path Length = APL

56 N Source/Destination Pairs [,][,][10,1][1,11][,1][,] [,1][1,10][1,1][11,][,] [1,0][11,][0,][1,1][10,0][9,] [,][8,0][1,] [,8][1,1][,11][11,][1,1] [1,1][8,10][10,1] [11,9][,][1,][10,] [0,9] [10,][,1][1,][1,][,1][1,0] [0,0][,][,][8,][1,1] [1,][10,][1,1][1,1][,1][11,9][1,][8,8] [9,11][1,1][,][10,1][11,1][1,11] [1,11][,9][1,1] [,1][1,][11,][0,][10,][,][, 1][9,][1,9][1,1][,] [1,][1,1][0,][,][9,1][,][1, 0][1,1][,][,10] [1,][9,][1,][1,1][,8][,9] [,][1,1][8,1] [10,][8,][,9][,0][,][1,1] [,1][,1][,1][1,1][1,] [,1][1,][10,1][,1][11,] [9,][,8][1,][,10][,8] [9,1][,1][,8][1,][,][8,11] [,8][8,8][11,1][1,][1,1] [1,0][1,][1,] [8,8] [1,9][,9][,1][,9][9,1][,1] [1,0][,0][8,10][11,] [,1][11,][8,][1,1][1,8][,1] [,8][1,] RM APL N Source/Destination Pairs [,][,][10,1][1,11][,1][,] [,1][1,10][1,1][11,][,] [1,0][11,][0,][1,1][10,0][9,] [,][8,0][1,] [,8][1,1][,11][11,][1,1] [1,1][8,10][10,1] [11,9][,][1,][10,] [0,9] [10,][,1][1,][1,][,1][1,0] [0,0][,][,][8,][1,1] [1,][10,][1,1][1,1][,1] [11,9][1,][8,8] [9,11][1,1][,][10,1][11,1][1,11] [1,11][,9][1,1] [,1][1,][11,][0,][10,][,] [,1][9,][1,9][1,1][,] [1,][1,1][0,][,][9,1][,] [1,0][1,1][,][,10] [1,][9,][1,][1,1][,8][,9] [,][1,1][8,1] [10,][8,][,9][,0][,][1,1] [,1][,1][,1][1,1][1,] [,1][1,][10,1][,1][11,] [9,][,8][1,][,10][,8] [9,1][,1][,8][1,][,][8,11] [,8][8,8][11,1][1,][1,1] [1,0][1,][1,] [8,8] [1,9][,9][,1][,9][9,1][,1] [1,0][,0][8,10][11,] [,1][11,][8,][1,1][1,8][,1] [,8][1,] RM APL ASEN network When no switch is failed When switch is failed

This chapter provides the background knowledge about Multistage. multistage interconnection networks are explained. The need, objectives, research

This chapter provides the background knowledge about Multistage. multistage interconnection networks are explained. The need, objectives, research CHAPTER 1 Introduction This chapter provides the background knowledge about Multistage Interconnection Networks. Metrics used for measuring the performance of various multistage interconnection networks

More information

INTERCONNECTION NETWORKS LECTURE 4

INTERCONNECTION NETWORKS LECTURE 4 INTERCONNECTION NETWORKS LECTURE 4 DR. SAMMAN H. AMEEN 1 Topology Specifies way switches are wired Affects routing, reliability, throughput, latency, building ease Routing How does a message get from source

More information

Interconnection networks

Interconnection networks Interconnection networks When more than one processor needs to access a memory structure, interconnection networks are needed to route data from processors to memories (concurrent access to a shared memory

More information

Advanced Parallel Architecture. Annalisa Massini /2017

Advanced Parallel Architecture. Annalisa Massini /2017 Advanced Parallel Architecture Annalisa Massini - 2016/2017 References Advanced Computer Architecture and Parallel Processing H. El-Rewini, M. Abd-El-Barr, John Wiley and Sons, 2005 Parallel computing

More information

Dr e v prasad Dt

Dr e v prasad Dt Dr e v prasad Dt. 12.10.17 Contents Characteristics of Multiprocessors Interconnection Structures Inter Processor Arbitration Inter Processor communication and synchronization Cache Coherence Introduction

More information

Multiprocessor Interconnection Networks- Part Three

Multiprocessor Interconnection Networks- Part Three Babylon University College of Information Technology Software Department Multiprocessor Interconnection Networks- Part Three By The k-ary n-cube Networks The k-ary n-cube network is a radix k cube with

More information

4. Networks. in parallel computers. Advances in Computer Architecture

4. Networks. in parallel computers. Advances in Computer Architecture 4. Networks in parallel computers Advances in Computer Architecture System architectures for parallel computers Control organization Single Instruction stream Multiple Data stream (SIMD) All processors

More information

Physical Organization of Parallel Platforms. Alexandre David

Physical Organization of Parallel Platforms. Alexandre David Physical Organization of Parallel Platforms Alexandre David 1.2.05 1 Static vs. Dynamic Networks 13-02-2008 Alexandre David, MVP'08 2 Interconnection networks built using links and switches. How to connect:

More information

Interconnection Networks: Topology. Prof. Natalie Enright Jerger

Interconnection Networks: Topology. Prof. Natalie Enright Jerger Interconnection Networks: Topology Prof. Natalie Enright Jerger Topology Overview Definition: determines arrangement of channels and nodes in network Analogous to road map Often first step in network design

More information

Lecture 12: Interconnection Networks. Topics: communication latency, centralized and decentralized switches, routing, deadlocks (Appendix E)

Lecture 12: Interconnection Networks. Topics: communication latency, centralized and decentralized switches, routing, deadlocks (Appendix E) Lecture 12: Interconnection Networks Topics: communication latency, centralized and decentralized switches, routing, deadlocks (Appendix E) 1 Topologies Internet topologies are not very regular they grew

More information

Chapter 8 : Multiprocessors

Chapter 8 : Multiprocessors Chapter 8 Multiprocessors 8.1 Characteristics of multiprocessors A multiprocessor system is an interconnection of two or more CPUs with memory and input-output equipment. The term processor in multiprocessor

More information

Network-on-chip (NOC) Topologies

Network-on-chip (NOC) Topologies Network-on-chip (NOC) Topologies 1 Network Topology Static arrangement of channels and nodes in an interconnection network The roads over which packets travel Topology chosen based on cost and performance

More information

Characteristics of Mult l ip i ro r ce c ssors r

Characteristics of Mult l ip i ro r ce c ssors r Characteristics of Multiprocessors A multiprocessor system is an interconnection of two or more CPUs with memory and input output equipment. The term processor in multiprocessor can mean either a central

More information

Interconnect Technology and Computational Speed

Interconnect Technology and Computational Speed Interconnect Technology and Computational Speed From Chapter 1 of B. Wilkinson et al., PARAL- LEL PROGRAMMING. Techniques and Applications Using Networked Workstations and Parallel Computers, augmented

More information

Lecture: Interconnection Networks

Lecture: Interconnection Networks Lecture: Interconnection Networks Topics: Router microarchitecture, topologies Final exam next Tuesday: same rules as the first midterm 1 Packets/Flits A message is broken into multiple packets (each packet

More information

Interconnection Network

Interconnection Network Interconnection Network Recap: Generic Parallel Architecture A generic modern multiprocessor Network Mem Communication assist (CA) $ P Node: processor(s), memory system, plus communication assist Network

More information

CSC630/CSC730: Parallel Computing

CSC630/CSC730: Parallel Computing CSC630/CSC730: Parallel Computing Parallel Computing Platforms Chapter 2 (2.4.1 2.4.4) Dr. Joe Zhang PDC-4: Topology 1 Content Parallel computing platforms Logical organization (a programmer s view) Control

More information

Lecture 26: Interconnects. James C. Hoe Department of ECE Carnegie Mellon University

Lecture 26: Interconnects. James C. Hoe Department of ECE Carnegie Mellon University 18 447 Lecture 26: Interconnects James C. Hoe Department of ECE Carnegie Mellon University 18 447 S18 L26 S1, James C. Hoe, CMU/ECE/CALCM, 2018 Housekeeping Your goal today get an overview of parallel

More information

EE/CSCI 451: Parallel and Distributed Computation

EE/CSCI 451: Parallel and Distributed Computation EE/CSCI 451: Parallel and Distributed Computation Lecture #11 2/21/2017 Xuehai Qian Xuehai.qian@usc.edu http://alchem.usc.edu/portal/xuehaiq.html University of Southern California 1 Outline Midterm 1:

More information

EE/CSCI 451: Parallel and Distributed Computation

EE/CSCI 451: Parallel and Distributed Computation EE/CSCI 451: Parallel and Distributed Computation Lecture #4 1/24/2018 Xuehai Qian xuehai.qian@usc.edu http://alchem.usc.edu/portal/xuehaiq.html University of Southern California 1 Announcements PA #1

More information

Lecture 2: Topology - I

Lecture 2: Topology - I ECE 8823 A / CS 8803 - ICN Interconnection Networks Spring 2017 http://tusharkrishna.ece.gatech.edu/teaching/icn_s17/ Lecture 2: Topology - I Tushar Krishna Assistant Professor School of Electrical and

More information

MULTIPROCESSORS. Characteristics of Multiprocessors. Interconnection Structures. Interprocessor Arbitration

MULTIPROCESSORS. Characteristics of Multiprocessors. Interconnection Structures. Interprocessor Arbitration MULTIPROCESSORS Characteristics of Multiprocessors Interconnection Structures Interprocessor Arbitration Interprocessor Communication and Synchronization Cache Coherence 2 Characteristics of Multiprocessors

More information

EE/CSCI 451: Parallel and Distributed Computation

EE/CSCI 451: Parallel and Distributed Computation EE/CSCI 451: Parallel and Distributed Computation Lecture #5 1/29/2017 Xuehai Qian Xuehai.qian@usc.edu http://alchem.usc.edu/portal/xuehaiq.html University of Southern California 1 From last class Outline

More information

CS575 Parallel Processing

CS575 Parallel Processing CS575 Parallel Processing Lecture three: Interconnection Networks Wim Bohm, CSU Except as otherwise noted, the content of this presentation is licensed under the Creative Commons Attribution 2.5 license.

More information

Interconnection Network. Jinkyu Jeong Computer Systems Laboratory Sungkyunkwan University

Interconnection Network. Jinkyu Jeong Computer Systems Laboratory Sungkyunkwan University Interconnection Network Jinkyu Jeong (jinkyu@skku.edu) Computer Systems Laboratory Sungkyunkwan University http://csl.skku.edu Topics Taxonomy Metric Topologies Characteristics Cost Performance 2 Interconnection

More information

Interconnection topologies (cont.) [ ] In meshes and hypercubes, the average distance increases with the dth root of N.

Interconnection topologies (cont.) [ ] In meshes and hypercubes, the average distance increases with the dth root of N. Interconnection topologies (cont.) [ 10.4.4] In meshes and hypercubes, the average distance increases with the dth root of N. In a tree, the average distance grows only logarithmically. A simple tree structure,

More information

Scalable crossbar network: a non-blocking interconnection network for large-scale systems

Scalable crossbar network: a non-blocking interconnection network for large-scale systems J Supercomput DOI 10.1007/s11227-014-1319-2 Scalable crossbar network: a non-blocking interconnection network for large-scale systems Fathollah Bistouni Mohsen Jahanshahi Springer Science+Business Media

More information

Lecture 13: Interconnection Networks. Topics: lots of background, recent innovations for power and performance

Lecture 13: Interconnection Networks. Topics: lots of background, recent innovations for power and performance Lecture 13: Interconnection Networks Topics: lots of background, recent innovations for power and performance 1 Interconnection Networks Recall: fully connected network, arrays/rings, meshes/tori, trees,

More information

Parallel Architectures

Parallel Architectures Parallel Architectures CPS343 Parallel and High Performance Computing Spring 2018 CPS343 (Parallel and HPC) Parallel Architectures Spring 2018 1 / 36 Outline 1 Parallel Computer Classification Flynn s

More information

EE/CSCI 451 Midterm 1

EE/CSCI 451 Midterm 1 EE/CSCI 451 Midterm 1 Spring 2018 Instructor: Xuehai Qian Friday: 02/26/2018 Problem # Topic Points Score 1 Definitions 20 2 Memory System Performance 10 3 Cache Performance 10 4 Shared Memory Programming

More information

Interconnection Network

Interconnection Network Interconnection Network Jinkyu Jeong (jinkyu@skku.edu) Computer Systems Laboratory Sungkyunkwan University http://csl.skku.edu SSE3054: Multicore Systems, Spring 2017, Jinkyu Jeong (jinkyu@skku.edu) Topics

More information

Multiprocessors Interconnection Networks

Multiprocessors Interconnection Networks Babylon University College of Information Technology Software Department Multiprocessors Interconnection Networks By Interconnection Networks Taxonomy An interconnection network could be either static

More information

Lecture 3: Topology - II

Lecture 3: Topology - II ECE 8823 A / CS 8803 - ICN Interconnection Networks Spring 2017 http://tusharkrishna.ece.gatech.edu/teaching/icn_s17/ Lecture 3: Topology - II Tushar Krishna Assistant Professor School of Electrical and

More information

A MULTIPROCESSOR SYSTEM. Mariam A. Salih

A MULTIPROCESSOR SYSTEM. Mariam A. Salih A MULTIPROCESSOR SYSTEM Mariam A. Salih Multiprocessors classification. interconnection networks (INs) Mode of Operation Control Strategy switching techniques Topology BUS-BASED DYNAMIC INTERCONNECTION

More information

Lecture 7: Parallel Processing

Lecture 7: Parallel Processing Lecture 7: Parallel Processing Introduction and motivation Architecture classification Performance evaluation Interconnection network Zebo Peng, IDA, LiTH 1 Performance Improvement Reduction of instruction

More information

Performance of Multihop Communications Using Logical Topologies on Optical Torus Networks

Performance of Multihop Communications Using Logical Topologies on Optical Torus Networks Performance of Multihop Communications Using Logical Topologies on Optical Torus Networks X. Yuan, R. Melhem and R. Gupta Department of Computer Science University of Pittsburgh Pittsburgh, PA 156 fxyuan,

More information

Lecture 24: Interconnection Networks. Topics: topologies, routing, deadlocks, flow control

Lecture 24: Interconnection Networks. Topics: topologies, routing, deadlocks, flow control Lecture 24: Interconnection Networks Topics: topologies, routing, deadlocks, flow control 1 Topology Examples Grid Torus Hypercube Criteria Bus Ring 2Dtorus 6-cube Fully connected Performance Bisection

More information

Fault Tolerant and Secure Architectures for On Chip Networks With Emerging Interconnect Technologies. Mohsin Y Ahmed Conlan Wesson

Fault Tolerant and Secure Architectures for On Chip Networks With Emerging Interconnect Technologies. Mohsin Y Ahmed Conlan Wesson Fault Tolerant and Secure Architectures for On Chip Networks With Emerging Interconnect Technologies Mohsin Y Ahmed Conlan Wesson Overview NoC: Future generation of many core processor on a single chip

More information

Interconnection Networks

Interconnection Networks Lecture 17: Interconnection Networks Parallel Computer Architecture and Programming A comment on web site comments It is okay to make a comment on a slide/topic that has already been commented on. In fact

More information

CSE Introduction to Parallel Processing. Chapter 4. Models of Parallel Processing

CSE Introduction to Parallel Processing. Chapter 4. Models of Parallel Processing Dr Izadi CSE-4533 Introduction to Parallel Processing Chapter 4 Models of Parallel Processing Elaborate on the taxonomy of parallel processing from chapter Introduce abstract models of shared and distributed

More information

SMD149 - Operating Systems - Multiprocessing

SMD149 - Operating Systems - Multiprocessing SMD149 - Operating Systems - Multiprocessing Roland Parviainen December 1, 2005 1 / 55 Overview Introduction Multiprocessor systems Multiprocessor, operating system and memory organizations 2 / 55 Introduction

More information

Overview. SMD149 - Operating Systems - Multiprocessing. Multiprocessing architecture. Introduction SISD. Flynn s taxonomy

Overview. SMD149 - Operating Systems - Multiprocessing. Multiprocessing architecture. Introduction SISD. Flynn s taxonomy Overview SMD149 - Operating Systems - Multiprocessing Roland Parviainen Multiprocessor systems Multiprocessor, operating system and memory organizations December 1, 2005 1/55 2/55 Multiprocessor system

More information

Lecture 7: Parallel Processing

Lecture 7: Parallel Processing Lecture 7: Parallel Processing Introduction and motivation Architecture classification Performance evaluation Interconnection network Zebo Peng, IDA, LiTH 1 Performance Improvement Reduction of instruction

More information

The Effect of Adaptivity on the Performance of the OTIS-Hypercube under Different Traffic Patterns

The Effect of Adaptivity on the Performance of the OTIS-Hypercube under Different Traffic Patterns The Effect of Adaptivity on the Performance of the OTIS-Hypercube under Different Traffic Patterns H. H. Najaf-abadi 1, H. Sarbazi-Azad 2,1 1 School of Computer Science, IPM, Tehran, Iran. 2 Computer Engineering

More information

Topologies. Maurizio Palesi. Maurizio Palesi 1

Topologies. Maurizio Palesi. Maurizio Palesi 1 Topologies Maurizio Palesi Maurizio Palesi 1 Network Topology Static arrangement of channels and nodes in an interconnection network The roads over which packets travel Topology chosen based on cost and

More information

ECE 4750 Computer Architecture, Fall 2017 T06 Fundamental Network Concepts

ECE 4750 Computer Architecture, Fall 2017 T06 Fundamental Network Concepts ECE 4750 Computer Architecture, Fall 2017 T06 Fundamental Network Concepts School of Electrical and Computer Engineering Cornell University revision: 2017-10-17-12-26 1 Network/Roadway Analogy 3 1.1. Running

More information

Homework Assignment #1: Topology Kelly Shaw

Homework Assignment #1: Topology Kelly Shaw EE482 Advanced Computer Organization Spring 2001 Professor W. J. Dally Homework Assignment #1: Topology Kelly Shaw As we have not discussed routing or flow control yet, throughout this problem set assume

More information

Three parallel-programming models

Three parallel-programming models Three parallel-programming models Shared-memory programming is like using a bulletin board where you can communicate with colleagues. essage-passing is like communicating via e-mail or telephone calls.

More information

Scalability and Classifications

Scalability and Classifications Scalability and Classifications 1 Types of Parallel Computers MIMD and SIMD classifications shared and distributed memory multicomputers distributed shared memory computers 2 Network Topologies static

More information

Parallel Systems Prof. James L. Frankel Harvard University. Version of 6:50 PM 4-Dec-2018 Copyright 2018, 2017 James L. Frankel. All rights reserved.

Parallel Systems Prof. James L. Frankel Harvard University. Version of 6:50 PM 4-Dec-2018 Copyright 2018, 2017 James L. Frankel. All rights reserved. Parallel Systems Prof. James L. Frankel Harvard University Version of 6:50 PM 4-Dec-2018 Copyright 2018, 2017 James L. Frankel. All rights reserved. Architectures SISD (Single Instruction, Single Data)

More information

Detection and Removal of Black Hole Attack in Mobile Ad hoc Network

Detection and Removal of Black Hole Attack in Mobile Ad hoc Network Detection and Removal of Black Hole Attack in Mobile Ad hoc Network Harmandeep Kaur, Mr. Amarvir Singh Abstract A mobile ad hoc network consists of large number of inexpensive nodes which are geographically

More information

Fundamentals of Networking Types of Topologies

Fundamentals of Networking Types of Topologies Fundamentals of Networking Types of Topologies Kuldeep Sonar 1 Bus Topology Bus topology is a network type in which every computer and network device is connected to single cable. When it has exactly two

More information

Introduction to Multiprocessors (Part I) Prof. Cristina Silvano Politecnico di Milano

Introduction to Multiprocessors (Part I) Prof. Cristina Silvano Politecnico di Milano Introduction to Multiprocessors (Part I) Prof. Cristina Silvano Politecnico di Milano Outline Key issues to design multiprocessors Interconnection network Centralized shared-memory architectures Distributed

More information

Interconnection Networks. Issues for Networks

Interconnection Networks. Issues for Networks Interconnection Networks Communications Among Processors Chris Nevison, Colgate University Issues for Networks Total Bandwidth amount of data which can be moved from somewhere to somewhere per unit time

More information

CS252 Graduate Computer Architecture Lecture 14. Multiprocessor Networks March 9 th, 2011

CS252 Graduate Computer Architecture Lecture 14. Multiprocessor Networks March 9 th, 2011 CS252 Graduate Computer Architecture Lecture 14 Multiprocessor Networks March 9 th, 2011 John Kubiatowicz Electrical Engineering and Computer Sciences University of California, Berkeley http://www.eecs.berkeley.edu/~kubitron/cs252

More information

Interconnection Networks: Routing. Prof. Natalie Enright Jerger

Interconnection Networks: Routing. Prof. Natalie Enright Jerger Interconnection Networks: Routing Prof. Natalie Enright Jerger Routing Overview Discussion of topologies assumed ideal routing In practice Routing algorithms are not ideal Goal: distribute traffic evenly

More information

Static Interconnection Networks Prof. Kasim M. Al-Aubidy Computer Eng. Dept.

Static Interconnection Networks Prof. Kasim M. Al-Aubidy Computer Eng. Dept. Advanced Computer Architecture (0630561) Lecture 17 Static Interconnection Networks Prof. Kasim M. Al-Aubidy Computer Eng. Dept. INs Taxonomy: An IN could be either static or dynamic. Connections in a

More information

Chapter 9 Multiprocessors

Chapter 9 Multiprocessors ECE200 Computer Organization Chapter 9 Multiprocessors David H. lbonesi and the University of Rochester Henk Corporaal, TU Eindhoven, Netherlands Jari Nurmi, Tampere University of Technology, Finland University

More information

B.H.GARDI COLLEGE OF ENGINEERING & TECHNOLOGY (MCA Dept.) Parallel Database Database Management System - 2

B.H.GARDI COLLEGE OF ENGINEERING & TECHNOLOGY (MCA Dept.) Parallel Database Database Management System - 2 Introduction :- Today single CPU based architecture is not capable enough for the modern database that are required to handle more demanding and complex requirements of the users, for example, high performance,

More information

Lecture 2 Parallel Programming Platforms

Lecture 2 Parallel Programming Platforms Lecture 2 Parallel Programming Platforms Flynn s Taxonomy In 1966, Michael Flynn classified systems according to numbers of instruction streams and the number of data stream. Data stream Single Multiple

More information

Recall: The Routing problem: Local decisions. Recall: Multidimensional Meshes and Tori. Properties of Routing Algorithms

Recall: The Routing problem: Local decisions. Recall: Multidimensional Meshes and Tori. Properties of Routing Algorithms CS252 Graduate Computer Architecture Lecture 16 Multiprocessor Networks (con t) March 14 th, 212 John Kubiatowicz Electrical Engineering and Computer Sciences University of California, Berkeley http://www.eecs.berkeley.edu/~kubitron/cs252

More information

6. Parallel Volume Rendering Algorithms

6. Parallel Volume Rendering Algorithms 6. Parallel Volume Algorithms This chapter introduces a taxonomy of parallel volume rendering algorithms. In the thesis statement we claim that parallel algorithms may be described by "... how the tasks

More information

Fundamentals of. Parallel Computing. Sanjay Razdan. Alpha Science International Ltd. Oxford, U.K.

Fundamentals of. Parallel Computing. Sanjay Razdan. Alpha Science International Ltd. Oxford, U.K. Fundamentals of Parallel Computing Sanjay Razdan Alpha Science International Ltd. Oxford, U.K. CONTENTS Preface Acknowledgements vii ix 1. Introduction to Parallel Computing 1.1-1.37 1.1 Parallel Computing

More information

Networks, Routers and Transputers:

Networks, Routers and Transputers: This is Chapter 1 from the second edition of : Networks, Routers and Transputers: Function, Performance and applications Edited M.D. by: May, P.W. Thompson, and P.H. Welch INMOS Limited 1993 This edition

More information

Introduction. The fundamental purpose of data communications is to exchange information between user's computers, terminals and applications programs.

Introduction. The fundamental purpose of data communications is to exchange information between user's computers, terminals and applications programs. Introduction The fundamental purpose of data communications is to exchange information between user's computers, terminals and applications programs. Simplified Communications System Block Diagram Intro-1

More information

EE/CSCI 451 Spring 2018 Homework 2 Assigned: February 7, 2018 Due: February 14, 2018, before 11:59 pm Total Points: 100

EE/CSCI 451 Spring 2018 Homework 2 Assigned: February 7, 2018 Due: February 14, 2018, before 11:59 pm Total Points: 100 EE/CSCI 45 Spring 08 Homework Assigned: February 7, 08 Due: February 4, 08, before :59 pm Total Points: 00 [0 points] Explain the following terms:. Diameter of a network. Bisection width of a network.

More information

Parallel Architecture. Sathish Vadhiyar

Parallel Architecture. Sathish Vadhiyar Parallel Architecture Sathish Vadhiyar Motivations of Parallel Computing Faster execution times From days or months to hours or seconds E.g., climate modelling, bioinformatics Large amount of data dictate

More information

Unit 9 : Fundamentals of Parallel Processing

Unit 9 : Fundamentals of Parallel Processing Unit 9 : Fundamentals of Parallel Processing Lesson 1 : Types of Parallel Processing 1.1. Learning Objectives On completion of this lesson you will be able to : classify different types of parallel processing

More information

Design of a System-on-Chip Switched Network and its Design Support Λ

Design of a System-on-Chip Switched Network and its Design Support Λ Design of a System-on-Chip Switched Network and its Design Support Λ Daniel Wiklund y, Dake Liu Dept. of Electrical Engineering Linköping University S-581 83 Linköping, Sweden Abstract As the degree of

More information

TDT Appendix E Interconnection Networks

TDT Appendix E Interconnection Networks TDT 4260 Appendix E Interconnection Networks Review Advantages of a snooping coherency protocol? Disadvantages of a snooping coherency protocol? Advantages of a directory coherency protocol? Disadvantages

More information

Multi-Processor / Parallel Processing

Multi-Processor / Parallel Processing Parallel Processing: Multi-Processor / Parallel Processing Originally, the computer has been viewed as a sequential machine. Most computer programming languages require the programmer to specify algorithms

More information

CS 498 Hot Topics in High Performance Computing. Networks and Fault Tolerance. 9. Routing and Flow Control

CS 498 Hot Topics in High Performance Computing. Networks and Fault Tolerance. 9. Routing and Flow Control CS 498 Hot Topics in High Performance Computing Networks and Fault Tolerance 9. Routing and Flow Control Intro What did we learn in the last lecture Topology metrics Including minimum diameter of directed

More information

Lecture 12: Interconnection Networks. Topics: dimension/arity, routing, deadlock, flow control

Lecture 12: Interconnection Networks. Topics: dimension/arity, routing, deadlock, flow control Lecture 12: Interconnection Networks Topics: dimension/arity, routing, deadlock, flow control 1 Interconnection Networks Recall: fully connected network, arrays/rings, meshes/tori, trees, butterflies,

More information

SHARED MEMORY VS DISTRIBUTED MEMORY

SHARED MEMORY VS DISTRIBUTED MEMORY OVERVIEW Important Processor Organizations 3 SHARED MEMORY VS DISTRIBUTED MEMORY Classical parallel algorithms were discussed using the shared memory paradigm. In shared memory parallel platform processors

More information

Chapter 3 : Topology basics

Chapter 3 : Topology basics 1 Chapter 3 : Topology basics What is the network topology Nomenclature Traffic pattern Performance Packaging cost Case study: the SGI Origin 2000 2 Network topology (1) It corresponds to the static arrangement

More information

A Dynamic NOC Arbitration Technique using Combination of VCT and XY Routing

A Dynamic NOC Arbitration Technique using Combination of VCT and XY Routing 727 A Dynamic NOC Arbitration Technique using Combination of VCT and XY Routing 1 Bharati B. Sayankar, 2 Pankaj Agrawal 1 Electronics Department, Rashtrasant Tukdoji Maharaj Nagpur University, G.H. Raisoni

More information

Extended Junction Based Source Routing Technique for Large Mesh Topology Network on Chip Platforms

Extended Junction Based Source Routing Technique for Large Mesh Topology Network on Chip Platforms Extended Junction Based Source Routing Technique for Large Mesh Topology Network on Chip Platforms Usman Mazhar Mirza Master of Science Thesis 2011 ELECTRONICS Postadress: Besöksadress: Telefon: Box 1026

More information

Module 16: Distributed System Structures

Module 16: Distributed System Structures Chapter 16: Distributed System Structures Module 16: Distributed System Structures Motivation Types of Network-Based Operating Systems Network Structure Network Topology Communication Structure Communication

More information

Outline. Distributed Shared Memory. Shared Memory. ECE574 Cluster Computing. Dichotomy of Parallel Computing Platforms (Continued)

Outline. Distributed Shared Memory. Shared Memory. ECE574 Cluster Computing. Dichotomy of Parallel Computing Platforms (Continued) Cluster Computing Dichotomy of Parallel Computing Platforms (Continued) Lecturer: Dr Yifeng Zhu Class Review Interconnections Crossbar» Example: myrinet Multistage» Example: Omega network Outline Flynn

More information

CS 6143 COMPUTER ARCHITECTURE II SPRING 2014

CS 6143 COMPUTER ARCHITECTURE II SPRING 2014 CS 6143 COMPUTER ARCHITECTURE II SPRING 2014 DUE : April 9, 2014 HOMEWORK IV READ : - Related portions of Chapter 5 and Appendces F and I of the Hennessy book - Related portions of Chapter 1, 4 and 6 of

More information

MIMD Overview. Intel Paragon XP/S Overview. XP/S Usage. XP/S Nodes and Interconnection. ! Distributed-memory MIMD multicomputer

MIMD Overview. Intel Paragon XP/S Overview. XP/S Usage. XP/S Nodes and Interconnection. ! Distributed-memory MIMD multicomputer MIMD Overview Intel Paragon XP/S Overview! MIMDs in the 1980s and 1990s! Distributed-memory multicomputers! Intel Paragon XP/S! Thinking Machines CM-5! IBM SP2! Distributed-memory multicomputers with hardware

More information

Concurrent/Parallel Processing

Concurrent/Parallel Processing Concurrent/Parallel Processing David May: April 9, 2014 Introduction The idea of using a collection of interconnected processing devices is not new. Before the emergence of the modern stored program computer,

More information

Packet Switch Architecture

Packet Switch Architecture Packet Switch Architecture 3. Output Queueing Architectures 4. Input Queueing Architectures 5. Switching Fabrics 6. Flow and Congestion Control in Sw. Fabrics 7. Output Scheduling for QoS Guarantees 8.

More information

Packet Switch Architecture

Packet Switch Architecture Packet Switch Architecture 3. Output Queueing Architectures 4. Input Queueing Architectures 5. Switching Fabrics 6. Flow and Congestion Control in Sw. Fabrics 7. Output Scheduling for QoS Guarantees 8.

More information

Understanding the Routing Requirements for FPGA Array Computing Platform. Hayden So EE228a Project Presentation Dec 2 nd, 2003

Understanding the Routing Requirements for FPGA Array Computing Platform. Hayden So EE228a Project Presentation Dec 2 nd, 2003 Understanding the Routing Requirements for FPGA Array Computing Platform Hayden So EE228a Project Presentation Dec 2 nd, 2003 What is FPGA Array Computing? Aka: Reconfigurable Computing Aka: Spatial computing,

More information

Reliability Analysis of Modified Irregular Augmented Shuffle Exchange Network (MIASEN)

Reliability Analysis of Modified Irregular Augmented Shuffle Exchange Network (MIASEN) www.ijcsi.org https://doi.org/10.20943/01201703.5964 59 Reliability Analysis of Modified Irregular Augmented Shuffle Exchange Network (MIASEN) Shobha Arya 1 and Nipur Singh 2 1,2 Department of Computer

More information

Achieving Distributed Buffering in Multi-path Routing using Fair Allocation

Achieving Distributed Buffering in Multi-path Routing using Fair Allocation Achieving Distributed Buffering in Multi-path Routing using Fair Allocation Ali Al-Dhaher, Tricha Anjali Department of Electrical and Computer Engineering Illinois Institute of Technology Chicago, Illinois

More information

Problem Formulation. Specialized algorithms are required for clock (and power nets) due to strict specifications for routing such nets.

Problem Formulation. Specialized algorithms are required for clock (and power nets) due to strict specifications for routing such nets. Clock Routing Problem Formulation Specialized algorithms are required for clock (and power nets) due to strict specifications for routing such nets. Better to develop specialized routers for these nets.

More information

A Hybrid Interconnection Network for Integrated Communication Services

A Hybrid Interconnection Network for Integrated Communication Services A Hybrid Interconnection Network for Integrated Communication Services Yi-long Chen Northern Telecom, Inc. Richardson, TX 7583 kchen@nortel.com Jyh-Charn Liu Department of Computer Science, Texas A&M Univ.

More information

Data Partitioning. Figure 1-31: Communication Topologies. Regular Partitions

Data Partitioning. Figure 1-31: Communication Topologies. Regular Partitions Data In single-program multiple-data (SPMD) parallel programs, global data is partitioned, with a portion of the data assigned to each processing node. Issues relevant to choosing a partitioning strategy

More information

Resource Deadlocks and Performance of Wormhole Multicast Routing Algorithms

Resource Deadlocks and Performance of Wormhole Multicast Routing Algorithms IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, VOL. 9, NO. 6, JUNE 1998 535 Resource Deadlocks and Performance of Wormhole Multicast Routing Algorithms Rajendra V. Boppana, Member, IEEE, Suresh

More information

Introduction to Mobile Ad hoc Networks (MANETs)

Introduction to Mobile Ad hoc Networks (MANETs) Introduction to Mobile Ad hoc Networks (MANETs) 1 Overview of Ad hoc Network Communication between various devices makes it possible to provide unique and innovative services. Although this inter-device

More information

IV. PACKET SWITCH ARCHITECTURES

IV. PACKET SWITCH ARCHITECTURES IV. PACKET SWITCH ARCHITECTURES (a) General Concept - as packet arrives at switch, destination (and possibly source) field in packet header is used as index into routing tables specifying next switch in

More information

CS 258, Spring 99 David E. Culler Computer Science Division U.C. Berkeley Wide links, smaller routing delay Tremendous variation 3/19/99 CS258 S99 2

CS 258, Spring 99 David E. Culler Computer Science Division U.C. Berkeley Wide links, smaller routing delay Tremendous variation 3/19/99 CS258 S99 2 Real Machines Interconnection Network Topology Design Trade-offs CS 258, Spring 99 David E. Culler Computer Science Division U.C. Berkeley Wide links, smaller routing delay Tremendous variation 3/19/99

More information

JUNCTION BASED ROUTING: A NOVEL TECHNIQUE FOR LARGE NETWORK ON CHIP PLATFORMS

JUNCTION BASED ROUTING: A NOVEL TECHNIQUE FOR LARGE NETWORK ON CHIP PLATFORMS 1 JUNCTION BASED ROUTING: A NOVEL TECHNIQUE FOR LARGE NETWORK ON CHIP PLATFORMS Shabnam Badri THESIS WORK 2011 ELECTRONICS JUNCTION BASED ROUTING: A NOVEL TECHNIQUE FOR LARGE NETWORK ON CHIP PLATFORMS

More information

Data Communication. Chapter # 1: Introduction. By: William Stalling

Data Communication. Chapter # 1: Introduction. By: William Stalling Data Communication Chapter # 1: By: Introduction William Stalling Data Communication The exchange of data between two devices via some form of transmission medium such as cable wire. For data communications

More information

Importance of Interoperability in High Speed Seamless Redundancy (HSR) Communication Networks

Importance of Interoperability in High Speed Seamless Redundancy (HSR) Communication Networks Importance of Interoperability in High Speed Seamless Redundancy (HSR) Communication Networks Richard Harada Product Manager RuggedCom Inc. Introduction Reliable and fault tolerant high speed communication

More information

Multiprocessor Interconnection Networks

Multiprocessor Interconnection Networks Multiprocessor Interconnection Networks Todd C. Mowry CS 740 November 19, 1998 Topics Network design space Contention Active messages Networks Design Options: Topology Routing Direct vs. Indirect Physical

More information

Topology basics. Constraints and measures. Butterfly networks.

Topology basics. Constraints and measures. Butterfly networks. EE48: Advanced Computer Organization Lecture # Interconnection Networks Architecture and Design Stanford University Topology basics. Constraints and measures. Butterfly networks. Lecture #: Monday, 7 April

More information

Multiple Processor Systems. Lecture 15 Multiple Processor Systems. Multiprocessor Hardware (1) Multiprocessors. Multiprocessor Hardware (2)

Multiple Processor Systems. Lecture 15 Multiple Processor Systems. Multiprocessor Hardware (1) Multiprocessors. Multiprocessor Hardware (2) Lecture 15 Multiple Processor Systems Multiple Processor Systems Multiprocessors Multicomputers Continuous need for faster computers shared memory model message passing multiprocessor wide area distributed

More information