Centralized (Indirect) switching networks. Computer Architecture AMANO, Hideharu

Similar documents
Cloud Connector 徹底解説. 多様な基盤への展開を可能にするための Citrix Cloud のキーコンポーネント A-5 セールスエンジニアリング本部パートナー SE 部リードシステムズエンジニア. 哲司 (Satoshi Komiyama) Citrix

Snoop cache. AMANO, Hideharu, Keio University Textbook pp.40-60

今日の予定 1. 展開図の基礎的な知識 1. 正多面体の共通の展開図. 2. 複数の箱が折れる共通の展開図 :2 時間目 3. Rep-Cube: 最新の話題 4. 正多面体に近い立体と正 4 面体の共通の展開図 5. ペタル型の紙で折るピラミッド型 :2 時間目 ~3 時間目

携帯電話の 吸収率 (SAR) について / Specific Absorption Rate (SAR) of Mobile Phones

携帯電話の 吸収率 (SAR) について / Specific Absorption Rate (SAR) of Mobile Phones

Lecture 4 Branch & cut algorithm


Relaxed Consistency models and software distributed memory. Computer Architecture Textbook pp.79-83

DürrConnect the clever connection. The quick connection with the Click

フラクタル 1 ( ジュリア集合 ) 解説 : ジュリア集合 ( 自己平方フラクタル ) 入力パラメータの例 ( 小さな数値の変化で模様が大きく変化します. Ar や Ai の数値を少しずつ変化させて描画する. ) プログラムコード. 2010, AGU, M.

JASCO-HPLC Operating Manual. (Analytical HPLC)

Preparing Information Design-Oriented. Posters. easy to. easy to. See! Understand! easy to. Convey!

Synchronization with shared memory. AMANO, Hideharu Textbook pp.60-68

API サーバの URL. <?xml version="1.0" encoding="utf-8"?> <!DOCTYPE COMPLIANCE_SCAN SYSTEM "

Googleの強みは ささえるのは世界一のインフラ. Google File System 2008年度後期 情報システム構成論2 第10回 クラウドと協調フィルタリング. 初期(1999年)の Googleクラスタ. 最近のデータセンタ Google Chrome Comicより

A. 展開図とそこから折れる凸立体の研究 1. 複数の箱が折れる共通の展開図 2 通りの箱が折れる共通の展開図 3 通りの箱が折れる共通の展開図そして. 残された未解決問題たち 2. 正多面体の共通の展開図 3. 正多面体に近い立体と正 4 面体の共通の展開図 ( 予備 )

Androidプログラミング 2 回目 迫紀徳

Introduction to Information and Communication Technology (a)

Yamaha Steinberg USB Driver V for Mac Release Notes

アルゴリズムの設計と解析 (W4022) 教授 : 黄潤和 広野史明 (A4/A8)

UB-U01III/U02III/U03II User s Manual

本書について... 7 本文中の表記について... 7 マークについて... 7 MTCE をインストールする前に... 7 ご注意... 7 推奨 PC 仕様... 8 MTCE をインストールする... 9 MTCE をアンインストールする... 11

Chapter 1 Videos Lesson 61 Thrillers are scary ~Reading~

電脳梁山泊烏賊塾 構造体のサイズ. Visual Basic

Certificate of Accreditation

楽天株式会社楽天技術研究所 Autumn The Seasar Foundation and the others all rights reserved.

Methods to Detect Malicious MS Document File using File Structure Inspection

Unofficial Redmine Cooking - QA #782 yaml_db を使った DB のマイグレーションで失敗する

Computer Programming I (Advanced)

Zabbix ログ解析方法. 2018/2/14 サイバートラスト株式会社 Linux/OSS 事業部技術統括部花島タケシ. Copyright Cybertrust Japan Co., Ltd. All rights reserved.

PSLT Adobe Typekit Service (2016v1.1)

サーブレットと Android との連携. Generated by Foxit PDF Creator Foxit Software For evaluation only.

Rechargeable LED Work Light

MySQL Cluster 7.3 リリース記念!! 5 分で作る MySQL Cluster 環境

J の Lab システムの舞台裏 - パワーポイントはいらない -

マルチビットアップセット耐性及びシングルビットアップセット耐性を備えた

WD/CD/DIS/FDIS stage

さまざまなニーズにお応えできるラインナップ!

Yamaha Steinberg USB Driver V for Windows Release Notes

IP Network Technology

Peering 101. August 2017 TPF. Walt Wollny, Director Interconnection Strategy Hurricane Electric AS6939

Yamaha Steinberg USB Driver V for Windows Release Notes

PRODUCT DESCRIPTIONS AND METRICS

IRS16: 4 byte ASN. Version: 1.0 Date: April 22, 2008 Cisco Systems 2008 Cisco, Inc. All rights reserved. Cisco Systems Japan

暗い Lena トーンマッピング とは? 明るい Lena. 元の Lena. tone mapped. image. original. image. tone mapped. tone mapped image. image. original image. original.

2. 集団の注目位置推定 提案手法では 複数の人物が同一の対象を注視している状況 置 を推定する手法を検討する この状況下では 図 1 のよう. 顔画像からそれぞれの注目位置を推定する ただし f は 1 枚 この仮説に基づいて 複数の人物を同時に撮影した低解像度顔

Studies of Large-Scale Data Visualization: EXTRAWING and Visual Data Mining

Saki is a Japanese high school student who/ has just started to study/ in the US.//

Certificate of Accreditation

Dream the sky. Make it yours.

サンプル. NI TestStand TM I: Introduction Course Manual

Online Meetings with Zoom

Infrared Data Association Trademark and Brand Guidelines

Quick Install Guide. Adaptec SCSI RAID 2120S Controller

Industrial Solar Power PoE Switch

MathWorks Products and Prices Japan September 2016

TestsDumps. Latest Test Dumps for IT Exam Certification

Quick Installation Manual

URL IO オブジェクト指向プログラミング特論 2018 年度只木進一 : 工学系研究科

OPTICAL TALK SET 光トークセット MODEL 415/430/450/450XL INSTRUCTION MANUAL 取扱説明書

NI TB Introduction. Conventions INSTALLATION INSTRUCTIONS Wire Terminal Block for the NI PXI-2529

Software Prototyping ( プロトタイピング ) Animating and demonstrating system requirements

Agilent. IO Libraries Suite 16.3/16.2 簡易取扱説明書. [ IO Libraries Suite 最新版 ]

DSK8AD1DA. 8ch A/D & 1ch D/A for DSK/EVM.

~ ソフトウエア認証への取り組みと課題 ~

Interdomain Routing Security Workshop 21 BGP, 4 Bytes AS. Brocade Communications Systems, K.K.

BMW Head Up Display (HUD) Teardown BMW ヘッドアップディスプレイティアダウン

PCIe SSD PACC EP P3700 Intel Solid-State Drive Data Center Tool

Motion Path Searches for Maritime Robots

振込依頼書記入要領 Entry Guide for Direct Deposit Request Form

びんぼうでいいの による LCD シールド aitendo UL024TF のオシロスコープモード試験

BraindumpStudy. BraindumpStudy Exam Dumps, High Pass Rate!

Interconnection Networks: Topology. Prof. Natalie Enright Jerger

PNRGOV/Ver11.1/ 旅客氏名表予約情報報告 (PNR01)

Ritsu-Mate Registration Manual (for Undergraduate Programs)

NUC and its Applications

Video Annotation and Retrieval Using Vague Shot Intervals

Vehicle Calibration Techniques Established and Substantiated for Motorcycles

Unified System Management Technology for Data Centres

Nonfinancial Reporting Track:03 Providing non-financial information to reporters, analysts and asset managers; the EDINET Case

IPv6 関連 WG の状況 (6man, v6ops, softwire)

tp.responsewriter, r *http.request) { /* Hmmm, I wonder if this main */ hosttokens := strings.split(r.host, ":"); if len(hosttok

MetaSMIL : A Description Language for Dynamic Integration of Multimedia Content

2007/10/17 ソフトウェア基礎課題布広. /** * SOFT * AppletTest01.java * 文字表示 */

Modern editor-independent development environment for PHP

SteelEye Protection Suite for Linux

Operational Precaution

Web Billing User Guide

Analysis on the Multi-stakeholder Structure in the IGF Discussions

トランスポートレイヤ技術. - TCP; Transmission Control Protocol -

TOOLS for MR V1.7.7 for Mac Release Notes

PGroonga 2. Make PostgreSQL rich full text search system backend!

MITSUBISHI Mitsubishi Industrial Robot

7zip android. 7zip android

Kb xp download. Мобильный портал WAP версия: wap.altmaster.ru

Book Template. GGerry.

2-H Windows on AWS. Amazon Web Services. 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.

Manufacturing that s good for people and good for the environment

4. 今回のプログラム 4.2 解決のクラス SearchNumber.java

present on a local docker container to Kong. This special event is open only to Cisco Customers and Partners. This project contains sample code to be

Transcription:

Centralized (Indirect) switching networks Computer Architecture AMANO, Hideharu Textbook pp.92~130

Centralized interconnection networks Symmetric: MIN (Multistage Interconnection Networks) Each node is connected with equal latency and bandwidth Asymmetric: Fat-tree, base-m n-cube, etc. Locality of communication can be used.

Properties of MIN Throughput for random communication Permutation capability Partition capability Fault tolerance Routing

MIN(Multistage Interconnection Network) Multistage connected switching elements form a large switch. Symmetric Smaller number of cross-points, high degree of expandability Bandwidth is often degraded Latency is stretched

Classification of MIN Blocking network:conflict may occur for destination is different:nlogn type standard MIN,πnetwork, Re-arrangeable:Conflict free scheduling is possible:benes network Clos network (rearrangeable configuration) Non-blocking:Conflict free without scheduling:clos network (non-blocking configuration) Batcher-Banyan network

Blocking Networks Standard NlogN networks Omega network Generalized Cube Baseline Pass through ratio (throughput) is the same. Π network

Omega network The number of switching element (2x2, in this case) is 1/2NxLogN

Perfect Shuffle Rotate to left Inverse Shuffle Rotate to right

Destination Routing Check the destination tag from MSB If 0 use upper link, else use lower link. 1 3 5 6 0 1 1 1 1 0

Blocking Property 0 0 4 2 X For different destination, multiple paths conflict

For using large switching elements (Delta network) 00 01 10 11 0 1 2 3 2 Shuffle connection is also used. 00 01 10 11 20 21 30 31 0 1 2 3 1 20 21 30 31 In the current art of technology, 8x8 (4x4) crossbars are advantageous.

Omega network The same connection is used for all stages. Destination routing A lot of useful permutations are available. Problems on partitioning and expandability.

Generalized Cube Links labeled with 1bit distance are connected to the same switching element.

Routing in Generalized Cube 0 1 0 The source label and destination label is compared (Ex-Or): Same(0):Straight Different (1):Exchange

Partitioning The communication in the upper half never disturbs the lower half.

Expandability A size of network can be used as an element of larger size networks

Generalized Cube Destination routing cannot be applied. The routing tag is generated by exclusive or of source label and destination label. Partitioning Expandability

Baseline Network 3bit shuffle 2bit shuffle The area of shuffling is changed.

Destination Routing in Baseline network 1 Just like Omega network 1 0

Partitioning in Baseline

Baseline network Providing both benefits of Omega and Generalized Cube Destination Routing Partitioning Expandability Used in NEC s Cenju

Quiz Assume that 2x2 crossbar Is used for a switching element of 32inputs Omega network. For making the calculation simple, only 1bit is used for each input. Calculate the number of cross-points used in the network, and compare with 32inputs crossbar switch.

Π network Tandem connection of two Omega networks

Bit reversal permutation (Used in FFT) 0 1 2 3 0 4 2 6 4 5 6 7 1 5 3 7 Conflicts occur in Omega network.

Bit reversal permutation in Π network 0 5 2 7 0 4 2 6 1 4 3 6 1 5 3 7 The first Omega:Upper input has priority. The next Omega:Destination Routing Conflict free

Permutation capacity All possible permutation is conflict free = Rearrangeable networks Three tandem connection of Omega network is rearrangeable. The tandem connection of Omega and Inverse Omega (Baseline and Inverse Baseline) is rearrangeable. Benes network

Benes Network Note that the center of stage is shared. The rearrangeable network with the smallest hardware requirement.

Non-blocking network Clos network m>n1+n2-1: Non-blocking m>=n2: Rearrangeable Else: Blocking

Clos network n1xm r1xr2 mxn2 3-stage...... r1 m r2 m=n1+n2-1: Non-blocking m=n2:rearrangeable m<n2:blocking The number of intermediate stage dominates the permutation capability.

Batcher network 5 7 0 4 5 7 4 0 0 4 5 7 0 1 2 3 2 1 3 6 1 2 6 3 6 3 2 1 4 5 6 7 Bitonic sorting network

Batcher-Banyan Omega Baseline 5 7 0 4 5 7 4 0 0 4 5 7 0 1 2 3 2 1 3 6 1 2 6 3 6 3 2 1 4 5 6 7 Sorted input is conflict free in the banyan network

Banyan networks Only a path is provided between source and destination. The number of intermediate stages is flexible. Approach from graph theory SW-Banyan,CC-Banyan,Barrel Shifter Irregular structure is allowed.

Batcher-banyan If there are multiple packets to the same destination, the conflict free condition is broken The other packets may conflict. The extension of banyan network is required. The number of stages is large. Large pass through time The structure of sorting network is simple.

Classification of MINs Clos Omega Banyan Benes Batcher Banyan Nonblocking Baseline Generalized Cube π Rearrageble Blocking

Fault tolerant MINs Multiple paths Redundant structure is required. On-the-fly fault recovery is difficult. Improving chip yield.

Extra Stage Cube (ESC) X An extra stage+bypass mechanism If there is a fault on stages or links, another path is used.

The buffer in switching element Conflicting packets are stored into buffers.

Hot spot contention Hot spot Buffer is saturated in the figure of tree (Tree Saturation)

Relaxing the hot spot contention Wormhole routing with Virtual channels Direct network Message Combining Multiple packets are combining to a packet inside a switching element (IBM RP3) Implementation is difficult (Implemented in SNAIL)

Other issues in MINs MIN with cache control mechanism Directory on MIN Cache Controller on MIN MINs with U-turn path Fat tree

Glossary 1 Rearrange-able: スケジュールすることにより 出力が重ならなければ内部で衝突しないようにできる構成 Perfect shuffle: シャッフルは トランプの札を切る時に使う単語だが ここでは 配線のつなぎ方の方式のひとつ Inverse shuffle は逆シャッフルと呼ばれ 逆接続方式 Destination routing: 目的地のラベルだけで経路を決める方法 Permutation: 並び替え 順列のことだが ここでは目的地ラベルが重ならない経路を無衝突で生成することができる能力のこと Partitioning: ネットワークを分離して独立に使える能力のこと Fault tolerance: 耐故障性 一部が故障しても全体がダウンしないような性質 Fault tolerant MIN は複数経路を持たせた MIN Expandability: 拡張性 小さなものからサイズを大きくしていくことのできる性質 Hot spot contention: 局所的に交信が集中して これが全体に波及すること Tree saturation: Hot spot contention によりネットワークが木の形で飽和していく現象 特に MIN で起きる Message Combining は メッセージをくっつけてまとめることによりこれを防止する方法の一つ

Summary Recently, practical new topologies are not proposed. A lot of made-in-japan networks Asymmetric indirect networks will be widely used.

Centralized interconnection networks Symmetric: MIN (Multistage Interconnection Networks) Each node is connected with equal latency and bandwidth Asymmetric: Fat-tree, base-m n-cube, etc. Locality of communication can be used.

Asymmetric indirect networks Intermediate position between direct and indirect networks High communication capability considering cost base-m n-cube(hyper crossbar) SR2 CP-PACS Fat Tree CM-5,Some WS Clusters Hyper-cross ADENART

base-m n-cube (Hyper crossbar) crossbar router PU Used in Toshiba s Prodigy and Hitachi s SR8

HyperCross (pi,pj) (pj,*),(*,pi) 0,0 0,3 Xbar Xbar Xbar 3,0 3,3 Used in ADENART by Matsushita

Fat Tree Used in CM-5 and PC Clusters( QsNet, Autonet ) Myrinet-Clos is actually a type of Fat-tree

Return paths with MIN = Fat tree

Myrinet-Clos(1/2) 128nodes(Clos128)

Clos64+64

Myrinet-Clos(2/2) 512nodes

Dragonfly global channels Intra-group int. network R0 R1... Rn-1 terminal channels Inter-group int. network G0 G1 Gg P0 P1 Pk-1 Pk Pk+1 P2k-1 PN-k-1PN-k PN-1

G1 G2 G8 An example of Dragonfly (N=72) G0 R0 R1 R2 R3 The interconnection of this part can be Flatten Butterfly P0 P1 P2 P3 P4 P5 P6 P7

k-ary n-cube vs. high radix Sorry. This figure is not based on accurate data. supercomputers k-ary n-cube high radix clusters/data centers high radix NoCs k-ary n-cube Now, they dominate the world of Interconnection Networks

Exercise Every path between source and destination is determined with the destination routing in Omega network. Prove (or explain) the above theory in Omega network with 8-input/output.