Zooming in on the QNoC Architecture
|
|
- June Bruce
- 5 years ago
- Views:
Transcription
1 Technion s NoC esearch: Israel Cidon Department of Electrical Engineering Technion Israel Institute of Technology Zooming in on the QNoC Architecture PIs Israel Cidon (Networking) an Ginosar (VLSI) Idit Keidar (Dist. Systems) Isaac Kesslassy (Networking) Avinoam Kolodny (VLSI) Uri Weiser (Architecture) Students: Evgeny Bolotin, oman Gindin euven Dobkin, Zvika Guz an Manevich Arkadiy Morgenshtein, Zigi Walter May One NoC does not fits all! NoC for ASICs Traffic uncertainty un time FPGA CMP Design envelop / constraints Well define inter-modules traffic Automatic synthesis Variable QoS requirement Configuration ASSP Main cost Power and area Chip design SOC single application General purpose computer Flexibility I. Cidon and K. Goossens, in Networks on Chips, G. De Micheli and L. Benini, Morgan Kaufmann,
2 QNoC Quality-of-service NoC architecture for ASICs Message routing path Traffic requirements are known a-priori Overall approach Wormhole switching QoS based on priority classes Small buffer/vc budget In-order SP XY routing Irregular topology Optimized link capacities (0,0) (1,0) (2,0) (2,1) (0,2) (0,3) (1,4) (0,4) (5,0) (2,2) (2,3) (2,4) (4,3) (3,4) (4,4) Fixed shortest-path routing (X-Y) Simple outer No deadlock scenario No reordering of messages SP is power-efficient * E. Bolotin, I. Cidon,. Ginosar and A. Kolodny., QNoC: QoS architecture and design process for Network on Chip, JSA special issue on NoC, Wormhole Switching Quality-of-Service in QNoC Small number of buffers Low latency IP2 Multiple priority classes Define latency Preemptive Possible ASIC classes Signaling eal Time Stream N ead-write DMA Block Transfer Statistical guarantees E.g. <0.01% arrive later then required T IP1 * E. Bolotin, I. Cidon,. Ginosar and A. Kolodny., QNoC: QoS architecture and design process for Network on Chip, JSA special issue on NOC,
3 outer structure Virtual Channels or another router outer Flits stored in input ports Output port schedules transmission of flits 9 10 QNoC router with Virtual Channels Simulation Model Flit-accurate simulations VC 5 SL COSS-BA 5 Delay of high-priority service levels is not affected by load 11 12
4 QNoC Design Flow QNoC Design Flow Extract intermodule traffic Extract intermodule traffic Place modules Place modules Allocate link capacities Allocate link capacities Verify QoS and cost Verify QoS and cost QNoC Design Flow Extract intermodule traffic Place modules Allocate link capacities Verify QoS and cost NoC for FPGA Design envelope / constraints Many ASIC like applications per a given FPGA Hard NoC infrastructure efficient but inflexible Soft logic is reusable but has inferior performance Architecture of choice: egular and uniform grid In-order/load balanced routing Hard logic for links, routers Soft logic for routing algorithms, headers, CNIs Soft NoC tuning (routing, CNI) for a given implementation Optimize capacity for performance/power tradeoff Capacity allocation is a traditional WAN problem Never solved for wormhole routing and NoC *. Gindin, I. Keidar and, I.Cidon, NoC Architecture for Future FPGAs., NoCs
5 NoC Based FPGA Architecture FPGA outing Optimization Problem Functional unit NoC for interrouting Common efficient NoC outers Configurable region User logic Set of Applications Different Architectures Different Traffic Patterns Configurable network interface Implemented on the same chip Ordered outing Algorithms Source Toggle XY One route per source-destination (S-D) pair No traffic splitting Unlike TXY, traffic to same destination is not split Maximum capacity similar to TXY The route is a bitwise XO of source and destination ID Can be extended to weighted source toggle (WOT) XY YX XY YX XY YX YX XY YX XY YX XY YX XY YX XY YX XY YX XY YX XY YX XY Unordered outing Ordered outing 19 20
6 Two Hotspots Maximum Capacity C a p a c i t y 20 XY TXY STXY WTXY WOT Design Envelope for various distances between the hotspots for WOT NoC and CMP Uniprocessors cannot provide Power-efficient performance growth Interconnect dominates dynamic power Global wire delay doesn t scale Interconnect Instruction-level parallelism is limited (Polack) Progress to parallel computations Chip Multi Processors (CMP) Thread-Level Parallelism (TLP) Built around NoC Uniprocessor dynamic power (Magen et al., SLIP 2004) Gate Diff Minimum Distance between the hotspots NoC for Shared Memory CMP NoC can facilitate system services Constraints Multiple access to shared cache equires cache coherency Heterogeneous traffic pattern QoS requirements (fetch, pre-fetch) Architecture of choice: Wormhole switching In-order routing Simple QoS mechanisms Embedded functions (multicast, search ) P6 P7 P0 P1 0 7 Distributed L P5 P4 P3 P2 * E.Bolotin, Z. Guz, I.Cidon,. Ginosar and A. Kolodny, The Power of Priority: NoC based Distributed Cache Coherency, NoCs
7 Delay eduction [%] Priority NoC: esults L2 Access Delay eduction by Priority-based NoC ead ead Exclusive apache zeus fft ocean radix Speedup [%] P6 P7 P0 P1 0 7 Distributed L P5 P4 Total Program Speedup by Priority-based NoC apache zeus fft ocean radix P3 P2 Generic NoC Issues Many shared problems across design spectrum, examples: Providing an extremely low latency class of service Providing broadcast/multicast functions Analysis and solutions for hot modules outing in irregular networks eliability and fault tolerance Network verification Power control of NoCs NoC physical layout Circuits for NoC lines Bus-Enhanced NoC (BENoC) Dynamic Non-Uniform Cache Access L2$ Low latency bus and high-throughput NoC Multicast/broadcast services Low latency module to module signaling NoC management & control CMP Cache example *. Manevich, I, Walter, I.Cidon and A. Kolodny, Best of Both Worlds: Bus Enhanced NoC (BENoC), NoCs Non uniform cache access (NUCA) - split into independent banks Dynamic NUCA - cache lines are moved to shorten access time Before fetching into, needs to find the L2$ storing the line When a modifies the data, all L1 copies should be invalidated 27 28
8 Bus-Enhanced NoC (BENoC) Bus-Enhanced NoC ecent Work Main messages: BENoC facilitates advanced services BENoC s broadcast is energy efficient Model NoC and bus broadcast latency and energy analytically BENoC is good for performance Use simulator to run benchmarks Short latency signaling NoC management Multicast services CMP Cache (focus of recent research) Analyzing Latency and Energy Bus-Enhanced NoC ecent Work NoC links Input C in C load Model NoC and bus broadcast latency ( P C0 Cin ) Tnet n Tlink n τ τ ( Clink Cin ) τ n + = = + + = + τ Cld Cld (( ) 0 ) τ τ T = C + τ = n+ n PC + nc + τ bus bus in Cbd Cbd Bus wire Approximate broadcast power Latency: Energy: τ T = Cload +τ C 2 in E= V ( C + C ) load in Enet = V K( Cld + Clink + Cin ) τ( npc0+ npc0+ nc in) Ebus = V( ( n+ n) PC0+ ncin) + V n τ ( P C0+ Cin ) + τ τ β C ld 31 32
9 BENoC saves energy BENoC is Good for Performance Network broadcast Bus transaction (beta=1) Bus transaction (beta=10) Network unicast ead transaction time performance gain Energy per bit [pj/bit] Split cache traffic Broadcast over Bus Unicast over NoC n Generic NoC Issues NoC Clogging Hot modules cannot absorb traffic from NoC Off chip systems Shared memory subsystems Expensive functional units IP1 IP2 IP3 HM is not a local problem Transparent to NoC performance Walter, Cidon, Ginosar and Kolodny, Access egulation to Hot-s in Wormhole NoCs, NOCS
10 Source Fairness No fairness is guarantied since routers arbitration is based on local state The further is the source from the destination, its worm has to win more arbitrations The HM module bandwidth isn t fairly shared IP (HM) Hot Distributed Arbitration Control is part of interface or a core serving multiple HM eceives all requests and employs a scheduling scheme equests and grants sent in a high service level or bus equests and grants includes additional data: requested quota, source queue size, priority, deadline, etc. Granted quota, scheduling of transmission's, etc. Initial credits hides light load request-grant latency Hot vs. non-hot Traffic Optimizing routing on Irregular Mesh HM Traffic With Control Around the Block Dead End HM Traffic Without Control Other Traffic Without Control Goal: Minimize the total size of routing tables Other Traffic With Control 39 E. Bolotin, I. Cidon,. Ginosar and A. Kolodny, "outing Table Minimization for Irregular Mesh NoCs", DATE
11 Saving Table Hardware Traditional solutions - full routing tables Destination Based outing - at router Source outing at sources Solution idea: Use educed Tables Store only relevant destinations (PLA) Default function ( Go XY or Don t turn ) + Table for deviations outing Cost [gates] outing Heuristics for Irregular Mesh 90,000 80,000 70,000 60,000 50,000 40,000 30,000 20,000 10,000 0 outing Cost in 12x12 NoC (many andom holes, high problem hotspost instances probability) Hotspot Number D XYDT S SDP Distributed outing (full tables) X-Y outing with Deviation Tables Source outing Source outing for Deviation Points Log ( outing Cost ) outing Systems Cost with eduction real applications in eal Aplications D TT XYDT S SDP 1 MPEG4 VOPD Efficient outing esults Scaling of Savings Savings Summary NoC became a major research field Key to SoC, FPGA and CMP future Different NoC architectures required NoC is not a miniature off-chip net Power major optimization factor Latency critical and solvable May integrate fine grain operations and control May integrate multiple best-of-breed architectures Network Size 43 44
Architecture and Routing in NOC Based FPGAs
Architecture and Routing in NOC Based PGAs Roman Gindin rgindin@tx.technion.ac.il Israel Cidon cidon@ee.technion.ac.il Idit Keidar idish@ee.technion.ac.il ABSTRACT We present a novel network-on-chip architecture
More informationRouting Algorithms, Process Model for Quality of Services (QoS) and Architectures for Two-Dimensional 4 4 Mesh Topology Network-on-Chip
Routing Algorithms, Process Model for Quality of Services (QoS) and Architectures for Two-Dimensional 4 4 Mesh Topology Network-on-Chip Nauman Jalil, Adnan Qureshi, Furqan Khan, and Sohaib Ayyaz Qazi Abstract
More informationDeadlock-free XY-YX router for on-chip interconnection network
LETTER IEICE Electronics Express, Vol.10, No.20, 1 5 Deadlock-free XY-YX router for on-chip interconnection network Yeong Seob Jeong and Seung Eun Lee a) Dept of Electronic Engineering Seoul National Univ
More informationQNoC: QoS architecture and design process for network on chip
Journal of Systems Architecture xxx (23) xxx xxx www.elsevier.com/locate/sysarc QNoC: QoS architecture and design process for network on chip Evgeny Bolotin *, Israel Cidon, Ran Ginosar, Avinoam Kolodny
More informationNetSpeed ORION: A New Approach to Design On-chip Interconnects. August 26 th, 2013
NetSpeed ORION: A New Approach to Design On-chip Interconnects August 26 th, 2013 INTERCONNECTS BECOMING INCREASINGLY IMPORTANT Growing number of IP cores Average SoCs today have 100+ IPs Mixing and matching
More informationOfficial Agenda Review the most commonly used tools in CMP µarch research. Unofficial Agenda Convince you to use Simics
Simics and Friends Modeling Tools for CMP Research Zvika Guz, Isask har (Zigi Zigi) Walter The Technion Israel Institute of Technology Official Agenda Agenda Review the most commonly used tools in CMP
More informationEfficient Throughput-Guarantees for Latency-Sensitive Networks-On-Chip
ASP-DAC 2010 20 Jan 2010 Session 6C Efficient Throughput-Guarantees for Latency-Sensitive Networks-On-Chip Jonas Diemer, Rolf Ernst TU Braunschweig, Germany diemer@ida.ing.tu-bs.de Michael Kauschke Intel,
More informationLecture 16: On-Chip Networks. Topics: Cache networks, NoC basics
Lecture 16: On-Chip Networks Topics: Cache networks, NoC basics 1 Traditional Networks Huh et al. ICS 05, Beckmann MICRO 04 Example designs for contiguous L2 cache regions 2 Explorations for Optimality
More information4. Networks. in parallel computers. Advances in Computer Architecture
4. Networks in parallel computers Advances in Computer Architecture System architectures for parallel computers Control organization Single Instruction stream Multiple Data stream (SIMD) All processors
More informationA Thermal-aware Application specific Routing Algorithm for Network-on-chip Design
A Thermal-aware Application specific Routing Algorithm for Network-on-chip Design Zhi-Liang Qian and Chi-Ying Tsui VLSI Research Laboratory Department of Electronic and Computer Engineering The Hong Kong
More informationCAD System Lab Graduate Institute of Electronics Engineering National Taiwan University Taipei, Taiwan, ROC
QoS Aware BiNoC Architecture Shih-Hsin Lo, Ying-Cherng Lan, Hsin-Hsien Hsien Yeh, Wen-Chung Tsai, Yu-Hen Hu, and Sao-Jie Chen Ying-Cherng Lan CAD System Lab Graduate Institute of Electronics Engineering
More informationAn Analysis of Blocking vs Non-Blocking Flow Control in On-Chip Networks
An Analysis of Blocking vs Non-Blocking Flow Control in On-Chip Networks ABSTRACT High end System-on-Chip (SoC) architectures consist of tens of processing engines. These processing engines have varied
More informationQoS architecture and design process for cost effective Network on Chip
1 QoS architecture and design process for cost effective Network on Chip Evgeny Bolotin, Israel Cidon, Ran Ginosar and Avinoam Kolodny Electrical Engineering Department, Technion Israel Institute of Technology
More informationDesign and Test Solutions for Networks-on-Chip. Jin-Ho Ahn Hoseo University
Design and Test Solutions for Networks-on-Chip Jin-Ho Ahn Hoseo University Topics Introduction NoC Basics NoC-elated esearch Topics NoC Design Procedure Case Studies of eal Applications NoC-Based SoC Testing
More informationTDT Appendix E Interconnection Networks
TDT 4260 Appendix E Interconnection Networks Review Advantages of a snooping coherency protocol? Disadvantages of a snooping coherency protocol? Advantages of a directory coherency protocol? Disadvantages
More informationThe Power of Priority: NoC based Distributed Cache Coherency
The Power of Priority: NoC based Distributed Cache Coherency Evgeny Bolotin, Zvika Guz, Israel Cidon, Ran Ginosar and Avinoam Kolodny Electrical Engineering Department,Technion - Israel Institute of Technology,
More informationSoC Design Lecture 13: NoC (Network-on-Chip) Department of Computer Engineering Sharif University of Technology
SoC Design Lecture 13: NoC (Network-on-Chip) Department of Computer Engineering Sharif University of Technology Outline SoC Interconnect NoC Introduction NoC layers Typical NoC Router NoC Issues Switching
More informationPower and Performance Efficient Partial Circuits in Packet-Switched Networks-on-Chip
2013 21st Euromicro International Conference on Parallel, Distributed, and Network-Based Processing Power and Performance Efficient Partial Circuits in Packet-Switched Networks-on-Chip Nasibeh Teimouri
More informationInterconnection Networks
Lecture 17: Interconnection Networks Parallel Computer Architecture and Programming A comment on web site comments It is okay to make a comment on a slide/topic that has already been commented on. In fact
More informationOpenSMART: Single-cycle Multi-hop NoC Generator in BSV and Chisel
OpenSMART: Single-cycle Multi-hop NoC Generator in BSV and Chisel Hyoukjun Kwon and Tushar Krishna Georgia Institute of Technology Synergy Lab (http://synergy.ece.gatech.edu) hyoukjun@gatech.edu April
More informationDesign and Implementation of Low Complexity Router for 2D Mesh Topology using FPGA
Design and Implementation of Low Complexity Router for 2D Mesh Topology using FPGA Maheswari Murali * and Seetharaman Gopalakrishnan # * Assistant professor, J. J. College of Engineering and Technology,
More informationFlow Control can be viewed as a problem of
NOC Flow Control 1 Flow Control Flow Control determines how the resources of a network, such as channel bandwidth and buffer capacity are allocated to packets traversing a network Goal is to use resources
More informationSwizzle Switch: A Self-Arbitrating High-Radix Crossbar for NoC Systems
1 Swizzle Switch: A Self-Arbitrating High-Radix Crossbar for NoC Systems Ronald Dreslinski, Korey Sewell, Thomas Manville, Sudhir Satpathy, Nathaniel Pinckney, Geoff Blake, Michael Cieslak, Reetuparna
More informationAchieving Lightweight Multicast in Asynchronous Networks-on-Chip Using Local Speculation
Achieving Lightweight Multicast in Asynchronous Networks-on-Chip Using Local Speculation Kshitij Bhardwaj Dept. of Computer Science Columbia University Steven M. Nowick 2016 ACM/IEEE Design Automation
More informationTraffic Generation and Performance Evaluation for Mesh-based NoCs
Traffic Generation and Performance Evaluation for Mesh-based NoCs Leonel Tedesco ltedesco@inf.pucrs.br Aline Mello alinev@inf.pucrs.br Diego Garibotti dgaribotti@inf.pucrs.br Ney Calazans calazans@inf.pucrs.br
More informationLecture 3: Flow-Control
High-Performance On-Chip Interconnects for Emerging SoCs http://tusharkrishna.ece.gatech.edu/teaching/nocs_acaces17/ ACACES Summer School 2017 Lecture 3: Flow-Control Tushar Krishna Assistant Professor
More informationDynamic Routing of Hierarchical On Chip Network Traffic
System Level Interconnect Prediction 2014 Dynamic outing of Hierarchical On Chip Network Traffic Julian Kemmerer jvk@drexel.edu Baris Taskin taskin@coe.drexel.edu Electrical & Computer Engineering Drexel
More informationTHE INTERPLAY OF CACHES AND THREADS IN CHIP-MULTIPROCESSORS
THE INTERPLAY OF CACHES AND THREADS IN CHIP-MULTIPROCESSORS Zvika Guz THE INTERPLAY OF CACHES AND THREADS IN CHIP-MULTIPROCESSORS Research Thesis Submitted in Partial Fulfillment of the Requirements for
More informationNetworks-on-Chip Router: Configuration and Implementation
Networks-on-Chip : Configuration and Implementation Wen-Chung Tsai, Kuo-Chih Chu * 2 1 Department of Information and Communication Engineering, Chaoyang University of Technology, Taichung 413, Taiwan,
More informationInterconnection Networks: Topology. Prof. Natalie Enright Jerger
Interconnection Networks: Topology Prof. Natalie Enright Jerger Topology Overview Definition: determines arrangement of channels and nodes in network Analogous to road map Often first step in network design
More informationReal-Time Mixed-Criticality Wormhole Networks
eal-time Mixed-Criticality Wormhole Networks Leandro Soares Indrusiak eal-time Systems Group Department of Computer Science University of York United Kingdom eal-time Systems Group 1 Outline Wormhole Networks
More informationNoC Test-Chip Project: Working Document
NoC Test-Chip Project: Working Document Michele Petracca, Omar Ahmad, Young Jin Yoon, Frank Zovko, Luca Carloni and Kenneth Shepard I. INTRODUCTION This document describes the low-power high-performance
More informationFitting the Router Characteristics in NoCs to Meet QoS Requirements
Fitting the Router Characteristics in NoCs to Meet QoS Requirements Edgard de Faria Corrêa Superintendência de Informática - UFRN edgard@info.ufrn.br Leonardo A.de P. e Silva lapys@inf.ufrgs.br Flávio
More informationHARDWARE IMPLEMENTATION OF PIPELINE BASED ROUTER DESIGN FOR ON- CHIP NETWORK
DOI: 10.21917/ijct.2012.0092 HARDWARE IMPLEMENTATION OF PIPELINE BASED ROUTER DESIGN FOR ON- CHIP NETWORK U. Saravanakumar 1, R. Rangarajan 2 and K. Rajasekar 3 1,3 Department of Electronics and Communication
More informationStudy of Network on Chip resources allocation for QoS Management
Journal of Computer Science 2 (10): 770-774, 2006 ISSN 1549-3636 2006 Science Publications Study of Network on Chip resources allocation for QoS Management Abdelhamid HELALI, Adel SOUDANI, Jamila BHAR
More informationPower dissipation! The VLSI Interconnect Challenge. Interconnect is the crux of the problem. Interconnect is the crux of the problem.
The VLSI Interconnect Challenge Avinoam Kolodny Electrical Engineering Department Technion Israel Institute of Technology VLSI Challenges System complexity Performance Tolerance to digital noise and faults
More informationThomas Moscibroda Microsoft Research. Onur Mutlu CMU
Thomas Moscibroda Microsoft Research Onur Mutlu CMU CPU+L1 CPU+L1 CPU+L1 CPU+L1 Multi-core Chip Cache -Bank Cache -Bank Cache -Bank Cache -Bank CPU+L1 CPU+L1 CPU+L1 CPU+L1 Accelerator, etc Cache -Bank
More informationImplementing Flexible Interconnect Topologies for Machine Learning Acceleration
Implementing Flexible Interconnect for Machine Learning Acceleration A R M T E C H S Y M P O S I A O C T 2 0 1 8 WILLIAM TSENG Mem Controller 20 mm Mem Controller Machine Learning / AI SoC New Challenges
More informationPseudo-Circuit: Accelerating Communication for On-Chip Interconnection Networks
Department of Computer Science and Engineering, Texas A&M University Technical eport #2010-3-1 seudo-circuit: Accelerating Communication for On-Chip Interconnection Networks Minseon Ahn, Eun Jung Kim Department
More informationMulti-level Fault Tolerance in 2D and 3D Networks-on-Chip
Multi-level Fault Tolerance in 2D and 3D Networks-on-Chip Claudia usu Vladimir Pasca Lorena Anghel TIMA Laboratory Grenoble, France Outline Introduction Link Level outing Level Application Level Conclusions
More informationEECS 598: Integrating Emerging Technologies with Computer Architecture. Lecture 12: On-Chip Interconnects
1 EECS 598: Integrating Emerging Technologies with Computer Architecture Lecture 12: On-Chip Interconnects Instructor: Ron Dreslinski Winter 216 1 1 Announcements Upcoming lecture schedule Today: On-chip
More informationComputer Systems Architecture
Computer Systems Architecture Lecture 23 Mahadevan Gomathisankaran April 27, 2010 04/27/2010 Lecture 23 CSCE 4610/5610 1 Reminder ABET Feedback: http://www.cse.unt.edu/exitsurvey.cgi?csce+4610+001 Student
More informationIncorporating DMA into QoS Policies for Maximum Performance in Shared Memory Systems. Scott Marshall and Stephen Twigg
Incorporating DMA into QoS Policies for Maximum Performance in Shared Memory Systems Scott Marshall and Stephen Twigg 2 Problems with Shared Memory I/O Fairness Memory bandwidth worthless without memory
More informationLecture: Interconnection Networks
Lecture: Interconnection Networks Topics: Router microarchitecture, topologies Final exam next Tuesday: same rules as the first midterm 1 Packets/Flits A message is broken into multiple packets (each packet
More informationUnderstanding the Routing Requirements for FPGA Array Computing Platform. Hayden So EE228a Project Presentation Dec 2 nd, 2003
Understanding the Routing Requirements for FPGA Array Computing Platform Hayden So EE228a Project Presentation Dec 2 nd, 2003 What is FPGA Array Computing? Aka: Reconfigurable Computing Aka: Spatial computing,
More informationFCUDA-NoC: A Scalable and Efficient Network-on-Chip Implementation for the CUDA-to-FPGA Flow
FCUDA-NoC: A Scalable and Efficient Network-on-Chip Implementation for the CUDA-to-FPGA Flow Abstract: High-level synthesis (HLS) of data-parallel input languages, such as the Compute Unified Device Architecture
More informationLecture 12: Interconnection Networks. Topics: communication latency, centralized and decentralized switches, routing, deadlocks (Appendix E)
Lecture 12: Interconnection Networks Topics: communication latency, centralized and decentralized switches, routing, deadlocks (Appendix E) 1 Topologies Internet topologies are not very regular they grew
More informationComparison of Deadlock Recovery and Avoidance Mechanisms to Approach Message Dependent Deadlocks in on-chip Networks
Comparison of Deadlock Recovery and Avoidance Mechanisms to Approach Message Dependent Deadlocks in on-chip Networks Andreas Lankes¹, Soeren Sonntag², Helmut Reinig³, Thomas Wild¹, Andreas Herkersdorf¹
More informationBasic Low Level Concepts
Course Outline Basic Low Level Concepts Case Studies Operation through multiple switches: Topologies & Routing v Direct, indirect, regular, irregular Formal models and analysis for deadlock and livelock
More informationConnection-oriented Multicasting in Wormhole-switched Networks on Chip
Connection-oriented Multicasting in Wormhole-switched Networks on Chip Zhonghai Lu, Bei Yin and Axel Jantsch Laboratory of Electronics and Computer Systems Royal Institute of Technology, Sweden fzhonghai,axelg@imit.kth.se,
More informationLecture: Transactional Memory, Networks. Topics: TM implementations, on-chip networks
Lecture: Transactional Memory, Networks Topics: TM implementations, on-chip networks 1 Summary of TM Benefits As easy to program as coarse-grain locks Performance similar to fine-grain locks Avoids deadlock
More informationNETWORK AND TRANSPORT LAYERS IN NETWORKS ON CHIP
C H A P T E R 5 NETWORK AND TRANSPORT LAYERS IN NETWORKS ON CHIP The main goal of the network and transport layers is to support the endto-end communication between the modules at the specified quality
More informationInterconnection Networks
Lecture 18: Interconnection Networks Parallel Computer Architecture and Programming CMU 15-418/15-618, Spring 2015 Credit: many of these slides were created by Michael Papamichael This lecture is partially
More informationComputer Systems Architecture
Computer Systems Architecture Lecture 24 Mahadevan Gomathisankaran April 29, 2010 04/29/2010 Lecture 24 CSCE 4610/5610 1 Reminder ABET Feedback: http://www.cse.unt.edu/exitsurvey.cgi?csce+4610+001 Student
More informationEfficient Latency Guarantees for Mixed-criticality Networks-on-Chip
Platzhalter für Bild, Bild auf Titelfolie hinter das Logo einsetzen Efficient Latency Guarantees for Mixed-criticality Networks-on-Chip Sebastian Tobuschat, Rolf Ernst IDA, TU Braunschweig, Germany 18.
More informationA Novel Energy Efficient Source Routing for Mesh NoCs
2014 Fourth International Conference on Advances in Computing and Communications A ovel Energy Efficient Source Routing for Mesh ocs Meril Rani John, Reenu James, John Jose, Elizabeth Isaac, Jobin K. Antony
More informationMultiprocessors and Thread-Level Parallelism. Department of Electrical & Electronics Engineering, Amrita School of Engineering
Multiprocessors and Thread-Level Parallelism Multithreading Increasing performance by ILP has the great advantage that it is reasonable transparent to the programmer, ILP can be quite limited or hard to
More informationOn-chip Monitoring Infrastructures and Strategies for Many-core Systems
On-chip Monitoring Infrastructures and Strategies for Many-core Systems ussell Tessier, Jia Zhao, Justin Lu, Sailaja Madduri, and Wayne Burleson esearch supported by the Semiconductor esearch Corporation
More informationCCNoC: Specializing On-Chip Interconnects for Energy Efficiency in Cache-Coherent Servers
CCNoC: Specializing On-Chip Interconnects for Energy Efficiency in Cache-Coherent Servers Stavros Volos, Ciprian Seiculescu, Boris Grot, Naser Khosro Pour, Babak Falsafi, and Giovanni De Micheli Toward
More informationFPGA based Design of Low Power Reconfigurable Router for Network on Chip (NoC)
FPGA based Design of Low Power Reconfigurable Router for Network on Chip (NoC) D.Udhayasheela, pg student [Communication system],dept.ofece,,as-salam engineering and technology, N.MageshwariAssistant Professor
More informationNOC: Networks on Chip SoC Interconnection Structures
NOC: Networks on Chip SoC Interconnection Structures COE838: Systems-on-Chip Design http://www.ee.ryerson.ca/~courses/coe838/ Dr. Gul N. Khan http://www.ee.ryerson.ca/~gnkhan Electrical and Computer Engineering
More informationCentralized Adaptive Routing for NoCs
Centralized Adaptive Routing for NoCs Authors Name/s per 1st Affiliation (Author) line 1 (of Affiliation): dept. name of organization line 2: name of organization, acronyms acceptable line 3: City, Country
More informationAddress InterLeaving for Low- Cost NoCs
Address InterLeaving for Low- Cost NoCs Miltos D. Grammatikakis, Kyprianos Papadimitriou, Polydoros Petrakis, Marcello Coppola, and Michael Soulie Technological Educational Institute of Crete, GR STMicroelectronics,
More informationOverlaid Mesh Topology Design and Deadlock Free Routing in Wireless Network-on-Chip. Danella Zhao and Ruizhe Wu Presented by Zhonghai Lu, KTH
Overlaid Mesh Topology Design and Deadlock Free Routing in Wireless Network-on-Chip Danella Zhao and Ruizhe Wu Presented by Zhonghai Lu, KTH Outline Introduction Overview of WiNoC system architecture Overlaid
More informationMapping of Real-time Applications on
Mapping of Real-time Applications on Network-on-Chip based MPSOCS Paris Mesidis Submitted for the degree of Master of Science (By Research) The University of York, December 2011 Abstract Mapping of real
More informationInterconnection Network
Interconnection Network Recap: Generic Parallel Architecture A generic modern multiprocessor Network Mem Communication assist (CA) $ P Node: processor(s), memory system, plus communication assist Network
More informationOASIS Network-on-Chip Prototyping on FPGA
Master thesis of the University of Aizu, Feb. 20, 2012 OASIS Network-on-Chip Prototyping on FPGA m5141120, Kenichi Mori Supervised by Prof. Ben Abdallah Abderazek Adaptive Systems Laboratory, Master of
More informationModule 17: "Interconnection Networks" Lecture 37: "Introduction to Routers" Interconnection Networks. Fundamentals. Latency and bandwidth
Interconnection Networks Fundamentals Latency and bandwidth Router architecture Coherence protocol and routing [From Chapter 10 of Culler, Singh, Gupta] file:///e /parallel_com_arch/lecture37/37_1.htm[6/13/2012
More informationSoC Design. Prof. Dr. Christophe Bobda Institut für Informatik Lehrstuhl für Technische Informatik
SoC Design Prof. Dr. Christophe Bobda Institut für Informatik Lehrstuhl für Technische Informatik Chapter 5 On-Chip Communication Outline 1. Introduction 2. Shared media 3. Switched media 4. Network on
More informationReNoC: A Network-on-Chip Architecture with Reconfigurable Topology
1 ReNoC: A Network-on-Chip Architecture with Reconfigurable Topology Mikkel B. Stensgaard and Jens Sparsø Technical University of Denmark Technical University of Denmark Outline 2 Motivation ReNoC Basic
More informationLecture 24: Interconnection Networks. Topics: topologies, routing, deadlocks, flow control
Lecture 24: Interconnection Networks Topics: topologies, routing, deadlocks, flow control 1 Topology Examples Grid Torus Hypercube Criteria Bus Ring 2Dtorus 6-cube Fully connected Performance Bisection
More informationChapter 9 Multiprocessors
ECE200 Computer Organization Chapter 9 Multiprocessors David H. lbonesi and the University of Rochester Henk Corporaal, TU Eindhoven, Netherlands Jari Nurmi, Tampere University of Technology, Finland University
More informationA TDM NoC supporting QoS, multicast, and fast connection set-up
A TDM NoC supporting QoS, multicast, and fast connection set-up Radu Stefan TU Eindhoven R.Stefan@tue.nl Anca Molnos TU Delft A.M.Molnos@tudelft.nl Angelo Ambrose UNSW ajangelo@cse.unsw.edu.au Kees Goossens
More informationSome issues in Application Specific Network on Chip Design
Some issues in Application Specific Network on Chip Design Embedded Systems Group Department of Electronics and Computer Engineering School of Engineering, Jönköping University 1 Outline Introduction:
More informationLecture: Cache Hierarchies. Topics: cache innovations (Sections B.1-B.3, 2.1)
Lecture: Cache Hierarchies Topics: cache innovations (Sections B.1-B.3, 2.1) 1 Types of Cache Misses Compulsory misses: happens the first time a memory word is accessed the misses for an infinite cache
More informationTrends in Embedded System Design
Trends in Embedded System Design MPSoC design gets increasingly complex Moore s law enables increased component integration Digital convergence creates a market for highly integrated devices The resulting
More informationUltra-Fast NoC Emulation on a Single FPGA
The 25 th International Conference on Field-Programmable Logic and Applications (FPL 2015) September 3, 2015 Ultra-Fast NoC Emulation on a Single FPGA Thiem Van Chu, Shimpei Sato, and Kenji Kise Tokyo
More informationPhastlane: A Rapid Transit Optical Routing Network
Phastlane: A Rapid Transit Optical Routing Network Mark Cianchetti, Joseph Kerekes, and David Albonesi Computer Systems Laboratory Cornell University The Interconnect Bottleneck Future processors: tens
More informationLecture 18: Communication Models and Architectures: Interconnection Networks
Design & Co-design of Embedded Systems Lecture 18: Communication Models and Architectures: Interconnection Networks Sharif University of Technology Computer Engineering g Dept. Winter-Spring 2008 Mehdi
More informationTrade Offs in the Design of a Router with Both Guaranteed and Best-Effort Services for Networks on Chip
Trade Offs in the Design of a Router with Both Guaranteed and BestEffort Services for Networks on Chip E. Rijpkema, K. Goossens, A. R dulescu, J. Dielissen, J. van Meerbergen, P. Wielage, and E. Waterlander
More informationLecture 25: Interconnection Networks, Disks. Topics: flow control, router microarchitecture, RAID
Lecture 25: Interconnection Networks, Disks Topics: flow control, router microarchitecture, RAID 1 Virtual Channel Flow Control Each switch has multiple virtual channels per phys. channel Each virtual
More informationMapping and Physical Planning of Networks-on-Chip Architectures with Quality-of-Service Guarantees
Mapping and Physical Planning of Networks-on-Chip Architectures with Quality-of-Service Guarantees Srinivasan Murali 1, Prof. Luca Benini 2, Prof. Giovanni De Micheli 1 1 Computer Systems Lab, Stanford
More informationEvaluation of NOC Using Tightly Coupled Router Architecture
IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 18, Issue 1, Ver. II (Jan Feb. 2016), PP 01-05 www.iosrjournals.org Evaluation of NOC Using Tightly Coupled Router
More informationJUNCTION BASED ROUTING: A NOVEL TECHNIQUE FOR LARGE NETWORK ON CHIP PLATFORMS
1 JUNCTION BASED ROUTING: A NOVEL TECHNIQUE FOR LARGE NETWORK ON CHIP PLATFORMS Shabnam Badri THESIS WORK 2011 ELECTRONICS JUNCTION BASED ROUTING: A NOVEL TECHNIQUE FOR LARGE NETWORK ON CHIP PLATFORMS
More informationEECS 570 Final Exam - SOLUTIONS Winter 2015
EECS 570 Final Exam - SOLUTIONS Winter 2015 Name: unique name: Sign the honor code: I have neither given nor received aid on this exam nor observed anyone else doing so. Scores: # Points 1 / 21 2 / 32
More informationLecture 7: Flow Control - I
ECE 8823 A / CS 8803 - ICN Interconnection Networks Spring 2017 http://tusharkrishna.ece.gatech.edu/teaching/icn_s17/ Lecture 7: Flow Control - I Tushar Krishna Assistant Professor School of Electrical
More informationA DRAM Centric NoC Architecture and Topology Design Approach
11 IEEE Computer Society Annual Symposium on VLSI A DRAM Centric NoC Architecture and Topology Design Approach Ciprian Seiculescu, Srinivasan Murali, Luca Benini, Giovanni De Micheli LSI, EPFL, Lausanne,
More informationBursty Communication Performance Analysis of Network-on-Chip with Diverse Traffic Permutations
International Journal of Soft Computing and Engineering (IJSCE) Bursty Communication Performance Analysis of Network-on-Chip with Diverse Traffic Permutations Naveen Choudhary Abstract To satisfy the increasing
More informationNetwork on Chip Architecture: An Overview
Network on Chip Architecture: An Overview Md Shahriar Shamim & Naseef Mansoor 12/5/2014 1 Overview Introduction Multi core chip Challenges Network on Chip Architecture Regular Topology Irregular Topology
More informationFast Flexible FPGA-Tuned Networks-on-Chip
This work was funded by NSF. We thank Xilinx for their FPGA and tool donations. We thank Bluespec for their tool donations. Fast Flexible FPGA-Tuned Networks-on-Chip Michael K. Papamichael, James C. Hoe
More informationES1 An Introduction to On-chip Networks
December 17th, 2015 ES1 An Introduction to On-chip Networks Davide Zoni PhD mail: davide.zoni@polimi.it webpage: home.dei.polimi.it/zoni Sources Main Reference Book (for the examination) Designing Network-on-Chip
More informationConquering Memory Bandwidth Challenges in High-Performance SoCs
Conquering Memory Bandwidth Challenges in High-Performance SoCs ABSTRACT High end System on Chip (SoC) architectures consist of tens of processing engines. In SoCs targeted at high performance computing
More informationBrief Background in Fiber Optics
The Future of Photonics in Upcoming Processors ECE 4750 Fall 08 Brief Background in Fiber Optics Light can travel down an optical fiber if it is completely confined Determined by Snells Law Various modes
More informationDesign of Router Architecture Based on Wormhole Switching Mode for NoC
International Journal of Scientific & Engineering Research Volume 3, Issue 3, March-2012 1 Design of Router Architecture Based on Wormhole Switching Mode for NoC L.Rooban, S.Dhananjeyan Abstract - Network
More information3D WiNoC Architectures
Interconnect Enhances Architecture: Evolution of Wireless NoC from Planar to 3D 3D WiNoC Architectures Hiroki Matsutani Keio University, Japan Sep 18th, 2014 Hiroki Matsutani, "3D WiNoC Architectures",
More informationDesign and Implementation of a Packet Switched Dynamic Buffer Resize Router on FPGA Vivek Raj.K 1 Prasad Kumar 2 Shashi Raj.K 3
IJSRD - International Journal for Scientific Research & Development Vol. 2, Issue 02, 2014 ISSN (online): 2321-0613 Design and Implementation of a Packet Switched Dynamic Buffer Resize Router on FPGA Vivek
More information15-744: Computer Networking. Routers
15-744: Computer Networking outers Forwarding and outers Forwarding IP lookup High-speed router architecture eadings [McK97] A Fast Switched Backplane for a Gigabit Switched outer Optional [D+97] Small
More informationECE 551 System on Chip Design
ECE 551 System on Chip Design Introducing Bus Communications Garrett S. Rose Fall 2018 Emerging Applications Requirements Data Flow vs. Processing µp µp Mem Bus DRAMC Core 2 Core N Main Bus µp Core 1 SoCs
More informationCommunication Services for Networks on Chip
SAMOS, II(), 275 299 (2002) Communication Services for Networks on Chip Andrei Rădulescu and Kees Goossens Philips Research Laboratories, Eindhoven, The Netherlands Email: andrei.radulescu kees.goossens@philips.com
More informationFault Tolerant and Secure Architectures for On Chip Networks With Emerging Interconnect Technologies. Mohsin Y Ahmed Conlan Wesson
Fault Tolerant and Secure Architectures for On Chip Networks With Emerging Interconnect Technologies Mohsin Y Ahmed Conlan Wesson Overview NoC: Future generation of many core processor on a single chip
More information