%URDGFRP&DOLVWR Π$0XOWL&KDQQHO0XOWL6HUYLFH &RPPXQLFDWLRQV3ODWIRUP

Size: px
Start display at page:

Download "%URDGFRP&DOLVWR Π$0XOWL&KDQQHO0XOWL6HUYLFH &RPPXQLFDWLRQV3ODWIRUP"

Transcription

1 %URDGFRP&DOLVWR Π$0XOWL&KDQQHO0XOWL6HUYLFH &RPPXQLFDWLRQV3ODWIRUP John Nickolls L.J. Madar III Scott Johnson Viresh Rustagi Ken Unger Mustafiz Choudhury Broadcom Corporation 1 Calisto Hot Chips 2002

2 &DOLVWR Π2XWOLQH Communications gateways Multi-channel, multi-service, real-time Platform architecture Cluster architecture SpiceEngine vector DSPs Shared memory Multi-channel I/O Implementation Performance Summary 2 Calisto Hot Chips 2002

3 0XOWL&KDQQHO&RPPXQLFDWLRQV Cable, xdsl Access Gateways Universal Port Gateway (RAS) VoDSLAM PSTN IP Class 4 Switch Packet Network ATM/FR/IP VoIP/DSL/DLC CO/PBX Trunking Gateway Enterprise Voice/Video 3G Wireless 3 Calisto Hot Chips 2002

4 &DOLVWR Œ *DWHZD\%ODGH Packet Backplane Interconnect Backplane Interface Packet Switch Circuit Interface *DWHZD\ %ODGH Circuit Backplane Interconnect Carrier-class/enterprise gateways Voice over packet, media, fax, data, wireless, trunking, remote access Require many diverse services Require 1000 s of channels per blade Severely limited power budget Prior gateways were low density Complex designs, many different chips ÎCalisto is a gateway on a chip Integrates DSPs, Us, RAMs, I/O Programmable multi-service platform Supports channels per chip Simple Calisto chip array Enables over 2,016 channels per blade ÎEfficient low-power design 4 Calisto Hot Chips 2002

5 0XOWL6HUYLFH*DWHZD\6WDFN Real-Time OS Packet Network RTP/UDP or AAL2 Jitter Buffer Voice-band Signaling Voice Detect/Noise Gen Voice Coder Fax/Data Relay Fax/Data Modem Conference/Mix Service Tone Service Call Discriminator Echo Cancel PCM or AAL1 Circuit Network Circuit Network MCycles/S OS: Complex service suite Minimize assembler ÎFlexible compiled C software Any service, any port, any time Total program image KB ÎSingle-image shared memory Compute + memory demand DSP/RISC MS/channel 4 20 KB inter-frame data/channel ÎBalanced compute and memory Mix of DSP, packet, OS work No time for DSP task + packet task ÎHybrid DSP/RISC processors ÎDedicated OS processors Total: MS required per channel 5 Calisto Hot Chips 2002

6 0DQ\5HDOWLPH3HULRGLF7DVNV Process frames periodically for all channels Service-specific frame period: 5, 6, 10, 16.7, 20, or 30 msec 800 to 3,000 tasks/sec per DSP, or 4 to 15 tasks per frame period ÎEvery service stack can be executed any time Requires low latency processing delay with low jitter Incremental latency of 1 2 msec, jitter under 250 µsec ÎCombine all processing in single per-frame task, minimize overhead Packet ι Circuit and Packet η Circuit Get ι pkt 1 JB ι pkt 1 Accumulate η frame f1 Get ι pkt 2 Task ο f1 Jitter Buffer ι pkt 2 Put η pkt 1 Accumulate η frame f2 Put η pkt 2 Accumulate η frame f3 Channel task flow 0 msec 5 msec 10 msec 15 msec Task ο f2 Generate ι frame f1 Get ι pkt 3 JB ι pkt 3 Task ο f3 Generate ι frame f2 6 Calisto Hot Chips 2002

7 0DSSLQJ$SSOLFDWLRQ$WWULEXWHV $SSOLFDWLRQ$WWULEXWH Many independent channels Many diverse services High channel density Digital signal processing Network packet processing Mix of DSP and packet work Low latency, low jitter Î Î Î Î Î Î Î Î 3ODWIRUP$UFKLWHFWXUH Many processors, many parallel per-channel tasks Programmable, primarily in C; large instruction memory, fast cache fill Minimize power, share resources Vector data parallelism, operand throughput, memory bandwidth Task parallelism, sequential protocols Hybrid DSP/RISC processors Consistent task run time, minimize task overhead and contention 7 Calisto Hot Chips 2002

8 &DOLVWR%&0$UFKLWHFWXUH Clusters partition functions CM MB 1 MB Total Cluster Memory Task data, I/O buffers, stacks 4 Cluster Processors Run OS and supervise cluster 16 SpiceEngine DSPs Run DSP/packet tasks 5 Memory and I/O Bridges Share resources efficiently SM 768 KB Shared Memory Single OS & application image 4 Multi-channel I/O interfaces Main Processor I/O MP Hub Runs OS and supervises chip Chip synchronization Hub JTAG BOOT 2 Packet I/O 2 TDM I/O Cluster Memory 256 KB Cluster Memory 256 KB MB MB Shared Memory 768 KB Hub MP SDRAM MB MB Cluster Memory 256 KB Cluster Memory 256 KB 8 Calisto Hot Chips 2002

9 &OXVWHU$UFKLWHFWXUH 256 KB Cluster Memory Shared by, s, MB, I/O Task, OS, I/O communication Pipelined CM switch CM Low latency load pipeline Minimal memory contention Cluster synch hub Hub 4 SpiceEngine DSP pool Next available runs task CM No channel affinity Æ min latency RISC Cluster Processor Schedules tasks and output Services interrupts and input Frees s of overhead I/O Bus IOB CM KB Bank 0 32 D$ Chip Hub BIF 32 Shared Memory Switch Port MB 192 Cluster Memory (CM) Pipelined Crossbar Switch CM 32 KB Bank 1 32 Hub CM 32 KB Bank 2 CM 32 KB Bank 3 CM 32 KB Bank 4 CM 32 KB Bank 5 CM 32 KB Bank 6 32 CM KB Bank 7 9 Calisto Hot Chips 2002

10 6SLFH(QJLQH Π9HFWRU'63 MUL 17 x 17 Instruction Cache Fill Interface Vector Registers 1 KB MReg 33 Configuration Registers 8 x 128 Vector Address Units ALU 40/32/16 VAU Regs 16 x 10 Vector Load/Store SHIFT 40/32/16 Accumulators 2 x 40 Instruction Cache 384 x 24 Registers 16 x 32 BITU 32 Cluster Memory Interface Load/ Store ALU 32 MMU Vector registers Fast structured data access Reduce cluster memory demand Fixed-point DSP pipeline Configurable 16/32/40 bits 10 Parallel operations/cycle ITU saturation arithmetic Efficient C compiler target Simple single-mac datapath Narrow 24-bit instruction issue Wide configuration register issue for loops and vector operations Fewer cycles than comparable DSPs, using primarily compiled C J. Nickolls, Broadcom SpiceEngine: A Vector Signal Processor, Microprocessor Forum, Oct. 18, Calisto Hot Chips 2002

11 :K\9HFWRU5HJLVWHUV" VAU vw VAU va vw Write 32/16/8 y[0] y[2] y[4] y[6] x[0] x[2] x[4] x[6] c[0] c[2] c[4] va Read vm Load/Store m[0] m[1] m[2] 32/16/8 y[1] y[3] y[5] y[7] x[1] x[3] x[5] x[7] c[1] c[3] c[5] 32/16/8 32/16/8 vb Read VAU vm VAU vb Fast structured data access Vectors, arrays, tables, lists Multi-port vector register access Compiler-scheduled preload Efficient DSP performance Vector preload covers memory latency, unlike data cache miss Reduce DSP memory accesses and power: O(N T) Æ O(N + T) Vector address unit (VAU) Low overhead strided address Vector registers & configurations Sustain 1 cycle per MAC with minimal memory bandwidth 11 Calisto Hot Chips 2002

12 9HFWRUL]LQJ&&RPSLOHU 7RROV Multi-level loop vectorization Automatic and manual vector register allocation Vector register load scheduling Multi-level vector stride detection Dynamic configuration generation Minimum-cost configuration generation, load scheduling Signal processing, cross-module optimizations Value range optimization, saturation algebra Intrinsic deduction, inlining, constant propagation Redundant store elimination (e.g. ITU overflow flag) High-efficiency signal/packet processing in C Typical ITU reference C within 2x cycles of hand-coded assembler ÎEnables complete C service stacks, including DSP and network code Multi-channel debugging, profiling, performance tuning tools 12 Calisto Hot Chips 2002

13 &DFKH)LOOIURP6KDUHG0HPRU\ SDRAM Controller IOB I/O Bus IOB 32 Shared Memory 192 KB Bank 0 Shared Memory Pipelined Cross Bar Switch 128 MB MP Cluster 0 D$ BIF Hub MB Shared Memory 192 KB Bank Cluster 1 CM 1 Cluster 3 MB Shared Memory 192 KB Bank Cluster Memory (CM) Shared Memory 192 KB Bank Cluster 2 Cluster 4 CM 2 MB MB CM 4 Shared Memory SM Single program image, 768 KB Fills & instruction caches Interleaved 4 banks x 128 bits Pipelined crossbar switch Delivers full 512 bits per cycle Typical utilization 50% 80% Supports I/O bus and SDRAM instruction cache fill Memory Bridge DMA engine Cache reduces memory demand 192 byte line amortizes latency Average miss cost: I SM MB 13 Calisto Hot Chips 2002

14 0XOWL&KDQQHO6FDWWHU*DWKHU,2 Boot Port IOB IOB Cluster 2 D$ Packet I/O Port CM 2 Ingress BIF Hub Packet I/O Port IOB I/O Arbiter Cluster 3 Cluster 4 CM 3 Cluster 1 TDM I/O Port IOB CM 4 Cluster Memory (CM) 1 TDM I/O Port Egress IOB SM MB Packet and cell network I/O IP and ATM networks 100 Mbps full duplex, pin-efficient TDM sample stream I/O Circuit and AAL1 networks Bit-serial ports Multi-channel scatter/gather Ingress/egress ring buffers in Cluster or Shared Memory Classify ingress packet/cell, DMA to ring buffer Assemble egress packets from chained DMA buffers Ring buffer synchronization 14 Calisto Hot Chips 2002

15 (IILFLHQF\IRUORZSRZHU Scalable multi-processor parallelism Many small DSPs exploit channel task parallelism Higher efficiency than fewer large VLIW DSPs Shared program memory Significant reduction in area and power vs. DSP farms Instruction caches reduce memory demand Cluster memory shared by DSPs and Enables scalable DSP array, pool scheduling Vector DSPs reduce memory demand Avoid task preemption, minimize overhead No cycles wasted on DSP task preemption Balanced DSP/RISC MS and memory Channel density typically limited equally 15 Calisto Hot Chips 2002

16 &DOLVWR Œ %&0,PSOHPHQWDWLRQ Implementation 130 M transistors.13µm CMOS -G 166 MHz clock 1.2 V core, 2.5 V I/O 1.2 Watts 239 BGA 19x19 mm Q1/2002 production Low power design Extensive clock gating Low power pipelined busses RAM defect repair Redundant rows and columns Manufacturing test, laser repair 16 SpiceEngine DSPs 2.7 GIPS 2.7 GMACS, 26.6 GOPS 16 KB Vector registers 18 KB Cache 5 RISC Processors 830 MIPS 40 KB Cache 768 KB Shared Memory 10.6 GB/sec bandwidth 1 MB Cluster Memory 18.6 GB/sec bandwidth Multi-channel I/O 16 Calisto Hot Chips 2002

17 17 Calisto Hot Chips 2002 &DOLVWR Π%&0

18 26/RZ/DWHQF\6FKHGXOLQJ Multi-channel real-time OS Develop application service stack for single channel OS maps each channel to a periodic per-frame channel task OS/MMU restricts tasks to access only assigned channel memory Schedule per-frame channel tasks in a periodic window Combine up/down, DSP/packet processing in one per-frame task Run each per-frame task to completion, within known deadline Zero cycles wasted on task preemption OS schedule minimizes longest processing latency Stagger channel tasks relative to input frames, schedule output Highest priority task gets next available SpiceEngine ÎAchieves less than 250 µsec output packet jitter OS distributed across clusters Each manages its cluster resources 18 Calisto Hot Chips 2002

19 %&0*DWHZD\3HUIRUPDQFH Calisto Gateway xchange software ÎComplete carrier-class VoP gateway service suite ITU bit-exact vocoders, G.7xx, GSM, BroadVoice, 64/128 msec adaptive line echo cancellation (ECAN) Voice activity detection, comfort noise generation DTMF tone detect/generate, tone relay, fax/modem detect Fax relay, Fax modems, T.38 FoIP, AAL2 Annex M Packet interface: IP/RTP/UDP, ATM AAL1/2/5, adaptive jitter buffer Conference mixing, in-band signaling, remote test, encryption Breakthrough density for VoP Gateways 240/220 channels G.711 VoP, 32/64 msec ECAN, 5 mw/channel 184 channels G.711 VoP, 128 msec ECAN 120 channels G.729a VoP, 64 msec ECAN 104 channels G.729ab VoP, 128 msec ECAN, T.38 Fax relay ÎEnables 2,016 channel OC-3 gateway blade with 10 chips 19 Calisto Hot Chips 2002

20 &DOLVWR Œ 2&*DWHZD\%ODGH Simple array of 10 Calisto chips Carrier-class multi-service Gateway xchange software ÎEnables 2,016 channel OC-3 packet voice gateway blade 20 Calisto Hot Chips 2002

21 &DOLVWR Œ 6XPPDU\ Multi-channel multi-service communications platform Integrated multi-processor system Balanced performance and memory Consistent real-time performance Flexible C programmable platform Efficient low-power architecture Appropriate use of parallelism Shared resources Vector DSPs reduce memory usage Low power design and implementation Tightly coupled OS and software Enables carrier-class gateways Calisto : 16 SpiceEngine DSPs 5 Control Processors Multi-channel I/O 1.75 MB Memory.13µm CMOS 166 MHz 1.2V, 1.2 Watts 239 BGA 19x19 mm Q1/2002 production 21 Calisto Hot Chips 2002

Networking Infrastructure Markets

Networking Infrastructure Markets Module Introduction PURPOSE: To discuss the target markets and applications for the MSC8101. OBJECTIVES: Describe the use of the MSC8101 in EDGE and 3G wireless baseband applications. Describe the use

More information

TNETV3010 Infrastructure VOP Gateway Solution

TNETV3010 Infrastructure VOP Gateway Solution Technology for Innovators TM Product Bulletin TNETV3010 Infrastructure VOP Gateway Solution Telogy Software Products Integrated with TI s DSP-Based Access Communication Processor Texas Instruments (TIs)

More information

ARM Processors for Embedded Applications

ARM Processors for Embedded Applications ARM Processors for Embedded Applications Roadmap for ARM Processors ARM Architecture Basics ARM Families AMBA Architecture 1 Current ARM Core Families ARM7: Hard cores and Soft cores Cache with MPU or

More information

The Nios II Family of Configurable Soft-core Processors

The Nios II Family of Configurable Soft-core Processors The Nios II Family of Configurable Soft-core Processors James Ball August 16, 2005 2005 Altera Corporation Agenda Nios II Introduction Configuring your CPU FPGA vs. ASIC CPU Design Instruction Set Architecture

More information

The S6000 Family of Processors

The S6000 Family of Processors The S6000 Family of Processors Today s Design Challenges The advent of software configurable processors In recent years, the widespread adoption of digital technologies has revolutionized the way in which

More information

MIMD Overview. Intel Paragon XP/S Overview. XP/S Usage. XP/S Nodes and Interconnection. ! Distributed-memory MIMD multicomputer

MIMD Overview. Intel Paragon XP/S Overview. XP/S Usage. XP/S Nodes and Interconnection. ! Distributed-memory MIMD multicomputer MIMD Overview Intel Paragon XP/S Overview! MIMDs in the 1980s and 1990s! Distributed-memory multicomputers! Intel Paragon XP/S! Thinking Machines CM-5! IBM SP2! Distributed-memory multicomputers with hardware

More information

General Purpose Signal Processors

General Purpose Signal Processors General Purpose Signal Processors First announced in 1978 (AMD) for peripheral computation such as in printers, matured in early 80 s (TMS320 series). General purpose vs. dedicated architectures: Pros:

More information

An Ultra High Performance Scalable DSP Family for Multimedia. Hot Chips 17 August 2005 Stanford, CA Erik Machnicki

An Ultra High Performance Scalable DSP Family for Multimedia. Hot Chips 17 August 2005 Stanford, CA Erik Machnicki An Ultra High Performance Scalable DSP Family for Multimedia Hot Chips 17 August 2005 Stanford, CA Erik Machnicki Media Processing Challenges Increasing performance requirements Need for flexibility &

More information

Classification of Semiconductor LSI

Classification of Semiconductor LSI Classification of Semiconductor LSI 1. Logic LSI: ASIC: Application Specific LSI (you have to develop. HIGH COST!) For only mass production. ASSP: Application Specific Standard Product (you can buy. Low

More information

ELEC 5200/6200 Computer Architecture and Design Spring 2017 Lecture 7: Memory Organization Part II

ELEC 5200/6200 Computer Architecture and Design Spring 2017 Lecture 7: Memory Organization Part II ELEC 5200/6200 Computer Architecture and Design Spring 2017 Lecture 7: Organization Part II Ujjwal Guin, Assistant Professor Department of Electrical and Computer Engineering Auburn University, Auburn,

More information

Vector IRAM: A Microprocessor Architecture for Media Processing

Vector IRAM: A Microprocessor Architecture for Media Processing IRAM: A Microprocessor Architecture for Media Processing Christoforos E. Kozyrakis kozyraki@cs.berkeley.edu CS252 Graduate Computer Architecture February 10, 2000 Outline Motivation for IRAM technology

More information

PRODUCT PREVIEW TNETV1050 IP PHONE PROCESSOR. description

PRODUCT PREVIEW TNETV1050 IP PHONE PROCESSOR. description C55x DSP Operating at 125 MHz, Providing up to 250 MIPS MIPS32 4KEc 32-Bit RISC Processor, Operating at 165 MHz, Providing up to 223 Dhrystone MIPS On-Chip Peripherals Include: External Memory Interface

More information

Course Introduction. Purpose: Objectives: Content: Learning Time:

Course Introduction. Purpose: Objectives: Content: Learning Time: Course Introduction Purpose: This course provides an overview of the Renesas SuperH series of 32-bit RISC processors, especially the microcontrollers in the SH-2 and SH-2A series Objectives: Learn the

More information

Jim Keller. Digital Equipment Corp. Hudson MA

Jim Keller. Digital Equipment Corp. Hudson MA Jim Keller Digital Equipment Corp. Hudson MA ! Performance - SPECint95 100 50 21264 30 21164 10 1995 1996 1997 1998 1999 2000 2001 CMOS 5 0.5um CMOS 6 0.35um CMOS 7 0.25um "## Continued Performance Leadership

More information

Memory Systems IRAM. Principle of IRAM

Memory Systems IRAM. Principle of IRAM Memory Systems 165 other devices of the module will be in the Standby state (which is the primary state of all RDRAM devices) or another state with low-power consumption. The RDRAM devices provide several

More information

The T0 Vector Microprocessor. Talk Outline

The T0 Vector Microprocessor. Talk Outline Slides from presentation at the Hot Chips VII conference, 15 August 1995.. The T0 Vector Microprocessor Krste Asanovic James Beck Bertrand Irissou Brian E. D. Kingsbury Nelson Morgan John Wawrzynek University

More information

SA-1500: A 300 MHz RISC CPU with Attached Media Processor*

SA-1500: A 300 MHz RISC CPU with Attached Media Processor* and Bridges Division SA-1500: A 300 MHz RISC CPU with Attached Media Processor* Prashant P. Gandhi, Ph.D. and Bridges Division Computing Enhancement Group Intel Corporation Santa Clara, CA 95052 Prashant.Gandhi@intel.com

More information

Introduction to Quality of Service

Introduction to Quality of Service Introduction to Quality of Service The use of IP as a foundation for converged networks has raised several issues for both enterprise IT departments and ISPs. IP and Ethernet are connectionless technologies

More information

CS650 Computer Architecture. Lecture 9 Memory Hierarchy - Main Memory

CS650 Computer Architecture. Lecture 9 Memory Hierarchy - Main Memory CS65 Computer Architecture Lecture 9 Memory Hierarchy - Main Memory Andrew Sohn Computer Science Department New Jersey Institute of Technology Lecture 9: Main Memory 9-/ /6/ A. Sohn Memory Cycle Time 5

More information

Media Resources CHAPTER

Media Resources CHAPTER CHAPTER 6 Last revised on: October 30, 2008 A media resource is a software-based or hardware-based entity that performs media processing functions on the data streams to which it is connected. Media processing

More information

Embedded Systems. 7. System Components

Embedded Systems. 7. System Components Embedded Systems 7. System Components Lothar Thiele 7-1 Contents of Course 1. Embedded Systems Introduction 2. Software Introduction 7. System Components 10. Models 3. Real-Time Models 4. Periodic/Aperiodic

More information

Eliminating the Last Mile Bottleneck

Eliminating the Last Mile Bottleneck Eliminating the Last Mile Bottleneck P. Michael Henderson michael.henderson@cox.net 1 Agenda Technology and Market Drivers Performance Requirements VoDSL System Architecture Closed Form Analysis Non-Closed

More information

Current and Projected Digital Complexity of DMT VDSL

Current and Projected Digital Complexity of DMT VDSL June 1, 1999 1 Standards Project: T1E1.4:99-268 VDSL Title: Current and Projected Digital Complexity of DMT VDSL Source: Texas Instruments Author: C. S. Modlin J. S. Chow Texas Instruments 2043 Samaritan

More information

4. Hardware Platform: Real-Time Requirements

4. Hardware Platform: Real-Time Requirements 4. Hardware Platform: Real-Time Requirements Contents: 4.1 Evolution of Microprocessor Architecture 4.2 Performance-Increasing Concepts 4.3 Influences on System Architecture 4.4 A Real-Time Hardware Architecture

More information

4. Networks. in parallel computers. Advances in Computer Architecture

4. Networks. in parallel computers. Advances in Computer Architecture 4. Networks in parallel computers Advances in Computer Architecture System architectures for parallel computers Control organization Single Instruction stream Multiple Data stream (SIMD) All processors

More information

Latches. IT 3123 Hardware and Software Concepts. Registers. The Little Man has Registers. Data Registers. Program Counter

Latches. IT 3123 Hardware and Software Concepts. Registers. The Little Man has Registers. Data Registers. Program Counter IT 3123 Hardware and Software Concepts Notice: This session is being recorded. CPU and Memory June 11 Copyright 2005 by Bob Brown Latches Can store one bit of data Can be ganged together to store more

More information

Routers Technologies & Evolution for High-Speed Networks

Routers Technologies & Evolution for High-Speed Networks Routers Technologies & Evolution for High-Speed Networks C. Pham Université de Pau et des Pays de l Adour http://www.univ-pau.fr/~cpham Congduc.Pham@univ-pau.fr Router Evolution slides from Nick McKeown,

More information

Tandem Switch Application (VoIP Mode)

Tandem Switch Application (VoIP Mode) CHAPTER 4 This chapter provides a general description of an application in which a VISM equipped Cisco MGX 8850 and a Cisco VSC 3000 Virtual Switch Controller (Call Agent) combine to provide the functions

More information

Programmable Logic Design Grzegorz Budzyń Lecture. 15: Advanced hardware in FPGA structures

Programmable Logic Design Grzegorz Budzyń Lecture. 15: Advanced hardware in FPGA structures Programmable Logic Design Grzegorz Budzyń Lecture 15: Advanced hardware in FPGA structures Plan Introduction PowerPC block RocketIO Introduction Introduction The larger the logical chip, the more additional

More information

Vertex Shader Design I

Vertex Shader Design I The following content is extracted from the paper shown in next page. If any wrong citation or reference missing, please contact ldvan@cs.nctu.edu.tw. I will correct the error asap. This course used only

More information

A 400Gbps Multi-Core Network Processor

A 400Gbps Multi-Core Network Processor A 400Gbps Multi-Core Network Processor James Markevitch, Srinivasa Malladi Cisco Systems August 22, 2017 Legal THE INFORMATION HEREIN IS PROVIDED ON AN AS IS BASIS, WITHOUT ANY WARRANTIES OR REPRESENTATIONS,

More information

Buses. Disks PCI RDRAM RDRAM LAN. Some slides adapted from lecture by David Culler. Pentium 4 Processor. Memory Controller Hub.

Buses. Disks PCI RDRAM RDRAM LAN. Some slides adapted from lecture by David Culler. Pentium 4 Processor. Memory Controller Hub. es > 100 MB/sec Pentium 4 Processor L1 and L2 caches Some slides adapted from lecture by David Culler 3.2 GB/sec Display Memory Controller Hub RDRAM RDRAM Dual Ultra ATA/100 24 Mbit/sec Disks LAN I/O Controller

More information

With Fixed Point or Floating Point Processors!!

With Fixed Point or Floating Point Processors!! Product Information Sheet High Throughput Digital Signal Processor OVERVIEW With Fixed Point or Floating Point Processors!! Performance Up to 14.4 GIPS or 7.7 GFLOPS Peak Processing Power Continuous Input

More information

Computer Organization and Structure. Bing-Yu Chen National Taiwan University

Computer Organization and Structure. Bing-Yu Chen National Taiwan University Computer Organization and Structure Bing-Yu Chen National Taiwan University Storage and Other I/O Topics I/O Performance Measures Types and Characteristics of I/O Devices Buses Interfacing I/O Devices

More information

SurfExpress/PCIe TM. Modular PCI Express DSP Multimedia Processing Board for Enterprise and CTI Applications. Overview.

SurfExpress/PCIe TM. Modular PCI Express DSP Multimedia Processing Board for Enterprise and CTI Applications. Overview. S U R F Main Features» PCI Express (PCIe) form-factor farm with 2x Gigabit Ethernet ports and CT bus» Complete media processing package for audio, video, modem and fax» Flexible and scalable modular design

More information

The Alpha Microprocessor: Out-of-Order Execution at 600 Mhz. R. E. Kessler COMPAQ Computer Corporation Shrewsbury, MA

The Alpha Microprocessor: Out-of-Order Execution at 600 Mhz. R. E. Kessler COMPAQ Computer Corporation Shrewsbury, MA The Alpha 21264 Microprocessor: Out-of-Order ution at 600 Mhz R. E. Kessler COMPAQ Computer Corporation Shrewsbury, MA 1 Some Highlights z Continued Alpha performance leadership y 600 Mhz operation in

More information

POWER7: IBM's Next Generation Server Processor

POWER7: IBM's Next Generation Server Processor POWER7: IBM's Next Generation Server Processor Acknowledgment: This material is based upon work supported by the Defense Advanced Research Projects Agency under its Agreement No. HR0011-07-9-0002 Outline

More information

Dialogic Diva Analog Media Boards by Sangoma

Dialogic Diva Analog Media Boards by Sangoma Dialogic Diva Analog Media Boards by Sangoma The Dialogic Diva Analog Media Boards provide two, four, and eight ports and serve as an excellent communication platform, which scales from 2 to 64 channels

More information

Fujitsu System Applications Support. Fujitsu Microelectronics America, Inc. 02/02

Fujitsu System Applications Support. Fujitsu Microelectronics America, Inc. 02/02 Fujitsu System Applications Support 1 Overview System Applications Support SOC Application Development Lab Multimedia VoIP Wireless Bluetooth Processors, DSP and Peripherals ARM Reference Platform 2 SOC

More information

Organic Computing. Dr. rer. nat. Christophe Bobda Prof. Dr. Rolf Wanka Department of Computer Science 12 Hardware-Software-Co-Design

Organic Computing. Dr. rer. nat. Christophe Bobda Prof. Dr. Rolf Wanka Department of Computer Science 12 Hardware-Software-Co-Design Dr. rer. nat. Christophe Bobda Prof. Dr. Rolf Wanka Department of Computer Science 12 Hardware-Software-Co-Design 1 Reconfigurable Computing Platforms 2 The Von Neumann Computer Principle In 1945, the

More information

Implementation of DSP Algorithms

Implementation of DSP Algorithms Implementation of DSP Algorithms Main frame computers Dedicated (application specific) architectures Programmable digital signal processors voice band data modem speech codec 1 PDSP and General-Purpose

More information

Lecture 15: Multiplexing (2)

Lecture 15: Multiplexing (2) Lecture 15: Multiplexing (2) Last Lecture Multiplexing (1) Source: chapter 8 This Lecture Multiplexing (2) Source: chapter8 Next Lecture Circuit switching (1) Source: chapter9 Digital Carrier Systems Hierarchy

More information

ECE 1160/2160 Embedded Systems Design. Midterm Review. Wei Gao. ECE 1160/2160 Embedded Systems Design

ECE 1160/2160 Embedded Systems Design. Midterm Review. Wei Gao. ECE 1160/2160 Embedded Systems Design ECE 1160/2160 Embedded Systems Design Midterm Review Wei Gao ECE 1160/2160 Embedded Systems Design 1 Midterm Exam When: next Monday (10/16) 4:30-5:45pm Where: Benedum G26 15% of your final grade What about:

More information

Vector Architectures Vs. Superscalar and VLIW for Embedded Media Benchmarks

Vector Architectures Vs. Superscalar and VLIW for Embedded Media Benchmarks Vector Architectures Vs. Superscalar and VLIW for Embedded Media Benchmarks Christos Kozyrakis Stanford University David Patterson U.C. Berkeley http://csl.stanford.edu/~christos Motivation Ideal processor

More information

PowerPC 740 and 750

PowerPC 740 and 750 368 floating-point registers. A reorder buffer with 16 elements is used as well to support speculative execution. The register file has 12 ports. Although instructions can be executed out-of-order, in-order

More information

ConnX D2 DSP Engine. A Flexible 2-MAC DSP. Dual-MAC, 16-bit Fixed-Point Communications DSP PRODUCT BRIEF FEATURES BENEFITS. ConnX D2 DSP Engine

ConnX D2 DSP Engine. A Flexible 2-MAC DSP. Dual-MAC, 16-bit Fixed-Point Communications DSP PRODUCT BRIEF FEATURES BENEFITS. ConnX D2 DSP Engine PRODUCT BRIEF ConnX D2 DSP Engine Dual-MAC, 16-bit Fixed-Point Communications DSP FEATURES BENEFITS Both SIMD and 2-way FLIX (parallel VLIW) operations Optimized, vectorizing XCC Compiler High-performance

More information

The Alpha Microprocessor: Out-of-Order Execution at 600 MHz. Some Highlights

The Alpha Microprocessor: Out-of-Order Execution at 600 MHz. Some Highlights The Alpha 21264 Microprocessor: Out-of-Order ution at 600 MHz R. E. Kessler Compaq Computer Corporation Shrewsbury, MA 1 Some Highlights Continued Alpha performance leadership 600 MHz operation in 0.35u

More information

Storage I/O Summary. Lecture 16: Multimedia and DSP Architectures

Storage I/O Summary. Lecture 16: Multimedia and DSP Architectures Storage I/O Summary Storage devices Storage I/O Performance Measures» Throughput» Response time I/O Benchmarks» Scaling to track technological change» Throughput with restricted response time is normal

More information

Virtex-II Architecture. Virtex II technical, Design Solutions. Active Interconnect Technology (continued)

Virtex-II Architecture. Virtex II technical, Design Solutions. Active Interconnect Technology (continued) Virtex-II Architecture SONET / SDH Virtex II technical, Design Solutions PCI-X PCI DCM Distri RAM 18Kb BRAM Multiplier LVDS FIFO Shift Registers BLVDS SDRAM QDR SRAM Backplane Rev 4 March 4th. 2002 J-L

More information

1. Microprocessor Architectures. 1.1 Intel 1.2 Motorola

1. Microprocessor Architectures. 1.1 Intel 1.2 Motorola 1. Microprocessor Architectures 1.1 Intel 1.2 Motorola 1.1 Intel The Early Intel Microprocessors The first microprocessor to appear in the market was the Intel 4004, a 4-bit data bus device. This device

More information

Chapter 5B. Large and Fast: Exploiting Memory Hierarchy

Chapter 5B. Large and Fast: Exploiting Memory Hierarchy Chapter 5B Large and Fast: Exploiting Memory Hierarchy One Transistor Dynamic RAM 1-T DRAM Cell word access transistor V REF TiN top electrode (V REF ) Ta 2 O 5 dielectric bit Storage capacitor (FET gate,

More information

A Scalable Multiprocessor for Real-time Signal Processing

A Scalable Multiprocessor for Real-time Signal Processing A Scalable Multiprocessor for Real-time Signal Processing Daniel Scherrer, Hans Eberle Institute for Computer Systems, Swiss Federal Institute of Technology CH-8092 Zurich, Switzerland {scherrer, eberle}@inf.ethz.ch

More information

Digital Signal Processor 2010/1/4

Digital Signal Processor 2010/1/4 Digital Signal Processor 1 Analog to Digital Shift 2 Digital Signal Processing Applications FAX Phone Personal Computer Medical Instruments DVD player Air conditioner (controller) Digital Camera MP3 audio

More information

CS 856 Latency in Communication Systems

CS 856 Latency in Communication Systems CS 856 Latency in Communication Systems Winter 2010 Latency Challenges CS 856, Winter 2010, Latency Challenges 1 Overview Sources of Latency low-level mechanisms services Application Requirements Latency

More information

Input/Output. Today. Next. Principles of I/O hardware & software I/O software layers Disks. Protection & Security

Input/Output. Today. Next. Principles of I/O hardware & software I/O software layers Disks. Protection & Security Input/Output Today Principles of I/O hardware & software I/O software layers Disks Next Protection & Security Operating Systems and I/O Two key operating system goals Control I/O devices Provide a simple,

More information

Main Memory Systems. Department of Electrical Engineering Stanford University Lecture 5-1

Main Memory Systems. Department of Electrical Engineering Stanford University   Lecture 5-1 Lecture 5 Main Memory Systems Department of Electrical Engineering Stanford University http://eeclass.stanford.edu/ee282 Lecture 5-1 Announcements If you don t have a group of 3, contact us ASAP HW-1 is

More information

Chapter 8: Multiplexing

Chapter 8: Multiplexing NET 456 High Speed Networks Chapter 8: Multiplexing Dr. Anis Koubaa Reformatted slides from textbook Data and Computer Communications, Ninth Edition by William Stallings, 1 (c) Pearson Education - Prentice

More information

1/29/2008. From Signals to Packets. Lecture 6 Datalink Framing, Switching. Datalink Functions. Datalink Lectures. Character and Bit Stuffing.

1/29/2008. From Signals to Packets. Lecture 6 Datalink Framing, Switching. Datalink Functions. Datalink Lectures. Character and Bit Stuffing. /9/008 From Signals to Packets Lecture Datalink Framing, Switching Peter Steenkiste Departments of Computer Science and Electrical and Computer Engineering Carnegie Mellon University Analog Signal Digital

More information

Meet in the Middle: Leveraging Optical Interconnection Opportunities in Chip Multi Processors

Meet in the Middle: Leveraging Optical Interconnection Opportunities in Chip Multi Processors Meet in the Middle: Leveraging Optical Interconnection Opportunities in Chip Multi Processors Sandro Bartolini* Department of Information Engineering, University of Siena, Italy bartolini@dii.unisi.it

More information

Commercial Network Processors

Commercial Network Processors Commercial Network Processors ECE 697J December 5 th, 2002 ECE 697J 1 AMCC np7250 Network Processor Presenter: Jinghua Hu ECE 697J 2 AMCC np7250 Released in April 2001 Packet and cell processing Full-duplex

More information

Performance Tuning on the Blackfin Processor

Performance Tuning on the Blackfin Processor 1 Performance Tuning on the Blackfin Processor Outline Introduction Building a Framework Memory Considerations Benchmarks Managing Shared Resources Interrupt Management An Example Summary 2 Introduction

More information

Tutorial Introduction

Tutorial Introduction Tutorial Introduction PURPOSE: This tutorial describes the key features of the DSP56300 family of processors. OBJECTIVES: Describe the main features of the DSP 24-bit core. Identify the features and functions

More information

ASSEMBLY LANGUAGE MACHINE ORGANIZATION

ASSEMBLY LANGUAGE MACHINE ORGANIZATION ASSEMBLY LANGUAGE MACHINE ORGANIZATION CHAPTER 3 1 Sub-topics The topic will cover: Microprocessor architecture CPU processing methods Pipelining Superscalar RISC Multiprocessing Instruction Cycle Instruction

More information

An Evaluation of an Energy Efficient Many-Core SoC with Parallelized Face Detection

An Evaluation of an Energy Efficient Many-Core SoC with Parallelized Face Detection An Evaluation of an Energy Efficient Many-Core SoC with Parallelized Face Detection Hiroyuki Usui, Jun Tanabe, Toru Sano, Hui Xu, and Takashi Miyamori Toshiba Corporation, Kawasaki, Japan Copyright 2013,

More information

Hybrid On-chip Data Networks. Gilbert Hendry Keren Bergman. Lightwave Research Lab. Columbia University

Hybrid On-chip Data Networks. Gilbert Hendry Keren Bergman. Lightwave Research Lab. Columbia University Hybrid On-chip Data Networks Gilbert Hendry Keren Bergman Lightwave Research Lab Columbia University Chip-Scale Interconnection Networks Chip multi-processors create need for high performance interconnects

More information

Computer and Hardware Architecture I. Benny Thörnberg Associate Professor in Electronics

Computer and Hardware Architecture I. Benny Thörnberg Associate Professor in Electronics Computer and Hardware Architecture I Benny Thörnberg Associate Professor in Electronics Hardware architecture Computer architecture The functionality of a modern computer is so complex that no human can

More information

Linköping University Post Print. epuma: a novel embedded parallel DSP platform for predictable computing

Linköping University Post Print. epuma: a novel embedded parallel DSP platform for predictable computing Linköping University Post Print epuma: a novel embedded parallel DSP platform for predictable computing Jian Wang, Joar Sohl, Olof Kraigher and Dake Liu N.B.: When citing this work, cite the original article.

More information

Advance CPU Design. MMX technology. Computer Architectures. Tien-Fu Chen. National Chung Cheng Univ. ! Basic concepts

Advance CPU Design. MMX technology. Computer Architectures. Tien-Fu Chen. National Chung Cheng Univ. ! Basic concepts Computer Architectures Advance CPU Design Tien-Fu Chen National Chung Cheng Univ. Adv CPU-0 MMX technology! Basic concepts " small native data types " compute-intensive operations " a lot of inherent parallelism

More information

Embedded Management Functions. Signal backhaul xgcp (CAS), Q.931 (PRI) * Call setup/ teardown xgcp * Connection Handling *

Embedded Management Functions. Signal backhaul xgcp (CAS), Q.931 (PRI) * Call setup/ teardown xgcp * Connection Handling * CHAPTER 5 This chapter describes VISM operation. Figure 5-1 shows the functional blocks of VISM. Items marked with a single asterisk apply to VoIP switching and switched AAL2 PVC modes only. Items marked

More information

Lecture #6 Multiplexing and Switching

Lecture #6 Multiplexing and Switching SPRING 2015 Integrated Technical Education Cluster At AlAmeeria E-626-A Data Communication and Industrial Networks (DC-IN) Lecture #6 Multiplexing and Switching Instructor: Dr. Ahmad El-Banna 1 Agenda

More information

SwitchX Virtual Protocol Interconnect (VPI) Switch Architecture

SwitchX Virtual Protocol Interconnect (VPI) Switch Architecture SwitchX Virtual Protocol Interconnect (VPI) Switch Architecture 2012 MELLANOX TECHNOLOGIES 1 SwitchX - Virtual Protocol Interconnect Solutions Server / Compute Switch / Gateway Virtual Protocol Interconnect

More information

Lecture 18: DRAM Technologies

Lecture 18: DRAM Technologies Lecture 18: DRAM Technologies Last Time: Cache and Virtual Memory Review Today DRAM organization or, why is DRAM so slow??? Lecture 18 1 Main Memory = DRAM Lecture 18 2 Basic DRAM Architecture Lecture

More information

Cell Switching (ATM) Commonly transmitted over SONET other physical layers possible. Variable vs Fixed-Length Packets

Cell Switching (ATM) Commonly transmitted over SONET other physical layers possible. Variable vs Fixed-Length Packets Cell Switching (ATM) Connection-oriented packet-switched network Used in both WAN and LAN settings Signaling (connection setup) Protocol: Q2931 Specified by ATM forum Packets are called cells 5-byte header

More information

2008/12/23. System Arch 2008 (Fire Tom Wada) 1

2008/12/23. System Arch 2008 (Fire Tom Wada) 1 Digital it Signal Processor System Arch 2008 (Fire Tom Wada) 1 Analog to Digital Shift System Arch 2008 (Fire Tom Wada) 2 Digital Signal Processing Applications FAX Phone Personal Computer Medical Instruments

More information

Computer Architecture CS 355 Busses & I/O System

Computer Architecture CS 355 Busses & I/O System Computer Architecture CS 355 Busses & I/O System Text: Computer Organization & Design, Patterson & Hennessy Chapter 6.5-6.6 Objectives: During this class the student shall learn to: Describe the two basic

More information

What Are The Main Differences Between Program Counter Pc And Instruction Register Ir

What Are The Main Differences Between Program Counter Pc And Instruction Register Ir What Are The Main Differences Between Program Counter Pc And Instruction Register Ir and register-based instructions - Anatomy on a CPU - Program Counter (PC): holds memory address of next instruction

More information

A Media-Enhanced Vector Architecture for Embedded Memory Systems

A Media-Enhanced Vector Architecture for Embedded Memory Systems A Media-Enhanced Vector Architecture for Embedded Memory Systems Christoforos Kozyrakis Report No. UCB/CSD-99-1059 July 1999 Computer Science Division (EECS) University of California Berkeley, California

More information

CENG3420 Lecture 08: Memory Organization

CENG3420 Lecture 08: Memory Organization CENG3420 Lecture 08: Memory Organization Bei Yu byu@cse.cuhk.edu.hk (Latest update: February 22, 2018) Spring 2018 1 / 48 Overview Introduction Random Access Memory (RAM) Interleaving Secondary Memory

More information

Toward a unified architecture for LAN/WAN/WLAN/SAN switches and routers

Toward a unified architecture for LAN/WAN/WLAN/SAN switches and routers Toward a unified architecture for LAN/WAN/WLAN/SAN switches and routers Silvano Gai 1 The sellable HPSR Seamless LAN/WLAN/SAN/WAN Network as a platform System-wide network intelligence as platform for

More information

POWER7: IBM's Next Generation Server Processor

POWER7: IBM's Next Generation Server Processor Hot Chips 21 POWER7: IBM's Next Generation Server Processor Ronald Kalla Balaram Sinharoy POWER7 Chief Engineer POWER7 Chief Core Architect Acknowledgment: This material is based upon work supported by

More information

Chapter 8. A Typical collection of I/O devices. Interrupts. Processor. Cache. Memory I/O bus. I/O controller I/O I/O. Main memory.

Chapter 8. A Typical collection of I/O devices. Interrupts. Processor. Cache. Memory I/O bus. I/O controller I/O I/O. Main memory. Chapter 8 1 A Typical collection of I/O devices Interrupts Cache I/O bus Main memory I/O controller I/O controller I/O controller Disk Disk Graphics output Network 2 1 Interfacing s and Peripherals I/O

More information

Intelop. *As new IP blocks become available, please contact the factory for the latest updated info.

Intelop. *As new IP blocks become available, please contact the factory for the latest updated info. A FPGA based development platform as part of an EDK is available to target intelop provided IPs or other standard IPs. The platform with Virtex-4 FX12 Evaluation Kit provides a complete hardware environment

More information

Smart ATA Product Bulletin

Smart ATA Product Bulletin Smart ATA Product Bulletin Smart ATA is an ATA that not only offers the service provider a high-function voicefax ATA with FXO capability, but also includes patented technology (US patent 9,094,419) that

More information

Embedded Computing Platform. Architecture and Instruction Set

Embedded Computing Platform. Architecture and Instruction Set Embedded Computing Platform Microprocessor: Architecture and Instruction Set Ingo Sander ingo@kth.se Microprocessor A central part of the embedded platform A platform is the basic hardware and software

More information

The Network Processor Revolution

The Network Processor Revolution The Network Processor Revolution Fast Pattern Matching and Routing at OC-48 David Kramer Senior Design/Architect Market Segments Optical Mux Optical Core DWDM Ring OC 192 to OC 768 Optical Mux Carrier

More information

SIDDHARTH GROUP OF INSTITUTIONS :: PUTTUR Siddharth Nagar, Narayanavanam Road QUESTION BANK (DESCRIPTIVE) UNIT-I

SIDDHARTH GROUP OF INSTITUTIONS :: PUTTUR Siddharth Nagar, Narayanavanam Road QUESTION BANK (DESCRIPTIVE) UNIT-I SIDDHARTH GROUP OF INSTITUTIONS :: PUTTUR Siddharth Nagar, Narayanavanam Road 517583 QUESTION BANK (DESCRIPTIVE) Subject with Code : CO (16MC802) Year & Sem: I-MCA & I-Sem Course & Branch: MCA Regulation:

More information

EEM870 Embedded System and Experiment Lecture 3: ARM Processor Architecture

EEM870 Embedded System and Experiment Lecture 3: ARM Processor Architecture EEM870 Embedded System and Experiment Lecture 3: ARM Processor Architecture Wen-Yen Lin, Ph.D. Department of Electrical Engineering Chang Gung University Email: wylin@mail.cgu.edu.tw March 2014 Agenda

More information

PCI Express: Evolution, Deployment and Challenges

PCI Express: Evolution, Deployment and Challenges PCI Express: Evolution, Deployment and Challenges Nick Ma 马明辉 Field Applications Engineer, PLX Freescale Technology Forum, Beijing Track: Enabling Technologies Freescale Technology Forum, Beijing - November

More information

Introduction to Input and Output

Introduction to Input and Output Introduction to Input and Output The I/O subsystem provides the mechanism for communication between the CPU and the outside world (I/O devices). Design factors: I/O device characteristics (input, output,

More information

Embedded Systems. 8. Hardware Components. Lothar Thiele. Computer Engineering and Networks Laboratory

Embedded Systems. 8. Hardware Components. Lothar Thiele. Computer Engineering and Networks Laboratory Embedded Systems 8. Hardware Components Lothar Thiele Computer Engineering and Networks Laboratory Do you Remember? 8 2 8 3 High Level Physical View 8 4 High Level Physical View 8 5 Implementation Alternatives

More information

ECE7995 (4) Basics of Memory Hierarchy. [Adapted from Mary Jane Irwin s slides (PSU)]

ECE7995 (4) Basics of Memory Hierarchy. [Adapted from Mary Jane Irwin s slides (PSU)] ECE7995 (4) Basics of Memory Hierarchy [Adapted from Mary Jane Irwin s slides (PSU)] Major Components of a Computer Processor Devices Control Memory Input Datapath Output Performance Processor-Memory Performance

More information

A 50Mvertices/s Graphics Processor with Fixed-Point Programmable Vertex Shader for Mobile Applications

A 50Mvertices/s Graphics Processor with Fixed-Point Programmable Vertex Shader for Mobile Applications A 50Mvertices/s Graphics Processor with Fixed-Point Programmable Vertex Shader for Mobile Applications Ju-Ho Sohn, Jeong-Ho Woo, Min-Wuk Lee, Hye-Jung Kim, Ramchan Woo, Hoi-Jun Yoo Semiconductor System

More information

Universität Dortmund. ARM Architecture

Universität Dortmund. ARM Architecture ARM Architecture The RISC Philosophy Original RISC design (e.g. MIPS) aims for high performance through o reduced number of instruction classes o large general-purpose register set o load-store architecture

More information

The ARM10 Family of Advanced Microprocessor Cores

The ARM10 Family of Advanced Microprocessor Cores The ARM10 Family of Advanced Microprocessor Cores Stephen Hill ARM Austin Design Center 1 Agenda Design overview Microarchitecture ARM10 o o Memory System Interrupt response 3. Power o o 4. VFP10 ETM10

More information

Product Technical Brief S3C2440X Series Rev 2.0, Oct. 2003

Product Technical Brief S3C2440X Series Rev 2.0, Oct. 2003 Product Technical Brief S3C2440X Series Rev 2.0, Oct. 2003 S3C2440X is a derivative product of Samsung s S3C24XXX family of microprocessors for mobile communication market. The S3C2440X s main enhancement

More information

The Memory Hierarchy & Cache

The Memory Hierarchy & Cache Removing The Ideal Memory Assumption: The Memory Hierarchy & Cache The impact of real memory on CPU Performance. Main memory basic properties: Memory Types: DRAM vs. SRAM The Motivation for The Memory

More information

4.1 Introduction 4.3 Datapath 4.4 Control 4.5 Pipeline overview 4.6 Pipeline control * 4.7 Data hazard & forwarding * 4.

4.1 Introduction 4.3 Datapath 4.4 Control 4.5 Pipeline overview 4.6 Pipeline control * 4.7 Data hazard & forwarding * 4. Chapter 4: CPU 4.1 Introduction 4.3 Datapath 4.4 Control 4.5 Pipeline overview 4.6 Pipeline control * 4.7 Data hazard & forwarding * 4.8 Control hazard 4.14 Concluding Rem marks Hazards Situations that

More information

Hercules ARM Cortex -R4 System Architecture. Processor Overview

Hercules ARM Cortex -R4 System Architecture. Processor Overview Hercules ARM Cortex -R4 System Architecture Processor Overview What is Hercules? TI s 32-bit ARM Cortex -R4/R5 MCU family for Industrial, Automotive, and Transportation Safety Hardware Safety Features

More information

Reconfigurable Cell Array for DSP Applications

Reconfigurable Cell Array for DSP Applications Outline econfigurable Cell Array for DSP Applications Chenxin Zhang Department of Electrical and Information Technology Lund University, Sweden econfigurable computing Coarse-grained reconfigurable cell

More information

Parallel Computer Architectures. Lectured by: Phạm Trần Vũ Prepared by: Thoại Nam

Parallel Computer Architectures. Lectured by: Phạm Trần Vũ Prepared by: Thoại Nam Parallel Computer Architectures Lectured by: Phạm Trần Vũ Prepared by: Thoại Nam Outline Flynn s Taxonomy Classification of Parallel Computers Based on Architectures Flynn s Taxonomy Based on notions of

More information