Computer Organization

Similar documents
Computer Organization

Chapter 18 Parallel Processing

Memory Systems in Pipelined Processors

CSCI 4717 Computer Architecture

Parallel Processing. Computer Architecture. Computer Architecture. Outline. Multiple Processor Organization

Multi-Processor / Parallel Processing

SMP and ccnuma Multiprocessor Systems. Sharing of Resources in Parallel and Distributed Computing Systems

Shared Memory Architecture Part One

Lecture 9: MIMD Architectures

Lecture 9: MIMD Architectures

10 Parallel Organizations: Multiprocessor / Multicore / Multicomputer Systems

Multiple Processor Systems. Lecture 15 Multiple Processor Systems. Multiprocessor Hardware (1) Multiprocessors. Multiprocessor Hardware (2)

WHY PARALLEL PROCESSING? (CE-401)

Organisasi Sistem Komputer

Computer Organization. Chapter 16

Computer Organization

SMD149 - Operating Systems - Multiprocessing

Overview. SMD149 - Operating Systems - Multiprocessing. Multiprocessing architecture. Introduction SISD. Flynn s taxonomy

Non-uniform memory access machine or (NUMA) is a system where the memory access time to any region of memory is not the same for all processors.

Chapter 18. Parallel Processing. Yonsei University

Computer Organization

Lecture 9: MIMD Architecture

Computer Organization

MULTIPROCESSORS. Characteristics of Multiprocessors. Interconnection Structures. Interprocessor Arbitration

Generic Model of I/O Module Interface to CPU and Memory Interface to one or more peripherals

Computer Organization

INSTITUTO SUPERIOR TÉCNICO. Architectures for Embedded Computing

Distributed Systems. Thoai Nam Faculty of Computer Science and Engineering HCMC University of Technology

Chapter Seven Morgan Kaufmann Publishers

SIDDHARTH GROUP OF INSTITUTIONS :: PUTTUR Siddharth Nagar, Narayanavanam Road QUESTION BANK (DESCRIPTIVE) UNIT-I

Comp. Org II, Spring

Computer parallelism Flynn s categories

Serial. Parallel. CIT 668: System Architecture 2/14/2011. Topics. Serial and Parallel Computation. Parallel Computing

Multiprocessors and Thread-Level Parallelism. Department of Electrical & Electronics Engineering, Amrita School of Engineering

Parallel Processing & Multicore computers

Shared Memory and Distributed Multiprocessing. Bhanu Kapoor, Ph.D. The Saylor Foundation

Comp. Org II, Spring

Multiprocessor Systems Continuous need for faster computers Multiprocessors: shared memory model, access time nanosec (ns) Multicomputers: message pas

Operating Systems, Fall Lecture 9, Tiina Niklander 1

Computer Architecture

Operating Systems: Internals and Design Principles, 7/E William Stallings. Chapter 1 Computer System Overview

Computer Organization

Operating Systems: Internals and Design Principles. Chapter 1 Computer System Overview Seventh Edition By William Stallings

Chapter 7. Multicores, Multiprocessors, and Clusters. Goal: connecting multiple computers to get higher performance

Computer Systems Architecture

Convergence of Parallel Architecture

Chapter 17 - Parallel Processing

Computer Systems Architecture

EN164: Design of Computing Systems Lecture 34: Misc Multi-cores and Multi-processors

Non-Uniform Memory Access (NUMA) Architecture and Multicomputers

Non-Uniform Memory Access (NUMA) Architecture and Multicomputers

Spring 2011 Parallel Computer Architecture Lecture 4: Multi-core. Prof. Onur Mutlu Carnegie Mellon University

Multiple Issue and Static Scheduling. Multiple Issue. MSc Informatics Eng. Beyond Instruction-Level Parallelism

Dr e v prasad Dt

MULTIPROCESSORS AND THREAD-LEVEL. B649 Parallel Architectures and Programming

MULTIPROCESSORS AND THREAD-LEVEL PARALLELISM. B649 Parallel Architectures and Programming

Part I Overview Chapter 1: Introduction

Multiprocessor Synchronization

Computer Organization ECE514. Chapter 5 Input/Output (9hrs)

Non-Uniform Memory Access (NUMA) Architecture and Multicomputers

East Tennessee State University Department of Computer and Information Sciences CSCI 4717 Computer Architecture TEST 2 for Fall Semester, 2007

Online Course Evaluation. What we will do in the last week?

Dheeraj Bhardwaj May 12, 2003

6.1 Multiprocessor Computing Environment

Lecture 24: Virtual Memory, Multiprocessors

Multiprocessors & Thread Level Parallelism

Lecture 18: Coherence Protocols. Topics: coherence protocols for symmetric and distributed shared-memory multiprocessors (Sections

Parallel Architectures

CS 590: High Performance Computing. Parallel Computer Architectures. Lab 1 Starts Today. Already posted on Canvas (under Assignment) Let s look at it

2. Parallel Architectures

EN164: Design of Computing Systems Topic 08: Parallel Processor Design (introduction)

Introduction. Chapter 1

DISTRIBUTED SHARED MEMORY

William Stallings Computer Organization and Architecture. Chapter 11 CPU Structure and Function

COMPUTER ORGANISATION CHAPTER 1 BASIC STRUCTURE OF COMPUTERS

Objective. We will study software systems that permit applications programs to exploit the power of modern high-performance computers.

Parallel Computing Platforms. Jinkyu Jeong Computer Systems Laboratory Sungkyunkwan University

Course Description: This course includes concepts of instruction set architecture,

Design with Microprocessors

Chapter 1 Computer System Overview

Multiprocessing and Scalability. A.R. Hurson Computer Science and Engineering The Pennsylvania State University

Module 5 Introduction to Parallel Processing Systems

CPU Structure and Function. Chapter 12, William Stallings Computer Organization and Architecture 7 th Edition

Chapter 2: Computer-System Structures. Hmm this looks like a Computer System?

Uniprocessor Computer Architecture Example: Cray T3E

Portland State University ECE 588/688. Directory-Based Cache Coherence Protocols

Parallel Computing Platforms

Honorary Professor Supercomputer Education and Research Centre Indian Institute of Science, Bangalore

Alternate definition: Instruction Set Architecture (ISA) What is Computer Architecture? Computer Organization. Computer structure: Von Neumann model

CSE 392/CS 378: High-performance Computing - Principles and Practice

Computer and Information Sciences College / Computer Science Department CS 207 D. Computer Architecture. Lecture 9: Multiprocessors

Unit 3 and Unit 4: Chapter 4 INPUT/OUTPUT ORGANIZATION

3/24/2014 BIT 325 PARALLEL PROCESSING ASSESSMENT. Lecture Notes:

Keywords and Review Questions

Computer Organization. Submitted By: Dalvir Hooda

Number of processing elements (PEs). Computing power of each element. Amount of physical memory used. Data access, Communication and Synchronization

Handout 3 Multiprocessor and thread level parallelism

Computer-System Organization (cont.)

Chapter 1: Perspectives

SRM ARTS AND SCIENCE COLLEGE SRM NAGAR, KATTANKULATHUR

Transcription:

University of Pune S.E. I.T. Subject code: 214442 Computer Organization Part 44 Cluster Processors, UMA, NUMA UNIT VI Tushar B. Kute, Department of Information Technology, Sandip Institute of Technology & Research Centre, Nashik. http://tusharkute.com Clusters Computer cluster is a group of linked computers, working together closely so that in many respect they form a single computer. The components of a cluster are commonly but not always connected to each other through fast LAN. Computer means a system that run its own, a part from the cluster. Such a computer in cluster is typically referred as a node.

Advantages of clustering Absolute scalability Incremental scalability High availability Cost effective Cluster configurations

Cluster configurations Homogenous clusters Every single node is exactly the same,

Heterogeneous Cluster Made from different kinds of computers. For example: a few Sun SPARC station IPXs, a few Intel 486 machines, and a DEC alpha. Made from different machines in the same architecture family. For example: a collection of Intel boxes where the machines are of different generations such as mixture of 486, Pentium I, and Pentium II. Operating System Design Issues Failure management Load balancing Parallelizing computation Parallelizing compiler Parallelized applications Parametric computing

Cluster Computer Architecture Cluster middleware services and functions Single entry point Single file hierarchy Single control unit Single virtual networking Single memory space Single job management system Single I/O space Single Process Space Check Pointing Process Migration

Comparison Uniform Memory Access It is a shared memory architecture used in parallel computers. All the processors in the UMA model share physical memory uniformly. In a UMA architecture, access time to memory location is independent of which processor makes the request or which memory chip contains the transferred data.

Types of UMA UMA using bus-based SMP architectures UMA using crossbar switches UMA using multistage switching networks Example: UMA

Non-Uniform Memory Access It is a computer memory design used in multiprocessors, where the memory access time depends on the memory location relative to a processor. Under NUMA, a processor can access its own local memory faster than non-local memory, that is, memory local to another processor or memory shared between processors.

Cache Coherence NUMA The system runs only one OS and shows only a single memory image to the user even though the memory is physically distributed over processors. Single processors can access their own memory much faster than that of other processors, the memory access is nonuniform. CC-NUMA

Vector Processing It is a CPU design where the instruction set includes operations that can perform mathematical operations on multiple data elements simultaneously. This is in contrast to scalar processor which handles one element at a time using multiple instructions. Examples and Applications Radar and Signal processing for detection of space/underwater targets. Remote sensing for earth resource exploration. Computational wind tunnel experiments. 3D stop action computer assisted tomography. Weather forecasting Medical diagnosis

Vector Processing Approach Instead of pipelining just the instructions, they also pipeline the data itself. They are fed instructions that say not just to add A to B, but Illustrations Programming language Execute this loop for 10 times Read the next instruction and decode it Fetch first number Fetch second number Add them Put the result here End loop Vector Processing Read instructions and decode it. Fetch 10 numbers Fetch 10 numbers Add them Put the results here

Vector computations Pipelined ALU Parallel ALU Parallel Processors Pipelined ALU

Bus Arbitration The device that is allowed to initiate data transfers on the bus at any given time is called bus master. There may be more than one bus master such as processor, DMA controller etc. They share the system bus. When the current master relinquishes control of the bus, another master acquire control of bus. Bus arbitration is the process by which the next device to become the bus master is selected and bus mastership is transferred to it. the selection of bus master is usually done on the priority basis. Centralized arbitration A single bus arbiter performs the required arbitration. The bus arbiter may be the processor or a separate controller connected to the bus. Methods: Daisy chaining Polling Independent request

Daisy chaining Polling

Independent request References Computer Architecture and Organization By A. P. Godse (from books.google.com ) Computer Organization By Hamacher and Zaky Computer Organization and Architecture By William Stallings