CS61C : Machine Structures

Similar documents
Review. Motivation for Input/Output. What do we need to make I/O work?

CS 61C: Great Ideas in Computer Architecture (Machine Structures) Lecture 36: IO Basics. Instructor: Dan Garcia h<p://inst.eecs.berkeley.

Review. Manage memory to disk? Treat as cache. Lecture #26 Virtual Memory II & I/O Intro

CS61C : Machine Structures

CS 61C: Great Ideas in Computer Architecture (Machine Structures) Lecture 36: IO Basics

61C In the News. Review. Memory Management Today. Impact of Paging on AMAT

ECE331: Hardware Organization and Design

CS 61C: Great Ideas in Computer Architecture. Input/Output

October, Saeid Nooshabadi. Overview COMP 3221

ECE232: Hardware Organization and Design

Page 1, 5/4/99 8:22 PM

CS61C : Machine Structures

Systems Architecture II

Raspberry Pi ($40 on Amazon)

CS 61C: Great Ideas in Computer Architecture (Machine Structures) Opera&ng Systems, Interrupts, Virtual Memory

CPS104 Computer Organization and Programming Lecture 17: Interrupts and Exceptions. Interrupts Exceptions and Traps. Visualizing an Interrupt

Exceptions and Interrupts

Assembly labs start this week. Don t forget to submit your code at the end of your lab section. Download MARS4_5.jar to your lab PC or laptop.

EECS150 - Digital Design Lecture 9 Project Introduction (I), Serial I/O. Announcements

Raspberry Pi ($40 on Amazon)

EECS150 - Digital Design Lecture 08 - Project Introduction Part 1

11/13/17. CS61C so far. Adding I/O C Programs. So How is a Laptop Any Different? It s a Real Computer! Raspberry Pi (Less than $40 on Amazon in 2017)

CS 61C: Great Ideas in Computer Architecture Lecture 22: Opera&ng Systems, Interrupts, and Virtual Memory Part 1

COMP I/O, interrupts, exceptions April 3, 2016

CS 61C: Great Ideas in Computer Architecture. Lecture 22: Operating Systems. Krste Asanović & Randy H. Katz

UCB CS61C : Machine Structures

Motivation for Input/Output

Computer Architecture CS 355 Busses & I/O System

Hardware OS & OS- Application interface

The Operating System (OS) MicroComputer Engineering OperatingSystem slide 1!

Example Networks on chip Freescale: MPC Telematics chip

Last 2 Classes: Introduction to Operating Systems & C++ tutorial. Today: OS and Computer Architecture

Virtual Machines & the OS Kernel

EE 109 Unit 17 - Exceptions

Computer Science 2500 Computer Organization Rensselaer Polytechnic Institute Spring Topic Notes: MIPS Programming

Chapter 6. Storage & Other I/O

CS 134. Operating Systems. April 8, 2013 Lecture 20. Input/Output. Instructor: Neil Rhodes. Monday, April 7, 14

Computer System Overview OPERATING SYSTEM TOP-LEVEL COMPONENTS. Simplified view: Operating Systems. Slide 1. Slide /S2. Slide 2.

Operating System: Chap13 I/O Systems. National Tsing-Hua University 2016, Fall Semester

SPIM Instruction Set

CS 537 Lecture 2 Computer Architecture and Operating Systems. OS Tasks

COMP2421 COMPUTER ORGANZATION. Lab 3

Computer System Overview

Module 6: INPUT - OUTPUT (I/O)

Anne Bracy CS 3410 Computer Science Cornell University

CS 61C: Great Ideas in Computer Architecture Virtual Memory. Instructors: John Wawrzynek & Vladimir Stojanovic

Chapter 5 - Input / Output

19: I/O. Mark Handley. Direct Memory Access (DMA)

Lecture 7: Examples, MARS, Arithmetic

by I.-C. Lin, Dept. CS, NCTU. Textbook: Operating System Concepts 8ed CHAPTER 13: I/O SYSTEMS

Discussion Week 8. TA: Kyle Dewey. Tuesday, November 15, 11

Module 11: I/O Systems

Instruction Set Architecture part 1 (Introduction) Mehran Rezaei

ECEN 449 Microprocessor System Design. Hardware-Software Communication. Texas A&M University

I/O - input/output. system components: CPU, memory, and bus -- now add I/O controllers and peripheral devices. CPU Cache

Last class: Today: Course administration OS definition, some history. Background on Computer Architecture

Chapter 5 Input/Output. I/O Devices

5.b Principles of I/O Hardware CPU-I/O communication

I/O Handling. ECE 650 Systems Programming & Engineering Duke University, Spring Based on Operating Systems Concepts, Silberschatz Chapter 13

The control of I/O devices is a major concern for OS designers

CSE 120. Overview. July 27, Day 8 Input/Output. Instructor: Neil Rhodes. Hardware. Hardware. Hardware

John Wawrzynek & Nick Weaver

Computer Organization and Structure. Bing-Yu Chen National Taiwan University

Architecture and OS. To do. q Architecture impact on OS q OS impact on architecture q Next time: OS components and structure

Lecture 11: Interrupt and Exception. James C. Hoe Department of ECE Carnegie Mellon University

EECS 373 Design of Microprocessor-Based Systems

UC Santa Barbara. Operating Systems. Christopher Kruegel Department of Computer Science UC Santa Barbara

Inf2C - Computer Systems Lecture 16 Exceptions and Processor Management

Chapter 8. A Typical collection of I/O devices. Interrupts. Processor. Cache. Memory I/O bus. I/O controller I/O I/O. Main memory.

Faculty of Science FINAL EXAMINATION

CS61C : Machine Structures

ELEC / Computer Architecture and Design Fall 2013 Instruction Set Architecture (Chapter 2)

EE 457 Unit 8. Exceptions What Happens When Things Go Wrong

[08] IO SUBSYSTEM 1. 1

Operating Systems CMPSCI 377 Spring Mark Corner University of Massachusetts Amherst

What are Exceptions? EE 457 Unit 8. Exception Processing. Exception Examples 1. Exceptions What Happens When Things Go Wrong

CMSC 313 COMPUTER ORGANIZATION & ASSEMBLY LANGUAGE PROGRAMMING LECTURE 09, SPRING 2013

MIPS R-format Instructions. Representing Instructions. Hexadecimal. R-format Example. MIPS I-format Example. MIPS I-format Instructions

OPERATING SYSTEMS CS136

UC Berkeley CS61C : Machine Structures

CSE 378 Final Exam 3/14/11 Sample Solution

We will study the MIPS assembly language as an exemplar of the concept.

CS61C : Machine Structures

Lecture 23. I/O, Interrupts, exceptions

QtSPIM and MARS : MIPS Simulators

Course Administration

UC Berkeley CS61C : Machine Structures

Virtual Machines & the OS Kernel

CSE 153 Design of Operating Systems

CS 104 Computer Organization and Design

Virtual Memory Input/Output. Admin

Input / Output. School of Computer Science G51CSA

CSE 153 Design of Operating Systems Fall 18

Input/Output Interfaces: Ch

Computer Architecture Instruction Set Architecture part 2. Mehran Rezaei

Review 1/2 Operating System started as shared I/O library. CS61C Anatomy of I/O Devices: Networks. Lecture 14

Architectural Support for Operating Systems

EE 109 Unit 15 Subroutines and Stacks

ECE468 Computer Organization and Architecture. OS s Responsibilities

Chapter 8 :: Topics. Chapter 8 :: Memory Systems. Introduction Memory System Performance Analysis Caches Virtual Memory Memory-Mapped I/O Summary

Transcription:

inst.eecs.berkeley.edu/~cs61c/su05 CS61C : Machine Structures Lecture #25: I/O 2005-08-03 Andy Carle CS61C L25 I/O (1)

Review Virtual memory to Physical Memory Translation too slow? Add a cache of Virtual to Physical Address Translations, called a TLB Spatial Locality means Working Set of Pages is all that must be in memory for process to run fairly well Virtual Memory allows protected sharing of memory between processes with less swapping to disk CS61C L25 I/O (2)

Recall : 5 components of any Computer Earlier Lectures Current Lectures Computer Processor (active) Control ( brain ) Datapath ( brawn ) Memory (passive) (where programs, data live when running) Devices Input Output Keyboard, Mouse Disk, Network Display, Printer CS61C L25 I/O (3)

Motivation for Input/Output I/O is how humans interact with computers I/O gives computers long-term memory. I/O lets computers do amazing things: Read pressure of synthetic hand and control synthetic arm and hand of fireman Control propellers, fins, communicate in BOB (Breathable Observable Bubble) Computer without I/O like a car without wheels; great technology, but won t get you anywhere CS61C L25 I/O (4)

I/O Device Examples and Speeds I/O Speed: bytes transferred per second (from mouse to Gigabit LAN: 10-million-to-1) Device Behavior Partner Data Rate (KBytes/s) Keyboard Input Human 0.01 Mouse Input Human 0.02 Voice output Output Human 5.00 Floppy disk Storage Machine 50.00 Laser Printer Output Human 100.00 Magnetic Disk Storage Machine 10,000.00 Wireless Network I or O Machine 10,000.00 Graphics Display Output Human 30,000.00 Wired LAN Network I or O Machine 125,000.00 When discussing transfer rates, use 10 x CS61C L25 I/O (5)

What do we need to make I/O work? A way to connect many types of devices to the Proc-Mem A way to control these devices, respond to them, and transfer data A way to present them to user programs so they are useful Files APIs Operating System Proc SCSI Bus PCI Bus Mem cmd reg. data reg. CS61C L25 I/O (6)

Instruction Set Architecture for I/O What must the processor do for I/O? Input: reads a sequence of bytes Output: writes a sequence of bytes Some processors have special input and output instructions Alternative model (used by MIPS): Use loads for input, stores for output Called Memory Mapped Input/Output A portion of the address space dedicated to communication paths to Input or Output devices (no memory there) CS61C L25 I/O (7)

Memory Mapped I/O Certain addresses are not regular memory Instead, they correspond to registers in I/O devices address 0xFFFFFFFF 0xFFFF0000 cntrl reg. data reg. 0 CS61C L25 I/O (8)

Processor-I/O Speed Mismatch 1GHz microprocessor can execute 1 billion load or store instructions per second, or 4,000,000 KB/s data rate I/O devices data rates range from 0.01 KB/s to 125,000 KB/s Input: device may not be ready to send data as fast as the processor loads it Also, might be waiting for human to act Output: device not be ready to accept data as fast as processor stores it What to do? CS61C L25 I/O (9)

Processor Checks Status before Acting Path to device generally has 2 registers: Control Register, says it s OK to read/write (I/O ready) [think of a flagman on a road] Data Register, contains data Processor reads from Control Register in loop, waiting for device to set Ready bit in Control reg (0 1) to say its OK Processor then loads from (input) or writes to (output) data register Load from or Store into Data Register resets Ready bit (1 0) of Control Register CS61C L25 I/O (10)

SPIM I/O Simulation SPIM simulates 1 I/O device: memorymapped terminal (keyboard + display) Read from keyboard (receiver); 2 device regs Writes to terminal (transmitter); 2 device regs Receiver Control 0xffff0000 Receiver Data 0xffff0004 Transmitter Control 0xffff0008 Transmitter Data 0xffff000c (IE) Unused (00...00) Unused Unused (00...00) Unused (00...00) Ready (I.E.) Received Byte Ready (I.E.) Transmitted Byte CS61C L25 I/O (11)

SPIM I/O Control register rightmost bit (0): Ready Receiver: Ready==1 means character in Data Register not yet been read; 1 0 when data is read from Data Reg Transmitter: Ready==1 means transmitter is ready to accept a new character; 0 Transmitter still busy writing last char - I.E. bit discussed later Data register rightmost byte has data Receiver: last char from keyboard; rest = 0 Transmitter: when write rightmost byte, writes char to display CS61C L25 I/O (12)

I/O Example Input: Read from keyboard into $v0 lui $t0, 0xffff #ffff0000 Waitloop: lw $t1, 0($t0) #control andi $t1,$t1,0x1 beq $t1,$zero, Waitloop lw $v0, 4($t0) #data Output: Write to display from $a0 lui $t0, 0xffff #ffff0000 Waitloop: lw $t1, 8($t0) #control andi $t1,$t1,0x1 beq $t1,$zero, Waitloop sw $a0, 12($t0) #data Processor waiting for I/O called Polling Ready bit from processor s point of view! CS61C L25 I/O (13)

Administrivia Project 3 Due Friday Project 4 Out Soon Final Exam will be Next Friday! CS61C L25 I/O (14)

Cost of Polling? Assume for a processor with a 1GHz clock it takes 400 clock cycles for a polling operation (call polling routine, accessing the device, and returning). Determine % of processor time for polling Mouse: polled 30 times/sec so as not to miss user movement Floppy disk: transfers data in 2-Byte units and has a data rate of 50 KB/second. No data transfer can be missed. Hard disk: transfers data in 16-Byte chunks and can transfer at 16 MB/second. Again, no transfer can be missed. CS61C L25 I/O (15)

% Processor time to poll [p. 677 in book] Mouse Polling, Clocks/sec = 30 [polls/s] * 400 [clocks/poll] = 12K [clocks/s] % Processor for polling: 12*10 3 [clocks/s] / 1*10 9 [clocks/s] = 0.0012% Polling mouse little impact on processor Frequency of Polling Floppy = 50 [KB/s] / 2 [B/poll] = 25K [polls/s] Floppy Polling, Clocks/sec = 25K [polls/s] * 400 [clocks/poll] = 10M [clocks/s] % Processor for polling: 10*10 6 [clocks/s] / 1*10 9 [clocks/s] = 1% OK if not too many I/O devices CS61C L25 I/O (16)

% Processor time to poll hard disk Frequency of Polling Disk = 16 [MB/s] / 16 [B] = 1M [polls/s] Disk Polling, Clocks/sec = 1M [polls/s] * 400 [clocks/poll] = 400M [clocks/s] % Processor for polling: 400*10 6 [clocks/s] / 1*10 9 [clocks/s] = 40% Unacceptable CS61C L25 I/O (17)

What is the alternative to polling? Wasteful to have processor spend most of its time spin-waiting for I/O to be ready Would like an unplanned procedure call that would be invoked only when I/O device is ready Solution: use exception mechanism to help I/O. Interrupt program when I/O ready, return when done with data transfer CS61C L25 I/O (18)

I/O Interrupt An I/O interrupt is like overflow exceptions except: An I/O interrupt is asynchronous More information needs to be conveyed An I/O interrupt is asynchronous with respect to instruction execution: I/O interrupt is not associated with any instruction, but it can happen in the middle of any given instruction I/O interrupt does not prevent any instruction from completion CS61C L25 I/O (19)

Definitions for Clarification Exception: signal marking that something out of the ordinary has happened and needs to be handled Interrupt: asynchronous exception Trap: synchronous exception Note: Many systems folks say interrupt to mean what we mean when we say exception. CS61C L25 I/O (20)

Interrupt Driven Data Transfer Memory (1) I/O interrupt (2) save PC (3) jump to interrupt service routine (5) (4) perform transfer add sub and or read store... jr user program interrupt service routine CS61C L25 I/O (21)

SPIM I/O Simulation: Interrupt Driven I/O I.E. stands for Interrupt Enable Set Interrupt Enable bit to 1 have interrupt occur whenever Ready bit is set Receiver Control 0xffff0000 Receiver Data 0xffff0004 Transmitter Control 0xffff0008 Transmitter Data 0xffff000c (IE) Unused (00...00) Unused Unused (00...00) Unused (00...00) Ready (I.E.) Received Byte Ready (I.E.) Transmitted Byte CS61C L25 I/O (22)

Benefit of Interrupt-Driven I/O Find the % of processor consumed if the hard disk is only active 5% of the time. Assuming 500 clock cycle overhead for each transfer, including interrupt: Disk Interrupts/s = 16 MB/s / 16B/interrupt = 1M interrupts/s Disk Interrupts, clocks/s = 1M interrupts/s * 500 clocks/interrupt = 500,000,000 clocks/s % Processor for during transfer: 500*10 6 / 1*10 9 = 50% Disk active 5% 5% * 50% 2.5% busy CS61C L25 I/O (23)

Generalizing Interrupts We can handle all sorts of exceptions with interrupts. Big idea: jump to handler that knows what to do with each interrupt, then jump back Our types: syscall, overflow, mmio ready. CS61C L25 I/O (24)

OS: I/O Requirements The OS must be able to prevent: The user program from communicating with the I/O device directly If user programs could perform I/O directly: No protection to the shared I/O resources 3 types of communication are required: The OS must be able to give commands to the I/O devices The I/O device notify OS when the I/O device has completed an operation or an error Data transfers between memory and I/O device CS61C L25 I/O (25)

Instruction Set Support for OS (1/2) How to turn off interrupts during interrupt routine? Bit in Status Register determines whether or not interrupts enabled: Interrupt Enable bit (IE) (0 off, 1 on) (described later) IE Status Register CS61C L25 I/O (26)

Instruction Set Support for OS (2/2) How to prevent user program from turning off interrupts (forever)? Bit in Status Register determines whether in user mode or OS (kernel) mode: Kernel/User bit (KU) (0 kernel, 1 user) Assume Unused KU IE Status Register On exception/interrupt disable interrupts (IE=0) and go into kernel mode (KU=0) CS61C L25 I/O (27)

Kernel/User Mode Generally restrict device access to OS HOW? Add a mode bit to the machine: K/U Only allow SW in kernel mode to access device registers If user programs could access device directly? could destroy each others data,... might break the devices, CS61C L25 I/O (28)

Crossing the System Boundary System loads user program into memory and gives it use of the processor Switch back User SYSCALL - request service System Proc Mem - I/O I/O Bus TRAP (overflow) Interrupt cmd reg. data reg. CS61C L25 I/O (29)

Syscall How does user invoke the OS? syscall instruction: invoke the kernel (Go to 0x80000080, change to kernel mode) By software convention, $v0 has system service requested: OS performs request CS61C L25 I/O (30)

SPIM OS Services via Syscall Service Code Args Result (put in $v0) print_int 1 $a0 = integer print_float 2 $f12 = float print_double 3 $f12 = double print_string 4 $a0 = string read_int 5 integer (in $v0) read_float 6 float (in $f0) read_double 7 double (in $f0) read_string 8 $a0 = buffer, $a1 = length sbrk 9 $a0 = amount address(in $v0) exit 10 Note: most OS services deal with I/O CS61C L25 I/O (31)

Example: User invokes OS (SPIM) Print the answer = 42 First print the answer = :.data str:.asciiz "the answer = ".text li $v0,4 # 4=code for print_str la $a0,str # address of string syscall # print the string Now print 42 li $v0,1 # 1=code for print_int li $a0,42 # integer to print syscall # print int CS61C L25 I/O (32)

Handling a Single Interrupt (1/3) An interrupt has occurred, then what? Automatically, the hardware copies PC into EPC ($14 on cop0) and puts correct code into Cause Reg ($13 on cop0) Automatically, PC is set to 0x80000080, process enters kernel mode, and interrupt handler code begins execution Interrupt Handler code: Checks Cause Register (bits 5 to 2 of $13 in cop0) and jumps to portion of interrupt handler which handles the current exception CS61C L25 I/O (33)

Handling a Single Interrupt (2/3) Sample Interrupt Handler Code.text 0x80000080 mfc0 $k0,$13 # $13 is Cause Reg sll $k0,$k0,26 # isolate srl $k0,$k0,28 # Cause bits Notes: Don t need to save $k0 or $k1 CS61C L25 I/O (34) - MIPS software convention to provide temp registers for operating system routines - Application software cannot use them Can only work on CPU, not on cop0

Handling a Single Interrupt (3/3) When the interrupt is handled, copy the value from EPC to the PC. Call instruction rfe (return from exception), which will return process to user mode and reset state to the way it was before the interrupt What about multiple interrupts? CS61C L25 I/O (35)

Multiple Interrupts Problem: What if we re handling an Overflow interrupt and an I/O interrupt (printer ready, for example) comes in? Options: drop any conflicting interrupts: unrealistic, they may be important simultaneously handle multiple interrupts: unrealistic, may not be able to synchronize them (such as with multiple I/O interrupts) queue them for later handling: sounds good CS61C L25 I/O (36)

Prioritized Interrupts (1/3) Question: Suppose we re dealing with a computer running a nuclear facility. What if we re handling an Overflow interrupt and a Nuclear Meltdown Imminent interrupt comes in? Answer: We need to categorize and prioritize interrupts so we can handle them in order of urgency: emergency vs. luxury. CS61C L25 I/O (37)

Prioritized Interrupts (2/3) OS convention to simplify software: Process cannot be preempted by interrupt at same or lower "level" Return to interrupted code as soon as no more interrupts at a higher level When an interrupt is handled, take the highest priority interrupt on the queue - may be partially handled, may not, so we may need to save state of interrupts(!) CS61C L25 I/O (38)

Prioritized Interrupts (3/3) To implement, we need an Exception Stack: portion of address space allocated for stack of Exception Frames each frame represents one interrupt: contains priority level as well as enough info to restart handling it if necessary CS61C L25 I/O (39)

Modified Interrupt Handler (1/3) Problem: When an interrupt comes in, EPC and Cause get overwritten immediately by hardware. Lost EPC means loss of user program. Solution: Modify interrupt handler. When first interrupt comes in: disable interrupts (in Status Register) save EPC, Cause, Status and Priority Level on Exception Stack re-enable interrupts continue handling current interrupt CS61C L25 I/O (40)

Modified Interrupt Handler (2/3) When next (or any later) interrupt comes in: interrupt the first one disable interrupts (in Status Register) save EPC, Cause, Status and Priority Level (and maybe more) on Exception Stack determine whether new one preempts old one - if no, re-enable interrupts and continue with old one - if yes, may have to save state for the old one, then re-enable interrupts, then handle new one CS61C L25 I/O (41)

Modified Interrupt Handler (3/3) Notes: Disabling interrupts is dangerous So we disable them for as short a time as possible: long enough to save vital info onto Exception Stack This new scheme allows us to handle many interrupts effectively. CS61C L25 I/O (42)

Interrupt Levels in MIPS? What are they? It depends what the MIPS chip is inside of: differ by app Casio PalmPC, Sony Playstation, HP LaserJet printer MIPS architecture enables priorities for different I/O events CS61C L25 I/O (43)

Interrupt Levels in MIPS Architecture Conventionally, from highest level to lowest level exception/interrupt levels: Bus error Illegal Instruction/Address trap High priority I/O Interrupt (fast response) Low priority I/O Interrupt (slow response) Others CS61C L25 I/O (44)

Improving Data Transfer Performance Thus far: OS give commands to I/O, I/O device notify OS when the I/O device completed operation or an error What about data transfer to I/O device? Processor busy doing loads/stores between memory and I/O Data Register Ideal: specify the block of memory to be transferred, be notified on completion? Direct Memory Access (DMA) : a simple computer transfers a block of data to/from memory and I/O, interrupting upon done CS61C L25 I/O (45)

Example: code in DMA controller DMA code from Disk Device to Memory.data Count:.word 4096 Start:.space 4096.text Initial: lw $s0, Count # No. chars la $s1, Start # @next char Wait: lw $s2, DiskControl andi $s2,$s2,1 # select Ready beq $s2,$0,wait # spinwait lb $t0, DiskData # get byte sb $t0, 0($s1) # transfer addiu $s0,$s0,-1 # Count-- addiu $s1,$s1,1 # Start++ bne $s0,$0,wait # next char DMA computer in parallel with CPU CS61C L25 I/O (46)

Details not covered MIPS has a field to record all pending interrupts so that none are lost while interrupts are off; in Cause register The Interrupt Priority Level that the CPU is running at is set in memory MIPS has a field in that can mask interrupts of different priorities to implement priority levels; in Status register MIPS has limited nesting of saving KU,IE bits to recall in case higher priority interrupts; in Status Register CS61C L25 I/O (47)

Implementation? Take 150 & 152 CS61C L25 I/O (48)

Peer Instruction A. A faster CPU will result in faster I/O. B. Hardware designers handle mouse input with interrupts since it is better than polling in almost all cases. C. Low-level I/O is actually quite simple, as it s really only reading and writing bytes. CS61C L25 I/O (49)

And in conclusion I/O gives computers their 5 senses I/O speed range is 100-million to one Processor speed means must synchronize with I/O devices before use Polling works, but expensive processor repeatedly queries devices Interrupts works, more complex devices causes an exception, causing OS to run and deal with the device I/O control leads to Operating Systems CS61C L25 I/O (50)