COMP 321: Introduction to Computer Systems

Similar documents
CSCI Computer Systems Fundamentals Shell Lab: Writing Your Own Unix Shell

CS 213, Fall 2002 Lab Assignment L5: Writing Your Own Unix Shell Assigned: Oct. 24, Due: Thu., Oct. 31, 11:59PM

CS 224 Spring 2015 Lab Assignment Shell Lab: Writing Your Own Unix Shell

CS 213, Fall 2001 Lab Assignment L5: Writing Your Own Unix Shell Assigned: Oct. 25, Due: Fri., Nov. 2, 11:59PM

CS 4410, Fall 2017 Project 1: My First Shell Assigned: August 27, 2017 Due: Monday, September 11:59PM

CSC374, Spring 2010 Lab Assignment 1: Writing Your Own Unix Shell

COMP 321: Introduction to Computer Systems

COMP 321: Introduction to Computer Systems

COMP 321: Introduction to Computer Systems

HW 1: Shell. Contents CS 162. Due: September 18, Getting started 2. 2 Add support for cd and pwd 2. 3 Program execution 2. 4 Path resolution 3

CSCI0330 Intro Computer Systems Doeppner. Project Shell 2. Due: November 8, 2017 at 11:59pm. 1 Introduction 2

Computer Science 330 Operating Systems Siena College Spring Lab 5: Unix Systems Programming Due: 4:00 PM, Wednesday, February 29, 2012

Processes, Signals, I/O, Shell Lab : Introduc>on to Computer Systems Recita>on 9: Monday, Oct. 21, 2013 Marjorie Carlson Sec>on A

Multitasking. Programmer s model of multitasking. fork() spawns new process. exit() terminates own process

Programming Assignment #1: A Simple Shell

COMP 321: Introduction to Computer Systems

Signals. Jin-Soo Kim Computer Systems Laboratory Sungkyunkwan University

Programming Assignments will be.. All the PAs are continuous 3 major factors that you should consider

Project 1: Implementing a Shell

Exceptions, Processes and Signals

CS 470 Operating Systems Spring 2013 Shell Project

Each terminal window has a process group associated with it this defines the current foreground process group. Keyboard-generated signals are sent to

CPSC 341 OS & Networks. Processes. Dr. Yingwu Zhu

Processes & Signals. System Runs Many Processes Concurrently. State consists of memory image + register values + program counter

CSci 4061 Introduction to Operating Systems. Processes in C/Unix

Project 2: Shell with History1

Lecture 24: Multitasking and Signals

Reading Assignment 4. n Chapter 4 Threads, due 2/7. 1/31/13 CSE325 - Processes 1

Lab 2: Implementing a Shell COMPSCI 310: Introduction to Operating Systems

Shell and Signals. Computer Organization 3/17/2015. CSC252 - Spring The World of Multiprogramming or Multitasking. Unix Process Hierarchy

CS24: INTRODUCTION TO COMPUTING SYSTEMS. Spring 2018 Lecture 20

UNIVERSITY OF NEBRASKA AT OMAHA Computer Science 4500/8506 Operating Systems Summer 2016 Programming Assignment 1 Introduction The purpose of this

Lab 4. Out: Friday, February 25th, 2005

The Shell, System Calls, Processes, and Basic Inter-Process Communication

PROCESS CONTROL BLOCK TWO-STATE MODEL (CONT D)

CSE 410: Computer Systems Spring Processes. John Zahorjan Allen Center 534

Recitation Processes, Signals,UNIX error handling

Processes & Threads. (Chapter 3) CS 4410 Operating Systems. [R. Agarwal, L. Alvisi, A. Bracy, M. George, E. Sirer, R. Van Renesse]

Basic OS Programming Abstractions (and Lab 1 Overview)

CSCE Operating Systems Interrupts, Exceptions, and Signals. Qiang Zeng, Ph.D. Fall 2018

Getting to know you. Anatomy of a Process. Processes. Of Programs and Processes

SE350: Operating Systems

(In columns, of course.)

Recitation 8 Processes, Signals, Tshlab

Process management. What s in a process? What is a process? The OS s process namespace. A process s address space (idealized)

CSE 451: Operating Systems Winter Module 4 Processes. Mark Zbikowski Allen Center 476

fork System-Level Function

API Interlude: Process Creation (DRAFT)

UNIVERSITY OF NEBRASKA AT OMAHA Computer Science 4500/8506 Operating Systems Fall Programming Assignment 1 (updated 9/16/2017)

Project #1: Tracing, System Calls, and Processes

What is a Process? Processes and Process Management Details for running a program

Operating System Structure

Process Management! Goals of this Lecture!

Most of the work is done in the context of the process rather than handled separately by the kernel

OS lpr. www. nfsd gcc emacs ls 1/27/09. Process Management. CS 537 Lecture 3: Processes. Example OS in operation. Why Processes? Simplicity + Speed

Exceptional Control Flow Exists at All Levels of a System. Systemprogrammering 2007 Föreläsning 3 Exceptional Control Flow Part II

CS 201. Processes. Gerson Robboy Portland State University

OS Structure, Processes & Process Management. Don Porter Portions courtesy Emmett Witchel

Processes. CS439: Principles of Computer Systems January 24, 2018

Section 1: Tools. Kaifei Chen, Luca Zuccarini. January 23, Make Motivation How... 2

bash, part 3 Chris GauthierDickey

PROCESS CONTROL: PROCESS CREATION: UNIT-VI PROCESS CONTROL III-II R

SOFTWARE ARCHITECTURE 3. SHELL

Programming Assignments

Advanced Unix Concepts. Satyajit Rai

Windows architecture. user. mode. Env. subsystems. Executive. Device drivers Kernel. kernel. mode HAL. Hardware. Process B. Process C.

Processes. Operating System CS 217. Supports virtual machines. Provides services: User Process. User Process. OS Kernel. Hardware

ECF Exists at All Levels of a System Exceptional Control Flow Part II Oct. 22, 2002 Topics! Process Hierarchy! Shells! Signals!

Introduction to OS Processes in Unix, Linux, and Windows MOS 2.1 Mahmoud El-Gayyar

Assignment 1. Teaching Assistant: Michalis Pachilakis (

Project 3 Students Choice of Native Apps. each milestone s deadline is noon see cs164.net/expectations for each milestone s expectations

Using the Zoo Workstations

Operating Systems and Networks Assignment 2

OS lpr. www. nfsd gcc emacs ls 9/18/11. Process Management. CS 537 Lecture 4: Processes. The Process. Why Processes? Simplicity + Speed

The Software Stack: From Assembly Language to Machine Code

Linux Operating System

Operating Systemss and Multicore Programming (1DT089)

Midterm Exam CPS 210: Operating Systems Spring 2013

Exceptional Control Flow Part II


Processes. Processes (cont d)

Processes: Introduction. CS 241 February 13, 2012

Giving credit where credit is due

CS213. Exceptional Control Flow Part II. Topics Process Hierarchy Signals

Process Management 1

Mon Sep 17, 2007 Lecture 3: Process Management

Getting started with UNIX/Linux for G51PRG and G51CSA

Unix Processes. What is a Process?

Creating a Shell or Command Interperter Program CSCI411 Lab

Operating Systems and Networks Assignment 9

Problem Set 1 Due: 11:59pm Wednesday, February 7

Signals and Inter-Process Communica.on

CSE 506: Opera.ng Systems Signals and Inter-Process Communica.on

Princeton University Computer Science 217: Introduction to Programming Systems. Process Management

CS Fundamentals of Programming II Fall Very Basic UNIX

CS 356, Fall 2018 Data Lab (Part 1): Manipulating Bits Due: Wednesday, Sep. 5, 11:59PM

Killing Zombies, Working, Sleeping, and Spawning Children

CSC 252: Computer Organization Spring 2018: Lecture 19

CS153: Process 2. Chengyu Song. Slides modified from Harsha Madhyvasta, Nael Abu-Ghazaleh, and Zhiyun Qian

CS Operating Systems Lab 3: UNIX Processes

Transcription:

Assigned: 2/15/18, Due: 3/8/18, 11:55 PM Important: This project may be done individually or in pairs. Be sure to carefully read the course policies for assignments (including the honor code policy) on the assignments page of the course web site: http://www.clear.rice.edu/comp321/html/assignments.html Overview You will write a simple Unix shell program that supports job control. The goals of this project are as follows: Work with a more realistic system application. Become familiar with the concepts of process control. Become familiar with the concepts of signalling. General Overview of Unix Shells A shell is an interactive command-line interpreter that runs programs on behalf of the user. A shell repeatedly prints a prompt, waits for a command line onstdin, and then carries out some action, as directed by the contents of the command line. The command line is a sequence of ASCII text words delimited by whitespace. The first word in the command line is either the name of a built-in command or the name of an executable file. The remaining words are command-line arguments. If the first word is a built-in command, the shell immediately executes the command in the current process. Otherwise, the shell assumes that the first word is the name of a program to execute. In this case, the shell forks a child process, then attempts to load and run the program in the context of the child. The child processes created as a result of interpreting a single command line are known collectively as a job. In general, a job can consist of multiple child processes connected by Unix pipes. If the first word in the command line starts with a directory, such as., /, or the name of a subdirectory in the current directory, the word is assumed to be the path name of an executable program, for example,./tsh, /usr/bin/ls, or comp321/count/count. Otherwise, the first word is assumed to be the name of an executable that is contained in one of the directories in the shell s search path, which is an ordered list of directories that the shell searches for executables. If the command line ends with an ampersand ( & ), then the job runs in the background, which means that the shell does not wait for the job to terminate before printing the prompt and awaiting the next command line. Otherwise, the job runs in the foreground, which means that the shell waits for the job to terminate before awaiting the next command line. Thus, at any point in time, at most one job can be running in the foreground. However, an arbitrary number of jobs can run in the background. For example, typing the command line 1

tsh> jobs causes the shell to execute the built-injobs command. In contrast, typing the command line tsh> /bin/ls -l -d runs the ls program in the foreground. By convention, the shell ensures that when the program begins executing its main routine int main(int argc, char *argv[]) the argc and argv arguments have the following values: argc == 3, argv[0] == "/bin/ls", argv[1] == "-l", argv[2] == "-d". Alternatively, typing the command line tsh> /bin/ls -l -d & runs thels program in the background. Unix shells support the notion of job control, which allows users to move jobs back and forth between the background and foreground, and to change the process state (running, stopped, or terminated) of the processes in a job. Typing ctrl-c causes a SIGINT signal to be delivered to each process in the foreground job. The default action for SIGINT is to terminate the process. Similarly, typing ctrl-z causes a SIGTSTP signal to be delivered to each process in the foreground job. The default action for SIGTSTP is to place a process in the stopped state, where it remains until it is awakened by the receipt of a SIGCONT signal. Unix shells also provide various built-in commands that support job control. For example: jobs: List the running and stopped background jobs. bg <job>: Change a stopped job to a running background job. fg <job>: Change a stopped or running background job to a running job in the foreground. The Tiny Shell (tsh) Specification Yourtsh shell should have the following features: The prompt should be the string tsh>. The search path should be the string contained by the PATH environmental variable. The search path is a string that contains a list of directories, where each directory is separated by the : character. If the search path contains an empty string, this should be interpreted as the current directory being located in the search path. The search path can contain an empty string if the entire path is an empty string, if there are two : characters in a row, or if the path starts or ends with the : character. The search path should be loaded when the program is initialized. 2

The command line typed by the user should consist of aname and zero or more arguments, all separated by one or more spaces. If name is a built-in command, then tsh should handle it immediately and wait for the next command line. Otherwise, tsh should assume that name is the name of an executable file, which it loads and runs in the context of an initial child process. This initial child process is called a job. Ifname does not start with a directory and the search path is notnull, thentsh should assume that name is the name of an executable file contained in one of the directories in the search path. In this case, tsh should search the directories in the search path, in order, for an executable named name. The first executable named name found in the search path that tsh can execute is the program that tsh loads and runs. Ifname starts with a directory or the search path isnull,tsh should assume thatname is the path name of an executable file. tsh must use theexecve function. In other words,tsh may not use any other member of theexec family, such as execvp. tsh need not support pipes ( ) or I/O redirection (< and>). Typing ctrl-c (ctrl-z) should cause a SIGINT (SIGTSTP) signal to be sent to the current foreground job, as well as any descendents of that job (e.g., any child processes that it forked). If there is no foreground job, then the signal should have no effect. If the command line ends with an ampersand (&), then tsh should run the job in the background. Otherwise, it should run the job in the foreground. Each job can be identified by either a process ID (PID) or a job ID (JID), which is a positive integer assigned bytsh. JIDs should be denoted on the command line by the prefix %. For example, %5 denotes JID 5, and 5 denotes PID 5. (We have provided you with all of the routines you need for manipulating the job list.) tsh should support the following built-in commands: The quit command terminates the shell. Thejobs command lists all background jobs. Thebg <job> command restarts<job> by sending it a SIGCONT signal, and then runs it in the background. The<job> argument can be either a PID or a JID. Thefg <job> command restarts<job> by sending it a SIGCONT signal, and then runs it in the foreground. The<job> argument can be either a PID or a JID. tsh should reap all of its zombie children. If any job terminates because it receives a signal that it didn t catch, then tsh should recognize this event and print a message with the job s PID and a description of the offending signal of the form Job [1] (11639) terminated by signal SIGINT Where the job ID is 1 and the process id is 11639 and the job terminated because of a SIGINT signal. The provided signame array is useful for converting signal numbers into names. Specifically, use the signal number as the index for selecting an element of the array. The element will be the address of a string containing the symbol for that signal, without the SIG prefix. 3

Getting Started If you choose to work in a pair, only one of you should follow the below instructions for creating a private repository. In a later section, we will explain how to share this repository with the partner who didn t create it. Please visit the web page at https://classroom.github.com/a/pxxhmbke This page should be titled RICE-COMP321-S18. Moreover, it should say, Accept the Shell assignment, and have a button labeled Accept the invitation 1. Please accept the invitation. Upon accepting the invitation, you will be redirected to another web page. This page confirms that you have accepted the assignment and provides a link to your personal repository for the assignment. Click the link to go to your personal repository. The web page for your personal repository has a button labeled Clone or download. Click this button. You should now see text field with a URL. Copy or remember this URL. Login to theclear system if you have not already done so. Type the following: git clone [Copy the URL for your repo here] You will be prompted for yourgithub username and password. Once the clone operation is complete, you will have a directory named shell-[your github ID] Pleasecd into this directory, and do the following: Type the commandmake to compile and link a skeleton shell and some simple programs you can run from within your shell to test it. Type your name(s) and NetID(s) in the header comment at the top oftsh.c. Looking at the tsh.c (tiny shell) file, you will see that it contains a functional skeleton of a simple Unix shell. To help you get started, we have already implemented the less interesting functions. Your assignment is to complete the remaining empty functions listed below. As a sanity check for you, we ve listed the approximate number of lines of code for each of these functions in our reference solution (which includes lots of comments). main: Performs the read-evaluate/execute loop. Most of this function s implementation is provided. The only changes that you should need to make are to thesigaction calls. [6-12 lines] initpath: Performs any necessary initialization of the search path. The complexity of this function affects the complexity of the eval function. Specifically, a more complicated initpath function may allow for a simplereval function. [1 20 lines] eval: Main routine that parses and interprets the command line. [90 110 lines] builtin cmd: Recognizes and interprets the built-in commands: quit, fg, bg, and jobs. [25 lines] 1 You may have to login togithub to see the invitation. If so, you will be prompted for your username and password at the top of the page. 4

do bgfg: Implements the bg and fg built-in commands. [50 lines] waitfg: Waits for a foreground job to complete. [20 lines] sigchld handler: Catches SIGCHILD signals. [80 lines] sigint handler: Catches SIGINT (ctrl-c) signals. [15 lines] sigtstp handler: Catches SIGTSTP (ctrl-z) signals. [15 lines] Each time you modify yourtsh.c file, typemake to recompile it. To run your shell, typetsh to the command line: UNIX%./tsh tsh> [type commands to your shell here] Checking Your Work We have provided some tools to help you check your work. You may assume that MAXLINE, MAXARGS, MAXJOBS, and MAXJID are the maximum number of characters in any command line, the maximum number of arguments on any command line, the maximum number of jobs at any time, and the maximum job ID, respectively. You do not need to check to ensure that these maximums will be met, rather you may assume that no one will type any input into the shell that would cause them to be exceeded. Reference solution. The executable tshref is the reference solution for the shell. Run this program to resolve any questions you have about how your shell should behave. Your shell should emit output that is identical to the reference solution (except for PIDs, of course, which change from run to run). Shell driver. The sdriver.pl program executes a shell as a child process, sends it commands and signals as directed by a trace file, and captures and displays the output from the shell. Use the -h argument to find out the usage ofsdriver.pl: UNIX%./sdriver.pl -h Usage: sdriver.pl [-hv] -t <trace> -s <shellprog> -a <args> Options: -h Print this message -v Be more verbose -t <trace> Trace file -s <shell> Shell program to test -a <args> Shell arguments -g Generate output for autograder We have also provided sample trace files (trace*.txt) that you can use in conjunction with the shell driver to test the correctness of your shell. The lower-numbered trace files do very simple tests, and the higher-numbered tests do more complicated tests. You can run the shell driver on your shell using trace filetrace01.txt (for instance) by typing: UNIX%./sdriver.pl -t trace01.txt -s./tsh -a "-p" (the-a "-p" argument tells your shell not to emit a prompt), or UNIX% make test01 Similarly, to compare your result with the reference shell, you can run the trace driver on the reference shell by typing: 5

UNIX%./sdriver.pl -t trace01.txt -s./tshref -a "-p" or UNIX% make rtest01 For your reference, tshref.out gives the output of the reference solution for the provided traces. This might be more convenient for you than manually running the shell driver on all trace files. The neat thing about the trace files is that they generate the same output you would have gotten had you run your shell interactively (except for an initial comment that identifies the trace). For example: UNIX% make test07./sdriver.pl -t trace07.txt -s./tsh -a "-p" # # trace07.txt - Forward SIGINT only to foreground job. # tsh>./myspin 4 & [1] (27778)./myspin 4 & tsh>./myspin 5 Job [2] (27780) terminated by signal SIGINT tsh> jobs [1] (27778) Running./myspin 4 & UNIX% Notes Read every word of Chapter 8 (Exceptional Control Flow) in your textbook. Significant changes were made to this chapter in the third edition of the textbook. In particular, the material in Section 8.5.5 on safe signal handling did not exist in the previous edition. Read every word of this project handout. Use the trace files to guide the development of your shell. Starting with trace01.txt, make sure that your shell produces the identical output as the reference shell. Then move on to trace file trace02.txt, and so on. The functions waitpid, kill, fork, execve, setpgid, and sigprocmask will be the fundamental building blocks of your shell. Make sure that you understand what these functions do and how to use them. The WUNTRACED and WNOHANG options towaitpid will also be useful. When you implement your signal handlers, be sure to sendsigint andsigtstp signals to the entire foreground process group, using -pid instead of pid in the argument to thekill function. Thesdriver.pl program tests for this error. One of the tricky parts of the assignment is deciding on the allocation of work between thewaitfg and sigchld handler functions. We recommend the following approach: Inwaitfg, use a loop around thesleep function that tests whether or not the specified process is still running in the foreground. Insigchld handler, usewaitpid to reap zombie children. While other solutions are possible, such as calling waitpid in both waitfg and sigchld handler, these can be very confusing. It is simpler to do all reaping in the handler. 6

When you implement your signal handlers, be sure to follow the rules described in Section 8.5.5. In particular, per rule G1, you must not call functions such asprintf orfprintf to print messages. The textbook is too quick to dismiss the direct use of sigaction by applications, like your shell, using the weak argument that having to initialize and pass a structure to sigaction makes it unwieldy to use. (See page 775.) However, that structure ssa mask field provides for a much simpler approach to addressing the problem discussed in Section 8.5.5 s rule G3. Rather than explicitly calling sigprocmask within a handler, as suggested by rule G3, you should usesigaction ssa mask to automatically block and unblock any signals that should not be allowed to preempt the signal handler. In eval, the parent must use sigprocmask to block SIGCHLD signals before it forks the child, and then unblock these signals, again using sigprocmask after it adds the child to the job list by calling addjob. Since children inherit the blocked vectors of their parents, the child must be sure to then unblocksigchld signals before it execs the new program. The parent needs to block thesigchld signals in this way in order to avoid the race condition where the child is reaped by sigchld handler (and thus removed from the job list) before the parent calls addjob. Programs such as more, less, vi, and emacs do strange things with the terminal settings. Don t run these programs from your shell. Stick with simple text-based programs such as /bin/ls, /bin/ps,/bin/echo, and the included test programs (such asmyspin). When you run your shell from the standard Unix shell, your shell is running in the foreground process group. If your shell then creates a child process, by default that child will also be a member of the foreground process group. Since typing ctrl-c sends a SIGINT to every process in the foreground group, typing ctrl-c will send a SIGINT to your shell, as well as to every process that your shell created, which obviously isn t correct. Here is the workaround: After thefork, but before the call toexecve, the child process should call setpgid(0, 0), which puts the child in a new process group whose group ID is identical to the child s PID. This ensures that there will be only one process, your shell, in the foreground process group. When you type ctrl-c, the shell should catch the resulting SIGINT and then forward it to the appropriate foreground job (or more precisely, the process group that contains the foreground job). As described, your shell will not be able to handle programs that read from the standard input, and you are not required to do so. Note that this means you will not be able to run your shell from within your shell. Handling programs that read from the standard input requires your shell to give control of the terminal to its foreground child, which is beyond the scope of this project. We suggest you stick to the simple text based programs mentioned above and to the provided test programs (such as myspin). A string is guaranteed to start with a directory if the string contains a /. Turning in Your Assignment If you choose to work in a pair, your code and writeup must include both partners names and NetIDs. To turn in your code and writeup, you must usegit push to copy your work to thegithub remote repository. We will only look at the last version that you pushed before the deadline, taking into account your use of slip days. If you are working with a partner, it doesn t matter which partner performs the 7

final push. As a precaution against accidental loss of your code or writeup, we encourage you to push periodically. Please note, the only files that you need to turn in aretsh.c and writeup.txt. In other words, these are the only two files on which you should ever performgit add. For grading your submission, we will use the Makefile that was originally provided to compile your code. Therefore, your code should not rely on any modifications to themakefile for correct compilation. As a sanity check, you should use your web browser to visit your assignment repo. Make sure that what you see in the browser is consistent what you think you have pushed. Evaluation The project will be graded as follows: Writeup: 15% Style: 15% Correctness: 70% Your solution will be tested for correctness on a CLEAR machine, using the same shell driver and trace files that were provided (along with additional traces). Your shell should produce identical output on these traces, with only two exceptions: The PIDs can (and will) be different. The output of the /bin/ps commands will be different from run to run. However, the running states of any processes run from within your shell in the output of the/bin/ps command should be identical. For each test, you will only receive credit for that test if the output of your shell is identical to the output of the reference shell (other than the exceptions listed above). How to add your partner as a collaborator on your repository To work with a partner, the assignment acceptor from the section Getting Started must add the partner as a collaborator by following these steps: 1. Navigate your browser to the assignment repository page, using the URL you received when you initially accepted the assignment. 2. Near the top of the github repository page, there is a tab labeled Settings. There is a small gear icon directly to the left of the Settings title. Click on the Settings tab. 3. Once on the Settings page, you should find a group of operations labeled Options ( Options is in boldface.) The first such option is labeled Collaborators and teams. Click on the Collaborators and teams operation. 4. The Options pane will still be visible on the left. On the right of the Options pane, you will find two panes stacked vertically. The top right pane is labeled Teams. The lower right pane is labeled Collaborators. At the bottom of the Collaborators pane, there is a text box paired with a button labeled Add collaborator. The Add collaborator button will be slightly greyed out. 8

5. Click inside the Collaborators text box. You should see a text cursor appear inside the text box. 6. Enter your partner s github ID in the text box. Presumably, github will recognize the ID. When recognized, the github ID and user s actual name appear just below the text box. At this point, you should click the Add collaborator button. 7. You will be informed that you are Awaiting a response from your collaborator. 8. At this point, your partner should check email for the invitation. When your partner receives the invitation, please click the provided acceptance link. [The acceptance step requires that the partner be logged into github]. 9. A short time later, the original repo acceptor will see confirmation that the invitation has been accepted. Congratulations! You now have a shared repository for your assignment. Both partners will be able to clone, push, and pull from the assignment repository. (The last of which, i.e., pull, we will explain in lab.) 9