are all acceptable. With the right compiler flags, Java/C++ style comments are also acceptable.

Similar documents
ch = argv[i][++j]; /* why does ++j but j++ does not? */

Lecture 12 CSE July Today we ll cover the things that you still don t know that you need to know in order to do the assignment.

PRINCIPLES OF OPERATING SYSTEMS

Advanced C Programming Topics

211: Computer Architecture Summer 2016

Hello, World! in C. Johann Myrkraverk Oskarsson October 23, The Quintessential Example Program 1. I Printing Text 2. II The Main Function 3

Contents. A Review of C language. Visual C Visual C++ 6.0

CSCI-243 Exam 2 Review February 22, 2015 Presented by the RIT Computer Science Community

Lectures 5-6: Introduction to C

Chapter 11 Introduction to Programming in C

Chapter 11 Introduction to Programming in C

Organization of a file

Array Initialization

UNIT IV-2. The I/O library functions can be classified into two broad categories:

gcc hello.c a.out Hello, world gcc -o hello hello.c hello Hello, world

Computer Science 2500 Computer Organization Rensselaer Polytechnic Institute Spring Topic Notes: C and Unix Overview

8. Characters, Strings and Files

ITC213: STRUCTURED PROGRAMMING. Bhaskar Shrestha National College of Computer Studies Tribhuvan University

Chapter 11 Introduction to Programming in C

BIL 104E Introduction to Scientific and Engineering Computing. Lecture 14

Topic 6: A Quick Intro To C

CSCI 171 Chapter Outlines

Topic 6: A Quick Intro To C. Reading. "goto Considered Harmful" History

Lectures 5-6: Introduction to C

UNIT-V CONSOLE I/O. This section examines in detail the console I/O functions.

IO = Input & Output 2

Computer Programming Unit v

Chapter 11 Introduction to Programming in C

High Performance Programming Programming in C part 1

Features of C. Portable Procedural / Modular Structured Language Statically typed Middle level language

Approximately a Final Exam CPSC 206

C Overview Fall 2014 Jinkyu Jeong

The detail of sea.c [1] #include <stdio.h> The preprocessor inserts the header file stdio.h at the point.

Computer Science 322 Operating Systems Mount Holyoke College Spring Topic Notes: C and Unix Overview

Tutorial 1: Introduction to C Computer Architecture and Systems Programming ( )

C Concepts - I/O. Lecture 19 COP 3014 Fall November 29, 2017

Copyright The McGraw-Hill Companies, Inc. Permission required for reproduction or display.

Programming in C. What is C?... What is C?

Programming in C UVic SEng 265

Tutorial 1 C Tutorial: Pointers, Strings, Exec

Programs. Function main. C Refresher. CSCI 4061 Introduction to Operating Systems

Programming in C. What is C?... What is C?

Chapter 11 Introduction to Programming in C

Compiling and Running a C Program in Unix

CS 220: Introduction to Parallel Computing. Beginning C. Lecture 2


B.V. Patel Institute of Business Management, Computer & Information Technology, Uka Tarsadia University

Outline. Lecture 1 C primer What we will cover. If-statements and blocks in Python and C. Operators in Python and C

COSC 2P91. Introduction Part Deux. Week 1b. Brock University. Brock University (Week 1b) Introduction Part Deux 1 / 14

Question 1. Part (a) [2 marks] error: assignment of read-only variable x ( x = 20 tries to modify a constant) Part (b) [1 mark]

CMPT 102 Introduction to Scientific Computer Programming. Input and Output. Your first program

File IO and command line input CSE 2451

Basic C Programming. Bin Li Assistant Professor Dept. of Electrical, Computer and Biomedical Engineering University of Rhode Island

Chapter IV Introduction to C for Java programmers

File Access. FILE * fopen(const char *name, const char * mode);

CS113: Lecture 7. Topics: The C Preprocessor. I/O, Streams, Files

EL2310 Scientific Programming

C programming basics T3-1 -

Darshan Institute of Engineering & Technology for Diploma Studies Unit 6

CpSc 1111 Lab 4 Part a Flow Control, Branching, and Formatting

CS240: Programming in C

Programming in C. Session 8. Seema Sirpal Delhi University Computer Centre

Chapter 11 Introduction to Programming in C

Unit 6 Files. putchar(ch); ch = getc (fp); //Reads single character from file and advances position to next character

C How to Program, 6/e by Pearson Education, Inc. All Rights Reserved.

Content. Input Output Devices File access Function of File I/O Redirection Command-line arguments

Pointers cause EVERYBODY problems at some time or another. char x[10] or char y[8][10] or char z[9][9][9] etc.

UNIT - I. Introduction to C Programming. BY A. Vijay Bharath

Input / Output Functions

Have examined process Creating program Have developed program Written in C Source code

CS240: Programming in C. Lecture 2: Overview

Functions in C C Programming and Software Tools. N.C. State Department of Computer Science

Data and File Structures Laboratory

CS 326 Operating Systems C Programming. Greg Benson Department of Computer Science University of San Francisco

CpSc 1111 Lab 5 Formatting and Flow Control

Computational Methods of Scientific Programming Fall 2007

25.2 Opening and Closing a File

Lecture 7: Files. opening/closing files reading/writing strings reading/writing numbers (conversion to ASCII) command line arguments

The Design of C: A Rational Reconstruction (cont.)

Computers Programming Course 5. Iulian Năstac

EC 413 Computer Organization

Dynamic memory allocation (malloc)

Lecture 03 Bits, Bytes and Data Types

Running a C program Compilation Python and C Variables and types Data and addresses Functions Performance. John Edgar 2

Princeton University COS 333: Advanced Programming Techniques A Subset of C90

Standard File Pointers

Modifiers. int foo(int x) { static int y=0; /* value of y is saved */ y = x + y + 7; /* across invocations of foo */ return y; }

Programming and Data Structure

Lab Exam 1 D [1 mark] Give an example of a sample input which would make the function

Computer Programming: Skills & Concepts (CP1) Files in C. 18th November, 2010

C Input/Output. Before we discuss I/O in C, let's review how C++ I/O works. int i; double x;

CSE 374 Programming Concepts & Tools

Input/Output: Advanced Concepts

Friday, February 10, Lab Notes

Computer Programming: Skills & Concepts (CP) Files in C

Basic Types and Formatted I/O

Decaf Language Reference Manual

Review Topics. Final Exam Review Slides

EECS2031 Software Tools

COMP 2355 Introduction to Systems Programming

Transcription:

CMPS 12M Introduction to Data Structures Lab Lab Assignment 3 The purpose of this lab assignment is to introduce the C programming language, including standard input-output functions, command line arguments, File IO, and compilation with Makefiles. Introduction to C If you are not already familiar with C (or even if you are) it is recommended that you purchase a good C reference such as C for Java Programmers: a Primer by Charlie McDowell (Lulu.com 2007). The C programming language is, in a certain sense, the grandparent of Java (C++ being its parent). Java is known as an Object Oriented Programming (OOP) language, which means that data structures and the procedures which operate on them are grouped together into one language construct, namely the class. Common behavior amongst classes is specified explicitly through the mechanism of inheritance. The C programming language on the other hand does not directly support OOP, although it can be implemented with some effort. C is known as a procedural programming language, which means that data structures and functions (i.e. procedures) are separate language constructs. There are no classes, no objects, and no inheritance. New data types in C are created using the typedef and struct constructs, which will be illustrated in future lab assignments. There is however much common syntax between Java and C. Many control structures such as loops (while, do-while, for), and branching (if, if-else, switch) are virtually identical in the two languages. One major difference is in the way program input and output is handled, both to and from standard IO devices (keyboard and terminal window), and to and from files. The following is an example of a "Hello World!" program in C. Example * hello.c * Prints "Hello World!" to stdout #include <stdio.h> printf("hello World!\n"); return 0; Comments in C are specified by bracketing them between the strings and, and may span several lines. For instance comment or comment comment or * comment * comment are all acceptable. With the right compiler flags, Java/C++ style comments are also acceptable. // comment // comment 1

You may use any style you like, but throughout this document we will use the older C style comments. Any line beginning with # is known as a preprocessor directive. The preprocessor performs the first phase of compilation wherein these directives, which are literal text substitutions, are performed, making the program ready for later stages of compilation. The line #include<stdio.h> inserts the standard library header file stdio.h, which specifies functions for performing standard input-output operations. Notice that preprocessor commands in C do not end in a semicolon. One can also specify constant macros using the #define preprocessor directive as follows. * hello.c * Prints "Hello World!" to stdout #include <stdio.h> #define HELLO_STRING "Hello World!\n" printf(hello_string); return 0; Every occurance of the constant macro HELLO_STRING is replaced by the string literal "Hello World!\n". Although you can name a C program anything you want, each C program must contain exactly one function called main(). As in Java, function main() is the entry point for program execution. Typically main() will call other functions, which may in turn call other functions. Function definitions in C look very much like they do in Java. returntype functionname(datatype variablename, datatype variablename,...){ declarations of local variables executable statements return statement (provided return-type is not void) Recall that Java allows variable declarations to be interspersed among executable statements. Whether this is allowable in C depends on which version of the language you are using. In particular the ANSI standard requires that local variables be declared at the beginning of the function body and before any other statements. The version of the C language we will use (C99) is not so strict on this point, but I recommend that students adhere to the earlier standard (even in Java programs), since it helps organize one s thinking about program variables. The function printf() prints formatted text to stdout. It s first argument is known as a format string and consists of two types of items. The first type is made up of characters that will be printed to the screen. The second type contains format commands that define the way the remaining arguments are displayed. A format command begins with a percent sign and is followed by the format code. There must be exactly the same number of format commands as there are remaining arguments, and the format commands are matched with the remaining arguments in order. For example printf("there are %d days in %s\n", 30, "April"); prints the text "There are 30 days in April". Some common format commands are: %c character %d signed decimal integer %f decimal floating point number %s string (same as char array) %e scientific notation 2

%% prints a percent sign See a good reference on C for other format commands. (Note java contains a method System.out.printf that behaves exactly like printf in C. There is a lengthy discussion of this function in the documentation for java.io.printstream.) Observe that the above main() function has return type int. A return value of 0 indicates to the caller (i.e. the operating system) that execution was nominal and without errors. Actually the proper exit values for main are system dependent. For the sake of portability one should use the exit codes EXIT_SUCCESS and EXIT_FAILURE which are predefined constants found in the library header file stdlib.h. The exit code allows for the status of the main program to be tested at the level of the operating system. (In Unix the exit code will be stored in the shell variable $status.) * hello.c * Prints "Hello World!" to stdout #include <stdio.h> #include <stdlib.h> #define HELLO_STRING "Hello World!\n" printf(hello_string); Compiling a C program A C program may be comprised of any number of source files. Each source file name must end with the.c extension. There are four components to the compilation process: Preprocessor Compiler Assembler Linker The preprocessor performs only pure text substitutions, i.e. expanding symbolic constants in macro definitions and inserting library header files. The compiler creates assembly language code corresponding to the instructions in the source file. The assembler translates the assembly code into native machine readable binary object code. One object file is created for each source file. Each object file has the same name as the corresponding source file with the.c extension replaced by.o. The linker searches specified libraries for functions that the program uses (such as printf() above) and combines pre-compiled object code for those functions with the program's object code. The finished product is a complete executable binary file. Most Unix systems provide several C compilers. We will use the gcc compiler exclusively in this course. To create the object file hello.o do % gcc c std=c99 Wall hello.c (Remember % stands for the Unix prompt.) The c option means compile only (i.e. do not link), -std=c99 means use the C99 language standard, and Wall means print all warnings. To link hello.o to the standard library functions in stdio.h and stdlib.h do % gcc o hello hello.o The o option specifies hello as the name of the executable binary file to be created. If this option is left out the executable will be named a.out. To run the program type its name at the command prompt. 3

% hello Hello World! The whole four step process can be automated by doing one command %gcc std=c99 Wall o hello hello.c which automatically deletes the object file hello.o after linking. If the program resides in just one source file, this may be the simplest way to compile. If the program is spread throughout several source files (as will often be the case in this course) a Makefile should be used to automate compilation. Here is a simple Makefile for hello.c. # Makefile for hello.c hello : hello.o gcc -o hello hello.o hello.o : hello.c gcc -c std=c99 -Wall hello.c clean : rm -f hello hello.o Here is an equivalent Makefile that uses macros. # Makefile for hello.c with macros FLAGS = -std=c99 -Wall SOURCES = hello.c OBJECTS = hello.o EXEBIN = hello all: $(EXEBIN) $(EXEBIN) : $(OBJECTS) gcc -o $(EXEBIN) $(OBJECTS) $(OBJECTS) : $(SOURCES) gcc -c $(FLAGS) $(SOURCES) clean : rm -f $(EXEBIN) $(OBJECTS) See the examples page for a fancy hello world program in C that prints out operating system environment variables. Command Line Arguments The main function in a C program may have two possible prototypes: int main(); int main(int argc, char* argv[]); The second prototype is used for C programs that use command line arguments. The first argument argc specifies the number of string arguments on the command line (including the program name.) The second argument is an 4

array containing the string arguments themselves. (Note that in C a string is just a char array and is specified by the type char*.) The parameter argc is necessary since arrays in C do not know their own lengths, unlike in Java. The following program behaves similarly to its namesake in lab2. * CommandLineArguments.c #include <stdio.h> #include <stdlib.h> int main(int argc, char* argv[]){ int i; printf("argc = %d\n", argc); for(i = 0; i<argc; i++ ){ printf("%s\n", argv[i]); Compile this program and run it with several different sets of arguments. Notice that there is one important difference between this program and the Java version from lab2. If the executable jar file is called CLA for brevity, then the Java version gives % CLA x y z args.length = 3 x y z whereas the C version above gives % CLA x y z argc = 4 CLA x y z We see that in C, argv[0] is the name of the command that invoked the program, i.e. CLA. In java the array of strings holding the command line arguments does not include the name of the program. Console Input and Output The function printf() discussed above sends its output to stdout, which is the data stream normally associated with the terminal window. The library stdio.h also includes a function called scanf(), which is a general purpose input routine that reads the stream stdin (the data stream normally associated with the keyboard) and stores the information in the variables pointed to by its argument list. It can read all the built-in data types and automatically convert them into the proper internal format. The first argument to scanf() is a format string consisting of three types of characters: format specifiers, whitespace characters, and non-whitespace characters. The format specifiers are preceded by a % sign and tell scanf() what type of data is to be read. For example %s reads a string while %d reads an integer. Some common format codes are: %c character %d signed decimal integer %f decimal floating point 5

%s string (i.e. char array) See a good C reference for other scanf() codes, which are similar but not identical to printf()'s codes. The format string is read left to right and the format codes are matched, in order, with the remaining arguments that comprise the argument list of scanf(). A whitespace character (i.e. space, newline, or tab) in the format string causes scanf() to skip over one or more whitespace characters in the input stream. In other words, one whitespace character in the format string will cause scanf() to read, but not to store, any number (including zero) of whitespace characters up to the first non-whitespace character. A non-whitespace character in the format string causes scanf() to read and discard a single matching character in the input stream. For example, the control string " %d, %s" causes scanf() to first skip any number of leading whitespace characters, then read one integer, then read and discard one comma, then skip any number of whitespace characters, then read one string. If the specified input is not found, scanf() will terminate. All the variables receiving values through scanf() must be passed by reference, i.e. the address of each variable receiving a value must be placed in the argument list. We use the "address-of" operator & to specify the address of a variable. For instance the following program asks the user for three integers, then echoes them back to the screen. #include<stdio.h> #include<stdlib.h> int x, y, z; printf("enter three integers separated by commas, then press return: "); scanf(" %d, %d, %d", &x, &y, &z); printf("the integers entered were %d, %d, %d\n", x, y, z); A sample run of this program looks like Enter three integers separated by commas, then press return: 12, -7, 13 The integers entered were 12, -7, 13 Running again, but this time leaving out the separating commas in the input line gives Enter three integers separated by commas, then press return: 12-7 13 The integers entered were 12, 4, -4261060 Since the comma separating the first and second integers was left out scanf() read the first integer, then expected to read and discard a comma but failed to do so, then just returned without reading anything else. The values printed for variables y and z are random values stored in uninitialized memory and will no doubt be different when you run the program. Thus we see that scanf() is intended for reading formatted input. This is what the "f" in its name stands for. The scanf() function returns a number equal to the number of fields that were successfully assigned. That number can be tested and used within the program, as the following example illustrates. #include<stdio.h> #include<stdlib.h> int n, i; int x[3]; printf("enter three integers separated by spaces, then press return: "); 6

n = scanf(" %d %d %d", &x[0], &x[1], &x[2]); printf("%d numbers were successfully read: ", n); for(i=0; i<n; i++) printf("%d ", x[i]); printf("\n"); Some sample runs of this program follow: Enter three integers separated by spaces, then press return: 1 2 G 2 numbers were successfully read: 1 2 Enter three integers separated by spaces, then press return: monkey at the keyboard! 0 numbers were successfully read: If an error occurs before the first field is assigned, then scanf() returns the pre-defined constant EOF. Reading the end-of-file character is considered such an error. You can place an end-of-file character into the input stream by typing control-d. The value of EOF is always a negative integer and is defined in the header file stdlib.h. Evidently the value of EOF on my system is -1, as the following test run illustrates. Enter three integers separated by spaces, then press return: ^D -1 numbers were successfully read: % File Input and Output The library header file stdio.h defines many functions for doing various types of input and output. See any good C reference for a listing of these functions (e.g. getc(), getchar(), gets(), putc(), putchar(), and puts() are commonly used by C programmers). Most of these functions have both a console version and a file version. The functions fprintf() and fscanf() are the file equivalents of printf() and scanf() respectively. They operate exactly as do printf() and scanf() except that they have an additional first argument that specifies a file to write to or read from. This file "handle" as it is sometimes called, is a variable of type FILE*, which is defined in stdio.h. Files are opened and closed using the fopen() and fclose() functions. The following program illustrates the use of functions fopen(), fclose(), fscanf(), and fprintf(). * FileIO.c * Reads input file and prints each word on a separate line of * the output file. #include<stdio.h> #include<stdlib.h> int main(int argc, char* argv[]){ FILE* in; file handle for input FILE* out; file handle for output char word[256]; char array to store words from input file check command line for correct number of arguments if( argc!= 3 ){ printf("usage: %s <input file> <output file>\n", argv[0]); exit(exit_failure); 7

open input file for reading in = fopen(argv[1], "r"); if( in==null ){ printf("unable to read from file %s\n", argv[1]); exit(exit_failure); open output file for writing out = fopen(argv[2], "w"); if( out==null ){ printf("unable to write to file %s\n", argv[2]); exit(exit_failure); read words from input file, print on separate lines to output file while( fscanf(in, " %s", word)!= EOF ){ fprintf(out, "%s\n", word); close input and output files fclose(in); fclose(out); return(exit_success); Notice that by inserting the leading space in the format string for fscanf(), we skip all intervening whitespace characters and thus parse the tokens in the file. (Recall a token is a substring that (1) contains no white space characters, and (2) is maximal with respect to (1).) What to turn in Write a C program called FileReverse.c that behaves exactly like the program FileReverse.java that you wrote in lab 2. Thus FileReverse.c will take two command line arguments naming the input and output files respectively (following the FileIO.c example above.) Your program will read each word in the input file, then print it backwards on a line by itself. For example given a file called in containing the lines: abc def ghij klm nopq rstuv w xyz the command % FileReverse in out will create a file called out containing the lines: cba fed jihg mlk qpon vutsr w zyx Your program will contain a function called stringreverse() with the heading void stringreverse(char* s) which reverses its string argument. Place the definition of this function after all preprocessor directives but before the function main(). Your main function will be almost identical to FileIO.c above, except that the while 8

loop will contain a call to stringreverse(word). Although it is possible to write stringreverse() as a recursive function, it is not recommended that you do so unless you are very familiar with C strings and the string handling functions in the standard library string.h. Instead it is recommended that you implement a simple iterative algorithm to reverse the string s (see below). There is one function from string.h that you will need however, and that is strlen(), which returns the length of its string argument. For example the following program prints out the length of a string entered on the command line. * charcount.c * prints the number of characters in a string on the command line #include<stdio.h> #include<stdlib.h> #include<string.h> int main(int argc, char* argv[]){ if(argc<2){ printf("usage: %s some-string\n", argv[0]); exit(exit_failure); printf("%s contains %d characters\n", argv[1], strlen(argv[1]) ); Remember that a string in C is a char array, not a separate data type as it is in Java. Recall also that arrays in C cannot be queried as to their length. How then does strlen() work? Actually a string in C is a little bit more than a char array. By convention, C strings are terminated by the null character '\0'. This character acts as a sentinel, telling the functions in string.h where the end of the string is. Function strlen() counts and returns the number of characters in its char* (char array) argument up to (but not including) the null character. Here is an algorithm in pseudo-code that will perform the string reversal: 1. Set two variables i and j to i=0 and j=strlen(s)-1 2. Swap the characters s[i] and s[j] 3. Increment i and decrement j 4. Stop if i>=j 5. Otherwise go back to (2) One more question should be answered here. Are arrays in C passed by value or by reference? Obviously by reference, for otherwise functions like stringreverse() would have no effect on their array arguments outside the function call. In fact, an array name in C is literally the address of (i.e. a pointer to) the first element in the array. Arrays, strings, and pointer variables in C will be discussed in subsequent lab assignments. Write a Makefile that creates an executable binary file called FileReverse, and includes a clean utility. Submit the files: README, Makefile, and FileReverse.c to the assignment name lab3. Start early and ask for help if you are at all confused. 9