Princeton University Computer Science 217: Introduction to Programming Systems. Goals of this Lecture. A Taste of C. Agenda.

Similar documents
Goals of this Lecture

Princeton University Computer Science 217: Introduction to Programming Systems. A Taste of C

Reminder. Sign up for ee209 mailing list. Precept. If you haven t received any from ee209 yet Follow the link from our class homepage

C Examples. Goals of this Lecture. Overview of this Lecture

o Echo the input directly to the output o Put all lower-case letters in upper case o Put the first letter of each word in upper case

Fundamentals of Programming. Lecture 11: C Characters and Strings

LESSON 4. The DATA TYPE char

SWEN-250 Personal SE. Introduction to C

Chapter 8 C Characters and Strings

Chapter 8 - Characters and Strings

Muntaser Abulafi Yacoub Sabatin Omar Qaraeen. C Data Types

today cs3157-fall2002-sklar-lect05 1

Strings. Steven R. Bagley

LECTURE 15. String I/O and cstring library

Chapter 8: Character & String. In this chapter, you ll learn about;

Contents. Preface. Introduction. Introduction to C Programming

Program Building. Goals for this Lecture. Help you learn about: Program structure. Detecting and reporting failure

C Style Strings. Lecture 11 COP 3014 Spring March 19, 2018

C: How to Program. Week /May/28

Features of C. Portable Procedural / Modular Structured Language Statically typed Middle level language

C Libraries. Bart Childs Complementary to the text(s)

Scientific Programming in C V. Strings

cs3157: another C lecture (mon-21-feb-2005) C pre-processor (3).

C Fundamentals! C vs. Java: Design Goals. q Java design goals! q Implications for Java! language, machine language, hardware!

Object Oriented Programming COP3330 / CGS5409

Strings and Library Functions

C mini reference. 5 Binary numbers 12

Princeton University Computer Science 217: Introduction to Programming Systems. Assembly Language: Function Calls

Introduction to string

scanf erroneous input Computer Programming: Skills & Concepts (CP) Characters Last lecture scanf error-checking our input This lecture

Fundamental Data Types. CSE 130: Introduction to Programming in C Stony Brook University

CS3157: Advanced Programming. Outline

Draft. Chapter 1 Program Structure. 1.1 Introduction. 1.2 The 0s and the 1s. 1.3 Bits and Bytes. 1.4 Representation of Numbers in Memory

Converting a Lowercase Letter Character to Uppercase (Or Vice Versa)

c) Comments do not cause any machine language object code to be generated. d) Lengthy comments can cause poor execution-time performance.

Computers Programming Course 5. Iulian Năstac

Introduction to Algorithms and Data Structures. Lecture 6 - Stringing Along - Character and String Manipulation

Characters and Strings

Main Program. C Programming Notes. #include <stdio.h> main() { printf( Hello ); } Comments: /* comment */ //comment. Dr. Karne Towson University

Goals of C "" The Goals of C (cont.) "" Goals of this Lecture"" The Design of C: A Rational Reconstruction"

UNIT - I. Introduction to C Programming. BY A. Vijay Bharath

Full file at C How to Program, 6/e Multiple Choice Test Bank

C Concepts - I/O. Lecture 19 COP 3014 Fall November 29, 2017

Copyright The McGraw-Hill Companies, Inc. Permission required for reproduction or display.

Arrays, Strings, & Pointers

C-programming. Goal To give basic knowledge of the C language. Some previous experiences in programming are assumed. Litterature

Computer Science 217 Final Exam January 25, :30-4:30pm

The Design of C: A Rational Reconstruction (cont.)

Strings in C. Professor Hugh C. Lauer CS-2303, System Programming Concepts

upper and lower case English letters: A-Z and a-z digits: 0-9 common punctuation symbols special non-printing characters: e.g newline and space.

Multiple Choice Questions ( 1 mark)

Tail recursion. Decision. Assignment. Iteration

C and Programming Basics

CIS 2107 Computer Systems and Low-Level Programming Fall 2011 Midterm

The Design of C: A Rational Reconstruction (cont.)" Jennifer Rexford!

Chapter 7 C Pointers

C PROGRAMMING. Characters and Strings File Processing Exercise

These are reserved words of the C language. For example int, float, if, else, for, while etc.

Assembly Language: Function Calls

The Compilation Process

ch = argv[i][++j]; /* why does ++j but j++ does not? */

Characters, c-strings, and the string Class. CS 1: Problem Solving & Program Design Using C++

Appendix A Developing a C Program on the UNIX system

Chapter 2 Basic Elements of C++

THE FUNDAMENTAL DATA TYPES

UNIT-I Input/ Output functions and other library functions

Mid-term Exam. Fall Semester 2017 KAIST EE209 Programming Structures for Electrical Engineering. Name: Student ID:

Mid-term Exam. Fall Semester 2017 KAIST EE209 Programming Structures for Electrical Engineering. Name: Student ID:

CS102: Variables and Expressions

printf( Please enter another number: ); scanf( %d, &num2);

Arithmetic Expressions in C

6.096 Introduction to C++ January (IAP) 2009

THE C STANDARD LIBRARY & MAKING YOUR OWN LIBRARY. ISA 563: Fundamentals of Systems Programming

AIR FORCE SCHOOL,BAMRAULI COMPUTER SCIENCE (083) CLASS XI Split up Syllabus (Session ) Contents

In this Chapter, we tie up some loose ends concerning some of the built in functions provided

Fundamentals of Computer Programming Using C

Approximately a Test II CPSC 206

Work relative to other classes

Variables in C. Variables in C. What Are Variables in C? CMSC 104, Fall 2012 John Y. Park

Debugging (Part 1) The material for this lecture is drawn, in part, from The Practice of Programming (Kernighan & Pike) Chapter 5

CS165 Computer Security. Understanding low-level program execution Oct 1 st, 2015

Princeton University. Computer Science 217: Introduction to Programming Systems. Data Types in C

Reserved Words and Identifiers

1/25/2018. ECE 220: Computer Systems & Programming. Write Output Using printf. Use Backslash to Include Special ASCII Characters

sends the formatted data to the standard output stream (stdout) int printf ( format_string, argument_1, argument_2,... ) ;

C Language, Token, Keywords, Constant, variable

CS102: Standard I/O. %<flag(s)><width><precision><size>conversion-code

Arrays. Lecture 9 COP 3014 Fall October 16, 2017

C: How to Program. Week /Mar/05

Softprocessor SP3: Subprograms and Header

program structure declarations and definitions expressions and statements more standard I/O

Villanova University CSC 2400: Computer Systems I

x86-64 Programming III & The Stack

Fundamental of Programming (C)

SU 2017 May 11/16 LAB 2: Character and integer literals, number systems, character arrays manipulation, relational operator

ITC213: STRUCTURED PROGRAMMING. Bhaskar Shrestha National College of Computer Studies Tribhuvan University

Section 3: Library Functions

C How to Program, 6/e by Pearson Education, Inc. All Rights Reserved.

BIL 104E Introduction to Scientific and Engineering Computing. Lecture 1

Writing an ANSI C Program Getting Ready to Program A First Program Variables, Expressions, and Assignments Initialization The Use of #define and

Transcription:

Princeton University Computer Science 217: Introduction to Programming Systems Goals of this Lecture A Taste of C C Help you learn about: The basics of C Deterministic finite-state automata (DFA) Expectations for programming assignments Why? Help you get started with Assignment 1 Required readings + coverage of programming environment in precepts + minimal coverage of C in this lecture = enough info to start Assignment 1 DFAs are useful in many contexts E.g. Assignment 1, Assignment 7 1 2 Agenda The charcount Program The charcount program The upper program The upper1 program Functionality: Read all chars from stdin (standard input stream) Write to stdout (standard output stream) the number of chars read stdin Line 1 Line 2 charcount stdout 14 3 4 The charcount Program charcount Building and Running The program: $ gcc217 o charcount $./charcount Line 1 Line 2 ^D 14 $ What is this? What is the effect? 5 6

charcount Building and Running charcount Building and Running $ cat somefile Line 1 Line 2 $./charcount < somefile 14 $ What is this? What is the effect? $./charcount > someotherfile Line 1 Line 2 ^D $ cat someotherfile 14 What is this? What is the effect? 7 8 charcount Building and Running in Detail charcount Building and Running in Detail Question: Exactly what happens when you issue the command gcc217 o charcount Answer: Four steps Preprocess Compile Assemble Link The starting point C language Missing definitions of getchar() and printf() 9 10 Preprocessing charcount Preprocessing charcount Command to preprocess: gcc217 E > charcount.i Preprocessor functionality Removes comments Handles preprocessor directives Preprocessor replaces with contents of /usr/include/stdio.h Preprocessor replaces EOF with -1 11 12

Preprocessing charcount Preprocessing charcount while (c!= -1) Preprocessor removes comment The result charcount.i int getchar(); int printf(char *fmt, ); while (c!= -1) Why int instead of char? C language Missing comments Missing preprocessor directives Contains code from stdio.h Declarations of getchar() and printf() Missing definitions of getchar() and printf() 13 14 Compiling charcount Compiling charcount Command to compile: gcc217 S charcount.i Compiler functionality Translate from C to assembly language Use function declarations to check calls of getchar() and printf() charcount.i int getchar(); int printf(char *fmt, ); while (c!= -1) Compiler sees function declarations So compiler has enough information to check subsequent calls of getchar() and printf() 15 16 Compiling charcount Compiling charcount charcount.i int getchar(); int printf(char *fmt, ); while (c!= -1) Definition of main() function Compiler checks calls of getchar() and printf() when encountered Compiler translates to assembly language 17 The result: charcount.s.section ".rodata" format:.string "%d\n".section ".text".globl main.type main,@function main: pushq %rbp movq %rsp, %rbp subq $4, %rsp call getchar loop: cmpl $-1, %eax je endloop incl -4(%rbp) call getchar jmp loop endloop: movq $format, %rdi movl -4(%rbp), %esi movl $0, %eax call printf movl $0, %eax movq %rbp, %rsp popq %rbp ret Assembly language Missing definitions of getchar() and printf() 18

Assembling charcount Assembling charcount The result: Command to assemble: gcc217 c charcount.s charcount.o Assembler functionality Translate from assembly language to machine language Machine language version of the program No longer human readable Machine language Missing definitions of getchar() and printf() 19 20 Linking charcount Linking charcount The result: Command to link: gcc217 charcount.o o charcount charcount Linker functionality Resolve references Fetch machine language code from the standard C library (/usr/lib/libc.a) to make the program complete Machine language version of the program No longer human readable Machine language Contains definitions of getchar() and printf() Complete! Executable! 21 22 Command to run:./charcount < somefile Computer allocates space for c and charcount in the stack section of memory Why int instead of char? 23 24

Computer calls getchar() getchar() tries to read char from stdin Success returns char (within an int) Failure returns EOF EOF is a special non-char value that getchar() returns to indicate failure 25 Assuming c EOF, computer increments charcount 26 Computer calls getchar() again, and repeats Eventually getchar() returns EOF Computer breaks out of loop Computer calls printf() to write charcount 27 28 Other Ways to charcount Computer executes return stmt Return from main() terminates program Normal execution return 0 or EXIT_SUCCESS Abnormal execution return EXIT_FAILURE 29 1 2 3 for (c=getchar(); c!=eof; c=getchar()) charcount++; while ((c=getchar())!=eof) charcount++; for (;;) { if (c == EOF) charcount++; 4 Which way is best? while (c!=eof) 30

Review of Example 1 Input/Output Including stdio.h Functions getchar() and printf() Representation of a character as an integer Predefined constant EOF Program control flow The for and while statements The break statement The return statement Operators Assignment: = Increment: ++ Relational: ==!= Agenda The charcount program The upper program The upper1 program 31 32 Example 2: upper upper Building and Running Functionality Read all chars from stdin Convert each lower case alphabetic char to upper case Leave other kinds of chars alone Write result to stdout stdin Does this work? It seems to work. upper stdout DOES THIS WORK? IT SEEMS TO WORK. $ gcc217 upper.c o upper $ cat somefile Does this work? It seems to work. $./upper < somefile DOES THIS WORK? IT SEEMS TO WORK. $ 33 34 ASCII upper Version 1 American Standard Code for Information Interchange 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 0 NUL HT LF 16 32 SP! " # $ % & ' ( ) * +, -. / 48 0 1 2 3 4 5 6 7 8 9 : ; < = >? 64 @ A B C D E F G H I J K L M N O 80 P Q R S T U V W X Y Z [ \ ] ^ _ 96 ` a b c d e f g h i j k l m n o 112 p q r s t u v w x y z { ~ Partial map Note: Lower case and upper case letters are 32 apart { if ((c >= 97) && (c <= 122)) c -= 32; putchar(c); What s wrong? 35 36

EBCDIC Extended Binary Coded Decimal Interchange Code 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 0 NUL HT 16 32 LF 48 64 SP. < ( + 80 &! $ * ) ; 96 - /, % _ >? 112 ` : # @ ' = " 128 a b c d e f g h i { 144 j k l m n o p q r 160 ~ s t u v w x y z 176 192 A B C D E F G H I 208 J K L M N O P Q R 224 \ S T U V W X Y Z 240 0 1 2 3 4 5 6 7 8 9 Note: Lower case not contiguous; same for upper case Partial map 37 Character Literals Examples 'a' '\n' '\t' '\\' '\'' '\0' the a character 97 on ASCII systems 129 on EBCDIC systems newline 10 on ASCII systems 37 on EBCDIC systems horizontal tab 9 on ASCII systems 5 on EBCDIC systems backslash 92 on ASCII systems 224 on EBCDIC systems single quote 39 on ASCII systems 125 on EBCDIC systems the null character (alias NUL) 0 on all systems 38 upper Version 2 ctype.h Functions { if ((c >= 'a') && (c <= 'z')) c += 'A' - 'a'; putchar(c); Arithmetic on chars? What s wrong? 39 $ man islower NAME isalnum, isalpha, isascii, isblank, iscntrl, isdigit, isgraph, islower, isprint, ispunct, isspace, isupper, isxdigit character classification routines SYNOPSIS int isalnum(int c); int isalpha(int c); int isascii(int c); int isblank(int c); int iscntrl(int c); int isdigit(int c); int isgraph(int c); int islower(int c); int isprint(int c); int ispunct(int c); int isspace(int c); int isupper(int c); int isxdigit(int c); These functions check whether c falls into a certain character class ctype.h Functions upper Final Version $ man toupper NAME toupper, tolower - convert letter to upper or lower case SYNOPSIS int toupper(int c); int tolower(int c); DESCRIPTION toupper() converts the letter c to upper case, if possible. tolower() converts the letter c to lower case, if possible. If c is not an unsigned char value, or EOF, the behavior of these functions is undefined. RETURN VALUE The value returned is that of the converted letter, or c if the conversion was not possible. { if (islower(c)) c = toupper(c); putchar(c); Is the if statement really necessary? 42

Review of Example 2 Agenda Representing characters ASCII and EBCDIC character sets Character literals (e.g., A or a ) Manipulating characters Arithmetic on characters Functions such as islower() and toupper() The charcount program The upper program The upper1 program 43 44 Example 3: upper1 upper1 Building and Running Functionality Read all chars from stdin Capitalize the first letter of each word cos 217 rocks Cos 217 Rocks Write result to stdout stdin cos 217 rocks Does this work? It seems to work. upper1 stdout Cos 217 Rocks Does This Work? It Seems To Work. $ gcc217 upper1.c o upper1 $ cat somefile cos 217 rocks Does this work? It seems to work. $./upper1 < somefile Cos 217 Rocks Does This Work? It Seems To Work. $ 45 46 upper1 Challenge Deterministic Finite Automaton Problem Must remember where you are Capitalize c in cos, but not o in cos or c in rocks Solution Maintain some extra information In a word vs not in a word Deterministic Finite State Automaton (DFA)!isalpha (print) NORMAL isalpha (print uppercase equiv)!isalpha (print) INWORD isalpha (print) States, one of which is denoted the start state Transitions labeled by chars or char categories Optionally, actions on transitions 47 48

upper1 Version 1 upper1 Toward Version 2 int state = 0; { switch (state) { case 0: { putchar(toupper(c)); state = 1; state = 0; case 1: state = 1; state = 0;!isalpha isalpha 0 1!isalpha That s a B. What s wrong? isalpha 49 Problem: The program works, but States should have names Solution: Define your own named constants enum Statetype {NORMAL, INWORD; Define an enumeration type enum Statetype state; Define a variable of that type 50 upper1 Version 2 upper1 Toward Version 3 enum Statetype {NORMAL, INWORD; enum Statetype state = NORMAL; { switch (state) { case NORMAL: { putchar(toupper(c)); state = INWORD; state = NORMAL; case INWORD: state = INWORD; state = NORMAL; That s a B+. What s wrong? 51 Problem: The program works, but Deeply nested statements No modularity Solution: Handle each state in a separate function 52 upper1 Version 3 upper1 Toward Final Version enum Statetype {NORMAL, INWORD; enum Statetype handlenormalstate(int c) { enum Statetype state; { putchar(toupper(c)); state = INWORD; state = NORMAL; return state; enum Statetype handleinwordstate(int c) { enum Statetype state; if (!isalpha(c)) state = NORMAL; state = INWORD; return state; enum Statetype state = NORMAL; { switch (state) { case NORMAL: state = handlenormalstate(c); case INWORD: state = handleinwordstate(c); That s an A-. What s wrong? 53 Problem: The program works, but No comments Solution: Add (at least) function-level comments 54

Function Comments Function comment should describe what the function does (from the caller s viewpoint) Input to the function Parameters, input streams Output from the function Return value, output streams, (call-by-reference parameters) Function comment should not describe how the function works Function Comment Examples Bad main() function comment Read a character from stdin. Depending upon the current DFA state, pass the character to an appropriate state-handling function. The value returned by the state-handling function is the next DFA state. Repeat until end-of-file. Describes how the function works Good main() function comment Read text from stdin. Convert the first character of each "word" to uppercase, where a word is a sequence of letters. Write the result to stdout. Return 0. 55 Describes what the function does from caller s viewpoint 56 upper1 Final Version upper1 Final Version /*------------------------------------------------------------*/ /* upper1.c */ /* Author: Bob Dondero */ /*------------------------------------------------------------*/ enum Statetype {NORMAL, INWORD; Continued on next page /*----------------------------------------------------------*/ /* Implement the NORMAL state of the DFA. c is the current DFA character. Write c or its uppercase equivalent to stdout, as specified by the DFA. Return the next state. */ enum Statetype handlenormalstate(int c) { enum Statetype state; { putchar(toupper(c)); state = INWORD; state = NORMAL; return state; Continued on next page 57 58 upper1 Final Version upper1 Final Version /*----------------------------------------------------------*/ /*----------------------------------------------------------*/ /* Implement the INWORD state of the DFA. c is the current DFA character. Write c to stdout, as specified by the DFA. Return the next state. */ enum Statetype handleinwordstate(int c) { enum Statetype state; if (!isalpha(c)) state = NORMAL; state = INWORD; return state; Continued on next page /* Read text from stdin. Convert the first character of each "word" to uppercase, where a word is a sequence of letters. Write the result to stdout. Return 0. */ /* Use a DFA approach. state indicates the DFA state. */ enum Statetype state = NORMAL; { switch (state) { case NORMAL: state = handlenormalstate(c); case INWORD: state = handleinwordstate(c); 59 60

Review of Example 3 Deterministic finite-state automaton Two or more states Transitions between states Next state is a function of current state and current character Actions can occur during transitions Expectations for COS 217 assignments Readable Meaningful names for variables and literals Reasonable max nesting depth Modular Multiple functions, each of which does one well-defined job Function-level comments Should describe what function does See K&P book for style guidelines specification Summary The C programming language Overall program structure Control statements (if, while, for, and switch) Character I/O functions (getchar() and putchar()) Deterministic finite state automata (DFA) Expectations for programming assignments Especially Assignment 1 Start Assignment 1 soon! 61 62 Another DFA Example Appendix: Additional DFA Examples Does the string have nano in it? banano yes nnnnnnnanofff yes banananonano yes bananananashanana no other n n Double circle is accepting state Single circle is rejecting state other n a n o start n na nan other a nano other other 63 64 Yet Another DFA Example Old Exam Question Compose a DFA to identify whether or not a string is a floating-point literal Valid literals -34 78.1 +298.3-34.7e-1 34.7E-1 7..7 999.99e99 Invalid literals abc -e9 1e + 17.9A 0.38+. 38.38f9 65