Computer Science & Engineering 150A Problem Solving Using Computers. Chapter 9. Strings. Notes. Notes. Notes. Lecture 07 - Strings

Similar documents
CSCE150A. Introduction. Basics. String Library. Substrings. Line Scanning. Sorting. Command Line Arguments. Misc CSCE150A. Introduction.

Computer Science & Engineering 150A Problem Solving Using Computers

Arrays, Strings, & Pointers

Strings. Chuan-Ming Liu. Computer Science & Information Engineering National Taipei University of Technology Taiwan

Pointers, Arrays, and Strings. CS449 Spring 2016

Chapter 8: Character & String. In this chapter, you ll learn about;

8. Characters, Strings and Files

Strings(2) CS 201 String. String Constants. Characters. Strings(1) Initializing and Declaring String. Debzani Deb

Computer Programming: Skills & Concepts (CP) Strings

Characters and Strings

Strings. Arrays of characters. Pallab Dasgupta Professor, Dept. of Computer Sc & Engg INDIAN INSTITUTE OF TECHNOLOGY

CSC209H Lecture 4. Dan Zingaro. January 28, 2015

Introduction to string

C Programming. Unit 9. Manipulating Strings File Processing.

Chapter 8 - Characters and Strings

Lecture07: Strings, Variable Scope, Memory Model 4/8/2013

C Characters and Strings

Lecture 17. Strings II. CptS 121 Summer 2016 Armen Abnousi

Arrays and Strings. CS449 Fall 2017

C: How to Program. Week /May/28

COMP1917: 09 Arrays and Strings

Reading Assignment. Strings. K.N. King Chapter 13. K.N. King Sections 23.4, Supplementary reading. Harbison & Steele Chapter 12, 13, 14

Create a Program in C (Last Class)

CCE1111 Programming for Engineers [C Programming Language]

Chapter 8 C Characters and Strings

Fundamentals of Programming. Lecture 11: C Characters and Strings

Using Character Arrays. What is a String? Using Character Arrays. Using Strings Life is simpler with strings. #include <stdio.

C mini reference. 5 Binary numbers 12

Fundamentals of Programming & Procedural Programming

CSCI 6610: Intermediate Programming / C Chapter 12 Strings

Intermediate Programming, Spring 2017*

Iosif Ignat, Marius Joldoș Laboratory Guide 9. Character strings CHARACTER STRINGS

Array Initialization

upper and lower case English letters: A-Z and a-z digits: 0-9 common punctuation symbols special non-printing characters: e.g newline and space.

C: Arrays, and strings. Department of Computer Science College of Engineering Boise State University. September 11, /16

Lecture 05 Pointers ctd..

Computers Programming Course 11. Iulian Năstac

Overview. Concepts this lecture String constants Null-terminated array representation String library <strlib.h> String initializers Arrays of strings

ONE DIMENSIONAL ARRAYS

C: Pointers, Arrays, and strings. Department of Computer Science College of Engineering Boise State University. August 25, /36

LECTURE 15. String I/O and cstring library

gcc hello.c a.out Hello, world gcc -o hello hello.c hello Hello, world

Course organization. Course introduction ( Week 1)

CS 137 Part 6. ASCII, Characters, Strings and Unicode. November 3rd, 2017

1 Pointer Concepts. 1.1 Pointer Examples

CSE 230 Intermediate Programming in C and C++ Arrays, Pointers and Strings

C Style Strings. Lecture 11 COP 3014 Spring March 19, 2018

Characters, Character Strings, and string-manipulation functions in C

211: Computer Architecture Summer 2016

C PROGRAMMING. Characters and Strings File Processing Exercise

Programming in C. Session 7. Seema Sirpal Delhi University Computer Centre

CS 261 Fall Mike Lam, Professor. Structs and I/O

Arrays and Pointers (part 2) Be extra careful with pointers!

Converting a Lowercase Letter Character to Uppercase (Or Vice Versa)

C BOOTCAMP DAY 2. CS3600, Northeastern University. Alan Mislove. Slides adapted from Anandha Gopalan s CS132 course at Univ.

Strings. Steven R. Bagley

Introduction to C/C++ Lecture 5 - String & its Manipulation

Arrays and Pointers (part 2) Be extra careful with pointers!

C-String Library Functions

CS 241 Data Organization Pointers and Arrays

C strings. (Reek, Ch. 9) 1 CS 3090: Safety Critical Programming in C

ECE551 Midterm Version 1

Chapter 9 Strings. With this array declaration: char s[10];

C Programming Language Review and Dissection III

EECS402 Lecture 08. Intro To The Standard string Class. Some string Functionality

Scientific Programming in C V. Strings

CSC 2400: Computer Systems. Arrays and Strings in C

Lecture 12. We have already used strings. Strings. Hello Class is a string.

Floating-point lab deadline moved until Wednesday Today: characters, strings, scanf Characters, strings, scanf questions clicker questions

Strings in C++ Dr. Ferdin Joe John Joseph Kamnoetvidya Science Academy

Announcements. Strings and Pointers. Strings. Initializing Strings. Character I/O. Lab 4. Quiz. July 18, Special character arrays

Grade Distribution. Exam 1 Exam 2. Exams 1 & 2. # of Students. Total: 17. Total: 17. Total: 17

Procedural Programming

File IO and command line input CSE 2451

Computer Security. Robust and secure programming in C. Marius Minea. 12 October 2017

CS 220: Introduction to Parallel Computing. Input/Output. Lecture 7

CSC 2400: Computer Systems. Arrays and Strings in C

String constants. /* Demo: string constant */ #include <stdio.h> int main() {

Simple Output and Input. see chapter 7

Chapter 5. Section 5.4 The Common String Library Functions. CS 50 Hathairat Rattanasook

Input/Output and the Operating Systems

Chapter 13. Strings. Introduction

EE 355 Lab 4 - Party Like A Char Star

CSC 270 Survey of Programming Languages. C-String Values

Slide Set 3. for ENCM 339 Fall 2017 Section 01. Steve Norman, PhD, PEng

Model Viva Questions for Programming in C lab

ELEC / COMP 177 Fall Some slides from Kurose and Ross, Computer Networking, 5 th Edition

Chapter 8 Character Arrays and Strings

CS1100 Introduction to Programming

Module: Safe Programming. Professor Trent Jaeger. CSE543 - Introduction to Computer and Network Security

SYSC 2006 C Winter String Processing in C. D.L. Bailey, Systems and Computer Engineering, Carleton University

CMPSC 311- Introduction to Systems Programming Module: Strings

Approximately a Test II CPSC 206

CSE 333 Lecture 7 - final C details

Why arrays? To group distinct variables of the same type under a single name.

Computer Programming

Should you know scanf and printf?

C Input/Output. Before we discuss I/O in C, let's review how C++ I/O works. int i; double x;

Text Output and Input; Redirection

C Libraries. Bart Childs Complementary to the text(s)

Transcription:

Computer Science & Engineering 150A Problem Solving Using Computers Lecture 07 - Strings Scanning Stephen Scott (Adapted from Christopher M. Bourke) 1 / 51 Fall 2009 cbourke@cse.unl.edu Chapter 9 Scanning 9.1 String 9.2 Functions: Assignment and 9.3 Longer Strings: and Whole- Input 9.4 String Comparison 9.6 Character Operations 9.7 String-to-Number and Number-to-String Conversion 9.8 Common Programming Errors 2 / 51 Strings Scanning Until now we have only dealt with single characters char mychar = A, \n Processing and manipulating single characters is too limiting Need a way for dealing with groups of characters 3 / 51

Strings Scanning A collection of characters is called a string C has no string data type Instead, strings are arrays of characters, char mystring[], char myname[20] Necessary to represent textual data, communicate with users in a readable manner 4 / 51 String Scanning Calls to scanf or printf used a string constant as the first argument. We have also dealt with static strings: "Hello World!" printf("a = %d\n", a) printf("average = %.2f", avg) Each string above is a string of 12, 7, and 14 characters, respectively It s possible to use a preprocessor directive: #define INSUFF_DATA "Insufficient Data" 5 / 51 Static Strings Scanning Static strings cannot be changed during the execution of the program They cannot be manipulated or processed May only be changed by recompiling Stored in an array of a fixed size 6 / 51

Declaring and Initializing String Variables Scanning Strings are character arrays Declaration is the same, just use char char string_var[100]; char myname[30]; myname will hold strings anywhere from 0 to 29 characters long Individual characters can be accessed/set using indices 1 myname [0] = B ; 2 myname [1] = r ; 3 myname [2] = i ; 4 myname [3] = a ; 5 myname [4] = n ; 6 printf (" First initial : %c.\ n", myname [0]); 7 / 51 Declaring and Initializing String Variables Scanning You can declare and initialize in one line Be sure to use the double quotes char myname[30] = "Brian"; You need not specify the size of the array when declaring-initializing in one line: char myname[] = "Brian"; C will create a character array large enough to hold the string 8 / 51 Null Terminating Character Scanning C needs a way to tell where the end of a string is With arrays, it is your responsibility to ensure you do not access memory outside the array To determine where the string ends, C uses the null-terminating character: \0 Character with ASCII code 0 9 / 51

Null Terminating Character Example Scanning char str[20] = "Initial value"; will produce the following in memory: [0] [1] [2] [3] [4] [5] [6] [7] [8] [9] I n i t i a l v a [10] [11] [12] [13] [14] [15] [16] [17] [18] [19] l u e \0?????? 10 / 51 Arrays of Strings Scanning Without the null terminating character, C would not know where the string ends Many functions parse a string until it sees \0 Without it, the program would run into memory space that doesn t belong to it char str[20] can only hold 19 characters: at least one character is reserved for \0 In declarations, char myname[] = "Brian", C automatically inserts the null-terminating character 11 / 51 Printing Strings Scanning You can use printf to print strings Use %s as a placeholder: printf("my Name is %s.\n", myname); printf prints the string until the first null-terminating character Can specify minimum field width, as with e.g. int: printf("my Name is %20s.\n", myname); A negative field width will left justify instead of right justify 12 / 51

Arrays of Strings Scanning One string is an array of characters; an array of strings is a two-dimensional array of characters 1 # define NUM_PEOPLE 30 2 # define NAME_LEN 25 3... 4 char names [ NUM_PEOPLE ][ NAME_LEN ]; names can hold 30 names, each of up to 24 characters long 13 / 51 Arrays of Strings Scanning We can initialize an array of strings at declaration in the following manner: 1 char month [12][10] = {" January ", " February ", 2 " March "," April ", " May ", " June ", " July ", 3 " August ", " September "," October ", 4 " November ", " December "}; As with other arrays, the [12] is optional Why [10]? September is the longest string with 9 characters Needs an additional character for the null-terminating character 14 / 51 Reading Strings I Scanning You can use scanf and %s to read strings printf("enter Topic: "); scanf("%s", string_var); scanf skips leading whitespace characters such as blanks, newlines, and tabs Starting with the first non-whitespace character, scanf copies the characters it encounters into successive memory cells of its character array argument When a whitespace character is reached, scanning stops, and scanf places the null character at the end of the string in its array argument 15 / 51

Reading Strings II Scanning Note: no & is used The array is already represented by a memory address Dangerous: the user can put as many characters as they want If they input more characters than the string can hold: overflow Segmentation fault (if you re lucky), or may not even crash Rest of the program may produce garbage results 16 / 51 Functions: Assignment and Scanning The assignment operator, = works for simple data types For strings, = only works in the declaration 1 char message [30]; 2 message = " Hello!"; Illegal This is because arrays point to a memory location Cannot assign arbitrary values to memory pointers Must use library functions to do so 17 / 51 Scanning C provides a standard string library Use #include<string.h> Table 9.1 summarizes which functions are provided Copy, concatenation, comparison, length, tokenizer, etc. 18 / 51

String Assignment I Scanning To assign a value to a string, we actually copy it char *strcpy(char *dest, const char *src) copies string src (source) into dest (destination) Note: Second argument has the keyword const: guarantees the source string is not modified First argument must point to a memory location large enough to handle the size of dest This is your responsibility; C does not do it for you Returns a pointer to the first character of dest 19 / 51 String Assignment II Scanning 1 char myemail [30]; 2 strcpy ( myemail, " bgriffin@cse. unl. edu "); Be very careful: 1 char myemail [10]; 2 strcpy ( myemail, " bgriffin@cse. unl. edu "); In this case, se.unl.edu would overwrite adjacent memory cells 20 / 51 String Assignment I Byte-wise Scanning C provides another copying function called strncpy: char *strncpy(char *dest, const char *src, size_t n); size_t is an unsigned integer (no negative value) Copies (up to) n character values of src to dest Actually copies n bytes, but 1 char is one byte 1 char myemail [] = " bgriffin@cse. unl. edu "; 2 char mylogin [30]; 3 // copy first 8 characters : 4 strncpy ( mylogin, myemail, 8); 21 / 51

String Assignment II Byte-wise Scanning Pitfall: If there is no null-terminating character in the first n bytes of src, strncpy will not insert one for you You must add the null terminating character yourself 1 char myemail [] = " bgriffin@cse. unl. edu "; 2 char mylogin [30]; 3 // copy first 8 characters : 4 strncpy ( mylogin, myemail, 8); 5 mylogin [8] = \0 ; 22 / 51 String Assignment III Byte-wise Scanning If n is larger than src, the null-terminating character is copied multiple times: strncpy(astring, "Test", 8); Four null terminating characters will be copied Thus, astring contains "Test\0\0\0\0" 23 / 51 I Scanning is the operation of appending two strings C provides concatenation functions: char *strcat(char *dest, const char *src); char *strncat(char *dest, const char *src, size_t n); Both append src onto the end of dest 24 / 51

II Scanning 1 char fullname [80]; 2 char firstname [30] = " Brian "; 3 char lastname [30] = " Griffin "; 4 strcpy ( fullname, lastname ); 5 strcat ( fullname,", "); 6 strcat ( fullname, firstname ); 7 printf (" My name is %s\n", fullname ); Result: My name is Griffin, Brian 25 / 51 III Scanning strncat copies at most n bytes From the documentation (man pages): If src contains n or more characters, strncat() writes n+1 characters to dest (n from src plus the terminating null byte). Therefore, the size of dest must be at least the length of dest+n+1 26 / 51 I Scanning We can do character comparisons, A < a We can also do string comparisons (lexicographic order), but not with the usual operators <, > <=, etc. Strings (arrays of characters) are memory addresses string_1 < string_2 would compare the memory locations 27 / 51

II Scanning String library provides several comparison functions: int strcmp(const char *s1, const char *s2); int strncmp(const char *s1, const char *s2, size_t n); Both compare s1, s2 If s1 < s2, returns a negative integer If s1 > s2, returns a positive integer If s1 == s2 returns zero strncmp compares only the first n characters 28 / 51 III Scanning 1 char namea [] = " Alpha "; 2 char nameb [] = " Beta "; 3 char namec [] = " Alphie "; 4 char named [] = " BetaFish "; 5 if( strcmp ( namea, nameb ) < 0) 6 printf ("%s comes before %s\n", namea, nameb ); 7 if( strncmp ( namea, namec,4) == 0) 8 printf (" Almost the same!\ n"); 9 if( strcmp ( nameb, named ) < 0) 10 printf ("%s comes before %s\n", nameb, named ); 29 / 51 String Scanning The string library also provides a function to count the number of characters in a string: size_t strlen(const char *s); Returns the number of characters (bytes) appearing before the null terminating character Does not count the size of the array! 1 char message [50] = " You have mail "; 2 int n = strlen ( message ); 3 printf (" message has %d characters \n",n); Result: message has 13 characters 30 / 51

I Scanning A substring is a portion of a string, not necessarily from the beginning strncpy can be used to extract a substring (of n characters), but only from the beginning However, we can use referencing to get the memory address of a character &astring[3] is the memory address of the 4th character in astring We can exploit this fact to copy an arbitrary substring 31 / 51 II Scanning 1 char astring [100] = 2 " Please Email me at the address bgriffin@cse. unl.edu, thnx "; 3 char myemail [20]; 4 // copy a substring 5 strncpy (myemail, &astring [31], 20); 6 printf (" email is %s\n",myemail ); Result: email is bgriffin@cse.unl.edu 32 / 51 Pitfalls & Strategies Scanning Two most important questions when dealing with strings: 1 Is there enough room to perform the given operation? 2 Does the created string end in \0? Read the documentation (man pages) Each string function has its own expectations and guarantees 33 / 51

Scanning a Full I Scanning scanf only gets non-whitespace characters Sometimes it is necessary to get everything, including whitespace Standard function (in stdio library): char *gets(char *s); char *fgets(char *s, int size, FILE *stream); gets works with the standard input, fgets works with any buffer (more in Chapter 12) gets (get a string) 34 / 51 Scanning a Full II Scanning 1 char read_line [80]; 2 gets ( read_line ); 3 printf ("I read your line as \"% s \"\ n", read_line ); Dangerous: If the user enters more than 79 characters, no room for null-terminating character If user enters more than 80 characters: overflow Can actually be a security hazard Compiler message: (.text+0x2c5): warning: the gets function is dangerous and should not be used. 35 / 51 Scanning a Full III Scanning fgets is safer since you can limit the number of bytes it reads: char read_line[80]; fgets(read_line,80,stdin) Reads at most size-1 characters (automatically inserts null-terminating character) Takes the endline character out of the standard input, but retains it in the string 36 / 51

Comparison and Swapping Scanning We can perform a sorting algorithm to a list of strings: 1 for (i =0; i< num_string -1; i ++) 2 { 3 for (j=i; j< num_string ; j ++) 4 { 5 if( strcmp ( list [j], list [j +1]) > 0) 6 Swap ( list [j], list [j +1]); 7 } 8 } What would Swap look like? 37 / 51 Comparison and Swapping Scanning Swapping two strings: 1 strcpy (tmp, list [j ]); 2 strcpy ( list [j], list [j +1]); 3 strcpy ( list [j+1], tmp ); Careful: how big does tmp need to be? 38 / 51 I Scanning Up to now, your int main(void) functions have not taken any parameters. To read parameters (delimited by white space) in from the command line, you can use int main(int argc, char *argv[]) argc gives you a count of the number of arguments which are stored in argv argv is an array of strings (two-dimensional array of characters) 39 / 51

II Scanning argv: the first element is the program name (ex: argv[0] = a.out) Subsequent elements of argv contain strings read from the command line are delimited by whitespace You can encapsulate multiple words from the command line using the double quotes 40 / 51 III Scanning cse> a.out hello world abc 123 "hi everyone" would result in: argc = 6 argv[0] = a.out argv[1] = hello argv[2] = world argv[3] = abc argv[4] = 123 argv[5] = hi everyone 41 / 51 IV Scanning 1 /* 2 commandargs.c 3 4 Demonstrates the usage of command line arguments 5 by printing the arguments back to the command 6 line. 7 8 */ 9 10 # include < stdio.h> 11 # include < string.h> 12 13 int main ( int argc, char * argv []) 14 { 15 printf (" You entered %d arguments.\n",argc -1); 16 printf (" Program Name : %s\n",argv [0]); 17 int i; 18 for (i =1; i< argc ; i ++) 19 printf ("\ targv [%d] = %s\n",i, argv [i ]); 20 21 return 0; 22 } 42 / 51

Character Analysis and Conversion Scanning 43 / 51 The C library ctype.h provides several useful functions on characters isalpha(char ch) is true if ch is an alphabetic character (upper or lower case) isdigit(char ch) is true if ch is a character representing a digit islower(char ch) is true if ch is a lower-case character isupper(char ch) (guess) toupper and tolower convert alphabetic characters (no effect otherwise) ispunct(char ch) isspace(char ch) true if ch is any whitespace character stdio.h has getchar(void) and getc(file *inp), which read in one character at a time (use to build scanline in Fig 9.15) String-to-Number and Number-to-String Conversions I Scanning stdlib.h provides several functions for converting between strings and numbers String to numbers: int atoi(const char *nptr); double atof(const char *nptr); Returns the value of the number represented in the string nptr a (alpha-numeric) to integer, floating point Does not handle errors well: returns zero if it fails (see strtol for advanced behavior) 44 / 51 String-to-Number and Number-to-String Conversions II Scanning 1 # include <stdlib.h> 2 # include <stdio.h> 3 4 int main ( int argc, char * argv []) 5 { 6 if( argc!= 3) 7 { 8 printf (" Usage : %s integer double \n", argv [0]); 9 exit ( -1); 10 } 11 int a = atoi ( argv [1]); 12 double b = atof ( argv [2]); 13 printf (" You gave a = %d, b = %f ",a,b); 14 printf ("as command line args \n"); 15 return 0; 16 } 45 / 51

String-to-Number and Number-to-String Conversions I Scanning sprintf takes numbers, doubles, characters, and strings and concatenates them into one large string. sprintf(string_1, "%d integer %c - %s", int_val, char_val, string_2); If int_val = 42, char_val = a, and string_2 = "Stewie" then string_1 would be "42 integer a - Stewie" sscanf takes a string and parses it into integer, doubles, characters, and strings 46 / 51 String-to-Number and Number-to-String Conversions II Scanning 1 int num ; 2 double pi; 3 char a [50], b [50]; 4 sscanf ("42 3.141592 Stewie Griffin ", "%d %lf %s %s", &num, 5 &pi, 6 a, 7 b); 8 printf (" num = %d\n", num ); 9 printf ("pi = %f\n", pi ); 10 printf ("a = %s\n", a); 11 printf ("b = %s\n", b); 47 / 51 String-to-Number and Number-to-String Conversions III Scanning Result: 1 num = 42 2 pi = 3.141592 3 a = Stewie 4 b = Griffin 48 / 51

Common Programming Errors I Scanning We usually use functions to compute some value and use the return to send that value back to the main function. However, functions are not allowed to return strings, so we must use what we learned about input/output parameters Know when to use & and when not to Use them for simple data types: int, char, and double Do not use them for whole arrays (strings) 49 / 51 Common Programming Errors II Scanning Be careful not to overflow strings Always follow expected formats Read the documentation! Most important: make sure all strings are null-terminated (a \0 at the end) Just because your program seems to work, doesn t mean it always does (ex: add & to a, b in the sscanf snippet above) 50 / 51 Exercises I Scanning 1 Write a program that takes command line arguments and prints them out one by one. Then sort them in lexicographic order and print them out again. 2 A palindrome is a string that is the same backwards and forwards (example: tenet, level). Write a program that reads a string from the command line and determines if it is a palindrome or not. In the case that it is not, make the string a palindrome by concatenating a reversed copy to the end. 51 / 51