Data Structures Lecture 8

Similar documents
Elementary maths for GMT. Algorithm analysis Part I

LECTURE 9 Data Structures: A systematic way of organizing and accessing data. --No single data structure works well for ALL purposes.

Analysis of Algorithms

Outline and Reading. Analysis of Algorithms 1

Algorithms and Data Structures

9/10/2018 Algorithms & Data Structures Analysis of Algorithms. Siyuan Jiang, Sept

Data Structures and Algorithms

ANALYSIS OF ALGORITHMS

Choice of C++ as Language

Algorithm Analysis. Applied Algorithmics COMP526. Algorithm Analysis. Algorithm Analysis via experiments

Algorithm Analysis. October 12, CMPE 250 Algorithm Analysis October 12, / 66

Chapter 6 INTRODUCTION TO DATA STRUCTURES AND ALGORITHMS

Data Structures and Algorithms

Running Time. Analysis of Algorithms. Experimental Studies. Limitations of Experiments

Analysis of Algorithms

Running Time ( 3.1) Analysis of Algorithms. Experimental Studies. Limitations of Experiments

(Refer Slide Time: 1:27)

[ 11.2, 11.3, 11.4] Analysis of Algorithms. Complexity of Algorithms. 400 lecture note # Overview

Scientific Computing. Algorithm Analysis

Plotting run-time graphically. Plotting run-time graphically. CS241 Algorithmics - week 1 review. Prefix Averages - Algorithm #1

PROGRAM EFFICIENCY & COMPLEXITY ANALYSIS

Data Structures and Algorithms. Analysis of Algorithms

asymptotic growth rate or order compare two functions, but ignore constant factors, small inputs

Introduction to Data Structure

Outline and Reading. Analysis of Algorithms. Running Time. Experimental Studies. Limitations of Experiments. Theoretical Analysis

Algorithm Analysis. (Algorithm Analysis ) Data Structures and Programming Spring / 48

Assignment 1 (concept): Solutions

CH ALGORITHM ANALYSIS CH6. STACKS, QUEUES, AND DEQUES

Partha Sarathi Mandal

Algorithm. Lecture3: Algorithm Analysis. Empirical Analysis. Algorithm Performance

CS240 Fall Mike Lam, Professor. Algorithm Analysis

Chapter 2: Complexity Analysis

Analysis of Algorithm. Chapter 2

CSCA48 Winter 2018 Week 10:Algorithm Analysis. Marzieh Ahmadzadeh, Nick Cheng University of Toronto Scarborough

CS:3330 (22c:31) Algorithms

L6,7: Time & Space Complexity

CS240 Fall Mike Lam, Professor. Algorithm Analysis

CS302 Topic: Algorithm Analysis. Thursday, Sept. 22, 2005

CSE 373 APRIL 3 RD ALGORITHM ANALYSIS

Algorithm. Algorithm Analysis. Algorithm. Algorithm. Analyzing Sorting Algorithms (Insertion Sort) Analyzing Algorithms 8/31/2017

Asymptotic Analysis of Algorithms

Why study algorithms? CS 561, Lecture 1. Today s Outline. Why study algorithms? (II)

Algorithms and Data Structures

Complexity Analysis of an Algorithm

UNIT 1 ANALYSIS OF ALGORITHMS

Algorithm Analysis. College of Computing & Information Technology King Abdulaziz University. CPCS-204 Data Structures I

Lecture 5: Running Time Evaluation

Theory and Algorithms Introduction: insertion sort, merge sort

CS171:Introduction to Computer Science II. Algorithm Analysis. Li Xiong

Data Structures and Algorithms

How much space does this routine use in the worst case for a given n? public static void use_space(int n) { int b; int [] A;

Introduction to Computers & Programming

CS/ENGRD 2110 Object-Oriented Programming and Data Structures Spring 2012 Thorsten Joachims. Lecture 10: Asymptotic Complexity and

University of Toronto Department of Electrical and Computer Engineering. Midterm Examination. ECE 345 Algorithms and Data Structures Fall 2010

Data Structures and Algorithms. Part 2

Data structure and algorithm in Python

ASYMPTOTIC COMPLEXITY

RUNNING TIME ANALYSIS. Problem Solving with Computers-II

Data Structures and Algorithms Chapter 2

CSE 146. Asymptotic Analysis Interview Question of the Day Homework 1 & Project 1 Work Session

CS 310: Order Notation (aka Big-O and friends)

Module 1: Asymptotic Time Complexity and Intro to Abstract Data Types

INTRODUCTION. Analysis: Determination of time and space requirements of the algorithm

Introduction to Computer Science

Unit 1 Chapter 4 ITERATIVE ALGORITHM DESIGN ISSUES

Another Sorting Algorithm

DATA STRUCTURES AND ALGORITHMS. Asymptotic analysis of the algorithms

10/5/2016. Comparing Algorithms. Analyzing Code ( worst case ) Example. Analyzing Code. Binary Search. Linear Search

0.1 Welcome. 0.2 Insertion sort. Jessica Su (some portions copied from CLRS)

ASYMPTOTIC COMPLEXITY

CSE 332 Winter 2015: Midterm Exam (closed book, closed notes, no calculators)

Introduction to Algorithms 6.046J/18.401J

Introduction to the Analysis of Algorithms. Algorithm

University of Toronto Department of Electrical and Computer Engineering. Midterm Examination. ECE 345 Algorithms and Data Structures Fall 2012

CSE373: Data Structures and Algorithms Lecture 4: Asymptotic Analysis. Aaron Bauer Winter 2014

Introduction to Computer Science

Today s Outline. CSE 326: Data Structures Asymptotic Analysis. Analyzing Algorithms. Analyzing Algorithms: Why Bother? Hannah Takes a Break

DESIGN AND ANALYSIS OF ALGORITHMS. Unit 1 Chapter 4 ITERATIVE ALGORITHM DESIGN ISSUES

Data Structure Lecture#5: Algorithm Analysis (Chapter 3) U Kang Seoul National University

Introduction to Algorithms 6.046J/18.401J/SMA5503

Data Structure and Algorithm Homework #1 Due: 1:20pm, Tuesday, March 21, 2017 TA === Homework submission instructions ===

csci 210: Data Structures Program Analysis

Complexity of Algorithms

Programming II (CS300)

Analysis of Algorithms & Big-O. CS16: Introduction to Algorithms & Data Structures Spring 2018

Agenda. The worst algorithm in the history of humanity. Asymptotic notations: Big-O, Big-Omega, Theta. An iterative solution

O(n): printing a list of n items to the screen, looking at each item once.

Algorithm Analysis. Part I. Tyler Moore. Lecture 3. CSE 3353, SMU, Dallas, TX

MergeSort, Recurrences, Asymptotic Analysis Scribe: Michael P. Kim Date: September 28, 2016 Edited by Ofir Geri

How do we compare algorithms meaningfully? (i.e.) the same algorithm will 1) run at different speeds 2) require different amounts of space

Algorithms in Systems Engineering IE172. Midterm Review. Dr. Ted Ralphs

MergeSort, Recurrences, Asymptotic Analysis Scribe: Michael P. Kim Date: April 1, 2015

Complexity of Algorithms. Andreas Klappenecker

Classic Data Structures Introduction UNIT I

Big-O-ology 1 CIS 675: Algorithms January 14, 2019

CMPSCI 187: Programming With Data Structures. Lecture 5: Analysis of Algorithms Overview 16 September 2011

Algorithm Analysis. Spring Semester 2007 Programming and Data Structure 1

Data Structures Question Bank Multiple Choice

COT 5407: Introduction to Algorithms. Giri Narasimhan. ECS 254A; Phone: x3748

CS 6402 DESIGN AND ANALYSIS OF ALGORITHMS QUESTION BANK

Transcription:

Fall 2017 Fang Yu Software Security Lab. Dept. Management Information Systems, National Chengchi University Data Structures Lecture 8

Recap What should you have learned? Basic java programming skills Object-oriented programming Classes and objects Inheritance, exception handling, generics Java class library Basic data structures and their applications Linear data structure: linked list, array, stack, queue Hierarchical data structures: tree and heap

Midterm on Dec. 7 Lec 1-9, TextBook Ch1-8, 11,12 大勇樓 106, 9:00-12:00am, Dec 7. How to prepare your midterm: Understand ALL the materials mentioned in the slides Discuss with me, your TAs, or classmates Read the text book to help you understand the materials You are allowed to bring an A4 size note Prepare your own note; write whatever you think that may help you get better scores in the midterm

Wrap up What are you going to learn in the rest of this semester? Algorithms Analysis of algorithms Brute force, divide and conquer, dynamic programming Sorting Advanced data structures Hash table Map and dictionary Graph

Input Algorithm Output Analysis of Algorithms How good is your program?

Running Time Most algorithms transform input objects into output objects. The running time of an algorithm typically grows with the input size. 120 100 best case average case worst case Average case time is often difficult to determine. We focus on the worst case running time. Easier to analyze 20 Crucial to applications such as games, finance and robotics 0 Running Time 80 60 40 1000 2000 3000 4000 Input Size

Experimental Studies Write a program implementing the algorithm Run the program with inputs of varying size and composition Use a method like System.currentTimeMillis() to get an accurate measure of the actual running time Plot the results Time (ms) 9000 8000 7000 6000 5000 4000 3000 2000 1000 0 0 50 100 Input Size

Limitations of Experiments It is necessary to implement the algorithm, which may be difficult Results may not be indicative of the running time on other inputs not included in the experiment. In order to compare two algorithms, the same hardware and software environments must be used

Theoretical Analysis Uses a high-level description of the algorithm instead of an implementation Characterizes running time as a function of the input size, n. Takes into account all possible inputs Allows us to evaluate the speed of an algorithm independent of the hardware/software environment

Pseudo code High-level description of an algorithm More structured than English prose Less detailed than a program Preferred notation for describing algorithms Hides program design issues Example: find the max element of an array Algorithm arraymax(a, n) Input array A of n integers Output the maximum element of A currentmax A[0] for i 1 to n - 1 do if A[i] > currentmax then currentmax A[i] return currentmax

Pseudo code Example: find the max element of an array Find the min element of an array Algorithm arraymax(a, n) Input array A of n integers Output the maximum element of A currentmax A[0] for i 1 to n - 1 do if A[i] > currentmax then currentmax A[i] return currentmax Algorithm arraymin(a, n) Input array A of n integers Output the minimum element of A currentmin A[0] for i 1 to n - 1 do if A[i] < currentmin then currentmin A[i] return currentmin

Pseudo code Find the min element of an array Sum all the elements of an array Algorithm arraymin(a, n) Input array A of n integers Output the minimum element of A currentmin A[0] for i 1 to n - 1 do if A[i] < currentmin then currentmin A[i] return currentmin Algorithm arraysum(a, n) Input array A of n integers Output sum of all the elements of A currentsum 0 for i 0 to n - 1 do currentsum currentsum+a[i] return currentsum

Pseudo code Sum all the elements of an array Multiply all the elements of an array Algorithm arraysum(a, n) Input array A of n integers Output sum of all the elements of A currentsum 0 for i 0 to n - 1 do currentsum currentsum+a[i] return currentsum Algorithm arraymultiply(a, n) Input array A of n integers Output Multiply all the elements of A current 1 for i 0 to n - 1 do current current*a[i] return current

Pseudo code Details Control flow if then [else ] while do repeat until for do Indentation replaces braces Method declaration Algorithm method (arg [, arg ]) Input Output

Pseudo code Details Method call var.method (arg [, arg ]) Return value return expression Expressions Assignment (like = in Java) = Equality testing (like == in Java) n 2 Superscripts and other mathematical formatting allowed

The Random Access Machine (RAM) Model A CPU An potentially unbounded bank of memory cells, each of which can hold an arbitrary number or character Memory cells are numbered and accessing any cell in memory takes unit time. 0 1 2

Seven Important Functions Seven functions that often appear in algorithm analysis: Constant 1 Logarithmic log n Linear n N-Log-N n log n Quadratic n 2 Cubic n 3 Exponential 2 n

Functions Graphed Using Normal Scale g(n) = n lg n g(n) = 1 g(n) = lg n g(n) = n 2 g(n) = 2 n g(n) = n g(n) = n 3

Primitive Operations Basic computations performed by an algorithm Identifiable in pseudocode Largely independent from the programming language Exact definition not important (we will see why later) Assumed to take a constant amount of time in the RAM model Examples: Evaluating an expression Assigning a value to a variable Indexing into an array Calling a method Returning from a method

Counting Primitive Operations By inspecting the pseudocode, we can determine the maximum number of primitive operations executed by an algorithm, as a function of the input size Algorithm arraymax(a, n) # operations currentmax A[0] 2 for i 1 to n - 1 do 2n if A[i] > currentmax then 2(n - 1) currentmax A[i] 2(n - 1) { increment counter i } 2(n - 1) return currentmax 1 Total 8n - 2

Counting Primitive Operations By inspecting the pseudocode, we can determine the maximum number of primitive operations executed by an algorithm, as a function of the input size Algorithm arraymultiply(a, n) #operations current 1 1 for i 0 to n - 1 do 2(n+1) current current*a[i] 3n { increment counter i } 2n return current 1 Total 7n+4 =>O(n)

Counting Primitive Operations By inspecting the pseudocode, we can determine the maximum number of primitive operations executed by an algorithm, as a function of the input size Algorithm arrayaverage(a, n) #operations current 0 1 for i 0 to n - 1 do 2(n+1) current current+a[i] 3n { increment counter i } 2n return current/n 2 Total 7n+5 =>O(n)

Estimating Running Time Algorithm arraymax executes 8n - 2 primitive operations in the worst case. Define: a = Time taken by the fastest primitive operation b = Time taken by the slowest primitive operation Let T(n) be worst-case time of arraymax. Then a (8n - 2) T(n) b(8n - 2) Hence, the running time T(n) is bounded by two linear functions

Growth Rate of Running Time Changing the hardware/ software environment Affects T(n) by a constant factor, but Does not alter the growth rate of T(n) The linear growth rate of the running time T(n) is an intrinsic property of algorithm arraymax

Why Growth Rate Matters if runtime is..." time for n + 1" time for 2 n" time for 4 n" c lg n" c lg (n + 1)" c (lg n + 1)" c(lg n + 2)" c n" c (n + 1)" 2c n" 4c n" c n lg n" ~ c n lg n" + c n" 2c n lg n + 2cn" 4c n lg n + 4cn" c n 2 " ~ c n 2 + 2c n" 4c n 2! 16c n 2" runtime quadruples when problem size doubles c n 3" ~ c n 3 + 3c n 2 " 8c n 3 " 64c n 3" c 2 n " c 2 n+1 " c 2 2n " c 2 4n "

Comparison of Two Algorithms insertion sort is n 2 / 4 merge sort is 2 n lg n sort a million items? insertion sort takes roughly 70 hours while merge sort takes roughly 40 seconds This is a slow machine, but if 100 x as fast then it s 40 minutes versus less than 0.5 seconds

Constant Factors The growth rate is not affected by constant factors or lower-order terms Examples 10 2 n + 10 5 is a linear function 10 5 n 2 + 10 8 n is a quadratic function T(n) 1E+25 1E+23 1E+21 1E+19 1E+17 1E+15 1E+13 1E+11 1E+9 1E+7 1E+5 1E+3 1E+1 Quadratic Quadratic Linear Linear 1E-1 1E-1 1E+1 1E+3 1E+5 1E+7 1E+9 n

Big-Oh Notation Given functions f(n) and g(n), we say that f(n) is O(g(n)) if there are positive constants c and n 0 such that 10,000 1,000 3n 2n+10 n f(n) cg(n) for n n 0 100 Example: 2n + 10 is O(n) 2n + 10 cn (c - 2) n 10 10 n 10/(c - 2) Pick c = 3 and n 0 = 10 1 1 10 100 1,000 n

Big-Oh Example Example: the function n 2 is not O(n) n 2 cn n c The above inequality cannot be satisfied since c must be a constant 1,000,000 100,000 10,000 1,000 100 n^2 100n 10n n 10 1 1 10 100 1,000 n

More Big-Oh Example 7n-2 is O(n) need c > 0 and n 0 1 such that 7n-2 c n for n n 0 this is true for c = 7 and n 0 = 1 3n 3 + 20n 2 + 5 is O(n 3 ) need c > 0 and n 0 1 such that 3n 3 + 20n 2 + 5 c n 3 for n n 0 this is true for c = 4 and n 0 = 21 3 log n + 5 is O(log n) need c > 0 and n 0 1 such that 3 log n + 5 c log n for n n 0 this is true for c = 8 and n 0 = 2

Big-Oh and Growth Rate The big-oh notation gives an upper bound on the growth rate of a function The statement f(n) is O(g(n)) means that the growth rate of f(n) is no more than the growth rate of g(n) We can use the big-oh notation to rank functions according to their growth rate f(n) is O(g(n)) g(n) is O(f(n)) g(n) grows more Yes No f(n) grows more No Yes Same growth Yes Yes

Big-Oh Rules If f(n) is a polynomial of degree d, then f(n) is O(n d ), i.e., 1. Drop lower-order terms 2. Drop constant factors Use the smallest possible class of functions Say 2n is O(n) instead of 2n is O(n 2 ) Use the simplest expression of the class Say 3n + 5 is O(n) instead of 3n + 5 is O(3n)

Asymptotic Algorithm Analysis The asymptotic analysis of an algorithm determines the running time in big-oh notation To perform the asymptotic analysis We find the worst-case number of primitive operations executed as a function of the input size We express this function with big-oh notation

Asymptotic Algorithm Analysis Example: We determine that algorithm arraymax executes at most 8n - 2 primitive operations We say that algorithm arraymax runs in O(n) time Since constant factors and lower-order terms are eventually dropped anyhow, we can disregard them when counting primitive operations

Computing Prefix Averages We further illustrate asymptotic analysis with two algorithms for prefix averages The i-th prefix average of an array X is average of the first (i + 1) elements of X: A[i] = (X[0] + X[1] + + X[i])/(i+1) Computing the array A of prefix averages of another array X has applications to financial analysis 35 30 25 20 15 10 5 0 X A 1 2 3 4 5 6 7

Exercise Implement prefixaverage Input: Get n integers from a txt file The first integer indicates the number of integers (the size of X) Input: 4 1 2 3 5 Output: 1 2 2 Output: Print out a sequence of integers The ith integer indicates the average of the first i+1 input numbers (starting from the second input)

Prefix Average (Quadratic) The following algorithm computes prefix averages in quadratic time by applying the definition Algorithm prefixaverages1(x, n) Input array X of n integers Output array A of prefix averages of X #operations A new array of n integers n for i 0 to n - 1 do n s X[0] n for j 1 to i do 1 + 2 + + (n - 1) s s + X[j] 1 + 2 + + (n - 1) A[i] s / (i + 1) n return A 1

Arithmetic Progression The running time of prefixaverages1 is O(1 + 2 + + n) The sum of the first n integers is n(n + 1) / 2 There is a simple visual proof of this fact 2 Thus, algorithm prefixaverages1 runs in O(n 2 ) 1 time 0 7 6 5 4 3 1 2 3 4 5 6

Prefix Average (Linear) The following algorithm computes prefix averages in linear time by keeping a running sum Algorithm prefixaverages2 runs in O(n) time Algorithm prefixaverages2(x, n) Input array X of n integers Output array A of prefix averages of X #operations A new array of n integers n s 0 1 for i 0 to n - 1 do n s s + X[i] n A[i] s / (i + 1) n return A 1

Relatives of Big-Oh big-omega f(n) is Ω(g(n)) if there is a constant c > 0 and an integer constant n 0 1 such that f(n) c g(n) for n n 0 big-theta f(n) is Θ(g(n)) if there are constants c > 0 and c > 0 and an integer constant n 0 1 such that c g(n) f(n) c g(n) for n n 0

Intuition for Asymptotic Notation Big-Oh f(n) is O(g(n)) if f(n) is asymptotically less than or equal to g(n) big-omega f(n) is Ω(g(n)) if f(n) is asymptotically greater than or equal to g(n) big-theta f(n) is Θ(g(n)) if f(n) is asymptotically equal to g(n)

Examples of Using Relatives of Big-Oh 5n 2 is Ω(n 2 ) f(n) is Ω(g(n)) if there is a constant c > 0 and an integer constant n 0 1 such that f(n) c g(n) for n n 0 let c = 5 and n 0 = 1 5n 2 is Ω(n) f(n) is Ω(g(n)) if there is a constant c > 0 and an integer constant n 0 1 such that f(n) c g(n) for n n 0 let c = 1 and n 0 = 1 5n 2 is Θ(n 2 ) f(n) is Θ(g(n)) if it is Ω(n 2 ) and O(n 2 ). We have already seen the former, for the latter recall that f(n) is O(g(n)) if there is a constant c > 0 and an integer constant n 0 1 such that f(n) < c g(n) for n n 0 Let c = 5 and n 0 = 1

Coming Up Big O: Read Text Book 4 Divide and Conquer/Sorting: Read Text Book 11