CSE 332: Data Structures & Parallelism Lecture 12: Comparison Sorting. Ruth Anderson Winter 2019

Similar documents
CSE373: Data Structure & Algorithms Lecture 18: Comparison Sorting. Dan Grossman Fall 2013

CSE 373: Data Structures and Algorithms

CSE373: Data Structure & Algorithms Lecture 21: More Comparison Sorting. Aaron Bauer Winter 2014

CSE 373: Data Structures and Algorithms

CSE 373: Data Structures and Algorithms

Sorting. Riley Porter. CSE373: Data Structures & Algorithms 1

Announcements. HW4: Due tomorrow! Final in EXACTLY 2 weeks. Start studying. Summer 2016 CSE373: Data Structures & Algorithms 1

CSE 373: Data Structure & Algorithms Comparison Sor:ng

CSE 373: Data Structures & Algorithms More Sor9ng and Beyond Comparison Sor9ng

CSE 373 MAY 24 TH ANALYSIS AND NON- COMPARISON SORTING

CSE 373 NOVEMBER 8 TH COMPARISON SORTS

Sorting Algorithms. + Analysis of the Sorting Algorithms

Quick Sort. CSE Data Structures May 15, 2002

Sorting. Sorting in Arrays. SelectionSort. SelectionSort. Binary search works great, but how do we create a sorted array in the first place?

CS 137 Part 8. Merge Sort, Quick Sort, Binary Search. November 20th, 2017

/633 Introduction to Algorithms Lecturer: Michael Dinitz Topic: Sorting lower bound and Linear-time sorting Date: 9/19/17

Unit 6 Chapter 15 EXAMPLES OF COMPLEXITY CALCULATION

Mergesort again. 1. Split the list into two equal parts

COMP Data Structures

CSE 143. Two important problems. Searching and Sorting. Review: Linear Search. Review: Binary Search. Example. How Efficient Is Linear Search?

Divide and Conquer Sorting Algorithms and Noncomparison-based

CSE332: Data Abstractions Lecture 21: Parallel Prefix and Parallel Sorting. Tyler Robison Summer 2010

SAMPLE OF THE STUDY MATERIAL PART OF CHAPTER 6. Sorting Algorithms

Sorting. Bubble Sort. Pseudo Code for Bubble Sorting: Sorting is ordering a list of elements.

Quicksort (Weiss chapter 8.6)

Sorting. CSE 143 Java. Insert for a Sorted List. Insertion Sort. Insertion Sort As A Card Game Operation. CSE143 Au

CS : Data Structures

Unit Outline. Comparing Sorting Algorithms Heapsort Mergesort Quicksort More Comparisons Complexity of Sorting 2 / 33

Divide and Conquer CISC4080, Computer Algorithms CIS, Fordham Univ. Instructor: X. Zhang

Lecture #2. 1 Overview. 2 Worst-Case Analysis vs. Average Case Analysis. 3 Divide-and-Conquer Design Paradigm. 4 Quicksort. 4.

4. Sorting and Order-Statistics

Divide and Conquer CISC4080, Computer Algorithms CIS, Fordham Univ. Acknowledgement. Outline

Lecture 19 Sorting Goodrich, Tamassia

CS125 : Introduction to Computer Science. Lecture Notes #38 and #39 Quicksort. c 2005, 2003, 2002, 2000 Jason Zych

A Sophomoric Introduction to Shared-Memory Parallelism and Concurrency Lecture 3 Parallel Prefix, Pack, and Sorting

CS 171: Introduction to Computer Science II. Quicksort

What next? CSE332: Data Abstractions Lecture 20: Parallel Prefix and Parallel Sorting. The prefix-sum problem. Parallel prefix-sum

DATA STRUCTURES AND ALGORITHMS

Chapter 7 Sorting. Terminology. Selection Sort

Sorting. CSC 143 Java. Insert for a Sorted List. Insertion Sort. Insertion Sort As A Card Game Operation CSC Picture

08 A: Sorting III. CS1102S: Data Structures and Algorithms. Martin Henz. March 10, Generated on Tuesday 9 th March, 2010, 09:58

Better sorting algorithms (Weiss chapter )

Sorting is a problem for which we can prove a non-trivial lower bound.

Data Structures and Algorithms

IS 709/809: Computational Methods in IS Research. Algorithm Analysis (Sorting)

Jana Kosecka. Linear Time Sorting, Median, Order Statistics. Many slides here are based on E. Demaine, D. Luebke slides

Intro to Algorithms. Professor Kevin Gold

Sorting: Given a list A with n elements possessing a total order, return a list with the same elements in non-decreasing order.

Sorting. Weiss chapter , 8.6

Sorting. Sorting. Stable Sorting. In-place Sort. Bubble Sort. Bubble Sort. Selection (Tournament) Heapsort (Smoothsort) Mergesort Quicksort Bogosort

Comparison Sorts. Chapter 9.4, 12.1, 12.2

II (Sorting and) Order Statistics

We can use a max-heap to sort data.

CS2223: Algorithms Sorting Algorithms, Heap Sort, Linear-time sort, Median and Order Statistics

Algorithmic Analysis and Sorting, Part Two

Algorithmic Analysis and Sorting, Part Two. CS106B Winter

Copyright 2009, Artur Czumaj 1

Lecture Notes 14 More sorting CSS Data Structures and Object-Oriented Programming Professor Clark F. Olson

Algorithm Efficiency & Sorting. Algorithm efficiency Big-O notation Searching algorithms Sorting algorithms

Table ADT and Sorting. Algorithm topics continuing (or reviewing?) CS 24 curriculum

Mergesort again. 1. Split the list into two equal parts

Divide and Conquer Algorithms: Advanced Sorting

Quicksort. Repeat the process recursively for the left- and rightsub-blocks.

Cosc 241 Programming and Problem Solving Lecture 17 (30/4/18) Quicksort

COMP 250 Fall recurrences 2 Oct. 13, 2017

COSC242 Lecture 7 Mergesort and Quicksort

Merge Sort. Algorithm Analysis. November 15, 2017 Hassan Khosravi / Geoffrey Tien 1

MERGESORT & QUICKSORT cs2420 Introduction to Algorithms and Data Structures Spring 2015

Introduction. e.g., the item could be an entire block of information about a student, while the search key might be only the student's name

CSE373: Data Structures and Algorithms Lecture 4: Asymptotic Analysis. Aaron Bauer Winter 2014

CS1 Lecture 30 Apr. 2, 2018

Presentation for use with the textbook, Algorithm Design and Applications, by M. T. Goodrich and R. Tamassia, Wiley, Merge Sort & Quick Sort

Algorithms and Data Structures (INF1) Lecture 7/15 Hua Lu

S O R T I N G Sorting a list of elements implemented as an array. In all algorithms of this handout the sorting of elements is in ascending order

Sorting (I) Hwansoo Han

O(n): printing a list of n items to the screen, looking at each item once.

Sorting: Quick Sort. College of Computing & Information Technology King Abdulaziz University. CPCS-204 Data Structures I

Treaps. 1 Binary Search Trees (BSTs) CSE341T/CSE549T 11/05/2014. Lecture 19

Sorting. CPSC 259: Data Structures and Algorithms for Electrical Engineers. Hassan Khosravi

CPSC 311 Lecture Notes. Sorting and Order Statistics (Chapters 6-9)

Sorting. Dr. Baldassano Yu s Elite Education

CS 112 Introduction to Computing II. Wayne Snyder Computer Science Department Boston University

Selection, Bubble, Insertion, Merge, Heap, Quick Bucket, Radix

Sorting. Quicksort analysis Bubble sort. November 20, 2017 Hassan Khosravi / Geoffrey Tien 1

The Limits of Sorting Divide-and-Conquer Comparison Sorts II

Hash Tables. CS 311 Data Structures and Algorithms Lecture Slides. Wednesday, April 22, Glenn G. Chappell

Lecture 2: Divide&Conquer Paradigm, Merge sort and Quicksort

CS221: Algorithms and Data Structures. Sorting Takes Priority. Steve Wolfman (minor tweaks by Alan Hu)

Lecture 7 Quicksort : Principles of Imperative Computation (Spring 2018) Frank Pfenning

DIVIDE AND CONQUER ALGORITHMS ANALYSIS WITH RECURRENCE EQUATIONS

Hiroki Yasuga, Elisabeth Kolp, Andreas Lang. 25th September 2014, Scientific Programming

CS240 Fall Mike Lam, Professor. Quick Sort

Lecture Notes on Quicksort

COMP2012H Spring 2014 Dekai Wu. Sorting. (more on sorting algorithms: mergesort, quicksort, heapsort)

Data structures. More sorting. Dr. Alex Gerdes DIT961 - VT 2018

CSC Design and Analysis of Algorithms

ITEC2620 Introduction to Data Structures

The divide and conquer strategy has three basic parts. For a given problem of size n,

Divide-and-Conquer. Dr. Yingwu Zhu

EECS 2011M: Fundamentals of Data Structures

Transcription:

CSE 332: Data Structures & Parallelism Lecture 12: Comparison Sorting Ruth Anderson Winter 2019

Today Sorting Comparison sorting 2/08/2019 2

Introduction to sorting Stacks, queues, priority queues, and dictionaries all focused on providing one element at a time But often we know we want all the data items in some order Anyone can sort, but a computer can sort faster Very common to need data sorted somehow Alphabetical list of people Population list of countries Search engine results by relevance Different algorithms have different asymptotic and constantfactor trade-offs No single best sort for all scenarios Knowing one way to sort just isn t enough 2/08/2019 3

More reasons to sort General technique in computing: Preprocess (e.g. sort) data to make subsequent operations faster Example: Sort the data so that you can Find the k th largest in constant time for any k Perform binary search to find an element in logarithmic time Whether the benefit of the preprocessing depends on How often the data will change How much data there is 2/08/2019 4

The main problem, stated carefully For now we will assume we have n comparable elements in an array and we want to rearrange them to be in increasing order Input: An array A of data records A key value in each data record A comparison function (consistent and total) Given keys a & b, what is their relative ordering? <, =, >? Ex: keys that implement Comparable or have a Comparator that can handle them Effect: Reorganize the elements of A such that for any i and j, if i < j then A[i] A[j] Usually unspoken assumption: A must have all the same data it started with Could also sort in reverse order, of course An algorithm doing this is a comparison sort 2/08/2019 5

Variations on the basic problem 1. Maybe elements are in a linked list (could convert to array and back in linear time, but some algorithms needn t do so) 2. Maybe in the case of ties we should preserve the original ordering Sorts that do this naturally are called stable sorts One way to sort twice, Ex: Sort movies by year, then for ties, alphabetically 3. Maybe we must not use more than O(1) auxiliary space Sorts meeting this requirement are called in-place sorts Not allowed to allocate extra array (at least not with size O(n)), but can allocate O(1) # of variables All work done by swapping around in the array 4. Maybe we can do more with elements than just compare Comparison sorts assume we work using a binary compare operator In special cases we can sometimes get faster algorithms 5. Maybe we have too much data to fit in memory Use an external sorting algorithm 2/08/2019 6

Sorting: The Big Picture Simple algorithms: O(n 2 ) Fancier algorithms: O(n log n) Comparison lower bound: (n log n) Specialized algorithms: O(n) Handling huge data sets Insertion sort Selection sort Shell sort Heap sort Merge sort Quick sort (avg) Bucket sort Radix sort External sorting 2/08/2019 7

Insertion Sort Idea: At step k, put the k th element in the correct position among the first k elements Alternate way of saying this: Sort first two elements Now insert 3 rd element in order Now insert 4 th element in order Loop invariant : when loop index is i, first i elements are sorted Time? Best-case Worst-case Average case 2/08/2019 8

Insertion Sort Idea: At step k, put the k th element in the correct position among the first k elements Alternate way of saying this: Sort first two elements Now insert 3 rd element in order Now insert 4 th element in order Loop invariant : when loop index is i, first i elements are sorted Time? Best-case O(n) Worst-case O(n 2 ) Average case O(n 2 ) start sorted start reverse sorted (see text) 2/08/2019 9

Selection sort Idea: At step k, find the smallest element among the not-yetsorted elements and put it at position k Alternate way of saying this: Find smallest element, put it 1 st Find next smallest element, put it 2 nd Find next smallest element, put it 3 rd Loop invariant : when loop index is i, first i elements are the i smallest elements in sorted order Time? Best-case Worst-case Average case 2/08/2019 10

Selection sort Idea: At step k, find the smallest element among the not-yetsorted elements and put it at position k Alternate way of saying this: Find smallest element, put it 1 st Find next smallest element, put it 2 nd Find next smallest element, put it 3 rd Loop invariant : when loop index is i, first i elements are the i smallest elements in sorted order Time? Best-case O(n 2 ) Worst-case O(n 2 ) Average case O(n 2 ) Always T(1) = 1 and T(n) = n + T(n-1) 2/08/2019 11

Insertion Sort vs. Selection Sort Different algorithms Solve the same problem Have the same worst-case and average-case asymptotic complexity Insertion-sort has better best-case complexity; preferable when input is mostly sorted Other algorithms are more efficient for non-small arrays that are not already almost sorted Insertion sort may do well on small arrays 2/08/2019 12

Aside: We won t cover Bubble Sort It doesn t have good asymptotic complexity: O(n 2 ) It s not particularly efficient with respect to common factors Basically, almost everything it is good at, some other algorithm is at least as good at Some people seem to teach it just because someone taught it to them For fun see: Bubble Sort: An Archaeological Algorithmic Analysis, Owen Astrachan, SIGCSE 2003 http://www.cs.duke.edu/~ola/bubble/bubble.pdf 2/08/2019 13

Sorting: The Big Picture Simple algorithms: O(n 2 ) Fancier algorithms: O(n log n) Comparison lower bound: (n log n) Specialized algorithms: O(n) Handling huge data sets Insertion sort Selection sort Shell sort Heap sort Merge sort Quick sort (avg) Bucket sort Radix sort External sorting 2/08/2019 14

Heap sort Sorting with a heap is easy: insert each arr[i], better yet use buildheap for(i=0; i < arr.length; i++) arr[i] = deletemin(); Worst-case running time: We have the array-to-sort and the heap So this is not an in-place sort There s a trick to make it in-place 2/08/2019 15

Heap sort Sorting with a heap is easy: insert each arr[i], better yet use buildheap for(i=0; i < arr.length; i++) arr[i] = deletemin(); Worst-case running time: O(n log n) why? We have the array-to-sort and the heap So this is not an in-place sort There s a trick to make it in-place 2/08/2019 16

In-place heap sort But this reverse sorts how would you fix that? Treat the initial array as a heap (via buildheap) When you delete the i th element, put it at arr[n-i] It s not part of the heap anymore! 4 7 5 9 8 6 10 3 2 1 heap part sorted part 5 7 6 9 8 10 4 3 2 1 arr[n-i]= deletemin() heap part sorted part 2/08/2019 17

AVL sort How? 2/08/2019 18

AVL sort We can also use a balanced tree to: insert each element: total time O(n log n) Do an in-order traversal O(n) But this cannot be made in-place and has worse constant factors than heap sort both are O(n log n) in worst, best, and average case neither parallelizes well heap sort is better Don t even think about trying to sort with a hash table 2/08/2019 19

Divide and conquer Very important technique in algorithm design 1. Divide problem into smaller parts 2. Solve the parts independently Think recursion Or potential parallelism 3. Combine solution of parts to produce overall solution Ex: Sort each half of the array, combine together; to sort each half, split into halves 2/08/2019 20

Divide-and-conquer sorting Two great sorting methods are fundamentally divide-and-conquer 1. Mergesort: Sort the left half of the elements (recursively) Sort the right half of the elements (recursively) Merge the two sorted halves into a sorted whole 2. Quicksort: Pick a pivot element Divide elements into those less-than pivot and those greater-than pivot Sort the two divisions (recursively on each) Answer is [sorted-less-than then pivot then sorted-greater-than] 2/08/2019 21

Mergesort 0 1 2 3 4 5 6 7 a 8 2 9 4 5 3 1 6 lo hi To sort array from position lo to position hi: If range is 1 element long, it s sorted! (Base case) Else, split into two halves: Sort from lo to (hi+lo)/2 Sort from (hi+lo)/2 to hi Merge the two halves together Merging takes two sorted parts and sorts everything O(n) but requires auxiliary space 2/08/2019 22

Example, focus on merging Start with: a 8 2 9 4 5 3 1 6 a After we return from left and right recursive calls (pretend it works for now) 2 4 8 9 1 3 5 6 Merge: Use 3 fingers and 1 more array (After merge, copy back to original array) aux 2/08/2019 23

Example, focus on merging Start with: 8 2 9 4 5 3 1 6 After recursion: (not magic ) 2 4 8 9 1 3 5 6 Merge: Use 3 fingers and 1 more array 1 (After merge, copy back to original array) 2/08/2019 24

Example, focus on merging Start with: 8 2 9 4 5 3 1 6 After recursion: (not magic ) 2 4 8 9 1 3 5 6 Merge: Use 3 fingers and 1 more array 1 2 (After merge, copy back to original array) 2/08/2019 25

Example, focus on merging Start with: 8 2 9 4 5 3 1 6 After recursion: (not magic ) 2 4 8 9 1 3 5 6 Merge: Use 3 fingers and 1 more array 1 2 3 (After merge, copy back to original array) 2/08/2019 26

Example, focus on merging Start with: 8 2 9 4 5 3 1 6 After recursion: (not magic ) 2 4 8 9 1 3 5 6 Merge: Use 3 fingers and 1 more array 1 2 3 4 (After merge, copy back to original array) 2/08/2019 27

Example, focus on merging Start with: 8 2 9 4 5 3 1 6 After recursion: (not magic ) 2 4 8 9 1 3 5 6 Merge: Use 3 fingers and 1 more array 1 2 3 4 5 (After merge, copy back to original array) 2/08/2019 28

Example, focus on merging Start with: 8 2 9 4 5 3 1 6 After recursion: (not magic ) 2 4 8 9 1 3 5 6 Merge: Use 3 fingers and 1 more array 1 2 3 4 5 6 (After merge, copy back to original array) 2/08/2019 29

Example, focus on merging Start with: 8 2 9 4 5 3 1 6 After recursion: (not magic ) 2 4 8 9 1 3 5 6 Merge: Use 3 fingers and 1 more array 1 2 3 4 5 6 8 (After merge, copy back to original array) 2/08/2019 30

Example, focus on merging Start with: 8 2 9 4 5 3 1 6 After recursion: (not magic ) 2 4 8 9 1 3 5 6 Merge: Use 3 fingers and 1 more array 1 2 3 4 5 6 8 9 (After merge, copy back to original array) 2/08/2019 31

Example, focus on merging Start with: 8 2 9 4 5 3 1 6 After recursion: (not magic ) 2 4 8 9 1 3 5 6 Merge: Use 3 fingers and 1 more array (After merge, copy back to original array) 1 2 3 4 5 6 8 9 1 2 3 4 5 6 8 9 2/08/2019 32

1 element Mergesort example: Recursively splitting list in half Divide Divide Divide 8 2 9 4 5 3 1 6 8 2 9 4 5 3 1 6 8 2 8 2 9 4 5 3 1 6 9 4 5 3 1 6 2/08/2019 33

1 element Mergesort example: Merge as we return from recursive calls Divide Divide Divide 8 2 9 4 5 3 1 6 8 2 9 4 5 3 1 6 8 2 8 2 9 4 5 3 1 6 9 4 5 3 1 6 Merge 2 8 4 9 3 5 1 6 Merge 2 4 8 9 1 3 5 6 Merge 1 2 3 4 5 6 8 9 When a recursive call ends, it s sub-arrays are each in order; just 2/08/2019 need to merge them in order together 34

1 element Mergesort example: Merge as we return from recursive calls Divide Divide Divide 8 2 9 4 5 3 1 6 8 2 9 4 5 3 1 6 8 2 8 2 9 4 5 3 1 6 9 4 5 3 1 6 Merge 2 8 4 9 3 5 1 6 Merge 2 4 8 9 1 3 5 6 Merge 1 2 3 4 5 6 8 9 We need another array in which to do each merging step; merge 2/08/2019 results into there, then copy back to original array 35

Mergesort, some details: saving a little time What if the final steps of our merging looked like the following: 2 4 5 6 1 3 8 9 Main array 1 2 3 4 5 6 Auxiliary array Seems kind of wasteful to copy 8 & 9 to the auxiliary array just to copy them immediately back 2/08/2019 36

Mergesort, some details: saving a little time Unnecessary to copy dregs over to auxiliary array If left-side finishes first, just stop the merge & copy the auxiliary array: copy If right-side finishes first, copy dregs directly into right side, then copy auxiliary array first second 2/08/2019 37

Some details: saving space / copying Simplest / worst approach: Use a new auxiliary array of size (hi-lo) for every merge Returning from a recursive call? Allocate a new array! Better: Reuse same auxiliary array of size n for every merging stage Allocate auxiliary array at beginning, use throughout Best (but a little tricky): Don t copy back at 2 nd, 4 th, 6 th, merging stages, use the original array as the auxiliary array and vice-versa Need one copy at end if number of stages is odd 2/08/2019 38

Picture of the best from previous slide: Allocate one auxiliary array, switch each step First recurse down to lists of size 1 As we return from the recursion, switch off arrays Merge by 1 Merge by 2 Merge by 4 Merge by 8 Merge by 16 Copy if Needed Arguably easier to code up without recursion at all 2/08/2019 39

Linked lists and big data We defined the sorting problem as over an array, but sometimes you want to sort linked lists One approach: Convert to array: O(n) Sort: O(n log n) Convert back to list: O(n) Or: mergesort works very nicely on linked lists directly heapsort and quicksort do not insertion sort and selection sort do but they re slower Mergesort is also the sort of choice for external sorting Linear merges minimize disk accesses 2/08/2019 40

Mergesort Analysis Having defined an algorithm and argued it is correct, we should analyze its running time (and space): To sort n elements, we: Return immediately if n=1 Else do 2 subproblems of size n/2 and then an O(n) merge Recurrence relation? 2/08/2019 41

Mergesort Analysis Having defined an algorithm and argued it is correct, we should analyze its running time (and space): To sort n elements, we: Return immediately if n=1 Else do 2 subproblems of size n/2 and then an O(n) merge Recurrence relation: T(1) = c 1 T(n) = 2T(n/2) + c 2 n 2/08/2019 42

MergeSort Recurrence (For simplicity let constants be 1 no effect on asymptotic answer) T(1) = 1 So total is 2 k T(n/2 k ) + kn where T(n) = 2T(n/2) + n n/2 k = 1, i.e., log n = k = 2(2T(n/4) + n/2) + n That is, 2 log n T(1) + n log n = 4T(n/4) + 2n = n + n log n = 4(2T(n/8) + n/4) + 2n = O(n log n) = 8T(n/8) + 3n. (after k expansions) = 2 k T(n/2 k ) + kn 2/08/2019 43

Or more intuitively This recurrence comes up often enough you should just know it s O(n log n) Merge sort is relatively easy to intuit (best, worst, and average): The recursion tree will have log n height At each level we do a total amount of merging equal to n 2/08/2019 44

Quicksort Also uses divide-and-conquer Recursively chop into halves But, instead of doing all the work as we merge together, we ll do all the work as we recursively split into halves Also unlike MergeSort, does not need auxiliary space O(n log n) on average, but O(n 2 ) worst-case MergeSort is always O(nlogn) So why use QuickSort? Can be faster than mergesort Often believed to be faster Quicksort does fewer copies and more comparisons, so it depends on the relative cost of these two operations! 2/08/2019 45

Quicksort Overview 1. Pick a pivot element Hopefully an element ~median Good QuickSort performance depends on good choice of pivot; we ll see why later, and talk about good pivot selection later 2. Partition all the data into: A. The elements less than the pivot B. The pivot C. The elements greater than the pivot 3. Recursively sort A and C 4. The answer is, as simple as A, B, C (Alas, there are some details lurking in this algorithm) 2/08/2019 46

Quicksort: Think in terms of sets S 13 81 92 43 65 31 57 26 75 0 select pivot value S 1 S 2 partition S 13 0 43 31 65 26 57 92 75 81 S 1 S 2 0 13 26 31 43 57 65 75 81 92 QuickSort(S 1 ) and QuickSort(S 2 ) S 0 13 26 31 43 57 65 75 81 92 Presto! S is sorted [Weiss] 2/08/2019 47

Quicksort Example, showing recursion Divide Divide Divide 1 element 1 2 8 2 9 4 5 3 1 6 5 2 4 3 1 8 9 6 3 2 1 4 6 8 9 Conquer 1 2 Conquer 1 2 3 4 6 8 9 Conquer 1 2 3 4 5 6 8 9 2/08/2019 48

MergeSort Recursion Tree 8 2 9 4 5 3 1 6 Divide 5 2 4 3 1 8 9 6 Divide 3 2 1 4 6 8 9 Divide 1 element 1 2 Conquer 1 2 Conquer 1 2 3 4 Conquer 6 8 9 1 2 3 4 5 6 8 9 QuickSort Recursion Tree 2/08/2019 49

Quicksort Details We have not yet explained: How to pick the pivot element Any choice is correct: data will end up sorted But as analysis will show, want the two partitions to be about equal in size How to implement partitioning In linear time In place 2/08/2019 50

Pivots Best pivot? Median Halve each time Worst pivot? Greatest/least element Reduce to problem of size 1 smaller O(n 2 ) 8 2 9 4 5 3 1 6 Divide 5 2 4 3 1 8 9 6 Divide 3 2 1 4 6 8 9 Divide 1 element 1 2 Conquer 1 2 Conquer 1 2 3 4 Conquer 6 8 9 1 2 3 4 5 6 8 9 2/08/2019 51

Quicksort: Potential pivot rules While sorting arr from lo (inclusive) to hi (exclusive) Pick arr[lo] or arr[hi-1] Fast, but worst-case is (mostly) sorted input Pick random element in the range Does as well as any technique, but (pseudo)random number generation can be slow (Still probably the most elegant approach) Median of 3, e.g., arr[lo], arr[hi-1], arr[(hi+lo)/2] Common heuristic that tends to work well 2/08/2019 52

Partitioning That is, given 8, 4, 2, 9, 3, 5, 7 and pivot 5 Dividing into left half & right half (based on pivot) Conceptually simple, but hardest part to code up correctly After picking pivot, need to partition Ideally in linear time Ideally in place Ideas? 2/08/2019 53

Partitioning One approach (there are slightly fancier ones): 1. Swap pivot with arr[lo]; move it out of the way 2. Use two fingers i and j, starting at lo+1 and hi-1 (start & end of range, apart from pivot) 3. Move from right until we hit something less than the pivot; belongs on left side Move from left until we hit something greater than the pivot; belongs on right side Swap these two; keep moving inward while (i < j) if (arr[j] > pivot) j-- else if (arr[i] <= pivot) i++ else swap arr[i] with arr[j] 4. Put pivot back in middle (Swap with arr[i]) 2/08/2019 54

Quicksort Example Step one: pick pivot as median of 3 lo = 0, hi = 10 0 1 2 3 4 5 6 7 8 9 8 1 4 9 0 3 5 2 7 6 Step two: move pivot to the lo position 0 1 2 3 4 5 6 7 8 9 6 1 4 9 0 3 5 2 7 8 2/08/2019 55

Quicksort Example Often have more than one swap during partition this is a short example Now partition in place 6 1 4 9 0 3 5 2 7 8 Move fingers 6 1 4 9 0 3 5 2 7 8 Swap 6 1 4 2 0 3 5 9 7 8 Move fingers 6 1 4 2 0 3 5 9 7 8 Move pivot 5 1 4 2 0 3 6 9 7 8 2/08/2019 56

Quicksort Analysis Best-case? Worst-case? Average-case? 2/08/2019 57

Quicksort Analysis Best-case: Pivot is always the median T(0)=T(1)=1 T(n)=2T(n/2) + n -- linear-time partition Same recurrence as mergesort: O(n log n) Worst-case: Pivot is always smallest or largest element T(0)=T(1)=1 T(n) = 1T(n-1) + n Basically same recurrence as selection sort: O(n 2 ) Average-case (e.g., with random pivot) O(n log n), not responsible for proof (in text) 2/08/2019 58

Quicksort Cutoffs For small n, all that recursion tends to cost more than doing a quadratic sort Remember asymptotic complexity is for large n Also, recursive calls add a lot of overhead for small n Common engineering technique: switch to a different algorithm for subproblems below a cutoff Reasonable rule of thumb: use insertion sort for n < 10 Notes: Could also use a cutoff for merge sort Cutoffs are also the norm with parallel algorithms switch to sequential algorithm None of this affects asymptotic complexity 2/08/2019 59

Quicksort Cutoff skeleton void quicksort(int[] arr, int lo, int hi) { if(hi lo < CUTOFF) insertionsort(arr,lo,hi); else } Notice how this cuts out the vast majority of the recursive calls Think of the recursive calls to quicksort as a tree Trims out the bottom layers of the tree 2/08/2019 60