Introduction to Concurrent Programming

Similar documents
Introduction to Concurrent Programming

Introduction. Companion slides for The Art of Multiprocessor Programming by Maurice Herlihy & Nir Shavit

Introduction. Companion slides for The Art of Multiprocessor Programming by Maurice Herlihy & Nir Shavit. Art of Multiprocessor Programming

Programming Paradigms for Concurrency Introduction

COMP 322: Principles of Parallel Programming Lecture 11: Parallel Programming Issues (Chapter 6) Fall 2009

Introduction to Multiprocessor Synchronization

Introduction. Companion slides for The Art of Multiprocessor Programming by Maurice Herlihy & Nir Shavit

Introduction. Chapter 1

Υλοποίηση γλωσσών συναρτησιακού προγραμματισμού

ARM Compiler and assembly on QEMU

Lecture 7: Mutual Exclusion 2/16/12. slides adapted from The Art of Multiprocessor Programming, Herlihy and Shavit

Locks Chapter 4. Overview. Introduction Spin Locks. Queue locks. Roger Wattenhofer. Test-and-Set & Test-and-Test-and-Set Backoff lock

Introduction to Concurrency and Multicore Programming. Slides adapted from Art of Multicore Programming by Herlihy and Shavit

Spin Locks and Contention. Companion slides for The Art of Multiprocessor Programming by Maurice Herlihy & Nir Shavit

Locking. Part 2, Chapter 11. Roger Wattenhofer. ETH Zurich Distributed Computing

Solution: a lock (a/k/a mutex) public: virtual void unlock() =0;

Spin Locks and Contention

Linked Lists: Locking, Lock-Free, and Beyond. Companion slides for The Art of Multiprocessor Programming by Maurice Herlihy & Nir Shavit

Programming Paradigms for Concurrency Lecture 3 Concurrent Objects

Mutual Exclusion. Companion slides for The Art of Multiprocessor Programming by Maurice Herlihy & Nir Shavit

Spin Locks and Contention. Companion slides for The Art of Multiprocessor Programming by Maurice Herlihy & Nir Shavit

Chapter 5 Concurrency: Mutual Exclusion. and. Synchronization. Operating Systems: Internals. and. Design Principles

THREADS & CONCURRENCY

Computer Architecture

An Introduction to Parallel Programming

Multiprocessors & Thread Level Parallelism

Operating Systems. Designed and Presented by Dr. Ayman Elshenawy Elsefy

Chapter 5 Concurrency: Mutual Exclusion and Synchronization

Multi-core Architectures. Dr. Yingwu Zhu

CMSC 330: Organization of Programming Languages. Threads Classic Concurrency Problems

Concurrent Computing

The Dining Philosophers Problem CMSC 330: Organization of Programming Languages

Spin Locks and Contention. Companion slides for Chapter 7 The Art of Multiprocessor Programming by Maurice Herlihy & Nir Shavit

CMSC 330: Organization of Programming Languages. The Dining Philosophers Problem

Computer and Information Sciences College / Computer Science Department CS 207 D. Computer Architecture. Lecture 9: Multiprocessors

Programming Paradigms for Concurrency Lecture 2 - Mutual Exclusion

Concurrent Objects. Companion slides for The Art of Multiprocessor Programming by Maurice Herlihy & Nir Shavit

Chapter 5: Thread-Level Parallelism Part 1

Online Course Evaluation. What we will do in the last week?

Process Management And Synchronization

CS370 Operating Systems

Kernel Synchronization I. Changwoo Min

Design and Evaluation of Scalable Software

CPSC/ECE 3220 Fall 2017 Exam Give the definition (note: not the roles) for an operating system as stated in the textbook. (2 pts.

Lecture 13: Memory Consistency. + a Course-So-Far Review. Parallel Computer Architecture and Programming CMU , Spring 2013

Other consistency models

Synchronization in Concurrent Programming. Amit Gupta

COL 380: Introduc1on to Parallel & Distributed Programming. Lecture 1 Course Overview + Introduc1on to Concurrency. Subodh Sharma

Principles of Software Construction: Objects, Design, and Concurrency. The Perils of Concurrency Can't live with it. Cant live without it.

What We'll Cover Today

Concurrency Terminology

Programming at Scale: Concurrency

CSE 392/CS 378: High-performance Computing - Principles and Practice

Concurrent Skip Lists. Companion slides for The Art of Multiprocessor Programming by Maurice Herlihy & Nir Shavit

Parallel Processing: Performance Limits & Synchronization

Operating Systems. Synchronization

10/17/ Gribble, Lazowska, Levy, Zahorjan 2. 10/17/ Gribble, Lazowska, Levy, Zahorjan 4

Agenda Process Concept Process Scheduling Operations on Processes Interprocess Communication 3.2

The Art of Parallel Processing

Midterm Exam March 13, 2013 CS162 Operating Systems

Parallel Algorithm Engineering

CS 136: Advanced Architecture. Review of Caches

Εισαγωγή στον Προγραμματισμό Introduction to Programming

THREADS AND CONCURRENCY

Computer Architecture Crash course

Multi-core Architectures. Dr. Yingwu Zhu

Lesson 1. Concurrency. Terminology Concurrency in Systems Problem Examples Copyright Teemu Kerola 2009

IT 540 Operating Systems ECE519 Advanced Operating Systems

Lecture 10: Multi-Object Synchronization

Parallel Programming: Background Information

Parallel Programming: Background Information

Concurrency: Mutual Exclusion and Synchronization

CSE Traditional Operating Systems deal with typical system software designed to be:

Semaphores. Jinkyu Jeong Computer Systems Laboratory Sungkyunkwan University

Introduction to Concurrency (Processes, Threads, Interrupts, etc.)

CS377P Programming for Performance Multicore Performance Multithreading

CS 571 Operating Systems. Midterm Review. Angelos Stavrou, George Mason University

1. Introduction to Concurrent Programming

Issues in Parallel Processing. Lecture for CPSC 5155 Edward Bosworth, Ph.D. Computer Science Department Columbus State University

Chapter 6: Synchronization. Chapter 6: Synchronization. 6.1 Background. Part Three - Process Coordination. Consumer. Producer. 6.

Threads and Parallelism in Java

Advanced Topic: Efficient Synchronization

CMSC Computer Architecture Lecture 12: Multi-Core. Prof. Yanjing Li University of Chicago

Lecture notes for CS Chapter 4 11/27/18

CS370 Operating Systems Midterm Review

Sharing Objects Ch. 3

Chapter 6: Process [& Thread] Synchronization. CSCI [4 6] 730 Operating Systems. Why does cooperation require synchronization?

Chapter 6: Process Synchronization


Cache Coherence and Atomic Operations in Hardware

Copyright 2010, Elsevier Inc. All rights Reserved

Concurrent Preliminaries

Agenda. Lecture. Next discussion papers. Bottom-up motivation Shared memory primitives Shared memory synchronization Barriers and locks

THREADS & CONCURRENCY

SPIN, PETERSON AND BAKERY LOCKS

Whatever can go wrong will go wrong. attributed to Edward A. Murphy. Murphy was an optimist. authors of lock-free programs LOCK FREE KERNEL

Week 7. Concurrent Programming: Thread Synchronization. CS 180 Sunil Prabhakar Department of Computer Science Purdue University

Now we are going to speak about the CPU, the Central Processing Unit.

Coherence and Consistency

Lecture Topics. Announcements. Today: Concurrency (Stallings, chapter , 5.7) Next: Exam #1. Self-Study Exercise #5. Project #3 (due 9/28)

Transcription:

Σχολή Ηλεκτρολόγων Μηχανικών και Μηχανικών Υπολογιστών Τομέας Τεχνολογίας Πληροφορικής και Υπολογιστών Εθνικό Μετσόβιο Πολυτεχνείο Γλώσσες Προγραμματισμού ΙΙ Διδάσκοντες: Νικόλαος Παπασπύρου, Κωστής Σαγώνας Introduction to Concurrent Programming Based on the companion slides for the book The Art of Multiprocessor Programming by Maurice Herlihy & Nir Shavit, 2008

Άδεια Χρήσης Το παρόν εκπαιδευτικό υλικό υπόκειται σε άδειες χρήσης Creative Commons. Για εκπαιδευτικό υλικό, όπως εικόνες, που υπόκειται σε άδεια χρήσης άλλου τύπου, αυτή πρέπει να αναφέρεται ρητώς.

From the New York Times SAN FRANCISCO, 7 May 2004 - Intel said on Friday that it was scrapping its development of two microprocessors, a move that is a shift in the company's business strategy. Introduction to Concurrent Programming 2

Moore s Law Transistor count still rising Clock speed flattening sharply Introduction to Concurrent Programming 3

Your Old Desktop: A Uniprocesor cpu memory Introduction to Concurrent Programming 4

Till recently in the Enterprise: The Shared Memory Multiprocessor (SMP) cache cache Bus cache Bus shared memory Introduction to Concurrent Programming 5

On Your New Desktop: The Multicore processor (CMP) All on the same chip cache cache cache Bus Bus shared memory Sun T2000 Niagara Introduction to Concurrent Programming 6

Multicores Are Here Intel's Intel ups ante with 4-core chip. New microprocessor, due this year, will be faster, use less electricity... [San Fran Chronicle] AMD will launch a dual-core version of its Opteron server processor at an event in New York on April 21. [PC World] Sun s Niagara will have eight cores, each core capable of running 4 threads in parallel, for 32 concurrently running threads.. [The Inquierer] Introduction to Concurrent Programming 7

Why do we care? Time no longer cures software bloat The free ride is over When you double your program s path length You can t just wait 6 months Your software must somehow exploit twice as much concurrency Introduction to Concurrent Programming 8

Traditional Scaling Process Speedup 1.8x 3.6x 7x User code Traditional Uniprocessor Time: Moore s law Introduction to Concurrent Programming 9

Multicore Scaling Process Speedup 1.8x 3.6x 7x User code Multicore Unfortunately, not so simple Introduction to Concurrent Programming 10

Real-World Scaling Process Speedup 1.8x 2x 2.9x User code Multicore Parallelization and Synchronization require great care Introduction to Concurrent Programming 11

Sequential Computation thread memory object object Introduction to Concurrent Programming 12

Concurrent Computation threads memory object object Introduction to Concurrent Programming 13

Asynchrony Sudden unpredictable delays Cache misses (short) Page faults (long) Scheduling quantum used up (really long) Introduction to Concurrent Programming 14

Model Summary Multiple threads Sometimes called processes Single shared memory Objects live in memory Unpredictable asynchronous delays Introduction to Concurrent Programming 15

Road Map We are going to focus on principles first, then practice Start with idealized models Look at simplistic problems Emphasize correctness over pragmatism Correctness may be theoretical, but incorrectness has practical impact Introduction to Concurrent Programming 16

Concurrency Jargon Hardware Processors, cores Software Threads, processes Sometimes OK to confuse them, sometimes not Introduction to Concurrent Programming 17

Parallel Primality Testing Challenge Print primes from 1 to 10 10 Given Ten-processor multiprocessor One thread per processor Goal Get ten-fold speedup (or close) Introduction to Concurrent Programming 18

Load Balancing 1 10 9 2 10 9 10 10 P 0 P 1 P 9 Split the work evenly Each thread tests range of 10 9 Introduction to Concurrent Programming 19

Procedure for Thread i void primeprint { int i = ThreadID.get(); // IDs in {0..9} for (j = i*10 9 +1, j < (i+1)*10 9 ; j++) { if (isprime(j)) print(j); } } Introduction to Concurrent Programming 20

Issues Higher ranges have fewer primes Yet larger numbers harder to test Thread workloads Uneven Hard to predict rejected Need dynamic load balancing Introduction to Concurrent Programming 21

Shared Counter 19 18 each thread takes a number 17 Introduction to Concurrent Programming 22

Procedure for Thread i int counter = new Counter(1); void primeprint { long j = 0; while (j < 10 10 ) { j = counter.getandincrement(); if (isprime(j)) print(j); } } Introduction to Concurrent Programming 23

Procedure for Thread i Counter counter = new Counter(1); void primeprint { long j = 0; while (j < 10 10 ) { j = counter.getandincrement(); } } if (isprime(j)) print(j); Shared counter object Introduction to Concurrent Programming 24

Where Things Reside int counter = new Counter(1); void primeprint { long j = 0; while (j < 10 10 ) { j = counter.getandincrement(); if (isprime(j)) print(j); } } code Local variables cache cache Bus cache Bus 1 shared memory shared counter Introduction to Concurrent Programming 25

Procedure for Thread i Counter counter = new Counter(1); void primeprint { Stop when every long j = 0; value taken while (j < 10 10 ) { j = counter.getandincrement(); if (isprime(j)) print(j); } } Introduction to Concurrent Programming 26

Procedure for Thread i Counter counter = new Counter(1); void primeprint { long j = 0; while (j < 10 10 ) { j = counter.getandincrement(); if (isprime(j)) print(j); } } Increment & return each new value Introduction to Concurrent Programming 27

Counter Implementation public class Counter { private long value; } public long getandincrement() { return value++; } OK for single thread, not for concurrent threads Introduction to Concurrent Programming 28

What It Means public class Counter { private long value; } public long getandincrement() { return value++; temp = value; } value = value + 1; return temp; Introduction to Concurrent Programming 29

Not so good Value 1 2 3 2 read 1 write 2 read 2 write 3 read 1 write 2 time Introduction to Concurrent Programming 30

Is this problem inherent? read write write read If we could only glue reads and writes Introduction to Concurrent Programming 31

Challenge public class Counter { private long value; } public long getandincrement() { temp = value; value = temp + 1; return temp; } Make these steps atomic (indivisible) Introduction to Concurrent Programming 32

Hardware Solution public class Counter { private long value; public long getandincrement() { temp = value; value = temp + 1; return temp; } } ReadModifyWrite() instruction Introduction to Concurrent Programming 33

An Aside: Java public class Counter { private long value; } public long getandincrement() { synchronized { temp = value; value = temp + 1; } return temp; } Synchronized block Introduction to Concurrent Programming 34

An Aside: Java public class Counter { private long value; } public long getandincrement() { synchronized { temp = value; value = temp + 1; } return temp; } Mutual Exclusion Introduction to Concurrent Programming 35

Mutual Exclusion or Alice & Bob share a pond A B Introduction to Concurrent Programming 36

Alice has a pet A B Introduction to Concurrent Programming 37

Bob has a pet A B Introduction to Concurrent Programming 38

The Problem A B The pets don t get along Introduction to Concurrent Programming 39

Formalizing the Problem Two types of formal properties in asynchronous computation: Safety Properties Nothing bad happens ever Liveness Properties Something good happens eventually Introduction to Concurrent Programming 40

Formalizing our Problem Mutual Exclusion Both pets never in pond simultaneously This is a safety property No Deadlock if only one wants in, it gets in if both want in, one gets in. This is a liveness property Introduction to Concurrent Programming 41

Simple Protocol Idea Just look at the pond Gotcha Trees obscure the view Introduction to Concurrent Programming 42

Interpretation Threads can t see what other threads are doing Explicit communication required for coordination Introduction to Concurrent Programming 43

Mobile Phone Protocol Idea Bob sends SMS to Alice (or vice-versa) Gotcha Bob takes shower Alice recharges battery Bob out shopping for pet food Introduction to Concurrent Programming 44

Interpretation Message-passing doesn t work Recipient might not be Listening There at all Communication must be Persistent (like writing) Not transient (like speaking) Introduction to Concurrent Programming 45

Can Protocol cola cola Introduction to Concurrent Programming 46

Bob conveys a bit A B cola Introduction to Concurrent Programming 47

Bob conveys a bit A B cola Introduction to Concurrent Programming 48

Can Protocol Idea Cans on Alice s windowsill Strings lead to Bob s house Bob pulls strings, knocks over cans Gotcha Cans cannot be reused Bob runs out of cans Introduction to Concurrent Programming 49

Interpretation Cannot solve mutual exclusion with interrupts Sender sets fixed bit in receiver s space Receiver resets bit when ready Requires unbounded number of interrupt bits Introduction to Concurrent Programming 50

Flag Protocol A B Introduction to Concurrent Programming 51

Alice s Protocol (sort of) A B Introduction to Concurrent Programming 52

Bob s Protocol (sort of) A B Introduction to Concurrent Programming 53

Alice s Protocol Raise flag Wait until Bob s flag is down Unleash pet Lower flag when pet returns Introduction to Concurrent Programming 54

Bob s Protocol Raise flag Wait until Alice s flag is down Unleash pet Lower flag when pet returns danger! Introduction to Concurrent Programming 55

Bob s Protocol (2 nd try) Bob defers Raise flag to Alice While Alice s flag is up Lower flag Wait for Alice s flag to go down Raise flag Unleash pet Lower flag when pet returns Introduction to Concurrent Programming 56

The Flag Principle Raise the flag Look at other s flag Flag Principle: If each raises and looks, then Last to look must see both flags up Introduction to Concurrent Programming 57

Proof of Mutual Exclusion Assume both pets in pond Derive a contradiction By reasoning backwards Consider the last time Alice and Bob each looked before letting the pets in Without loss of generality assume Alice was the last to look Introduction to Concurrent Programming 58

Proof Bob last raised flag Alice last raised her flag Alice s last look Bob s last looked time QED Alice must have Introduction seen to Concurrent Bob s Programming flag. A contradiction 59

Proof of No Deadlock If only one pet wants in, it gets in. Deadlock requires both continually trying to get in. If Bob sees Alice s flag, he gives her priority (a gentleman ) QED Introduction to Concurrent Programming 60

Remarks Protocol is unfair Bob s pet might never get in Protocol uses waiting If Bob is eaten by his pet, Alice s pet might never get in Introduction to Concurrent Programming 61

Moral of Story Mutual Exclusion cannot be solved by transient communication (mobile phones) interrupts (cans) It can be solved by one-bit shared variables that can be read or written Introduction to Concurrent Programming 62

The Arbiter Problem (an aside) Pick a point Pick a point Introduction to Concurrent Programming 63

The Fable Continues Alice and Bob fall in love & marry Then they fall out of love & divorce She gets the pets (who now get along fine with each other) He has to feed them Leading to a new coordination problem: Producer-Consumer Introduction to Concurrent Programming 64

Bob Puts Food in the Pond A Introduction to Concurrent Programming 65

Alice releases her pets to Feed mmm mmm B Introduction to Concurrent Programming 66

Producer/Consumer Alice and Bob can t meet Each has restraining order on other So he puts food in the pond And later, she releases the pets Avoid Releasing pets when there s no food Putting out food if uneaten food remains Introduction to Concurrent Programming 67

Producer/Consumer Need a mechanism so that Bob lets Alice know when food has been put out Alice lets Bob know when to put out more food Introduction to Concurrent Programming 68

Surprise Solution A B cola Introduction to Concurrent Programming 69

Bob puts food in Pond A B cola Introduction to Concurrent Programming 70

Bob knocks over Can A B cola Introduction to Concurrent Programming 71

Alice Releases Pets A yum yum B cola Introduction to Concurrent Programming 72

Alice Resets Can when Pets are Fed A B cola Introduction to Concurrent Programming 73

Pseudocode while (true) { while (can.isup()){}; pet.release(); Bob s code pet.recapture(); can.reset(); } while (true) { while (can.isdown()){}; pond.stockwithfood(); can.knockover(); } Alice s code Introduction to Concurrent Programming 74

Correctness Mutual Exclusion Pets and Bob never together in pond No Starvation safety liveness if Bob always willing to feed, and pets always famished, then pets eat infinitely often. Producer/Consumer The pets never enter pond unless there is food, and Bob never provides food if there is unconsumed food. safety Introduction to Concurrent Programming 75

Could Also Solve Using Flags A B Introduction to Concurrent Programming 76

Waiting Both solutions use waiting while(mumble){} Waiting is problematic If one participant is delayed So is everyone else But delays are common & unpredictable Introduction to Concurrent Programming 77

The Fable drags on Bob and Alice still have issues But they need to communicate So they agree to use billboards Introduction to Concurrent Programming 78

Billboards are Large B A 3 C 1 3 D 2 E 1 Letter Tiles From Scrabble box Introduction to Concurrent Programming 79

Write One Letter at a Time W 4 A 1 S 1 H4 A 1 B 3 C 3 D 2 E 1 Introduction to Concurrent Programming 80

To post a message W A S H T H E C A R 4 1 1 4 1 4 1 3 1 1 whe w Introduction to Concurrent Programming 81

Let s send another message L S E L L L A V A 1 A 1 1 1 1 1 1 1 4 1 M P3 3 S 1 Introduction to Concurrent Programming 82

Uh-Oh S 1 E 1 L 1 L 1 T 1 H 4 E 1 C A R 3 1 1 L 1 OK Introduction to Concurrent Programming 83

Readers/Writers Devise a protocol so that Writer writes one letter at a time Reader reads one letter at a time Reader sees Old message or new message No mixed messages Introduction to Concurrent Programming 84

Readers/Writers (cont d) Easy with mutual exclusion But mutual exclusion requires waiting One waits for the other Everyone executes sequentially Remarkably We can solve R/W without mutual exclusion Introduction to Concurrent Programming 85

Why do we care? We want as much of the code as possible to execute concurrently (in parallel) A larger sequential part implies reduced performance Amdahl s law: this relation is not linear Introduction to Concurrent Programming 86

Amdahl s Law Speedup = OldExecutionTime NewExecutionTime of computation given n CPUs instead of 1 Introduction to Concurrent Programming 87

Amdahl s Law Sequential fraction Speedup = Number of processors 1 1 p + p n Parallel fraction Introduction to Concurrent Programming 88

Example Ten processors 60% concurrent, 40% sequential How close to 10-fold speedup? Speedup=2.17= 1 1 0.6 0.6 10 Introduction to Concurrent Programming 89

Example Ten processors 80% concurrent, 20% sequential How close to 10-fold speedup? Speedup=3.57= 1 1 0.8 0.8 10 Introduction to Concurrent Programming 90

Example Ten processors 90% concurrent, 10% sequential How close to 10-fold speedup? Speedup=5.26= 1 1 0.9 0. 9 10 Introduction to Concurrent Programming 91

Example Ten processors 99% concurrent, 01% sequential How close to 10-fold speedup? Speedup=9.17= 1 1 0. 99 0. 99 10 Introduction to Concurrent Programming 92

The Moral Making good use of our multiple processors (cores) means Finding ways to effectively parallelize our code Minimize sequential parts Reduce idle time: time in which threads wait without work Introduction to Concurrent Programming 93

Άδεια Χρήσης Το παρόν εκπαιδευτικό υλικό υπόκειται σε άδειες χρήσης Creative Commons. Για εκπαιδευτικό υλικό, όπως εικόνες, που υπόκειται σε άδεια χρήσης άλλου τύπου, αυτή πρέπει να αναφέρεται ρητώς.