AC: COMPOSABLE ASYNCHRONOUS IO FOR NATIVE LANGUAGES. Tim Harris, Martín Abadi, Rebecca Isaacs & Ross McIlroy

Size: px

Start display at page:

Download "AC: COMPOSABLE ASYNCHRONOUS IO FOR NATIVE LANGUAGES. Tim Harris, Martín Abadi, Rebecca Isaacs & Ross McIlroy"

Vernon Patrick
6 years ago
Views:

1 AC: COMPOSABLE ASYNCHRONOUS IO FOR NATIVE LANGUAGES Tim Harris, Martín Abadi, Rebecca Isaacs & Ross McIlroy

*output = free(buf); return ok; It exposes reads one by one to the IO system: no pipelining, poor performance If we want to process multiple files

2 Synchronous IO in the Windows API Read the contents of h, and compute a result BOOL ProcessFile(HANDLE h, int *output) { BOOL ok = TRUE; DWORD size = FileSize(h); LPVOID buf = malloc(size); for (int i = 0; ok && i<(size/1024); i++) { ok &= ReadFile(h, buf+(i*1024), 1024, NULL, NULL); if (ok) { *output = free(buf); return ok; It exposes reads one by one to the IO system: no pipelining, poor performance If we want to process multiple files Implemented using synchronous calls to we must either (i) handle them ReadFile to read each block in turn serially, (ii) introduce threading 15/11/2011 2

Asynchronous IO static OVERLAPPED o[pd]; BOOL ProcessFile(HANDLE h, int *output) { DWORD size = FileSize(h); LPVOID buf = malloc(size); HANDLE cp = CreateIoCompletionPort(h, NULL, 0, 0); BOOL ok =

GetQueuedCompletionStatus(cp, NULL, &idx, &op, INFINITE); op >offset = i*1024; ReadFile(h, buf+(i*1024), 1024, NULL, op); for (int i = 0; i<pd; i++) { GetQueuedCompletionStatus(cp, NULL, &idx, &op,

3 Asynchronous IO static OVERLAPPED o[pd]; BOOL ProcessFile(HANDLE h, int *output) { DWORD size = FileSize(h); LPVOID buf = malloc(size); HANDLE cp = CreateIoCompletionPort(h, NULL, 0, 0); BOOL ok = TRUE; DWORD idx; LPOVERLAPPED *op; for (int i = 0; i<pd; i++) { PostQueuedCompletionStatus(cp, 0, i, &o[i]); for (int i = 0; i<size/1024; i++) { DWORD idx; LPOVERLAPPED *op; GetQueuedCompletionStatus(cp, NULL, &idx, &op, INFINITE); op >offset = i*1024; ReadFile(h, buf+(i*1024), 1024, NULL, op); for (int i = 0; i<pd; i++) { GetQueuedCompletionStatus(cp, NULL, &idx, &op, INFINITE); if (ok) { *output =... CloseHandle(cp); free(buf); return ok; Create completion port to synchronize with IOs Main loop, issuing IO operations Drain the completion port, waiting for all IOs to be done and asynchronous IO is not composable: ProcessFile is just a normal synchronous function (And this is with a lot of error handling code omitted) 15/11/2011 3

4 AC: asynchronous C Synchronous IO Easy to use Poor perf AC: similar programming model to synchronous, similar performance to async Async IO 1. Focus on performance and ability for use in low level code (e.g., monitor process in Barrelfish research OS) Difficult to use 2. Small number of constructs: starting async work, synchronize with async work, stop Good waiting perf for async work 3. Enable higher level features to be built over this join patterns, futures, 15/11/2011 4

5 AC: almost the same as synchronous BOOL ProcessFileAC(HANDLE h, int *output) { BOOL ok = TRUE; DWORD size = FileSize(h); LPVOID buf = malloc(size); do { for (int i = 0; ok && i<n; i++) { async { ok &= ReadFileAC(h, i*1024, buf+(i*1024), 1024); finish; if (ok) { *output = free(buf); return ok; 1. Use the AC primitives for data access 2. Use async to identify work that can continue while waiting: { async X ; Y start X and, if it blocks, continue with Y 3. Add finish to identify synchronization: do { X finish don t go past finish until all the async work in X is done 15/11/2011 5

6 AC is composable We can build layered abstractions using AC: user code can be called asynchronously, not just primitives: BOOL ProcessTwoFilesAC(HANDLE h1, HANDLE h2, int *output) { int o1, o2; BOOL ok = TRUE; do { async { ok &= ProcessFileAC(h1, &o1); async { ok &= ProcessFileAC(h2, &o2); finish; if (ok) *output = return ok; 15/11/2011 6

7 Making multiple requests // Counting semaphore to limit requests SemAC_t sem; SemACInit(&sem, 100); // Loop potentially starting lots of work do { for (int i = 0; i < ; i++) { P_SemAC(&sem); async { V_SemAC(&sem); finish; // Do work 15/11/2011 7

Cancellation Suppose we want to add a timeout 2. It marks the named do.

$statement with_timoeut: do { async { ok = ProcessFileAC(h, result); cancel with_timeout; async {$ SleepAC(result); cancel with_timeout; finish; return ok; 5.

8 Cancellation Suppose we want to add a timeout 2. It marks the named do..finish as cancelled, and pokes blocked BOOL TryProcessFileAC(HANDLE h, int timeout, 1. Suppose int *result) we execute operations dynamically within it { BOOL ok = TRUE; this cancel statement with_timoeut: do { async { ok = ProcessFileAC(h, result); cancel with_timeout; async { SleepAC(result); cancel with_timeout; finish; return ok; 5. The finish works as before: wait until both asyncs are done 3. SleepAC is woken and terminates early with FALSE 4. This second cancellation has no effect 15/11/2011 8

$*result) { { BOOL ok = TRUE; async { ProcessFileAC with_timoeut: do { ok &= ReadFileAC(h, i*1024, buf+(i*1024), 1024);$ $async { ok = ProcessFileAC(h, result); cancel with_timeout; async { SleepAC(result); finish; cancel with_timeout; finish;$

9 Cancellation BOOL ProcessFileAC(HANDLE h, int *output) { BOOL ok = TRUE; DWORD size = FileSize(h); 2. It pokes blocked LPVOID buf = malloc(size); operations in the do { 3. Cancellation BOOL TryProcessFileAC(HANDLE named do finish for (int i = h, 0; int ok propagates timeout, && i<n; i++) into *result) { { BOOL ok = TRUE; async { ProcessFileAC with_timoeut: do { ok &= ReadFileAC(h, i*1024, buf+(i*1024), 1024); async { ok = ProcessFileAC(h, result); cancel with_timeout; async { SleepAC(result); finish; cancel with_timeout; finish; if (ok) { return ok; *output = Suppose we want to add a timeout 4. The result of ProcessFileAC free(buf); forms the overall return result ok; 1. Suppose we execute this cancel statement 15/11/2011 9

Exact cancellation A convention: provided by built primitives, recommended for user code An *AC function either Returns TRUE if it completes successfully without cancellation Returns FALSE and undoes

10 Exact cancellation A convention: provided by built primitives, recommended for user code An *AC function either Returns TRUE if it completes successfully without cancellation Returns FALSE and undoes its partial side effects if it is cancelled Easier said than done 1. Remember cancellation is co operative 2. *NAC non cancellable primitives 3. For responsiveness, push clean up work to a thread pool 10

11 Implementation Thread Run queue Current FB Finish block (FB) ACTV Enclosing FB Count = 0 Completion AWI Stack main First Last Techniques: Per thread run queue of atomic work items ( AWIs ) Wait for next IO completion when AWI queue empty Book keeping data all stack allocated Cactus stack implementation, creating a new stack lazily when blocking (rather than eagerly on entering an async) 15/11/

12 Performance Function call latency (cycles) Direct (normal function call) async X() (X does not block) async X() (X blocks) Do not fear async Think about correctness: if the callee doesn t block then perf is basically unchanged 15/11/

13 Performance: Barrelfish ping pong test Using ring-buffer channel directly Using portable event-based interface Using AC (one side only) Using AC (both sides) MPI (Visual Studio HPC-Pack 2008 SDK) Latency (cycles) Ping pong test Minimum sized messages AMD 4 * 4-core machine Using cores sharing L3 cache 15/11/

14 2 phase commit between cores AC Async Sync 19 event handlers, 5 temporary structures 15/11/

15 Software RAID0 model, 64KB transfers 2 * Intel X25 M SSDs Each 250MB/s AC Async Sync 15/11/

16 Performance (embedded web server) Apache benchmark workload driver Running across WAN between CA and UK 15/11/

17 Summary AC: composable asynchronous IO for native languages Similar programming style to synchronous IO APIs Similar performance for asynchronous IO APIs Also in the paper: Operational semantics for core language Integration of IO with runtime system Available as part of the Barrelfish research OS We re hiring: 15/11/

18 Microsoft Corporation. All rights reserved. This material is provided for informational purposes only. MICROSOFT MAKES NO WARRANTIES, EXPRESS OR IMPLIED, IN THIS SUMMARY. Microsoft is a registered trademark or trademark of Microsoft Corporation in the United States and/or other countries.

Extensions to Barrelfish Asynchronous C

Extensions to Barrelfish Asynchronous C Michael Quigley michaelforrquigley@gmail.com School of Computing, University of Utah October 27, 2016 1 Abstract The intent of the Microsoft Barrelfish Asynchronous