Pay Migration Tax to Homeland: Anchor-based Scalable Reference Counting for Multicores

Size: px

Start display at page:

Download "Pay Migration Tax to Homeland: Anchor-based Scalable Reference Counting for Multicores"

Cameron Jennings
5 years ago
Views:

1 Pay Migration Tax to Homeland: Anchor-based Scalable Reference Counting for Multicores Seokyong Jung, Jongbin Kim, Minsoo Ryu, Sooyong Kang, Hyungsoo Jung Hanyang University 1

2 Reference counting It is a general technique to manage the number of references for resources mainly used to reclaim resources in timely manner Scalability is the most important challenge in multicore environment 1. REF (increase counter) 2. Use resource 3. UNREF (decrease counter) 2

3 Known scalability issues of reference counting in Linux Thread Atomic counter Counter Increment: CAS(counter,v,v+1) Decrement: CAS(counter,v,v-1) Traditional Counting in Linux Read throughput for a same page in a single file (Min et al. ATC 16) 3

4 Four performance metrics we established Space Ideal Query Time Counting Counting Overhead Cost for updating a reference counter Query Overhead Cost for checking if a reference counter is zero Space Overhead Space required for reference counter itself Time Overhead Time for synchronizing between internal structures for reference counting 4

5 Overhead analysis of prior proposals Query Thread Atomic counter Counter Traditional Counting Space Counting Reference Time Low query Low space Low time High counting 5

6 Overhead analysis of prior proposals (cont.) Query Thread Atomic counter Counter Contention Distribution Space Counting Reference Time Better counting Worse space Worse query 6

7 Overhead analysis of prior proposals (cont.) Query Thread Atomic counter Counter Contention Distribution Space Counting Reference Time Prior proposals: SNZI (Ellen et al., SOSP 07) Carrefour (Dashti et al., SIGPLAN Notices (2013)) Doppel (Nurula et al., OSDI 14) Dynamic SNZI (Acar et al., PPoPP 17) Better counting Worse space Worse query 7

8 Overhead analysis of prior proposals (cont.) Query Thread Atomic counter Cache Affinity Counter Space Counting Reference core core core core Time Better counting Worse space Worse query 8

9 Overhead analysis of prior proposals (cont.) Query Thread Atomic counter Cache Affinity Counter Space Counting Reference Time Prior proposals: percpu_counter structure in Linux (2006) Sloppy counter (Boyd-Wickizer et al., OSDI 10) percpu_ref structure in Linux (2013) core core core core Better counting Worse space Worse query 9

10 Overhead analysis of prior proposals (cont.) Query Thread Obj A Per-core Hash Central counter Atomic counter Obj B Central counter Obj C Central counter Counter Space Counting Obj A Obj A Obj B Obj C Reference Time core core core core Better counting Worse query Worse time 10

11 Overhead analysis of prior proposals (cont.) Query Thread Obj A Per-core Hash Central counter Atomic counter Obj B Central counter Obj C Central counter Counter Space Counting Obj A Obj A Obj B Obj C Reference Time Prior proposals: Reference counting using Linux RCU (Mckenny et al., TR (2013)) RadixVM (Clements et al., EuroSys 13) OpLog (Boyd-Wickizer, PhD thesis (2014)) ScaleFS (Bhat et al., SOSP 17) core core core core Better counting Worse query Worse time 11

12 Overhead analysis of prior proposals (cont.) Query Per-core Hash Obj A Obj B Obj C Obj D Obj X Space Counting Central counter 10 Central counter 0... Obj D +1 Time Obj A X +1 Obj C +1 Obj B +1 Evict... core 12

13 Overhead analysis of prior proposals (cont.) Query Per-core Hash Obj A Obj B Obj C Obj D Obj X Space Counting Central counter 10 Central counter 0... Obj D +1 Time Obj X +1 Obj C +1 Obj B core 13

14 Throughput (Mops/sec) Overhead analysis of prior proposals (cont.) Query Per-core Hash (RefCache) Space Counting Time Number of Shared Objects (Pages) 14

15 Summarizing all these... 15

16 Summarizing all these... 16

17 Challenges for the space-time tradeoff Obj A Obj B Obj C Obj D Obj E Obj F Obj G Obj H Obj A Obj B Obj C Obj D Obj E Obj F Obj G Obj H Central counter 0 core core core core Cache Affinity Per-core Hash 17

18 Challenges for the space-time tradeoff (cont.) Obj A Obj B Obj C Obj D Obj E Obj F Obj G Obj H Central counter 0 Obj A +1 Obj A -1 core core 18

19 Our solution to this issue... Obj A Central counter 0 Anchoring Core 0 Obj A +1 0 core core REF A UNREF A 19

20 Our solution to this issue... PayGo Pay Migration Tax to Homeland Obj A Central counter 0 Nonatomic Local Atomic Anchor Obj A +1-1 REF A core Core 0 UNREF A core 20

21 Issue for a single local counter : local counter : lock;addl (atomic op.) : lock;subl (atomic op.) core core core 2 migrate core 3 migrate 21

22 Anchoring in action : addl/subl : local counter / : anchor counter : lock;subl (atomic op.) core core core 2 migrate core 3 migrate 22

23 Paygo (Pay migration tax as you go to other core) Low counting Scalable for local counters Low space (per core hash) Proportional to the number of CPU cores Query is still high Escaping the counting-query tradeoff is beyond the scope of this work. 23

24 Overhead Analysis for a Reference Counter 24

25 Data structures in PayGo (per-core) PAYGO (per-process) task struct anchor info anchor core IDs PAYGO entry object local anchor pointer counter counter cache line size (byte) core core Local counter increases the local count by REF operation A process records core IDs along with object pointer when REF operation is performed 25

26 Extending PayGo design to support user-level objects user threads object user mode sys_ref(void* obj) or sys_unref(void* obj) REF(obj, pid) UNREF(obj, pid) kernel mode preempt_disable() preempt_enable() referencing or unreferencing 26

27 Experimental Setup Kernel: Linux CPU: four 24-core Intel Xeon E v4 CPUs RAM: 1 TiB DDR4 DRAM Storage: Samsung SM1725 NVMe SSD 27

28 Scalability Comparison of the Linux Page Cache Strongly contending workloads: FxMark DRBH workload Weakly contending workloads: filebench modified fileserver workload 28

29 Performance Spectrum on Varying Contention Levels 29

30 Scalability of User-level Paygo (at 96 threads) 30

31 Conclusion Designing scalable reference counting techniques should consider space-time tradeoff as well as counting-query tradeoff. PayGo escapes the space-time tradeoff by using anchoring technique. PayGo provides scalable counting and space efficiency with negligible time delay for reclaiming obsolete hash entries. 31

A Scalable Lock Manager for Multicores

A Scalable Lock Manager for Multicores Hyungsoo Jung Hyuck Han Alan Fekete NICTA Samsung Electronics University of Sydney Gernot Heiser NICTA Heon Y. Yeom Seoul National University @University of Sydney