RAMP: RDMA Migration Platform

Size: px

Start display at page:

Download "RAMP: RDMA Migration Platform"

Kimberly Hopkins
5 years ago
Views:

1 RAMP: RDMA Migration Platform Babar Naveed Memon, Xiayue Charles Lin, Arshia Mufti, Arthur Scott Wesley, Tim Brecht, Kenneth Salem, Bernard Wong, and Benjamin Cassell firstname.lastname@uwaterloo.ca

2 RDMA and RDMA-based Systems What and why? 2

3 RDMA and RDMA-based Systems What and why? What is the right programming model? 3

4 RDMA and RDMA-based Systems What and why? What is the right programming model? Shared Memory FaRM NAM-DB 4

5 Motivation 5

6 Motivation Support Configuration Operations in Loosely Coupled Distributed Systems 6

7 Motivation Support Configuration Operations in Loosely Coupled Distributed Systems Loosely Coupled Applications 7

8 Cluster of Servers Motivation Server 1 Server 2 Server 3 Partition1 Partition4 Partition7 Support Configuration Operations in Loosely Coupled Distributed Systems Partition2 Partition3 Partition5 Partition6 Partition8 Partition9 Loosely Coupled Applications Client 1 Client 2 Client 3 Client n 8

9 Motivation Support Configuration Operations in Loosely Coupled Distributed Systems Cluster of Servers Server 1 Server 2 Server 3 Partition1 Partition2 Partition3 Partition4 Partition5 Partition6 Partition7 Partition8 Partition9 Loosely Coupled Applications Configuration Operations Scale out, scale in or load balance Client 1 Client 2 Client 3 Client n 9

Motivation Support Configuration Operations in Loosely Coupled Distributed Systems Cluster of Servers Server 1 Server 2 Server 3 Partition1 Partition2 Partition3 Partition4 Partition5

10 Motivation Support Configuration Operations in Loosely Coupled Distributed Systems Cluster of Servers Server 1 Server 2 Server 3 Partition1 Partition2 Partition3 Partition4 Partition5 Partition6 Partition7 Partition8 Partition9 Loosely Coupled Applications Configuration Operations Scale out, scale in or load balance Is shared memory overkill? Client 1 Client 2 Client 3 Client n 10

11 Desired Properties for RAMP 11

12 Desired Properties for RAMP On-The-Fly Bulk Data Movement Minimize interference with on-going application workload Particularly at the source 12

13 Desired Properties for RAMP On-The-Fly Bulk Data Movement Minimize interference with on-going application workload Particularly at the source Non-Intrusive Stay out of the way, except during configuration options Avoid shared storage approach Local memory access faster than RDMA 13

14 Desired Properties for RAMP On-The-Fly Bulk Data Movement Minimize interference with on-going application workload Particularly at the source Non-Intrusive Stay out of the way, except during configuration options Avoid shared storage approach Local memory access faster than RDMA Application-Managed Application controls when data moves, and where it moves 14

15 RAMP: The Big Picture 15

16 RAMP: The Big Picture Coordinated memory segments Single reader/writer Contains application data 16

17 RAMP: The Big Picture Coordinated memory segments Single reader/writer Contains application data Segments are migratable 17

18 RAMP: The Big Picture Coordinated memory segments Single reader/writer Contains application data Segments are migratable No serialization/deserialization of application data during migration 18

19 RAMP Functionality 19

20 RAMP Functionality Cluster of Servers Server 1 Server 2 Server 3 Partition1 Partition2 Partition3 Partition4 Partition5 Partition6 Partition7 Partition8 Partition9 Client 1 Client 2 Client 3 Client n 20

21 RAMP Memory Segment Allocation Server 1 Server 2 Server 3 21

22 RAMP Memory Segment Allocation Server 1 Server 2 Server 3 RAMP Arena 22

23 RAMP Memory Segment Allocation RAMP Segment Server 1 Server 2 Server 3 RAMP Arena 23

24 RAMP Memory Segment Allocation RAMP Segment Server 1 Server 2 Server 3 RAMP Arena RAMP Segment 24

25 Using RAMP Segments 25

26 Using RAMP Segments Server 1 Application Data Structures 26

27 Using RAMP Segments Server 1 Application Data Structures Optional RAMP-provided in-segment C++ containers (vectors, maps) using custom memory allocators 27

28 Normal Operation Cluster of Servers Server 1 Server 2 Server 3 Partition1 Partition2 Partition3 Partition4 Partition5 Partition6 Partition7 Partition8 Partition9 Client 1 Client 2 Client 3 Client n 28

29 RAMP Segment Migration (Phase 1) Server 1 Server 2 29

30 RAMP Segment Migration (Phase 1) CONNECT Server 1 Server 2 30

31 RAMP Segment Migration (Phase 1) CONNECT Server 1 Server 2 ACCEPT 31

32 RAMP Segment Migration (Phase 1) CONNECT Server 1 Server 2 ACCEPT Registration has a high latency cost (100 s ms) but segment remains available 32

33 RAMP Segment Migration (Phase 2) Server 1 Server 2 33

34 RAMP Segment Migration (Phase 2) Server TRANSFER 1 Server 2 34

35 RAMP Segment Migration (Phase 2) Server TRANSFER 1 Server Transfers segment 2 ownership (not data) Low latency operation (20 microseconds) 35

36 RAMP Segment Migration (Phase 3) Server 1 Server 2 36

37 RAMP Segment Migration (Phase 3) Server 1 Server 2 PULL 37

38 RAMP Segment Migration (Phase 3) Server 1 Server 2 PULL PULL 38

39 RAMP Segment Migration (Phase 3) Server 1 Server 2 Implemented using one-sided RDMA reads Application managed vs RAMP managed PULL PULL 39

40 RAMP Segment Migration (Phase 4) CLOSE Server 1 Server 2 40

41 RAMP Segment Migration (Phase 4) CLOSE Server 1 Server 2 Clean up 41

42 Segment Migration Performance 42

43 Segment Migration Performance STL map with 8B keys and 128B values in 256MB segment 43

44 Segment Migration Performance STL map with 8B keys and 128B values in 256MB segment Single thread at receiver starts using the map immediately after TRANSFER 44

45 Segment Migration Performance STL map with 8B keys and 128B values in 256MB segment Single thread at receiver starts using the map immediately after TRANSFER Latency of map get operations, as a function of time since TRANSFER 45

46 Segment Migration Performance STL map with 8B keys and 128B values in 256MB segment Single thread at receiver starts using the map immediately after TRANSFER Latency of map get operations, as a function of time since TRANSFER 46

47 Segment Migration Performance Paging first access 40 µs Stop-and-copy first access 310 ms 47

48 Conclusion Overview of RAMP, lightweight support for configuration operations in loosely coupled systems 48

49 Conclusion Overview of RAMP, lightweight support for configuration operations in loosely coupled systems Coordinated allocation of segments, fast ownership transfer, application-managed data movement 49

50 Conclusion Overview of RAMP, lightweight support for configuration operations in loosely coupled systems Coordinated allocation of segments, fast ownership transfer, application-managed data movement In the paper: 50

51 Conclusion Overview of RAMP, lightweight support for configuration operations in loosely coupled systems Coordinated allocation of segments, fast ownership transfer, application-managed data movement In the paper: Many more details 51

52 Conclusion Overview of RAMP, lightweight support for configuration operations in loosely coupled systems Coordinated allocation of segments, fast ownership transfer, application-managed data movement In the paper: Many more details Rcached: memcached-like in-memory k/v store, using RAMP for load balancing 52

53 Feedback The right abstraction for the application Is shared memory abstraction overkill for loosely coupled data intensive applications? 53

54 Thank You 54

55 Rcached Memcached with RAMP based Hash-Maps Ability to migrate partitions 128 partitions hashed across 4 servers 40 million keys (key = 8 Bytes, Value = 128 Byte) 100 closed loop clients Per server latency noted over request windows 55

56 Rcached (2) 56

RAMP: A Lightweight RDMA Abstraction for Loosely Coupled Applications

RAMP: A Lightweight RDMA Abstraction for Loosely Coupled Applications Babar Naveed Memon, Xiayue Charles Lin, Arshia Mufti, Arthur Scott Wesley Tim Brecht, Kenneth Salem, Bernard Wong, Benjamin Cassell