Network Architecture Laboratory

Size: px

Start display at page:

Download "Network Architecture Laboratory"

Joseph Patterson
5 years ago
Views:

1 Automated Synthesis of Adversarial Workloads for Network Functions Luis Pedrosa, Rishabh Iyer, Arseniy Zaostrovnykh, Jonas Fietz, Katerina Argyraki Network Architecture Laboratory

2 Software NFs The good: The flexibility of software The software development cycle The bad: The reliability of software Inconsistent performance The ugly: Adversarial traffic / DoS / Slowdowns 2

3 We need better tools... Dynamic analysis: profiling Reasons about known inputs Helps find root cause / debug Only as good as the inputs used 3

4 We need better tools... Static analysis Reasons about potential inputs in abstract Over-approximating: WCET Under-approximating: adversarial inputs 0 Typical Adversarial MAX WCET Latency (not to scale) 4

5 CASTAN Cycle Approximating Symbolic Timing Analysis for NFs Statically analyze NF Analyze code Generate PCAP file with adversarial workload Exploit The CPU cache hierarchy Algorithmic complexity It works! Increased NF latency up to 3 5

6 Outline Introduction SymbEx in a Nutshell CASTAN Evaluation Conclusion 6

7 SymbEx in a Nutshell Procedure Interpret code with symbolic values 01: int var = input(); // α 02: return var++; // α+1 J. C. King, Symbolic Execution and Program Testing,

8 SymbEx in a Nutshell Procedure Interpret code with symbolic values 01: 02: 03: 04: 05: 06: int var = input(); // α if (var >= 0) { return var; } else { return -var; } 8

9 SymbEx in a Nutshell Procedure Interpret code with symbolic values Fork execution on symbolic conditions Keep track of path constraints 01: 02: 03: 04: 05: 06: int var = input(); // α if (var >= 0) { return var; // α if α>=0 } else { return -var; // -α if α<0 } 9

10 SymbEx in a Nutshell Procedure Interpret code with symbolic values Fork execution on symbolic conditions Keep track of path constraints SMT solver finds concrete inputs 01: 02: 03: 04: 05: 06: int var = input(); // α if (var >= 0) { return var; // α if α>=0, e.g. α=0 } else { return -var; // -α if α<0, e.g. α=-1 } 10

11 SymbEx in a Nutshell Challenges Path Explosion! Typically exponential # of paths / branch Unbounded with loops Impractical to SymbEx exhaustively 11

12 SymbEx in a Nutshell Mitigation Can t do everything: prioritize! Directed Symbolic Execution Prioritize executing relevant paths over others Graph search with heuristic Try to reach a bug / increase coverage / etc. Stop SEE when satisfied (or impatient) 12

13 CASTAN Overview Generate adversarial NF workloads Packet sequence more CPU cycles / packet Under-approximate: not WCET Largely automated 13

14 CASTAN Approach Exploits performance variation 1. CPU cache: +DRAM accesses 2. Algorithmic complexity: +instructions 3. Hashing: reverse to expose internals 14

15 CASTAN Attacking the CPU Cache Symbolic Pointers Index into memory with packet: array[packet.dst_addr] Find packets memory addresses DRAM access CPU Cache Model Simple 1-tier model of the LLC Models contention, associativity, write-back Empirical contention set model 15

16 CASTAN Attacking Algorithmic Complexity Maximize Instructions / Packet Find packets longer code paths Guide SymbEx with a Heuristic Maximize cycles w/o inducing breadth-first-search Estimate cycles / packet Receive Packet Receive Packet 16

17 CASTAN Attacking Algorithmic Complexity CFG Distance Heuristic max(successors)+cost<current> cost = cycles conservatively assuming an L1 hit 17

18 CASTAN Attacking Algorithmic Complexity CFG Distance Heuristic max(successors)+cost<current> cost = cycles conservatively assuming an L1 hit 0 18

19 CASTAN Attacking Algorithmic Complexity CFG Distance Heuristic max(successors)+cost<current> cost = cycles conservatively assuming an L1 hit

20 CASTAN Attacking Algorithmic Complexity CFG Distance Heuristic max(successors)+cost<current> cost = cycles conservatively assuming an L1 hit

21 CASTAN Attacking Algorithmic Complexity Handling Loops Distance vector algorithm Limit repeats to 2 (unrolls loops once) 21

22 CASTAN Attacking Algorithmic Complexity Handling Loops Distance vector algorithm Limit repeats to 2 (unrolls loops once) 0 22

23 CASTAN Attacking Algorithmic Complexity Handling Loops Distance vector algorithm Limit repeats to 2 (unrolls loops once)

24 CASTAN Attacking Algorithmic Complexity Handling Loops Distance vector algorithm Limit repeats to 2 (unrolls loops once)

25 CASTAN Attacking Algorithmic Complexity Handling Loops Distance vector algorithm Limit repeats to 2 (unrolls loops once)

26 CASTAN Handling Hash Functions SymbExing hash functions is hard Complex expression / Path explosion Reason about hash value, without computing it? 26

27 CASTAN Handling Hash Functions SymbExing hash functions is hard Complex expression / Path explosion Reason about hash value, without computing it? Havocing Annotate and disable hash function Assign hash value a new symbol Analyze data structure internals unencumbered Find packet hash value expected behavior 27

28 CASTAN Handling Hash Functions Packets Solve Packets Hash Inputs Reverse Hashes Hashes Solve Hashes 28

29 Evaluation Setup Network Measurement Campaign E2E Latency / Throughput Intel Xeon E5-2667v2 3.3GHz 25.6MB LLC / 32GB RAM Intel 82599ES 10Gb NICs Tester DUT 29

30 Evaluation NFs 11 NF Implementations 3 types, different data structures Unbalanced Tree Red-Black Tree Hash Ring Hash Table Hierarchical Lookup (DPDK) Single Lookup Patricia Trie NAT LB LPM 30

31 Evaluation NFs 11 NF Implementations 3 types, different data structures Unbalanced Tree Red-Black Tree Hash Ring Hash Table Hierarchical Lookup (DPDK) Single Lookup Patricia Trie NAT Algorithmic Complexity LB Cache LPM 31

32 Evaluation Workloads Baseline NOP Adversarial CASTAN (~50 flows), Manual (~50 flows) Random UniRand (1Mflows) Zipf (100kpkts, 6.7kflows) UniRand CASTAN (# flows = CASTAN) 32

33 Evaluation CDF LPM / Single Lookup Table 3 33

34 Evaluation LPM / Single Lookup Table CDF CASTAN induces DRAM accesses 3 Latency 3 UniRand; fewer flows 34

35 Evaluation LPM / Single Lookup Table -19% 35

36 Evaluation CDF NAT / Unbalanced Tree

37 Evaluation NAT / Unbalanced Tree CDF CASTAN skews the tree +70% Latency / -7% Throughput 1.7 Manual; without intuition 37

38 Conclusion CASTAN Attacks complexity, CPU cache, hash functions Little developer input Adversarial Workloads Manual when available > Uniform random for same number of flows Up to +201% latency / -19% throughput 38

39 Find out more! Look for our poster! Get the source and more: 39

40 Backup Slides 40

41 Cache Structure L3 slice L2 line 34 bits 1GB page index 15 bits 3 bits L1d line byte offset 6 bits 6 bits 1GB page offset 41

42 Latency Deviation from NOP 42

43 Throughput 43

44 LPM / Single Lookup Table 44

45 NAT / Unbalanced Tree 45

46 NAT / Hash Ring 46

47 NAT / Red-Black Tree 47

48 NAT / Hash Table 48

49 LPM / Hierarchical Lookup (DPDK) 49

50 LPM / Patricia Trie 50

51 LB / Unbalanced Tree 51

52 LB / Red-Black Tree 52

53 LB / Hash Ring 53

54 LB / Hash Table 54

Performance Contracts for Software Network Functions

Performance Contracts for Software Network Functions Rishabh Iyer, Luis Pedrosa, Arseniy Zaostrovnykh, Solal Pirelli, Katerina Argyraki, and George Candea, EPFL https://www.usenix.org/conference/nsdi19/presentation/iyer