The Last Mile An Empirical Study of Timing Channels on sel4

Size: px

Start display at page:

Download "The Last Mile An Empirical Study of Timing Channels on sel4"

Paul Bryan
5 years ago
Views:

1 The Last Mile An Empirical Study of Timing on David Cock Qian Ge Toby Murray Gernot Heiser 4 November 2014 NICTA Funding and Supporting Members and Partners

2 Outline The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 2/34

3 is a verified, high-performance microkernel. We have: Proof of functional correctness. Proof of authority confinement. Proof of explicit information-flow control. WCET analysis. The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 3/34

4 is a verified, high-performance microkernel. We have: Proof of functional correctness. Proof of authority confinement. Proof of explicit information-flow control. WCET analysis. We don t have: An comprehensive hardware model. The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 3/34

5 Covert and Side RSA Cache AES ALU Unexpected channels invalidate info-flow control. We can t prove their absence: Depend heavily on undocumented chip internals. are probabilistic. The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 4/34

6 of Interest we consider Can subvert our proof: Timing channels. Could be fixed with OS techniques e.g. Cache contention. That can be exploited in software. The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 5/34

7 of Interest we consider Can subvert our proof: Timing channels. Could be fixed with OS techniques e.g. Cache contention. That can be exploited in software. we don t consider Physical attacks e.g. DPA. already excluded by proof e.g. Storage channels. The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 5/34

8 An Experimental Corpus We exploit 2 principal hardware channels: The L2 cache channel. The bus contention channel (not covered today). The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 6/34

9 An Experimental Corpus We exploit 2 principal hardware channels: The L2 cache channel. The bus contention channel (not covered today). The data also suggests 3 more (2 as-yet-unrecognised). The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 6/34

10 An Experimental Corpus We exploit 2 principal hardware channels: The L2 cache channel. The bus contention channel (not covered today). The data also suggests 3 more (2 as-yet-unrecognised). We evaluate 2 cache-channel countermeasures: Instruction-based scheduling. Cache colouring. The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 6/34

11 An Experimental Corpus We exploit 2 principal hardware channels: The L2 cache channel. The bus contention channel (not covered today). The data also suggests 3 more (2 as-yet-unrecognised). We evaluate 2 cache-channel countermeasures: Instruction-based scheduling. Cache colouring. We also consider countermeasures against remote, algorithmic channels: The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 6/34

12 An Experimental Corpus We exploit 2 principal hardware channels: The L2 cache channel. The bus contention channel (not covered today). The data also suggests 3 more (2 as-yet-unrecognised). We evaluate 2 cache-channel countermeasures: Instruction-based scheduling. Cache colouring. We also consider countermeasures against remote, algorithmic channels: Lucky-13 against OpenSSL is our example victim. The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 6/34

13 An Experimental Corpus We exploit 2 principal hardware channels: The L2 cache channel. The bus contention channel (not covered today). The data also suggests 3 more (2 as-yet-unrecognised). We evaluate 2 cache-channel countermeasures: Instruction-based scheduling. Cache colouring. We also consider countermeasures against remote, algorithmic channels: Lucky-13 against OpenSSL is our example victim. Scheduled delivery is our countermeasure. The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 6/34

14 Experimental Platforms Core Date L2 Cache imx.31 ARM1136JF-S (ARMv6) KiB E6550 Conroe (x86-64) KiB DM3730 Cortex A8 (ARMv7) KiB AM3358 Cortex A8 (ARMv7) KiB imx.6 Cortex A9 (ARMv7) KiB Exynos4412 Cortex A9 (ARMv7) KiB 7 years and 3 (ARM) core generations. 32-fold range of cache sizes. The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 7/34

15 Methodology Integrated with nightly regression test. Runs each channel with each countermeasure (54 combinations). 2,000 hours of data over 12 months, 4.3 GiB. Data and analysis tools open source: timingchannels.pml The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 8/34

16 Outline The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 9/34

17 Cache Contention Conflict! domain switch If you share a core, you share a cache. Hit vs. miss makes a big time difference. Many published attacks steal keys through the cache. The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 10/34

18 Cache Contention Conflict! domain switch If you share a core, you share a cache. Hit vs. miss makes a big time difference. Many published attacks steal keys through the cache. We can control cache allocation (more shortly). The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 10/34

19 Exynos4412 Cache Channel Lines touched / Lines evicted / ,768 cache lines, 1000Hz sample rate (preemption). Bandwidth: 2400b/s. Baseline for comparison. The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 11/34

20 What is (IBS)? Our example attack used preemption as a clock. If it s tied to progress, the channel should vanish. This is a form of deterministic execution. The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 12/34

21 What is (IBS)? Our example attack used preemption as a clock. If it s tied to progress, the channel should vanish. This is a form of deterministic execution. Advantages Applies to any channel. Simple to implement (18 lines in ). The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 12/34

22 What is (IBS)? Our example attack used preemption as a clock. If it s tied to progress, the channel should vanish. This is a form of deterministic execution. Advantages Applies to any channel. Simple to implement (18 lines in ). Disadvantages Restrictive Need to remove all clocks. Performance counter accuracy critical. The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 12/34

23 What is (IBS)? Our example attack used preemption as a clock. If it s tied to progress, the channel should vanish. This is a form of deterministic execution. Advantages Applies to any channel. Simple to implement (18 lines in ). Disadvantages Restrictive Need to remove all clocks. Performance counter accuracy critical. Works great on older chips (imx.31), but... The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 12/34

24 Exynos4412 Cache Channel with IBS Lines touched Lines evicted /10 3 Preempt after 10 5 instructions. Bandwidth: 400b/s. 6 reduction very poor result. Event delivery is imprecise thanks to speculation. The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 13/34

25 Exynos4412 Cache Channel with IBS Lines touched Lines evicted /10 3 Preempt after 10 5 instructions. Bandwidth: 400b/s. 6 reduction very poor result. Event delivery is imprecise thanks to speculation. Attacker can modulate it! The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 13/34

26 What is Colouring? frame number frame offset set selector line index Caches are divided into sets by physical address. The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 14/34

27 What is Colouring? frame number frame offset set selector line index Caches are divided into sets by physical address. The set selector bits of the address choose the set. The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 14/34

28 What is Colouring? frame number frame offset set selector line index Caches are divided into sets by physical address. The set selector bits of the address choose the set. These bits (usually) overlap the frame number. The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 14/34

29 What is Colouring? frame number frame offset set selector line index Caches are divided into sets by physical address. The set selector bits of the address choose the set. These bits (usually) overlap the frame number. If these colour bits differ, the frames cannot collide. The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 14/34

30 What is Colouring? frame number frame offset set selector line index Caches are divided into sets by physical address. The set selector bits of the address choose the set. These bits (usually) overlap the frame number. If these colour bits differ, the frames cannot collide. The frame number (phys. addr.) is under OS control. The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 14/34

31 Colouring in In, resource allocation is securely delegated. The initial task splits RAM in to coloured pools. The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 15/34

32 Colouring in In, resource allocation is securely delegated. The initial task splits RAM in to coloured pools. Advantages Very few kernel changes (in ). Low overhead. The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 15/34

33 Colouring in In, resource allocation is securely delegated. The initial task splits RAM in to coloured pools. Advantages Very few kernel changes (in ). Low overhead. Disadvantages Only applies to the cache channel. Relies on internal details of cache operation. Doesn t work with large pages. Hashed caches break everything. The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 15/34

34 Exynos4412 Cache Channel, Partitioned Lines touched / Lines evicted /10 3 Bandwidth: 15b/s. 160 reduction Much better, but not perfect. The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 16/34

Exynos4412 Cache Channel, Partitioned Lines touched /10 3 12.5 12.4 12.3 12.2 12.1 10-3 10-4 0 10 20 30 Lines evicted /10 3 Bandwidth: 15b/s.

35 Exynos4412 Cache Channel, Partitioned Lines touched / Lines evicted /10 3 Bandwidth: 15b/s. 160 reduction Much better, but not perfect. Where does that 15b/s come from? The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 16/34

36 Exynos4412 TLB Channel Lines touched No TLB flush TLB flush Stalls/line No TLB flush TLB flush 4 Average rate correlates with TLB misses. Flushing on switch removes the signal. The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 17/34

37 Misprediction and the Cycle Counter HPT ticks / Branch mispredicts Lines evicted 1 Cycle counter affected by invisible mispredicts. A new (an unexpected) channel. Event delivery is precise, the cycle counter is wrong. The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 18/34

38 Outline The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 19/34

39 The Lucky-13 Attack on TLS Al Fardan & Paterson, SSP 2013 Exploits non-constant-time MAC calculation. This is an Algorithmic side-channel. Remotely exploitable. We reproduce the distinguishing attack. The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 20/34

40 OpenSSL 1.0.1c Response Times Probability M 0 VL M 1 VL Response time (µs) Nearby attacker (crossover cable). Modified vs. unmodified packet. Distinguishable with 100% probability. The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 21/34

41 The Upstream Fix: Constant-Time Code Fixed in OpenSSL version 1.0.1e A constant-time padding/mac check. Now secure (on x86). We tested on ARM still a small channel. The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 22/34

42 The Upstream Fix: Constant-Time Code Fixed in OpenSSL version 1.0.1e A constant-time padding/mac check. Now secure (on x86). We tested on ARM still a small channel. We present an OS-level solution, with: Better performance. Lower latency. Lower CPU overhead. No modification of OpenSSL. The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 22/34

43 OpenSSL 1.0.1e Response Times Probability M 0 VL M 1 VL Response time (µs) The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 23/34

44 OpenSSL 1.0.1e Response Times Probability M 0 VL M 1 VL M 0 CT M 1 CT Response time (µs) Constant-time implementation. Better, but still distinguishable 62%. 60µs (8.1%) latency penalty. The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 24/34

45 What is? in SSL_read handler out SSL_write server Separate OpenSSL and application. Announce packets over IPC. The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 25/34

46 What is? in 1 SSL_read handler out SSL_write server Separate OpenSSL and application. Announce packets over IPC. Record arrival time. The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 25/34

47 What is? in 1 SSL_read handler out SSL_write C 2 server Separate OpenSSL and application. Announce packets over IPC. Record arrival time. The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 25/34

48 What is? in out 1 SSL_read SSL_write 3 handler C 2 server Separate OpenSSL and application. Announce packets over IPC. Record arrival time. The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 25/34

49 What is? in out 1 SSL_read SSL_write 3 handler 4 2 server Separate OpenSSL and application. C Announce packets over IPC. Record arrival time. Block response on timer. R The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 25/34

50 What is? in out 1 SSL_read SSL_write 3 handler 4 server Separate OpenSSL and application. C Announce packets over IPC. Record arrival time. Block response on timer. 2 R The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 25/34

51 What is? in out 1 SSL_read SSL_write 3 handler 4 C 2 server Separate OpenSSL and application. Announce packets over IPC. Record arrival time. Block response on timer. 5 R The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 25/34

52 on We use real-time scheduling to precisely delay messages. Provides an efficient mechanism to enforce a delay policy: See Askarov et. al., CCS The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 26/34

53 on We use real-time scheduling to precisely delay messages. Provides an efficient mechanism to enforce a delay policy: See Askarov et. al., CCS Advantages Uses existing IPC controls no modifications. Fast and effective (see next slide). The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 26/34

54 on We use real-time scheduling to precisely delay messages. Provides an efficient mechanism to enforce a delay policy: See Askarov et. al., CCS Advantages Uses existing IPC controls no modifications. Fast and effective (see next slide). Disadvantages Specific to remote/network attacks. Need to wrap (but not modify) vulnerable component. The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 26/34

55 Security of Probability M 0 VL M 1 VL M 0 CT M 1 CT Response time (µs) The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 27/34

56 Security of Probability M 0 VL M 1 VL M 0 CT M 1 CT M 0 SD M 1 SD Response time (µs) More secure 57% distinguishable. Faster 10µs (1.4%) latency penalty. The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 28/34

57 Performance of CPU load c 1.0.1e 1.0.1c-sd Ingress rate (packets/s) Lower overhead (2% vs. 11%), but earlier saturation. This is a worst-case benchmark no server work. The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 29/34

58 Outline The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 30/34

59 These channels are real We managed to exploit every channel we tried. The bandwidth is high, and growing. The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 31/34

60 These channels are real We managed to exploit every channel we tried. The bandwidth is high, and growing. They re getting worse Countermeasures get less effective on newer chips. New hardware channels e.g. branch predictor. The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 31/34

61 These channels are real We managed to exploit every channel we tried. The bandwidth is high, and growing. They re getting worse Countermeasures get less effective on newer chips. New hardware channels e.g. branch predictor. But there is hope Resource partitioning (e.g. colouring) is effective. Repurpose hardware QoS features. OS-level techniques can help. The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 31/34

62 Ongoing project at NICTA (Qian Ge). Developing a fully cache-coloured. We use these tools to evaluate the result. The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 32/34

63 Questions? Data and Tools TS/timingchannels.pml Is Open Source! The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 33/34

64 Capacity vs. Injected Noise Injected noise (b) uncorrelated noise anticorrelated noise Capacity (b) Noise injection doesn t scale. Increasing determinism is the way to go. The Last Mile Copyright NICTA 2014 David Cock, Qian Ge, Toby Murray, Gernot Heiser 34/34

COMP9242 Advanced OS. S2/2017 W03: Caches: What Every OS Designer Must

COMP9242 Advanced OS. S2/2017 W03: Caches: What Every OS Designer Must COMP9242 Advanced OS S2/2017 W03: Caches: What Every OS Designer Must Know @GernotHeiser Copyright Notice These slides are distributed under the Creative Commons Attribution 3.0 License You are free: to