High System-Code Security with Low Overhead

Size: px

Start display at page:

Download "High System-Code Security with Low Overhead"

Oswald Magnus Floyd
5 years ago
Views:

1 High System-Code Security with Low Overhead Jonas Wagner, Volodymyr Kuznetsov, George Candea, and Johannes Kinder École Polytechnique Fédérale de Lausanne Royal Holloway, University of London

2 High System-Code Security? Today s so!ware is dangerous. Example: OpenSSL Overflow in ssl/t1_lib.c:3997 OpenSSL contains 53,073 memory accesses. How to protect them all?

3 Protect all dangerous operations using sanity checks Checks are automatically added at compile time No source code modification is needed if (!isvalidaddress(p)) { reporterror(p); *p = 42; abort(); AddressSanitizer } *p = 42;

4 Problem: Sanity checks cause high performance overhead Tool Avg. Overhead AddressSanitizer (memory errors) 73% SoftBound/CETS (full memory safety) 116% UndefinedBehaviorSanitizer (integer overflows, type errors ) 71% Assertions, code contracts, depends

5 Problem: Sanity checks cause high performance overhead People use checks heavily for testing, but disable them in production Goal: checks in production

6 Insight: Checks are not all equal Most of the overhead comes from a few expensive checks Checks in hot code, each executed many times Most of the protection comes from many cheap checks Checks in cold code

7 Our Approach : ASAP As Safe As Possible Lets users choose their overhead budget (e.g., 5%) Automatically identifies sanity checks in so!ware Analyzes the cost of every check Selects as many checks as fit in the user s budget Most of the overhead comes from a few expensive checks Most of the protection comes from many cheap checks

8 ASAP Insight & Results 100% 80% Fraction of critical operations Sanity level 60% 40% protected by a check 20% 0% 0% 10% 20% 30% 40% 50% 60% Overhead

9 ASAP Insight & Results 100% 80% Sanity level 60% 40% Most protection comes 20% from cheap checks 0% 0% 10% 20% 30% 40% 50% 60% Overhead

10 ASAP Insight & Results 100% 80% A few checks are very expensive Sanity level 60% 40% 20% 0% 0% 10% 20% 30% 40% 50% 60% Overhead

11 ASAP Insight & Results 100% 80% A user on a 5% budget Sanity level 60% 40% can get 87% of the protection 20% 0% 0% 10% 20% 30% 40% 50% 60% Overhead

12 Outline Introduction: What is ASAP? Design Key Algorithms Results Conclusion

13 Design ASAP is built into the compiler Easy to use (set CC and CFLAGS) Compatible & robust (parallel compilation, ) Compiler (LLVM) Identify sanity checks Profiler: measure check costs Optimizer: select maximum set of checks

14 Users use ASAP like a regular compiler that adds checks.cc.h ASAP.exe +.llvm ASAP stores intermediate compiler output

15 Users use ASAP like a regular compiler that adds checks.cc.h ASAP.exe +.llvm ASAP generates a program variant with profiling instrumentation. Users run this to measure check costs. ASAP.llvm Profiler.exe Costs Workload

16 Users use ASAP like a regular compiler that adds checks.cc.h ASAP.exe +.llvm ASAP generates a program variant with profiling instrumentation. Users run this to measure check costs. ASAP.llvm Profiler.exe Costs Workload ASAP uses costs & budget to generate an optimized program ASAP.llvm + Costs + Budget Optimizer.exe

17 Outline Introduction: What is ASAP? Design Key Algorithms Results Conclusion

18 ASAP Identify check instructions (de)activate checks Measure compiler driver Budget check cost IDE integration Quantify protection.exe profiling workload Can users trust ASAP to select checks that use the least CPU cycles?

19 Measure Check Cost... if (!isvalidaddress(p)) { reporterror(p); abort(); } *p = 42;...

20 Measure Check Cost Add profiling counters prof1++; if (!isvalidaddress(p)) { prof2++; reporterror(p); abort(); } prof3++; *p = 42;...

21 Measure Check Cost... prof1++; if (!isvalidaddress(p)) { 1. Add profiling counters 2. Identify check instructions prof2++; reporterror(p); abort(); } prof3++; *p = 42;...

22 Measure Check Cost... prof1++; if (!isvalidaddress(p)) { prof2++; reporterror(p); abort(); } 1. Add profiling counters 2. Identify check instructions 3. Use static model of cycles per instruction 4. Compute cost for check in CPU cycles: prof3++; *p = 42;... X i2check prof(i) cycles(i) Precise cost in CPU cycles

23 ASAP Identify check instructions (de)activate checks Measure compiler driver Budget check cost IDE integration Quantify protection.exe profiling workload How can ASAP detect checks robustly? How are checks switched on or off?

24 // blocksort.c line 349 (bzip2 from SPEC CPU 2006) cc1 = eclass[fmap[i]]; %1 = load i32* %fmap_i_ptr, align 4 %2 = zext i32 %1 to i64 %3 = getelementptr inbounds i32* %eclass, i64 %2 %19 = load i32* %3, align 4

25 ; <label>:0 %1 = load i32* %fmap_i_ptr, align 4 %2 = zext i32 %1 to i64 %3 = getelementptr inbounds i32* %eclass, i64 %2 %4 = ptrtoint i32* %3 to i64 %5 = lshr i64 %4, 3 %6 = add i64 %5, 0x %7 = inttoptr i64 %6 to i8* %8 = load i8* %7, align 1 %9 = icmp eq i8 %8, 0 br i1 %9, label %18, label %10 ; <label>:10 %11 = ptrtoint i32* %3 to i64 %12 = and i64 %11, 7 %13 = add i64 %12, 3 %14 = trunc i64 %13 to i8 %15 = icmp slt i8 %14, %8 br i1 %15, label %18, label %16 ; <label>:18 %19 = load i32* %3, align 4 ; <label>:16 %17 = ptrtoint i32* %3 to i64 call asan_report_load4(i64 %17) #3 call void asm sideeffect "", ""() #3 unreachable

26 ; <label>:0 %1 = load i32* %fmap_i_ptr, align 4 %2 = zext i32 %1 to i64 %3 = getelementptr inbounds i32* %eclass, i64 %2 %4 = ptrtoint i32* %3 to i64 %5 = lshr i64 %4, 3 %6 = add i64 %5, 0x %7 = inttoptr i64 %6 to i8* %8 = load i8* %7, align 1 %9 = icmp eq i8 %8, 0 br i1 %9, label %18, label %10 ; <label>:10 %11 = ptrtoint i32* %3 to i64 %12 = and i64 %11, 7 %13 = add i64 %12, 3 %14 = trunc i64 %13 to i8 %15 = icmp slt i8 %14, %8 br i1 %15, label %18, label %16 ; <label>:18 %19 = load i32* %3, align 4 ; <label>:16 %17 = ptrtoint i32* %3 to i64 call asan_report_load4(i64 %17) #3 call void asm sideeffect "", ""() #3 unreachable

27 ; <label>:0 %1 = load i32* %fmap_i_ptr, align 4 %2 = zext i32 %1 to i64 %3 = getelementptr inbounds i32* %eclass, i64 %2 %4 = ptrtoint i32* %3 to i64 %5 = lshr i64 %4, 3 %6 = add i64 %5, 0x %7 = inttoptr i64 %6 to i8* %8 = load i8* %7, align 1 %9 = icmp eq i8 %8, 0 br i1 %9, label %18, label %10 ; <label>:10 %11 = ptrtoint i32* %3 to i64 %12 = and i64 %11, 7 %13 = add i64 %12, 3 %14 = trunc i64 %13 to i8 %15 = icmp slt i8 %14, %8 br i1 %15, label %18, label %16 ; <label>:18 %19 = load i32* %3, align 4 ; <label>:16 %17 = ptrtoint i32* %3 to i64 call asan_report_load4(i64 %17) #3 call void asm sideeffect "", ""() #3 unreachable

28 ; <label>:0 %1 = load i32* %fmap_i_ptr, align 4 %2 = zext i32 %1 to i64 %3 = getelementptr inbounds i32* %eclass, i64 %2 %4 = ptrtoint i32* %3 to i64 %5 = lshr i64 %4, 3 %6 = add i64 %5, 0x %7 = inttoptr i64 %6 to i8* %8 = load i8* %7, align 1 %9 = icmp eq i8 %8, 0 br i1 %9, label %18, label %10 ; <label>:10 %11 = ptrtoint i32* %3 to i64 %12 = and i64 %11, 7 %13 = add i64 %12, 3 %14 = trunc i64 %13 to i8 %15 = icmp slt i8 %14, %8 br i1 %15, label %18, label %16 ; <label>:18 %19 = load i32* %3, align 4 ; <label>:16 %17 = ptrtoint i32* %3 to i64 call asan_report_load4(i64 %17) #3 call void asm sideeffect "", ""() #3 unreachable

29 ; <label>:0 %1 = load i32* %fmap_i_ptr, align 4 %2 = zext i32 %1 to i64 %3 = getelementptr inbounds i32* %eclass, i64 %2 %4 = ptrtoint i32* %3 to i64 %5 = lshr i64 %4, 3 %6 = add i64 %5, 0x %7 = inttoptr i64 %6 to i8* %8 = load i8* %7, align 1 %9 = icmp eq i8 %8, 0 br i1 %9, label %18, label %10 ; <label>:10 %11 = ptrtoint i32* %3 to i64 %12 = and i64 %11, 7 %13 = add i64 %12, 3 %14 = trunc i64 %13 to i8 %15 = icmp slt i8 %14, %8 br i1 %15, label %18, label %16 ; <label>:18 %19 = load i32* %3, align 4 ; <label>:16 %17 = ptrtoint i32* %3 to i64 call asan_report_load4(i64 %17) #3 call void asm sideeffect "", ""() #3 unreachable

30 ; <label>:0 %1 = load i32* %fmap_i_ptr, align 4 %2 = zext i32 %1 to i64 %3 = getelementptr inbounds i32* %eclass, i64 %2 %4 = ptrtoint i32* %3 to i64 %5 = lshr i64 %4, 3 %6 = add i64 %5, 0x %7 = inttoptr i64 %6 to i8* %8 = load i8* %7, align 1 %9 = icmp eq i8 %8, 0 br i1 true, label %18, label %10 ; <label>:10 %11 = ptrtoint i32* %3 to i64 %12 = and i64 %11, 7 %13 = add i64 %12, 3 %14 = trunc i64 %13 to i8 %15 = icmp slt i8 %14, %8 br i1 %15, label %18, label %16 ; <label>:18 %19 = load i32* %3, align 4 ; <label>:16 %17 = ptrtoint i32* %3 to i64 call asan_report_load4(i64 %17) #3 call void asm sideeffect "", ""() #3 unreachable

31 ; <label>:0 %1 = load i32* %fmap_i_ptr, align 4 %2 = zext i32 %1 to i64 %3 = getelementptr inbounds i32* %eclass, i64 %2 %4 = ptrtoint i32* %3 to i64 %5 = lshr i64 %4, 3 %6 = add i64 %5, 0x %7 = inttoptr i64 %6 to i8* %8 = load i8* %7, align 1 %9 = icmp eq i8 %8, 0 br i1 true, label %18, label %10 ; <label>:18 %19 = load i32* %3, align 4

32 ; <label>:0 %1 = load i32* %fmap_i_ptr, align 4 %2 = zext i32 %1 to i64 %3 = getelementptr inbounds i32* %eclass, i64 %2 %19 = load i32* %3, align 4

33 ASAP Identify check instructions (de)activate checks Measure compiler driver Budget check cost IDE integration Quantify protection.exe profiling workload If ASAP says you re 87% protected, what does this mean?

34 Typically, instrumentation tools try to offer guarantees (e.g., memory safety) Lower overheads from tools with weaker guarantees (e.g., control flow integrity) Practical implication of weaker guarantees?

35 ASAP quantifies protection using the sanity level We would like to know the effective protection level Methodology: measure how many bugs/vulnerabilities are effectively prevented Typically, instrumentation tools try to offer guarantees (e.g., memory safety) Lower overheads from tools with weaker guarantees (e.g., control flow integrity) Practical implication of weaker guarantees?

36 ASAP quantifies protection using the sanity level Experiment 1 We would like to know the effective protection level Methodology: measure how many bugs/vulnerabilities are effectively prevented Source code of Python 2.7 Bug: line that has received a patch between version and 2.7.8

37 ASAP quantifies protection using the sanity level We would like to know the effective protection level Methodology: measure how many bugs/vulnerabilities are effectively prevented Buggy lines covered by checks 100% 80% 60% 40% 20% 0 Effective Protection sanity level 0 20% 40% 60% 80% 100% Sanity level

38 Experiment 2 Experiment 3 Known bugs Vulnerabilities from CVE DB Project Bugs Analyze 145 vulnerabilities from 2014 Python OpenSSL 1 RIPE Benchmarks 10 Memory errors Open source Patch available Error location known All of these are in cold code 83% of these are in cold code Checks in cold code provide real value

39 Outline Introduction: What is ASAP? Design Key Algorithms Results Conclusion

40 Results 120 Overhead for: SPEC benchmarks AddressSanitizer 100% sanity level libquantum lbm soplex mcf namd astar bzip2 milc sphinx3 hmmer gobmk sjeng gcc povray Overhead in %

41 Results 120 CPU cycles saved 100 Overhead for: SPEC benchmarks AddressSanitizer 95% sanity level 0 libquantum lbm soplex mcf namd astar bzip2 milc sphinx3 hmmer gobmk sjeng gcc povray Overhead in %

42 Results 120 Overhead for: SPEC benchmarks AddressSanitizer 90% sanity level Overhead in % libquantum lbm soplex mcf namd astar bzip2 milc sphinx3 hmmer gobmk sjeng gcc povray

43 Results Residual overhead Overhead for: (not due to checks) SPEC benchmarks AddressSanitizer 80% sanity level 0 libquantum lbm soplex mcf namd astar bzip2 milc sphinx3 hmmer gobmk sjeng gcc povray Overhead in %

44 Results 120 Overhead for: SPEC benchmarks AddressSanitizer 80% sanity level Overhead in % libquantum lbm soplex mcf namd astar bzip2 milc sphinx3 hmmer gobmk sjeng gcc povray

45 Conclusion Protect your so$ware! dslab.epfl.ch/proj/asap Run-time checks deliver strong protection at high cost ASAP Identify measure select maximum Budget sanity checks check costs set of checks.exe Most of the overhead comes from a few expensive checks Most of the protection comes from many cheap checks

Lightweight Memory Tracing

Lightweight Memory Tracing Mathias Payer*, Enrico Kravina, Thomas Gross Department of Computer Science ETH Zürich, Switzerland * now at UC Berkeley Memory Tracing via Memlets Execute code (memlets) for