MemPass

by Eashan Garg and Sameer Lal November 14, 2019

While manual access to memory allocation/deallocation can allow the experienced programmer to significantly improve the performance of their program, it can also open the door to a host of memory safety bugs: use after free, double free, out of bounds memory accesses, and the like. Detecting these problems are difficult on the programmer's end and leave room for exploitation by malicious users.

Memory Safety Vulnerabilities

Many memory safety vulnerabilities occur when the user tries to access or deallocate memory that isn't available for use, either because it was freed, or never allocated in the first place. Take for example the following program:

int ptr*;
ptr = (int *)malloc(4 * sizeof(int));
free(ptr);

After allocating 16 bytes, the starting address for this block of memory is stored in ptr, and freed shortly afterward.

Now, assume that the programmer attempts to access the memory stored at the address in ptr.

int *ptr, *ptr2;
ptr = (int *)malloc(4 * sizeof(int));
ptr[0] = 4;
free(ptr);
ptr2 = (int *)malloc(4 * sizeof(int));
printf("%d", ptr[0]);

Since ptr has been freed, the second call to malloc (which is of same size as the first) is likely to allocate the free space ptr points to. Now, let's assume a malicious attacker managed to store a dangerous payload in this space. Since the program tries to use ptr after it has been freed, it might actually end up accessing the attacker's payload, not the original ptr data as intended!

Double free errors occur upon consecutive free() call with the same memory address. For instance, consider the following code snippet from our test cases (described in the evaluation section):

int main() {
	char* ptr= (char*) malloc(sizeof(char));
	free(ptr);
	free(ptr); /* Double Free Error! */
}

Here, ptr is allocated and then freed twice. Generally, double-freeing a block of main memory will corrupt the state of the memory manager, and could allow a malicious attacker to write arbitrary dangerous code to the memory location that was freed twice.

Memory Leaks occur when the programmer forgets to free memory which is no longer needed.

int main() {
    char* ptr= (char*) malloc(sizeof(char));
}

The consequences of memory leaks is often degraded performance. Memory leaks reduce the available amount of memory in a system, and if not enough space is available, the program may slow down due to thrashing or stop altogether.

Introducing MemPass

Here, we present an analysis tool (creatively titled MemPass) for LLVM to detect use after free bugs, double free bugs, and memory leakage. Here's how it works.

Once any of the above vulnerabilities are detected, the program will throw a warning to the user, along with the line numbers of the original error to allow the programmer to debug the issue. Upon completion of a program's execution, MemPass will generate a report listing the detected vulnerabilities in the original program.

In practice, we can only detect memory leaks after the program has completely executed, while use-after-free and double-free errors can be detected during runtime and should result in the program halting to prevent subsequent malicious code from executing. We decided to only return warnings for this report to make testing more manageable.

Design Overview

As mentioned at the beginning, there is a large breadth of possible memory safety vulnerabilities. In order to tackle a modest subset of these bugs, our strategy of choice is a simple dynamic analysis pass over the LLVM IR.

MemPass inserts instrumentation after relevant memory allocation instructions, recording the relevant addresses. Any time the program attempts to either access or deallocate memory, MemPass will then check if those addresses are still available for use. If not, the program will throw a warning.

In essence, if the program ever try to deallocate or access a memory address that isn't currently allocated in MemPass's hashtable, we have a bug!

Instrumenting LLVM IR

To track memory allocations, deallocations, and accesses in LLVM IR, MemPass needs to insert instrumentation after relevant instructions.

While we could insert our own, carefully crafted LLVM instructions every time, we opted to write a runtime library and link it to the main program with our pass (as described here).

In this runtime library, we can write a series of functions to grab relevant data, and then perform the appropriate steps to detect any memory safety vulnerabilities. For each instruction, we log:

Instruction	Logging
`alloca`	size and (stack) pointer address
`malloc`	size and pointer address
`calloc`	size and pointer address
`free`	pointer address
`load`	address to load from
`store`	address to store to

Now, all we have to do is insert a call to one of our library functions after every relevant memory instruction, and the llvm-link tool will do all the heavy lifting!

Tracking Memory Allocation

In order to better illustrate how MemPass works with a real LLVM program, lets take the following buggy program segment:

int* ptr;
ptr = (int *)malloc(4 * sizeof(int));
free(ptr);
free(ptr);

We see a double free vulnerability, which we want to catch and emit as a warning to the user.

The relevant call to malloc in LLVM IR translates as follows:

%7 = call noalias i8* @malloc(i64 16) #3

To pass the relevant data to our logging functions, we need to grab both the memory address that malloc allocated, as well as the amount of memory that was allocated. Luckily, %7 stores the pointer address as an 8-bit integer pointer, and the size is a 64-bit integer operand to the malloc call itself.

Thus, MemPass inserts a call to our logMalloc library function:

%7 = call noalias i8* @malloc(i64 16) #3
call void @logMalloc(i8* %7, i64 16)

Armed with this data, MemPass stores the address and memory size as key-value pairs in a hashtable. This allows us to easily check if an address has already been allocated.

Note that this approach will work similarly with any calls to calloc.

Checking Free

Returning to the above example, the two calls to free roughly translate to (with instrumentation, after omitting a few loads and bitcasts):

call void @free(i8* %10) #3
call void @logFree(i8* %10)
call void @free(i8* %12) #3
call void @logFree(i8* %12)

In essence, MemPass sends the relevant addresses that we want to free to our runtime library. Taking the first address in %10, MemPass checks the allocation table to see if has been allocated. If so, this is a valid attempt to free!

On the other hand, our second call to free occurs attempts to free %12, which is the same address as %10. MemPass simply checks the allocation table, and since the address is no longer stored here we have a double free bug. MemPass prints this as a warning to the console, and continues searching for vulnerabilities.

While MemPass doesn't handle calls to realloc, it would be quite tractible to do so now. Just remove the old memory address from the allocation hash, and add the new address that the function returns (along with its size).

Use After Free: Accessing invalid memory

One of the more difficult aspects of MemPass's implementation is to find a way to handle accesses to memory after a pointer has been freed. Consider the following program:

int* ptr;
ptr = (int *)malloc(4 * sizeof(int));
free(ptr);
ptr[0] = 4;
return 0;

The naïve solution would be to add instrumentation after every load or store instruction in the LLVM IR, and compare the addresses to our allocation table. However, we run into a series of complications once we look at the actual IR.

%3 = alloca i32, align 4
%4 = alloca i32, align 4
%5 = alloca i8**, align 8
%6 = alloca i32*, align 8
store i32 0, i32* %3, align 4
store i32 %0, i32* %4, align 4
store i8** %1, i8*** %5, align 8
%7 = call noalias i8* @malloc(i64 16) #3
%8 = bitcast i8* %7 to i32*
store i32* %8, i32** %6, align 8
%9 = load i32*, i32** %6, align 8
%10 = bitcast i32* %9 to i8*
call void @free(i8* %10) #3
%11 = load i32*, i32** %6, align 8
%12 = getelementptr inbounds i32, i32* %11, i64 0
store i32 4, i32* %12, align 4
ret i32 0

First, the program accesses more memory than what was allocated by the programmer through calls to malloc. If MemPass compares the address accessed by one of the first store instructions with the allocation hash, it wouldn't find the address and emit a false-positive warning.

In reality, programs also allocates stack frame memory with alloca instructions. In order to handle these extra allocations, MemPass adds extra instrumentation here, and adds the stack frame addresses/sizes onto the allocation table.

However, there's another problem. Some of the load and store instructions use pointers of arbitrary types. If MemPass doesn't know what the pointer types are, it can't pass those addresses to the runtime library for evaluation.

A solution that MemPass employs is to insert bitcast instructions after every load or store instruction, converting the address pointer from its arbitrary type to an 8-bit integer pointer. Since we're only comparing addresses and not the actual values at these addresses, this should work somewhat well.

With i8* pointers, all MemPass needs is the size of the memory chunk that a load or store plans to interact with. While this data is not immediately accessible, LLVM provides a handy DataLayout class. After grabbing the type of the element that the original pointer points to, MemPass can extract its size and pass that to our library functions.

Finally, we need to actually check if we are accessing memory that is available as per the allocation table. MemPass takes the difference of the address in question with every pointer address in the allocation table, and compares it to the appropriate sizes. If the address is within the bounds of a chunk of memory, then we are fine. Otherwise, the program will emit a use after free warning, and continue to look for more vulnerabilities.

Program Termination

On program termination, we need to check the allocation table for any remaining memory that has not been freed. In order to differentiate stack memory that was allocated with alloca, any memory allocated with malloc is written to a separate file, along with any pointer addresses passed to free. Now, MemPass just searches this file and emits any malloced addresses that were not freed. This is compiled into a final report, listing all of the memory safety vulnerabilities (among double free, use after free, and memory leak) that were detected throughout the execution of this program, along with their line numbers.

Implementation extras

Another possible implementation scheme we considered was the use of a deallocation hashmap in addition to the allocation hashmap, to store memory that has been freed. This way, MemPass does not need add instrumentation after alloca instructions; it just needs to store memory addresses allocated with malloc. However, every time memory is allocated or deallocated, MemPass must check addresses in the other map to ensure there are no overlaps.

Both for the proposed framework and our current one, some sort of segmentation tree implementation could be useful to store memory bounds as intervals and compare them quickly. However, the overhead of building this tree might not be worth the benefits for small programs.

Another implementation idea that would have been much more effective in hindsight would be to find some way invalidate pointers after they are freed. This way, MemPass does not need to add instrumentation after every load and store (the program can exit when accesssing a specific "invalid" pointer).

Evaluation

When evaluating, we aimed to catch all the use-after-free, double-free, and memory leak errors that we could. We wrote a series of small correctness tests first to verify that our algorithm worked as expected. Then, we selected a series of benchmark tests, checking both how many bugs MemPass was able to catch and the runtime overhead of our instrumentation.

Our testing procedure is as follows:

Translate each test file to LLVM IR with clang and the disable-O0-optnone flag.
Run MemPass on the new LLVM IR with the opt tool.
Translate runtime libraries to LLVM IR with clang and the O3 flag.
Use llvm-link to link all files together
Compile this final, linked file with clang and run it with the bash time tool. Log the user time measurements (not the sys or real).

All tests were run on an Intel Core i7-7700HQ CPU @ 2.80GHz with 16GB of RAM, and using Ubuntu on WSL.

Benchmark Testing: Runtime Overhead

In order to evaluate the overhead cost due to MemPass, we instrumented a subset of the LLVM Test Suite (specifically the Stanford Test Suite). Each benchmark was run 5 times, and the average and standard deviation were recorded in the following tables.

Program (Original)	Average (s)	SD (s)
BubbleSort	0.048	0.009
FloatMM	1.220	0.086
IntMM	0.000	0.000
Oscar	0.010	0.000
Perm	0.030	0.000
Queens	0.010	0.000
Quicksort	0.064	0.004
RealMM	0.010	0.000
Towers	0.028	0.009
Treesort	0.106	0.019

Program (MemPass)	Average (s)	SD (s)
BubbleSort	44.834	0.1995
FloatMM	166.354	2.174
IntMM	0.244	0.012
Oscar	1.270	0.016
Perm	2.322	0.186
Queens	8.198	0.252
Quicksort	19.060	0.240
RealMM	0.446	0.013
Towers	2.852	0.279
Treesort	5.948	0.305

Note: Some programs had an almost negligible runtime, which we record as 0.000 in the charts.

MemPass's overhead can be anywhere from a 100x slowdown to over 1000x in the case of Bubblesort. This can be attributed to the fact that MemPass adds instrumentation not only after malloc and free calls, but also after every load and store. The benchmarks in question (especially FloatMM, IntMM, and RealMM) do many matrix computations, which end up blowing up the performance cost of our instrumentation significantly. In hindsight, restricting instrumentation to only malloc and free would significantly reduce the overhead of MemPass. As explained before, we could find some clever way to invalidate pointers once they are freed, thus causing the program to exit when trying to access an invalid location in memory.

Benchmark Testing: Correctness

We evaluated the correctness of MemPass on subsets of the Toyota ITC and SARD-100 benchmark tests. The Toyota benchmark tests consist of a family of memory tests, and two test suites that we used are Double Free and Memory Leak tests. The SARD-100 benchmark tests are similar, and we used the cwe-401-memory-leak, cwe-415-double-free and cwe-416-use-after-free suites.

Note: We wanted to use the Invalid Memory Access (Use After Free) test set in Toyota as well, but we kept running into segmentation faults with the tests and struggled to debug them.

In general, the Double Free and cwe-415-double-free benchmark tests consist of normal double free errors, freeing in constant/variable if statements, freeing in a function, freeing in conditional while loops, and freeing in for loops.

The Memory Leak, cwe-401-memory-leak, and cwe-416-use-after-free benchmark tests consist of a series of tests such as allocating memory without freeing, allocating in conditional statements, freeing based on function return values, allocating memory in mutually recursive functions and various branching scenarios.

Benchmark Results

In Detection of Security Vulnerabilities in C Code using Runtime Verification, the authors provide benchmark test results for E-ACSL, Google Sanitizer, and RV-Match on both the Toyota-ITC and SARD-100 test suites.

With the Toyota dataset, all three of these related memory vulnerability detection tools were run on their dynamic memory tests, which look for errors such as Double Free, Memory Leak, Null Memory, among many others. The numbers below refer to the percent of tests the tools were able to correctly detect the appropriate memory vulnerability for.

Defect Type	E-ACSL	Sanitizer	RV-Match
Dynamic Memory Tests	94%	78%	94%

Unfortunately, the above statistics are not that useful, as our LLVM pass is only able to target double free, memory leaks and use-after-free errors. Still, within the Dynamic Memory tests, we considered the Double Free and Memory Leak suites of tests. Double Free contains 12 cases, and Memory Leak contains 18. Over these test suites we achieved the following results:

Dynamic Memory Test	Double Free	Memory Leak
MemPass	91.6%	77.7%

After looking through the failed test cases, we found that the failed double free test and three of the four failed memory leak tests used either static or global variables. Our somewhat sketchy pointer type conversion from earlier seemed to struggle to convert the types of these variables correctly to 8-bit pointers, and thus MemPass ended up generating many false positives. With a more involved type-checking/conversion system (or something else we failed to consider), we should avoid failing most of these cases.

On the other hand, the paper provided a much more granular breakdown of test results on the Sard-100 test suite. Their results, along with our results, are displayed below:

Non-Memory Defects	E-ACSL	Sanitizer	RV-Match	MemPass
CWE-416: Use After Free	100% (6/6)	100% (6/6)	100% (6/6)	100% (6/6)
CWE-415: Double Free	100% (6/6)	100% (6/6)	67% (4/6)	67% (4/6)
CWE-401: Memory Leak	100% (5/5)	80% (4/5)	60% (3/5)	100% (5/5)

Once again, the double free test cases that we fail are related to casting issues generating false positives.

Note: Our implementation attempts to achieve some sense of completeness, as we only emit a warning if a program tries to interact with memory that was not allocated to begin with. However, this dynamic analysis comes at the cost of soundness, especially since we only interact with one execution path at a time. In addition, due to casting errors we can end up with some false positives in certain cases.

Dynamic vs Static Analysis

While a dynamic analysis is certainly interesting and useful, it's difficult to analyze certain memory bugs such as memory leaks, since they cannot be detected until the program terminates. In addition, a dynamic analysis can only check individual executions of a program, and therefore might miss bugs with programs that have a large number of inputs.

A static analysis that uses some sort of use-def chain would be another interesting way to triage these vulnerabilities, and would relax much of the overhead that our method produces. In addition, it would be able to analyze all possible execution paths at once, and therefore complete a more "sound" analysis of the various bugs that may be present in the program.

The code for MemPass can be found here: https://github.com/splashofcrimson/memPass

The CS 6120 Course Blog