A Unified Theory of Garbage Collection

by Orko Sinha, Michael Maitland March 29, 2022

Introduction

Some programming languages have dynamic memory management. Tracing and reference counting have often been thought of as two seperate techniques to implement garbage collection. However, A Unified Theory of Garbage Collection by Bacon et al. presents a framework that views tracing and reference counting as duals of each other. The paper presents this framework, shows how tracing and reference counting are in fact duals, shows how optimized garbage collectors are hybrids of tracing and reference counting, and develops a cost analysis to determine the time and space tradeoffs of collectors within this design space.

Background

Garbage Collection

Languages like C and C++ take a manual memory management approach, requiring programmers to explicitly allocate and deallocate memory using malloc and free. This approach is extremely error prone and has led to many bugs. Languages such as Java or Python use dynamic memory management where the programming language runtime automatically frees memory that is no longer being used by the programmer. This approach however comes at a runtime cost since the language must be able to keep track of which objects are no longer being used. This has led to the research of high performance garbage collectors. There are two main types of collectors: reference counting collectors and tracing collectors. Until the ideas in this paper were presented, the two can seem like unrelated algorithms that accomplished the same task.

Reference Counting

A reference counting collector keeps metadata that keeps track of how many references there are to each object. Reference counting runs every time a pointer is changed. When a pointer is assigned to point to an object, the count for that object is increased, and when it no longer points to an object the count is decreased. When the count goes to 0, the object can be freed. Low pause time, determinism, predictability, and simplicity are pros of reference counting.

Tracing

Tracing does not keep track of any extra metadata. Tracing does not run every time a pointer is changed. Instead, it runs periodically as determined by the programming language. Although it does not run every time a pointer is changed and does not keep track of metadata, it must perform a graph traversal to identify which nodes are unreachable. Lower space overhead, control over when the collector should run, and cycle collection are pros of tracing.

The Paper

Motivation

Tracing and reference counting collectors each have their pros and cons. Researchers have explored the design space searching for optimizations. This paper presents a formalization of the design space, demonstrating that all garbage collection is a combination of reference counting and tracing. It also provides a way for collectors to perform analysis on the cost of such collectors.

Fix-Point Formulation

Garbage collection is first defined using a fixed point formulation. Later in the paper, it is shown how tracing, reference counting, and hybrids of tracing and reference counting find solutions that satisfy this fixed point formulation. We start out with a characterization of memory:

$V$ is the set of all objects. This includes objects that are still in use, objects that are no longer in use but not yet freed, and objects that have been freed.
$E$ is the multiset of edges in the graph. In other words, the references between one object and another. It is a multi-set because an object can have multiple pointers to another node. Consider the case where object $a$ has fields $f1$ and $f2$ which both point to the same object $b$.
$R$ is the multiset of objects that are roots in the graph.
$p(v)$ where $v \in V$ is the reference count of vertex $v$.

The object graph is the triple $G = <V, E, R>$. From here, garbage collection is given as a fixed point computation.

alt_text

Once reference counts have been assigned, vertices with counts of 0 are reclaimed. It is important to note that this fix point equation is not an algorithm. Instead, tracing, reference counting, and hybrids of the two are algorithms that satisfy this fix point equation.

Algorithmic Duals

One of the key insights of this paper was that tracing and reference counting garbage collection methods are “algorithmic duals” of one another. The paper gives the intuition that tracing opperate on live objects or “matter” while reference counting operates on dead objects or “anti-matter”.

Tracing vs Reference Counting

The tracing garbage collection and reference counting algorithms are shown below. Tracing computes the least fix point and reference counting computes the greatest fix point. The set difference between the two fix point solutions is comprised of cyclical garbage. Cyclical garbage will be discussed in more detail below.

alt_text

Tracing initially sets the reference counts of all objects to 0 and initializes the worklist to be the root set $R$. The scan-by-tracing() function scans through the worklist and increments the reference counts of the objects it encounters, and adds all of the objects referenced by it to the worklist to be recursively processed as well. When this function terminates, the reference counts for live objects will have non-zero counts and all other objects will have a count of zero. From here the sweep-for-tracing() function can free all objects of count zero so their memory can be reused.

Reference counting need not do any initialization work at the start of the algorithm because initialization is handled for each time when an object is allocated. When an object is allocated, its reference count is set to 0. When an object is assigned to a pointer, the reference count of the object is incremented by one since a pointer now has a reference to it, and the object that was previously referenced by that pointer, if any, is added to the worklist to have its count decremented by one since the pointer no longer references it. The scan-by-counting() function processes this worklist of objects who need to have their count decremented by one and recursively adds objects it references to the worklist, similar to the tracing algorithm. The sweep-for-counting() function acts exactly like the one for tracing, as it frees all objects of count zero so their memory can be reused.

It is important to note that when we want to decrement the count of an object in this formalization of reference counting, we delay the actual decrement operation by placing it in a worklist so that it is decremented in the sweep-for-counting() function. In a real world implementation this may seem silly, but it is structured in this way to show the relationship between tracing and reference counting as duals. The actual complexity of the algorithm remains the same between both versions.

Now that tracing and referencing counting algorithms have been presented, it is possible to compare the two. Revisiting the idea of duals, we again acknowledge that tracing increments the reference counts of objects on the worklist whereas RC decrements them. Now, it becomes clear that they are also duals in the manner that tracing checks if the reference count is 1 when deciding whether to add to the worklist and reference counting checks if the reference count is 0 when deciding whether to add to the worklist. Lastly, for tracing the worklist initially included roots of the graph, but for reference counting the worklist contains objects that had a reference removed since the last time the algorithm ran. Tracing begins with an underestimate of counts while reference counting starts with an overestimate and both converge towards a true value.

When comparing the two algorithms side by side, we see how similar the two are. The scan and sweep functions are almost identical except for the duality differences highlighted in the previous paragraph.

Deferred Reference Counting vs Partial Tracing

In making optimizations to the traditional methods of garbage collection such as reference counting and tracing, we have deferred reference counting and its converse, partial tracing. In deferred reference counting we maintain a Zero Count Table (ZCT) which maintains objects with reference count 0. We also save some overhead in not counting mutations to root references. At collection time, elements in the ZCT that are pointed to by roots are removed from the ZCT and the remaining elements are collected. In partial tracing, we count root references and then perform tracing from those root references. Note that partial tracing is not a fast optimization, instead it is brought up to illustrate how we can create converses of tracing or reference counting based algorithms and consider their performance within the design space. This is illustrated in the following figure.

alt_text

Generational Garbage Collection Hybrids

Generational garbage collection is another well researched method of garbage collection. These collectors operate on the assumption that most objects die young. In other words, objects that have been recently allocated have a higher chance of becoming unreachable compared to objects that have been reachable for a long time. To accomplish this the heap is split into two regions: a nursery and a mature space. Objects are allocated into the nursery and at some point objects that survive long enough are moved to the mature space.

The paper presents and compares three generational garbage collector algorithms: tracing generational collection, generational with a reference counted nursery, and generational with a reference counted heap. In a tracing generational collector the roots into the nursery use tracing, the roots into the mature space use tracing, the nursery uses tracing within itself, the mature space uses tracing within itself, and objects in the mature space that reference objects in the nursery use reference counting. In a generational collector with reference counted nursery, we use the same formulation except that the nursery uses reference counting instead of tracing. The advantage of this collector is that cyclic garbage will eventually be collected because the mature space is traced; the disadvantage is that it reference counts young objects, which most likely have a high mutation rate, leading to expensive write barrier operations being performed most frequently. In a generational collector with a reference counted heap, we also use the same formulation except that the mature space is reference counted instead of traced. This has the advantage that mutations in the nursery are not recorded by the write barrier which is less expensive, but the disadvantage that some additional cycle collection mechanism is required for the mature space. The relationship between these collectors is depicted below. It is important to recognize that modifications or different versions of collectors yield different design tradeoffs because they are employing tracing or reference counting differently.

alt_text

Cycle Collection

One of the key disadvantages of reference counting is the lack of cycle collection. The authors propose two solutions: (1) run reference counting without cycle collection, and occasionally run a tracing algorithm to detect cycles, or (2) use reference counting with trial deletion. In this algorithm, objects with suspiciouslly high references are traced to check for cycles.

Multi-Heap Collectors

All of the collectors described so far were analyzed in the scope of a single heap, but with multiple heaps there is more flexibility in how garbage collection runs on each heap. The train algorithm presented shows how we can introduce a more subtle notion of generations as in the generational garbage collectors, and run tracing on a smaller set of references to get shorter pause times. These algorithms are presented in the paper to extend the concept of duality between tracing and renference counting.

Cost Analysis

In the final section of the paper, we get a methodology to analyze garbage collectors in a real-world setting. The cost factors associated with a collector are broken down into

$\kappa (X)$ - The time it takes to run a single garbage collection
$\sigma (X)$ - The space overhead of the collector
$\phi (X)$ - The frequency of collection
$\mu (X)$ - The mutation overhead
$\tau (X)$ - The total time overhead for collection

where $X$ is the collector.

The paper then goes onto computing these factors for various collectors, paramatarized by machine or program specifications.

Conclusions

The paper concludes with some general recommendations to those looking to implement collectors. They state to consider three key decisions in implmenting collectors: how to paritition memory, how to traverse that memory and the trade-offs associated with the partition scheme.

Analysis of Contributions and Impact

The paper itself does not present any new ideas on how to perform garbage collection. Instead the paper introduces a new methodology of how to develop collectors by creating a design space and evaluation method. This methodology is inspired by the idea that reference counting and tracing are algorithmic duals of one another. This key, and very cool, insight is the underlying motivation for the methodology. The paper also seems to be best for an engineer looking to build a garabage collector than a formal method in evaluating collectors. The cost analysis they give is much more practical, using “real” values for evaluation rather than estimates with a Big-O analysis.

The CS 6120 Course Blog

A Unified Theory of Garbage Collection

Introduction

Background

Garbage Collection

Reference Counting

Tracing

The Paper

Motivation

Fix-Point Formulation

Algorithmic Duals

Tracing vs Reference Counting

Deferred Reference Counting vs Partial Tracing

Generational Garbage Collection Hybrids

Cycle Collection

Multi-Heap Collectors

Cost Analysis

Conclusions

Analysis of Contributions and Impact