Chapter 13

One-sided Matchings: House Allocation Problems

In this chapter, mirroring our treatment of voting, we revisit the matching market problem in a setting without money transfers; we refer to these type of problems as one-sided matchings as only nodes on one “side” of the bipartite graph (namely, individuals on the “left”) have preferences over the nodes on the other side (namely, the items on the “right”). These type of problems are often also called house allocation problems. (Later on in chapter 14, we turn to considering also a “two-sided” version of the problem where nodes on both sides have preferences over one another.)

13.1  (One-sided) Matching Contexts

Recall that a matching market frame (n,Y,v) specifies a set [n] of players, a set Y of objects (e.g., “houses”) such that |Y | = n, and a player valuation function v : Y . Recall that from such a matching market frame, we defined a notion of a matching market context (n,Ω,T,v):

As we saw in chapter 10, for every matching market context, we can use the VCG mechanism to DST-implement social value maximization.

In this section, following the voting paradigm discussed in chapter 11, we consider a different type of mechanism design context where players’ types simply specify preference ordering over the possible objects. Additionally, given that items now do not have prices, we restrict the state space Ω to the set of perfect matchings in the complete bipartite graph over [n] Y (i.e., matchings where all players get allocated an item).

More formally, consider a mechanism design context (n,Ω,T,v) where:

We refer to such a context as a (one-sided) matching context (as opposed to a “matching market context”); if it always holds that vi(t,x) > vi(t,x) (i.e., we have strict inequality) when x is higher ranked than x according to t, we refer to the context as a matching context with strict preferences. Furthermore, if for each player i, Ti is the full set of preferences over Y (i.e., all orderings over Y are possible), we refer to the context as a complete matching context.

As with voting, we restrict our attention to payment-free mechanisms: We refer to a payment-free mechanism 𝒱 for a matching context as a matching rule 𝒱, and refer to DST matching rules as strategy-proof matching rules.

13.2  Strategy-proof Matching Rules: Serial Dictatorship

At first one may think that the Gibbard–Satterthwaite theorem directly applies also to strategy-proof matching rules. Indeed, there is a sense in which it does: if players had strict preferences over all of Ω (i.e., the set of all possible allocations M), then by the Gibbard–Satterthwaite theorem, we would directly have that every strategy-proof matching mechanism (over at least 3 possible allocations) must be dictatorial. But, in a matching context, players cannot have strict preferences over the full set of allocations; rather, a player i can only have strict preferences over objects the player get allocated—their utility is independent of what objects are allocated to players i. Thus, a matching context can be viewed as a restricted (as opposed to a complete) social-choice context, and we may thus hope to overcome the Gibbard–Satterthwaite impossibility result.

Indeed, as we now show, a nontrivial (and actually quite useful) mechanism is possible! Given a matching context Γ = (n,Ω,T,v), and given some permutation π of the players [n] (i.e., an ordering of the players), consider the following “serial dictatorship mechanism” 𝒱. Start with player π(1) (the “first dictator”) and let that player be allocated their top choice; next, move on to player π(2) (the “second dictator”), and let that player pick their top unallocated choice, and so on. (See Figure 13.1 for an illustration.)

Figure 13.1: An example of the process of serial dictatorship in a matching between four players and four items. The ordering of dictators in this case is (3,2,4,1); each player’s preferences are shown in the illustration, and at each step the current dictator selects their most preferred remaining choice (highlighted in blue).

We refer to this mechanism as the Serial Dictator Mechanism (SDM) with respect to the ordering π and denote it by SDMπ; this mechanism is sometimes also referred to as the priority mechanism.

Theorem 13.1. For any matching context Γ = (n,Ω,T,v) and any ordering π of the players [n], we have that the SDM with respect to π is strategy-proof for Γ.

Proof. Consider the “first dictator” π(1); clearly this player gets their first choice, so can never gain anything by not truthfully reporting their preferences.

Now consider instead some later dictator π(i) that gets to pick in the ith iteration of the mechanism. By reporting their preferences truthfully this player gets their top choice of the currently unallocated items. By deviating, it can never get an item that has already been allocated (since the mechanism never reallocates items); thus, by deviating, the player can only get allocated an item that is less (or as) desirable.

When preferences are strict, this simple mechanism also produces an allocation that is “optimal” in the sense no subset of the players can improve their utility without hurting some other player:

Definition 13.1. Given a matching context Γ = (n,Ω,T,v), we say that an allocation M Ω is Pareto-optimal with respect to μ T if there does not exist some M Ω such that:

  • For all i [n], vi(μi,M(i)) vi(μi,M(i)) (i.e., every player is at least as well off);
  • There exists some i [n] such that vi(μi,M(i)) > vi(μi,M(i)) (i.e., some player is strictly better off).

Theorem 13.2. For any matching context Γ = (n,Ω,T,v) with strict preferences, any ordering π of the players [n], any μ T, we have that SDMπ(t) is Pareto-optimal with respect to μ.

Proof. Consider some type profile μ, and let a = SDMπ(μ). Consider any matching M where all players i get allocated an item M(i) that they like as much as M(i) (i.e., no player is worse off). As we now show, then it must hold that M = M, which concludes the proof.

First, note that clearly M(π(1)) = M(π(1)) since the first dictator gets their top choice (which is unique by the strict preference assumption). It inductively follows that M(π(i)) = M(π(i)) for every i [n] since dictator i gets it first choice of the currently unallocated items (which again is unique). Thus, M = M.

Note that in the above proof, we crucially rely on the fact that preferences are strict; without this restrictions the SDM may output an allocation that is not Pareto-optimal (show it!).

The SDM mechanism is commonly used in practice: for instance, for office allocations for professors, for the New York City school choice system, university housing allocations, and so on. For all these examples, we typically select the dictatorship order π at random.

13.3  Uniqueness of Serial Dictatorship [Advanced]

Despite all of the above-mentioned the nice features of the SDM, there is something unappealing about it—once the order is fixed, the mechanism is very unfair. A natural question is whether this unfairness is needed. As we shall not see, there is a sense in which it is: We present a variant of the Gibbard–Satterthwaite theorem that shows that the only strategy-proof mechanism satisfying two natural properties—neutrality and non-bossiness—is the SDM:

Clearly, SDM satisfies both of these properties.

We are now ready to state the uniqueness theorem. As for voting contexts, we restrict our attention to complete matching contexts with strict preferences.

Theorem 13.3. Let 𝒱 be a neutral and non-bossy matching rule for a complete matching context with strict preferences (n,Ω,T,v). Then there exists some permutation π over [n] such that 𝒱 = SDMπ.

13.4  Proof of the Uniqueness Theorem [Advanced]

In this section, we prove Theorem 13.3. The proof follows a similar (but actually simpler) structure as the proof of the Gibbard-Satterthwaite theorem. Again, for simplicity of notation, in the remainder of this section, we always have some fixed complete matching context Γ with strict preferences in mind (and as such, whenever we write, for instance, strategy-proof, neutral, or non-bossy, we mean with respect to this matching market context).

We start by defining the analog of monotonicity for matching mechanisms.

Definition 13.2. A matching rule 𝒱 is monotone if for every two preference profiles μ and μ, if 𝒱(μ) = M, and for every player i and every choice y such that M(i) > μiy it is also true that M(i) > μiy, it follows that 𝒱(μ) = M.

In other words, if a matching rule is monotone, then a winning allocation M will continue to win as long as for every player i, all candidates that are dominated by i’s allocation, M(i), continue to be dominated by M(i).

Essentially the same proof that shows that strategy-proof voting rules are monotone shows that strategy-proof and non-bossy matching rules are monotone—the only minor difference is that we need non-bossiness to deal with the fact that player preferences are not strict over the full set of allocations (but rather only strict over each players’ own choices). For convenience, we repeat the proof and highlight (in italics) where non-bossiness is used.

Lemma 13.1. If a strategy-proof matching rule 𝒱 is non-bossy, then it is also monotone.

Proof. Assume for the sake of contradiction that 𝒱 is not monotone; then, there exist profiles of preferences μ and μ so that 𝒱(μ) = M and for every player i and every choice y such that M(i) > μiy it is also true that M(i) > μiy, yet 𝒱(μ)M.

As in the proof of Lemma 12.1, we construct a sequence of “hybrid” preferences μk where we move from μ to μ one player at a time:

  • μik = μi if i > k, and
  • μik = μi if i k.

Thus, μ0 = μ and μn = μ. Now, let k denote the smallest index where 𝒱(μk ) = M but 𝒱(μk+1 )M. Clearly such an index must exist, since 𝒱(μn)M by assumption.

Furthermore, note that the only difference between μk and μk+1 is that player k + 1 is switching from μk+1 to μk+1. By non-bossiness, we thus must have that M(k + 1)M(k + 1); that is, the allocation of player k + 1 changes to some zM(k + 1) when he switches strategies.

Since 𝒱 is strategy-proof, we must have that M(k + 1) > μk+1z, or else, given the preference profile μk, player k + 1 would prefer to report μk+1, which would lead to the outcome z that they strictly prefer to M(k + 1) (by our assumption that preferences are strict). Since by the choice of μ, we have that for every player i and every choice y, if M(i) > μiy then M(i) > μiy, it follows that M(k + 1) > μk+1z.

But, by the same argument, we have that z > μk+1M(k + 1) (or else player k + 1 would prefer to report μk+1 given the preference profile μk+1); this is a contradiction (since preferences are strict).

We are now ready to prove Theorem 13.3.

Proof of Theorem 13.3. Consider some neutral and non-bossy matching rule 𝒱 for n players over some set of objects X = {x1,x2,xn}. We show the existence of a permutation π over [n] such that 𝒱 actually implements SDM with respect to π.

To construct π, consider the “lexicographic” preference profile ν where for every player i, νi = (x1,x2,x3,,xn); that is, they all prefer x1 > x2 > x3 > > xn. Define π as follows: let π(i) = j if 𝒱(ν)(j) = xi; that is, let the player j who receives object i (i.e., xi) in the matching output by 𝒱(ν) be the “ith dictator.” Since every player receives a different object in the matching, this uniquely defines π.

We now show that 𝒱 actually implements SDMπ. Consider any preference profile μ. We aim to show that 𝒱(μ) = SDMπ(μ). Let ν be the preference profile where for every player i, νi = (y1,y2,,yn) where

yi = SDMπ(μ)(π(i)).

That is, yi is the object received by the ith dictator if running SDMπ(μ).

By neutrality (renaming xi to yi), it directly follows that for every i,

𝒱(ν)(π(i)) = y i = SDMπ(μ)(π(i))

and thus,

𝒱(ν) = SDM π(μ).

As we now shall argue, by monotonicity (which follows from Lemma 13.1), it also holds that

𝒱(ν) = 𝒱(μ),

which will conclude the proof.

Consider the allocation M = 𝒱(ν) and for any i, the ith dictator, player π(i). As argued above, this player is currently receiving object M(π(i)) = yi. To use monotonicity (to conclude that 𝒱(ν) = 𝒱(μ)) we need to argue that every object y that is dominated by yi with respect to ν (i.e., objects yj such that j > i) remain dominated by yi with respect to μ.

Assume for contradiction that this is not the case; that is, there exists some yj where j > i that player π(i) prefers to yi with respect to μ. Then, the ith dictator (i.e., player π(i)) in the execution SDMπ(μ) would be assigned yj (since when it was their turn to choose, yj is still unassigned), which contradicts the definition of yi.

Notes

Shapley and Scarf were the first to study one-sided matching problems without payment [SS74]; the model we considered here—which typically is called the housing allocation model—is due to Hylland and Zeckhauser [HZ79]. The serial dictatorship mechanism and the notion of “non-bossiness” were introduced and studied by Satterthwaite and Sonnenschein [SS81]. The impossibility of non-serial dictatorship mechanisms is due to Svensson [Sve99]; our proof, however, is a simplified version of his proof (due to the fact that we have restricted our attention to bipartite graphs where |X| = |Y |).