Lecture 6 supplement: Proofs of relationships between inverses and 'jectivity

Here are a collection of proofs of lemmas about the relationships between function inverses and in-/sur-/bijectivity.

This document serves at least two purposes:

  1. These proofs are good examples of what we expect when we ask you to do proofs on the homework.

  2. These theorems are useful, so having a list of them is convenient.

It is a good exercise to try to prove these on your own as well, and to compare your proofs with those given here.

Summary of results

See the lecture notes for the relevant definitions.

Notes on proofs

There are two ways to come up with the proofs below:

  1. Write down the claim, then write down the assumptions, then replace words with their definitions as necessary; the result will often just fall out immediately

  2. Work through a few examples and try to find a common pattern.

Here are the key things to look for in these proofs and to ensure when you write your own proofs:

The Proofs

In the following proofs, unless stated otherwise, f will denote a function from A to B and g will denote a function from B to A. I will also assume that A and B are non-empty; some of these claims are false when either A or B is empty (for example, a function from ∅→B cannot have an inverse, because there are no functions from B→∅).

Claim: f is injective if and only if it has a left inverse.

Proof: We must ( ⇒ ) prove that if f is injective then it has a left inverse, and also ( ⇐ ) that if f has a left inverse, then it is injective.

( ⇒ ) Suppose f is injective. We wish to construct a function g: BA such that g ∘ f = idA. Choose a fixed element c ∈ A (we can do this since A is non-empty). We will define g as follows on an input y:

I claim that for any x, (g ∘ f)(x) = x. Indeed, by the definition of g, since y = f(x) is in the image of f, g(y) is defined by the first rule to be x. Thus g ∘ f = idA.

( ⇐ ) Suppose conversely that f has a left inverse, which we'll call g. We wish to show that f is injective. In other words, we wish to show that whenever f(x) = f(y), that x = y.

Well, if f(x) = f(y), then we know that g(f(x)) = g(f(y)). By definition of g, we have x = g(f(x)) and g(f(y)) = y. Putting this together, we have x = g(f(x)) = g(f(y)) = y as required.

Claim: f is surjective if and only if it has a right inverse.

Proof: As before, we must prove the implication in both directions.

( ⇒ ) Suppose f is surjective. We must define a function g such that f ∘ g = idB. We define g as follows: on a given input y, we know that there is at least one x with f(x) = y (since f is surjective). We choose one such x and define g(y) = x.

With this definition, it is clear that (f ∘ g)(y) = y, so g is a right inverse of f, as required.

( ⇐ ) Suppose that f has a right inverse, and let's call it g. We must show that f is onto, that is, for any y ∈ B, there is some x ∈ A with f(x) = y.

Indeed, if we choose x = g(y), then since g is a right inverse of f, we have f(x) = f(g(y)) = y, as required.

Claim: f is bijective if and only if it has a two-sided inverse.

Invalid Proof ( ⇒ ): Suppose f is bijective. By above, we know that f has a left inverse and a right inverse. Therefore it has a two-sided inverse.

This proof is invalid, because just because it has a left- and a right inverse does not imply that they are actually the same function.

Valid Proof ( ⇒ ): Suppose f is bijective. By above, we know that f has a left inverse and a right inverse. I will prove below that this implies that they must be the same function, and therefore that function is a two-sided inverse of f. (Note: this proof is dangerous, because we have to be very careful that we don't use the fact we're currently proving in the proof below, otherwise the logic would be circular!)

Proof ( ⇐ ): Suppose f has a two-sided inverse g. Since g is a left-inverse of f, f must be injective. Since g is also a right-inverse of f, f must also be surjective. Since it is both surjective and injective, it is bijective (by definition).

Claim: if f has a left inverse (g) and a right inverse (gʹ) then g = gʹ.

Proof: Choose an arbitrary y ∈ B. We must show that g(y) = gʹ(y).

Since gʹ is a right inverse of f, we know that y = f(gʹ(y)). Applying g to both sides of this equation, we see that g(y) = g(f(gʹ(y))). But for any x, g(f(x)) = x. In particular, if we choose x = gʹ(y), we see that


g(y) = g(f(gʹ(y))) = g(f(x)) = x = gʹ(y)

as required.

Claim: The composition of two injective functions f: BC and g: AB is injective.

Proof: We must show that for any x and y, if (f ∘ g)(x) = (f ∘ g)(y) then x = y.

If f(g(x)) = f(g(y)), then since f is injective, we conclude that g(x) = g(y). Then, since g is injective, we conclude that x = y, as required.

Claim: The composition of two surjections f: BC and g: AB is surjective.

Proof: We must show that for any c ∈ C, there exists some a in A with f(g(a)) = c.

Since f is surjective, we know there is some b ∈ B with f(b) = c. Since g is surjective, there must be some a in A with g(a) = b. Thus f(g(a)) = f(b) = c as required.

Claim: The composition of two bijections f and g is a bijection.

Proof: Since f and g are both bijections, they are both surjections. By above, this implies that f ∘ g is a surjection. Similarly, f ∘ g is an injection. Therefore f ∘ g is a bijection.

A note on the axiom of choice

We are using the axiom of choice all over the place in the above proofs. In particular, every time we say "since X is non-empty, we can choose some x ∈ X"