Proj 1: Harmonious learning

There are many problems that involve optimizing some objective function by making local adjustments to a structure or graph. For example:

If we want to reinforce a truss with a limited budget, where should we add new beams (or strengthen old ones)?
After a failure in the power grid, how should lines be either taken out of service or put in service to ensure no other lines are overloaded?
In a road network, how will road closures or rate-limiting of on-ramps affect congestion (for better or worse)?
In a social network, which edges are most critical to spreading information or influence to a target audience?

For our project, we will consider a simple method for graph interpolation. We are given a (possibly weighted) undirected graph on $n$ nodes, and we wish to determine some real-valued numerical property at each node. Given values at a few of the nodes, how should we fill in the remaining values? A natural approach that is used in some semi-supervised machine learning approaches is to fill in the remaining values by assuming that the value at an unlabeled node $i$ is the (possibly weighted) average of the values at all neighbors of the node. In this project, we will see how to quickly solve this problem, and how to efficiently evaluate the sensitivity with respect to different types of changes in the setup. Of course, in the process we also want to exercise your knowledge of linear systems, norms, and the like!

md"""

# Proj 1: Harmonious learning

There are many problems that involve optimizing some objective

function by making local adjustments to a structure or graph.

For example:

- If we want to reinforce a truss with a limited budget, where

should we add new beams (or strengthen old ones)?

- After a failure in the power grid, how should lines be either

taken out of service or put in service to ensure no other lines

are overloaded?

- In a road network, how will road closures or rate-limiting of

on-ramps affect congestion (for better or worse)?

- In a social network, which edges are most critical to

spreading information or influence to a target audience?

For our project, we will consider a simple method for

*graph interpolation*. We are given a (possibly weighted) undirected graph on

$n$ nodes, and we wish to determine some real-valued numerical property

at each node. Given values at a few of the nodes, how should we fill in

the remaining values? A natural approach that is used in some

semi-supervised machine learning approaches is to fill in the remaining

values by assuming that the value at an unlabeled node $i$ is the

(possibly weighted) average of the values at all neighbors of the node.

In this project, we will see how to quickly solve this problem, and how

to efficiently evaluate the sensitivity with respect to different types

of changes in the setup. Of course, in the process we also want to

exercise your knowledge of linear systems, norms, and the like!

"""

2.2 ms

Logistics

You are encouraged to work in pairs on this project. You should produce short report addressing the analysis tasks, and a few short codes that address the computational tasks. You may use any Julia functions you might want.

Most of the code in this project will be short, but that does not make it easy. You should be able to convince both me and your partner that your code is right. A good way to do this is to test thoroughly. Check residuals, compare cheaper or more expensive ways of computing the same thing, and generally use the computer to make sure you don't commit silly errors in algebra or coding. You will also want to make sure that you satisfy the efficiency constraints stated in the tasks.

md"""

## Logistics

*You are encouraged to work in pairs on this project.* You should

produce short report addressing the analysis tasks, and a few

short codes that address the computational tasks. You may

use any Julia functions you might want.

Most of the code in this project will be short, but that does not make

it easy. You should be able to convince both me and your partner that

your code is right. A good way to do this is to test thoroughly.

Check residuals, compare cheaper or more expensive ways of computing

the same thing, and generally use the computer to make sure you don't

commit silly errors in algebra or coding. You will also want to make

sure that you satisfy the efficiency constraints stated in the tasks.

"""

259 μs

Background

The (combinatorial) graph Laplacian matrix occurs often when using linear algebra to analyze graphs. For an undirected graph on vertices ${1, \dots, n}$ , the weighted graph Laplacian $L \in R^{n \times n}$ has entries

$l_{i j} = {\begin{cases} - w_{i j}, & if (i, j) an edge with weight w_{i} \\ d_{i} = \sum_{k} w_{i k}, & i = j \\ 0, & otherwise . \end{cases}$

The unweighted case corresponds to $w_{i j} = - 1$ and $d$ equal to the node degree.

In our project, we seek to solve problems of the form

$[\begin{matrix} L_{11} & L_{12} \\ L_{21} & L_{22} \end{matrix}] [\begin{matrix} u_{1} \\ u_{2} \end{matrix}] = [\begin{matrix} 0 \\ r_{2} \end{matrix}]$

where the leading indices correspond to nodes in the graph at which $u$ must be inferred (i.e. $u_{1}$ is an unknown) and the remaining indices correspond to nodes in the graph at which $u$ is specified (i.e. $u_{2}$ is known, though $r_{2}$ is not). Note that if $i$ is an index in the first block, then the equation at row $i$ specifies that

$u_{i} = \frac{1}{d_{i}} \sum_{(i, j) \in E} w_{i j} u_{j},$

i.e. the value at $i$ is a weighted average of the neighboring values.

md"""

## Background

The (combinatorial) *graph Laplacian* matrix occurs often when

using linear algebra to analyze graphs. For an undirected graph on

vertices $\{1, \ldots, n\}$, the weighted graph

Laplacian $L \in \mathbb{R}^{n \times n}$ has entries

$$l_{ij} = \begin{cases}

-w_{ij}, & \mbox{ if } (i,j) \mbox{ an edge with weight } w_i \\

d_{i} = \sum_k w_{ik}, & \mbox{i = j} \\

0, & \mbox{otherwise}.

\end{cases}$$

The unweighted case corresponds to $w_{ij} = -1$ and $d$ equal to the

node degree.

In our project, we seek to solve problems of the form

$$\begin{bmatrix}

L_{11} & L_{12} \\

L_{21} & L_{22}

\end{bmatrix}

\begin{bmatrix} u_1 \\ u_2 \end{bmatrix} =

\begin{bmatrix} 0 \\ r_2 \end{bmatrix}$$

where the leading indices correspond to nodes in the graph at which

$u$ must be inferred (i.e. $u_1$ is an unknown) and the remaining

indices correspond to nodes in the graph at which $u$ is specified

(i.e. $u_2$ is known, though $r_2$ is not). Note that if $i$ is an

index in the first block, then the equation at row $i$ specifies that

$$u_i = \frac{1}{d_i} \sum_{(i,j) \in \mathcal{E}} w_{ij} u_j,$$

i.e. the value at $i$ is a weighted average of the neighboring values.

"""

373 μs

Code setup

We will use the California road network data from the SNAP data set; to retrieve it, download the roadNet-CA.txt file from the class web page. The following loader function will read in the topology and form the graph Laplacian in compressed sparse column format (SparseMatricCSC in Julia). This is a big enough network that you will not want to form the graph Laplacian or related matrices in dense form. On the other hand, because it is a moderate-sized planar graph, sparse Cholesky factorization on $L$ will work fine.

md"""

## Code setup

We will use the California road network data from the SNAP data set; to

retrieve it, download the `roadNet-CA.txt` file from the class web

page. The following loader function will read in the topology and form

the graph Laplacian in compressed sparse column format (`SparseMatricCSC` in Julia).

This is a big enough network that you will *not*

want to form the graph Laplacian or related matrices in dense form.

On the other hand, because it is a moderate-sized planar graph, sparse

Cholesky factorization on $L$ will work fine.

"""

219 μs

using DelimitedFiles

273 μs

using SparseArrays

164 μs

using SuiteSparse

142 μs

using LinearAlgebra

126 μs

load_network (generic function with 1 method)

function load_network(fname)

ij = readdlm(fname, '\t', Int)

ij[:,1], ij[:,2]

end

317 μs

Int64

5520776

Int64

5520776

CA_I, CA_J = load_network("roadNet-CA.txt")

9.0 s

form_laplacian (generic function with 1 method)

function form_laplacian(I, J)

n = maximum(I)

nnz = length(I)

# Compute node degree vector

d = zeros(Int, n)

for k = 1:nnz

d[I[k]] += 1

end

# Form the adjacency

A = sparse(I, J, ones(nnz))

# Form the Laplacian

L = spdiagm(d) - A

end

716 μs

For the tasks in this assignment, it is useful to carry around more than just the graph Laplacian. We also want to keep track of which nodes have associated known values and what those values are. For these purposes, it is helpful to use a Julia structure that we pass around.

md"""

"""

131 μs

LabeledLaplacian

begin

# Julia structure for Laplacian with selective labels

mutable struct LabeledLaplacian

L :: SparseMatrixCSC{Float64,Int64} # Laplacian storage

LC :: Union{Nothing,

SuiteSparse

.CHOLMOD.Factor} # Cholesky factor

u :: Vector{Float64} # Node values

active :: Vector{Bool} # Which dofs are active

new_values :: Vector{Tuple{Int,Float64}} # Updates to values

new_weights :: Vector{Tuple{Int,Int,Float64}} # Updates to edge weights

end

# Construct a LabeledLaplacian with no labels from a Laplacian

function LabeledLaplacian(L :: SparseMatrixCSC{Float64,Int})

n = size(L)[1]

u = zeros(n)

active = ones(Bool,n)

LabeledLaplacian(L, nothing, u, active, [], [])

end

# Construct a LabeledLaplacian with no labels from coord form

function LabeledLaplacian(I :: Vector{Int}, J :: Vector{Int})

LabeledLaplacian(form_laplacian(I,J))

end

# Construct a LabeledLaplacian with no labels from a file name

function LabeledLaplacian(fname :: String)

I, J = load_network(fname)

LabeledLaplacian(I, J)

end

2.3 ms

We will set this up so that we can easily add node values and adjust edge weights. We do this differently depending on whether or not we have already factored (part of) the Laplacian. If we have an existing factorization, we will keep track of the updates that we would like to apply separately, and handle them via a bordered system approach described below.

md"""

We will set this up so that we can easily add node values and adjust edge weights.

We do this differently depending on whether or not we have already factored (part of) the Laplacian. If we have an existing factorization, we will keep track of the updates that we would like to apply separately, and handle them via a bordered system approach described below.

"""

124 μs

new_value! (generic function with 1 method)

function new_value!(LL :: LabeledLaplacian, v, i)

.LC == nothing

.u[i] = v

.active[i] = false

else

push!(LL.new_values, (i,v))

end

446 μs

new_value! (generic function with 2 methods)

function new_value!(LL :: LabeledLaplacian, V :: Vector, I :: Vector)

for (v,i) in zip(V,I)

new_value!(LL, v, i)

end

586 μs

update_edge! (generic function with 1 method)

function update_edge!(LL :: LabeledLaplacian, v, i, j)

.LC == nothing

.L[i,i] += v

.L[j,j] += v

.L[i,j] -= v

.L[j,i] -= v

else

push!(LL.new_weights, (i,j,v))

end

684 μs

update_edge! (generic function with 2 methods)

function update_edge!(LL :: LabeledLaplacian, V :: Vector, I :: Vector, J :: Vector)

for (v,i,j) in zip(V,I,J)

update_edge!(LL, v, i, j)

end

672 μs

We also provide a factor routine to compute (or re-compute) the Cholesky factorization of the Laplacian matrix. We only do the computation if there seems to be need; if there is an existing factorization and no updates have been made (new values or edge weight adjustments), we will keep the existing factorization as is.

md"""

We also provide a `factor` routine to compute (or re-compute) the Cholesky factorization of the Laplacian matrix. We only do the computation if there seems to be need; if there is an existing factorization and no updates have been made (new values or edge weight adjustments), we will keep the existing factorization as is.

"""

166 μs

factor! (generic function with 1 method)

function factor!(LL :: LabeledLaplacian)

# Short-circuit: no need to factor if already done (and no updates)

.LC != nothing && isempty(LL.new_values) && isempty(LL.new_weights)

return

end

# Clear any existing factorization

.LC = nothing

# Make sure any updates are merged

for (i,v) in

.new_values

.u[i] = v

.active[i] = false

end

for (i,j,v) in

.new_weights

.L[i,i] += v

.L[j,j] += v

.L[i,j] -= v

.L[j,i] -= v

end

.new_values = []

.new_weights = []

# Compute the Cholesky factorization

.LC = cholesky(LL.L[LL.active, LL.active])

end

1.7 ms

Finally, we provide a residual check routine to provide some reassurance about the correctness of our solutions.

md"""

Finally, we provide a residual check routine to provide some reassurance about the correctness of our solutions.

"""

114 μs

residual (generic function with 2 methods)

function residual(LL :: LabeledLaplacian, clear_inactive=true)

# Compute residual with un-adjusted L

r = LL.L * LL.u

# Add terms associated with edge weight updates

for (i,j,v) in

.new_weights

dij = v*(LL.u[i]-LL.u[j])

r[i] += dij

r[j] -= dij

end

clear_inactive

# Ignore entries for inactive dofs

for k = 1:length(LL.active)

if !LL.active[k]

r[k] = 0.0

end

# And separately track errors for newly-inactive dofs

for (i,v) in

.new_values

r[i] = LL.u[i]-v

end

1.8 ms

And we provide some helper functions for working with the LabeledLaplacian objects.

md"""

And we provide some helper functions for working with the `LabeledLaplacian` objects.

"""

115 μs

inactive (generic function with 1 method)

begin

# Helpers to get sizes of different pieces

ntotal(LL :: LabeledLaplacian) = length(LL.active)

nmod_val(LL :: LabeledLaplacian) = length(LL.new_values)

nmod_wts(LL :: LabeledLaplacian) = length(LL.new_weights)

nmod(LL :: LabeledLaplacian) = nmod_val(LL) + nmod_wts(LL)

nactive(LL :: LabeledLaplacian) = sum(LL.active)

# Get active/inactive index sets

active(LL :: LabeledLaplacian) = LL.active

inactive(LL :: LabeledLaplacian) = .!LL.active

end

1.2 ms

Task 1

For the first part of the assignment, we will improve on a naive solve! command (given below) that always forces a re-factorization.

md"""

### Task 1

For the first part of the assignment, we will improve on a naive `solve!` command (given below) that always forces a re-factorization.

"""

140 μs

solve! (generic function with 1 method)

function solve!(LL :: LabeledLaplacian)

# Force refactorization

factor!(LL)

# Index sets for reference unknown and known pieces

Iu = active(LL)

Ik = inactive(LL)

# Compute RHS and do Cholesky solve

rhs = LL.L[Iu,Ik] * LL.u[Ik]

.u[Iu] = -(LL.LC \ rhs)

end

484 μs

Our modified version of solve! will let us adapt to new values or edge weight updates without recomputing the Cholesky factorization. We can do this by computing a bordered linear system

$[\begin{matrix} L_{11} & L_{12} & B_{1} \\ L_{21} & L_{22} & B_{2} \\ B_{1}^{T} & B_{2}^{T} & C \end{matrix}] [\begin{matrix} u_{1} \\ u_{2} \\ w \end{matrix}] = [\begin{matrix} 0 \\ r_{2} \\ f \end{matrix}] .$

To enforce additional boundary conditions, we use each column of $B_{1}$ to indicate a node to constrain, and let the corresponding entry of $f$ be the value at that node. To adjust the weight of an edge $(i, j)$ by $s$ , note that the Laplacian for the new graph would be

$L^{'} = L + s (e_{i} - e_{j}) (e_{i} - e_{j})^{T},$

and we can write $L^{'} u$ as $L u + (e_{i} - e_{j}) γ$ where $γ = s (e_{i} - e_{j})^{T} u$ . Using this observation, we can form a bordered system that incorporates edge weight modifications as well as additional boundary conditions, all without re-computing any large sparse factorizations.

I split this code into two pieces: a compute_bordering function that produces B, C, and f in the system above, and the solve! function hat solves the actual system by block Gaussian elimination.

Your updated code should take $O (k)$ linear solves with the existing factorization to account for $k$ updates, whether new assignments of node values or adjustments to graph edge weights.

md"""

Our modified version of `solve!` will let us adapt to new values or edge weight updates *without* recomputing the Cholesky factorization. We can do this by computing a bordered linear system

$$\begin{bmatrix}

L_{11} & L_{12} & B_1 \\

L_{21} & L_{22} & B_2 \\

B_1^T & B_2^T & C

\end{bmatrix}

\begin{bmatrix}

u_1 \\ u_2 \\ w

\end{bmatrix} =

\begin{bmatrix}

0 \\ r_2 \\ f

\end{bmatrix}.$$

To enforce additional boundary conditions, we use each column of $B_1$

to indicate a node to constrain, and let the corresponding entry of $f$

be the value at that node. To adjust the weight of an edge $(i, j)$ by $s$,

note that the Laplacian for the new graph would be

$$L' = L + s (e_i-e_j) (e_i-e_j)^T,$$

and we can write $L'u$ as $Lu + (e_i-e_j) \gamma$ where

$\gamma = s (e_i-e_j)^T u$. Using this observation,

we can form a bordered system that incorporates

edge weight modifications as well as additional boundary conditions,

all without re-computing any large sparse factorizations.

I split this code into two pieces: a `compute_bordering` function that

produces `B`, `C`, and `f` in the system above, and the `solve!` function

hat solves the actual system by block Gaussian elimination.

Your updated code should take $O(k)$ linear solves with the existing factorization

to account for $k$ updates, whether new assignments of node values or adjustments

to graph edge weights.

"""

377 μs

Sanity checking

We provide a simple test case to check the correctness of our bordered system approach. This will start off correct (giving small residual values) for the naive "refactor every time" approach; you should ideally keep the residuals small while improving the speed!

md"""

#### Sanity checking

We provide a simple test case to check the correctness of our bordered system

approach. This will start off correct (giving small residual values) for the naive

"refactor every time" approach; you should ideally keep the residuals small while

improving the speed!

"""

149 μs

test_task1 (generic function with 1 method)

function test_task1(CA_I, CA_J)

LLCA = LabeledLaplacian(CA_I, CA_J)

# Add a first batch of values and solve

new_value!(LLCA, [1.0, 2.0, 3.0], [1, 10, 20])

t1 = @elapsed solve!(LLCA)

r1 = norm(residual(LLCA))

# Add some more node values and re-solve

new_value!(LLCA, [4.0, 5.0], [30, 40])

t2 = @elapsed solve!(LLCA)

r2 = norm(residual(LLCA))

# Update an edge weight (delete the edge 1-2) and re-solve

update_edge!(LLCA, -1.0, 1, 2)

t3 = @elapsed solve!(LLCA)

r3 = norm(residual(LLCA))

md"""

- Initial solve: $(t1) s, residual norm $(r1)

- New values: $(t2) s, residual norm $(r2)

- Edge update: $(t3) s, residual norm $(r3)

"""

end

1.0 ms

Initial solve: 1.661338208 s, residual norm 1.1692050219556296e-12
New values: 1.599754459 s, residual norm 2.174866173668037e-12
Edge update: 1.587610917 s, residual norm 2.22216338865832e-12

test_task1(CA_I, CA_J)

5.1 s

Additional questions

We have to assign some values before we are able to run the solver. Why can't we safely factor the full Laplacian immediately?
The largest and smallest entries of the solution vector $u$ should always be entries where we've specified values. Why is this?

md"""

#### Additional questions

1. We have to assign some values before we are able to run the solver. Why can't we safely factor the full Laplacian immediately?

2. The largest and smallest entries of the solution vector $u$ should always be entries where we've specified values. Why is this?

"""

274 μs

Task 2

Again using the bordered system idea from the first part, we now want to consider the problem of leave-one-out cross-validation of the assigned values at the nodes. That is, for a given node $j$ that has an assigned value $u_{j}$ , we would like to compare $u_{j}$ to the value $u_{j}^{(- j)}$ we would have inferred if all the data but that value at node $j$ were provided.

Complete the cross_validate function below to return the difference $u_{j} - u_{j}^{(- j)}$ . As in the previous task, your code should not require a new matrix factorization. You should use the sanity check to make sure you have the right answer.

A useful building block will be a version of the solver code that solves systems $L_{11} x = b$ for general right hand sides via the bordered system

$[\begin{matrix} L_{11} & B_{1} \\ B_{1}^{T} & C \end{matrix}] [\begin{matrix} x \\ y \end{matrix}] = [\begin{matrix} b \\ 0 \end{matrix}]$

Once we have this building block, it is convenient let $z = u - u^{(- j)}$ , and think of splitting the boundary nodes into group 2 (consisting just of node $j$ ) and group 3 (all the other boundary nodes). In our Julia framework, that means block 1 is associated with active, block 2 is j, and block 3 is the rest of .!active. We know that $u$ and $u^{(- j)}$ satisfy

$[\begin{matrix} L_{11} & l_{12} & L_{13} \\ l_{21} & l_{22} & l_{23} \\ L_{31} & L_{32} & l_{33} \end{matrix}] [\begin{matrix} u_{1} \\ u_{2} \\ u_{3} \end{matrix}] = [\begin{matrix} 0 \\ r_{2} \\ r_{3} \end{matrix}], [\begin{matrix} L_{11} & l_{12} & L_{13} \\ l_{21} & l_{22} & l_{23} \\ L_{31} & L_{32} & l_{33} \end{matrix}] [\begin{matrix} u_{1}^{(- j)} \\ u_{2}^{(- j)} \\ u_{3}^{(- j)} \end{matrix}] = [\begin{matrix} 0 \\ 0 \\ {\tilde{r}}_{3} \end{matrix}]$

and subtracting the two equations gives

$[\begin{matrix} L_{11} & l_{12} & L_{13} \\ l_{21} & l_{22} & l_{23} \\ L_{31} & L_{32} & l_{33} \end{matrix}] [\begin{matrix} z_{1} \\ z_{2} \\ z_{3} \end{matrix}] = [\begin{matrix} 0 \\ r_{2} \\ r_{3} - {\tilde{r}}_{3} \end{matrix}]$

where $z_{3}$ is by definition zero (since both $u$ and $u^{(- j)}$ agree on all the boundary nodes other than node $j$ ). Therefore, we have

$[\begin{matrix} L_{11} & l_{12} \\ l_{21} & l_{22} \end{matrix}] [\begin{matrix} z_{1} \\ z_{2} \end{matrix}] = [\begin{matrix} 0 \\ r_{2} \end{matrix}]$

and eliminating the first block gives the system

$(l_{22} - l_{21} L_{11}^{- 1} l_{12}) z_{2} = r_{2} .$

Note that $z_{2}$ is precisely the cross-validation value that we've described.

md"""

### Task 2

Again using the bordered system idea from the first part, we now want to consider the problem of *leave-one-out cross-validation* of the assigned values at the nodes. That is, for a given node $j$ that has an assigned value $u_j$, we would like to compare $u_j$ to the value $u_j^{(-j)}$ we would have inferred if all the data but that value at node $j$ were provided.

Complete the `cross_validate` function below to return the difference

$u_j-u_j^{(-j)}$. As in the previous task, your code should *not* require

a new matrix factorization. You should use the sanity check to make sure

you have the right answer.

A useful building block will be a version of the solver code that solves systems $L_{11} x = b$ for general right hand sides via the bordered system

$$\begin{bmatrix} L_{11} & B_1 \\ B_1^T & C \end{bmatrix} \begin{bmatrix} x \\ y \end{bmatrix} = \begin{bmatrix} b \\ 0 \end{bmatrix}$$

Once we have this building block, it is convenient let $z = u-u^{(-j)}$,

and think of splitting the boundary nodes into group 2

(consisting just of node $j$) and group 3 (all the other boundary nodes).

In our Julia framework, that means block 1 is associated with `active`,

block 2 is `j`, and block 3 is the rest of `.!active`. We know

that $u$ and $u^{(-j)}$ satisfy

$$\begin{bmatrix}

L_{11} & l_{12} & L_{13} \\

l_{21} & l_{22} & l_{23} \\

L_{31} & L_{32} & l_{33}

\end{bmatrix}

\begin{bmatrix} u_1 \\ u_2 \\ u_3 \end{bmatrix} =

\begin{bmatrix} 0 \\ r_2 \\ r_3 \end{bmatrix},

\begin{bmatrix}

L_{11} & l_{12} & L_{13} \\

l_{21} & l_{22} & l_{23} \\

L_{31} & L_{32} & l_{33}

\end{bmatrix}

\begin{bmatrix} u_1^{(-j)} \\ u_2^{(-j)} \\ u_3^{(-j)} \end{bmatrix} =

\begin{bmatrix} 0 \\ 0 \\ \tilde{r}_3 \end{bmatrix}$$

547 μs

cross_validate (generic function with 1 method)

function cross_validate(LL :: LabeledLaplacian, j)

if !LL.active[j]

0.0 # Replace this with the cross-validation computation

else

0.0 # This is correct if j didn't have an assigned value anyhow

end

306 μs

The cross-validation can be done in about the same time as the fast solves described before using the bordered solver approach. We give two tests to compare the fast approach against a reference computation (the second a little harder than the first). The reference version takes a few seconds on my machine (vs a fraction of a second).

md"""

The cross-validation can be done in about the same time as the fast solves described

before using the bordered solver approach. We give two tests to compare the fast

approach against a reference computation (the second a little harder than the first).

The reference version takes a few seconds on my machine (vs a fraction of a second).

"""

124 μs

test_cross_validate1 (generic function with 1 method)

function test_cross_validate1()

LLCA = LabeledLaplacian(CA_I, CA_J)

# Add a first batch of values and solve

new_value!(LLCA, [1.0, 2.0, 3.0], [1, 10, 20])

solve!(LLCA)

u30_ref = LLCA.u[30]

# Add one more node and re-solve with forced factorization

new_value!(LLCA, [4.0], [30])

factor!(LLCA)

solve!(LLCA)

# Run cross-validation and compare with the expensive version

zref = 4.0-u30_ref

t1 = @elapsed begin zcv = cross_validate(LLCA, 30) end

md"""

- Slow computation: $(zref)

- Fast computation: $(zcv)

- Relerr: $(abs(zref-zcv)/abs(zref))

- Time (fast): $(t1)

"""

end

1.1 ms

Slow computation: 1.6946128762480739
Fast computation: 0.0
Relerr: 1.0
Time (fast): 0.0

test_cross_validate1()

3.4 s

test_cross_validate2 (generic function with 1 method)

function test_cross_validate2()

LLCA = LabeledLaplacian(CA_I, CA_J)

# Add a first batch of values and solve

new_value!(LLCA, [1.0, 2.0, 3.0], [1, 10, 20])

solve!(LLCA)

u30_ref = LLCA.u[30]

# Add one more node and re-solve with forced factorization

new_value!(LLCA, [4.0], [30])

factor!(LLCA)

solve!(LLCA)

LLCA2 = LabeledLaplacian(CA_I, CA_J)

# Add values in two batches to sanity check this works with bordered solver

new_value!(LLCA2, [1.0, 2.0, 4.0], [1, 10, 30])

solve!(LLCA2)

new_value!(LLCA2, [3.0], [20])

solve!(LLCA2)

# Run cross-validation and compare with the expensive version

zref = 4.0-u30_ref

t1 = @elapsed begin zcv = cross_validate(LLCA, 30) end

md"""

- Slow computation: $(zref)

- Fast computation: $(zcv)

- Relerr: $(abs(zref-zcv)/abs(zref))

- Time (fast): $(t1)

"""

end

1.6 ms

Slow computation: 1.6946128762480739
Fast computation: 0.0
Relerr: 1.0
Time (fast): 2.08e-7

test_cross_validate2()

6.8 s

Task 3

Using bordered systems lets us recompute the solution quickly after we adjust the edge weights. But what if we want to compute the sensitivity of the value at some target node to small changes to {\em any} of the edges? That is, for a target node $k$ , we think of $u_{k}$ as a function of all the edge weights, and compute the sparse sensitivity matrix

$S_{i j} = {\begin{cases} \frac{\partial u_{k}}{\partial w_{i j}}, & (i, j) \in E \\ 0, & otherwise . \end{cases}$

Assuming the $u$ vector has already been computed, the sensitivity computation requires constant work per edge after one additional linear solve. Fill in edge_sensitivity to carry out this computation. Note that you should not require new factorizations if you already have one; that is, your code should ideally use the bordered system formalism to incorporate any new boundary conditions or edge updates added to the system since the last factorization.

As in task 2, you should also provide a sanity check code.

md"""

### Task 3

Using bordered systems lets us recompute the solution quickly after we

adjust the edge weights. But what if we want to compute the sensitivity

of the value at some target node to small changes to {\em any} of the

edges? That is, for a target node $k$, we think of $u_k$ as a function

of all the edge weights, and compute the sparse sensitivity matrix

$$S_{ij} =

\begin{cases}

\frac{\partial u_k}{\partial w_{ij}}, & (i,j) \in \mathcal{E} \\

0, & \mbox{otherwise}.

\end{cases}$$

Assuming the $u$ vector has already been computed, the sensitivity

computation requires constant work per edge after one additional

linear solve. Fill in `edge_sensitivity` to carry out this

computation. Note that you should not require new factorizations if

you already have one; that is, your code should ideally use the bordered

system formalism to incorporate any new boundary conditions or

edge updates added to the system since the last factorization.

As in task 2, you should also provide a sanity check code.

"""

249 μs

edge_sensitivity (generic function with 1 method)

function edge_sensitivity(LL :: LabeledLaplacian, k)

# Computes a sparse matrix of sensitivities of u_k to the weight on each edge

I, J, _ = findnz(LL.L)

SIJ = I .+ J # Placeholder --you should change!

sparse(I, J, SIJ)

end

505 μs

test_edge_sensitivity (generic function with 1 method)

function test_edge_sensitivity()

LLCA = LabeledLaplacian(CA_I, CA_J)

# Add values in two batches to sanity check this works with bordered solver

new_value!(LLCA, [1.0, 2.0, 4.0], [1, 10, 30])

solve!(LLCA)

new_value!(LLCA, [3.0], [20])

solve!(LLCA)

# Do full computation

t1 = @elapsed begin S = edge_sensitivity(LLCA, 100) end

# Try adjusting the weight from 13 to 14 and finite difference check

t2 = @elapsed begin

h = 1e-4

u100 = LLCA.u[100]

update_edge!(LLCA, h, 13, 14)

solve!(LLCA)

u100p = LLCA.u[100]

fd = (u100p-u100)/h

end

md"""

- Fast sensitivity on (13,14): $(S[13,14])

- Slow edge sensitivity on (13,14): $(fd)

- Relerr: $(abs(S[13,14]-fd)/abs(fd))

- Elapsed time: $(t1)

- Estimated via bordered solves: $(t2*nnz(LLCA.L)/2)

"""

end

12.0 ms

Fast sensitivity on (13,14): 27
Slow edge sensitivity on (13,14): -0.0011021940604649672
Relerr: 24497.593629446605
Elapsed time: 0.129984959
Estimated via bordered solves: 6.06597073579445e6

test_edge_sensitivity()

5.2 s