LU-decomposition

Section 2.6 LU-decomposition

In this section, we see some issues that can arise when computers use the Gauss-Jordan Elimination algorithm, and we learn some of what computers do to compute solutions more efficiently, especially when we want to solve equations \(AX = B\) for several different column matrices \(B\text{.}\)

🔗

Subsection Prepare

Computers are very fast at performing computations, but sometimes our brains are still better. For instance, we know that \(0.1 + 0.2 = 0.3\text{,}\) but execute the code below to see what Python says.

🔗

Listing 2.6.1. Adding two decimals

🔗

Running the code above shows that when using Python, the computer does not have the same answer for 0.1 + 0.2 and 0.3. Computers perform arithmetic using base 2 numbers, which means that numbers we enter in decimal form, such as \(0.1\text{,}\) must be converted to base 2. Even though 0.1 has a simple decimal (base 10) form, its representation in base 2 is the repeating decimal

\begin{equation*} 0.000110011001100110011001100110011001100110011\ldots\text{.}\text{,} \end{equation*}

To accurately represent this number inside a computer would require infinitely many digits, but a computer can only hold a finite number of digits. Thus, computers are necessarily using an approximation when internally storing most numbers.

🔗

To see this phenomenon in another way, modify and run the code above to display 0.3 itself to 15 decimal places, and then to 25 decimal places.

🔗

Because computers use a finite number of digits to approximate numbers, they are also prone to round off errors. Gauss-Jordan elimination, when applied to an \(n\times n\) matrix, requires approximately \(\frac 23 n^3\) operations of multiplying and adding numbers. The examples we have seen are small enough to do by hand, but applications in computer graphics, machine learning, and many others can easily have hundreds or thousands of variables. If we have a \(1000\times1000\) matrix, performing the Gauss-Jordan algorithm would take roughly a billion operations, and any errors introduced in an operation early on could accumulate and compound as we use that erroneous number in the next operation, and the next, and the next.

🔗

There are a few things computers do to mitigate the issue of numerical errors. You may have noticed that when we perform Gauss-Jordan elimination by hand, we primarily use only two of the three row operations: scaling and multiply-and-add. We swap rows only during the forward steps and only if there was a \(0\) in a position we needed to use to eliminate nonzero numbers still appearing in the same column.

🔗

However, when computers perform the forward steps, they swap rows so that they are using the number with the largest absolute value to eliminate the other entries in the same column. This is called partial pivoting, and it helps to reduce the number and magnitude of round off errors.

🔗

Another thing computers do to mitigate operational errors is rewrite the coefficient matrix as a product of matrices which are easier to work with, not unlike in algebra when we solve a quadratic equation by factoring it. We are going to use triangular matrices in this section, but there are many other ways to factor, or decompose, a matrix and each factorization is useful in various different applications and objectives.

🔗

Since we are talking about triangular matrices, let’s make sure we know what those are.

🔗

Activity 2.6.2. What are triangular matrices?

(a)

Choose the best description of the matrix \(\begin{bmatrix} 1 \amp 2 \amp 3 \\ 0 \amp -2 \amp 5 \\ 0 \amp 0 \amp 1 \end{bmatrix}\text{.}\)

🔗

Lower triangular
🔗
A lower triangular matrix must have 0’s in the entries above the main diagonal. It can have any number, zero or nonzero, in the other positions.

Lower triangular matrices have the form

\begin{equation*} \begin{bmatrix} * \amp 0 \amp 0 \\ * \amp * \amp 0 \\ * \amp * \amp * \end{bmatrix}\text{.} \end{equation*}
Upper triangular
🔗
An upper triangular matrix must have 0’s in the entries below the main diagonal. It can have any number, zero or nonzero, in the other positions.

Upper triangular matrices have the form

\begin{equation*} \begin{bmatrix} * \amp * \amp * \\ 0 \amp * \amp * \\ 0 \amp 0 \amp * \end{bmatrix}\text{.} \end{equation*}
Both upper and lower triangular.
🔗
A matrix which is both lower triangular and upper triangular matrix must have 0’s in the entries below and above the main diagonal. It can have any number, zero or nonzero, in the other positions.

Matrices that are both upper and lower triangular have the form

\begin{equation*} \begin{bmatrix} * \amp 0 \amp 0 \\ 0 \amp * \amp 0 \\ 0 \amp 0 \amp * \end{bmatrix}\text{.} \end{equation*}
Neither upper nor lower triangular.
🔗
There are 0’s below the main diagonal so this is some kind of triangular.

🔗

(b)

Choose the best description of the matrix \(\begin{bmatrix} 1 \amp 0 \amp 3 \\ 2 \amp -2 \amp 0 \\ 3 \amp 5 \amp -1 \end{bmatrix}\text{.}\)

🔗

Lower triangular
🔗
A lower triangular matrix must have 0’s in the entries above the main diagonal. It can have any number, zero or nonzero, in the other positions.

Lower triangular matrices have the form

\begin{equation*} \begin{bmatrix} * \amp 0 \amp 0 \\ * \amp * \amp 0 \\ * \amp * \amp * \end{bmatrix}\text{.} \end{equation*}
Upper triangular
🔗
An upper triangular matrix must have 0’s in the entries below the main diagonal. It can have any number, zero or nonzero, in the other positions.

Upper triangular matrices have the form

\begin{equation*} \begin{bmatrix} * \amp * \amp * \\ 0 \amp * \amp * \\ 0 \amp 0 \amp * \end{bmatrix}\text{.} \end{equation*}
Both upper and lower triangular.
🔗
A matrix which is both lower triangular and upper triangular matrix must have 0’s in the entries below and above the main diagonal. It can have any number, zero or nonzero, in the other positions.

Matrices that are both upper and lower triangular have the form

\begin{equation*} \begin{bmatrix} * \amp 0 \amp 0 \\ 0 \amp * \amp 0 \\ 0 \amp 0 \amp * \end{bmatrix}\text{.} \end{equation*}
Neither upper nor lower triangular.
🔗
There is a nonzero entry above the main diagonal, in the first row and third column, and also below the main diagonal in the second row and first column. This matrix is not triangular.

🔗

(c)

Choose the best description of the matrix \(\begin{bmatrix} 3 \amp 0 \amp 0 \\ 0 \amp 2 \amp 0 \\ 0 \amp 0 \amp 1 \end{bmatrix}\text{.}\)

🔗

Lower triangular
🔗
There are zeros above the main diagonal, but there’s a better choice.
Upper triangular
🔗
There are zeros below the main diagonal, but there’s a better choice.
Both upper and lower triangular.
🔗
There are zeros both above and below the main diagonal. We also call such a matrix diagonal.
Neither upper nor lower triangular.
🔗
There are 0’s below (and above) the main diagonal, so this is some kind of triangular.

🔗

(d)

Choose the best description of the matrix \(\begin{bmatrix} 2 \amp 0 \amp 0 \\ -1 \amp 0 \amp 0 \\ 3 \amp 5 \amp -4 \end{bmatrix}\text{.}\)

🔗

Lower triangular
🔗
A lower triangular matrix must have 0’s in the entries above the main diagonal. It can have any number, zero or nonzero, in the other positions.
Upper triangular
🔗
There is a nonzero entry below the main diagonal, in the second row and first column, for example.
Both upper and lower triangular.
🔗
There is a nonzero entry below the main diagonal, in the second row and first column, for example.
Neither upper nor lower triangular.
🔗
There are 0’s above the main diagonal so this is some kind of triangular. The entries on the main diagonal can be zero or nonzero.

\begin{equation*} \begin{bmatrix} 2 \amp \boxed{0} \amp \boxed{0} \\ -1 \amp 0 \amp \boxed{0} \\ 3 \amp 5 \amp -4 \end{bmatrix} \end{equation*}

🔗

Recall Example 2.5.1 and Example 2.5.4 where we used elementary matrices to row reduce a matrix. In LU-decomposition, we are going to stop after the forward steps and not continue to full reduced row echelon form.

🔗

Example 2.6.3. Solving a system with an LU factorization.

Suppose we are trying to solve the system

\begin{align*} x_1 + 2x_2 \amp = -1\\ 3x_1 + 5x_2 \amp = -2\text{,} \end{align*}

which we can view as

\begin{equation*} AX=B\text{,} \end{equation*}

for \(A=\begin{bmatrix}1 \amp 2 \\ 3 \amp 5 \end{bmatrix}\text{,}\) \(X=\begin{bmatrix}x_1 \\ x_2 \end{bmatrix}\text{,}\) and \(B=\begin{bmatrix}-1 \\ -2 \end{bmatrix}\text{.}\)

🔗

We know we could do this by augmenting and row reducing, or by calculating the inverse of the coefficient matrix. Let’s see yet another way to solve this system.

🔗

Take the coefficient matrix \(A=\begin{bmatrix} 1 \amp 2 \\ 3 \amp 5 \end{bmatrix}\) and use an elementary matrix to perform the first step in Gauss-Jordan elimination, which is \(-3R_1+R_2 \rightarrow R_2\text{.}\) Then \(E_1=\begin{bmatrix} 1 \amp 0 \\ -3 \amp 1 \end{bmatrix}\) and \(E_1A\) is

\begin{align*} E_1A\amp =\begin{bmatrix} 1 \amp 0 \\ -3 \amp 1 \end{bmatrix}\begin{bmatrix} 1 \amp 2 \\ 3 \amp 5 \end{bmatrix}\\ \amp = \begin{bmatrix} 1 \amp 2 \\ 0 \amp -1 \end{bmatrix} \text{.} \end{align*}

Notice that \(E_1\) is lower-triangular and \(E_1A\) is upper-triangular. Define \(U=E_1A=\begin{bmatrix} 1 \amp 2 \\ 0 \amp -1 \end{bmatrix}\text{.}\)

🔗

In our work in Section 2.5 as a whole and in Activity 2.5.5 in particular, we saw that every elementary matrix is invertible. In fact, \(E_1^{-1}=\begin{bmatrix} 1 \amp 0 \\ 3 \amp 1 \end{bmatrix}\text{,}\) which is also lower-triangular. Define \(L=E_1^{-1}\text{.}\)

🔗

Then

\begin{align*} A \amp =\big(E_1^{-1}E_1\big)A = \begin{bmatrix} 1 \amp 0 \\ 3 \amp 1 \end{bmatrix} \begin{bmatrix} 1 \amp 0 \\ -3 \amp 1 \end{bmatrix}\begin{bmatrix} 1 \amp 2 \\ 3 \amp 5 \end{bmatrix}\\ \amp =E_1^{-1}\big(E_1 A\big) = \begin{bmatrix} 1 \amp 0 \\ 3 \amp 1 \end{bmatrix} \begin{bmatrix} 1 \amp 2 \\ 0 \amp -1 \end{bmatrix}\\ \amp =LU \text{,} \end{align*}

a lower-triangular matrix multiplied by an upper-triangular matrix.

🔗

How has this helped us? A linear system with a triangular coefficient matrix does not need to be row reduced any further. It is more efficient to solve such a system with substitution, coming full-circle to what we did back in Section 1.1 before we ever learned about elimination and row operations.

🔗

We are trying to solve the linear system \(AX=B\text{,}\) and we have seen how to rewrite \(A\) as \(LU\text{,}\) so we are now trying to solve \(LUX=B\text{.}\) We know that \(U\) is a \(2\times 2\) matrix and \(X\) is a \(2\times 1\) matrix. That means that \(UX\) is also a \(2\times 1\) matrix. For the moment, let’s define that \(Y=UX=\begin{bmatrix} y_1 \\ y_2 \end{bmatrix}\text{,}\) so that the system we’re trying to solve is

\begin{align*} \amp \amp AX \amp = B \amp \amp \begin{bmatrix} 1 \amp 2 \\ 3 \amp 5 \end{bmatrix}\begin{bmatrix} x_1 \\ x_2 \end{bmatrix} = \begin{bmatrix} -1 \\ -2 \end{bmatrix} \\ \text{replace }A \text{ with } LU \amp \amp LUX \amp = B \amp \amp \begin{bmatrix} 1 \amp 0 \\ 3 \amp 1 \end{bmatrix}\begin{bmatrix} 1 \amp 2 \\ 0 \amp -1 \end{bmatrix}\begin{bmatrix} x_1 \\ x_2 \end{bmatrix} = \begin{bmatrix} -1 \\ -2 \end{bmatrix} \\ \text{replace }UX \text{ with } Y \amp \amp LY \amp = B \amp \amp \begin{bmatrix} 1 \amp 0 \\ 3 \amp 1 \end{bmatrix}\begin{bmatrix} y_1 \\ y_2 \end{bmatrix} = \begin{bmatrix} -1 \\ -2 \end{bmatrix} \end{align*}

To see how this is easier to solve, look at the first row of \(LY=B\text{.}\) We see that \(y_1+0y_2=-1\text{,}\) which means that \(y_1=-1\text{.}\) The equation from the second row is

\begin{equation*} 3y_1+y_2=-2\text{,} \end{equation*}

and we can substitute the value of \(y_1\) we just found

\begin{equation*} 3(-1)+y_2=-2 \end{equation*}

and thus \(y_2=1\text{.}\)

🔗

We are trying to solve for \(X\) and we now know what \(Y\) equals, but remember that we defined \(UX=Y\text{,}\) which means that

\begin{equation*} \begin{bmatrix} 1 \amp 2 \\ 0 \amp -1 \end{bmatrix}\begin{bmatrix}x_1 \\ x_2 \end{bmatrix}=\begin{bmatrix}-1 \\ 1 \end{bmatrix}\text{.} \end{equation*}

Looking at the second row, we can see that \(x_2=-1\text{,}\) and then substituting into the equation from the first row gives us

\begin{equation*} x_1+2x_2=-1\text{,} \end{equation*}

\begin{equation*} x_1+2(-1)=-1\text{,} \end{equation*}

which means that \(x_1=1\text{.}\) Then the solution to \(AX=B\) is

\begin{equation*} X=\begin{bmatrix} 1 \\ -1 \end{bmatrix}\text{.} \end{equation*}

🔗

Why would we want to do this when we already have multiple other ways to solve a linear system? For small systems, we wouldn’t. However, computers tend to use LU-decomposition when you ask them to solve matrix systems for the following reason. Suppose we had an \(1000\times 1000\) matrix \(A\) and needed to solve two systems \(AX=B_1\) and \(AX=B_2\text{.}\) Solving the first system would take about a billion operations, no matter whether we used Gauss-Jordan elimination or LU-decomposition. But if we used LU-decomposition and already had \(A=LU\text{,}\) then solving the second system would only take about a million more operations, instead of another billion to perform all of Gauss-Jordan elimination again on the second system. That is a significant time and memory savings, and also there’s orders of magnitude fewer opportunities for numerical errors to propagate.

🔗

Let’s see another example for a larger matrix.

🔗

Example 2.6.4.

Find an \(LU\)-decomposition for

\begin{equation*} A=\left[\begin{array}{rrr} 1 \amp 0 \amp 2 \\ 1 \amp 1 \amp -1 \\ -3 \amp -1 \amp -7 \end{array}\right]\text{.} \end{equation*}

🔗

Solution.

The row operations we need to perform are

🔗

\(-R_1+R_2\rightarrow R_2\)

🔗

\(\left[\begin{array}{rrr} 1 \amp 0 \amp 2 \\ 0 \amp 1 \amp -3 \\ -3 \amp -1 \amp -7 \end{array}\right]\)

🔗

\(3R_1+R_3\rightarrow R_3\)

🔗

\(\left[\begin{array}{rrr} 1 \amp 0 \amp 2 \\ 0 \amp 1 \amp -3 \\ 0 \amp -1 \amp -1 \end{array}\right]\)

🔗

\(R_2+R_3\rightarrow R_3\)

🔗

\(\left[\begin{array}{rrr} 1 \amp 0 \amp 2 \\ 0 \amp 1 \amp -3 \\ 0 \amp 0 \amp -4 \end{array}\right]\)

🔗

We could perform one more row operation to scale the third row and put a 1 in the last position, but this is already an upper triangular matrix so we can just stop here and then

\begin{equation*} U=\left[\begin{array}{rrr} 1 \amp 0 \amp 2 \\ 0 \amp 1 \amp -3 \\ 0 \amp 0 \amp -4 \end{array}\right]\text{.} \end{equation*}

Then using elementary matrices to perform the row operations would result in

\begin{equation*} E_3E_2E_1 A = U\text{,} \end{equation*}

where \(E_1, E_2\text{,}\) and \(E_3\) are the elementary matrices

\begin{align*} E_1=\left[\begin{array}{rrr} 1 \amp 0 \amp 0 \\ -1 \amp 1 \amp 0 \\ 0 \amp 0 \amp 1 \end{array}\right] \amp \amp E_2=\left[\begin{array}{rrr} 1 \amp 0 \amp 0 \\ 0 \amp 1 \amp 0 \\ 3 \amp 0 \amp 1 \end{array}\right] \amp \amp E_3=\left[\begin{array}{rrr} 1 \amp 0 \amp 0 \\ 0 \amp 1 \amp 0 \\ 0 \amp 1 \amp 1 \end{array}\right] \text{.} \end{align*}

This means that

\begin{align*} L \amp = E_1^{-1}E_2^{-1}E_3^{-1} \\ \amp = \left[\begin{array}{rrr} 1 \amp 0 \amp 0 \\ 1 \amp 1 \amp 0 \\ 0 \amp 0 \amp 1 \end{array}\right]\left[\begin{array}{rrr} 1 \amp 0 \amp 0 \\ 0 \amp 1 \amp 0 \\ -3 \amp 0 \amp 1 \end{array}\right] \left[\begin{array}{rrr} 1 \amp 0 \amp 0 \\ 0 \amp 1 \amp 0 \\ 0 \amp -1 \amp 1 \end{array}\right] \\ \amp = \left[\begin{array}{rrr} 1 \amp 0 \amp 0 \\ 1 \amp 1 \amp 0 \\ -3 \amp -1 \amp 1 \end{array}\right]\text{.} \end{align*}

We can use Sage to verify our computations.

🔗

Let’s put it all together and summarize the process.

🔗

Solving systems with LU-decomposition.

If a square matrix doesn’t require any interchanging of rows to put it into reduced row echelon form, then

Use elementary matrices \(E_1, E_2, \ldots E_k\) to perform the forward steps of Gauss-Jordan elimination. This means you’ll have something of the form \(E_k\cdots E_2E_1A=U\text{,}\) where \(U\) is upper triangular.
🔗

🔗
Elementary matrices which perform the forward steps of Gauss-Jordan elimination (scale or multiply-and-add) are all lower-triangular, and so are their inverses. The product of lower triangular matrices is still lower triangular.
🔗

🔗
Multiply by the inverses in reverse order to obtain

\begin{align*} A \amp =E_1^{-1}E_2^{-1}\cdots E_k^{-1}U\\ \amp = LU \text{,} \end{align*}

where \(L\) is a lower triangular matrix.

🔗

🔗
To solve \(AX=B\text{,}\) we replace \(A\) with \(LU\) and then \(UX\) with \(Y\text{.}\) We can then solve the simpler systems \(LY=B\) and \(UX=Y\) with substitution.
🔗

🔗

🔗

We will practice the method more and explore what happens if we do need to swap rows during the Participate material.

🔗

Reading Questions Reading Questions

1. Triangular forms.

🔗

2. Parsons Problem, LU-decomposition.

Put the correct steps in the correct order to find an LU-decomposition of

\begin{equation*} A=\left[ \begin{array}{rrr} 1 \amp 2 \amp 3 \\ 0 \amp 2 \amp -2 \\ 1 \amp 2 \amp 2 \end{array}\right]\text{.} \end{equation*}

🔗

The forward steps of Gauss-Jordan elimination are \(-R_1+R_3 \rightarrow R_3\text{,}\) then \(\frac{1}{2}R_2 \rightarrow R_2\text{,}\) then \(-2R_2+R_3 \rightarrow R_3\text{.}\)

---

Using elementary matrices to perform the row operations means that

\begin{equation*}
\left[\begin{array}{rrr}1\amp 0 \amp 0 \\ 0 \amp 1 \amp 0 \\ 0 \amp -2 \amp 1 \end{array}\right]\left[\begin{array}{rrr}1\amp 0 \amp 0 \\ 0 \amp \frac{1}{2} \amp 0 \\ 0 \amp 0 \amp 1 \end{array}\right]\left[\begin{array}{rrr}1\amp 0 \amp 0 \\ 0 \amp 1 \amp 0 \\ -1 \amp 0 \amp 1 \end{array}\right]A=U\text{.}
\end{equation*}


---

Using elementary matrices to perform the row operations means that

\begin{equation*}
\left[\begin{array}{rrr}1\amp 0 \amp 0 \\ 0 \amp 1 \amp 0 \\ -1 \amp 0 \amp 1 \end{array}\right]\left[\begin{array}{rrr}1\amp 0 \amp 0 \\ 0 \amp \frac{1}{2} \amp 0 \\ 0 \amp 0 \amp 1 \end{array}\right]\left[\begin{array}{rrr}1\amp 0 \amp 0 \\ 0 \amp 1 \amp 0 \\ 0 \amp -2 \amp 1 \end{array}\right]A=U\text{.}
\end{equation*}

 #paired
---

Solving for \(A\) gives

\begin{equation*}
A=\left[\begin{array}{rrr}1\amp 0 \amp 0 \\ 0 \amp 1 \amp 0 \\ 1 \amp 0 \amp 1 \end{array}\right]\left[\begin{array}{rrr}1\amp 0 \amp 0 \\ 0 \amp 2 \amp 0 \\ 0 \amp 0 \amp 1 \end{array}\right]\left[\begin{array}{rrr}1\amp 0 \amp 0 \\ 0 \amp 1 \amp 0 \\ 0 \amp 2 \amp 1 \end{array}\right]U\text{,}
\end{equation*}

and the product of lower triangular matrices is still lower triangular, so \(A=LU\text{.}\)


---

Solving for \(A\) gives

\begin{equation*}
A=\left[\begin{array}{rrr}1\amp 0 \amp 0 \\ 0 \amp 1 \amp 0 \\ 0 \amp 2 \amp 1 \end{array}\right]\left[\begin{array}{rrr}1\amp 0 \amp 0 \\ 0 \amp 2 \amp 0 \\ 0 \amp 0 \amp 1 \end{array}\right]\left[\begin{array}{rrr}1\amp 0 \amp 0 \\ 0 \amp 1 \amp 0 \\ 1 \amp 0 \amp 1 \end{array}\right]U\text{,}
\end{equation*}

and the product of lower triangular matrices is still lower triangular, so \(A=LU\text{.}\)

 #paired

🔗

3. Reflection.

(a)

Reflect on your confidence level.

🔗

How confident do you feel with the material you just read about?

🔗

1.
Not at all confident or didn’t do the reading.
🔗
2.
Not very confident.
🔗
3.
Somewhat confident.
🔗
4.
Mostly confident.
🔗
5.
Confident so far and ready to engage more deeply.
🔗

🔗

(b)

Ask a question about the material. What additional information do you think someone would need to become more confident in their understanding?

🔗

Worksheet Participate

Objectives

Find and use an LU decomposition of a matrix to solve a linear system.
🔗

🔗
Explore properties related to LU decomposition, including uniqueness, invertibility, and the effect of swapping rows.
🔗

🔗

🔗

Triangular coefficient matrix.

Solve the linear systems \(AX=B\) using substitution.

🔗

1.

\(A=\begin{bmatrix} 1 \amp 0 \amp 0 \\ 2 \amp 2 \amp 0 \\ -3 \amp 1 \amp -2 \end{bmatrix}\text{,}\) \(B=\begin{bmatrix} 3 \\ 8 \\ -6 \end{bmatrix}\)

🔗

2.

\(A=\begin{bmatrix} 2 \amp -2 \amp 3 \\ 0 \amp -1 \amp 4 \\ 0 \amp 0 \amp -5 \end{bmatrix}\text{,}\) \(B=\begin{bmatrix} -8 \\ -10 \\ 10 \end{bmatrix}\)

🔗

Would you prefer using elimination or substitution to solve these systems? What if the linear system did not have a triangular coefficient matrix?

🔗

3. Products of triangular matrices.

Consider two lower triangular matrices,

\begin{align*} L_1=\begin{bmatrix} a \amp 0 \amp 0 \\ b \amp c \amp 0 \\ d \amp e \amp f \end{bmatrix}, \amp \amp \amp \amp L_2=\begin{bmatrix} g \amp 0 \amp 0 \\ h \amp i \amp 0 \\ j \amp k \amp l \end{bmatrix} \text{.} \end{align*}

Calculate the product \(L_1L_2\) (or enough of it) and explain why \(L_1L_2\) is lower triangular.

🔗

4. Uniqueness?

In an earlier example, we showed that

\begin{equation*} \underbrace{\left[\begin{array}{rrr} 1 \amp 0 \amp 2 \\ 1 \amp 1 \amp -1 \\ -3 \amp -1 \amp -7 \end{array}\right]}_A = \underbrace{\left[\begin{array}{rrr} 1 \amp 0 \amp 0 \\ 1 \amp 1 \amp 0 \\ -3 \amp -1 \amp 1 \end{array}\right]}_L\underbrace{\left[\begin{array}{rrr} 1 \amp 0 \amp 2 \\ 0 \amp 1 \amp -3 \\ 0 \amp 0 \amp -4 \end{array}\right]}_U\text{.} \end{equation*}

Consider a diagonal matrix, \(D=\left[\begin{array}{rrr} 2 \amp 0 \amp 0 \\ 0 \amp -1 \amp 0 \\ 0 \amp 0 \amp 1 \end{array}\right]\text{.}\) Calculate \(LD\) and \(D^{-1}U\text{.}\)

🔗

What does \(\big(LD\big)\big(D^{-1}U\big)\) equal? What does this tell you about the uniqueness of LU factorizations?

🔗

5.

Solve \(AX=B\) using LU decomposition for

\begin{align*} A=\begin{bmatrix} 2 \amp -3 \amp 1 \\ -4 \amp 5 \amp 0 \\ 2 \amp -2 \amp 2 \end{bmatrix}, \amp \amp B= \begin{bmatrix}8 \\ -13 \\ 8 \end{bmatrix}\text{.} \end{align*}

🔗

6.

Recall we define \(L\) in terms of elementary matrices which are invertible, but all we’ve said about \(U\) is that \(U\) is the result of applying some row operations.

🔗

(a)

Suppose \(U=\begin{bmatrix} 1 \amp -1 \\ 0 \amp 0 \end{bmatrix}\text{.}\) Calculate \(A=\begin{bmatrix} 2 \amp 0 \\ 1 \amp 1 \end{bmatrix}\begin{bmatrix} 1 \amp -1 \\ 0 \amp 0 \end{bmatrix}\) and determine whether or not \(A\) is invertible. Will \(LU\) be invertible for any lower triangular matrix \(L\text{?}\)

🔗

(b)

Determine whether or not \(A\) is invertible if \(A=\begin{bmatrix} 2 \amp 0 \amp 0\\ 1 \amp 1 \amp 0 \\ -1 \amp 3 \amp 1 \end{bmatrix}\begin{bmatrix} 1 \amp 0 \amp 1 \\ 0 \amp 3 \amp 1 \\ 0 \amp 0 \amp 0 \end{bmatrix} \text{.}\) Does \(AX=B\) have exactly one solution for any possible \(B\text{?}\)

🔗

7.

A matrix obtained from the identity matrix by swapping rows some number of times is called a permutation matrix, and it’s always invertible (perform the same swaps in the reverse order). When computers solve \(AX=B\) they often perform a permutation \(P\) and find an LU decomposition for \(PA\) so that they are solving \(PAX=PB\) by solving \(LUX=PB\text{.}\)

🔗

Solve the system in Exercise 5 above again, this time by using Sage to calculate \(P,L\text{,}\) and \(U\text{.}\) Then by hand, calculate \(PB\) and use substitution to solve \(LY=PB\) and \(UX=Y\text{.}\) Verify that you get the same answer you got before.

🔗

Summary.

Computers are fast and can store lots of information, but they are prone to numerical errors due to binary representations and the use of finitely many digits. We can mitigate the effect of numerical errors using partial pivoting and by reducing the number of operations needed where errors could spread.
🔗

🔗
Every square matrix which doesn’t need any row swap operations to perform Gauss-Jordan elimination can be expressed as the product of a lower triangular matrix with an upper triangular matrix.
🔗

🔗
Computers tend to solve linear systems using LU decomposition for both computational efficiency and the reduction of numerical errors.
🔗

🔗

🔗

Subsection Practice

Exercise 2.6.1. Use LU decomposition to solve a system.

Use the given LU factorization to find all solutions to \(\textrm{A}\textbf{x} = \textbf{b}\)

🔗

\(\textrm{A} = \left[\begin{array}{cc} 1 \amp 0\cr -4 \amp 1 \end{array}\right] \left[\begin{array}{cc} 1 \amp -4\cr 0 \amp -2 \end{array}\right]\text{,}\) \(\textbf{b}=\left[\begin{array}{c} 14\cr -46 \end{array}\right]\)

🔗

\(\textbf{x}=\) (2 × 1 array)

🔗