数值计算详细笔记（三）：非线性方程组解法

the Linear System of Equations (LSEs):
$(I)\left\{ \begin{aligned} E_1: & a_{11}x_1+a_{12}x_2+...+a_{1n}x_n=b_1 \\ E_2: & a_{21}x_1+a_{22}x_2+...+a_{2n}x_n=b_2 \\ ... & \\ E_n: & a_{n1}x_1+a_{n2}x_2+...+a_{nn}x_n=b_n \end{aligned} \right.$

6.1.2 Operations of LSEs

Multiplied operation - 数乘

Equation $E_i$ can be multiplied by any nonzero constant $\lambda$
$(\lambda E_i)\rightarrow E_i$

Multiplied and added operation - 倍加

Equation $E_j$ can be multiplied by any nonzero constant $\lambda$ , and added to Equation $E_i$ in place of $E_i$ , denoted by
$(\lambda E_j+E_i)\rightarrow E_i$

Transposition - 交换

Equation $E_i$ and $E_j$ can be transposed in order, denoted by
$E_i \leftrightarrow E_j$

6.1.3 Augmented Matirx

$\tilde{A}=[A,\textbf{b}]= \left ( \begin{array}{c:c} \begin{matrix} a_{11}&a_{12}&...&a_{1n}\\ a_{21}&a_{22}&...&a_{2n}\\ ... & ... & ... &... \\ a_{n1}&a_{n2}&...&a_{nn}\\ \end{matrix}& \begin{matrix} b_1\\ b_2\\ ...\\ b_n \end{matrix} \end{array} \right )$

6.2 Gaussian Elimination Method

6.2.1 Overall Description

The key point of Gaussian Elimination Method is changing the original matrix into upper-triangular matrix, then using backward–substitution method to calculate the answer.

6.2.2 Algorithm

INPUT: $N$ -dimension, $A(N,N), B(N)$
OUTPUT: Solution $x(N)$ or Message that LESs has no unique solution.
Step $1$ : For $k = 1,2,...,N-1$ , do step 2-4.
Step $2$ : Set $p$ be the smallest integer with $k\leq p\leq N$ and $A_{p,k}\not= 0$ . If no $p$ can be found, output: “no unique solution exists”; stop.
Step $3$ : If $p\not=k$ , do transposition $E_p\leftrightarrow E_k$ .
Step $4$ : For $i=k+1,...,N$
1. Set $m_{i,k}=\displaystyle\frac{A(i,k)}{A(k,k)}$
2. Set $B(i)=B(i)-m_{i,k}B(k)$
3. For $j=k+1,...,N$ , set $A(i,j)=A(i,j)-m_{i,k}A(k,j)$ ;
Step $5$ : If $A(N,N)\not=0$ , set $x(N)=\displaystyle\frac{B(N)}{A(N,N)}$ ; Else, output:“no unique solution exists.”
Step $6$ : For $i=N-1,N-2,...,1$ , set
$X(i)=[B(i)-\sum\limits_{j=i+1}^{N}A(i,j)x(j)]/A(i,i)$
Step $7$ : Output the solution $x(N)$ .

6.3 Pivoting Strategies

6.3.1 Background

According to the process of Gaussian Elimination Method, We find that if $a_{kk}^{(k-1)}$ is too small, the roundoff error will be larger.
$m_{i,k}=\displaystyle\frac{A(i,k)}{A(k,k)}\\ X(i)=[B(i)-\sum\limits_{j=i+1}^{N}A(i,j)x(j)]/A(i,i)\\$

Therefore, in order to reduce the roundoff error, we need to make the value of $a_{kk}^{(k-1)}$ larger.

6.3.2 Maximal Column Pivoting Technique

数值计算详细笔记（三）：非线性方程组解法

This method is to make $a_{kk}^{(k-1)}$ equal to the maximal value in its column.

6.3.3 Maximal Row Pivoting Technique

数值计算详细笔记（三）：非线性方程组解法

This method is to make $a_{kk}^{(k-1)}$ equal to the maximal value in its row.

6.3.4 Partial Pivoting Technique

数值计算详细笔记（三）：非线性方程组解法

This method is to make $a_{kk}^{(k-1)}$ equal to the maximal value in its remaining area.

6.3.5 Scaled Partial Pivoting Technique

$s_i=\max_{1\leq j\leq n}|a_{i,j}| \\ \displaystyle\frac{a_{kk}}{s_k}=\max_{k\leq i\leq n}\displaystyle\frac{a_{i,1}}{s_i}$

This method is to make $\displaystyle\frac{a_{kk}^{(k-1)}}{s_{k}}$ equal to the maximal value in its remaining area.

6.4 LU Factorization

6.4.1 The advantage of LU Factorization

$Ax=b\\ A=LU\\ L= \left( \begin{matrix} 1 & 0 & 0 & ... & 0 \\ l_{21} & 1 & 0 & ... & 0 \\ l_{31} & l_{32} & 1 & ... & 0 \\ ... & ... & ... & ... & ... \\ l_{n1} & l_{n2} & l_{n3} & ... & 1 \end{matrix} \right), R= \left( \begin{matrix} u_{11} & u_{12} & u_{13} & ... & u_{1n} \\ 0 & u_{22} & u_{23} & ... & u_{2n} \\ 0 & 0 & u_{33} & ... & u_{3n} \\ ... & ... & ... & ... & ... \\ 0 & 0 & 0 & ... & u_{nn} \end{matrix} \right)$

We can use two-step process to solve $LUx=b$ .

$y=Ux,Ly=b$
Solve $Ly=b$ determining $y$ with forward substitution method.
Solve $Ux=y$ determining $x$ with forward substitution method.

6.4.2 LU Factorization through Gaussian Elimination

Theorem

If Gaussian elimination can be performed on the linear system $Ax=b$ without row interchanges, then the matrix $A$ can be factored into the product of a lower-triangular $L$ and an upper-triangular matrix $U$ ,
$A=LU$
where
$L= \left( \begin{matrix} 1 & 0 & 0 & ... & 0 \\ m_{21} & 1 & 0 & ... & 0 \\ m_{31} & m_{32} & 1 & ... & 0 \\ ... & ... & ... & ... & ... \\ m_{n1} & m_{n2} & m_{n3} & ... & 1 \end{matrix} \right), R= \left( \begin{matrix} a_{11}^1 & a_{12}^1 & a_{13}^1 & ... & a_{1n}^1 \\ 0 & a_{22}^2 & a_{23}^2 & ... & a_{2n}^2 \\ 0 & 0 & a_{33}^3 & ... & a_{3n}^3 \\ ... & ... & ... & ... & ... \\ 0 & 0 & 0 & ... & a_{nn}^n \end{matrix} \right)$

Proof

$m_{j,1}=\displaystyle\frac{a_{j,1}}{a_{1,1}}\\ M^1 = \left( \begin{matrix} 1 & 0 & 0 & ... & 0 \\ -m_{21} & 1 & 0 & ... & 0 \\ -m_{31} & 0 & 1 & ... & 0 \\ ... & ... & ... & ... & ... \\ -m_{n1} & 0 & 0 & ... & 1 \end{matrix} \right)$
Thus,
$A^n=M^{n-1}M^{n-2}...M^{1}A.$
Let $U=A^n$ , then
$[M^1]^{-1}...[M^{n-2}]^{-1}[M^{n-1}]^{-1}U=A \\ [M^1]^{-1} = \left( \begin{matrix} 1 & 0 & 0 & ... & 0 \\ m_{21} & 1 & 0 & ... & 0 \\ m_{31} & 0 & 1 & ... & 0 \\ ... & ... & ... & ... & ... \\ m_{n1} & 0 & 0 & ... & 1 \end{matrix} \right)\\ L = [M^1]^{-1}...[M^{n-2}]^{-1}[M^{n-1}]^{-1}\\$

6.4.3 LU Factorization through Gaussian Elimination

$LU= \left( \begin{matrix} 1 & 0 & 0 & ... & 0 \\ l_{21} & 1 & 0 & ... & 0 \\ l_{31} & l_{32} & 1 & ... & 0 \\ ... & ... & ... & ... & ... \\ l_{n1} & l_{n2} & l_{n3} & ... & 1 \end{matrix} \right) \left( \begin{matrix} u_{11} & u_{12} & u_{13} & ... & u_{1n} \\ 0 & u_{22} & u_{23} & ... & u_{2n} \\ 0 & 0 & u_{33} & ... & u_{3n} \\ ... & ... & ... & ... & ... \\ 0 & 0 & 0 & ... & u_{nn} \end{matrix} \right)\\ LU=A= \left( \begin{matrix} a_{11} & a_{12} & a_{13} & ... & a_{1n} \\ a_{21} & a_{22} & a_{23} & ... & a_{2n} \\ a_{31} & a_{32} & a_{33} & ... & a_{3n} \\ ... & ... & ... & ... & ... \\ a_{n1} & a_{n2} & a_{n3} & ... & a_{nn} \end{matrix} \right)$ LU=⎝⎜⎜⎜⎜⎛1l21l31...ln101l32...ln2001...ln3...............000...1⎠⎟⎟⎟⎟⎞⎝⎜⎜⎜⎜⎛u1100...0u12u220...0u13u23u33...0...............u1nu2nu3n...unn⎠⎟⎟⎟⎟⎞LU=A=⎝⎜⎜⎜⎜⎛a11a21a31...an1a12a22a32...an2a13a23a33...an3...............a1na2na3n...ann⎠⎟⎟⎟⎟⎞

Algorithm

数值计算详细笔记（三）：非线性方程组解法

6.5 Strictly Diagonally dominant Matrix

6.5.1 Definition

The $n*n$ matrix is said to be strictly diagonally dominant (严格对角占优) when
$|a_{ii}|>\sum\limits_{j=1,j\not=i}^{n} |a_{ij}|$
holds for each $i=1,2,3,...,n$ .

6.5.2 Property

A strictly diagonally dominant matrix $A$ is nonsingular.
Moreover, in this case, Gaussian elimination can be performed on any linear system of the form $Ax=b$ to obtain its unique solution without row or column interchanges, and the computations are stable with respect to the growth of roundoff errors.

Proof for First Property

A matrix is singular means its determinant is zero.

A matrix’s determinant is zero means the n vectors in the matrix are linearly dependent.

Thus, matrix $A$ is singular means there exists a column vector $u$ that $Ax=0$ .

数值计算详细笔记（三）：非线性方程组解法

6.6 Positive Definite Symmetric Matrix

6.6.1 Definition

A matrix $A$ is positive definite if it is symmetric and if $x^TAx > 0$ for every $n$ -dimensional column vector $x\not=0$ .

6.6.2 Property

If A is an $n*n$ positive definite matrix, then

A is nonsingular;
$a_{ii}>0$ for each $i=1,2,...,n$ ;
$\max_{1\leq k,j\leq n}|a_{k,j}|\leq \max_{1\leq i\leq n}|a_{i,i}|$ ;
$a_{ij}^2<a_{ii}a_{jj}$ for each $i\not=j$ .

6.6.3 Theorem

数值计算详细笔记（三）：非线性方程组解法

6.7 $LL^T$ Factorization

6.7.1 Definition

For a $n*n$ symmetric and positive definite matrix $A$ with the form
$A= \left( \begin{matrix} a_{11} & a_{12} & a_{13} & ... & a_{1n} \\ a_{12} & a_{22} & a_{23} & ... & a_{2n} \\ a_{13} & a_{23} & a_{33} & ... & a_{3n} \\ ... & ... & ... & ... & ... \\ a_{1n} & a_{2n} & a_{3n} & ... & a_{nn} \end{matrix} \right)$
where $A^T=A.$ We can factorize this matrix to the form like $LL^T=A$ , where $L$ is a lower triangular matrix with form as follows
$\left( \begin{matrix} l_{11} & 0 & 0 & \cdots & 0 \\ l_{21} & l_{22} & 0 & \cdots & 0 \\ l_{31} & l_{32} & l_{33} & \cdots & 0 \\ \vdots & \vdots & \vdots & \ddots & \vdots \\ l_{n1} & l_{n2} & l_{n3} & \cdots & l_{nn} \end{matrix} \right)$

Thus, we need to determine the elements $l_{ij}$ , for $i\in[1,n]$ and $j\in[1,n]$ .
$A= \left( \begin{matrix} a_{11} & a_{12} & a_{13} & ... & a_{1n} \\ a_{12} & a_{22} & a_{23} & ... & a_{2n} \\ a_{13} & a_{23} & a_{33} & ... & a_{3n} \\ ... & ... & ... & ... & ... \\ a_{1n} & a_{2n} & a_{3n} & ... & a_{nn} \end{matrix} \right)=LL^T$

6.7.2 Choleski’s Algorithm

Calculate the value one row by one row

To factor the positive definite $n*n$ matrix $A$ into $LL^T$ , where $L$ is lower triangular:

INPUT: the dimension $n$ ; entries $a_{ij}$ of $A$ , for $i\in [1,n]$ and $j\in[1,i]$ .
OUTPUT: the entries $l_{ij}$ of $L$ , for $i\in [1,n]$ and $j\in[1,i]$ .
Step $1$ : Set $l_{11} = \sqrt{a_{11}}$ .
Step $2$ : For $j\in[2,n]$ , set $l_{j1}=\displaystyle\frac{a_{1j}}{l_{11}}$
Step $3$ : For $i\in[2,n-1]$ , do Steps 4 and 5.
Step $4$ : Set $l_{ii}=[a_{ii}-\sum_{j=1}^{i-1}l_{ij}^2]^{\frac{1}{2}}$ .
Step $5$ : For $j\in[i+1,n]$ , set $l_{ji}=\displaystyle\frac{a_{ij}-\sum_{k=1}^{i-1}l_{ik}l_{jk}}{l_{ii}}$
Step $6$ : Set $l_{nn}=[a_{nn}-\sum_{k=1}^{n-1}l_{nk}^2]^\frac{1}{2}$
Step $7$ : OUTPUT $l_{ij}$ for $j\in[1,i]$ and $i\in[1,n]$ . STOP!

Example

数值计算详细笔记（三）：非线性方程组解法

6.8 $LDL^T$ Factorization

6.8.1 Definition

Matrix $A$ is a positive definite matrix, thus
$A = LDL^T.$
数值计算详细笔记（三）：非线性方程组解法

数值计算详细笔记（三）：非线性方程组解法

We can calculate the value of $L$ and $D$ one row by one row.

6.8.2 Algorithm

数值计算详细笔记（三）：非线性方程组解法

6.9 Tri-diagonal Linear System

6.9.1 Definition

An $n*n$ matrix $A$ is called a band matrix (带状矩阵), if integers $p$ and $q$ , with $1<p, q<n$ , exist having the property that $a_{ij}=0$ whenever $i+p\leq j$ or $j+q\leq i$ . The bandwidth (带宽) of a band matrix is defined as $w=p+q-1$ .

数值计算详细笔记（三）：非线性方程组解法

6.9.2 LU Factorization

$A = LU$

数值计算详细笔记（三）：非线性方程组解法

In order to solve the problem of $Ax=LUx=b$ , there are two steps to do.

$z = Ux$ , and solve $Lz=b$
solve $Ux=z$

6.9.3 Remarks

Band matrices usually are sparse matrices, thus we need to substitute two-dimensional array to one-dimensional array to store the value of the matrices.
Banded matrices appear in numerical calculation methods of partial differential equations and are common matrix forms.

数值计算详细笔记（三）：非线性方程组解法

文章目录

6. Linear Systems Ax = b

6.1 Basic Concepts

6.1.1 LSEs

6.1.2 Operations of LSEs

6.1.3 Augmented Matirx

6.2 Gaussian Elimination Method

6.2.1 Overall Description

6.2.2 Algorithm

6.3 Pivoting Strategies

6.3.1 Background

6.3.2 Maximal Column Pivoting Technique

6.3.3 Maximal Row Pivoting Technique

6.3.4 Partial Pivoting Technique

6.3.5 Scaled Partial Pivoting Technique

6.4 LU Factorization

6.4.1 The advantage of LU Factorization

6.4.2 LU Factorization through Gaussian Elimination

6.4.3 LU Factorization through Gaussian Elimination

6.5 Strictly Diagonally dominant Matrix

6.5.1 Definition

6.5.2 Property

6.6 Positive Definite Symmetric Matrix

6.6.1 Definition

6.6.2 Property

6.6.3 Theorem

6.7 LLTLL^TLLT Factorization

6.7.1 Definition

6.7.2 Choleski’s Algorithm

6.8 LDLTLDL^TLDLT Factorization

6.8.1 Definition

6.8.2 Algorithm

6.9 Tri-diagonal Linear System

6.9.1 Definition

6.9.2 LU Factorization

6.9.3 Remarks

相关推荐

6.7 $LL^T$ Factorization

6.8 $LDL^T$ Factorization