Applied linear operators and spectral methods/Lecture 4

More on spectral decompositions

In the course of the previous lecture we essentially proved the following theorem:

Theorem:

1) If a $n \times n$ matrix $𝐀$ has $n$ linearly independent real or complex eigenvectors, the $𝐀$ can be diagonalized. 2) If $𝐓$ is a matrix whose columns are eigenvectors then $𝐓 𝐀 𝐓^{- 1} = Λ$ is the diagonal matrix of eigenvalues.

The factorization $𝐀 = 𝐓^{- 1} Λ 𝐓$ is called the spectral representation of $𝐀$ .

Application

We can use the spectral representation to solve a system of linear homogeneous ordinary differential equations.

For example, we could wish to solve the system

\frac{d 𝐮}{d t} = 𝐀 𝐮 = [\begin{matrix} - 2 & 1 \\ 1 & - 2 \end{matrix}] [\begin{matrix} u_{1} \\ u_{2} \end{matrix}]

(More generally $𝐀$ could be a $n \times n$ matrix.)

Comment:

Higher order ordinary differential equations can be reduced to this form. For example,

\frac{d^{2} u_{1}}{d t^{2}} + a \frac{d u_{1}}{d t} = b u_{1}

Introduce

u_{2} = \frac{d u_{1}}{d t}

Then the system of equations is

\begin{matrix} \frac{d u_{1}}{d t} & = u_{2} \\ \frac{d u_{2}}{d t} & = b u_{1} - a u_{2} \end{matrix}

or,

\frac{d 𝐮}{d t} = [\begin{matrix} 0 & 1 \\ b & - a \end{matrix}] [\begin{matrix} u_{1} \\ u_{2} \end{matrix}] = 𝐀 𝐮

Returning to the original problem, let us find the eigenvalues and eigenvectors of $𝐀$ . The characteristic equation is

d e t (𝐀 - λ 𝐈) = 0

o we can calculate the eigenvalues as

(2 + λ) (2 + λ) - 1 = 0 ⟹ λ^{2} + 4 λ + 3 = 0 ⟹ λ_{1} = - 1, λ_{2} = - 3

The eigenvectors are given by

(𝐀 - λ_{1} 𝐈) 𝐧_{1} = 𝟎; (𝐀 - λ_{2} 𝐈) 𝐧_{2} = 𝟎

or,

- n_{1}^{1} + n_{2}^{1} = 0; n_{1}^{1} - n_{2}^{1} = 0; n_{1}^{2} + n_{2}^{2} = 0; n_{1}^{2} + n_{2}^{2} = 0

Possible choices of $𝐧_{1}$ and $𝐧_{2}$ are

𝐧_{1} = [\begin{matrix} 1 \\ 1 \end{matrix}]; 𝐧_{2} = [\begin{matrix} 1 \\ - 1 \end{matrix}]

The matrix $𝐓$ is one whose columd are the eigenvectors of $𝐀$ , i.e.,

𝐓 = [\begin{matrix} 1 & 1 \\ 1 & - 1 \end{matrix}]

and

Λ = 𝐓^{- 1} 𝐀 𝐓 = [\begin{matrix} - 1 & 0 \\ 0 & - 3 \end{matrix}]

If $𝐮 = 𝐓 𝐮^{'}$ the system of equations becomes

\frac{d 𝐮^{'}}{d t} = 𝐓^{- 1} 𝐀 𝐓 𝐮^{'} = Λ 𝐮^{'}

Expanded out

\frac{d u_{1}^{'}}{d t} = - u_{1}^{'}; \frac{d u_{2}^{'}}{d t} = - 3 u_{2}^{'}

The solutions of these equations are

u_{1}^{'} = C_{1} e^{- t}; u_{2}^{'} = C_{2} e^{- 3 t}

Therefore,

𝐮 = 𝐓 𝐮^{'} = [\begin{matrix} C_{1} e^{- t} + C_{2} e^{- 3 t} \\ C_{1} e^{- t} - C_{2} e^{- 3 t} \end{matrix}]

This is the solution of the system of ODEs that we seek.

Most "generic" matrices have linearly independent eigenvectors. Generally a matrix will have $n$ distinct eigenvalues unless there are symmetries that lead to repeated values.

Theorem

If $𝐀$ has $k$ distinct eigenvalues then it has $k$ linearly independent eigenvectors.

Proof:

We prove this by induction.

Let $𝐧_{j}$ be the eigenvector corresponding to the eigenvalue $λ_{j}$ . Suppose $𝐧_{1}, 𝐧_{2}, \dots, 𝐧_{k - 1}$ are linearly independent (note that this is true for $k$ = 2). The question then becomes: Do there exist $α_{1}, α_{2}, \dots, α_{k}$ not all zero such that the linear combination

α_{1} 𝐧_{1} + α_{2} 𝐧_{2} + \dots + α_{k} 𝐧_{k} = 0

Let us multiply the above by $(𝐀 - λ_{k} 𝐈)$ . Then, since $𝐀 𝐧_{i} = λ_{i} 𝐧_{i}$ , we have

α_{1} (λ_{1} - λ_{k}) 𝐧_{1} + α_{2} (λ_{2} - λ_{k}) 𝐧_{2} + \dots + α_{k - 1} (λ_{k - 1} - λ_{k}) 𝐧_{k - 1} + α_{k} (λ_{k} - λ_{k}) 𝐧_{k} = 𝟎

Since $λ_{k}$ is arbitrary, the above is true only when

α_{1} = α_{2} = \dots = α_{k - 1} = 0

In thast case we must have

α_{k} 𝐧_{k} = 𝟎 ⟹ α_{k} = 0

This leads to a contradiction.

Therefore $𝐧_{1}, 𝐧_{2}, \dots, 𝐧_{k}$ are linearly independent. $◻$

Another important class of matrices which are diagonalizable are those which are self-adjoint.

Theorem

If $𝑨$ is self-adjoint the following statements are true

$⟨ 𝑨 𝐱, 𝐱 ⟩$ is real for all $𝐱$ .
All eigenvalues are real.
Eigenvectors of distinct eigenvalues are orthogonal.
There is an orthonormal basis formed by the eigenvectors.
The matrix $𝑨$ can be diagonalized (this is a consequence of the previous statement.)

Proof

1) Because the matrix is self-adjoint we have

⟨ 𝑨 𝐱, 𝐱 ⟩ = ⟨ 𝐱, 𝑨 𝐱 ⟩

From the property of the inner product we have

⟨ 𝐱, 𝑨 𝐱 ⟩ = \overline{⟨ 𝑨 𝐱, 𝐱 ⟩}

Therefore,

⟨ 𝑨 𝐱, 𝐱 ⟩ = \overline{⟨ 𝑨 𝐱, 𝐱 ⟩}

which implies that $⟨ 𝑨 𝐱, 𝐱 ⟩$ is real.

2) Since $⟨ 𝑨 𝐱, 𝐱 ⟩$ is real, $⟨ 𝑰 𝐱, 𝐱 ⟩ = ⟨ 𝐱, 𝐱 ⟩$ is real. Also, from the eiegnevalue problem, we have

⟨ 𝑨 𝐱, 𝐱 ⟩ = λ ⟨ 𝐱, 𝐱 ⟩

Therefore, $λ$ is real.

3) If $(λ, 𝐱)$ and $(μ, 𝐲)$ are two eigenpairs then

λ ⟨ 𝐱, 𝐲 ⟩ = ⟨ 𝑨 𝐱, 𝐲 ⟩

Since the matrix is self-adjoint, we have

λ ⟨ 𝐱, 𝐲 ⟩ = ⟨ 𝐱, 𝑨 𝐲 ⟩ = μ ⟨ 𝐱, 𝐲 ⟩

Therefore, if $λ \neq μ \neq 0$ , we must have

⟨ 𝐱, 𝐲 ⟩ = 0

Hence the eigenvectors are orthogonal.

4) This part is a bit more involved. We need to define a manifold first.

Linear manifold

A linear manifold (or vector subspace) $ℳ \in 𝒮$ is a subset of $𝒮$ which is closed under scalar multiplication and vector addition.

Examples are a line through the origin of $n$ -dimensional space, a plane through the origin, the whole space, the zero vector, etc.

Invariant manifold

An invariant manifold $ℳ$ for the matrix $𝑨$ is the linear manifold for which $𝐱 \in ℳ$ implies $𝑨 𝐱 \in ℳ$ .

Examples are the null space and range of a matrix $𝑨$ . For the case of a rotation about an axis through the origin in $n$ -space, invaraiant manifolds are the origin, the plane perpendicular to the axis, the whole space, and the axis itself.

Therefore, if $𝐱_{1}, 𝐱_{2}, \dots, 𝐱_{m}$ are a basis for $ℳ$ and $𝐱_{m + 1}, \dots, 𝐱_{n}$ are a basis for $ℳ_{⊥}$ (the perpendicular component of $ℳ$ ) then in this basis $𝑨$ has the representation

𝑨 = [\begin{matrix} x & x & | & x & x \\ x & x & | & x & x \\ - & - & - & - & - \\ 0 & 0 & | & x & x \\ 0 & 0 & | & x & x \end{matrix}]

We need a matrix of this form for it to be in an invariant manifold for $𝑨$ .

Note that if $ℳ$ is an invariant manifold of $𝑨$ it does not follow that $ℳ_{⊥}$ is also an invariant manifold.

Now, if $𝑨$ is self adjoint then the entries in the off-diagonal spots must be zero too. In that case, $𝑨$ is block diagonal in this basis.

Getting back to part (4), we know that there exists at least one eigenpair ( $λ_{1}, 𝐱_{1}$ ) (this is true for any matrix). We now use induction. Suppose that we have found ( $n - 1$ ) mutually orthogonal eigenvectors $𝐱_{i}$ with $𝑨 𝐱_{i} = λ_{i} 𝐱_{i}$ and $λ_{i}$ are real, $i = 1, \dots, k - 1$ . Note that the $𝐱_{i}$ s are invariant manifolds of $𝑨$ as is the space spanned by the $𝐱_{i}$ s and so is the manifold perpendicular to these vectors).

We form the linear manifold

ℳ_{k} = {𝐱 | ⟨ 𝐱, 𝐱_{j} ⟩ = 0 j = 1, 2, \dots, k - 1}

This is the orthogonal component of the $k - 1$ eigenvectors $𝐱_{1}, 𝐱_{2}, \dots, 𝐱_{k - 1}$ If $𝐱 \in ℳ_{k}$ then

⟨ 𝐱, 𝐱_{j} ⟩ = 0 and ⟨ 𝑨 𝐱, 𝐱_{j} ⟩ = ⟨ 𝐱, 𝑨 𝐱_{j} ⟩ = λ_{j} ⟨ 𝐱, 𝐱_{j} ⟩ = 0

Therefore $𝑨 𝐱 \in ℳ_{k}$ which means that $ℳ_{k}$ is invariant.

Hence $ℳ_{k}$ contains at least one eigenvector $𝐱_{k}$ with real eigenvalue $λ_{k}$ . We can repeat the procedure to get a diagonal matrix in the lower block of the block diagonal representation of $𝑨$ . We then get $n$ distinct eigenvectors and so $𝑨$ can be diagonalized. This implies that the eigenvectors form an orthonormal basis.

5) This follows from the previous result because each eigenvector can be normalized so that $⟨ 𝐱_{i}, 𝐱_{j} ⟩ = δ_{i j}$ .

We will explore some more of these ideas in the next lecture. Template:Lecture

Applied linear operators and spectral methods/Lecture 4

Contents

More on spectral decompositions

Theorem:

Application

Comment:

Theorem

Theorem

Linear manifold

Invariant manifold

Navigation menu

Applied linear operators and spectral methods/Lecture 4

More on spectral decompositions

Theorem:

Application

Comment:

Theorem

Theorem

Linear manifold

Invariant manifold

Navigation menu

Search