Skip to content

Chapter 68. Jordan Canonical Form

Jordan canonical form describes the structure of a square matrix that may fail to be diagonalizable.

Diagonalization is the ideal case. If a matrix has enough independent eigenvectors, it can be written as

A=PDP1, A=PDP^{-1},

where DD is diagonal. But many matrices do not have enough eigenvectors. Jordan canonical form gives the next best representation. It replaces a defective matrix by a block diagonal matrix that is almost diagonal.

Over an algebraically closed field, such as C\mathbb{C}, every square matrix is similar to a matrix built from Jordan blocks. Each block has one eigenvalue on the diagonal and ones immediately above the diagonal. This form records eigenvalues, algebraic multiplicities, geometric multiplicities, and the failure of diagonalization.

68.1 The Problem After Diagonalization

A matrix AA is diagonalizable if there is a basis of eigenvectors. In that case, the matrix of the transformation in that basis is diagonal.

But consider

A=[2102]. A= \begin{bmatrix} 2 & 1 \\ 0 & 2 \end{bmatrix}.

The only eigenvalue is

λ=2. \lambda=2.

Compute

A2I=[0100]. A-2I= \begin{bmatrix} 0 & 1 \\ 0 & 0 \end{bmatrix}.

Solving

(A2I)v=0 (A-2I)v=0

gives

y=0. y=0.

Thus

E2=span{[10]}. E_2= \operatorname{span} \left\{ \begin{bmatrix} 1 \\ 0 \end{bmatrix} \right\}.

There is only one independent eigenvector, but the space is two-dimensional. Hence AA cannot be diagonalized.

Still, AA has a simple structure. It is already a Jordan block:

[2102]. \begin{bmatrix} 2 & 1 \\ 0 & 2 \end{bmatrix}.

It acts like multiplication by 22, together with a nilpotent shift.

68.2 Jordan Blocks

A Jordan block of size kk with eigenvalue λ\lambda is the k×kk\times k matrix

Jk(λ)=[λ1000λ1000λ010000λ]. J_k(\lambda)= \begin{bmatrix} \lambda & 1 & 0 & \cdots & 0 \\ 0 & \lambda & 1 & \cdots & 0 \\ 0 & 0 & \lambda & \ddots & 0 \\ \vdots & \vdots & \ddots & \ddots & 1 \\ 0 & 0 & 0 & 0 & \lambda \end{bmatrix}.

It has λ\lambda on the main diagonal, 11 on the superdiagonal, and 00 elsewhere.

For example,

J3(5)=[510051005]. J_3(5)= \begin{bmatrix} 5 & 1 & 0 \\ 0 & 5 & 1 \\ 0 & 0 & 5 \end{bmatrix}.

A Jordan block of size 11 is just

J1(λ)=[λ]. J_1(\lambda)= [\lambda].

Thus diagonal matrices are special cases of Jordan forms in which every Jordan block has size 11.

68.3 Jordan Canonical Form

A Jordan canonical form is a block diagonal matrix whose diagonal blocks are Jordan blocks:

J=[Jk1(λ1)000Jk2(λ2)000Jkr(λr)]. J= \begin{bmatrix} J_{k_1}(\lambda_1) & 0 & \cdots & 0 \\ 0 & J_{k_2}(\lambda_2) & \cdots & 0 \\ \vdots & \vdots & \ddots & \vdots \\ 0 & 0 & \cdots & J_{k_r}(\lambda_r) \end{bmatrix}.

A square matrix AA has Jordan canonical form JJ if there exists an invertible matrix PP such that

A=PJP1. A=PJP^{-1}.

Equivalently,

P1AP=J. P^{-1}AP=J.

The columns of PP form a special basis called a Jordan basis.

The Jordan form is unique up to the order of its blocks. The order of blocks may be changed, but the block sizes attached to each eigenvalue are determined by the matrix.

68.4 A Jordan Block as Scalar Plus Nilpotent

A Jordan block can be written as

Jk(λ)=λI+N, J_k(\lambda)=\lambda I+N,

where

N=[010000100000100000]. N= \begin{bmatrix} 0 & 1 & 0 & \cdots & 0 \\ 0 & 0 & 1 & \cdots & 0 \\ 0 & 0 & 0 & \ddots & 0 \\ \vdots & \vdots & \ddots & \ddots & 1 \\ 0 & 0 & 0 & 0 & 0 \end{bmatrix}.

The matrix NN is nilpotent. This means that some power of NN is zero.

For a k×kk\times k Jordan block,

Nk=0, N^k=0,

but

Nk10. N^{k-1}\neq 0.

Thus a Jordan block consists of two parts:

PartMeaning
λI\lambda Idiagonal scaling by the eigenvalue
NNnilpotent shift along a chain

The diagonal part describes eigenvalue scaling. The nilpotent part describes the failure of diagonalization.

68.5 Generalized Eigenvectors

To build Jordan form, ordinary eigenvectors are not enough. We need generalized eigenvectors.

A nonzero vector vv is a generalized eigenvector of AA for eigenvalue λ\lambda if

(AλI)mv=0 (A-\lambda I)^m v=0

for some positive integer mm.

Every ordinary eigenvector is a generalized eigenvector with m=1m=1, since

(AλI)v=0. (A-\lambda I)v=0.

Generalized eigenvectors allow us to complete missing eigenvector bases.

For the defective matrix

A=[2102], A= \begin{bmatrix} 2 & 1 \\ 0 & 2 \end{bmatrix},

there is only one ordinary eigenvector. But there are two generalized eigenvectors, enough to form a basis.

68.6 Jordan Chains

A Jordan chain for eigenvalue λ\lambda is a list of vectors

v1,v2,,vk v_1,v_2,\ldots,v_k

such that

(AλI)v1=0 (A-\lambda I)v_1=0

and

(AλI)v2=v1, (A-\lambda I)v_2=v_1, (AλI)v3=v2, (A-\lambda I)v_3=v_2,

and so on, until

(AλI)vk=vk1. (A-\lambda I)v_k=v_{k-1}.

The first vector v1v_1 is an ordinary eigenvector. The later vectors are generalized eigenvectors.

Equivalently,

Av1=λv1, Av_1=\lambda v_1, Av2=λv2+v1, Av_2=\lambda v_2+v_1, Av3=λv3+v2, Av_3=\lambda v_3+v_2,

and in general,

Avj=λvj+vj1. Av_j=\lambda v_j+v_{j-1}.

This chain produces one Jordan block of size kk.

68.7 Matrix of a Jordan Chain

Let

v1,v2,,vk v_1,v_2,\ldots,v_k

be a Jordan chain for eigenvalue λ\lambda. In the ordered basis

v1,v2,,vk, v_1,v_2,\ldots,v_k,

the action of AA is

Av1=λv1, Av_1=\lambda v_1, Av2=v1+λv2, Av_2=v_1+\lambda v_2, Av3=v2+λv3, Av_3=v_2+\lambda v_3,

and so on.

Thus the matrix has the form

[λ1000λ1000λ010000λ]. \begin{bmatrix} \lambda & 1 & 0 & \cdots & 0 \\ 0 & \lambda & 1 & \cdots & 0 \\ 0 & 0 & \lambda & \ddots & 0 \\ \vdots & \vdots & \ddots & \ddots & 1 \\ 0 & 0 & 0 & 0 & \lambda \end{bmatrix}.

This is exactly the Jordan block Jk(λ)J_k(\lambda).

Thus Jordan blocks are the coordinate matrices of Jordan chains.

68.8 Example: A Two by Two Jordan Block

Let

A=[2102]. A= \begin{bmatrix} 2 & 1 \\ 0 & 2 \end{bmatrix}.

The eigenvalue is λ=2\lambda=2. Let

N=A2I=[0100]. N=A-2I= \begin{bmatrix} 0 & 1 \\ 0 & 0 \end{bmatrix}.

We have

Ne1=0 Ne_1=0

and

Ne2=e1. Ne_2=e_1.

Therefore

e1,e2 e_1,e_2

is a Jordan chain.

The matrix AA in this basis is already

J2(2)=[2102]. J_2(2)= \begin{bmatrix} 2 & 1 \\ 0 & 2 \end{bmatrix}.

The vector e1e_1 is an eigenvector. The vector e2e_2 is a generalized eigenvector.

68.9 Example: A Larger Jordan Block

Consider

J3(λ)=[λ100λ100λ]. J_3(\lambda)= \begin{bmatrix} \lambda & 1 & 0 \\ 0 & \lambda & 1 \\ 0 & 0 & \lambda \end{bmatrix}.

Let

N=J3(λ)λI=[010001000]. N=J_3(\lambda)-\lambda I = \begin{bmatrix} 0 & 1 & 0 \\ 0 & 0 & 1 \\ 0 & 0 & 0 \end{bmatrix}.

Then

Ne1=0, Ne_1=0, Ne2=e1, Ne_2=e_1, Ne3=e2. Ne_3=e_2.

Also,

N2e3=e1 N^2e_3=e_1

and

N3e3=0. N^3e_3=0.

Thus e3e_3 is a generalized eigenvector of rank 33, and

e1,e2,e3 e_1,e_2,e_3

form a Jordan chain.

68.10 Algebraic and Geometric Multiplicity in Jordan Form

Jordan form makes algebraic and geometric multiplicities visible.

For a fixed eigenvalue λ\lambda:

QuantityJordan form interpretation
Algebraic multiplicityTotal size of all Jordan blocks for λ\lambda
Geometric multiplicityNumber of Jordan blocks for λ\lambda
Largest Jordan block sizeExponent of (tλ)(t-\lambda) in the minimal polynomial

For example,

J=[4100004000004100004000004] J= \begin{bmatrix} 4 & 1 & 0 & 0 & 0 \\ 0 & 4 & 0 & 0 & 0 \\ 0 & 0 & 4 & 1 & 0 \\ 0 & 0 & 0 & 4 & 0 \\ 0 & 0 & 0 & 0 & 4 \end{bmatrix}

has three Jordan blocks for eigenvalue 44: two blocks of size 22 and one block of size 11.

Therefore:

algebraic multiplicity=5, \text{algebraic multiplicity}=5, geometric multiplicity=3. \text{geometric multiplicity}=3.

68.11 Diagonalization as a Special Case

A matrix is diagonalizable exactly when every Jordan block has size 11.

If all Jordan blocks are 1×11\times 1, then the Jordan form is diagonal:

J=[λ1000λ2000λn]. J= \begin{bmatrix} \lambda_1 & 0 & \cdots & 0 \\ 0 & \lambda_2 & \cdots & 0 \\ \vdots & \vdots & \ddots & \vdots \\ 0 & 0 & \cdots & \lambda_n \end{bmatrix}.

If any Jordan block has size greater than 11, the matrix is not diagonalizable.

Thus Jordan canonical form extends diagonalization to defective matrices.

68.12 Kernels and Block Sizes

The dimensions of kernels

ker(AλI),ker(AλI)2,ker(AλI)3, \ker(A-\lambda I), \quad \ker(A-\lambda I)^2, \quad \ker(A-\lambda I)^3, \quad \ldots

determine the Jordan block sizes for λ\lambda.

Let

N=AλI N=A-\lambda I

on the generalized eigenspace for λ\lambda.

Then

dimkerN \dim \ker N

is the number of Jordan blocks.

The difference

dimkerN2dimkerN \dim \ker N^2-\dim \ker N

counts how many blocks have size at least 22.

More generally,

dimkerNrdimkerNr1 \dim \ker N^r-\dim \ker N^{r-1}

counts how many blocks have size at least rr.

This gives a systematic way to recover Jordan block sizes.

68.13 Example from Kernel Dimensions

Suppose for an eigenvalue λ\lambda, the generalized eigenspace has dimension 66, and

dimkerN=2, \dim \ker N=2, dimkerN2=4, \dim \ker N^2=4, dimkerN3=5, \dim \ker N^3=5, dimkerN4=6. \dim \ker N^4=6.

Then:

dimkerNdimkerN0=2 \dim \ker N-\dim \ker N^0=2

so there are 22 blocks of size at least 11.

dimkerN2dimkerN=2 \dim \ker N^2-\dim \ker N=2

so there are 22 blocks of size at least 22.

dimkerN3dimkerN2=1 \dim \ker N^3-\dim \ker N^2=1

so there is 11 block of size at least 33.

dimkerN4dimkerN3=1 \dim \ker N^4-\dim \ker N^3=1

so there is 11 block of size at least 44.

Thus the block sizes are

4and2. 4 \qquad \text{and} \qquad 2.

The Jordan form for this eigenvalue has blocks

J4(λ)J2(λ). J_4(\lambda) \oplus J_2(\lambda).

68.14 Generalized Eigenspaces

The generalized eigenspace for eigenvalue λ\lambda is

Gλ=ker(AλI)m G_\lambda = \ker(A-\lambda I)^m

for mm large enough.

If the algebraic multiplicity of λ\lambda is aa, it is enough to take

m=a. m=a.

Thus

Gλ=ker(AλI)a. G_\lambda=\ker(A-\lambda I)^a.

The generalized eigenspace contains the ordinary eigenspace:

Eλ=ker(AλI)Gλ. E_\lambda=\ker(A-\lambda I)\subseteq G_\lambda.

The space decomposes as a direct sum of generalized eigenspaces:

V=Gλ1Gλ2Gλk. V= G_{\lambda_1}\oplus G_{\lambda_2}\oplus\cdots\oplus G_{\lambda_k}.

On each generalized eigenspace, the operator has only one eigenvalue and can be studied through a nilpotent part.

68.15 Jordan Decomposition on a Generalized Eigenspace

On GλG_\lambda, define

N=AλI. N=A-\lambda I.

Then NN is nilpotent on GλG_\lambda. This means

Nm=0 N^m=0

for some mm.

Therefore, on GλG_\lambda,

A=λI+N. A=\lambda I+N.

The Jordan form of AA on GλG_\lambda is determined by the Jordan form of the nilpotent operator NN.

This reduces the study of Jordan form to the study of nilpotent operators.

68.16 Nilpotent Jordan Form

A nilpotent matrix has only one eigenvalue:

0. 0.

Its Jordan form consists of blocks

Jk(0)=[010000100000100000]. J_k(0)= \begin{bmatrix} 0 & 1 & 0 & \cdots & 0 \\ 0 & 0 & 1 & \cdots & 0 \\ 0 & 0 & 0 & \ddots & 0 \\ \vdots & \vdots & \ddots & \ddots & 1 \\ 0 & 0 & 0 & 0 & 0 \end{bmatrix}.

Thus every Jordan block for a general eigenvalue is just a nilpotent Jordan block shifted by λI\lambda I:

Jk(λ)=λI+Jk(0). J_k(\lambda)=\lambda I+J_k(0).

Nilpotent behavior measures how far a matrix is from being diagonalizable.

68.17 Powers of a Jordan Block

Let

J=λI+N, J=\lambda I+N,

where Nk=0N^k=0.

Then for a positive integer mm,

Jm=(λI+N)m. J^m=(\lambda I+N)^m.

Since λI\lambda I commutes with NN, the binomial theorem applies:

Jm=r=0k1(mr)λmrNr. J^m = \sum_{r=0}^{k-1} \binom{m}{r} \lambda^{m-r}N^r.

Terms with rkr\geq k vanish because

Nk=0. N^k=0.

This formula shows that powers of a Jordan block contain both exponential behavior from λm\lambda^m and polynomial factors from the nilpotent part.

68.18 Matrix Functions

Jordan form also explains matrix functions.

If

J=λI+N J=\lambda I+N

and Nk=0N^k=0, then for a polynomial or analytic function ff,

f(J)=f(λI+N). f(J) = f(\lambda I+N).

Using Taylor expansion,

f(J)=f(λ)I+f(λ)N+f(λ)2!N2++f(k1)(λ)(k1)!Nk1. f(J)= f(\lambda)I + f'(\lambda)N + \frac{f''(\lambda)}{2!}N^2 + \cdots + \frac{f^{(k-1)}(\lambda)}{(k-1)!}N^{k-1}.

Thus a function of a Jordan block depends not only on f(λ)f(\lambda), but also on derivatives of ff at λ\lambda. This is one reason Jordan form is theoretically powerful.

68.19 Jordan Form and Differential Equations

Consider the linear differential equation

x(t)=Ax(t). x'(t)=Ax(t).

Its solution is

x(t)=etAx(0). x(t)=e^{tA}x(0).

If

A=PJP1, A=PJP^{-1},

then

etA=PetJP1. e^{tA}=Pe^{tJ}P^{-1}.

For a Jordan block

J=λI+N, J=\lambda I+N,

we have

etJ=etλetN. e^{tJ} = e^{t\lambda}e^{tN}.

Since NN is nilpotent,

etN=I+tN+t22!N2++tk1(k1)!Nk1. e^{tN} = I+tN+\frac{t^2}{2!}N^2+\cdots+\frac{t^{k-1}}{(k-1)!}N^{k-1}.

Therefore Jordan blocks produce solutions involving

eλt,teλt,t2eλt, e^{\lambda t}, \quad t e^{\lambda t}, \quad t^2 e^{\lambda t}, \quad \ldots

This explains why repeated eigenvalues in differential equations can produce polynomial factors.

68.20 Jordan Form and Minimal Polynomial

The minimal polynomial of AA is the monic polynomial of least degree mA(t)m_A(t) such that

mA(A)=0. m_A(A)=0.

Jordan form makes it easy to read.

If the largest Jordan block for eigenvalue λ\lambda has size sλs_\lambda, then the minimal polynomial contains the factor

(tλ)sλ. (t-\lambda)^{s_\lambda}.

Thus

mA(t)=λ(tλ)sλ. m_A(t) = \prod_{\lambda} (t-\lambda)^{s_\lambda}.

The characteristic polynomial records the total size of all blocks. The minimal polynomial records the largest block size for each eigenvalue.

A matrix is diagonalizable exactly when the minimal polynomial has no repeated root.

68.21 Existence of Jordan Form

Over C\mathbb{C}, every square matrix has a Jordan canonical form.

More generally, a matrix has a Jordan canonical form over a field FF if its characteristic polynomial splits completely over FF. That means all eigenvalues lie in FF.

If the characteristic polynomial does not split, the matrix may not have Jordan form over that field.

For example, the real matrix

[0110] \begin{bmatrix} 0 & -1 \\ 1 & 0 \end{bmatrix}

has characteristic polynomial

t2+1. t^2+1.

It has no real eigenvalues, so it has no real Jordan form with real eigenvalues. Over C\mathbb{C}, it has eigenvalues ii and i-i, and it is diagonalizable.

68.22 Numerical Warning

Jordan canonical form is mainly a theoretical tool.

It is highly sensitive to small perturbations. A matrix with a repeated eigenvalue and a Jordan block may become diagonalizable after an arbitrarily small perturbation. Conversely, nearly repeated eigenvalues can make numerical Jordan computations unstable.

For numerical work, one usually prefers Schur decomposition, singular value decomposition, or other stable factorizations. Jordan form classifies exact algebraic structure, but it is usually avoided as a computational normal form in floating point arithmetic.

68.23 Common Errors

The first common error is to think that every repeated eigenvalue creates a Jordan block larger than 11. A repeated eigenvalue may still have enough eigenvectors.

The second common error is to confuse algebraic multiplicity with block size. Algebraic multiplicity is the total size of all blocks for an eigenvalue, not the size of one block.

The third common error is to forget generalized eigenvectors. Jordan bases are built from chains, not only from ordinary eigenvectors.

The fourth common error is to assume Jordan form is numerically safe. It is structurally exact but computationally unstable.

The fifth common error is to ignore the field. Jordan form with scalar eigenvalues requires the characteristic polynomial to split over the field being used.

68.24 Summary

Jordan canonical form expresses a square matrix as

A=PJP1, A=PJP^{-1},

where JJ is block diagonal and each block has the form

Jk(λ)=[λ1000λ1000λ010000λ]. J_k(\lambda)= \begin{bmatrix} \lambda & 1 & 0 & \cdots & 0 \\ 0 & \lambda & 1 & \cdots & 0 \\ 0 & 0 & \lambda & \ddots & 0 \\ \vdots & \vdots & \ddots & \ddots & 1 \\ 0 & 0 & 0 & 0 & \lambda \end{bmatrix}.

A Jordan block represents one chain of generalized eigenvectors. Diagonalization is the special case in which every Jordan block has size 11.

Jordan form records the precise failure of diagonalization. It shows how eigenvalues, eigenspaces, generalized eigenvectors, nilpotent parts, minimal polynomials, and matrix functions fit into one structure.