Skip to content

Chapter 69. Minimal Polynomial

The minimal polynomial is the smallest monic polynomial that annihilates a matrix.

For a square matrix AA, a polynomial p(t)p(t) can be evaluated at AA by replacing tt with AA. For example, if

p(t)=t23t+2, p(t)=t^2-3t+2,

then

p(A)=A23A+2I. p(A)=A^2-3A+2I.

The minimal polynomial of AA is the monic polynomial mA(t)m_A(t) of least degree such that

mA(A)=0. m_A(A)=0.

It records the essential algebraic structure of AA. The characteristic polynomial records all eigenvalues with algebraic multiplicity. The minimal polynomial records only the largest Jordan block size for each eigenvalue. It also determines whether a matrix is diagonalizable. A matrix is diagonalizable exactly when its minimal polynomial splits into distinct linear factors.

69.1 Polynomial Evaluation at a Matrix

Let

p(t)=a0+a1t+a2t2++aktk. p(t)=a_0+a_1t+a_2t^2+\cdots+a_kt^k.

For a square matrix AA, define

p(A)=a0I+a1A+a2A2++akAk. p(A)=a_0I+a_1A+a_2A^2+\cdots+a_kA^k.

The identity matrix II is used for the constant term, since matrix addition requires all terms to be matrices of the same size.

For example, if

p(t)=t32t+5, p(t)=t^3-2t+5,

then

p(A)=A32A+5I. p(A)=A^3-2A+5I.

A polynomial pp is said to annihilate AA if

p(A)=0. p(A)=0.

The minimal polynomial is the simplest nonzero polynomial that annihilates AA.

69.2 Definition

Let AA be an n×nn\times n matrix over a field FF.

The minimal polynomial of AA is the unique monic polynomial mA(t)F[t]m_A(t)\in F[t] of least degree such that

mA(A)=0. m_A(A)=0.

Monic means that the leading coefficient is 11.

The same definition applies to a linear transformation

T:VV. T:V\to V.

The minimal polynomial mT(t)m_T(t) is the unique monic polynomial of least degree satisfying

mT(T)=0. m_T(T)=0.

The minimal polynomial belongs to the operator itself. If two matrices represent the same linear map in different bases, they have the same minimal polynomial.

69.3 Existence

A nonzero annihilating polynomial always exists for a matrix.

Let AA be n×nn\times n. The vector space of all n×nn\times n matrices has dimension n2n^2. Therefore the n2+1n^2+1 matrices

I,A,A2,,An2 I,A,A^2,\ldots,A^{n^2}

are linearly dependent.

Hence there exist scalars

c0,c1,,cn2, c_0,c_1,\ldots,c_{n^2},

not all zero, such that

c0I+c1A++cn2An2=0. c_0I+c_1A+\cdots+c_{n^2}A^{n^2}=0.

Thus the polynomial

p(t)=c0+c1t++cn2tn2 p(t)=c_0+c_1t+\cdots+c_{n^2}t^{n^2}

annihilates AA.

Among all nonzero annihilating polynomials, choose one of smallest degree and scale it to be monic. This is the minimal polynomial.

69.4 Uniqueness

The minimal polynomial is unique.

Suppose p(t)p(t) and q(t)q(t) are both monic annihilating polynomials of least degree. Divide pp by qq:

p(t)=s(t)q(t)+r(t), p(t)=s(t)q(t)+r(t),

where either r=0r=0 or

degr<degq. \deg r<\deg q.

Evaluate at AA:

p(A)=s(A)q(A)+r(A). p(A)=s(A)q(A)+r(A).

Since

p(A)=0 p(A)=0

and

q(A)=0, q(A)=0,

we get

r(A)=0. r(A)=0.

If r0r\neq 0, then rr is an annihilating polynomial of smaller degree than qq, which is impossible. Thus

r=0. r=0.

So qq divides pp. By symmetry, pp divides qq. Since both are monic and have the same least degree,

p=q. p=q.

69.5 Divisibility Property

The minimal polynomial divides every polynomial that annihilates AA.

If

p(A)=0, p(A)=0,

then divide pp by mAm_A:

p(t)=q(t)mA(t)+r(t), p(t)=q(t)m_A(t)+r(t),

with

degr<degmA. \deg r<\deg m_A.

Evaluate at AA:

p(A)=q(A)mA(A)+r(A). p(A)=q(A)m_A(A)+r(A).

Since

p(A)=0 p(A)=0

and

mA(A)=0, m_A(A)=0,

we obtain

r(A)=0. r(A)=0.

By minimality, the only polynomial of degree smaller than mAm_A that annihilates AA is the zero polynomial. Hence

r=0. r=0.

Therefore

mA(t)p(t). m_A(t)\mid p(t).

This divisibility property characterizes the minimal polynomial. It is not merely one annihilating polynomial; it is the generator of all annihilating polynomials.

69.6 Relation to the Characteristic Polynomial

The characteristic polynomial is

pA(t)=det(tIA) p_A(t)=\det(tI-A)

or, by another sign convention,

det(AtI). \det(A-tI).

The Cayley-Hamilton theorem states that

pA(A)=0. p_A(A)=0.

Thus the characteristic polynomial annihilates AA. Since the minimal polynomial divides every annihilating polynomial, it follows that

mA(t)pA(t). m_A(t)\mid p_A(t).

Therefore the minimal polynomial is always a divisor of the characteristic polynomial. This is one common way to express the link between the minimal polynomial and the Cayley-Hamilton theorem.

69.7 Same Roots as the Characteristic Polynomial

Over an algebraically closed field, the roots of the minimal polynomial are exactly the eigenvalues of AA.

If λ\lambda is an eigenvalue, then there is a nonzero vector vv such that

Av=λv. Av=\lambda v.

For any polynomial pp,

p(A)v=p(λ)v. p(A)v=p(\lambda)v.

If p(A)=0p(A)=0, then

p(λ)v=0. p(\lambda)v=0.

Since v0v\neq 0,

p(λ)=0. p(\lambda)=0.

Therefore every annihilating polynomial must vanish at every eigenvalue. In particular, the minimal polynomial has every eigenvalue as a root.

Conversely, since mAm_A divides the characteristic polynomial, every root of mAm_A is a root of the characteristic polynomial, hence an eigenvalue.

Thus the minimal and characteristic polynomials have the same distinct roots, though their multiplicities may differ.

69.8 Example: A Diagonal Matrix

Let

A=[200020005]. A= \begin{bmatrix} 2 & 0 & 0 \\ 0 & 2 & 0 \\ 0 & 0 & 5 \end{bmatrix}.

The characteristic polynomial is

pA(t)=(t2)2(t5). p_A(t)=(t-2)^2(t-5).

But the minimal polynomial is

mA(t)=(t2)(t5). m_A(t)=(t-2)(t-5).

Indeed,

(A2I)(A5I)=0. (A-2I)(A-5I)=0.

The factor t2t-2 appears only once in the minimal polynomial, even though 22 has algebraic multiplicity 22.

This happens because the matrix is diagonal. A diagonal matrix needs only one factor for each distinct eigenvalue.

69.9 Example: A Jordan Block

Let

A=[2102]. A= \begin{bmatrix} 2 & 1 \\ 0 & 2 \end{bmatrix}.

Then

A2I=[0100]. A-2I= \begin{bmatrix} 0 & 1 \\ 0 & 0 \end{bmatrix}.

This matrix is not zero. Hence

(A2I)0. (A-2I)\neq 0.

But

(A2I)2=0. (A-2I)^2=0.

Therefore the minimal polynomial is

mA(t)=(t2)2. m_A(t)=(t-2)^2.

The characteristic polynomial is also

pA(t)=(t2)2. p_A(t)=(t-2)^2.

This matrix is not diagonalizable, and the repeated factor in the minimal polynomial detects that failure.

69.10 Example: Two Jordan Blocks with the Same Eigenvalue

Suppose

J=J3(4)J2(4). J= J_3(4)\oplus J_2(4).

The characteristic polynomial is

pJ(t)=(t4)5. p_J(t)=(t-4)^5.

The minimal polynomial is determined by the largest Jordan block for the eigenvalue 44. Since the largest block has size 33,

mJ(t)=(t4)3. m_J(t)=(t-4)^3.

The characteristic polynomial counts the total size of all Jordan blocks. The minimal polynomial records the largest block size.

69.11 General Jordan Form Rule

Suppose AA has Jordan form consisting of Jordan blocks

Jk1(λ1),,Jkr(λr). J_{k_1}(\lambda_1),\ldots,J_{k_r}(\lambda_r).

For each eigenvalue λ\lambda, let sλs_\lambda be the size of the largest Jordan block associated with λ\lambda.

Then

mA(t)=λ(tλ)sλ. m_A(t)=\prod_{\lambda}(t-\lambda)^{s_\lambda}.

The product is taken over the distinct eigenvalues.

By contrast, the characteristic polynomial is

pA(t)=λ(tλ)aλ, p_A(t)=\prod_{\lambda}(t-\lambda)^{a_\lambda},

where aλa_\lambda is the algebraic multiplicity, the total size of all Jordan blocks for λ\lambda.

Thus the two polynomials contain related but different information.

69.12 Diagonalization Criterion

A matrix AA is diagonalizable over a field FF if and only if its minimal polynomial splits over FF into distinct linear factors.

That is,

mA(t)=(tλ1)(tλ2)(tλk), m_A(t)=(t-\lambda_1)(t-\lambda_2)\cdots(t-\lambda_k),

with all λi\lambda_i distinct.

If a repeated factor appears, such as

(tλ)2, (t-\lambda)^2,

then some Jordan block for λ\lambda has size at least 22, so the matrix is not diagonalizable.

This criterion is often more compact than checking eigenspace dimensions directly.

69.13 Diagonalizable Example

Let

A=[300030001]. A= \begin{bmatrix} 3 & 0 & 0 \\ 0 & 3 & 0 \\ 0 & 0 & -1 \end{bmatrix}.

The characteristic polynomial is

pA(t)=(t3)2(t+1). p_A(t)=(t-3)^2(t+1).

The minimal polynomial is

mA(t)=(t3)(t+1). m_A(t)=(t-3)(t+1).

This polynomial has no repeated factors. Therefore AA is diagonalizable.

In fact, it is already diagonal.

69.14 Non-Diagonalizable Example

Let

B=[310030001]. B= \begin{bmatrix} 3 & 1 & 0 \\ 0 & 3 & 0 \\ 0 & 0 & -1 \end{bmatrix}.

This matrix has one Jordan block of size 22 for eigenvalue 33, and one block of size 11 for eigenvalue 1-1.

Its characteristic polynomial is

pB(t)=(t3)2(t+1). p_B(t)=(t-3)^2(t+1).

Its minimal polynomial is

mB(t)=(t3)2(t+1). m_B(t)=(t-3)^2(t+1).

The repeated factor

(t3)2 (t-3)^2

shows that BB is not diagonalizable.

69.15 Computing the Minimal Polynomial

There are several ways to compute the minimal polynomial.

One method uses powers of the matrix. Search for the lowest-degree monic relation

Ak+ck1Ak1++c1A+c0I=0. A^k+c_{k-1}A^{k-1}+\cdots+c_1A+c_0I=0.

Another method uses eigenvalues and kernels. For each eigenvalue λ\lambda, find the smallest exponent ss such that

ker(AλI)s \ker(A-\lambda I)^s

equals the full generalized eigenspace for λ\lambda. This ss is the largest Jordan block size for λ\lambda.

A third method uses known structure. If the matrix is diagonal, symmetric, Hermitian, or normal over C\mathbb{C}, then it is diagonalizable, so the minimal polynomial has only distinct linear factors.

69.16 Kernel Stabilization

Let

N=AλI. N=A-\lambda I.

The sequence of subspaces

kerNkerN2kerN3 \ker N \subseteq \ker N^2 \subseteq \ker N^3 \subseteq \cdots

is increasing.

On the generalized eigenspace for λ\lambda, this sequence eventually stabilizes. The smallest exponent ss at which it reaches the full generalized eigenspace is the largest Jordan block size for λ\lambda.

Equivalently, ss is the exponent of

(tλ) (t-\lambda)

in the minimal polynomial.

The multiplicity of a root in the minimal polynomial is therefore controlled by the growth of these kernels.

69.17 Minimal Polynomial of a Projection

A projection satisfies

P2=P. P^2=P.

Therefore

P2P=0, P^2-P=0,

so

P(PI)=0. P(P-I)=0.

Thus the minimal polynomial divides

t(t1). t(t-1).

If P0P\neq 0 and PIP\neq I, then both eigenvalues 00 and 11 occur, and

mP(t)=t(t1). m_P(t)=t(t-1).

Since the polynomial has distinct linear factors, every projection is diagonalizable over any field in which 010\neq 1.

69.18 Minimal Polynomial of an Involution

An involution satisfies

A2=I. A^2=I.

Therefore

A2I=0. A^2-I=0.

So the minimal polynomial divides

t21=(t1)(t+1). t^2-1=(t-1)(t+1).

If the field has characteristic not equal to 22, these factors are distinct. Hence every involution is diagonalizable over such a field, provided the minimal polynomial splits there.

The eigenvalues of an involution are among

1and1. 1 \qquad \text{and} \qquad -1.

69.19 Minimal Polynomial of a Nilpotent Matrix

A matrix NN is nilpotent if

Nk=0 N^k=0

for some positive integer kk.

The minimal polynomial of NN has the form

mN(t)=ts, m_N(t)=t^s,

where ss is the smallest positive integer such that

Ns=0. N^s=0.

This integer ss is called the index of nilpotency.

If NN is in Jordan form, ss is the size of the largest nilpotent Jordan block.

For example, if

N=[010001000], N= \begin{bmatrix} 0 & 1 & 0 \\ 0 & 0 & 1 \\ 0 & 0 & 0 \end{bmatrix},

then

N3=0 N^3=0

but

N20. N^2\neq 0.

Thus

mN(t)=t3. m_N(t)=t^3.

69.20 Minimal Polynomial and Matrix Inverses

The minimal polynomial can express the inverse of an invertible matrix as a polynomial in the matrix.

Suppose

mA(t)=tk+ck1tk1++c1t+c0. m_A(t)=t^k+c_{k-1}t^{k-1}+\cdots+c_1t+c_0.

If AA is invertible, then 00 is not an eigenvalue, so

c00. c_0\neq 0.

Since

mA(A)=0, m_A(A)=0,

we have

Ak+ck1Ak1++c1A+c0I=0. A^k+c_{k-1}A^{k-1}+\cdots+c_1A+c_0I=0.

Rearrange:

c0I=A(Ak1+ck1Ak2++c1I). c_0I=-A(A^{k-1}+c_{k-1}A^{k-2}+\cdots+c_1I).

Multiply by c01c_0^{-1}:

A1=1c0(Ak1+ck1Ak2++c1I). A^{-1} = -\frac{1}{c_0} \left( A^{k-1}+c_{k-1}A^{k-2}+\cdots+c_1I \right).

Thus the inverse is a polynomial in AA.

69.21 Minimal Polynomial and Cyclic Vectors

A vector vv is called cyclic for AA if

v,Av,A2v,,An1v v,Av,A^2v,\ldots,A^{n-1}v

span the whole space.

If AA has a cyclic vector, then the minimal polynomial and characteristic polynomial are equal.

This happens because the action of AA on one vector already generates the entire space, so the first polynomial relation among the powers of AA must have degree nn.

Companion matrices provide standard examples where the minimal polynomial equals the characteristic polynomial.

69.22 Minimal Polynomial of a Linear Transformation

Let

T:VV T:V\to V

be a linear transformation on a finite-dimensional vector space.

The minimal polynomial mT(t)m_T(t) is the unique monic polynomial of least degree satisfying

mT(T)=0. m_T(T)=0.

If AA is the matrix of TT in some basis, then

mT(t)=mA(t). m_T(t)=m_A(t).

Changing basis replaces AA by a similar matrix

B=P1AP. B=P^{-1}AP.

For any polynomial pp,

p(B)=P1p(A)P. p(B)=P^{-1}p(A)P.

Thus

p(B)=0 p(B)=0

if and only if

p(A)=0. p(A)=0.

Therefore similar matrices have the same minimal polynomial.

69.23 What the Minimal Polynomial Does Not Determine

The minimal polynomial does not determine the matrix completely.

For example,

A=[2002] A= \begin{bmatrix} 2 & 0 \\ 0 & 2 \end{bmatrix}

has minimal polynomial

t2. t-2.

A 3×33\times 3 scalar matrix

B=[200020002] B= \begin{bmatrix} 2 & 0 & 0 \\ 0 & 2 & 0 \\ 0 & 0 & 2 \end{bmatrix}

also has minimal polynomial

t2. t-2.

The matrices have different sizes.

Even among matrices of the same size, the minimal polynomial may fail to determine all Jordan block multiplicities. It gives the largest block size for each eigenvalue, but not the number of smaller blocks.

To recover full Jordan structure, one needs more information, such as the dimensions of the kernels of powers of AλIA-\lambda I.

69.24 Summary

The minimal polynomial of a square matrix AA is the unique monic polynomial of least degree satisfying

mA(A)=0. m_A(A)=0.

It divides every polynomial that annihilates AA, including the characteristic polynomial.

Over an algebraically closed field, it has the same distinct roots as the characteristic polynomial. Its exponent at each eigenvalue equals the size of the largest Jordan block for that eigenvalue.

The minimal polynomial gives a compact test for diagonalization: AA is diagonalizable exactly when mA(t)m_A(t) splits into distinct linear factors.

It is smaller than the characteristic polynomial in many cases, but often more structurally informative.