Skip to content

Chapter 61. Characteristic Polynomial

The characteristic polynomial is the polynomial equation attached to a square matrix whose roots are the eigenvalues of that matrix.

It turns the eigenvalue problem into a polynomial problem. Instead of looking directly for nonzero vectors vv satisfying

Av=λv, Av = \lambda v,

we look for scalars λ\lambda that make a certain determinant equal to zero.

For an n×nn \times n matrix AA, the characteristic polynomial is commonly written as

pA(λ)=det(AλI). p_A(\lambda) = \det(A - \lambda I).

Some books use

pA(λ)=det(λIA). p_A(\lambda) = \det(\lambda I - A).

These two conventions differ only by the sign factor (1)n(-1)^n. They have the same roots, so they give the same eigenvalues. The characteristic polynomial has degree nn, and its roots are exactly the eigenvalues of AA.

61.1 From Eigenvectors to a Polynomial

Start with the eigenvalue equation

Av=λv. Av = \lambda v.

Move all terms to one side:

Avλv=0. Av - \lambda v = 0.

Since λv=λIv\lambda v = \lambda I v, we have

(AλI)v=0. (A - \lambda I)v = 0.

This is a homogeneous system. It has a nonzero solution exactly when the matrix AλIA-\lambda I is singular. A square matrix is singular exactly when its determinant is zero. Therefore,

det(AλI)=0. \det(A-\lambda I)=0.

This equation is called the characteristic equation. Its left side is the characteristic polynomial.

61.2 Definition

Let AA be an n×nn \times n matrix over a field FF. The characteristic polynomial of AA is

pA(λ)=det(AλI). p_A(\lambda)=\det(A-\lambda I).

The scalar λ\lambda is an indeterminate. The entries of AλIA-\lambda I are polynomials in λ\lambda. Taking the determinant produces one polynomial in λ\lambda.

The eigenvalues of AA are precisely the roots of this polynomial:

pA(λ)=0. p_A(\lambda)=0.

Thus the characteristic polynomial is the algebraic object that encodes the eigenvalues.

61.3 A Two by Two Formula

Let

A=[abcd]. A = \begin{bmatrix} a & b \\ c & d \end{bmatrix}.

Then

AλI=[aλbcdλ]. A-\lambda I = \begin{bmatrix} a-\lambda & b \\ c & d-\lambda \end{bmatrix}.

The characteristic polynomial is

pA(λ)=det[aλbcdλ]. p_A(\lambda) = \det \begin{bmatrix} a-\lambda & b \\ c & d-\lambda \end{bmatrix}.

Compute the determinant:

pA(λ)=(aλ)(dλ)bc. p_A(\lambda) = (a-\lambda)(d-\lambda)-bc.

Expanding gives

pA(λ)=λ2(a+d)λ+(adbc). p_A(\lambda) = \lambda^2-(a+d)\lambda+(ad-bc).

Since

tr(A)=a+d \operatorname{tr}(A)=a+d

and

det(A)=adbc, \det(A)=ad-bc,

we get

pA(λ)=λ2tr(A)λ+det(A). p_A(\lambda) = \lambda^2-\operatorname{tr}(A)\lambda+\det(A).

This formula is useful for quick computations with 2×22 \times 2 matrices.

61.4 Example

Let

A=[4123]. A = \begin{bmatrix} 4 & 1 \\ 2 & 3 \end{bmatrix}.

Then

AλI=[4λ123λ]. A-\lambda I = \begin{bmatrix} 4-\lambda & 1 \\ 2 & 3-\lambda \end{bmatrix}.

Hence

pA(λ)=(4λ)(3λ)2. p_A(\lambda) = (4-\lambda)(3-\lambda)-2.

Expand:

pA(λ)=127λ+λ22. p_A(\lambda) = 12-7\lambda+\lambda^2-2.

Therefore,

pA(λ)=λ27λ+10. p_A(\lambda)=\lambda^2-7\lambda+10.

Factor:

pA(λ)=(λ5)(λ2). p_A(\lambda)=(\lambda-5)(\lambda-2).

The roots are

λ=5andλ=2. \lambda=5 \qquad \text{and} \qquad \lambda=2.

Thus the eigenvalues of AA are 55 and 22.

61.5 Characteristic Equation

The equation

pA(λ)=0 p_A(\lambda)=0

is called the characteristic equation of AA.

For the previous matrix, the characteristic equation is

λ27λ+10=0. \lambda^2-7\lambda+10=0.

Solving it gives the eigenvalues.

The characteristic equation usually does not give the eigenvectors directly. After finding an eigenvalue λ\lambda, one finds the eigenvectors by solving

(AλI)v=0. (A-\lambda I)v=0.

Thus the eigenvalue computation has two stages:

StageOperationOutput
1Solve det(AλI)=0\det(A-\lambda I)=0Eigenvalues
2Solve (AλI)v=0(A-\lambda I)v=0Eigenvectors

61.6 Degree

If AA is an n×nn \times n matrix, then pA(λ)p_A(\lambda) has degree nn.

This follows from the determinant expansion. The diagonal entries of AλIA-\lambda I have the form

aiiλ. a_{ii}-\lambda.

The product of all diagonal terms contributes a term of degree nn:

(λ)n. (-\lambda)^n.

No other term can have higher degree. Hence the characteristic polynomial has degree nn.

With the convention

pA(λ)=det(AλI), p_A(\lambda)=\det(A-\lambda I),

the leading term is

(1)nλn. (-1)^n\lambda^n.

With the convention

qA(λ)=det(λIA), q_A(\lambda)=\det(\lambda I-A),

the leading term is

λn. \lambda^n.

The roots are the same under both conventions.

61.7 Constant Term

The constant term of

pA(λ)=det(AλI) p_A(\lambda)=\det(A-\lambda I)

is found by setting λ=0\lambda=0:

pA(0)=det(A). p_A(0)=\det(A).

Therefore, the constant term of the characteristic polynomial is the determinant of AA.

This gives an important relation. If the eigenvalues are

λ1,λ2,,λn, \lambda_1,\lambda_2,\ldots,\lambda_n,

counted with algebraic multiplicity, then

det(A)=λ1λ2λn. \det(A)=\lambda_1\lambda_2\cdots\lambda_n.

The determinant is the product of the eigenvalues.

61.8 Trace Term

For an n×nn \times n matrix, the coefficient of the next-highest power of λ\lambda is determined by the trace.

With the convention

qA(λ)=det(λIA), q_A(\lambda)=\det(\lambda I-A),

the characteristic polynomial has the form

qA(λ)=λntr(A)λn1++(1)ndet(A). q_A(\lambda) = \lambda^n-\operatorname{tr}(A)\lambda^{n-1}+\cdots+(-1)^n\det(A).

Thus, if the eigenvalues are

λ1,λ2,,λn, \lambda_1,\lambda_2,\ldots,\lambda_n,

then

tr(A)=λ1+λ2++λn. \operatorname{tr}(A)=\lambda_1+\lambda_2+\cdots+\lambda_n.

The trace is the sum of the eigenvalues, counted with algebraic multiplicity. The characteristic polynomial encodes determinant and trace among its coefficients.

61.9 Algebraic Multiplicity

An eigenvalue may occur more than once as a root of the characteristic polynomial.

The number of times an eigenvalue appears as a root is called its algebraic multiplicity.

For example,

pA(λ)=(λ2)3(λ+1) p_A(\lambda)=(\lambda-2)^3(\lambda+1)

has eigenvalues

2and1. 2 \qquad \text{and} \qquad -1.

The eigenvalue 22 has algebraic multiplicity 33. The eigenvalue 1-1 has algebraic multiplicity 11.

The sum of all algebraic multiplicities is the degree of the characteristic polynomial. For an n×nn \times n matrix, this sum is nn.

61.10 Geometric Multiplicity Compared

The algebraic multiplicity of an eigenvalue comes from the characteristic polynomial.

The geometric multiplicity comes from the eigenspace:

Eλ=ker(AλI). E_\lambda=\ker(A-\lambda I).

The geometric multiplicity is

dimEλ. \dim E_\lambda.

For every eigenvalue,

1geometric multiplicityalgebraic multiplicity. 1 \leq \text{geometric multiplicity} \leq \text{algebraic multiplicity}.

The characteristic polynomial tells how many times an eigenvalue appears algebraically. The eigenspace tells how many independent eigenvectors belong to it.

These two numbers need not be equal.

61.11 Repeated Root Example

Consider

A=[2102]. A = \begin{bmatrix} 2 & 1 \\ 0 & 2 \end{bmatrix}.

Then

AλI=[2λ102λ]. A-\lambda I = \begin{bmatrix} 2-\lambda & 1 \\ 0 & 2-\lambda \end{bmatrix}.

The determinant is

pA(λ)=(2λ)2. p_A(\lambda)=(2-\lambda)^2.

Thus λ=2\lambda=2 is an eigenvalue with algebraic multiplicity 22.

Now compute the eigenspace:

A2I=[0100]. A-2I = \begin{bmatrix} 0 & 1 \\ 0 & 0 \end{bmatrix}.

Solving

(A2I)v=0 (A-2I)v=0

gives

y=0. y=0.

Hence

E2=span{[10]}. E_2= \operatorname{span} \left\{ \begin{bmatrix} 1 \\ 0 \end{bmatrix} \right\}.

The geometric multiplicity is 11.

The characteristic polynomial has a repeated root, but the matrix has only one independent eigenvector.

61.12 Similar Matrices

Two square matrices AA and BB are similar if there is an invertible matrix PP such that

B=P1AP. B=P^{-1}AP.

Similar matrices represent the same linear transformation in different bases.

They have the same characteristic polynomial.

Indeed,

BλI=P1APλI. B-\lambda I = P^{-1}AP-\lambda I.

Since

I=P1IP, I=P^{-1}IP,

we have

BλI=P1(AλI)P. B-\lambda I = P^{-1}(A-\lambda I)P.

Taking determinants,

det(BλI)=det(P1)det(AλI)det(P). \det(B-\lambda I) = \det(P^{-1})\det(A-\lambda I)\det(P).

Since

det(P1)det(P)=1, \det(P^{-1})\det(P)=1,

it follows that

det(BλI)=det(AλI). \det(B-\lambda I)=\det(A-\lambda I).

Thus similar matrices have the same characteristic polynomial and the same eigenvalues.

61.13 Characteristic Polynomial of Diagonal Matrices

Let

D=[d1000d2000dn]. D = \begin{bmatrix} d_1 & 0 & \cdots & 0 \\ 0 & d_2 & \cdots & 0 \\ \vdots & \vdots & \ddots & \vdots \\ 0 & 0 & \cdots & d_n \end{bmatrix}.

Then

DλI=[d1λ000d2λ000dnλ]. D-\lambda I = \begin{bmatrix} d_1-\lambda & 0 & \cdots & 0 \\ 0 & d_2-\lambda & \cdots & 0 \\ \vdots & \vdots & \ddots & \vdots \\ 0 & 0 & \cdots & d_n-\lambda \end{bmatrix}.

The determinant of a diagonal matrix is the product of its diagonal entries, so

pD(λ)=(d1λ)(d2λ)(dnλ). p_D(\lambda) = (d_1-\lambda)(d_2-\lambda)\cdots(d_n-\lambda).

Therefore the eigenvalues are exactly the diagonal entries.

61.14 Characteristic Polynomial of Triangular Matrices

If AA is upper triangular or lower triangular, then AλIA-\lambda I is also triangular.

For a triangular matrix, the determinant is the product of the diagonal entries.

Thus, if

A=[a110a2200ann], A = \begin{bmatrix} a_{11} & * & \cdots & * \\ 0 & a_{22} & \cdots & * \\ \vdots & \vdots & \ddots & \vdots \\ 0 & 0 & \cdots & a_{nn} \end{bmatrix},

then

pA(λ)=(a11λ)(a22λ)(annλ). p_A(\lambda) = (a_{11}-\lambda)(a_{22}-\lambda)\cdots(a_{nn}-\lambda).

The eigenvalues of a triangular matrix are its diagonal entries.

This fact is central in numerical methods, especially the QR algorithm and Schur decomposition.

61.15 Characteristic Polynomial and Diagonalization

If AA is diagonalizable, then

A=PDP1, A=PDP^{-1},

where DD is diagonal.

Since AA and DD are similar, they have the same characteristic polynomial.

If

D=[λ1000λ2000λn], D= \begin{bmatrix} \lambda_1 & 0 & \cdots & 0 \\ 0 & \lambda_2 & \cdots & 0 \\ \vdots & \vdots & \ddots & \vdots \\ 0 & 0 & \cdots & \lambda_n \end{bmatrix},

then

pA(t)=pD(t)=(λ1t)(λ2t)(λnt) p_A(t)=p_D(t) = (\lambda_1-t)(\lambda_2-t)\cdots(\lambda_n-t)

under the convention pA(t)=det(AtI)p_A(t)=\det(A-tI).

Thus diagonalization makes the characteristic polynomial transparent.

61.16 Complex Roots

Over the real numbers, a characteristic polynomial may have no real roots.

For example,

A=[0110] A = \begin{bmatrix} 0 & -1 \\ 1 & 0 \end{bmatrix}

has

pA(λ)=det[λ11λ]=λ2+1. p_A(\lambda) = \det \begin{bmatrix} -\lambda & -1 \\ 1 & -\lambda \end{bmatrix} = \lambda^2+1.

There are no real roots.

Over the complex numbers,

λ2+1=0 \lambda^2+1=0

has roots

iandi. i \qquad \text{and} \qquad -i.

Thus the matrix has complex eigenvalues.

For this reason, spectral theory is often developed over C\mathbb{C}. Over the complex numbers, every degree nn characteristic polynomial has exactly nn roots counted with multiplicity.

61.17 Characteristic Polynomial and Invertibility

A square matrix AA is invertible exactly when 00 is not an eigenvalue.

Using the characteristic polynomial,

0 is an eigenvalue 0 \text{ is an eigenvalue}

exactly when

pA(0)=0. p_A(0)=0.

But

pA(0)=det(A). p_A(0)=\det(A).

Therefore,

A is invertible A \text{ is invertible}

exactly when

det(A)0. \det(A)\neq 0.

This connects three equivalent facts:

StatementMeaning
AA is invertibleThe transformation can be undone
det(A)0\det(A)\neq 0The matrix is nonsingular
00 is not an eigenvalueNo nonzero vector is sent to zero

61.18 Characteristic Polynomial of a Linear Transformation

The characteristic polynomial can be defined for a linear transformation

T:VV T:V\to V

on a finite-dimensional vector space.

Choose a basis of VV, and let AA be the matrix of TT in that basis. Define

pT(λ)=pA(λ). p_T(\lambda)=p_A(\lambda).

This definition is well-defined because changing the basis replaces AA by a similar matrix, and similar matrices have the same characteristic polynomial.

Thus the characteristic polynomial belongs to the linear transformation itself, not merely to a particular matrix representation.

61.19 What the Characteristic Polynomial Does Not Tell Alone

The characteristic polynomial gives the eigenvalues and their algebraic multiplicities. It also encodes determinant and trace.

However, it does not by itself determine the matrix.

Different matrices can have the same characteristic polynomial.

For example,

A=[2002] A = \begin{bmatrix} 2 & 0 \\ 0 & 2 \end{bmatrix}

and

B=[2102] B = \begin{bmatrix} 2 & 1 \\ 0 & 2 \end{bmatrix}

both have characteristic polynomial

(2λ)2. (2-\lambda)^2.

But AA has two independent eigenvectors, while BB has only one.

To understand the full structure, one also studies eigenspaces, minimal polynomials, Jordan form, and invariant subspaces.

61.20 Summary

The characteristic polynomial of a square matrix AA is

pA(λ)=det(AλI). p_A(\lambda)=\det(A-\lambda I).

Its roots are the eigenvalues of AA. Its degree is the size of the matrix. Its constant term is det(A)\det(A), and its next-highest coefficient is governed by tr(A)\operatorname{tr}(A).

The characteristic polynomial translates the eigenvalue problem into a polynomial equation. It is one of the main bridges between matrices, determinants, eigenvalues, diagonalization, and spectral theory.