Skip to content

Chapter 42. Similarity Transformations

A similarity transformation changes the matrix representation of a linear operator without changing the operator itself. It is the algebraic form of a change of basis.

Let AA be an n×nn\times n matrix, and let PP be an invertible n×nn\times n matrix. The matrix

B=P1AP B=P^{-1}AP

is called similar to AA. The transformation

AP1AP A\mapsto P^{-1}AP

is called a similarity transformation, or conjugation by PP. Similar matrices represent the same linear operator written in different bases.

42.1 Linear Operators and Bases

Similarity applies to linear operators, not arbitrary maps between different spaces.

A linear operator is a linear map

T:VV T:V\to V

from a vector space to itself.

If VV is finite-dimensional and a basis BB is chosen, then TT has a matrix representation

[T]B. [T]_B.

If a different basis CC is chosen, then TT has another matrix representation

[T]C. [T]_C.

These two matrices may look different, but they describe the same operator. Similarity is the relation between them.

42.2 Change of Basis

Let

B=(v1,,vn) B=(v_1,\ldots,v_n)

and

C=(w1,,wn) C=(w_1,\ldots,w_n)

be two ordered bases of VV.

Let PP be the change-of-coordinates matrix from CC-coordinates to BB-coordinates:

[v]B=P[v]C. [v]_B=P[v]_C.

Then

[v]C=P1[v]B. [v]_C=P^{-1}[v]_B.

Suppose

A=[T]B. A=[T]_B.

This means

[T(v)]B=A[v]B. [T(v)]_B=A[v]_B.

Using the change of coordinates,

[T(v)]C=P1[T(v)]B. [T(v)]_C=P^{-1}[T(v)]_B.

Substitute the action of AA:

[T(v)]C=P1A[v]B. [T(v)]_C=P^{-1}A[v]_B.

Since

[v]B=P[v]C, [v]_B=P[v]_C,

we get

[T(v)]C=P1AP[v]C. [T(v)]_C=P^{-1}AP[v]_C.

Therefore

[T]C=P1AP. [T]_C=P^{-1}AP.

This is the similarity formula.

42.3 Meaning of the Formula

The expression

P1AP P^{-1}AP

has three steps.

Starting with coordinates in the new basis CC, first multiply by PP. This converts the vector into old basis coordinates.

Then multiply by AA. This applies the operator in the old coordinate system.

Finally multiply by P1P^{-1}. This converts the result back into the new basis.

Thus

P1AP P^{-1}AP

means:

new coordinatesold coordinatesapply operatornew coordinates. \text{new coordinates} \to \text{old coordinates} \to \text{apply operator} \to \text{new coordinates}.

The operator has not changed. Only its coordinate description has changed.

42.4 Similar Matrices

Two square matrices AA and BB are similar if there exists an invertible matrix PP such that

B=P1AP. B=P^{-1}AP.

The matrix PP is the change-of-basis matrix.

Similarity is only defined for square matrices of the same size. This is because a linear operator has the same domain and codomain, so the same vector space is being described with different bases.

The notation

AB A\sim B

is often used to mean that AA and BB are similar.

42.5 Similarity Is an Equivalence Relation

Similarity is reflexive, symmetric, and transitive.

It is reflexive because

A=I1AI. A=I^{-1}AI.

So every square matrix is similar to itself.

It is symmetric because if

B=P1AP, B=P^{-1}AP,

then

A=PBP1. A=PBP^{-1}.

Equivalently,

A=(P1)1B(P1). A=(P^{-1})^{-1}B(P^{-1}).

Thus AA is similar to BB.

It is transitive because if

B=P1AP B=P^{-1}AP

and

C=Q1BQ, C=Q^{-1}BQ,

then

C=Q1(P1AP)Q=(PQ)1A(PQ). C=Q^{-1}(P^{-1}AP)Q=(PQ)^{-1}A(PQ).

Thus CC is similar to AA.

Therefore similarity partitions square matrices into equivalence classes. Each class consists of all matrices representing the same operator in different bases.

42.6 Example

Let

A=[2103] A= \begin{bmatrix} 2 & 1\\ 0 & 3 \end{bmatrix}

and let

P=[1101]. P= \begin{bmatrix} 1 & 1\\ 0 & 1 \end{bmatrix}.

Then

P1=[1101]. P^{-1}= \begin{bmatrix} 1 & -1\\ 0 & 1 \end{bmatrix}.

Compute

B=P1AP. B=P^{-1}AP.

First,

AP=[2103][1101]=[2303]. AP= \begin{bmatrix} 2 & 1\\ 0 & 3 \end{bmatrix} \begin{bmatrix} 1 & 1\\ 0 & 1 \end{bmatrix} = \begin{bmatrix} 2 & 3\\ 0 & 3 \end{bmatrix}.

Then

P1AP=[1101][2303]=[2003]. P^{-1}AP= \begin{bmatrix} 1 & -1\\ 0 & 1 \end{bmatrix} \begin{bmatrix} 2 & 3\\ 0 & 3 \end{bmatrix} = \begin{bmatrix} 2 & 0\\ 0 & 3 \end{bmatrix}.

So

A A

is similar to

B=[2003]. B= \begin{bmatrix} 2 & 0\\ 0 & 3 \end{bmatrix}.

The operator has not changed. In the new basis, its matrix is diagonal.

42.7 Why Similarity Matters

Similarity is useful because one matrix representation may be easier to understand than another.

A complicated matrix may become diagonal in a suitable basis. A matrix that cannot be diagonalized may still become Jordan form or rational canonical form. These simpler forms reveal structure that is hard to see in the original coordinates.

The central question is:

Given a matrix AA, can we choose a basis in which the same operator has a simpler matrix?

Similarity is the mathematical language for that question.

42.8 Diagonalization as Similarity

A matrix AA is diagonalizable if it is similar to a diagonal matrix.

That is, there exists an invertible matrix PP and a diagonal matrix DD such that

D=P1AP. D=P^{-1}AP.

Equivalently,

A=PDP1. A=PDP^{-1}.

The columns of PP are eigenvectors of AA. The diagonal entries of DD are the corresponding eigenvalues.

If

P=[v1v2vn], P= \begin{bmatrix} | & | & & |\\ v_1 & v_2 & \cdots & v_n\\ | & | & & | \end{bmatrix},

and

Avi=λivi, Av_i=\lambda_i v_i,

then

AP=PD. AP=PD.

Multiplying on the left by P1P^{-1}, we obtain

P1AP=D. P^{-1}AP=D.

Thus diagonalization is a similarity transformation into an eigenvector basis.

42.9 Invariants Under Similarity

Similar matrices share properties that belong to the underlying operator rather than to a particular basis.

If

B=P1AP, B=P^{-1}AP,

then AA and BB have the same rank, determinant, trace, characteristic polynomial, eigenvalues, and algebraic multiplicities.

These are called similarity invariants.

InvariantReason
RankMultiplication by invertible matrices preserves rank
Determinantdet(P1AP)=det(A)\det(P^{-1}AP)=\det(A)
Tracetr(P1AP)=tr(A)\operatorname{tr}(P^{-1}AP)=\operatorname{tr}(A)
Characteristic polynomialdet(λIP1AP)=det(λIA)\det(\lambda I-P^{-1}AP)=\det(\lambda I-A)
EigenvaluesRoots of the characteristic polynomial
Minimal polynomialPolynomial relations are preserved

Similarity invariants help determine whether two matrices can represent the same operator in different bases.

42.10 Determinant Is Preserved

Let

B=P1AP. B=P^{-1}AP.

Then

det(B)=det(P1AP). \det(B)=\det(P^{-1}AP).

Using multiplicativity of determinant,

det(B)=det(P1)det(A)det(P). \det(B)=\det(P^{-1})\det(A)\det(P).

Since

det(P1)=1det(P), \det(P^{-1})=\frac{1}{\det(P)},

we get

det(B)=det(A). \det(B)=\det(A).

Thus similar matrices have the same determinant.

The determinant is therefore a property of the linear operator, not merely of one coordinate representation.

42.11 Trace Is Preserved

The trace is also preserved by similarity.

Using the identity

tr(XY)=tr(YX) \operatorname{tr}(XY)=\operatorname{tr}(YX)

for square matrices of compatible size, we have

tr(P1AP)=tr(APP1)=tr(A). \operatorname{tr}(P^{-1}AP) = \operatorname{tr}(APP^{-1}) = \operatorname{tr}(A).

Thus

tr(B)=tr(A). \operatorname{tr}(B)=\operatorname{tr}(A).

The trace is the sum of diagonal entries, but its value is independent of basis for a linear operator.

42.12 Characteristic Polynomial Is Preserved

Let

B=P1AP. B=P^{-1}AP.

The characteristic polynomial of BB is

det(λIB). \det(\lambda I-B).

Substitute B=P1APB=P^{-1}AP:

det(λIP1AP). \det(\lambda I-P^{-1}AP).

Since

λI=P1(λI)P, \lambda I=P^{-1}(\lambda I)P,

we have

λIP1AP=P1(λIA)P. \lambda I-P^{-1}AP = P^{-1}(\lambda I-A)P.

Therefore

det(λIB)=det(P1(λIA)P). \det(\lambda I-B) = \det(P^{-1}(\lambda I-A)P).

Using determinant multiplicativity,

det(λIB)=det(P1)det(λIA)det(P). \det(\lambda I-B) = \det(P^{-1})\det(\lambda I-A)\det(P).

Hence

det(λIB)=det(λIA). \det(\lambda I-B)=\det(\lambda I-A).

So similar matrices have the same characteristic polynomial and the same eigenvalues.

42.13 Eigenvectors Under Similarity

Eigenvalues are preserved by similarity, but eigenvectors change coordinates.

Suppose

Av=λv. Av=\lambda v.

Let

B=P1AP. B=P^{-1}AP.

Set

w=P1v. w=P^{-1}v.

Then

Bw=P1AP(P1v)=P1Av=P1(λv)=λP1v=λw. Bw=P^{-1}AP(P^{-1}v)=P^{-1}Av=P^{-1}(\lambda v)=\lambda P^{-1}v=\lambda w.

Thus ww is an eigenvector of BB with the same eigenvalue.

The eigenvector has changed because the coordinate system has changed. The eigendirection as part of the abstract operator remains the same.

42.14 Similarity and Powers

Similarity behaves well with powers.

If

B=P1AP, B=P^{-1}AP,

then

B2=(P1AP)(P1AP)=P1A2P. B^2=(P^{-1}AP)(P^{-1}AP)=P^{-1}A^2P.

By induction,

Bk=P1AkP B^k=P^{-1}A^kP

for every integer k0k\geq 0.

If AA is invertible, the formula also holds for negative integers:

B1=P1A1P. B^{-1}=P^{-1}A^{-1}P.

Thus powers of similar matrices remain similar.

This matters in difference equations, Markov chains, iterative methods, and matrix functions.

42.15 Similarity and Polynomials in a Matrix

Let

p(t)=a0+a1t++aktk p(t)=a_0+a_1t+\cdots+a_kt^k

be a polynomial. If

B=P1AP, B=P^{-1}AP,

then

p(B)=P1p(A)P. p(B)=P^{-1}p(A)P.

This follows from the power formula and linearity:

p(B)=a0I+a1B++akBk. p(B)=a_0I+a_1B+\cdots+a_kB^k.

Substitute

Bj=P1AjP. B^j=P^{-1}A^jP.

Then

p(B)=P1(a0I+a1A++akAk)P. p(B)=P^{-1}(a_0I+a_1A+\cdots+a_kA^k)P.

So

p(B)=P1p(A)P. p(B)=P^{-1}p(A)P.

Consequently, polynomial identities are preserved under similarity. If

p(A)=0, p(A)=0,

then

p(B)=0. p(B)=0.

This explains why the minimal polynomial is a similarity invariant.

42.16 Similarity and Matrix Functions

Many functions of matrices are defined by power series or polynomial approximation. For such functions, similarity behaves naturally.

For example, the matrix exponential satisfies

eB=P1eAP e^{B}=P^{-1}e^{A}P

whenever

B=P1AP. B=P^{-1}AP.

Indeed,

eA=I+A+A22!+A33!+. e^A=I+A+\frac{A^2}{2!}+\frac{A^3}{3!}+\cdots.

Using the power formula,

eB=I+B+B22!+B33!+=P1eAP. e^B = I+B+\frac{B^2}{2!}+\frac{B^3}{3!}+\cdots = P^{-1}e^AP.

Thus changing basis before computing a matrix function gives the same result as computing the function and then changing basis.

42.17 Similarity Versus Equivalence

Similarity should be distinguished from matrix equivalence.

Two m×nm\times n matrices AA and BB are equivalent if there are invertible matrices PP and QQ such that

B=PAQ. B=PAQ.

Equivalence allows different changes of basis in the domain and codomain. It applies to linear maps

T:VW T:V\to W

between possibly different spaces.

Similarity has the special form

B=P1AP. B=P^{-1}AP.

It uses the same change of basis on both sides, because the domain and codomain are the same vector space.

Equivalence classifies linear maps by rank. Similarity classifies linear operators by deeper structure, including eigenvalues and canonical forms.

42.18 Similarity Versus Congruence

Similarity should also be distinguished from congruence.

A congruence transformation has the form

B=PTAP B=P^TAP

over the real numbers, or

B=PAP B=P^*AP

over the complex numbers.

Congruence arises naturally for bilinear forms and quadratic forms. Similarity arises naturally for linear operators.

The difference matters. Similarity preserves eigenvalues. Congruence generally does not. Congruence preserves properties such as rank and inertia for symmetric forms over the real numbers.

Thus the correct transformation law depends on the object being represented.

42.19 Orthogonal Similarity

If the change-of-basis matrix QQ is orthogonal, then

B=QTAQ B=Q^TAQ

because

Q1=QT. Q^{-1}=Q^T.

This is called orthogonal similarity.

Orthogonal similarity corresponds to changing from one orthonormal basis to another. It is especially important in numerical linear algebra because orthogonal transformations preserve lengths and are numerically stable.

For complex vector spaces, the analogous notion is unitary similarity:

B=UAU, B=U^*AU,

where UU is unitary.

Orthogonal and unitary similarities are more restrictive than general similarity, but they preserve additional metric structure.

42.20 Canonical Forms

Similarity leads to canonical forms.

A canonical form is a distinguished representative of a similarity class. It gives a standard matrix that represents the operator as simply as possible.

Important canonical forms include:

FormPurpose
Diagonal formBest case, basis of eigenvectors
Jordan formDescribes generalized eigenvectors
Rational canonical formWorks over arbitrary fields
Real canonical formHandles complex eigenvalues over R\mathbb{R}
Schur formUses unitary similarity, useful numerically

Not every matrix is diagonalizable. But every matrix over an algebraically closed field has a Jordan form. Every matrix over any field has a rational canonical form.

Canonical forms turn the classification of operators into the classification of similarity classes.

42.21 Example: Same Operator, Different Basis

Let T:R2R2T:\mathbb{R}^2\to\mathbb{R}^2 be the operator with standard matrix

A=[2103]. A= \begin{bmatrix} 2 & 1\\ 0 & 3 \end{bmatrix}.

Let the new basis be

C=([10],[11]). C= \left( \begin{bmatrix} 1\\ 0 \end{bmatrix}, \begin{bmatrix} 1\\ 1 \end{bmatrix} \right).

The change-of-basis matrix from CC-coordinates to standard coordinates is

P=[1101]. P= \begin{bmatrix} 1 & 1\\ 0 & 1 \end{bmatrix}.

As computed earlier,

[T]C=P1AP=[2003]. [T]_C=P^{-1}AP= \begin{bmatrix} 2 & 0\\ 0 & 3 \end{bmatrix}.

In the standard basis, the operator has an upper triangular matrix. In the basis CC, it is diagonal.

The diagonal form shows that the new basis vectors are eigenvectors.

42.22 Nonexample

The matrices

A=[1002] A= \begin{bmatrix} 1 & 0\\ 0 & 2 \end{bmatrix}

and

B=[1003] B= \begin{bmatrix} 1 & 0\\ 0 & 3 \end{bmatrix}

are not similar.

Their traces are different:

tr(A)=3,tr(B)=4. \operatorname{tr}(A)=3, \qquad \operatorname{tr}(B)=4.

Since trace is preserved under similarity, no invertible matrix PP can satisfy

B=P1AP. B=P^{-1}AP.

Their determinants are also different:

det(A)=2,det(B)=3. \det(A)=2, \qquad \det(B)=3.

Either invariant is enough to rule out similarity.

42.23 Similarity as Coordinate Independence

Similarity expresses coordinate independence.

A matrix often appears to be the primary object, but in many settings the primary object is the linear operator. The matrix is only the operator written in a basis.

When the basis changes, the matrix changes by

AP1AP. A\mapsto P^{-1}AP.

The quantities that survive this change are intrinsic. They belong to the operator itself.

This viewpoint explains why trace, determinant, eigenvalues, characteristic polynomial, minimal polynomial, rank, and canonical form are central. They do not depend on arbitrary coordinate choices.

42.24 Summary

Two square matrices AA and BB are similar if there exists an invertible matrix PP such that

B=P1AP. B=P^{-1}AP.

Similarity is the matrix form of change of basis for a linear operator.

If

A=[T]B, A=[T]_B,

and PP converts new coordinates to old coordinates, then

[T]C=P1AP. [T]_C=P^{-1}AP.

Similar matrices represent the same linear operator in different bases.

Similarity preserves rank, determinant, trace, characteristic polynomial, eigenvalues, minimal polynomial, and many other structural properties. It also controls diagonalization and canonical forms.

The main idea is simple: changing coordinates may change the entries of a matrix, but it does not change the operator.