A similarity transformation changes the matrix representation of a linear operator without changing the operator itself. It is the algebraic form of a change of basis.
Let be an matrix, and let be an invertible matrix. The matrix
is called similar to . The transformation
is called a similarity transformation, or conjugation by . Similar matrices represent the same linear operator written in different bases.
42.1 Linear Operators and Bases
Similarity applies to linear operators, not arbitrary maps between different spaces.
A linear operator is a linear map
from a vector space to itself.
If is finite-dimensional and a basis is chosen, then has a matrix representation
If a different basis is chosen, then has another matrix representation
These two matrices may look different, but they describe the same operator. Similarity is the relation between them.
42.2 Change of Basis
Let
and
be two ordered bases of .
Let be the change-of-coordinates matrix from -coordinates to -coordinates:
Then
Suppose
This means
Using the change of coordinates,
Substitute the action of :
Since
we get
Therefore
This is the similarity formula.
42.3 Meaning of the Formula
The expression
has three steps.
Starting with coordinates in the new basis , first multiply by . This converts the vector into old basis coordinates.
Then multiply by . This applies the operator in the old coordinate system.
Finally multiply by . This converts the result back into the new basis.
Thus
means:
The operator has not changed. Only its coordinate description has changed.
42.4 Similar Matrices
Two square matrices and are similar if there exists an invertible matrix such that
The matrix is the change-of-basis matrix.
Similarity is only defined for square matrices of the same size. This is because a linear operator has the same domain and codomain, so the same vector space is being described with different bases.
The notation
is often used to mean that and are similar.
42.5 Similarity Is an Equivalence Relation
Similarity is reflexive, symmetric, and transitive.
It is reflexive because
So every square matrix is similar to itself.
It is symmetric because if
then
Equivalently,
Thus is similar to .
It is transitive because if
and
then
Thus is similar to .
Therefore similarity partitions square matrices into equivalence classes. Each class consists of all matrices representing the same operator in different bases.
42.6 Example
Let
and let
Then
Compute
First,
Then
So
is similar to
The operator has not changed. In the new basis, its matrix is diagonal.
42.7 Why Similarity Matters
Similarity is useful because one matrix representation may be easier to understand than another.
A complicated matrix may become diagonal in a suitable basis. A matrix that cannot be diagonalized may still become Jordan form or rational canonical form. These simpler forms reveal structure that is hard to see in the original coordinates.
The central question is:
Given a matrix , can we choose a basis in which the same operator has a simpler matrix?
Similarity is the mathematical language for that question.
42.8 Diagonalization as Similarity
A matrix is diagonalizable if it is similar to a diagonal matrix.
That is, there exists an invertible matrix and a diagonal matrix such that
Equivalently,
The columns of are eigenvectors of . The diagonal entries of are the corresponding eigenvalues.
If
and
then
Multiplying on the left by , we obtain
Thus diagonalization is a similarity transformation into an eigenvector basis.
42.9 Invariants Under Similarity
Similar matrices share properties that belong to the underlying operator rather than to a particular basis.
If
then and have the same rank, determinant, trace, characteristic polynomial, eigenvalues, and algebraic multiplicities.
These are called similarity invariants.
| Invariant | Reason |
|---|---|
| Rank | Multiplication by invertible matrices preserves rank |
| Determinant | |
| Trace | |
| Characteristic polynomial | |
| Eigenvalues | Roots of the characteristic polynomial |
| Minimal polynomial | Polynomial relations are preserved |
Similarity invariants help determine whether two matrices can represent the same operator in different bases.
42.10 Determinant Is Preserved
Let
Then
Using multiplicativity of determinant,
Since
we get
Thus similar matrices have the same determinant.
The determinant is therefore a property of the linear operator, not merely of one coordinate representation.
42.11 Trace Is Preserved
The trace is also preserved by similarity.
Using the identity
for square matrices of compatible size, we have
Thus
The trace is the sum of diagonal entries, but its value is independent of basis for a linear operator.
42.12 Characteristic Polynomial Is Preserved
Let
The characteristic polynomial of is
Substitute :
Since
we have
Therefore
Using determinant multiplicativity,
Hence
So similar matrices have the same characteristic polynomial and the same eigenvalues.
42.13 Eigenvectors Under Similarity
Eigenvalues are preserved by similarity, but eigenvectors change coordinates.
Suppose
Let
Set
Then
Thus is an eigenvector of with the same eigenvalue.
The eigenvector has changed because the coordinate system has changed. The eigendirection as part of the abstract operator remains the same.
42.14 Similarity and Powers
Similarity behaves well with powers.
If
then
By induction,
for every integer .
If is invertible, the formula also holds for negative integers:
Thus powers of similar matrices remain similar.
This matters in difference equations, Markov chains, iterative methods, and matrix functions.
42.15 Similarity and Polynomials in a Matrix
Let
be a polynomial. If
then
This follows from the power formula and linearity:
Substitute
Then
So
Consequently, polynomial identities are preserved under similarity. If
then
This explains why the minimal polynomial is a similarity invariant.
42.16 Similarity and Matrix Functions
Many functions of matrices are defined by power series or polynomial approximation. For such functions, similarity behaves naturally.
For example, the matrix exponential satisfies
whenever
Indeed,
Using the power formula,
Thus changing basis before computing a matrix function gives the same result as computing the function and then changing basis.
42.17 Similarity Versus Equivalence
Similarity should be distinguished from matrix equivalence.
Two matrices and are equivalent if there are invertible matrices and such that
Equivalence allows different changes of basis in the domain and codomain. It applies to linear maps
between possibly different spaces.
Similarity has the special form
It uses the same change of basis on both sides, because the domain and codomain are the same vector space.
Equivalence classifies linear maps by rank. Similarity classifies linear operators by deeper structure, including eigenvalues and canonical forms.
42.18 Similarity Versus Congruence
Similarity should also be distinguished from congruence.
A congruence transformation has the form
over the real numbers, or
over the complex numbers.
Congruence arises naturally for bilinear forms and quadratic forms. Similarity arises naturally for linear operators.
The difference matters. Similarity preserves eigenvalues. Congruence generally does not. Congruence preserves properties such as rank and inertia for symmetric forms over the real numbers.
Thus the correct transformation law depends on the object being represented.
42.19 Orthogonal Similarity
If the change-of-basis matrix is orthogonal, then
because
This is called orthogonal similarity.
Orthogonal similarity corresponds to changing from one orthonormal basis to another. It is especially important in numerical linear algebra because orthogonal transformations preserve lengths and are numerically stable.
For complex vector spaces, the analogous notion is unitary similarity:
where is unitary.
Orthogonal and unitary similarities are more restrictive than general similarity, but they preserve additional metric structure.
42.20 Canonical Forms
Similarity leads to canonical forms.
A canonical form is a distinguished representative of a similarity class. It gives a standard matrix that represents the operator as simply as possible.
Important canonical forms include:
| Form | Purpose |
|---|---|
| Diagonal form | Best case, basis of eigenvectors |
| Jordan form | Describes generalized eigenvectors |
| Rational canonical form | Works over arbitrary fields |
| Real canonical form | Handles complex eigenvalues over |
| Schur form | Uses unitary similarity, useful numerically |
Not every matrix is diagonalizable. But every matrix over an algebraically closed field has a Jordan form. Every matrix over any field has a rational canonical form.
Canonical forms turn the classification of operators into the classification of similarity classes.
42.21 Example: Same Operator, Different Basis
Let be the operator with standard matrix
Let the new basis be
The change-of-basis matrix from -coordinates to standard coordinates is
As computed earlier,
In the standard basis, the operator has an upper triangular matrix. In the basis , it is diagonal.
The diagonal form shows that the new basis vectors are eigenvectors.
42.22 Nonexample
The matrices
and
are not similar.
Their traces are different:
Since trace is preserved under similarity, no invertible matrix can satisfy
Their determinants are also different:
Either invariant is enough to rule out similarity.
42.23 Similarity as Coordinate Independence
Similarity expresses coordinate independence.
A matrix often appears to be the primary object, but in many settings the primary object is the linear operator. The matrix is only the operator written in a basis.
When the basis changes, the matrix changes by
The quantities that survive this change are intrinsic. They belong to the operator itself.
This viewpoint explains why trace, determinant, eigenvalues, characteristic polynomial, minimal polynomial, rank, and canonical form are central. They do not depend on arbitrary coordinate choices.
42.24 Summary
Two square matrices and are similar if there exists an invertible matrix such that
Similarity is the matrix form of change of basis for a linear operator.
If
and converts new coordinates to old coordinates, then
Similar matrices represent the same linear operator in different bases.
Similarity preserves rank, determinant, trace, characteristic polynomial, eigenvalues, minimal polynomial, and many other structural properties. It also controls diagonalization and canonical forms.
The main idea is simple: changing coordinates may change the entries of a matrix, but it does not change the operator.