Schur decomposition is a triangular form for square matrices obtained by an orthogonal or unitary change of basis. It is one of the main tools behind modern eigenvalue algorithms.
For a complex square matrix , the Schur decomposition has the form
where is unitary and is upper triangular. Equivalently,
The diagonal entries of are the eigenvalues of , because is triangular and similar to . Every complex square matrix has such a decomposition. For a real square matrix, the real Schur form uses an orthogonal matrix and a real quasi-triangular matrix with and diagonal blocks.
79.1 Unitary Similarity
Two square matrices and are similar if there is an invertible matrix such that
Similarity represents a change of basis. Similar matrices describe the same linear transformation in different coordinate systems. They have the same eigenvalues, determinant, trace, and characteristic polynomial.
Schur decomposition uses a special kind of similarity. It chooses to be unitary. A matrix is unitary if
For real matrices, the analogous condition is orthogonality:
Unitary and orthogonal changes of basis preserve lengths and angles. This makes Schur decomposition especially important in numerical computation. It transforms a matrix without introducing the instability that may come from an arbitrary ill-conditioned basis.
79.2 The Complex Schur Form
Let
The complex Schur theorem states that there exists a unitary matrix and an upper triangular matrix such that
Equivalently,
The matrix has the form
The diagonal entries
are the eigenvalues of , counted with algebraic multiplicity.
The entries above the diagonal describe additional coupling between Schur vectors. If is diagonal, the matrix is unitarily diagonalizable. If has nonzero entries above the diagonal, the matrix has a triangular representation but not necessarily an orthonormal eigenbasis.
79.3 Schur Vectors
The columns of are called Schur vectors.
Write
Because is unitary, these vectors form an orthonormal basis of . The equation
shows how acts on this basis.
The first column gives
Thus is an eigenvector corresponding to .
The second column gives
The third column gives
In general, lies in the span of
Thus the Schur basis exposes a nested sequence of invariant subspaces.
79.4 Invariant Subspaces
Let
Since is upper triangular, the action of maps each into itself:
Thus Schur decomposition produces a chain
Each subspace in this chain is invariant under .
This is an important structural fact. It says that even when a matrix lacks a full set of orthogonal eigenvectors, it still admits an orthonormal basis in which its action is triangular.
79.5 Proof Idea
The proof of Schur decomposition uses induction on the dimension.
Every complex square matrix has at least one eigenvalue. Let be an eigenvalue of , and choose a corresponding unit eigenvector . Extend to an orthonormal basis of . Let
be the unitary matrix whose columns are this basis.
Then
has the block form
The lower-left block is zero because . Now apply the same argument to the smaller matrix . Repeating this process produces an upper triangular matrix.
The key reason the argument works over is that every complex polynomial of positive degree has a root. Therefore every complex square matrix has an eigenvalue.
79.6 A Two by Two Example
Let
This matrix has characteristic polynomial
Thus
Expanding,
The only eigenvalue is
An eigenvector satisfies
Since
we may take
Choose an orthonormal vector perpendicular to :
Then
Compute
First,
Then
Thus
where
This is a Schur decomposition. The matrix is triangular but not diagonal in this orthonormal basis.
79.7 Normal Matrices
A complex matrix is normal if
Normal matrices include Hermitian, real symmetric, skew-Hermitian, unitary, and orthogonal matrices.
Schur decomposition gives a simple route to the spectral theorem for normal matrices. Suppose
If is normal, then is also normal, because unitary similarity preserves normality. But an upper triangular normal matrix must be diagonal. Therefore
Hence
Thus every normal matrix is unitarily diagonalizable. Schur decomposition therefore extends the spectral theorem: all complex square matrices can be unitarily triangularized, while normal matrices can be unitarily diagonalized.
79.8 Relation to Diagonalization
Diagonalization seeks
where is diagonal.
Schur decomposition seeks
where is upper triangular.
The differences are important.
| Feature | Diagonalization | Schur decomposition |
|---|---|---|
| Form | ||
| Middle factor | Diagonal | Upper triangular |
| Basis | Eigenbasis | Orthonormal Schur basis |
| Exists for every complex square matrix | No | Yes |
| Change of basis | Invertible | Unitary |
| Numerical behavior | May be ill-conditioned | Usually stable |
Diagonalization gives a simpler middle factor, but it may not exist. Schur decomposition always exists over , and it uses a well-conditioned change of basis.
79.9 Defective Matrices
A matrix is defective if it does not have enough linearly independent eigenvectors to be diagonalized.
For example,
has the only eigenvalue . Its eigenspace is one-dimensional, so it cannot be diagonalized.
However, it is already in Schur form:
Taking
we have
Thus Schur decomposition still applies to defective matrices. It replaces the impossible goal of diagonalization by the always possible goal of triangularization.
79.10 Real Schur Decomposition
For a real matrix , complex eigenvalues may occur. Since complex eigenvalues of a real matrix occur in conjugate pairs, a fully triangular real Schur form may require complex entries.
To stay within real arithmetic, one uses the real Schur decomposition:
where is real orthogonal and is real quasi-upper triangular.
The matrix has and blocks on its diagonal:
Each diagonal block is either or . A block corresponds to a real eigenvalue. A block corresponds to a pair of complex conjugate eigenvalues.
79.11 A Real Two by Two Rotation
Consider the real rotation matrix
Its characteristic polynomial is
so its eigenvalues are
Over , the matrix has a triangular Schur form with diagonal entries and . Over , it cannot be triangularized into a real upper triangular matrix, because a real triangular matrix has real diagonal entries.
The real Schur form keeps the block:
Here the whole matrix is one block representing the conjugate eigenvalue pair.
79.12 Ordering Eigenvalues
The eigenvalues on the diagonal of a Schur form can often be ordered. By applying additional unitary transformations, one can arrange selected eigenvalues in chosen positions along the diagonal.
This is useful when separating stable and unstable modes, slow and fast modes, or eigenvalues inside and outside a region of the complex plane.
For example, in numerical control theory, one may want eigenvalues with negative real part to appear in the leading block. Then the corresponding leading Schur vectors span an invariant subspace associated with stable dynamics.
The ordered Schur form is usually more useful than raw eigenvectors when eigenvalues are clustered or eigenvectors are poorly conditioned.
79.13 Schur Form and Eigenvalues
Since and are similar, they have the same characteristic polynomial:
Because is upper triangular,
Therefore the eigenvalues of are exactly the diagonal entries of :
This is one of the main reasons Schur decomposition is used in eigenvalue algorithms. It reduces eigenvalue extraction to reading the diagonal of a triangular or quasi-triangular matrix.
79.14 Schur Decomposition and the QR Algorithm
The QR algorithm computes eigenvalues by producing a sequence of orthogonally similar matrices.
Starting with
one computes
and then forms
Since
each step preserves the eigenvalues.
With shifts and practical refinements, the QR algorithm tends toward Schur form. For real matrices, it tends toward real Schur form. For complex matrices, it tends toward complex triangular Schur form. The Schur decomposition is therefore both a theoretical target and a practical endpoint for eigenvalue computation.
79.15 Matrix Functions from Schur Form
Schur decomposition helps define and compute functions of matrices.
Suppose
For a suitable function , define
Since is triangular, computing is often easier than computing directly. The diagonal entries of are
The entries above the diagonal are determined by recurrence relations involving divided differences or by specialized algorithms.
This approach is used for matrix exponentials, logarithms, square roots, and other matrix functions.
79.16 Schur Form and Stability
Schur decomposition is valuable because the factor is unitary. Unitary transformations preserve the Euclidean norm:
They also preserve the Frobenius norm:
This matters in floating point arithmetic. When a matrix is transformed by poorly conditioned eigenvector matrices, small perturbations can be amplified. Schur decomposition avoids this by using orthonormal bases.
Thus many numerical algorithms prefer Schur vectors over eigenvectors, especially when eigenvectors are nearly linearly dependent.
79.17 Comparison with Jordan Form
Jordan canonical form writes a complex matrix as
where is block diagonal with Jordan blocks.
Jordan form gives detailed algebraic structure, but it is numerically unstable. Small perturbations can change the sizes of Jordan blocks. The eigenvector matrix may also be ill-conditioned.
Schur decomposition writes
It gives less explicit algebraic information than Jordan form, but it is much more stable numerically.
| Feature | Jordan form | Schur form |
|---|---|---|
| Middle matrix | Jordan block matrix | Upper triangular matrix |
| Change of basis | General invertible matrix | Unitary matrix |
| Exists over | Yes | Yes |
| Numerically stable | No | Yes |
| Used in computation | Rarely | Commonly |
| Shows eigenvalues | Diagonal | Diagonal |
Jordan form is mainly theoretical. Schur form is both theoretical and computational.
79.18 Generalized Schur Decomposition
There is also a generalized Schur decomposition for a pair of square matrices and .
It has the form
where and are unitary, while and are upper triangular.
This decomposition is also called the QZ decomposition. It is used for generalized eigenvalue problems of the form
The generalized eigenvalues are obtained from diagonal ratios:
when .
79.19 Algorithmic Form
A full production algorithm for Schur decomposition is more involved than the elementary QR iteration. A typical dense eigenvalue pipeline has the following shape:
schur_decomposition(A):
reduce A to Hessenberg form H by orthogonal or unitary transformations
repeat until H is in Schur or quasi-Schur form:
choose shifts
perform implicit shifted QR steps
deflate converged eigenvalues or blocks
accumulate the orthogonal or unitary transformations into Q
return Q, TThe Hessenberg reduction is used because it preserves eigenvalues and greatly reduces the cost of subsequent QR iterations. A Hessenberg matrix is nearly triangular, having zeros below the first subdiagonal.
For symmetric or Hermitian matrices, this process simplifies further, because the Schur form becomes diagonal and the Hessenberg form becomes tridiagonal.
79.20 Summary
Schur decomposition expresses a square matrix by unitary or orthogonal similarity.
For a complex matrix,
where is unitary and is upper triangular.
For a real matrix,
where is orthogonal and is real quasi-upper triangular.
The diagonal entries of the complex Schur form are the eigenvalues of . In the real Schur form, real eigenvalues appear as blocks, while complex conjugate pairs appear as blocks.
Schur decomposition is less explicit than Jordan form but far more useful in computation. It is stable, always exists over the complex numbers, supports eigenvalue algorithms, exposes invariant subspaces, and provides a reliable basis for computing matrix functions.