Skip to content

Chapter 46. Norms and Metrics

A norm is a rule that assigns a length to each vector. A metric is a rule that assigns a distance to each pair of points. In linear algebra, these two ideas are closely related: a norm gives a distance by measuring the length of a difference,

d(u,v)=uv. d(u,v) = \|u-v\|.

Thus a normed vector space is both an algebraic object and a geometric object. The vector operations give addition and scalar multiplication. The norm and metric give length, distance, convergence, continuity, and approximation. A normed vector space is a vector space equipped with a norm, and the norm-induced distance makes it a metric space.

46.1 Norms

Let VV be a vector space over R\mathbb{R} or C\mathbb{C}. A norm on VV is a function

:VR \|\cdot\| : V \to \mathbb{R}

satisfying the following properties for all u,vVu,v \in V and all scalars cc.

First, nonnegativity:

v0. \|v\| \ge 0.

Second, positive definiteness:

v=0if and only ifv=0. \|v\| = 0 \quad \text{if and only if} \quad v = 0.

Third, absolute homogeneity:

cv=cv. \|cv\| = |c|\|v\|.

Fourth, the triangle inequality:

u+vu+v. \|u+v\| \le \|u\| + \|v\|.

These axioms abstract the basic behavior of length. Length is never negative. Only the zero vector has zero length. Scaling a vector scales its length by the absolute value of the scalar. The length of a sum is at most the sum of the lengths.

46.2 The Euclidean Norm

The standard norm on Rn\mathbb{R}^n is the Euclidean norm:

x2=x12+x22++xn2. \|x\|_2 = \sqrt{x_1^2+x_2^2+\cdots+x_n^2}.

For example, if

x=[34], x = \begin{bmatrix} 3 \\ 4 \end{bmatrix},

then

x2=32+42=5. \|x\|_2 = \sqrt{3^2+4^2}=5.

The Euclidean norm is the norm induced by the standard inner product:

x2=x,x. \|x\|_2 = \sqrt{\langle x,x\rangle}.

In R2\mathbb{R}^2 and R3\mathbb{R}^3, this is the usual geometric length. In higher dimensions, the same formula remains valid, although direct geometric visualization becomes harder.

46.3 Norms Induced by Inner Products

Every inner product defines a norm by

v=v,v. \|v\| = \sqrt{\langle v,v\rangle}.

This construction is fundamental. It shows that inner product spaces are normed vector spaces.

The induced norm satisfies the norm axioms. Positive definiteness follows from the positive definiteness of the inner product. Absolute homogeneity follows from

cv=cv,cv=c2v,v=cv. \|cv\| = \sqrt{\langle cv,cv\rangle} = \sqrt{|c|^2\langle v,v\rangle} = |c|\|v\|.

The triangle inequality follows from the Cauchy-Schwarz inequality.

However, not every norm comes from an inner product. A norm comes from an inner product precisely when it satisfies the parallelogram law:

u+v2+uv2=2u2+2v2. \|u+v\|^2+\|u-v\|^2 = 2\|u\|^2+2\|v\|^2.

This distinction matters. Inner product geometry has angles and orthogonality. General normed geometry has length and distance, but it may lack a compatible notion of angle.

46.4 The pp-Norms

For 1p<1 \le p < \infty, the pp-norm on Rn\mathbb{R}^n is

xp=(x1p+x2p++xnp)1/p. \|x\|_p = \left( |x_1|^p + |x_2|^p + \cdots + |x_n|^p \right)^{1/p}.

Important special cases include:

NormFormulaCommon name
p=1p=1(|x|_1=\sum_ix_i
p=2p=2x2=(ixi2)1/2\|x\|_2=(\sum_i x_i^2)^{1/2}Euclidean norm
p=p=\infty(|x|_\infty=\max_ix_i

The infinity norm is defined by

x=max1inxi. \|x\|_\infty = \max_{1 \le i \le n} |x_i|.

For example, if

x=[253], x = \begin{bmatrix} -2 \\ 5 \\ 3 \end{bmatrix},

then

x1=10,x2=38,x=5. \|x\|_1 = 10, \qquad \|x\|_2 = \sqrt{38}, \qquad \|x\|_\infty = 5.

Each norm measures size differently. The 11-norm measures total absolute magnitude. The 22-norm measures Euclidean length. The infinity norm measures the largest coordinate magnitude.

46.5 Unit Balls

The unit ball of a normed vector space is the set

B={xV:x1}. B = \{x \in V : \|x\| \le 1\}.

Different norms produce different unit balls.

In R2\mathbb{R}^2, the Euclidean unit ball is a disk. The 11-norm unit ball is a diamond:

x1+x21. |x_1| + |x_2| \le 1.

The infinity norm unit ball is a square:

max(x1,x2)1. \max(|x_1|,|x_2|) \le 1.

The shape of the unit ball determines the geometry of the norm. Smooth unit balls behave differently from polygonal unit balls. This affects optimization, approximation, sparsity, and numerical algorithms.

46.6 Metrics

A metric on a set XX is a function

d:X×XR d : X \times X \to \mathbb{R}

satisfying the following properties for all x,y,zXx,y,z \in X.

First, nonnegativity:

d(x,y)0. d(x,y) \ge 0.

Second, identity of indiscernibles:

d(x,y)=0if and only ifx=y. d(x,y)=0 \quad \text{if and only if} \quad x=y.

Third, symmetry:

d(x,y)=d(y,x). d(x,y)=d(y,x).

Fourth, the triangle inequality:

d(x,z)d(x,y)+d(y,z). d(x,z) \le d(x,y)+d(y,z).

A set equipped with a metric is called a metric space.

A metric measures distance. Unlike a norm, it does not require vector addition or scalar multiplication. Therefore metrics apply to more general objects: points, functions, sequences, graphs, probability distributions, and many other sets.

46.7 Metrics Induced by Norms

Every norm on a vector space induces a metric by

d(u,v)=uv. d(u,v)=\|u-v\|.

This metric is translation invariant:

d(u+a,v+a)=d(u,v). d(u+a,v+a)=d(u,v).

It is also homogeneous:

d(cu,cv)=cd(u,v). d(cu,cv)=|c|d(u,v).

These properties reflect the linear structure of the space. The distance between two vectors depends only on their difference.

For example, in Rn\mathbb{R}^n, the Euclidean metric is

d2(u,v)=uv2=(u1v1)2++(unvn)2. d_2(u,v) = \|u-v\|_2 = \sqrt{(u_1-v_1)^2+\cdots+(u_n-v_n)^2}.

The 11-metric is

d1(u,v)=uv1=i=1nuivi. d_1(u,v) = \|u-v\|_1 = \sum_{i=1}^n |u_i-v_i|.

The infinity metric is

d(u,v)=uv=maxiuivi. d_\infty(u,v) = \|u-v\|_\infty = \max_i |u_i-v_i|.

Norm-induced metrics are the main metrics used in finite-dimensional linear algebra.

46.8 Distance and Approximation

Norms are used to compare an exact object with an approximation.

If xx is the exact vector and x^\hat{x} is an approximation, the error vector is

e=xx^. e = x-\hat{x}.

The size of the error is measured by

e. \|e\|.

Different norms emphasize different error behavior.

The 22-norm measures root-sum-square error. The 11-norm measures total absolute error. The infinity norm measures worst-coordinate error.

For example, if

e=[0.010.030.02], e = \begin{bmatrix} 0.01 \\ -0.03 \\ 0.02 \end{bmatrix},

then

e1=0.06,e2=0.0014,e=0.03. \|e\|_1 = 0.06, \qquad \|e\|_2 = \sqrt{0.0014}, \qquad \|e\|_\infty = 0.03.

The choice of norm depends on the problem. If total error matters, use a sum-type norm. If worst-case error matters, use the infinity norm. If Euclidean geometry matters, use the 22-norm.

46.9 Convergence

A sequence of vectors (xk)(x_k) in a normed vector space converges to xx if

limkxkx=0. \lim_{k\to\infty}\|x_k-x\| = 0.

This means the distance from xkx_k to xx becomes arbitrarily small.

Convergence depends on the norm. In finite-dimensional spaces, all norms define the same convergent sequences. In infinite-dimensional spaces, different norms may produce different notions of convergence.

For example, in Rn\mathbb{R}^n, convergence in the Euclidean norm is equivalent to coordinatewise convergence. If

xk=[xk1xkn], x_k = \begin{bmatrix} x_{k1} \\ \vdots \\ x_{kn} \end{bmatrix},

then

xkx x_k \to x

if and only if

xkixi x_{ki} \to x_i

for every coordinate ii.

46.10 Cauchy Sequences and Completeness

A sequence (xk)(x_k) is Cauchy if for every ε>0\varepsilon>0, there exists an integer NN such that

xmxn<ε \|x_m-x_n\| < \varepsilon

whenever m,nNm,n \ge N.

A normed vector space is complete if every Cauchy sequence converges to a vector in the space.

A complete normed vector space is called a Banach space. This concept becomes central in functional analysis, numerical analysis, and differential equations. A Banach space is, by definition, a complete normed space with respect to the metric induced by its norm.

Finite-dimensional normed spaces over R\mathbb{R} or C\mathbb{C} are complete. Infinite-dimensional spaces require more care.

46.11 Equivalence of Norms in Finite Dimensions

On a finite-dimensional vector space, any two norms are equivalent.

This means that if a\|\cdot\|_a and b\|\cdot\|_b are two norms on the same finite-dimensional space, then there exist constants C1,C2>0C_1,C_2>0 such that

C1xaxbC2xa C_1\|x\|_a \le \|x\|_b \le C_2\|x\|_a

for all vectors xx.

Equivalence of norms means that all norms give the same topology in finite dimensions. They have the same convergent sequences, open sets, closed sets, and continuous functions.

This fact justifies flexible norm choice in many finite-dimensional arguments. One may use the norm that makes the proof or computation simplest.

46.12 Open and Closed Balls

Let (X,d)(X,d) be a metric space. The open ball centered at xx with radius r>0r>0 is

Br(x)={yX:d(x,y)<r}. B_r(x)=\{y \in X : d(x,y)<r\}.

The closed ball is

Br(x)={yX:d(x,y)r}. \overline{B}_r(x)=\{y \in X : d(x,y)\le r\}.

In a normed vector space, these become

Br(x)={yV:yx<r}, B_r(x)=\{y \in V : \|y-x\|<r\},

and

Br(x)={yV:yxr}. \overline{B}_r(x)=\{y \in V : \|y-x\|\le r\}.

Balls describe local neighborhoods. They are the basic objects used to define limits, continuity, compactness, and approximation.

46.13 Continuity

A function f:VWf : V \to W between normed vector spaces is continuous at xx if for every ε>0\varepsilon>0, there exists δ>0\delta>0 such that

yxV<δf(y)f(x)W<ε. \|y-x\|_V < \delta \quad \Longrightarrow \quad \|f(y)-f(x)\|_W < \varepsilon.

Continuity means small changes in the input produce small changes in the output.

Linear maps have a particularly simple continuity theory. A linear map T:VWT : V \to W is continuous if and only if there exists a constant C0C\ge 0 such that

T(v)WCvV \|T(v)\|_W \le C\|v\|_V

for all vVv \in V.

In finite-dimensional spaces, every linear map is continuous. In infinite-dimensional spaces, some linear maps may fail to be continuous.

46.14 Operator Norms

Let T:VWT : V \to W be a linear map between normed vector spaces. The operator norm of TT is

T=supv0T(v)WvV. \|T\| = \sup_{v\ne 0} \frac{\|T(v)\|_W}{\|v\|_V}.

Equivalently,

T=supvV=1T(v)W. \|T\| = \sup_{\|v\|_V=1}\|T(v)\|_W.

The operator norm measures the maximum amount by which TT can stretch a vector.

For a matrix AA, the operator norm depends on the vector norm chosen. With the Euclidean norm, the induced matrix norm is

A2=supx0Ax2x2. \|A\|_2 = \sup_{x\ne 0} \frac{\|Ax\|_2}{\|x\|_2}.

This is the largest singular value of AA.

Operator norms are essential in perturbation theory, numerical stability, convergence estimates, and matrix analysis.

46.15 Matrix Norms

A matrix norm is a norm on a vector space of matrices. Some matrix norms are induced by vector norms. Others are defined directly from entries.

The Frobenius norm is

AF=i=1mj=1naij2. \|A\|_F = \sqrt{ \sum_{i=1}^m\sum_{j=1}^n |a_{ij}|^2 }.

The induced 11-norm is the maximum absolute column sum:

A1=maxjiaij. \|A\|_1 = \max_j \sum_i |a_{ij}|.

The induced infinity norm is the maximum absolute row sum:

A=maxijaij. \|A\|_\infty = \max_i \sum_j |a_{ij}|.

These norms measure matrix size in different ways.

The Frobenius norm treats the matrix as a vector of entries. The induced norms measure worst-case amplification of vectors.

46.16 Lipschitz Bounds

A function f:VWf : V \to W is Lipschitz continuous if there exists a constant L0L\ge 0 such that

f(u)f(v)WLuvV \|f(u)-f(v)\|_W \le L\|u-v\|_V

for all u,vVu,v \in V.

For a linear map TT,

T(u)T(v)=T(uv)Tuv. \|T(u)-T(v)\| = \|T(u-v)\| \le \|T\|\|u-v\|.

Thus every bounded linear map is Lipschitz continuous, with Lipschitz constant T\|T\|.

This gives a direct interpretation of the operator norm: it is the best global Lipschitz constant of the linear map.

46.17 Relative Error

In numerical work, absolute error is often measured by

xx^. \|x-\hat{x}\|.

Relative error is measured by

xx^x, \frac{\|x-\hat{x}\|}{\|x\|},

assuming x0x \ne 0.

Relative error compares the error to the size of the exact object. It is more informative when values have different scales.

For example, an error of 10310^{-3} is small if the exact value has norm 10310^3, but large if the exact value has norm 10610^{-6}.

Norms therefore provide the language for quantitative accuracy.

46.18 Norms and Conditioning

The condition number of an invertible matrix AA, with respect to a chosen norm, is

κ(A)=AA1. \kappa(A)=\|A\|\|A^{-1}\|.

It measures how sensitive the solution of

Ax=b Ax=b

is to perturbations in bb and AA.

If κ(A)\kappa(A) is small, the system is well-conditioned. If κ(A)\kappa(A) is large, small input errors may cause large output errors.

The value of κ(A)\kappa(A) depends on the norm. With the Euclidean norm,

κ2(A)=σmax(A)σmin(A), \kappa_2(A) = \frac{\sigma_{\max}(A)}{\sigma_{\min}(A)},

where σmax(A)\sigma_{\max}(A) and σmin(A)\sigma_{\min}(A) are the largest and smallest singular values of AA.

Conditioning is one of the main reasons norms matter in numerical linear algebra.

46.19 Examples

For

x=[122], x = \begin{bmatrix} 1 \\ -2 \\ 2 \end{bmatrix},

the common vector norms are

x1=5, \|x\|_1 = 5, x2=3, \|x\|_2 = 3,

and

x=2. \|x\|_\infty = 2.

For

A=[1234], A = \begin{bmatrix} 1 & -2 \\ 3 & 4 \end{bmatrix},

the Frobenius norm is

AF=12+(2)2+32+42=30. \|A\|_F = \sqrt{1^2+(-2)^2+3^2+4^2} = \sqrt{30}.

The induced 11-norm is

A1=max{1+3, 2+4}=max{4,6}=6. \|A\|_1 = \max\{ |1|+|3|,\ |-2|+|4| \} = \max\{4,6\} = 6.

The induced infinity norm is

A=max{1+2, 3+4}=max{3,7}=7. \|A\|_\infty = \max\{ |1|+|-2|,\ |3|+|4| \} = \max\{3,7\} = 7.

These computations show that matrix size depends on the chosen norm.

46.20 Summary

A norm measures the length of a vector. A metric measures the distance between two points. Every norm induces a metric by

d(u,v)=uv. d(u,v)=\|u-v\|.

Norms support the study of convergence, continuity, approximation, error, conditioning, and stability. Inner products produce important norms, especially the Euclidean norm, but many useful norms do not come from inner products.

The choice of norm should match the structure of the problem. Euclidean norms fit geometric and spectral arguments. The 11-norm fits total magnitude and sparsity. The infinity norm fits worst-case control. Matrix and operator norms describe the size and amplification behavior of linear transformations.