Skip to content

Chapter 132. Convex Geometry

132.1 Introduction

Convex geometry studies convex sets, convex combinations, supporting hyperplanes, and geometric structures defined by linear inequalities.

A subset CC of a vector space is convex if every line segment joining two points of CC lies entirely inside CC.

Convex geometry is closely connected with linear algebra because convex sets are built from vectors, affine combinations, linear maps, and systems of inequalities. Many optimization problems, numerical methods, and machine learning algorithms rely fundamentally on convex structure.

Convexity appears naturally in:

AreaRole of convexity
Linear programmingFeasible regions
OptimizationConvex objective functions
Functional analysisConvex subsets of normed spaces
ProbabilityConvex combinations of distributions
Machine learningLoss minimization
Computational geometryConvex hull algorithms
EconomicsFeasible production and utility sets

Linear algebra provides the coordinate and transformation framework in which convex geometry operates.

132.2 Convex Sets

Let VV be a real vector space.

A subset

CV C\subseteq V

is convex if for every

x,yC x,y\in C

and every scalar

t[0,1], t\in[0,1],

the vector

tx+(1t)y tx+(1-t)y

also belongs to CC.

The expression

tx+(1t)y tx+(1-t)y

is called a convex combination of xx and yy.

Geometrically, convexity means that the entire line segment between any two points remains inside the set.

Examples of convex sets include:

SetConvex?
LineYes
PlaneYes
Sphere surfaceNo
DiskYes
CubeYes
TriangleYes
AnnulusNo

The difference between a disk and a circle is important. The disk contains all segments between its points. The circle boundary alone does not.

132.3 Convex Combinations

A convex combination of vectors

v1,,vn v_1,\ldots,v_n

is an expression

a1v1++anvn, a_1v_1+\cdots+a_nv_n,

where

ai0 a_i\ge0

and

a1++an=1. a_1+\cdots+a_n=1.

These coefficients are nonnegative and sum to one.

Convex combinations differ from ordinary linear combinations because the coefficients are restricted.

The condition

a1++an=1 a_1+\cdots+a_n=1

keeps the result inside the affine span of the vectors.

The condition

ai0 a_i\ge0

forces the point to lie “between” the vectors rather than extending arbitrarily outward.

For example, in R2\mathbb{R}^2, the convex combinations of three noncollinear points form a triangle.

132.4 Convex Hulls

The convex hull of a subset

SV S\subseteq V

is the smallest convex set containing SS.

It is written

conv(S). \operatorname{conv}(S).

Equivalently, it is the set of all finite convex combinations of points of SS:

conv(S)={i=1naivi:viS, ai0, ai=1}. \operatorname{conv}(S) = \left\{ \sum_{i=1}^n a_iv_i : v_i\in S,\ a_i\ge0,\ \sum a_i=1 \right\}.

For example:

Set SSConvex hull
Two pointsLine segment
Three noncollinear pointsTriangle
Four noncoplanar pointsTetrahedron
Circle boundaryDisk

The convex hull operation “fills in” missing interior regions.

Convex hulls are fundamental in computational geometry and optimization.

132.5 Affine Sets

An affine combination of vectors is an expression

a1v1++anvn, a_1v_1+\cdots+a_nv_n,

where

a1++an=1, a_1+\cdots+a_n=1,

but the coefficients may be negative.

An affine set is a subset closed under affine combinations.

Affine geometry generalizes linear geometry by allowing translations.

Examples include:

| Object | Linear? | Affine? | |—|—| | Subspace through origin | Yes | Yes | | Shifted line | No | Yes | | Shifted plane | No | Yes |

Affine sets are translated subspaces.

If WW is a subspace and vVv\in V, then

v+W={v+w:wW} v+W = \{v+w:w\in W\}

is an affine set.

Convex geometry naturally lives inside affine geometry.

132.6 Cones

A subset

KV K\subseteq V

is a cone if

xK,λ0 x\in K,\qquad \lambda\ge0

imply

λxK. \lambda x\in K.

A convex cone is both convex and a cone.

Thus:

x,yK,a,b0 x,y\in K, \qquad a,b\ge0

imply

ax+byK. ax+by\in K.

Examples include:

SetConvex cone?
Nonnegative orthantYes
Linear subspaceYes
Ray from originYes
SphereNo

The nonnegative orthant in Rn\mathbb{R}^n,

R0n, \mathbb{R}_{\ge0}^n,

is one of the most important convex cones.

Convex cones appear in optimization, duality theory, and matrix analysis.

132.7 Hyperplanes

A hyperplane in VRnV\cong\mathbb{R}^n is a set of the form

H={xRn:aTx=b}, H = \{x\in\mathbb{R}^n : a^Tx=b\},

where

a0. a\neq0.

The vector aa is normal to the hyperplane.

A hyperplane divides space into two half-spaces:

aTxb, a^Tx\le b,

and

aTxb. a^Tx\ge b.

Half-spaces are convex.

Hyperplanes are fundamental because many convex sets can be described as intersections of half-spaces.

132.8 Polyhedra

A polyhedron is a set defined by finitely many linear inequalities:

P={xRn:Axb}. P = \{x\in\mathbb{R}^n : Ax\le b\}.

The matrix AA and vector bb determine the bounding half-spaces.

Examples include:

ObjectPolyhedron?
CubeYes
SimplexYes
PolygonYes
BallNo

A bounded polyhedron is called a polytope.

Polytopes generalize polygons and polyhedra to arbitrary dimensions.

Convex optimization often studies feasible regions that are polyhedra.

132.9 Extreme Points

Let CC be a convex set.

A point xCx\in C is an extreme point if it cannot be written as a nontrivial convex combination of distinct points in CC.

That is, if

x=ty+(1t)z,0<t<1, x=ty+(1-t)z, \qquad 0<t<1,

with

y,zC, y,z\in C,

then

x=y=z. x=y=z.

For a polygon, the extreme points are the vertices.

For a disk, every boundary point is extreme.

Extreme points encode the “corners” of convex sets.

132.10 Supporting Hyperplanes

A hyperplane

H={x:aTx=b} H=\{x:a^Tx=b\}

supports a convex set CC if:

  1. CC lies entirely on one side of HH,
  2. HCH\cap C\neq\emptyset.

Geometrically, the hyperplane touches the convex set without cutting through it.

Supporting hyperplanes are central in optimization because optimal points often occur where a supporting hyperplane meets the feasible region.

132.11 Separation Theorems

One of the most important results in convex geometry is the hyperplane separation theorem.

Roughly stated:

If two convex sets are disjoint, then there exists a hyperplane separating them.

That means there exists a vector aa and scalar bb such that

aTxb a^Tx\le b

for all points in one set and

aTyb a^Ty\ge b

for all points in the other.

Separation theorems connect geometry with optimization and duality.

They explain why linear functionals can distinguish convex regions.

132.12 Carathéodory’s Theorem

Carathéodory’s theorem states:

If a point xRnx\in\mathbb{R}^n lies in the convex hull of a set SS, then xx can be written as a convex combination of at most n+1n+1 points of SS.

Thus in R2\mathbb{R}^2, every point in a convex hull can be represented using at most three points.

In R3\mathbb{R}^3, at most four points suffice.

This theorem gives a finite description of convex combinations in finite-dimensional spaces.

132.13 Helly’s Theorem

Helly’s theorem concerns intersections of convex sets.

If a finite collection of convex subsets of Rn\mathbb{R}^n has the property that every n+1n+1 of them intersect, then the entire collection intersects.

This theorem is remarkable because it reduces a global intersection condition to a finite local condition.

Helly’s theorem is fundamental in combinatorial convexity.

132.14 Radon’s Theorem

Radon’s theorem states:

Every set of n+2n+2 points in Rn\mathbb{R}^n can be partitioned into two disjoint subsets whose convex hulls intersect.

This theorem reveals an unavoidable overlap structure in sufficiently large point sets.

Radon’s theorem is closely related to Carathéodory’s theorem and Helly’s theorem.

Together they form the core of finite-dimensional convexity theory.

132.15 Convex Functions

A function

f:CR f:C\to\mathbb{R}

defined on a convex set CC is convex if

f(tx+(1t)y)tf(x)+(1t)f(y) f(tx+(1-t)y) \le tf(x)+(1-t)f(y)

for all

x,yC,t[0,1]. x,y\in C, \qquad t\in[0,1].

Geometrically, the graph of a convex function lies below the secant lines connecting its points.

Examples include:

FunctionConvex?
x2x^2Yes
exe^xYes
(x
x2-x^2No

Convex functions are important because local minima are automatically global minima.

132.16 Jensen’s Inequality

If ff is convex, then

f(i=1naixi)i=1naif(xi), f\left( \sum_{i=1}^n a_ix_i \right) \le \sum_{i=1}^n a_if(x_i),

whenever

ai0,ai=1. a_i\ge0, \qquad \sum a_i=1.

This inequality generalizes the defining property of convexity.

Jensen’s inequality appears throughout probability, statistics, information theory, and optimization.

132.17 Convex Optimization

A convex optimization problem minimizes a convex function over a convex feasible region.

The standard form is:

minimize f(x) \text{minimize } f(x)

subject to

xC, x\in C,

where CC is convex and ff is convex.

Convex optimization is important because:

PropertyConsequence
Local minimumGlobal minimum
Convex feasible regionStable geometry
Supporting hyperplanesDuality theory
Linear constraintsEfficient algorithms

Linear programming is a special case in which:

f(x)=cTx f(x)=c^Tx

and the feasible region is a polyhedron.

132.18 Duality

Convex geometry naturally leads to duality.

Given a convex set CC, its dual cone is

C={y:y,x0 for all xC}. C^* = \{y:\langle y,x\rangle\ge0 \text{ for all }x\in C\}.

Duality transforms geometric problems into algebraic inequalities.

In optimization, every primal problem has a dual problem. Under suitable conditions, solving one determines the other.

Duality theory is built from hyperplanes, convex cones, and linear functionals.

132.19 Norms and Convexity

Every norm defines a convex unit ball:

B={x:x1}. B = \{x:\|x\|\le1\}.

Conversely, many norms may be reconstructed from convex geometry.

Examples:

NormUnit ball
Euclidean normSphere
1\ell^1-normDiamond
\ell^\infty-normCube

Different norms correspond to different convex geometries.

This connection is fundamental in functional analysis and optimization.

132.20 Convex Geometry in Linear Algebra

Convex geometry and linear algebra interact continuously.

Linear algebra conceptConvex geometry interpretation
Linear combinationsAffine structure
Positive coefficientsConvex combinations
Systems of inequalitiesPolyhedra
Linear functionalsSupporting hyperplanes
Null spacesFeasible directions
EigenvaluesSpectral convexity
Positive semidefinite matricesConvex cones

Much of modern optimization is the study of convex geometry using linear algebraic tools.

132.21 Summary

Convex geometry studies sets and functions preserved under convex combinations.

The central ideas are:

ConceptMeaning
Convex setContains all line segments between its points
Convex combinationWeighted average with nonnegative coefficients
Convex hullSmallest convex set containing a given set
Affine setClosed under affine combinations
ConeClosed under nonnegative scaling
HyperplaneLinear boundary surface
PolyhedronIntersection of finitely many half-spaces
Extreme pointNondecomposable boundary point
Supporting hyperplaneBoundary hyperplane touching a convex set
Convex functionFunction below its secant lines
DualityCorrespondence between primal and dual structures

Convex geometry supplies the geometric foundation for optimization, numerical analysis, machine learning, and many parts of applied linear algebra.