11.15 Network Design

Many real-world optimization problems can be modeled as graphs.

11.15 Network Design

Problem

Many real-world optimization problems can be modeled as graphs. Cities become vertices. Roads become edges. Data centers become vertices. Network links become edges. Water treatment facilities, electrical substations, pipelines, rail systems, and communication towers can all be represented in the same way.

The challenge is rarely to connect everything. The challenge is to connect everything at an acceptable cost.

Consider a telecommunications company deploying fiber across a region. Every possible connection has a construction cost. Building every connection would be prohibitively expensive. Building too few connections leaves parts of the network disconnected.

The question becomes:

What is the cheapest network that keeps every required location connected?

This is precisely the problem solved by minimum spanning trees.

Solution

Model the infrastructure as a weighted graph:

  • Vertices represent locations.
  • Edges represent possible connections.
  • Edge weights represent costs.

Then compute a minimum spanning tree.

The resulting tree:

  • Connects all locations.
  • Uses exactly (V - 1) links.
  • Has minimum total construction cost.

The MST provides the cheapest connected backbone.

Example: Fiber Network

Suppose a provider must connect five cities.

Connection Cost
A-B 4
A-C 2
B-C 1
B-D 7
C-D 5
C-E 8
D-E 3

Possible graph:

      B
     / \\n    4   1
   /     \\n  A---2---C
           \\n            5
             \\n              D
               \\n                3
                 \\n                  E

Running Kruskal's algorithm selects:

B-C = 1
A-C = 2
D-E = 3
C-D = 5

Total cost:

1 + 2 + 3 + 5 = 11

Every city becomes connected.

No cheaper connected solution exists.

Why Trees Matter

Suppose the graph contains:

A-B-C-A

a cycle.

Removing the most expensive edge from that cycle preserves connectivity.

This observation is fundamental.

A network containing cycles often contains redundant cost.

The MST removes all unnecessary redundancy while preserving reachability.

This is why spanning trees appear naturally in infrastructure planning.

Backbone Networks

Many real systems separate infrastructure into two layers:

  1. Backbone network
  2. Redundant expansion network

The backbone is often designed using MST principles.

Example:

National backbone

Connect:

New York
Chicago
Dallas
Denver
Los Angeles

using minimum cost.

After the backbone exists, additional links can be added for fault tolerance.

The MST provides the starting point.

Cost Models

The edge weight depends on the domain.

Road Networks

Weight:

Construction cost

Fiber Networks

Weight:

Installation cost

or:

Distance

Electrical Grids

Weight:

Material + labor cost

Pipelines

Weight:

Excavation cost

Wireless Systems

Weight:

Signal degradation

The algorithm remains unchanged.

Only the interpretation of edge weights differs.

Geographic Networks

Consider locations with coordinates.

Site x y
A 0 0
B 2 1
C 5 2
D 7 4

A common approach:

  1. Compute pairwise distances.
  2. Build a weighted graph.
  3. Run an MST algorithm.

The resulting network minimizes total cable length.

This technique is widely used in geographic information systems.

Data Center Connectivity

Suppose a company operates:

Virginia
Ohio
Oregon
Frankfurt
Singapore
Tokyo

Every potential connection has a deployment cost.

Building all links:

O(V²)

connections.

Very expensive.

An MST identifies the minimum-cost set of links that still allows communication between all locations.

The result often becomes the initial deployment plan.

Utility Distribution

Electrical utilities frequently face a similar problem.

Substations:

S1
S2
S3
...
Sn

must be connected through transmission lines.

Costs depend on:

  • Distance
  • Terrain
  • Regulatory constraints
  • Environmental restrictions

The weighted graph captures these factors.

An MST minimizes the overall infrastructure cost.

When MST Is Not Enough

Real systems rarely stop at an MST.

Consider:

A --- B --- C

If:

B

fails, connectivity is lost.

An MST has no redundancy.

This leads to an important distinction:

Goal Solution
Cheapest connected network MST
Fault-tolerant network Additional links required
High-capacity network Flow optimization
Shortest routes Shortest path algorithms

An MST optimizes cost, not resilience.

Reliability Considerations

Suppose the MST contains:

A-B
B-C
C-D

A failure of:

B-C

splits the network.

Many production systems therefore:

  1. Build an MST.
  2. Identify critical links.
  3. Add redundancy.

The final network costs more than the MST but provides higher availability.

This pattern appears throughout engineering.

Network Expansion

Infrastructure often grows incrementally.

Initial deployment:

A
B
C
D

Later expansion:

E
F
G

A common strategy:

  1. Connect the original sites with an MST.
  2. Attach new sites using the cheapest available connection.
  3. Periodically recompute the global MST.

This balances optimality and operational simplicity.

Clustering Before Network Design

Large deployments frequently use clustering.

Example:

1000 sites

Instead of one massive graph:

  1. Cluster geographically.
  2. Build local MSTs.
  3. Connect clusters using a higher-level MST.

This hierarchical approach improves scalability.

Minimum Bottleneck Design

Sometimes the largest connection cost matters more than total cost.

Example:

Mountain crossing = $50M

Every other connection:

$1M to $5M

Even if the total cost remains low, the single expensive link may dominate budgeting.

In such cases:

  • Minimum bottleneck spanning trees
  • Maximum-spacing clustering
  • Specialized design constraints

may be more appropriate than a standard MST.

Sparse Versus Dense Networks

Network planning often begins with a dense graph:

Every site can connect to every other site

This produces:

$$ O(V^2) $$

edges.

After MST construction:

$$ V - 1 $$

edges remain.

The reduction can be dramatic.

Example:

1000 locations

Possible links:

499,500

MST links:

999

This illustrates the power of spanning tree optimization.

Multi-Criteria Design

Real deployments rarely optimize a single metric.

An edge may have:

Cost
Latency
Reliability
Bandwidth

Several approaches exist:

Weighted Combination

score =
    α × cost +
    β × latency

Constraint-Based

Minimize cost
subject to latency ≤ threshold

Multi-Objective Optimization

Generate several candidate networks.

Although MST algorithms remain useful, the problem moves beyond classical spanning trees.

Implementation Pattern

A typical network-design workflow:

Step 1

Build graph.

vertices = sites
edges = possible connections

Step 2

Assign weights.

distance
cost
latency

Step 3

Compute MST.

Kruskal
Prim
Borůvka

Step 4

Analyze critical links.

Step 5

Add redundancy if required.

Step 6

Deploy network.

This pattern appears in many industries.

Complexity

For a graph with:

$$ V $$

vertices and:

$$ E $$

possible connections:

Kruskal

$$ O(E \log E) $$

Prim

$$ O(E \log V) $$

Borůvka

$$ O(E \log V) $$

The MST phase is usually not the computational bottleneck. Building the graph and estimating edge weights often requires more engineering effort.

Common Mistakes

Treating MST as a Final Production Network

MST minimizes cost but provides no redundancy.

Ignoring Real Constraints

Construction restrictions may eliminate certain edges entirely.

Using Geographic Distance Alone

Real costs often depend on terrain, regulation, and labor.

Assuming MST Optimizes Everything

It optimizes total connection cost only.

Overlooking Expansion

Networks frequently grow after deployment.

A design should anticipate future additions.

Recipe

When designing a cost-efficient network:

  1. Model locations as vertices.
  2. Model candidate links as weighted edges.
  3. Choose weights that reflect real deployment costs.
  4. Compute an MST.
  5. Treat the MST as the minimum-cost backbone.
  6. Analyze critical links and failure points.
  7. Add redundancy where reliability requirements justify additional cost.

Minimum spanning trees are among the most practical graph algorithms because they directly model real infrastructure decisions. Whether the network carries electricity, water, vehicles, data, or communication signals, the same underlying question appears repeatedly: how can we connect everything while spending as little as possible? The MST provides the first answer, and often the most important one.