Skip to content

13.5 Proof Length and Complexity

Quantitative study of proofs, including proof size, efficiency, and connections to computational complexity.

Proof theory can ask not only whether a formula is provable, but also how hard it is to prove. This leads to the study of proof length and proof complexity.

A formal proof is a finite object, so it has a size. We may count the number of lines, the number of inference steps, the total number of symbols, or the size of the proof tree.

For a formula AA, we may ask:

What is the length of the shortest proof of A? \text{What is the length of the shortest proof of } A?

This question turns proof theory into a quantitative subject.

Measuring Proof Length

Different proof systems measure proof size in slightly different ways.

MeasureMeaning
Line countnumber of formulas in a sequence proof
Step countnumber of inference rule applications
Symbol counttotal number of symbols in the proof
Tree sizenumber of nodes in a proof tree
DAG sizesize after sharing repeated subproofs

For example, a proof written as a tree may duplicate the same subproof several times. If repeated subproofs are shared, the same proof may be represented as a directed acyclic graph, or DAG, and become much smaller.

So proof length depends partly on representation.

Shortest Proofs

Let SS be a proof system. The proof length of AA in SS may be written:

S(A) \ell_S(A)

This means the length of the shortest proof of AA in system SS.

If AA has no proof in SS, then S(A)\ell_S(A) is undefined or treated as infinite.

The main question is how S(A)\ell_S(A) grows as the formulas become larger.

Some formulas have short proofs. Others may be true but require very long proofs in a given system.

Comparing Proof Systems

Two proof systems may prove the same formulas but differ greatly in proof length.

Suppose systems S1S_1 and S2S_2 prove the same theorems. It may still happen that some formulas have short proofs in S1S_1 and only very long proofs in S2S_2.

We say that S1S_1 polynomially simulates S2S_2 if every proof in S2S_2 can be translated into a proof in S1S_1 with only polynomial growth in size.

Informally:

S1(A)p(S2(A)) \ell_{S_1}(A) \leq p(\ell_{S_2}(A))

for some polynomial pp.

If no such efficient translation exists, then one proof system may be genuinely stronger in proof efficiency.

Proof Complexity

Proof complexity studies the size of proofs, especially for propositional tautologies.

A propositional formula AA is a tautology if:

A \models A

A sound and complete proof system can prove every tautology:

AA \models A \quad \Longrightarrow \quad \vdash A

But completeness only says that a proof exists. It does not say the proof is short.

Proof complexity asks whether certain tautologies require large proofs in a given system.

Resolution

One important proof system in proof complexity is resolution.

Resolution works with clauses. A clause is a disjunction of literals, such as:

p¬qr p \lor ¬q \lor r

The resolution rule is:

ApB¬pAB \frac{A \lor p \qquad B \lor ¬p}{A \lor B}

The idea is that one clause says AA is true unless pp fails, and the other says BB is true unless pp holds. Resolving eliminates pp.

Resolution is central because it is closely related to SAT solving.

To prove that a CNF formula is unsatisfiable, resolution derives the empty clause:

\Box

The empty clause represents contradiction.

Example of Resolution

Suppose we have the clauses:

pq p \lor q

and:

¬pr ¬p \lor r

Resolving on pp gives:

qr q \lor r

because the complementary literals pp and ¬p¬p are removed.

Another example:

p p

and:

¬p ¬p

resolve to:

\Box

This means the clauses are jointly inconsistent.

Lower Bounds

A lower bound says that every proof of a certain kind must be large.

For example, one may show that a family of tautologies requires resolution proofs of exponential size.

The general form is:

S(An)2cn \ell_S(A_n) \geq 2^{cn}

for some constant c>0c>0.

This means that as nn grows, proofs in system SS become exponentially long.

Lower bounds are difficult but important. They show that some proof systems cannot efficiently prove certain valid formulas.

Connection with Complexity Theory

Proof complexity is closely related to computational complexity.

A proof system for propositional tautologies can be viewed as a certificate system. If a tautology has a short proof, then the proof can be checked efficiently.

This connects to the class NP\mathrm{NP}, where solutions can be verified efficiently.

The complement problem, propositional tautology, is related to coNP\mathrm{coNP}. A major open question is whether every tautology has a short proof in some reasonable proof system.

This is closely connected to:

NP=?coNP \mathrm{NP} \stackrel{?}{=} \mathrm{coNP}

If every tautology had polynomial-size proofs in a suitable proof system, that would imply a major collapse in complexity theory.

Automated Reasoning

Proof length matters in practice.

Automated theorem provers and SAT solvers do not merely need proofs to exist. They need proofs that can be found and checked within reasonable time and memory.

A formula may be valid, but if every proof is enormous, automated reasoning may fail in practice.

This is why proof systems are compared not only by correctness but also by proof efficiency.

Compression and Sharing

Large proofs often contain repeated subarguments. Sharing can compress them.

A tree proof may repeat a lemma many times. A DAG proof can store the lemma once and reuse it.

This reflects ordinary mathematics. We prove a lemma once, then cite it repeatedly.

Formal proof systems can model this using derived rules, lemmas, abbreviations, and proof terms.

Compression changes proof size, so any serious study of proof length must specify the representation carefully.

Why This Matters

Proof length measures the cost of reasoning.

Soundness tells us that proofs are reliable. Completeness tells us that all valid formulas are provable. Proof complexity asks whether those proofs are feasible.

This adds a third dimension to formal logic:

truthprovabilityproof size \text{truth} \qquad \text{provability} \qquad \text{proof size}

A mature theory of proof must account for all three.