Proof compression

In proof theory, an area of mathematical logic, proof compression is the problem of algorithmically compressing formal proofs. The developed algorithms can be used to improve the proofs generated by automated theorem proving tools such as SAT solvers, SMT-solvers, first-order theorem provers and proof assistants.

Problem Representation
In propositional logic a resolution proof of a clause $$\kappa$$ from a set of clauses C is a directed acyclic graph (DAG): the input nodes are axiom inferences (without premises) whose conclusions are elements of C, the resolvent nodes are resolution inferences, and the proof has a node with conclusion $$\kappa$$.

The DAG contains an edge from a node $$\eta_{1}$$ to a node $$\eta_{2}$$ if and only if a premise of $$\eta_{1}$$ is the conclusion of $$\eta_{2}$$. In this case, $$\eta_{1}$$ is a child of $$\eta_{2}$$, and $$\eta_{2}$$ is a parent of $$\eta_{1}$$. A node with no children is a root.

A proof compression algorithm will try to create a new DAG with fewer nodes that represents a valid proof of $$\kappa$$ or, in some cases, a valid proof of a subset of $$\kappa$$.

A simple example
Let's take a resolution proof for the clause $$\left\{ a,b,c\right\}$$ from the set of clauses


 * $$\left\{ \eta_{1}:\left\{ a,b,p\right\} ,\eta_{2}:\left\{ c,\neg p\right\} \right\} \quad

\frac{\eta_{1}: a,b,p \quad\quad \eta_{2}: c,\neg p} {\eta_{3}: a,b,c} p $$

Here we can see:
 * $$\eta_{1}$$ and $$\eta_{2}$$ are input nodes.
 * The node $$\eta_{3}$$ has a pivot $$p$$,
 * left resolved literal $$p$$
 * right resolved literal $$\neg p$$
 * $$\eta_{3}$$ conclusion is the clause $$\left\{ a,b,c\right\} $$
 * $$\eta_{3}$$ premises are the conclusion of nodes $$\eta_{1}$$ and $$\eta_{2}$$ (its parents)
 * The DAG would be

\begin{array}{ccc} \eta_{1} & & \eta_{2}\\ & \nwarrow\nearrow\\ & \eta_{3}\end{array} $$
 * $$\eta_{1}$$ and $$\eta_{2}$$ are parents of $$\eta_{3}$$
 * $$\eta_{3}$$ is a child of $$\eta_{1}$$ and $$\eta_{2}$$
 * $$\eta_{3}$$ is a root of the proof

A (resolution) refutation of C is a resolution proof of $$\bot$$ from C. It is a common given a node $$\eta$$, to refer to the clause $$\eta$$ or $$\eta$$’s clause meaning the conclusion clause of $$\eta$$, and (sub)proof $$\eta$$ meaning the (sub)proof having $$\eta$$ as its only root.

In some works can be found an algebraic representation of resolution inferences. The resolvent of $$\kappa_{1}$$ and $$\kappa_{2}$$ with pivot $$p$$ can be denoted as $$\kappa_{1}\odot_{p}\kappa_{2}$$. When the pivot is uniquely defined or irrelevant, we omit it and write simply $$\kappa_{1}\odot\kappa_{2}$$. In this way, the set of clauses can be seen as an algebra with a commutative operator; and terms in the corresponding term algebra denote resolution proofs in a notation style that is more compact and more convenient for describing resolution proofs than the usual graph notation.

In our last example the notation of the DAG would be $$\left\{ a,b,p\right\} \odot_{p}\left\{ c,\neg p\right\}$$ or simply $$\left\{ a,b,p\right\} \odot\left\{ c,\neg p\right\}. $$

We can identify $$\underbrace{\overbrace{\left\{ a,b,p\right\} }^{\eta_{1}}\odot\overbrace{\left\{ c,\neg p\right\} }^{\eta_{2}}}_{\eta_{3}} $$.

Compression algorithms
Algorithms for compression of sequent calculus proofs include cut introduction and cut elimination.

Algorithms for compression of propositional resolution proofs include RecycleUnits, RecyclePivots, RecyclePivotsWithIntersection, LowerUnits, LowerUnivalents, Split, Reduce&Reconstruct, and Subsumption.