Matroid partitioning

Matroid partitioning is a problem arising in the mathematical study of matroids and in the design and analysis of algorithms. Its goal is to partition the elements of a matroid into as few independent sets as possible. An example is the problem of computing the arboricity of an undirected graph, the minimum number of forests needed to cover all of its edges. Matroid partitioning may be solved in polynomial time, given an independence oracle for the matroid. It may be generalized to show that a matroid sum is itself a matroid, to provide an algorithm for computing ranks and independent sets in matroid sums, and to compute the largest common independent set in the intersection of two given matroids.

Example
The arboricity of an undirected graph is the minimum number of forests into which its edges can be partitioned, or equivalently (by adding overlapping edges to each forest as necessary) the minimum number of spanning forests whose union is the whole graph. A formula proved by Crispin Nash-Williams characterizes the arboricity exactly: it is the maximum, over all subgraphs $$H$$ of the given graph $$G$$, of the quantity $$\left\lceil\frac{|E(H)|}{|V(H)|-1}\right\rceil$$.

The forests of a graph form the independent sets of the associated graphic matroid, and the quantity $$|V(H)|-1$$ appearing in Nash-Williams' formula is the rank of the graphic matroid of $$H$$, the maximum size of one of its independent sets. Thus, the problem of determining the arboricity of a graph is exactly the matroid partitioning problem for the graphic matroid. The fact that the $$|E(H)|$$ elements of this matroid cannot be partitioned into fewer than $$\frac{|E(H)|}{|V(H)|-1}$$ independent subsets is then just an application of the pigeonhole principle saying that, if $$x$$ items are partitioned into sets of size at most $$y$$, then at least $$x/y$$ sets are needed. The harder direction of Nash-Williams' formula, which can be generalized to all matroids, is the proof that a partition of this size always exists.

Formula for partition size
To generalize Nash-Williams' formula, one may replace $$G$$ by a matroid $$M$$, and the subgraph $$H$$ of $$G$$ with a restriction $$M|S$$ of $$M$$ to a subset $$S$$ of its elements. The number of edges of the subgraph $$H$$ becomes, in this generalization, the cardinality $$|S|$$ of the selected subset, and the formula $$|V(H)|-1$$ for the maximum size of a forest in $$H$$ becomes the rank $$r(S)$$. Thus, the minimum number of independent sets in a partition of the given matroid $$M$$ should be given by the formula
 * $$k(M)=\max_S \left\lceil\frac{|S|}{r(S)}\right\rceil$$.

This formula is indeed valid, and it was given an algorithmic proof by. In other words, a matroid can be partitioned into at most $$k$$ independent subsets, if-and-only-if for every subset $$S$$ of $$M$$, the cardinality of $$S$$ is at most $$k\cdot r(S)$$.

Algorithms
The first algorithm for matroid partitioning was given by. It is an incremental augmenting-path algorithm that considers the elements of the matroid one by one, in an arbitrary order, maintaining at each step of the algorithm an optimal partition for the elements that have been considered so far. At each step, when considering an element $$x$$ that has not yet been placed into a partition, the algorithm constructs a directed graph that has as its nodes the elements that have already been partitioned, the new element $$x$$, and a special element $$\bot_i$$ for each of the $$k$$ independent sets in the current partition. It then forms a directed graph $$G_x$$ on this node set, with a directed arc $$\bot_i\rightarrow y$$ for each matroid element $$y$$ that can be added to partition set $$i$$ without causing it to become dependent, and with a directed arc $$z\rightarrow y$$ for each pair of matroid elements $$(y,z)$$ such that removing $$z$$ from its partition and replacing it with $$y$$ forms another independent set.

Now there are two cases:


 * If this graph contains a directed path from an element $$\bot_i$$ to the newly considered element $$x$$, then the shortest such path (or more generally any path that does not have any shortcutting edges) describes a sequence of changes that can be made simultaneously to the partition sets in order to form a new partition, with the same number of sets, that also includes $$x$$. In this case, the algorithm performs these changes and continues.
 * If, on the other hand, no such path exists, then let $$S$$ consist of the matroid elements from which $$x$$ is reachable in $$D$$. Each set in the current partition must be a maximal independent set in the restriction $M|S$, for if some element $$y$$ of $$S$$ could be added to partition set $$i$$ in the restriction, then either there would exist an arc $$\bot_i\rightarrow y$$ (if partition set $$i$$ is non-maximal in the full matroid $$M$$) or an arc $$z\rightarrow y$$ where $$z\notin S$$ (if the partition set is non-maximal in $$S$$ but maximal in the full matroid). In either case the existence of this arc contradicts the assumed construction of the set $$S$$, and the contradiction proves that each partition set is maximal. Thus, by the easy direction of the matroid partitioning formula, the number of sets needed to partition $$S$$ is at least


 * $$\left\lceil\frac{|S|}{r(S)}\right\rceil=\left\lceil\frac{kr(S)+1}{r(S)}\right\rceil=k+1$$,

so in this case the algorithm may find an optimal partition by placing $$x$$ into its own new independent set and leaving the other independent sets unchanged.

The overall algorithm, then, considers each element $$x$$ of the given matroid in turn, constructs the graph $$G_x$$, tests which nodes can reach $$x$$, and uses this information to update the current partition so that it includes $$x$$. At each step, the partition of the elements considered so far is optimal, so when the algorithm terminates it will have found an optimal partition for the whole matroid. Proving that this algorithm is correct requires showing that a shorcut-free path in the auxiliary graph always describes a sequence of operations that, when performed simultaneously, correctly preserves the independence of the sets in the partition; a proof of this fact was given by Edmonds. Because the algorithm only increases the number of sets in the partition when the matroid partitioning formula shows that a larger number is needed, the correctness of this algorithm also shows the correctness of the formula.

Although this algorithm depends only on the existence of an independence oracle for its correctness, faster algorithms can be found in many cases by taking advantage of the more specialized structure of specific types of matroids (such as graphic matroids) from which a particular partitioning problem has been defined.

Related problems
A matroid sum $$\sum_i M_i$$ (where each $$M_i$$ is a matroid) is itself a matroid, having as its elements the union of the elements of the summands. A set is independent in the sum if it can be partitioned into sets that are independent within each summand. The matroid partitioning algorithm generalizes to the problem of testing whether a set is independent in a matroid sum. Its correctness can be used to prove that a matroid sum is necessarily a matroid. An extended problem, that is also sometimes called matroid partition, is to find a largest set that is independent in the matroid sum, that is, a largest set that can be partitioned into sets that are disjoint in each input matroid. Cunningham presents an algorithm for solving this problem on O(n) n-element matroids using $$O(n^{2.5})$$ calls to an independence oracle.

The matroid intersection problem is finding the largest set that is independent in two matroids $$M_1$$ and $$M_2$$. It may be solved by turning it into an equivalent matroid sum problem: if $$B$$ is a basis of the sum $$M_1+M_2^*$$, where $$M_2^*$$ is the dual of $$M_2$$, then $$B$$ must have full rank in $$M_2^*$$ and removing a maximal independent set of $$M_2^*$$ from $$B$$ leaves a maximum intersection.

Matroid partitioning is a form of set cover problem, and the corresponding set packing problem (find a maximum number of disjoint spanning sets within a given matroid) is also of interest. It can be solved by algorithms similar to those for matroid partitioning. The fractional set packing and set covering problems associated with a matroid (that is, assign a weight to each independent set in such a way that for every element the total weight of the sets containing it is at most one or at least one, maximizing or minimizing the total weight of all the sets, respectively) can also be solved in polynomial time using matroid partitioning methods.

As well as its use in calculating the arboricity of a graph, matroid partitioning can be used with other matroids to find a subgraph of a given graph whose average degree is maximum, and to find the edge toughness of a graph (a variant of graph toughness involving the deletion of edges in place of vertices).

Matroid-constrained number partitioning is a different problem in which k (the number of subsets in the partition) is fixed. There are k different matroids over the same ground set, and the goal is to partition the ground set into k subsets, such that each subset i is an independent set in matroid i. Subject to this constraint, some objective function should be minimized. In a generalization of this variant, each of the k matroids has a weight, and the objective function depends on the weights (maximum weight, minimum weight or sum of weights).