Butcher group

In mathematics, the Butcher group, named after the New Zealand mathematician John C. Butcher by, is an infinite-dimensional  Lie group first introduced in numerical analysis to study solutions of non-linear ordinary differential equations by the Runge–Kutta method. It arose from an algebraic formalism involving rooted trees that provides formal power series solutions of the differential equation modeling the flow of a vector field. It was, prompted by the work of Sylvester on change of variables in differential calculus, who first noted that the derivatives of a composition of functions can be conveniently expressed in terms of rooted trees and their combinatorics.

pointed out that the Butcher group is the group of characters of the Hopf algebra of rooted trees that had arisen independently in their own work on renormalization in quantum field theory and Connes' work with Moscovici on local index theorems. This Hopf algebra, often called the Connes–Kreimer algebra, is essentially equivalent to the Butcher group, since its dual can be identified with the universal enveloping algebra of the Lie algebra of the Butcher group. As they commented: We regard Butcher’s work on the classification of numerical integration methods as an impressive example that concrete problem-oriented work can lead to far-reaching conceptual results.

Differentials and rooted trees
A rooted tree is a graph with a distinguished node, called the root, in which every other node is connected to the root by a unique path. If the root of a tree t is removed and the nodes connected to the original node by a single bond are taken as new roots, the tree t breaks up into rooted trees t1, t2, ... Reversing this process a new tree t = [t1, t2, ...] can be constructed by joining the roots of the trees to a new common root. The number of nodes in a tree is denoted by |t|. A heap-ordering of a rooted tree t is an allocation of the numbers 1 through |t| to the nodes so that the numbers increase on any path going away from the root. Two heap orderings are equivalent, if there is an automorphism of rooted trees mapping one of them on the other. The number of equivalence classes of heap-orderings on a particular tree is denoted by α(t) and can be computed using the Butcher's formula:


 * $$\displaystyle \alpha(t)= {|t|!\over t! |S_t|},$$

where St denotes the symmetry group of t and the tree factorial is defined recursively by
 * $$[t_1,\dots,t_n]! = |[t_1,\dots,t_n]| \cdot t_1! \cdots t_n!$$

with the tree factorial of an isolated root defined to be 1


 * $$\bullet ! =1.$$

The ordinary differential equation for the flow of a vector field on an open subset U of RN can be written


 * $$\displaystyle {dx(s)\over ds} = f(x(s)),\,\, x(0)=x_0, $$

where x(s) takes values in U, f is a smooth function from U to RN and x0 is the starting point of the flow at time s = 0.

gave a method to compute the higher order derivatives x(m)(s) in terms of rooted trees. His formula can be conveniently expressed using the elementary differentials introduced by Butcher. These are defined inductively by


 * $$ \delta_\bullet^i= f^i, \,\,\, \delta^i_{[t_1,\dots,t_n]} = \sum_{j_1,\dots,j_n=1}^N (\delta^{j_1}_{t_1} \cdots \delta^{j_n}_{t_n})\partial_{j_1} \cdots \partial_{j_n} f^i.$$

With this notation


 * $$ {d^m x\over ds^m} = \sum_{|t|=m} \alpha(t) \delta_t,$$

giving the power series expansion


 * $$\displaystyle x(s) = x_0 + \sum_{t} {s^{|t|}\over |t|!} \alpha(t) \delta_t(0).$$

As an example when N = 1, so that x and f are real-valued functions of a single real variable, the formula yields


 * $$ x^{(4)} = f^{\prime\prime\prime}f^3 + 3 f^{\prime\prime}f^{\prime} f^2 + f^{\prime}f^{\prime\prime} f^2 +(f^\prime)^3 f,$$

where the four terms correspond to the four rooted trees from left to right in Figure 3 above.

In a single variable this formula is the same as Faà di Bruno's formula of 1855; however in several variables it has to be written more carefully in the form


 * $$ x^{(4)} = f^{\prime\prime\prime}(f,f,f) + 3f^{\prime\prime}(f,f^\prime(f))   +   f^\prime(f^{\prime\prime}(f,f))            +f^\prime(f^\prime(f^\prime(f))),$$

where the tree structure is crucial.

Definition using Hopf algebra of rooted trees
The Hopf algebra H of rooted trees was defined by in connection with Kreimer's previous work on renormalization in quantum field theory. It was later discovered that the Hopf algebra was the dual of a Hopf algebra defined earlier by in a different context. The characters of H, i.e. the homomorphisms of the underlying commutative algebra into R, form a group, called the Butcher group. It corresponds to the formal group structure discovered in numerical analysis by.

The Hopf algebra of rooted trees H is defined to be the polynomial ring in the variables t, where t runs through rooted trees.


 * Its comultiplication $$ \Delta:H\rightarrow H \otimes H$$ is defined by


 * $$\Delta(t) = t\otimes I + I \otimes t +\sum_{s\subset t} s\otimes [t\backslash s],$$

where the sum is over all proper rooted subtrees s of t; $$[t\backslash s]$$ is the monomial given by the product the variables ti formed by the rooted trees that arise on erasing all the nodes of s and connected links from t. The number of such trees is denoted by n(t\s).


 * Its counit is the homomorphism ε of H into R sending each variable t to zero.
 * Its antipode S can be defined recursively by the formula


 * $$ S(t) = -t - \sum_{s \subset t}(-1)^{n(t\backslash s)}S([t\backslash s])s, \,\,\, S(\bullet)= -\bullet.$$

The Butcher group is defined to be the set of algebra homomorphisms φ of H into R with group structure


 * $$\varphi_1 \star \varphi_2 (t)= (\varphi_1\otimes \varphi_2)\Delta(t).$$

The inverse in the Butcher group is given by


 * $$\varphi^{-1}(t)=\varphi(St)$$

and the identity by the counit ε.

Using complex coefficients in the construction of the Hopf algebra of rooted trees one obtains the complex Hopf algebra of rooted trees. Its C-valued characters form a group, called the complex Butcher group GC. The complex Butcher group GC is an infinite-dimensional complex Lie group which appears as a toy model in the of quantum field theories.

Butcher series and Runge–Kutta method
The non-linear ordinary differential equation


 * $$ {dx(s)\over ds} = f(x(s)),\,\,\, x(0)=x_0,$$

can be solved approximately by the Runge–Kutta method. This iterative scheme requires an m x m matrix


 * $$A=(a_{ij})$$

and a vector


 * $$b=(b_i)$$

with m components.

The scheme defines vectors xn by first finding a solution X1, ... , Xm of


 * $$ X_i= x_{n-1} + h \sum_{j=1}^m a_{ij} f(X_j)$$

and then setting


 * $$x_n=x_{n-1} +h \sum_{j=1}^m b_j f(x_j).$$

showed that the solution of the corresponding ordinary differential equations


 * $$ X_i(s)=x_0 + s\sum_{j=1}^m a_{ij} f(X_j(s)),\,\,\, x(s)=x_0 + s \sum_{j=1}^m b_jf(X_j(s))$$

has the power series expansion


 * $$ X_i(s) = x_0 +\sum_t {s^{|t|}\over |t|!} \alpha(t) t! \sum_{j=1}^m a_{ij} \varphi_j(t)\delta_t(0),\,\,\,\,x(s) = x_0 +

\sum_t {s^{|t|}\over |t|!} \alpha(t) t! \varphi(t)\delta_t(0), $$

where φj and φ are determined recursively by


 * $$\varphi_j(\bullet)=1.\,\,\, \varphi_i([t_1,\cdots,t_k])=\sum_{j_1,\dots,j_k} a_{ij_1}\dots a_{ij_k} \varphi_{j_1}(t_1)\dots \varphi_{j_k}(t_k)$$

and


 * $$\varphi(t) = \sum_{j=1}^m b_j \varphi_j(t).$$

The power series above are called B-series or Butcher series. The corresponding assignment φ is an element of the Butcher group. The homomorphism corresponding to the actual flow has


 * $$ \Phi(t)={1\over t!}.$$

Butcher showed that the Runge–Kutta method gives an nth order approximation of the actual flow provided that φ and Φ agree on all trees with n nodes or less. Moreover, showed that the homomorphisms defined by the Runge–Kutta method form a dense subgroup of the Butcher group: in fact he showed that, given a homomorphism  φ', there is a Runge–Kutta homomorphism φ agreeing with φ' to order n; and that if given homomorphims φ and φ' corresponding to Runge–Kutta data (A, b) and (A' , b' ), the product homomorphism $$\varphi\star \varphi^\prime$$ corresponds to the data


 * $$ \begin{pmatrix} A & 0\\ 0 & A^\prime\\ \end{pmatrix},\,\, (b,b^\prime).$$

proved that the Butcher group acts naturally on the functions f. Indeed, setting


 * $$\varphi\circ f= 1 +\sum_t {s^{|t|}\over |t|!} \alpha(t) t! \varphi(t)\delta_t(0),$$

they proved that


 * $$ \varphi_1\circ (\varphi_2\circ f) = (\varphi_1\star \varphi_2)\circ f.$$

Lie algebra
showed that associated with the Butcher group G is an infinite-dimensional Lie algebra. The existence of this Lie algebra is predicted by a theorem of : the commutativity and natural grading on H implies that the graded dual H* can be identified with the universal enveloping algebra of a Lie algebra $$\mathfrak{g}$$. Connes and Kreimer explicitly identify $$\mathfrak{g}$$ with a space of derivations θ of H into R, i.e. linear maps such that


 * $$\theta(ab)=\varepsilon(a)\theta(b) + \theta(a)\varepsilon(b),$$

the formal tangent space of G at the identity ε. This forms a Lie algebra with Lie bracket


 * $$[\theta_1,\theta_2](t)=(\theta_1 \otimes \theta_2 -\theta_2\otimes\theta_1)\Delta(t).$$

$$\mathfrak{g}$$ is generated by the derivations θt defined by


 * $$\theta_t(t^\prime)=\delta_{tt^\prime}, $$

for each rooted tree t.

The infinite-dimensional Lie algebra $$\mathfrak{g}$$ from and the Lie algebra L(G) of the Butcher group as an infinite-dimensional Lie group are not the same. The Lie algebra L(G) can be identified with the Lie algebra of all derivations in the dual of H (i.e. the space of all linear maps from H to R), whereas $$\mathfrak{g}$$ is obtained from the graded dual. Hence $$\mathfrak{g}$$ turns out to be a (strictly smaller) Lie subalgebra of L(G).

Renormalization
provided a general context for using Hopf algebraic methods to give a simple mathematical formulation of renormalization in quantum field theory. Renormalization was interpreted as Birkhoff factorization of loops in the character group of the associated Hopf algebra. The models considered by had Hopf algebra H and character group G, the Butcher group. has given an account of this renormalization process in terms of Runge–Kutta data.

In this simplified setting, a renormalizable model has two pieces of input data:
 * a set of Feynman rules given by an algebra homomorphism Φ of H into the algebra V of Laurent series in z with poles of finite order;
 * a renormalization scheme given by a linear operator R on V such that R satisfies the Rota–Baxter identity
 * $$R(fg) + R(f)R(g) = R(fR(g)) + R(R(f)g)$$
 * and the image of R – id lies in the algebra V+ of power series in z.

Note that R satisfies the Rota–Baxter identity if and only if id – R does. An important example is the minimal subtraction scheme


 * $$\displaystyle R(\sum_{n} a_n z^n )= \sum_{n< 0} a_n z^n.$$

In addition there is a projection P of H onto the augmentation ideal ker ε given by


 * $$\displaystyle P(x) = x -\varepsilon(x)1.$$

To define the renormalized Feynman rules, note that the antipode S satisfies


 * $$ m\circ (S\otimes {\rm id}) \Delta (x) =\varepsilon(x)1$$

so that


 * $$S = - m\circ (S\otimes P)\Delta,$$

The renormalized Feynman rules are given by a homomorphism $$\Phi_S^R$$ of H into V obtained by twisting the homomorphism Φ • S. The homomorphism $$\Phi_S^R$$ is uniquely specified by


 * $$\Phi_S^R = -m(S\otimes \Phi_S^R\circ P)\Delta.$$

Because of the precise form of Δ, this gives a recursive formula for $$\Phi_S^R$$.

For the minimal subtraction scheme, this process can be interpreted in terms of Birkhoff factorization in the complex Butcher group. Φ can be regarded as a map γ of the unit circle into the complexification GC of G (maps into C instead of R). As such it has a Birkhoff factorization


 * $$ \displaystyle \gamma(z)=\gamma_-(z)^{-1} \gamma_+(z),$$

where γ+ is holomorphic on the interior of the closed unit disk and γ– is holomorphic on its complement in the Riemann sphere C $$\cup\{\infty\}$$ with γ–(∞) = 1. The loop γ+ corresponds to the renormalized homomorphism. The evaluation at z = 0 of γ+ or the renormalized homomorphism gives the dimensionally regularized values for each rooted tree.

In example, the Feynman rules depend on additional parameter μ, a "unit of mass". showed that


 * $$\partial_\mu \gamma_{\mu-} =0,$$

so that γμ– is independent of μ.

The complex Butcher group comes with a natural one-parameter group λw of automorphisms, dual to that on H


 * $$\lambda_{w}(t)= w^{|t|}t$$

for w ≠ 0 in C.

The loops γμ and λw · γμ have the same negative part and, for t real,


 * $$\displaystyle F_t=\lim_{z=0} \gamma_-(z) \lambda_{tz}(\gamma_-(z)^{-1})$$

defines a one-parameter subgroup of the complex Butcher group GC called the renormalization group flow (RG).

Its infinitesimal generator β is an element of the Lie algebra of GC and is defined by


 * $$\beta=\partial_t F_t|_{t=0}.$$

It is called the beta-function of the model.

In any given model, there is usually a finite-dimensional space of complex coupling constants. The complex Butcher group acts by diffeomorphisms on this space. In particular the renormalization group defines a flow on the space of coupling constants, with the beta function giving the corresponding vector field.

More general models in quantum field theory require rooted trees to be replaced by Feynman diagrams with vertices decorated by symbols from a finite index set. Connes and Kreimer have also defined Hopf algebras in this setting and have shown how they can be used to systematize standard computations in renormalization theory.

Example
has given a "toy model" involving dimensional regularization for H and the algebra V. If c is a positive integer and qμ = q / μ is a dimensionless constant, Feynman rules can be defined recursively by


 * $$\displaystyle \Phi([t_1,\dots, t_n])=\int {\Phi(t_1)\cdots \Phi(t_n) \over |y|^2 + q_\mu^2} (|y|^2)^{-z({c\over 2} -1)} \, d^D y,$$

where z = 1 – D/2 is the regularization parameter. These integrals can be computed explicitly in terms of the Gamma function using the formula


 * $$\displaystyle \int {(|y|^2)^{-u}\over |y|^2 +q_\mu^2} \, d^Dy =  \pi^{D/2} (q_\mu^2)^{-z-u} {\Gamma(-u +D/2)\Gamma(1+u-D/2)\over \Gamma(D/2)}.$$

In particular


 * $$\displaystyle \Phi(\bullet)=\pi^{D/2}(q_\mu^2)^{-zc/2}{\Gamma(1+cz)\over cz}.$$

Taking the renormalization scheme R of minimal subtraction, the renormalized quantities $$\Phi_S^R(t)$$ are polynomials in $$\log q_\mu^2$$ when evaluated at z = 0.