Gauge theory (mathematics)

In mathematics, and especially differential geometry and mathematical physics, gauge theory is the general study of connections on vector bundles, principal bundles, and fibre bundles. Gauge theory in mathematics should not be confused with the closely related concept of a gauge theory in physics, which is a field theory which admits gauge symmetry. In mathematics theory means a mathematical theory, encapsulating the general study of a collection of concepts or phenomena, whereas in the physical sense a gauge theory is a mathematical model of some natural phenomenon.

Gauge theory in mathematics is typically concerned with the study of gauge-theoretic equations. These are differential equations involving connections on vector bundles or principal bundles, or involving sections of vector bundles, and so there are strong links between gauge theory and geometric analysis. These equations are often physically meaningful, corresponding to important concepts in quantum field theory or string theory, but also have important mathematical significance. For example, the Yang–Mills equations are a system of partial differential equations for a connection on a principal bundle, and in physics solutions to these equations correspond to vacuum solutions to the equations of motion for a classical field theory, particles known as instantons.

Gauge theory has found uses in constructing new invariants of smooth manifolds, the construction of exotic geometric structures such as hyperkähler manifolds, as well as giving alternative descriptions of important structures in algebraic geometry such as moduli spaces of vector bundles and coherent sheaves.

History
Gauge theory has its origins as far back as the formulation of Maxwell's equations describing classical electromagnetism, which may be phrased as a gauge theory with structure group the circle group. Work of Paul Dirac on magnetic monopoles and relativistic quantum mechanics encouraged the idea that bundles and connections were the correct way of phrasing many problems in quantum mechanics. Gauge theory in mathematical physics arose as a significant field of study with the seminal work of Robert Mills and Chen-Ning Yang on so-called Yang–Mills gauge theory, which is now the fundamental model that underpins the standard model of particle physics.

The mathematical investigation of gauge theory has its origins in the work of Michael Atiyah, Isadore Singer, and Nigel Hitchin on the self-duality equations on a Riemannian manifold in four dimensions. In this work the moduli space of self-dual connections (instantons) on Euclidean space was studied, and shown to be of dimension $$8k-3$$ where $$k$$ is a positive integer parameter. This linked up with the discovery by physicists of BPST instantons, vacuum solutions to the Yang–Mills equations in four dimensions with $$k=1$$. Such instantons are defined by a choice of 5 parameters, the center $$z\in \mathbb{R}^4$$ and scale $$\rho \in \mathbb{R}_{>0}$$, corresponding to the $$8-3=5$$-dimensional moduli space. A BPST instanton is depicted to the right.

Around the same time Atiyah and Richard Ward discovered links between solutions to the self-duality equations and algebraic bundles over the complex projective space $$\mathbb{CP}^3$$. Another significant early discovery was the development of the ADHM construction by Atiyah, Vladimir Drinfeld, Hitchin, and Yuri Manin. This construction allowed for the solution to the anti-self-duality equations on Euclidean space $$\mathbb{R}^4$$ from purely linear algebraic data.

Significant breakthroughs encouraging the development of mathematical gauge theory occurred in the early 1980s. At this time the important work of Atiyah and Raoul Bott about the Yang–Mills equations over Riemann surfaces showed that gauge theoretic problems could give rise to interesting geometric structures, spurring the development of infinite-dimensional moment maps, equivariant Morse theory, and relations between gauge theory and algebraic geometry. Important analytical tools in geometric analysis were developed at this time by Karen Uhlenbeck, who studied the analytical properties of connections and curvature proving important compactness results. The most significant advancements in the field occurred due to the work of Simon Donaldson and Edward Witten.

Donaldson used a combination of algebraic geometry and geometric analysis techniques to construct new invariants of four manifolds, now known as Donaldson invariants. With these invariants, novel results such as the existence of topological manifolds admitting no smooth structures, or the existence of many distinct smooth structures on the Euclidean space $$\mathbb{R}^4$$ could be proved. For this work Donaldson was awarded the Fields Medal in 1986.

Witten similarly observed the power of gauge theory to describe topological invariants, by relating quantities arising from Chern–Simons theory in three dimensions to the Jones polynomial, an invariant of knots. This work and the discovery of Donaldson invariants, as well as novel work of Andreas Floer on Floer homology, inspired the study of topological quantum field theory.

After the discovery of the power of gauge theory to define invariants of manifolds, the field of mathematical gauge theory expanded in popularity. Further invariants were discovered, such as Seiberg–Witten invariants and Vafa–Witten invariants. Strong links to algebraic geometry were realised by the work of Donaldson, Uhlenbeck, and Shing-Tung Yau on the Kobayashi–Hitchin correspondence relating Yang–Mills connections to stable vector bundles. Work of Nigel Hitchin and Carlos Simpson on Higgs bundles demonstrated that moduli spaces arising out of gauge theory could have exotic geometric structures such as that of hyperkähler manifolds, as well as links to integrable systems through the Hitchin system. Links to string theory and Mirror symmetry were realised, where gauge theory is essential to phrasing the homological mirror symmetry conjecture and the AdS/CFT correspondence.

Fundamental objects of interest
The fundamental objects of interest in gauge theory are connections on vector bundles and principal bundles. In this section we briefly recall these constructions, and refer to the main articles on them for details. The structures described here are standard within the differential geometry literature, and an introduction to the topic from a gauge-theoretic perspective can be found in the book of Donaldson and Peter Kronheimer.

Principal bundles
The central objects of study in gauge theory are principal bundles and vector bundles. The choice of which to study is essentially arbitrary, as one may pass between them, but principal bundles are the natural objects from the physical perspective to describe gauge fields, and mathematically they more elegantly encode the corresponding theory of connections and curvature for vector bundles associated to them.

A principal bundle with structure group $$G$$, or a principal $$G$$-bundle, consists of a quintuple $$(P,X,\pi, G, \rho)$$ where $$\pi: P \to X$$ is a smooth fibre bundle with fibre space isomorphic to a Lie group $$G$$, and $$\rho$$ represents a free and transitive right group action of $$G$$ on $$P$$ which preserves the fibres, in the sense that for all $$p\in P$$, $$\pi(pg) = \pi(p)$$ for all $$g\in G$$. Here $$P$$ is the total space, and $$X$$ the base space. Using the right group action for each $$x\in X$$ and any choice of $p\in P_x$, the map $$g \mapsto pg$$ defines a diffeomorphism $$P_x \cong G$$ between the fibre over $$x$$ and the Lie group $$G$$ as smooth manifolds. Note however there is no natural way of equipping the fibres of $$P$$ with the structure of Lie groups, as there is no natural choice of element $$p\in P_x$$ for every $$x\in X$$.

The simplest examples of principal bundles are given when $$G=\operatorname{U}(1)$$ is the circle group. In this case the principal bundle has dimension $$\dim P = n + 1$$ where $$\dim X = n$$. Another natural example occurs when $$P=\mathcal{F}(TX)$$ is the frame bundle of the tangent bundle of the manifold $$X$$, or more generally the frame bundle of a vector bundle over $$X$$. In this case the fibre of $$P$$ is given by the general linear group $$\operatorname{GL}(n, \mathbb{R})$$.

Since a principal bundle is a fibre bundle, it locally has the structure of a product. That is, there exists an open covering $$\{U_{\alpha}\}$$ of $$X$$ and diffeomorphisms $$\varphi_\alpha: P_{U_\alpha} \to U_\alpha \times G$$ commuting with the projections $$\pi$$ and $$\operatorname{pr}_1$$, such that the transition functions $$g_{\alpha\beta} :U_{\alpha}\cap U_{\beta} \to G$$ defined by $$\varphi_\alpha \circ \varphi_{\beta}^{-1} (x,g) = (x, g_{\alpha\beta}(x) g)$$ satisfy the cocycle condition
 * $$g_{\alpha\beta}(x) g_{\beta\gamma}(x) = g_{\alpha\gamma}(x)$$

on any triple overlap $$U_{\alpha}\cap U_{\beta}\cap U_\gamma$$. In order to define a principal bundle it is enough to specify such a choice of transition functions, The bundle is then defined by gluing trivial bundles $$U_\alpha\times G$$ along the intersections $$U_\alpha\cap U_\beta$$ using the transition functions. The cocycle condition ensures precisely that this defines an equivalence relation on the disjoint union $$\bigsqcup_\alpha U_\alpha \times G$$ and therefore that the quotient space $$P=\bigsqcup_\alpha U_\alpha \times G/{\sim}$$ is well-defined. This is known as the fibre bundle construction theorem and the same process works for any fibre bundle described by transition functions, not just principal bundles or vector bundles.

Notice that a choice of local section $$s_\alpha: U_\alpha \to P_{U_\alpha}$$ satisfying $$\pi \circ s_\alpha = \operatorname{Id}$$ is an equivalent method of specifying a local trivialisation map. Namely, one can define $$\varphi_\alpha(p) = (\pi(p), \tilde s_\alpha(p))$$ where $$\tilde s_\alpha(p)\in G$$ is the unique group element such that $$p\tilde s_\alpha(p)^{-1} =s_\alpha(\pi(p))$$.

Vector bundles


A vector bundle is a triple $$(E, X, \pi)$$ where $$\pi: E\to X$$ is a fibre bundle with fibre given by a vector space $$\mathbb{K}^r$$ where $$\mathbb{K}=\mathbb{R}, \mathbb{C}$$ is a field. The number $$r$$ is the rank of the vector bundle. Again one has a local description of a vector bundle in terms of a trivialising open cover. If $$\{U_{\alpha}\}$$ is such a cover, then under the isomorphism
 * $$\varphi_{\alpha}: E_{U_{\alpha}} \to U_{\alpha} \times \mathbb{K}^r$$

one obtains $$r$$ distinguished local sections of $$E$$ corresponding to the $$r$$ coordinate basis vectors $$e_1,\dots,e_r$$ of $$\mathbb{K}^r$$, denoted $$\boldsymbol{e}_1,\dots,\boldsymbol{e}_r$$. These are defined by the equation
 * $$\varphi_{\alpha} (\boldsymbol{e}_i (x)) = (x, e_i).$$

To specify a trivialisation it is therefore equivalent to give a collection of $$r$$ local sections which are everywhere linearly independent, and use this expression to define the corresponding isomorphism. Such a collection of local sections is called a frame.

Similarly to principal bundles, one obtains transition functions $$g_{\alpha\beta}: U_{\alpha}\cap U_{\beta} \to \operatorname{GL}(r, \mathbb{K})$$ for a vector bundle, defined by
 * $$\varphi_{\alpha} \circ \varphi_{\beta}^{-1} (x, v) = (x, g_{\alpha\beta}(x) v).$$

If one takes these transition functions and uses them to construct the local trivialisation for a principal bundle with fibre equal to the structure group $$\operatorname{GL}(r, \mathbb{K})$$, one obtains exactly the frame bundle of $$E$$, a principal $$\operatorname{GL}(r, \mathbb{K})$$-bundle.

Associated bundles
Given a principal $$G$$-bundle $$P$$ and a representation $$\rho$$ of $$G$$ on a vector space $$V$$, one can construct an associated vector bundle $$E=P\times_{\rho} V$$ with fibre the vector space $$V$$. To define this vector bundle, one considers the right action on the product $$P\times V$$ defined by $$(p,v)g = (pg, \rho(g^{-1})v)$$ and defines $$P\times_{\rho} V = (P\times V)/G$$ as the quotient space with respect to this action.

In terms of transition functions the associated bundle can be understood more simply. If the principal bundle $$P$$ has transition functions $$g_{\alpha\beta}$$ with respect to a local trivialisation $$\{U_{\alpha}\}$$, then one constructs the associated vector bundle using the transition functions $$\rho \circ g_{\alpha\beta}: U_{\alpha}\cap U_{\beta} \to \operatorname{GL}(V)$$.

The associated bundle construction can be performed for any fibre space $$F$$, not just a vector space, provided $$\rho: G\to \operatorname{Aut}(F)$$ is a group homomorphism. One key example is the capital A adjoint bundle $$\operatorname{Ad}(P)$$ with fibre $$G$$, constructed using the group homomorphism $$\rho: G \to \operatorname{Aut}(G)$$ defined by conjugation $$g \mapsto (h \mapsto g h g^{-1})$$. Note that despite having fibre $$G$$, the Adjoint bundle is neither a principal bundle, or isomorphic as a fibre bundle to $$P$$ itself. For example, if $$G$$ is Abelian, then the conjugation action is trivial and $$\operatorname{Ad}(P)$$ will be the trivial $$G$$-fibre bundle over $$X$$ regardless of whether or not $$P$$ is trivial as a fibre bundle. Another key example is the lowercase a adjoint bundle $$\operatorname{ad}(P)$$ constructed using the adjoint representation $$\rho: G \to \operatorname{Aut}(\mathfrak{g})$$ where $$\mathfrak{g}$$ is the Lie algebra of $$G$$.

Gauge transformations
A gauge transformation of a vector bundle or principal bundle is an automorphism of this object. For a principal bundle, a gauge transformation consists of a diffeomorphism $$\varphi: P \to P$$ commuting with the projection operator $$\pi$$ and the right action $$\rho$$. For a vector bundle a gauge transformation is similarly defined by a diffeomorphism $$\varphi: E \to E$$ commuting with the projection operator $$\pi$$ which is a linear isomorphism of vector spaces on each fibre.

The gauge transformations (of $$P$$ or $$E$$) form a group under composition, called the gauge group, typically denoted $$\mathcal{G}$$. This group can be characterised as the space of global sections $$\mathcal{G} = \Gamma(\operatorname{Ad}(P))$$ of the adjoint bundle, or $$\mathcal{G} = \Gamma(\operatorname{Ad}(\mathcal{F} (E)))$$ in the case of a vector bundle, where $$\mathcal{F}(E)$$ denotes the frame bundle.

One can also define a local gauge transformation as a local bundle isomorphism over a trivialising open subset $$U_{\alpha}$$. This can be uniquely specified as a map $$g_{\alpha} : U_{\alpha} \to G$$ (taking $$G=\operatorname{GL}(r, \mathbb{K})$$ in the case of vector bundles), where the induced bundle isomorphism is defined by
 * $$\varphi_{\alpha}(p) = pg_{\alpha}(\pi(p))$$

and similarly for vector bundles.

Notice that given two local trivialisations of a principal bundle over the same open subset $$U_{\alpha}$$, the transition function is precisely a local gauge transformation $$g_{\alpha\alpha}: U_{\alpha} \to G$$. That is, local gauge transformations are changes of local trivialisation for principal bundles or vector bundles.

Connections on principal bundles
A connection on a principal bundle is a method of connecting nearby fibres so as to capture the notion of a section $$s: X\to P$$ being constant or horizontal. Since the fibres of an abstract principal bundle are not naturally identified with each other, or indeed with the fibre space $$G$$ itself, there is no canonical way of specifying which sections are constant. A choice of local trivialisation leads to one possible choice, where if $$P$$ is trivial over a set $$U_{\alpha}$$, then a local section could be said to be horizontal if it is constant with respect to this trivialisation, in the sense that $$\varphi_{\alpha} (s(x)) = (x, g)$$ for all $$x\in U_{\alpha}$$ and one $$g\in G$$. In particular a trivial principal bundle $$P=X\times G$$ comes equipped with a trivial connection.

In general a connection is given by a choice of horizontal subspaces $$H_p \subset T_p P$$ of the tangent spaces at every point $$p\in P$$, such that at every point one has $$T_p P = H_p \oplus V_p$$ where $$V$$ is the vertical bundle defined by $$V=\ker d\pi$$. These horizontal subspaces must be compatible with the principal bundle structure by requiring that the horizontal distribution $$H$$ is invariant under the right group action: $$H_{pg} = d(R_g) (H_p)$$ where $$R_g: P \to P$$ denotes right multiplication by $$g$$. A section $$s$$ is said to be horizontal if $$T_p s \subset H_p$$ where $$s$$ is identified with its image inside $$P$$, which is a submanifold of $$P$$ with tangent bundle $$Ts$$. Given a vector field $$v\in \Gamma(TX)$$, there is a unique horizontal lift $$v^{\#}\in \Gamma(H)$$. The curvature of the connection $$H$$ is given by the two-form with values in the adjoint bundle $$F\in \Omega^2(X, \operatorname{ad}(P))$$ defined by
 * $$F(v_1, v_2) = [v_1^{\#}, v_2^{\#}] - [v_1, v_2]^{\#}$$

where $$[\cdot, \cdot]$$ is the Lie bracket of vector fields. Since the vertical bundle consists of the tangent spaces to the fibres of $$P$$ and these fibres are isomorphic to the Lie group $$G$$ whose tangent bundle is canonically identified with $$TG = G\times \mathfrak{g}$$, there is a unique Lie algebra-valued two-form $$F\in \Omega^2(P, \mathfrak{g})$$ corresponding to the curvature. From the perspective of the Frobenius integrability theorem, the curvature measures precisely the extent to which the horizontal distribution fails to be integrable, and therefore the extent to which $$H$$ fails to embed inside $$P$$ as a horizontal submanifold locally.

The choice of horizontal subspaces may be equivalently expressed by a projection operator $$\nu: TP \to V$$ which is equivariant in the correct sense, called the connection one-form. For a horizontal distribution $$H$$, this is defined by $$\nu_H (h+v) = v$$ where $$h+v$$ denotes the decomposition of a tangent vector with respect to the direct sum decomposition $$TP = H\oplus V$$. Due to the equivariance, this projection one-form may be taken to be Lie algebra-valued, giving some $$\nu \in \Omega^1(P, \mathfrak{g})$$.

A local trivialisation for $$P$$ is equivalently given by a local section $$s_{\alpha}: U_{\alpha} \to P_{U_{\alpha}}$$ and the connection one-form and curvature can be pulled back along this smooth map. This gives the local connection one-form $$A_{\alpha} = s_{\alpha}^* \nu\in \Omega^1(U_{\alpha}, \operatorname{ad}(P))$$ which takes values in the adjoint bundle of $$P$$. Cartan's structure equation says that the curvature may be expressed in terms of the local one-form $$A_{\alpha}$$ by the expression
 * $$F = dA_\alpha + \frac{1}{2} [A_\alpha, A_\alpha]$$

where we use the Lie bracket on the Lie algebra bundle $$\operatorname{ad}(P)$$ which is identified with $$U_\alpha \times \mathfrak{g}$$ on the local trivialisation $$U_{\alpha}$$.

Under a local gauge transformation $$g: U_\alpha \to G$$ so that $$\tilde A_{\alpha} = (g\circ s)^* \nu$$, the local connection one-form transforms by the expression
 * $$ \tilde A_\alpha = \operatorname{ad}(g)\circ A_\alpha + (g^{-1})^* \theta$$

where $$\theta$$ denotes the Maurer–Cartan form of the Lie group $$G$$. In the case where $$G$$ is a matrix Lie group, one has the simpler expression $$\tilde A_{\alpha} = g A_\alpha g^{-1} - (dg)g^{-1}.$$

Connections on vector bundles
A connection on a vector bundle may be specified similarly to the case for principal bundles above, known as an Ehresmann connection. However vector bundle connections admit a more powerful description in terms of a differential operator. A connection on a vector bundle is a choice of $$\mathbb{K}$$-linear differential operator
 * $$\nabla: \Gamma(E) \to \Gamma(T^*X \otimes E) = \Omega^1(E)$$

such that
 * $$\nabla(fs) = df\otimes s + f \nabla s$$

for all $$f\in C^{\infty}(X)$$ and sections $$s\in \Gamma(E)$$. The covariant derivative of a section $$s$$ in the direction of a vector field $$v$$ is defined by
 * $$\nabla_v (s) = \nabla s (v)$$

where on the right we use the natural pairing between $$\Omega^1(X)$$ and $$TX$$. This is a new section of the vector bundle $$E$$, thought of as the derivative of $$s$$ in the direction of $$v$$. The operator $$\nabla_v$$ is the covariant derivative operator in the direction of $$v$$. The curvature of $$\nabla$$ is given by the operator $$F_{\nabla}\in \Omega^2(\operatorname{End}(E))$$ with values in the endomorphism bundle, defined by
 * $$F_{\nabla}(v_1,v_2) = \nabla_{v_1} \nabla_{v_2} - \nabla_{v_2} \nabla_{v_1} - \nabla_{[v_1, v_2]}.$$

In a local trivialisation the exterior derivative $$d$$ acts as a trivial connection (corresponding in the principal bundle picture to the trivial connection discussed above). Namely for a local frame $$\boldsymbol{e}_1, \dots, \boldsymbol{e}_r$$ one defines
 * $$d (s^i \boldsymbol{e}_i) = ds^i \otimes \boldsymbol{e}_i$$

where here we have used Einstein notation for a local section $$s=s^i \boldsymbol{e}_i$$.

Any two connections $$\nabla_1, \nabla_2$$ differ by an $$\operatorname{End}(E)$$-valued one-form $$A$$. To see this, observe that the difference of two connections is $$C^{\infty}(X)$$-linear:
 * $$(\nabla_1 - \nabla_2)(fs) = f(\nabla_1-\nabla_2)(s).$$

In particular since every vector bundle admits a connection (using partitions of unity and the local trivial connections), the set of connections on a vector bundle has the structure of an infinite-dimensional affine space modelled on the vector space $$\Omega^1(\operatorname{End}(E))$$. This space is commonly denoted $$\mathcal{A}$$.

Applying this observation locally, every connection over a trivialising subset $$U_{\alpha}$$ differs from the trivial connection $$d$$ by some local connection one-form $$A_{\alpha}\in \Omega^1(U_{\alpha}, \operatorname{End}(E))$$, with the property that $$\nabla = d + A_{\alpha}$$ on $$U_{\alpha}$$. In terms of this local connection form, the curvature may be written as
 * $$F_A = dA_{\alpha} + A_{\alpha} \wedge A_{\alpha}$$

where the wedge product occurs on the one-form component, and one composes endomorphisms on the endomorphism component. To link back to the theory of principal bundles, notice that $$A\wedge A = \frac{1}{2}[A, A]$$ where on the right we now perform wedge of one-forms and commutator of endomorphisms.

Under a gauge transformation $$u$$ of the vector bundle $$E$$, a connection $$\nabla$$ transforms into a connection $$u\cdot \nabla$$ by the conjugation $$(u\cdot \nabla)_v(s) = u(\nabla_v(u^{-1}(s))$$. The difference $$u\cdot \nabla - \nabla = -(\nabla u)u^{-1}$$ where here $$\nabla$$ is acting on the endomorphisms of $$E$$. Under a local gauge transformation $$g$$ one obtains the same expression
 * $$\tilde A_{\alpha} = g A_{\alpha} g^{-1} - (dg)g^{-1}$$

as in the case of principal bundles.

Induced connections
A connection on a principal bundle induces connections on associated vector bundles. One way to see this is in terms of the local connection forms described above. Namely, if a principal bundle connection $$H$$ has local connection forms $$A_{\alpha}\in \Omega^1(U_{\alpha}, \operatorname{ad}(P))$$, and $$\rho: G \to \operatorname{Aut}(V)$$ is a representation of $$G$$ defining an associated vector bundle $$E=P\times_{\rho} V$$, then the induced local connection one-forms are defined by
 * $$\rho_* A_{\alpha} \in \Omega^1(U_{\alpha}, \operatorname{End}(E)).$$

Here $$\rho_*$$ is the induced Lie algebra homomorphism from $$\mathfrak{g} \to \operatorname{End}(V)$$, and we use the fact that this map induces a homomorphism of vector bundles $$\operatorname{ad}(P) \to \operatorname{End}(E)$$.

The induced curvature can be simply defined by
 * $$\rho_* F_A \in \Omega^2(U_\alpha, \operatorname{End}(E)).$$

Here one sees how the local expressions for curvature are related for principal bundles and vector bundles, as the Lie bracket on the Lie algebra $$\mathfrak{g}$$ is sent to the commutator of endomorphisms of $$\operatorname{End}(V)$$ under the Lie algebra homomorphism $$\rho_*$$.

Space of connections
The central object of study in mathematical gauge theory is the space of connections on a vector bundle or principal bundle. This is an infinite-dimensional affine space $$\mathcal{A}$$ modelled on the vector space $$\Omega^1(X, \operatorname{ad}(P))$$ (or $$\Omega^1(X, \operatorname{End}(E))$$ in the case of vector bundles). Two connections $$A, A'\in \mathcal{A}$$ are said to be gauge equivalent if there exists a gauge transformation $$u$$ such that $$A' = u\cdot A$$. Gauge theory is concerned with gauge equivalence classes of connections. In some sense gauge theory is therefore concerned with the properties of the quotient space $$\mathcal{A}/\mathcal{G}$$, which is in general neither a Hausdorff space or a smooth manifold.

Many interesting properties of the base manifold $$X$$ can be encoded in the geometry and topology of moduli spaces of connections on principal bundles and vector bundles over $$X$$. Invariants of $$X$$, such as Donaldson invariants or Seiberg–Witten invariants can be obtained by computing numeral quantities derived from moduli spaces of connections over $$X$$. The most famous application of this idea is Donaldson's theorem, which uses the moduli space of Yang–Mills connections on a principal $$\operatorname{SU}(2)$$-bundle over a simply connected four-manifold $$X$$ to study its intersection form. For this work Donaldson was awarded a Fields Medal.

Notational conventions
There are various notational conventions used for connections on vector bundles and principal bundles which will be summarised here.
 * The letter $$A$$ is the most common symbol used to represent a connection on a vector bundle or principal bundle. It comes from the fact that if one chooses a fixed connection $$\nabla_0\in \mathcal{A}$$ of all connections, then any other connection may be written $$\nabla = \nabla_0 + A$$ for some unique one-form $$A\in \Omega^1(X, \operatorname{ad}(P))$$. It also comes from the use of $$A_{\alpha}$$ to denote the local form of the connection on a vector bundle, which subsequently comes from the electromagnetic potential $$A$$ in physics. Sometimes the symbol $$\omega$$ is also used to refer to the connection form, usually on a principal bundle, and usually in this case $$\omega$$ refers to the global connection one-form $$\omega \in \Omega^1(P, \mathfrak{g})$$ on the total space of the principal bundle, rather than the corresponding local connections forms. This convention is usually avoided in the mathematical literature as it often clashes with the use of $$\omega$$ for a Kähler form when the underlying manifold $$X$$ is a Kähler manifold.
 * The symbol $$\nabla$$ is most commonly used to represent a connection on a vector bundle as a differential operator, and in that sense is used interchangeably with the letter $$A$$. It is also used to refer to the covariant derivative operators $$\nabla_X$$. Alternative notation for the connection operator and covariant derivative operators is $$\nabla_A$$ to emphasize the dependence on the choice of $$A\in \mathcal{A}$$, or $$D_A$$ or $$d_A$$.
 * The operator $$d_A$$ most commonly refers to the exterior covariant derivative of a connection $$A$$ (and so is sometimes written $$d_{\nabla}$$ for a connection $$\nabla$$). Since the exterior covariant derivative in degree 0 is the same as the regular covariant derivative, the connection or covariant derivative itself is often denoted $$d_A$$ instead of $$\nabla$$.
 * The symbol $$F_A$$ or $$F_{\nabla}$$ is most commonly used to refer to the curvature of a connection. When the connection is referred to by $$\omega$$, the curvature is referred to by $$\Omega$$ rather than $$F_{\omega}$$. Other conventions involve $$R$$ or $$R_A$$ or $$R_{\nabla}$$, by analogy with the Riemannian curvature tensor in Riemannian geometry which is denoted by $$R$$.
 * The letter $$H$$ is often used to denote a principal bundle connection or Ehresmann connection when emphasis is to be placed on the horizontal distribution $$H\subset TP$$. In this case the vertical projection operator corresponding to $$H$$ (the connection one-form on $$P$$) is usually denoted $$\omega$$, or $$v$$, or $$\nu$$. Using this convention the curvature is sometimes denoted $$F_H$$ to emphasize the dependence, and $$F_H$$ may refer to either the curvature operator on the total space $$F_H \in \Omega^2(P, \mathfrak{g})$$, or the curvature on the base $$F_H \in \Omega^2(X, \operatorname{ad}(P))$$.
 * The Lie algebra adjoint bundle is usually denoted $$\operatorname{ad}(P)$$, and the Lie group adjoint bundle by $$\operatorname{Ad}(P)$$. This disagrees with the convention in the theory of Lie groups, where $$\operatorname{Ad}$$ refers to the representation of $$G$$ on $$\mathfrak{g}$$, and $$\operatorname{ad}$$ refers to the Lie algebra representation of $$\mathfrak{g}$$ on itself by the Lie bracket. In the Lie group theory the conjugation action (which defines the bundle $$\operatorname{Ad}(P)$$) is often denoted by $$\Psi_g$$.

Dictionary of mathematical and physical terminology
The mathematical and physical fields of gauge theory involve the study of the same objects, but use different terminology to describe them. Below is a summary of how these terms relate to each other.

As a demonstration of this dictionary, consider an interacting term of an electron-positron particle field and the electromagnetic field in the Lagrangian of quantum electrodynamics:
 * $$\mathcal{L} = \bar\psi(i\gamma^\mu D_\mu - m)\psi - \frac{1}{4}F_{\mu\nu}F^{\mu\nu},$$

Mathematically this might be rewritten
 * $$ \mathcal{L}= \langle \psi, ({D\!\!\!\!/}_A - m) \psi \rangle_{L^2} + \|F_A\|_{L^2}^2 $$

where $$A$$ is a connection on a principal $$\operatorname{U}(1)$$ bundle $$P$$, $$\psi$$ is a section of an associated spinor bundle and $${D\!\!\!\!/}_A$$ is the induced Dirac operator of the induced covariant derivative $$\nabla_A$$ on this associated bundle. The first term is an interacting term in the Lagrangian between the spinor field (the field representing the electron-positron) and the gauge field (representing the electromagnetic field). The second term is the regular Yang–Mills functional which describes the basic non-interacting properties of the electromagnetic field (the connection $$A$$). The term of the form $$\nabla_A \psi$$ is an example of what in physics is called minimal coupling, that is, the simplest possible interaction between a matter field $$\psi$$ and a gauge field $$A$$.

Yang–Mills theory
The predominant theory that occurs in mathematical gauge theory is Yang–Mills theory. This theory involves the study of connections which are critical points of the Yang–Mills functional defined by
 * $$\operatorname{YM}(A) = \int_X \|F_A\|^2 \, d\mathrm{vol}_g$$

where $$(X,g)$$ is an oriented Riemannian manifold with $$d\mathrm{vol}_g$$ the Riemannian volume form and $$\| \cdot \|^2$$ an $$L^2$$-norm on the adjoint bundle $$\operatorname{ad}(P)$$. This functional is the square of the $$L^2$$-norm of the curvature of the connection $$A$$, so connections which are critical points of this function are those with curvature as small as possible (or higher local minima of $$\operatorname{YM}$$).

These critical points are characterised as solutions of the associated Euler–Lagrange equations, the Yang–Mills equations
 * $$d_A \star F_A = 0$$

where $$d_A$$ is the induced exterior covariant derivative of $$\nabla_A$$ on $$\operatorname{ad}(P)$$ and $$\star$$ is the Hodge star operator. Such solutions are called Yang–Mills connections and are of significant geometric interest.

The Bianchi identity asserts that for any connection, $$d_A F_A = 0$$. By analogy for differential forms a harmonic form $$\omega$$ is characterised by the condition
 * $$d \star \omega = d \omega = 0.$$

If one defined a harmonic connection by the condition that
 * $$d_A \star F_A = d_A F_A = 0$$

the then study of Yang–Mills connections is similar in nature to that of harmonic forms. Hodge theory provides a unique harmonic representative of every de Rham cohomology class $$[\omega]$$. Replacing a cohomology class by a gauge orbit $$\{u\cdot A \mid u\in \mathcal{G}\}$$, the study of Yang–Mills connections can be seen as trying to find unique representatives for each orbit in the quotient space $$\mathcal{A}/\mathcal{G}$$ of connections modulo gauge transformations.

Self-duality and anti-self-duality equations
In dimension four the Hodge star operator sends two-forms to two-forms, $$\star: \Omega^2(X) \to \Omega^2(X)$$, and squares to the identity operator, $$\star^2 = \operatorname{Id}$$. Thus the Hodge star operating on two-forms has eigenvalues $$\pm 1$$, and the two-forms on an oriented Riemannian four-manifold split as a direct sum
 * $$\Omega^2(X) = \Omega_+ (X) \oplus \Omega_-(X)$$

into the self-dual and anti-self-dual two-forms, given by the $$+1$$ and $$-1$$ eigenspaces of the Hodge star operator respectively. That is, $$\alpha \in \Omega^2(X)$$ is self-dual if $$\star \alpha = \alpha$$, and anti-self dual if $$\star \alpha = -\alpha$$, and every differential two-form admits a splitting $$\alpha = \alpha_+ + \alpha_-$$ into self-dual and anti-self-dual parts.

If the curvature of a connection $$A$$ on a principal bundle over a four-manifold is self-dual or anti-self-dual then by the Bianchi identity $$d_A \star F_A = \pm d_A F_A = 0$$, so the connection is automatically a Yang–Mills connection. The equation
 * $$\star F_A = \pm F_A$$

is a first order partial differential equation for the connection $$A$$, and therefore is simpler to study than the full second order Yang–Mills equation. The equation $$\star F_A = F_A$$ is called the self-duality equation, and the equation $$\star F_A = -F_A$$ is called the anti-self-duality equation, and solutions to these equations are self-dual connections or anti-self-dual connections respectively.

Dimensional reduction
One way to derive new and interesting gauge-theoretic equations is to apply the process of dimensional reduction to the Yang–Mills equations. This process involves taking the Yang–Mills equations over a manifold $$X$$ (usually taken to be the Euclidean space $$X=\mathbb{R}^4$$), and imposing that the solutions of the equations be invariant under a group of translational or other symmetries. Through this process the Yang–Mills equations lead to the Bogomolny equations describing monopoles on $$\mathbb{R}^3$$, Hitchin's equations describing Higgs bundles on Riemann surfaces, and the Nahm equations on real intervals, by imposing symmetry under translations in one, two, and three directions respectively.

Gauge theory in one and two dimensions
Here the Yang–Mills equations when the base manifold $$X$$ is of low dimension is discussed. In this setting the equations simplify dramatically due to the fact that in dimension one there are no two-forms, and in dimension two the Hodge star operator on two-forms acts as $$\star: \Omega^2(X) \to C^{\infty}(X)$$.

Yang–Mills theory
One may study the Yang–Mills equations directly on a manifold of dimension two. The theory of Yang–Mills equations when the base manifold is a compact Riemann surface was carried about by Michael Atiyah and Raoul Bott. In this case the moduli space of Yang–Mills connections over a complex vector bundle $$E$$ admits various rich interpretations, and the theory serves as the simplest case to understand the equations in higher dimensions. The Yang–Mills equations in this case become
 * $$\star F_A = \lambda(E) \operatorname{Id}_E$$

for some topological constant $$\lambda(E)\in \mathbb{C}$$ depending on $$E$$. Such connections are called projectively flat, and in the case where the vector bundle is topologically trivial (so $$\lambda(E) = 0$$) they are precisely the flat connections.

When the rank and degree of the vector bundle are coprime, the moduli space $$\mathcal{M}$$ of Yang–Mills connections is smooth and has a natural structure of a symplectic manifold. Atiyah and Bott observed that since the Yang–Mills connections are projectively flat, their holonomy gives projective unitary representations of the fundamental group of the surface, so that this space has an equivalent description as a moduli space of projective unitary representations of the fundamental group of the Riemann surface, a character variety. The theorem of Narasimhan and Seshadri gives an alternative description of this space of representations as the moduli space of stable holomorphic vector bundles which are smoothly isomorphic to the $$E$$. Through this isomorphism the moduli space of Yang–Mills connections gains a complex structure, which interacts with the symplectic structure of Atiyah and Bott to make it a compact Kähler manifold.

Simon Donaldson gave an alternative proof of the theorem of Narasimhan and Seshadri that directly passed from Yang–Mills connections to stable holomorphic structures. Atiyah and Bott used this rephrasing of the problem to illuminate the intimate relationship between the extremal Yang–Mills connections and the stability of the vector bundles, as an infinite-dimensional moment map for the action of the gauge group $$\mathcal{G}$$, given by the curvature map $$A\mapsto F_A$$ itself. This observation phrases the Narasimhan–Seshadri theorem as a kind of infinite-dimensional version of the Kempf–Ness theorem from geometric invariant theory, relating critical points of the norm squared of the moment map (in this case Yang–Mills connections) to stable points on the corresponding algebraic quotient (in this case stable holomorphic vector bundles). This idea has been subsequently very influential in gauge theory and complex geometry since its introduction.

Nahm equations
The Nahm equations, introduced by Werner Nahm, are obtained as the dimensional reduction of the anti-self-duality in four dimensions to one dimension, by imposing translational invariance in three directions. Concretely, one requires that the connection form $$A=A_0 \,dx^0 + A_1 \,dx^1 + A_2 \,dx^2 + A_3 \,dx^3$$ does not depend on the coordinates $$x^1, x^2, x^3$$. In this setting the Nahm equations between a system of equations on an interval $$I\subset \mathbb{R}$$ for four matrices $$T_0, T_1, T_2, T_3 \in C^{\infty}(I, \mathfrak{g})$$ satisfying the triple of equations
 * $$\begin{cases}

\frac{dT_1}{dt} + [T_0, T_1] + [T_2, T_3] = 0\\ \frac{dT_2}{dt} + [T_0, T_2] + [T_3, T_1] = 0\\ \frac{dT_3}{dt} + [T_0, T_3] + [T_1, T_2] = 0. \end{cases}$$ It was shown by Nahm that the solutions to these equations (which can be obtained fairly easily as they are a system of ordinary differential equations) can be used to construct solutions to the Bogomolny equations, which describe monopoles on $$\mathbb{R}^3$$. Nigel Hitchin showed that solutions to the Bogomolny equations could be used to construct solutions to the Nahm equations, showing solutions to the two problems were equivalent. Donaldson further showed that solutions to the Nahm equations are equivalent to rational maps of degree $$k$$ from the complex projective line $$\mathbb{CP}^1$$ to itself, where $$k$$ is the charge of the corresponding magnetic monopole.

The moduli space of solutions to the Nahm equations has the structure of a hyperkähler manifold.

Hitchin's equations and Higgs bundles
Hitchin's equations, introduced by Nigel Hitchin, are obtained as the dimensional reduction of the self-duality equations in four dimensions to two dimensions by imposing translation invariance in two directions. In this setting the two extra connection form components $$A_3 \,dx^3 + A_4 \,dx^4$$ can be combined into a single complex-valued endomorphism $$\Phi = A_3 + i A_4$$, and when phrased in this way the equations become conformally invariant and therefore are natural to study on a compact Riemann surface rather than $$\mathbb{R}^2$$. Hitchin's equations state that for a pair $$(A,\Phi)$$ on a complex vector bundle $$E\to \Sigma$$ where $$\Phi\in \Omega^{1,0}(\Sigma, \operatorname{End}(E))$$, that
 * $$\begin{cases}

F_A + [\Phi, \Phi^*] = 0\\ \bar\partial_A \Phi = 0\end{cases}$$ where $$\bar\partial_A $$ is the $$(0,1)$$-component of $$d_A$$. Solutions of Hitchin's equations are called Hitchin pairs.

Whereas solutions to the Yang–Mills equations on a compact Riemann surface correspond to projective unitary representations of the surface group, Hitchin showed that solutions to Hitchin's equations correspond to projective complex representations of the surface group. The moduli space of Hitchin pairs naturally has (when the rank and degree of the bundle are coprime) the structure of a Kähler manifold. Through an analogue of Atiyah and Bott's observation about the Yang–Mills equations, Hitchin showed that Hitchin pairs correspond to so-called stable Higgs bundles, where a Higgs bundle is a pair $$(E, \Phi)$$ where $$E\to \Sigma$$ is a holomorphic vector bundle and $$\Phi :E \to E\otimes K$$ is a holomorphic endomorphism of $$E$$ with values in the canonical bundle of the Riemann surface $$\Sigma$$. This is shown through an infinite-dimensional moment map construction, and this moduli space of Higgs bundles also has a complex structure, which is different to that coming from the Hitchin pairs, leading to two complex structures on the moduli space $$\mathcal{M}$$ of Higgs bundles. These combine to give a third making this moduli space a hyperkähler manifold.

Hitchin's work was subsequently vastly generalised by Carlos Simpson, and the correspondence between solutions to Hitchin's equations and Higgs bundles over an arbitrary Kähler manifold is known as the nonabelian Hodge theorem.

Monopoles
The dimensional reduction of the Yang–Mills equations to three dimensions by imposing translational invariance in one direction gives rise to the Bogomolny equations for a pair $$(A, \Phi)$$ where $$\Phi: \mathbb{R}^3 \to \mathfrak{g}$$ is a family of matrices. The equations are
 * $$F_A = \star d_A \Phi.$$

When the principal bundle $$P\to \mathbb{R}^3$$ has structure group $$\operatorname{U}(1)$$ the circle group, solutions to the Bogomolny equations model the Dirac monopole describing a magnetic monopole in classical electromagnetism. The work of Nahm and Hitchin shows that when the structure group is the special unitary group $$\operatorname{SU}(2)$$ solutions to the monopole equations correspond to solutions to the Nahm equations, and by work of Donaldson these further correspond to rational maps from $$\mathbb{CP}^1$$ to itself of degree $$k$$ where $$k$$ is the charge of the monopole. This charge is defined as the limit
 * $$ \lim_{R\to \infty} \int_{S_R} (\Phi, F_A) = 4\pi k$$

of the integral of the pairing $$(\Phi, F_A)\in \Omega^2(\mathbb{R}^3)$$ over spheres $$S_R$$ in $$\mathbb{R}^3$$ of increasing radius $$R$$.

Chern–Simons theory
Chern–Simons theory in 3 dimensions is a topological quantum field theory with an action functional proportional to the integral of the Chern–Simons form, a three-form defined by
 * $$\operatorname{Tr}(F_A \wedge A -\frac{1}{3} A\wedge A \wedge A).$$

Classical solutions to the Euler–Lagrange equations of the Chern–Simons functional on a closed 3-manifold $$X$$ correspond to flat connections on the principal $$G$$-bundle $$P\to X$$. However, when $$X$$ has a boundary the situation becomes more complicated. Chern–Simons theory was used by Edward Witten to express the Jones polynomial, a knot invariant, in terms of the vacuum expectation value of a Wilson loop in $$\operatorname{SU}(2)$$ Chern–Simons theory on the three-sphere $$S^3$$. This was a stark demonstration of the power of gauge theoretic problems to provide new insight in topology, and was one of the first instances of a topological quantum field theory.

In the quantization of the classical Chern–Simons theory, one studies the induced flat or projectively flat connections on the principal bundle restricted to surfaces $$\Sigma \subset X$$ inside the 3-manifold. The classical state spaces corresponding to each surface are precisely the moduli spaces of Yang–Mills equations studied by Atiyah and Bott. The geometric quantization of these spaces was achieved by Nigel Hitchin and Axelrod–Della Pietra–Witten independently, and in the case where the structure group is complex, the configuration space is the moduli space of Higgs bundles and its quantization was achieved by Witten.

Floer homology
Andreas Floer introduced a type of homology on a 3-manifolds defined in analogy with Morse homology in finite dimensions. In this homology theory, the Morse function is the Chern–Simons functional on the space of connections on an $$\operatorname{SU}(2)$$ principal bundle over the 3-manifold $$X$$. The critical points are the flat connections, and the flow lines are defined to be the Yang–Mills instantons on $$M\times I$$ that restrict to the critical flat connections on the two boundary components. This leads to instanton Floer homology. The Atiyah–Floer conjecture asserts that instanton Floer homology agrees with the Lagrangian intersection Floer homology of the moduli space of flat connections on the surface $$\Sigma\subset X$$ defining a Heegaard splitting of $$X$$, which is symplectic due to the observations of Atiyah and Bott.

In analogy with instanton Floer homology one may define Seiberg–Witten Floer homology where instantons are replaced with solutions of the Seiberg–Witten equations. By work of Clifford Taubes this is known to be isomorphic to embedded contact homology and subsequently Heegaard Floer homology.

Gauge theory in four dimensions
Gauge theory has been most intensively studied in four dimensions. Here the mathematical study of gauge theory overlaps significantly with its physical origins, as the standard model of particle physics can be thought of as a quantum field theory on a four-dimensional spacetime. The study of gauge theory problems in four dimensions naturally leads to the study of topological quantum field theory. Such theories are physical gauge theories that are insensitive to changes in the Riemannian metric of the underlying four-manifold, and therefore can be used to define topological (or smooth structure) invariants of the manifold.

Anti-self-duality equations
In four dimensions the Yang–Mills equations admit a simplification to the first order anti-self-duality equations $$\star F_A = -F_A$$ for a connection $$A$$ on a principal bundle $$P\to X$$ over an oriented Riemannian four-manifold $$X$$. These solutions to the Yang–Mills equations represent the absolute minima of the Yang–Mills functional, and the higher critical points correspond to the solutions $$d_A \star F_A = 0$$ that do not arise from anti-self-dual connections. The moduli space of solutions to the anti-self-duality equations, $$\mathcal{M}_P$$, allows one to derive useful invariants about the underlying four-manifold. This theory is most effective in the case where $$X$$ is simply connected. For example, in this case Donaldson's theorem asserts that if the four-manifold has negative-definite intersection form (4-manifold), and if the principal bundle has structure group the special unitary group $$\operatorname{SU}(2)$$ and second Chern class $$c_2(P) = 1$$, then the moduli space $$\mathcal{M}_P$$ is five-dimensional and gives a cobordism between $$X$$ itself and a disjoint union of $$b_2(X)$$ copies of $$\mathbb{CP}^2$$ with its orientation reversed. This implies that the intersection form of such a four-manifold is diagonalisable. There are examples of simply connected topological four-manifolds with non-diagonalisable intersection form, such as the E8 manifold, so Donaldson's theorem implies the existence of topological four-manifolds with no smooth structure. This is in stark contrast with two or three dimensions, in which topological structures and smooth structures are equivalent: any topological manifold of dimension less than or equal to 3 has a unique smooth structure on it.

Similar techniques were used by Clifford Taubes and Donaldson to show that Euclidean space $$\mathbb{R}^4$$ admits uncountably infinitely many distinct smooth structures. This is in stark contrast to any dimension other than four, where Euclidean space has a unique smooth structure.

An extension of these ideas leads to Donaldson theory, which constructs further invariants of smooth four-manifolds out of the moduli spaces of connections over them. These invariants are obtained by evaluating cohomology classes on the moduli space against a fundamental class, which exists due to analytical work showing the orientability and compactness of the moduli space by Karen Uhlenbeck, Taubes, and Donaldson.

When the four-manifold is a Kähler manifold or algebraic surface and the principal bundle has vanishing first Chern class, the anti-self-duality equations are equivalent to the Hermitian Yang–Mills equations on the complex manifold $$X$$. The Kobayashi–Hitchin correspondence proven for algebraic surfaces by Donaldson, and in general by Uhlenbeck and Yau, asserts that solutions to the HYM equations correspond to stable holomorphic vector bundles. This work gave an alternate algebraic description of the moduli space and its compactification, because the moduli space of semistable holomorphic vector bundles over a complex manifold is a projective variety, and therefore compact. This indicates one way of compactifying the moduli space of connections is to add in connections corresponding to semi-stable vector bundles, so-called almost Hermitian Yang–Mills connections.

Seiberg–Witten equations
During their investigation of supersymmetry in four dimensions, Edward Witten and Nathan Seiberg uncovered a system of equations now called the Seiberg–Witten equations, for a connection $$A$$ and spinor field $$\psi$$. In this case the four-manifold must admit a SpinC structure, which defines a principal SpinC bundle $$P$$ with determinant line bundle $$L$$, and an associated spinor bundle $$S^+$$. The connection $$A$$ is on $$L$$, and the spinor field $$\psi \in \Gamma(S^+)$$. The Seiberg–Witten equations are given by
 * $$\begin{cases}

F_A^+ = \psi \otimes \psi^* - \frac{1}{2} |\psi|^2\\ d_A \psi = 0.\end{cases}$$ Solutions to the Seiberg–Witten equations are called monopoles. The moduli space of solutions to the Seiberg–Witten equations, $$\mathcal{M}_{\sigma}$$ where $$\sigma$$ denotes the choice of Spin structure, is used to derive the Seiberg–Witten invariants. The Seiberg–Witten equations have an advantage over the anti-self-duality equations, in that the equations themselves may be perturbed slightly to give the moduli space of solutions better properties. To do this, an arbitrary self-dual two-form is added on to the first equation. For generic choices of metric $$g$$ on the underlying four-manifold, and choice of perturbing two-form, the moduli space of solutions is a compact smooth manifold. In good circumstances (when the manifold $$X$$ is of simple type), this moduli space is zero-dimensional: a finite collection of points. The Seiberg–Witten invariant in this case is simply the number of points in the moduli space. The Seiberg–Witten invariants can be used to prove many of the same results as Donaldson invariants, but often with easier proofs which apply in more generality.

Hermitian Yang–Mills equations
A particular class of Yang–Mills connections are possible to study over Kähler manifolds or Hermitian manifolds. The Hermitian Yang–Mills equations generalise the anti-self-duality equations occurring in four-dimensional Yang–Mills theory to holomorphic vector bundles over Hermitian complex manifolds in any dimension. If $$E\to X$$ is a holomorphic vector bundle over a compact Kähler manifold $$(X,\omega)$$, and $$A$$ is a Hermitian connection on $$E$$ with respect to some Hermitian metric $$h$$. The Hermitian Yang–Mills equations are
 * $$\begin{cases}

F_A^{0,2} = 0\\ \Lambda_{\omega} F_A = \lambda(E) \operatorname{Id}_E, \end{cases}$$ where $$\lambda(E)\in \mathbb{C}$$ is a topological constant depending on $$E$$. These may be viewed either as an equation for the Hermitian connection $$A$$ or for the corresponding Hermitian metric $$h$$ with associated Chern connection $$A$$. In four dimensions the HYM equations are equivalent to the ASD equations. In two dimensions the HYM equations correspond to the Yang–Mills equations considered by Atiyah and Bott. The Kobayashi–Hitchin correspondence asserts that solutions of the HYM equations are in correspondence with polystable holomorphic vector bundles. In the case of compact Riemann surfaces this is the theorem of Narasimhan and Seshadri as proven by Donaldson. For algebraic surfaces it was proven by Donaldson, and in general it was proven by Karen Uhlenbeck and Shing-Tung Yau. This theorem is generalised in the nonabelian Hodge theorem by Simpson, and is in fact a special case of it where the Higgs field of a Higgs bundle $$(E, \Phi)$$ is set to zero.

Exceptional holonomy instantons
The effectiveness of solutions of the Yang–Mills equations in defining invariants of four-manifolds has led to interest that they may help distinguish between exceptional holonomy manifolds such as G2 manifolds in dimension 7 and Spin(7) manifolds in dimension 8, as well as related structures such as Calabi–Yau 6-manifolds and nearly Kähler manifolds.

String theory
New gauge-theoretic problems arise out of superstring theory models. In such models the universe is 10 dimensional consisting of four dimensions of regular spacetime and a 6-dimensional Calabi–Yau manifold. In such theories the fields which act on strings live on bundles over these higher dimensional spaces, and one is interested in gauge-theoretic problems relating to them. For example, the limit of the natural field theories in superstring theory as the string radius approaches zero (the so-called large volume limit) on a Calabi–Yau 6-fold is given by Hermitian Yang–Mills equations on this manifold. Moving away from the large volume limit one obtains the deformed Hermitian Yang–Mills equation, which describes the equations of motion for a D-brane in the B-model of superstring theory. Mirror symmetry predicts that solutions to these equations should correspond to special Lagrangian submanifolds of the mirror dual Calabi–Yau.