Dirac bracket

The Dirac bracket is a generalization of the Poisson bracket developed by Paul Dirac to treat classical systems with second class constraints in Hamiltonian mechanics, and to thus allow them to undergo canonical quantization. It is an important part of Dirac's development of Hamiltonian mechanics to elegantly handle more general Lagrangians; specifically, when constraints are at hand, so that the number of apparent variables exceeds that of dynamical ones. More abstractly, the two-form implied from the Dirac bracket is the restriction of the symplectic form to the constraint surface in phase space.

This article assumes familiarity with the standard Lagrangian and Hamiltonian formalisms, and their connection to canonical quantization. Details of Dirac's modified Hamiltonian formalism are also summarized to put the Dirac bracket in context.

Inadequacy of the standard Hamiltonian procedure
The standard development of Hamiltonian mechanics is inadequate in several specific situations:
 * 1) When the Lagrangian is at most linear in the velocity of at least one coordinate; in which case, the definition of the canonical momentum leads to a constraint. This is the most frequent reason to resort to Dirac brackets. For instance, the Lagrangian (density) for any fermion is of this form.
 * 2) When there are gauge (or other unphysical) degrees of freedom which need to be fixed.
 * 3) When there are any other constraints that one wishes to impose in phase space.

Example of a Lagrangian linear in velocity
An example in classical mechanics is a particle with charge $q$ and mass $m$  confined to the $x$ - $y$ plane with a strong constant, homogeneous perpendicular magnetic field, so then pointing in the $z$-direction with strength $B$.

The Lagrangian for this system with an appropriate choice of parameters is


 * $$ L = \tfrac{1}{2}m\vec{v}^2 + \frac{q}{c}\vec{A}\cdot\vec{v} - V(\vec{r}),$$

where $→ A$  is the vector potential for the magnetic field, $→ B$; $c$  is the speed of light in vacuum; and $V(→ r)$ is an arbitrary external scalar potential; one could easily take it to be quadratic in $x$ and $y$, without loss of generality. We use


 * $$ \vec{A} = \frac{B}{2}(x\hat{y} - y\hat{x})$$

as our vector potential; this corresponds to a uniform and constant magnetic field B in the z direction. Here, the hats indicate unit vectors. Later in the article, however, they are used to distinguish quantum mechanical operators from their classical analogs. The usage should be clear from the context.

Explicitly, the Lagrangian amounts to just



L = \frac{m}{2}(\dot{x}^2 + \dot{y}^2) + \frac{qB}{2c}(x\dot{y} - y\dot{x}) - V(x, y) ~, $$

which leads to the equations of motion



m\ddot{x} = - \frac{\partial V}{\partial x} + \frac{q B}{c}\dot{y} $$



m\ddot{y} = - \frac{\partial V}{\partial y} - \frac{q B}{c}\dot{x}. $$

For a harmonic potential, the gradient of $V$ amounts to just the coordinates, $−(x,y)$.

Now, in the limit of a very large magnetic field, $qB/mc ≫ 1$. One may then drop the kinetic term to produce a simple approximate Lagrangian,



L = \frac{qB}{2c}(x\dot{y} - y\dot{x}) - V(x, y)~, $$

with first-order equations of motion



\dot{y} = \frac{c}{q B}\frac{\partial V}{\partial x} $$

\dot{x} = -\frac{c}{q B}\frac{\partial V}{\partial y}~. $$

Note that this approximate Lagrangian is linear in the velocities, which is one of the conditions under which the standard Hamiltonian procedure breaks down. While this example has been motivated as an approximation, the Lagrangian under consideration is legitimate and leads to consistent equations of motion in the Lagrangian formalism.

Following the Hamiltonian procedure, however, the canonical momenta associated with the coordinates are now



p_x = \frac{\partial L}{\partial \dot{x}} = -\frac{q B}{2c}y $$

p_y = \frac{\partial L}{\partial \dot{y}} = \frac{q B}{2c}x ~, $$

which are unusual in that they are not invertible to the velocities; instead, they are constrained to be functions of the coordinates: the four phase-space variables are linearly dependent, so the variable basis is overcomplete.

A Legendre transformation then produces the Hamiltonian



H(x,y, p_x, p_y) = \dot{x}p_x + \dot{y} p_y - L = V(x, y). $$

Note that this "naive" Hamiltonian has no dependence on the momenta, which means that equations of motion (Hamilton's equations) are inconsistent.

The Hamiltonian procedure has broken down. One might try to fix the problem by eliminating two of the components of the $4$-dimensional phase space, say $y$ and $p_{y}$, down to a reduced phase space of $2$ dimensions, that is sometimes expressing the coordinates as momenta and sometimes as coordinates. However, this is neither a general nor rigorous solution. This gets to the heart of the matter: that the definition of the canonical momenta implies a constraint on phase space (between momenta and coordinates) that was never taken into account.

Generalized Hamiltonian procedure
In Lagrangian mechanics, if the system has holonomic constraints, then one generally adds Lagrange multipliers to the Lagrangian to account for them. The extra terms vanish when the constraints are satisfied, thereby forcing the path of stationary action to be on the constraint surface. In this case, going to the Hamiltonian formalism introduces a constraint on phase space in Hamiltonian mechanics, but the solution is similar.

Before proceeding, it is useful to understand the notions of weak equality and strong equality. Two functions on phase space, $f$ and  $g$, are weakly equal if they are equal when the constraints are satisfied, but not throughout the phase space, denoted $f ≈ g$. If $f$ and  $g$  are equal independently of the constraints being satisfied, they are called strongly equal, written $f = g$. It is important to note that, in order to get the right answer, no weak equations may be used before evaluating derivatives or Poisson brackets.

The new procedure works as follows, start with a Lagrangian and define the canonical momenta in the usual way. Some of those definitions may not be invertible and instead give a constraint in phase space (as above). Constraints derived in this way or imposed from the beginning of the problem are called primary constraints. The constraints, labeled $φ_{j}$, must weakly vanish, $φ_{j }(p,q) ≈ 0$.

Next, one finds the naive Hamiltonian, $H$, in the usual way via a Legendre transformation, exactly as in the above example. Note that the Hamiltonian can always be written as a function of $q$s and $p$s only, even if the velocities cannot be inverted into functions of the momenta.

Generalizing the Hamiltonian
Dirac argues that we should generalize the Hamiltonian (somewhat analogously to the method of Lagrange multipliers) to



H^* = H + \sum_j c_j\phi_j \approx H, $$

where the $c_{j}$ are not constants but functions of the coordinates and momenta. Since this new Hamiltonian is the most general function of coordinates and momenta weakly equal to the naive Hamiltonian, $H^{*}$ is the broadest generalization of the Hamiltonian possible so that $δH * ≈ δH$ when  $δφ_{j} ≈ 0$.

To further illuminate the $c_{j}$, consider how one gets the equations of motion from the naive Hamiltonian in the standard procedure. One expands the variation of the Hamiltonian out in two ways and sets them equal (using a somewhat abbreviated notation with suppressed indices and sums):



\delta H = \frac{\partial H}{\partial q}\delta q + \frac{\partial H}{\partial p}\delta p        \approx \dot{q}\delta p - \dot{p}\delta q  ~, $$

where the second equality holds after simplifying with the Euler-Lagrange equations of motion and the definition of canonical momentum. From this equality, one deduces the equations of motion in the Hamiltonian formalism from



\left(\frac{\partial H}{\partial q} + \dot{p}\right)\delta q + \left(\frac{\partial H}{\partial p} - \dot{q}\right)\delta p = 0 ~, $$

where the weak equality symbol is no longer displayed explicitly, since by definition the equations of motion only hold weakly. In the present context, one cannot simply set the coefficients of  $δq$ and   $δp$ separately to zero, since the variations are somewhat restricted by the constraints. In particular, the variations must be tangent to the constraint surface.

One can demonstrate that the solution to



\sum_n A_n\delta q_n + \sum_n B_n\delta p_n = 0, $$

for the variations $δq_{n}$ and $δp_{n}$ restricted by the constraints $Φ_{j} ≈ 0$ (assuming the constraints satisfy some regularity conditions) is generally



A_n = \sum_m u_m \frac{\partial \phi_m}{\partial q_n} $$

B_n = \sum_m u_m \frac{\partial \phi_m}{\partial p_n}, $$

where the $u_{m}$ are arbitrary functions.

Using this result, the equations of motion become



\dot{p}_j = -\frac{\partial H}{\partial q_j} - \sum_k u_k \frac{\partial \phi_k}{\partial q_j} $$

\dot{q}_j = \frac{\partial H}{\partial p_j} + \sum_k u_k \frac{\partial \phi_k}{\partial p_j} $$

\phi_j(q, p) = 0, $$

where the $u_{k}$ are functions of coordinates and velocities that can be determined, in principle, from the second equation of motion above.

The Legendre transform between the Lagrangian formalism and the Hamiltonian formalism has been saved at the cost of adding new variables.

Consistency conditions
The equations of motion become more compact when using the Poisson bracket, since if $f$ is some function of the coordinates and momenta then



\dot{f} \approx \{f, H^*\}_{PB} \approx \{f, H\}_{PB} + \sum_k u_k\{f, \phi_k\}_{PB}, $$

if one assumes that the Poisson bracket with the $u_{k}$ (functions of the velocity) exist; this causes no problems since the contribution weakly vanishes. Now, there are some consistency conditions which must be satisfied in order for this formalism to make sense. If the constraints are going to be satisfied, then their equations of motion must weakly vanish, that is, we require



\dot{\phi_j} \approx \{\phi_j, H\}_{PB} + \sum_k u_k\{\phi_j,\phi_k\}_{PB} \approx 0. $$

There are four different types of conditions that can result from the above:
 * 1) An equation that is inherently false, such as $1=0$.
 * 2) An equation that is identically true, possibly after using one of our primary constraints.
 * 3) An equation that places new constraints on our coordinates and momenta, but is independent of the  $u_{k}$.
 * 4) An equation that serves to specify the $u_{k}$.

The first case indicates that the starting Lagrangian gives inconsistent equations of motion, such as $L = q$. The second case does not contribute anything new.

The third case gives new constraints in phase space. A constraint derived in this manner is called a secondary constraint. Upon finding the secondary constraint one should add it to the extended Hamiltonian and check the new consistency conditions, which may result in still more constraints. Iterate this process until there are no more constraints. The distinction between primary and secondary constraints is largely an artificial one (i.e. a constraint for the same system can be primary or secondary depending on the Lagrangian), so this article does not distinguish between them from here on. Assuming the consistency condition has been iterated until all of the constraints have been found, then $φ_{j}$ will index all of them. Note this article uses secondary constraint to mean any constraint that was not initially in the problem or derived from the definition of canonical momenta; some authors distinguish between secondary constraints, tertiary constraints, et cetera.

Finally, the last case helps fix the $u_{k}$. If, at the end of this process, the $u_{k}$ are not completely determined, then that means there are unphysical (gauge) degrees of freedom in the system. Once all of the constraints (primary and secondary) are added to the naive Hamiltonian and the solutions to the consistency conditions for the $u_{k}$ are plugged in, the result is called the total Hamiltonian.

Determination of the $u_{k}$
The uk must solve a set of inhomogeneous linear equations of the form



\{\phi_j, H\}_{PB} + \sum_k u_k\{\phi_j,\phi_k\}_{PB} \approx 0. $$

The above equation must possess at least one solution, since otherwise the initial Lagrangian is inconsistent; however, in systems with gauge degrees of freedom, the solution will not be unique. The most general solution is of the form



u_k = U_k + V_k, $$

where $U_{k}$ is a particular solution and $V_{k}$ is the most general solution to the homogeneous equation



\sum_k V_k\{\phi_j,\phi_k\}_{PB}\approx 0. $$

The most general solution will be a linear combination of linearly independent solutions to the above homogeneous equation. The number of linearly independent solutions equals the number of $u_{k}$  (which is the same as the number of constraints) minus the number of consistency conditions of the fourth type (in previous subsection). This is the number of unphysical degrees of freedom in the system. Labeling the linear independent solutions $V_{k}^{a}$ where the index $a$ runs from $1$ to the number of unphysical degrees of freedom, the general solution to the consistency conditions is of the form



u_k \approx U_k + \sum_a v_a V^a_k, $$

where the $v_{a}$ are completely arbitrary functions of time. A different choice of the  $v_{a}$ corresponds to a gauge transformation, and should leave the physical state of the system unchanged.

The total Hamiltonian
At this point, it is natural to introduce the total Hamiltonian



H_T = H + \sum_k U_k\phi_k + \sum_{a, k} v_a V^a_k \phi_k $$

and what is denoted

H' = H + \sum_k U_k \phi_k. $$

The time evolution of a function on the phase space, $f$ is governed by



\dot{f} \approx \{f, H_T\}_{PB}. $$

Later, the extended Hamiltonian is introduced. For gauge-invariant (physically measurable quantities) quantities, all of the Hamiltonians should give the same time evolution, since they are all weakly equivalent. It is only for nongauge-invariant quantities that the distinction becomes important.

The Dirac bracket
Above is everything needed to find the equations of motion in Dirac's modified Hamiltonian procedure. Having the equations of motion, however, is not the endpoint for theoretical considerations. If one wants to canonically quantize a general system, then one needs the Dirac brackets. Before defining Dirac brackets, first-class and second-class constraints need to be introduced.

We call a function $f(q, p)$ of coordinates and momenta first class if its Poisson bracket with all of the constraints weakly vanishes, that is,



\{f, \phi_j\}_{PB} \approx 0, $$

for all $j$. Note that the only quantities that weakly vanish are the constraints $φ_{j}$, and therefore anything that weakly vanishes must be strongly equal to a linear combination of the constraints. One can demonstrate that the Poisson bracket of two first-class quantities must also be first class. The first-class constraints are intimately connected with the unphysical degrees of freedom mentioned earlier. Namely, the number of independent first-class constraints is equal to the number of unphysical degrees of freedom, and furthermore, the primary first-class constraints generate gauge transformations. Dirac further postulated that all secondary first-class constraints are generators of gauge transformations, which turns out to be false; however, typically one operates under the assumption that all first-class constraints generate gauge transformations when using this treatment.

When the first-class secondary constraints are added into the Hamiltonian with arbitrary $v_{a}$ as the first-class primary constraints are added to arrive at the total Hamiltonian, then one obtains the extended Hamiltonian. The extended Hamiltonian gives the most general possible time evolution for any gauge-dependent quantities, and may actually generalize the equations of motion from those of the Lagrangian formalism.

For the purposes of introducing the Dirac bracket, of more immediate interest are the second class constraints. Second class constraints are constraints that have a nonvanishing Poisson bracket with at least one other constraint.

For instance, consider second-class constraints  $φ_{1}$ and  $φ_{2}$ whose Poisson bracket is simply a constant, $c$,



\{\phi_1,\phi_2\}_{PB} = c ~. $$

Now, suppose one wishes to employ canonical quantization, then the phase-space coordinates become operators whose commutators become $iħ$ times their classical Poisson bracket. Assuming there are no ordering issues that give rise to new quantum corrections, this implies that



[\hat{\phi}_1, \hat{\phi}_2] = i\hbar ~c, $$

where the hats emphasize the fact that the constraints are on operators.

On one hand, canonical quantization gives the above commutation relation, but on the other hand $φ$1  and $φ_{2}$ are constraints that must vanish on physical states, whereas the right-hand side cannot vanish. This example illustrates the need for some generalization of the Poisson bracket which respects the system's constraints, and which leads to a consistent quantization procedure. This new bracket should be bilinear, antisymmetric, satisfy the Jacobi identity as does the Poisson bracket, reduce to the Poisson bracket for unconstrained systems, and, additionally, the bracket of any second-class constraint with any other quantity must vanish.

At this point, the second class constraints will be labeled $$ \tilde{\phi}_a $$. Define a matrix with entries

M_{ab} = \{\tilde{\phi}_a,\tilde{\phi}_b\}_{PB}. $$

In this case, the Dirac bracket of two functions on phase space, $f$ and  $g$, is defined as

where $M^{−1}_{ab}$ denotes the $ab$ entry of $M$ 's inverse matrix. Dirac proved that $M$ will always be invertible.

It is straightforward to check that the above definition of the Dirac bracket satisfies all of the desired properties, and especially the last one, of vanishing for an argument which is a second-class constraint.

When applying canonical quantization on a constrained Hamiltonian system, the commutator of the operators is supplanted by $iħ$ times their classical Dirac bracket. Since the Dirac bracket respects the constraints, one need not be careful about evaluating all brackets before using any weak equations, as is the case with the Poisson bracket.

Note that while the Poisson bracket of bosonic (Grassmann even) variables with itself must vanish, the Poisson bracket of fermions represented as a Grassmann variables with itself need not vanish. This means that in the fermionic case it is possible for there to be an odd number of second class constraints.

Illustration on the example provided
Returning to the above example, the naive Hamiltonian and the two primary constraints are



H = V(x, y) $$

\phi_1 = p_x + \tfrac{q B}{2c} y,\qquad \phi_2 = p_y - \tfrac{q B}{2 c} x. $$

Therefore, the extended Hamiltonian can be written



H^* = V(x, y) + u_1 \left(p_x + \tfrac{q B}{2c}y\right) + u_2 \left(p_y - \tfrac{q B}{2c}x\right). $$

The next step is to apply the consistency conditions ${ Φ_{j}, H^{*} } _{PB} ≈ 0$, which in this case become



\{\phi_1, H\}_{PB}+\sum_j u_j\{\phi_1, \phi_j\}_{PB} = -\frac{\partial V}{\partial x} + u_2 \frac{q B}{c} \approx 0 $$

\{\phi_2, H\}_{PB}+\sum_j u_j\{\phi_2, \phi_j\}_{PB} = -\frac{\partial V}{\partial y} - u_1 \frac{q B}{c} \approx 0. $$

These are not secondary constraints, but conditions that fix $u_{1}$   and  $u_{2}$. Therefore, there are no secondary constraints and the arbitrary coefficients are completely determined, indicating that there are no unphysical degrees of freedom.

If one plugs in with the values of $u_{1}$   and  $u_{2}$, then one can see that the equations of motion are



\dot{x} = \{x, H\}_{PB} + u_1\{x, \phi_1\}_{PB} + u_2 \{x, \phi_2\} = -\frac{c}{q B} \frac{\partial V}{\partial y} $$

\dot{y} = \frac{c}{q B} \frac{\partial V}{\partial x} $$

\dot{p}_x = -\frac{1}{2}\frac{\partial V}{\partial x} $$

\dot{p}_y = -\frac{1}{2}\frac{\partial V}{\partial y}, $$

which are self-consistent and coincide with the Lagrangian equations of motion.

A simple calculation confirms that $φ_{1}$ and  $φ_{2}$ are second class constraints since



\{\phi_1, \phi_2\}_{PB} = - \{\phi_2, \phi_1\}_{PB} = \frac{q B}{c}, $$

hence the matrix looks like



M = \frac{q B}{c} \left(\begin{matrix} 0 & 1\\ -1 & 0 \end{matrix}\right), $$

which is easily inverted to



M^{-1} = \frac{c}{q B} \left(\begin{matrix} 0 & -1\\ 1 & 0 \end{matrix}\right) \quad\Rightarrow\quad M^{-1}_{ab} = -\frac{c}{q B_0} \varepsilon_{ab}, $$

where $ε_{ab}$ is the Levi-Civita symbol. Thus, the Dirac brackets are defined to be



\{f, g\}_{DB} = \{f, g\}_{PB} + \frac{c\varepsilon_{ab}}{q B} \{f, \phi_a\}_{PB}\{\phi_b, g\}_{PB}. $$

If one always uses the Dirac bracket instead of the Poisson bracket, then there is no issue about the order of applying constraints and evaluating expressions, since the Dirac bracket of anything weakly zero is strongly equal to zero. This means that one can just use the naive Hamiltonian with Dirac brackets, instead, to thus get the correct equations of motion, which one can easily confirm on the above ones.

To quantize the system, the Dirac brackets between all of the phase space variables are needed. The nonvanishing Dirac brackets for this system are



\{x, y\}_{DB} = -\frac{c}{q B} $$

\{x, p_x\}_{DB} = \{y, p_y\}_{DB} = \tfrac{1}{2} $$

while the cross-terms vanish, and



\{p_x, p_y\}_{DB} = - \frac{q B}{4c}. $$

Therefore, the correct implementation of canonical quantization dictates the commutation relations,



[\hat{x}, \hat{y}] = -i\frac{\hbar c}{q B} $$

[\hat{x}, \hat{p}_x] = [\hat{y}, \hat{p}_y] = i\frac{\hbar}{2} $$

with the cross terms vanishing, and



[\hat{p}_x, \hat{p}_y] = -i\frac{\hbar q B}{4c}~. $$

This example has a nonvanishing commutator between $&and; x$ and  $&and; y$, which means this structure specifies a noncommutative geometry. (Since the two coordinates do not commute, there will be an uncertainty principle for the $x$ and $y$ positions.)

Further Illustration for a hypersphere
Similarly, for free motion on a hypersphere $S^{n}$, the $n + 1$ coordinates are constrained, $x_{i} x^{i} = 1$. From a plain kinetic Lagrangian, it is evident that their momenta are perpendicular to them, $x_{i} p^{i} = 0$. Thus the corresponding Dirac Brackets are likewise simple to work out,



\{x_i, x_j\}_{DB} = 0, $$

\{x_i, p_j\}_{DB} = \delta_{ij} -x_i x_j ,$$

\{p_i, p_j\}_{DB} = x_j p_i - x_i p_j ~. $$

The ($2n + 1)$ constrained phase-space variables $(x_{i}, p_{i})$ obey much simpler Dirac brackets than the $2n$ unconstrained variables, had one eliminated one of the $x$s and one of the $p$s through the two constraints ab initio, which would obey plain Poisson brackets. The Dirac brackets add simplicity and elegance, at the cost of excessive (constrained) phase-space variables.

For example, for free motion on a circle, $n = 1$, for $x_{1} ≡ z$ and eliminating $x_{2}$ from the circle constraint yields the unconstrained


 * $$L=\frac{1}{2} \frac {{\dot z}^2}{1-z^2} ~,$$

with equations of motion


 * $${\ddot z} =-z \frac {{\dot z}^2}{1-z^2} =-z 2E ~,$$

an oscillation; whereas the equivalent constrained system with $H = p^{2}/2 = E$ yields


 * $${\dot x}^i =\{x^i,H\}_{DB} = p^i~, $$
 * $${\dot p}^i  =\{p^i,H\}_{DB} = - x^i ~  p^2~, $$

whence, instantly, virtually by inspection, oscillation for both variables,


 * $${\ddot x}^i = - x^i 2E ~. $$