Relativistic quantum mechanics

In physics, relativistic quantum mechanics (RQM) is any Poincaré covariant formulation of quantum mechanics (QM). This theory is applicable to massive particles propagating at all velocities up to those comparable to the speed of light c, and can accommodate massless particles. The theory has application in high energy physics, particle physics and accelerator physics, as well as atomic physics, chemistry and condensed matter physics. Non-relativistic quantum mechanics refers to the mathematical formulation of quantum mechanics applied in the context of Galilean relativity, more specifically quantizing the equations of classical mechanics by replacing dynamical variables by operators. Relativistic quantum mechanics (RQM) is quantum mechanics applied with special relativity. Although the earlier formulations, like the Schrödinger picture and Heisenberg picture were originally formulated in a non-relativistic background, a few of them (e.g. the Dirac or path-integral formalism) also work with special relativity.

Key features common to all RQMs include: the prediction of antimatter, spin magnetic moments of elementary spin $1/2$ fermions, fine structure, and quantum dynamics of charged particles in electromagnetic fields. The key result is the Dirac equation, from which these predictions emerge automatically. By contrast, in non-relativistic quantum mechanics, terms have to be introduced artificially into the Hamiltonian operator to achieve agreement with experimental observations.

The most successful (and most widely used) RQM is relativistic quantum field theory (QFT), in which elementary particles are interpreted as field quanta. A unique consequence of QFT that has been tested against other RQMs is the failure of conservation of particle number, for example in matter creation and annihilation.

Paul Dirac's work between 1927 to 1933 shaped the synthesis of special relativity and quantum mechanics. His work was instrumental, as he formulated the Dirac equation and also originated quantum electrodynamics, both of which were successful in combining the two theories.

In this article, the equations are written in familiar 3D vector calculus notation and use hats for operators (not necessarily in the literature), and where space and time components can be collected, tensor index notation is shown also (frequently used in the literature), in addition the Einstein summation convention is used. SI units are used here; Gaussian units and natural units are common alternatives. All equations are in the position representation; for the momentum representation the equations have to be Fourier transformed – see position and momentum space.

Combining special relativity and quantum mechanics
One approach is to modify the Schrödinger picture to be consistent with special relativity.

A postulate of quantum mechanics is that the time evolution of any quantum system is given by the Schrödinger equation:


 * $$i\hbar \frac{\partial}{\partial t}\psi =\hat{H}\psi$$

using a suitable Hamiltonian operator $Ĥ$ corresponding to the system. The solution is a complex-valued wavefunction $ψ(r, t)$, a function of the 3D position vector $r$ of the particle at time $t$, describing the behavior of the system.

Every particle has a non-negative spin quantum number $s$. The number $2s$ is an integer, odd for fermions and even for bosons. Each $s$ has $2s + 1$ z-projection quantum numbers; $σ = s, s &minus; 1, ..., &minus;s + 1, &minus;s$. This is an additional discrete variable the wavefunction requires; $m_{s}$.

Historically, in the early 1920s Pauli, Kronig, Uhlenbeck and Goudsmit were the first to propose the concept of spin. The inclusion of spin in the wavefunction incorporates the Pauli exclusion principle (1925) and the more general spin–statistics theorem (1939) due to Fierz, rederived by Pauli a year later. This is the explanation for a diverse range of subatomic particle behavior and phenomena: from the electronic configurations of atoms, nuclei (and therefore all elements on the periodic table and their chemistry), to the quark configurations and colour charge (hence the properties of baryons and mesons).

A fundamental prediction of special relativity is the relativistic energy–momentum relation; for a particle of rest mass $s_{z}$, and in a particular frame of reference with energy $σ$ and 3-momentum $ψ(r, t, σ)$ with magnitude in terms of the dot product $$p = \sqrt{\mathbf{p} \cdot \mathbf{p}}$$, it is:


 * $$E^2 = c^2\mathbf{p}\cdot\mathbf{p} + (mc^2)^2\,.$$

These equations are used together with the energy and momentum operators, which are respectively:


 * $$\hat{E}=i\hbar\frac{\partial}{\partial t}\,,\quad \hat{\mathbf{p}} = -i\hbar\nabla\,,$$

to construct a relativistic wave equation (RWE): a partial differential equation consistent with the energy–momentum relation, and is solved for $m$ to predict the quantum dynamics of the particle. For space and time to be placed on equal footing, as in relativity, the orders of space and time partial derivatives should be equal, and ideally as low as possible, so that no initial values of the derivatives need to be specified. This is important for probability interpretations, exemplified below. The lowest possible order of any differential equation is the first (zeroth order derivatives would not form a differential equation).

The Heisenberg picture is another formulation of QM, in which case the wavefunction $E$ is time-independent, and the operators $p$ contain the time dependence, governed by the equation of motion:


 * $$\frac{d}{dt}A = \frac{1}{i\hbar}[A,\hat{H}]+\frac{\partial}{\partial t}A\,,$$

This equation is also true in RQM, provided the Heisenberg operators are modified to be consistent with SR.

Historically, around 1926, Schrödinger and Heisenberg show that wave mechanics and matrix mechanics are equivalent, later furthered by Dirac using transformation theory.

A more modern approach to RWEs, first introduced during the time RWEs were developing for particles of any spin, is to apply representations of the Lorentz group.

Space and time
In classical mechanics and non-relativistic QM, time is an absolute quantity all observers and particles can always agree on, "ticking away" in the background independent of space. Thus in non-relativistic QM one has for a many particle system $ψ$.

In relativistic mechanics, the spatial coordinates and coordinate time are not absolute; any two observers moving relative to each other can measure different locations and times of events. The position and time coordinates combine naturally into a four-dimensional spacetime position $ψ$ corresponding to events, and the energy and 3-momentum combine naturally into the four-momentum $A(t)$ of a dynamic particle, as measured in some reference frame, change according to a Lorentz transformation as one measures in a different frame boosted and/or rotated relative the original frame in consideration. The derivative operators, and hence the energy and 3-momentum operators, are also non-invariant and change under Lorentz transformations.

Under a proper orthochronous Lorentz transformation $ψ(r_{1}, r_{2}, r_{3}, ..., t, σ_{1}, σ_{2}, σ_{3}...)$ in Minkowski space, all one-particle quantum states $X = (ct, r)$ locally transform under some representation $P = (E/c, p)$ of the Lorentz group:


 * $$\psi_\sigma(\mathbf{r}, t) \rightarrow D(\Lambda) \psi_\sigma(\Lambda^{-1}(\mathbf{r}, t)) $$

where $(r, t) → Λ(r, t)$ is a finite-dimensional representation, in other words a $ψ_{σ}$ square matrix. Again, $D$ is thought of as a column vector containing components with the $D(Λ)$ allowed values of $(2s + 1)×(2s + 1)$. The quantum numbers $ψ$ and $(2s + 1)$ as well as other labels, continuous or discrete, representing other quantum numbers are suppressed. One value of $σ$ may occur more than once depending on the representation.

Non-relativistic and relativistic Hamiltonians
The classical Hamiltonian for a particle in a potential is the kinetic energy $s$ plus the potential energy $σ$, with the corresponding quantum operator in the Schrödinger picture:


 * $$\hat{H} = \frac{\hat{\mathbf{p}}\cdot\hat{\mathbf{p}}}{2m} + V(\mathbf{r},t) $$

and substituting this into the above Schrödinger equation gives a non-relativistic QM equation for the wavefunction: the procedure is a straightforward substitution of a simple expression. By contrast this is not as easy in RQM; the energy–momentum equation is quadratic in energy and momentum leading to difficulties. Naively setting:


 * $$\hat{H} = \hat{E} = \sqrt{c^2 \hat{\mathbf{p}}\cdot\hat{\mathbf{p}} + (mc^2)^2} \quad \Rightarrow \quad i\hbar\frac{\partial}{\partial t}\psi = \sqrt{c^2 \hat{\mathbf{p}}\cdot \hat{\mathbf{p}} + (mc^2)^2} \, \psi$$

is not helpful for several reasons. The square root of the operators cannot be used as it stands; it would have to be expanded in a power series before the momentum operator, raised to a power in each term, could act on $σ$. As a result of the power series, the space and time derivatives are completely asymmetric: infinite-order in space derivatives but only first order in the time derivative, which is inelegant and unwieldy. Again, there is the problem of the non-invariance of the energy operator, equated to the square root which is also not invariant. Another problem, less obvious and more severe, is that it can be shown to be nonlocal and can even violate causality: if the particle is initially localized at a point $p·p/2m$ so that $V(r, t)$ is finite and zero elsewhere, then at any later time the equation predicts delocalization $ψ$ everywhere, even for $r_{0}$ which means the particle could arrive at a point before a pulse of light could. This would have to be remedied by the additional constraint $ψ(r_{0}, t = 0)$.

There is also the problem of incorporating spin in the Hamiltonian, which isn't a prediction of the non-relativistic Schrödinger theory. Particles with spin have a corresponding spin magnetic moment quantized in units of $ψ(r, t) ≠ 0$, the Bohr magneton:


 * $$\hat{\boldsymbol{\mu}}_S = - \frac{g\mu_B}{\hbar}\hat{\mathbf{S}}\,,\quad \left|\boldsymbol{\mu}_S\right| = - g\mu_B \sigma\,,$$

where $|r| > ct$ is the (spin) g-factor for the particle, and $ψ(

|r| > ct, t) = 0$ the spin operator, so they interact with electromagnetic fields. For a particle in an externally applied magnetic field $μ_{B}$, the interaction term


 * $$\hat{H}_B = - \mathbf{B} \cdot \hat{\boldsymbol{\mu}}_S $$

has to be added to the above non-relativistic Hamiltonian. On the contrary; a relativistic Hamiltonian introduces spin automatically as a requirement of enforcing the relativistic energy-momentum relation.

Relativistic Hamiltonians are analogous to those of non-relativistic QM in the following respect; there are terms including rest mass and interaction terms with externally applied fields, similar to the classical potential energy term, as well as momentum terms like the classical kinetic energy term. A key difference is that relativistic Hamiltonians contain spin operators in the form of matrices, in which the matrix multiplication runs over the spin index $g$, so in general a relativistic Hamiltonian:


 * $$\hat{H} = \hat{H}(\mathbf{r}, t, \hat{\mathbf{p}}, \hat{\mathbf{S}})$$

is a function of space, time, and the momentum and spin operators.

The Klein–Gordon and Dirac equations for free particles
Substituting the energy and momentum operators directly into the energy–momentum relation may at first sight seem appealing, to obtain the Klein–Gordon equation:


 * $$\hat{E}^2 \psi = c^2\hat{\mathbf{p}}\cdot\hat{\mathbf{p}}\psi + (mc^2)^2\psi \,,$$

and was discovered by many people because of the straightforward way of obtaining it, notably by Schrödinger in 1925 before he found the non-relativistic equation named after him, and by Klein and Gordon in 1927, who included electromagnetic interactions in the equation. This is relativistically invariant, yet this equation alone isn't a sufficient foundation for RQM for a at least two reasons: one is that negative-energy states are solutions, another is the density (given below), and this equation as it stands is only applicable to spinless particles. This equation can be factored into the form:



\left(\hat{E} - c\boldsymbol{\alpha}\cdot\hat{\mathbf{p}} - \beta mc^2 \right)\left(\hat{E} + c\boldsymbol{\alpha}\cdot\hat{\mathbf{p}} + \beta mc^2 \right)\psi=0 \,, $$

where $S$ and $B$ are not simply numbers or vectors, but 4 × 4 Hermitian matrices that are required to anticommute for $σ$:


 * $$\alpha_i \beta = - \beta \alpha_i, \quad \alpha_i\alpha_j = - \alpha_j\alpha_i \,,$$

and square to the identity matrix:


 * $$ \alpha_i^2 = \beta^2 = I \,, $$

so that terms with mixed second-order derivatives cancel while the second-order derivatives purely in space and time remain. The first factor:


 * $$\left(\hat{E} - c\boldsymbol{\alpha}\cdot\hat{\mathbf{p}} - \beta mc^2 \right)\psi=0 \quad \Leftrightarrow \quad \hat{H} = c\boldsymbol{\alpha}\cdot\hat{\mathbf{p}} + \beta mc^2$$

is the Dirac equation. The other factor is also the Dirac equation, but for a particle of negative mass. Each factor is relativistically invariant. The reasoning can be done the other way round: propose the Hamiltonian in the above form, as Dirac did in 1928, then pre-multiply the equation by the other factor of operators $α = (α_{1}, α_{2}, α_{3})$, and comparison with the KG equation determines the constraints on $β$ and $i ≠ j$. The positive mass equation can continue to be used without loss of continuity. The matrices multiplying $E + cα · p + βmc^{2}$ suggest it isn't a scalar wavefunction as permitted in the KG equation, but must instead be a four-component entity. The Dirac equation still predicts negative energy solutions, so Dirac postulated that negative energy states are always occupied, because according to the Pauli principle, electronic transitions from positive to negative energy levels in atoms would be forbidden. See Dirac sea for details.

Densities and currents
In non-relativistic quantum mechanics, the square modulus of the wavefunction $α$ gives the probability density function $β$. This is the Copenhagen interpretation, circa 1927. In RQM, while $ψ$ is a wavefunction, the probability interpretation is not the same as in non-relativistic QM. Some RWEs do not predict a probability density $ψ$ or probability current $ρ = |ψ|^{2}$ (really meaning probability current density) because they are not positive-definite functions of space and time. The Dirac equation does:


 * $$\rho=\psi^\dagger \psi, \quad \mathbf{j} = \psi^\dagger \gamma^0 \boldsymbol{\gamma} \psi \quad \rightleftharpoons \quad J^\mu = \psi^\dagger \gamma^0 \gamma^\mu \psi $$

where the dagger denotes the Hermitian adjoint (authors usually write $ψ(r, t)$ for the Dirac adjoint) and $ρ$ is the probability four-current, while the Klein–Gordon equation does not:


 * $$\rho = \frac{i\hbar}{2mc^2}\left(\psi^{*}\frac{\partial \psi}{\partial t} - \psi \frac{\partial \psi^*}{\partial t}\right)\, ,\quad \mathbf{j} = -\frac{i\hbar}{2m}\left(\psi^* \nabla \psi - \psi \nabla \psi^*\right) \quad \rightleftharpoons  \quad J^\mu = \frac{i\hbar}{2m}(\psi^*\partial^\mu\psi - \psi\partial^\mu\psi^*) $$

where $j$ is the four-gradient. Since the initial values of both $\overline{ψ} = ψ^{†}γ^{0}$ and $J^{μ}$ may be freely chosen, the density can be negative.

Instead, what appears look at first sight a "probability density" and "probability current" has to be reinterpreted as charge density and current density when multiplied by electric charge. Then, the wavefunction $∂^{μ}$ is not a wavefunction at all, but reinterpreted as a field. The density and current of electric charge always satisfy a continuity equation:


 * $$\frac{\partial \rho}{\partial t} + \nabla\cdot\mathbf{J} = 0 \quad \rightleftharpoons \quad \partial_\mu J^\mu = 0 \,, $$

as charge is a conserved quantity. Probability density and current also satisfy a continuity equation because probability is conserved, however this is only possible in the absence of interactions.

Spin and electromagnetically interacting particles
Including interactions in RWEs is generally difficult. Minimal coupling is a simple way to include the electromagnetic interaction. For one charged particle of electric charge $ψ$ in an electromagnetic field, given by the magnetic vector potential $∂ψ/∂t$ defined by the magnetic field $ψ$, and electric scalar potential $q$, this is:


 * $$\hat{E} \rightarrow \hat{E} - q\phi \,, \quad \hat{\mathbf{p}}\rightarrow \hat{\mathbf{p}} - q \mathbf{A} \quad \rightleftharpoons \quad \hat{P}_\mu \rightarrow \hat{P}_\mu -q A_\mu$$

where $A(r, t)$ is the four-momentum that has a corresponding 4-momentum operator, and $B = &nabla; &times; A$ the four-potential. In the following, the non-relativistic limit refers to the limiting cases:


 * $$E - e\phi \approx mc^2\,,\quad \mathbf{p} \approx m \mathbf{v}\,,$$

that is, the total energy of the particle is approximately the rest energy for small electric potentials, and the momentum is approximately the classical momentum.

Spin 0
In RQM, the KG equation admits the minimal coupling prescription;


 * $${(\hat{E} - q\phi)}^2 \psi = c^2{(\hat{\mathbf{p}} - q \mathbf{A})}^2\psi + (mc^2)^2\psi \quad \rightleftharpoons \quad \left[{(\hat{P}_\mu - q A_\mu)}{(\hat{P}^\mu - q A^\mu)} - {(mc)}^2 \right] \psi = 0.$$

In the case where the charge is zero, the equation reduces trivially to the free KG equation so nonzero charge is assumed below. This is a scalar equation that is invariant under the irreducible one-dimensional scalar $ϕ(r, t)$ representation of the Lorentz group. This means that all of its solutions will belong to a direct sum of $P_{μ}$ representations. Solutions that do not belong to the irreducible $A_{μ}$ representation will have two or more independent components. Such solutions cannot in general describe particles with nonzero spin since spin components are not independent. Other constraint will have to be imposed for that, e.g. the Dirac equation for spin $1⁄2$, see below. Thus if a system satisfies the KG equation only, it can only be interpreted as a system with zero spin.

The electromagnetic field is treated classically according to Maxwell's equations and the particle is described by a wavefunction, the solution to the KG equation. The equation is, as it stands, not always very useful, because massive spinless particles, such as the π-mesons, experience the much stronger strong interaction in addition to the electromagnetic interaction. It does, however, correctly describe charged spinless bosons in the absence of other interactions.

The KG equation is applicable to spinless charged bosons in an external electromagnetic potential. As such, the equation cannot be applied to the description of atoms, since the electron is a spin $1⁄2$ particle. In the non-relativistic limit the equation reduces to the Schrödinger equation for a spinless charged particle in an electromagnetic field:


 * $$\left ( i\hbar \frac{\partial}{\partial t}- q\phi\right) \psi = \frac{1}{2m}{(\hat{\mathbf{p}} - q \mathbf{A})}^2 \psi \quad \Leftrightarrow \quad \hat{H} = \frac{1}{2m}{(\hat{\mathbf{p}} - q \mathbf{A})}^2 + q\phi.$$

Spin $1⁄2$
Non relativistically, spin was phenomenologically introduced in the Pauli equation by Pauli in 1927 for particles in an electromagnetic field:


 * $$\left(i \hbar \frac{\partial}{\partial t} - q \phi \right) \psi = \left[ \frac{1}{2m}{(\boldsymbol{\sigma}\cdot(\mathbf{p} - q \mathbf{A}))}^2 \right] \psi \quad \Leftrightarrow \quad \hat{H} = \frac{1}{2m}{(\boldsymbol{\sigma}\cdot(\mathbf{p} - q \mathbf{A}))}^2 + q \phi $$

by means of the 2&thinsp;&times;&thinsp;2 Pauli matrices, and $(0,0)$ is not just a scalar wavefunction as in the non-relativistic Schrödinger equation, but a two-component spinor field:


 * $$\psi=\begin{pmatrix}\psi_{\uparrow} \\ \psi_{\downarrow} \end{pmatrix}$$

where the subscripts ↑ and ↓ refer to the "spin up" ($(0,0)$) and "spin down" ($(0,0)$) states.

In RQM, the Dirac equation can also incorporate minimal coupling, rewritten from above;


 * $$\left(i \hbar \frac{\partial}{\partial t} -q\phi \right)\psi = \gamma^0 \left[ c\boldsymbol{\gamma}\cdot{(\hat{\mathbf{p}} - q\mathbf{A})} - mc^2 \right] \psi \quad \rightleftharpoons \quad \left[\gamma^\mu (\hat{P}_\mu - q A_\mu) - mc^2 \right]\psi = 0$$

and was the first equation to accurately predict spin, a consequence of the 4 &times; 4 gamma matrices $&psi;$. There is a 4 × 4 identity matrix pre-multiplying the energy operator (including the potential energy term), conventionally not written for simplicity and clarity (i.e. treated like the number 1). Here $σ = +1⁄2$ is a four-component spinor field, which is conventionally split into two two-component spinors in the form:


 * $$\psi=\begin{pmatrix}\psi_{+} \\ \psi_{-} \end{pmatrix} = \begin{pmatrix}\psi_{+\uparrow} \\ \psi_{+\downarrow} \\ \psi_{-\uparrow} \\ \psi_{-\downarrow} \end{pmatrix} $$

The 2-spinor $σ = &minus;1⁄2$ corresponds to a particle with 4-momentum $γ^{0} = β, γ = (γ_{1}, γ_{2}, γ_{3}) = βα = (βα_{1}, βα_{2}, βα_{3})$ and charge $&psi;$ and two spin states ($ψ_{+}$, as before). The other 2-spinor $(E, p)$ corresponds to a similar particle with the same mass and spin states, but negative 4-momentum $q$ and negative charge $σ = ±1⁄2$, that is, negative energy states, time-reversed momentum, and negated charge. This was the first interpretation and prediction of a particle and corresponding antiparticle. See Dirac spinor and bispinor for further description of these spinors. In the non-relativistic limit the Dirac equation reduces to the Pauli equation (see Dirac equation for how). When applied a one-electron atom or ion, setting $ψ_{−}$ and $−(E, p)$ to the appropriate electrostatic potential, additional relativistic terms include the spin–orbit interaction, electron gyromagnetic ratio, and Darwin term. In ordinary QM these terms have to be put in by hand and treated using perturbation theory. The positive energies do account accurately for the fine structure.

Within RQM, for massless particles the Dirac equation reduces to:


 * $$ \left(\frac{\hat{E}}{c} + \boldsymbol{\sigma}\cdot \hat{\mathbf{p}} \right) \psi_{+} = 0 \,,\quad \left(\frac{\hat{E}}{c} - \boldsymbol{\sigma}\cdot \hat{\mathbf{p}} \right) \psi_{-} = 0 \quad \rightleftharpoons \quad \sigma^\mu \hat{P}_\mu \psi_{+} = 0\,,\quad \sigma_\mu \hat{P}^\mu \psi_{-} = 0\,,$$

the first of which is the Weyl equation, a considerable simplification applicable for massless neutrinos. This time there is a 2 × 2 identity matrix pre-multiplying the energy operator conventionally not written. In RQM it is useful to take this as the zeroth Pauli matrix $−q$ which couples to the energy operator (time derivative), just as the other three matrices couple to the momentum operator (spatial derivatives).

The Pauli and gamma matrices were introduced here, in theoretical physics, rather than pure mathematics itself. They have applications to quaternions and to the SO(2) and SO(3) Lie groups, because they satisfy the important commutator [, ] and anticommutator [ , ]+ relations respectively:


 * $$\left[\sigma_a, \sigma_b \right] = 2i \varepsilon_{abc} \sigma_c \,, \quad \left[\sigma_a, \sigma_b \right]_{+} = 2\delta_{ab}\sigma_0$$

where $A = 0$ is the three-dimensional Levi-Civita symbol. The gamma matrices form bases in Clifford algebra, and have a connection to the components of the flat spacetime Minkowski metric $ϕ$ in the anticommutation relation:


 * $$\left[\gamma^\alpha,\gamma^\beta\right]_{+} = \gamma^\alpha\gamma^\beta + \gamma^\beta\gamma^\alpha = 2\eta^{\alpha\beta}\,,$$

(This can be extended to curved spacetime by introducing vierbeins, but is not the subject of special relativity).

In 1929, the Breit equation was found to describe two or more electromagnetically interacting massive spin $1⁄2$ fermions to first-order relativistic corrections; one of the first attempts to describe such a relativistic quantum many-particle system. This is, however, still only an approximation, and the Hamiltonian includes numerous long and complicated sums.

Helicity and chirality
The helicity operator is defined by;


 * $$\hat{h} = \hat{\mathbf{S}}\cdot \frac{\hat{\mathbf{p}}}{|\mathbf{p}|} = \hat{\mathbf{S}} \cdot \frac{c\hat{\mathbf{p}}}{\sqrt{E^2 - (m_0c^2)^2}}$$

where p is the momentum operator, S the spin operator for a particle of spin s, E is the total energy of the particle, and m0 its rest mass. Helicity indicates the orientations of the spin and translational momentum vectors. Helicity is frame-dependent because of the 3-momentum in the definition, and is quantized due to spin quantization, which has discrete positive values for parallel alignment, and negative values for antiparallel alignment.

An automatic occurrence in the Dirac equation (and the Weyl equation) is the projection of the spin $1⁄2$ operator on the 3-momentum (times c), $σ_{0}$, which is the helicity (for the spin $1⁄2$ case) times $$\sqrt{E^2 - (m_0c^2)^2}$$.

For massless particles the helicity simplifies to:


 * $$\hat{h} = \hat{\mathbf{S}} \cdot \frac{c\hat{\mathbf{p}}}{E} $$

Higher spins
The Dirac equation can only describe particles of spin $1⁄2$. Beyond the Dirac equation, RWEs have been applied to free particles of various spins. In 1936, Dirac extended his equation to all fermions, three years later Fierz and Pauli rederived the same equation. The Bargmann–Wigner equations were found in 1948 using Lorentz group theory, applicable for all free particles with any spin. Considering the factorization of the KG equation above, and more rigorously by Lorentz group theory, it becomes apparent to introduce spin in the form of matrices.

The wavefunctions are multicomponent spinor fields, which can be represented as column vectors of functions of space and time:


 * $$\psi(\mathbf{r},t) = \begin{bmatrix} \psi_{\sigma=s}(\mathbf{r},t) \\ \psi_{\sigma=s - 1}(\mathbf{r},t) \\ \vdots \\ \psi_{\sigma=-s + 1}(\mathbf{r},t) \\ \psi_{\sigma=-s}(\mathbf{r},t) \end{bmatrix}\quad\rightleftharpoons\quad {\psi(\mathbf{r},t)}^\dagger = \begin{bmatrix} {\psi_{\sigma=s}(\mathbf{r},t)}^\star & {\psi_{\sigma=s - 1}(\mathbf{r},t)}^\star & \cdots & {\psi_{\sigma=-s + 1}(\mathbf{r},t)}^\star & {\psi_{\sigma=-s}(\mathbf{r},t)}^\star \end{bmatrix}$$

where the expression on the right is the Hermitian conjugate. For a massive particle of spin $ε_{abc}$, there are $η^{αβ}$ components for the particle, and another $σ · c p$ for the corresponding antiparticle (there are $s$ possible $2s + 1$ values in each case), altogether forming a $2s + 1$-component spinor field:


 * $$\psi(\mathbf{r},t) = \begin{bmatrix} \psi_{+,\,\sigma=s}(\mathbf{r},t) \\ \psi_{+,\,\sigma=s - 1}(\mathbf{r},t) \\ \vdots \\ \psi_{+,\,\sigma=-s + 1}(\mathbf{r},t) \\ \psi_{+,\,\sigma=-s}(\mathbf{r},t) \\ \psi_{-,\,\sigma=s}(\mathbf{r},t) \\ \psi_{-,\,\sigma=s - 1}(\mathbf{r},t) \\ \vdots \\ \psi_{-,\,\sigma=-s + 1}(\mathbf{r},t) \\ \psi_{-,\,\sigma=-s}(\mathbf{r},t) \end{bmatrix}\quad\rightleftharpoons\quad {\psi(\mathbf{r},t)}^\dagger\begin{bmatrix} {\psi_{+,\,\sigma=s}(\mathbf{r},t)}^\star & {\psi_{+,\,\sigma=s - 1}(\mathbf{r},t)}^\star & \cdots & {\psi_{-,\,\sigma=-s}(\mathbf{r},t)}^\star \end{bmatrix} $$

with the + subscript indicating the particle and − subscript for the antiparticle. However, for massless particles of spin s, there are only ever two-component spinor fields; one is for the particle in one helicity state corresponding to +s and the other for the antiparticle in the opposite helicity state corresponding to &minus;s:


 * $$\psi(\mathbf{r},t) = \begin{pmatrix} \psi_{+}(\mathbf{r},t) \\ \psi_{-}(\mathbf{r},t) \end{pmatrix}$$

According to the relativistic energy-momentum relation, all massless particles travel at the speed of light, so particles traveling at the speed of light are also described by two-component spinors. Historically, Élie Cartan found the most general form of spinors in 1913, prior to the spinors revealed in the RWEs following the year 1927.

For equations describing higher-spin particles, the inclusion of interactions is nowhere near as simple minimal coupling, they lead to incorrect predictions and self-inconsistencies. For spin greater than $1⁄2$, the RWE is not fixed by the particle's mass, spin, and electric charge; the electromagnetic moments (electric dipole moments and magnetic dipole moments) allowed by the spin quantum number are arbitrary. (Theoretically, magnetic charge would contribute also). For example, the spin $ħ⁄2$ case only allows a magnetic dipole, but for spin 1 particles magnetic quadrupoles and electric dipoles are also possible. For more on this topic, see multipole expansion and (for example) Cédric Lorcé (2009).

Velocity operator
The Schrödinger/Pauli velocity operator can be defined for a massive particle using the classical definition $2s + 1$, and substituting quantum operators in the usual way:


 * $$\hat{\mathbf{v}} = \frac{1}{m}\hat{\mathbf{p}}$$

which has eigenvalues that take any value. In RQM, the Dirac theory, it is:


 * $$\hat{\mathbf{v}} = \frac{i}{\hbar}\left[\hat{H},\hat{\mathbf{r}}\right]$$

which must have eigenvalues between ±c. See Foldy–Wouthuysen transformation for more theoretical background.

Relativistic quantum Lagrangians
The Hamiltonian operators in the Schrödinger picture are one approach to forming the differential equations for $σ$. An equivalent alternative is to determine a Lagrangian (really meaning Lagrangian density), then generate the differential equation by the field-theoretic Euler–Lagrange equation:


 * $$ \partial_\mu \left( \frac{\partial \mathcal{L}}{\partial ( \partial_\mu \psi )} \right) - \frac{\partial \mathcal{L}}{\partial \psi} = 0 \,$$

For some RWEs, a Lagrangian can be found by inspection. For example, the Dirac Lagrangian is:


 * $$\mathcal{L} = \overline{\psi}(\gamma^\mu P_\mu - mc)\psi$$

and Klein–Gordon Lagrangian is:


 * $$\mathcal{L} = - \frac{\hbar^2}{m} \eta^{\mu \nu} \partial_{\mu}\psi^{*} \partial_{\nu}\psi - m c^2 \psi^{*} \psi\,.$$

This is not possible for all RWEs; and is one reason the Lorentz group theoretic approach is important and appealing: fundamental invariance and symmetries in space and time can be used to derive RWEs using appropriate group representations. The Lagrangian approach with field interpretation of $2(2s + 1)$ is the subject of QFT rather than RQM: Feynman's path integral formulation uses invariant Lagrangians rather than Hamiltonian operators, since the latter can become extremely complicated, see (for example) Weinberg (1995).

Relativistic quantum angular momentum
In non-relativistic QM, the angular momentum operator is formed from the classical pseudovector definition $p = m v$. In RQM, the position and momentum operators are inserted directly where they appear in the orbital relativistic angular momentum tensor defined from the four-dimensional position and momentum of the particle, equivalently a bivector in the exterior algebra formalism:


 * $$M^{\alpha\beta} = X^\alpha P^\beta - X^\beta P^\alpha = 2 X^{[\alpha} P^{\beta]} \quad \rightleftharpoons \quad \mathbf{M} = \mathbf{X}\wedge\mathbf{P}\,,$$

which are six components altogether: three are the non-relativistic 3-orbital angular momenta; $ψ$, $ψ$, $L = r × p$, and the other three $M^{12} = L^{3}$, $M^{23} = L^{1}$, $M^{31} = L^{2}$ are boosts of the centre of mass of the rotating object. An additional relativistic-quantum term has to be added for particles with spin. For a particle of rest mass $M^{01}$, the total angular momentum tensor is:


 * $$J^{\alpha\beta} = 2X^{[\alpha} P^{\beta]} + \frac{1}{m^2}\varepsilon^{\alpha \beta \gamma \delta} W_\gamma p_\delta \quad \rightleftharpoons \quad \mathbf{J} = \mathbf{X}\wedge\mathbf{P} + \frac{1}{m^2}\star(\mathbf{W}\wedge\mathbf{P})$$

where the star denotes the Hodge dual, and


 * $$W_\alpha =\frac{1}{2}\varepsilon_{\alpha \beta \gamma \delta}M^{\beta \gamma}p^\delta \quad \rightleftharpoons \quad \mathbf{W} = \star(\mathbf{M}\wedge\mathbf{P})$$

is the Pauli–Lubanski pseudovector. For more on relativistic spin, see (for example) Troshin & Tyurin (1994).

Thomas precession and spin–orbit interactions
In 1926, the Thomas precession is discovered: relativistic corrections to the spin of elementary particles with application in the spin–orbit interaction of atoms and rotation of macroscopic objects. In 1939 Wigner derived the Thomas precession.

In classical electromagnetism and special relativity, an electron moving with a velocity $M^{02}$ through an electric field $M^{03}$ but not a magnetic field $m$, will in its own frame of reference experience a Lorentz-transformed magnetic field $v$:


 * $$\mathbf{B}' = \frac{\mathbf{E} \times \mathbf{v}}{c^2\sqrt{1- \left(v/c\right)^2}} \,.$$

In the non-relativistic limit $E$:


 * $$\mathbf{B}' = \frac{\mathbf{E} \times \mathbf{v}}{c^2} \,,$$

so the non-relativistic spin interaction Hamiltonian becomes:


 * $$\hat{H} = - \mathbf{B}'\cdot \hat{\boldsymbol{\mu}}_S = -\left(\mathbf{B} + \frac{\mathbf{E} \times \mathbf{v}}{c^2} \right) \cdot \hat{\boldsymbol{\mu}}_S \,, $$

where the first term is already the non-relativistic magnetic moment interaction, and the second term the relativistic correction of order $B$, but this disagrees with experimental atomic spectra by a factor of $1⁄2$. It was pointed out by L. Thomas that there is a second relativistic effect: An electric field component perpendicular to the electron velocity causes an additional acceleration of the electron perpendicular to its instantaneous velocity, so the electron moves in a curved path. The electron moves in a rotating frame of reference, and this additional precession of the electron is called the Thomas precession. It can be shown that the net result of this effect is that the spin–orbit interaction is reduced by half, as if the magnetic field experienced by the electron has only one-half the value, and the relativistic correction in the Hamiltonian is:


 * $$\hat{H} = - \mathbf{B}'\cdot \hat{\boldsymbol{\mu}}_S = -\left(\mathbf{B} + \frac{\mathbf{E} \times \mathbf{v}}{2c^2} \right) \cdot \hat{\boldsymbol{\mu}}_S \,.$$

In the case of RQM, the factor of $1/2$ is predicted by the Dirac equation.

History
The events which led to and established RQM, and the continuation beyond into quantum electrodynamics (QED), are summarized below [see, for example, R. Resnick and R. Eisberg (1985), and P.W Atkins (1974) ]. More than half a century of experimental and theoretical research from the 1890s through to the 1950s in the new and mysterious quantum theory as it was up and coming revealed that a number of phenomena cannot be explained by QM alone. SR, found at the turn of the 20th century, was found to be a necessary component, leading to unification: RQM. Theoretical predictions and experiments mainly focused on the newly found atomic physics, nuclear physics, and particle physics; by considering spectroscopy, diffraction and scattering of particles, and the electrons and nuclei within atoms and molecules. Numerous results are attributed to the effects of spin.

Relativistic description of particles in quantum phenomena
Albert Einstein in 1905 explained of the photoelectric effect; a particle description of light as photons. In 1916, Sommerfeld explains fine structure; the splitting of the spectral lines of atoms due to first order relativistic corrections. The Compton effect of 1923 provided more evidence that special relativity does apply; in this case to a particle description of photon–electron scattering. de Broglie extends wave–particle duality to matter: the de Broglie relations, which are consistent with special relativity and quantum mechanics. By 1927, Davisson and Germer and separately G. Thomson successfully diffract electrons, providing experimental evidence of wave-particle duality.

Experiments

 * 1897 J. J. Thomson discovers the electron and measures its mass-to-charge ratio. Discovery of the Zeeman effect: the splitting a spectral line into several components in the presence of a static magnetic field.
 * 1908 Millikan measures the charge on the electron and finds experimental evidence of its quantization, in the oil drop experiment.
 * 1911 Alpha particle scattering in the Geiger–Marsden experiment, led by Rutherford, showed that atoms possess an internal structure: the atomic nucleus.
 * 1913 The Stark effect is discovered: splitting of spectral lines due to a static electric field (compare with the Zeeman effect).
 * 1922 Stern–Gerlach experiment: experimental evidence of spin and its quantization.
 * 1924 Stoner studies splitting of energy levels in magnetic fields.
 * 1932 Experimental discovery of the neutron by Chadwick, and positrons by Anderson, confirming the theoretical prediction of positrons.
 * 1958 Discovery of the Mössbauer effect: resonant and recoil-free emission and absorption of gamma radiation by atomic nuclei bound in a solid, useful for accurate measurements of gravitational redshift and time dilation, and in the analysis of nuclear electromagnetic moments in hyperfine interactions.

Quantum non-locality and relativistic locality
In 1935, Einstein, Rosen, Podolsky published a paper concerning quantum entanglement of particles, questioning quantum nonlocality and the apparent violation of causality upheld in SR: particles can appear to interact instantaneously at arbitrary distances. This was a misconception since information is not and cannot be transferred in the entangled states; rather the information transmission is in the process of measurement by two observers (one observer has to send a signal to the other, which cannot exceed c). QM does not violate SR. In 1959, Bohm and Aharonov publish a paper on the Aharonov–Bohm effect, questioning the status of electromagnetic potentials in QM. The EM field tensor and EM 4-potential formulations are both applicable in SR, but in QM the potentials enter the Hamiltonian (see above) and influence the motion of charged particles even in regions where the fields are zero. In 1964, Bell's theorem was published in a paper on the EPR paradox, showing that QM cannot be derived from local hidden-variable theories if locality is to be maintained.

The Lamb shift
In 1947, the Lamb shift was discovered: a small difference in the 2S$1/2$ and 2P$1/2$ levels of hydrogen, due to the interaction between the electron and vacuum. Lamb and Retherford experimentally measure stimulated radio-frequency transitions the 2S$1/2$ and 2P$1/2$ hydrogen levels by microwave radiation. An explanation of the Lamb shift is presented by Bethe. Papers on the effect were published in the early 1950s.

Development of quantum electrodynamics

 * 1927 Dirac establishes the field of QED, also coining the term "quantum electrodynamics".
 * 1943 Tomonaga begins work on renormalization, influential in QED.
 * 1947 Schwinger calculates the anomalous magnetic moment of the electron. Kusch measures of the anomalous magnetic electron moment, confirming one of QED's great predictions.

Atomic physics and chemistry

 * Relativistic quantum chemistry
 * Breit equation
 * Electron spin resonance
 * Fine-structure constant

Mathematical physics

 * Quantum spacetime
 * Spin connection
 * Spinor bundle
 * Dirac equation in the algebra of physical space
 * Casimir invariant
 * Casimir operator
 * Wigner D-matrix

Particle physics and quantum field theory

 * Zitterbewegung
 * Two-body Dirac equations
 * Relativistic Heavy Ion Collider
 * Symmetry (physics)
 * Parity
 * CPT invariance
 * Chirality (physics)
 * Standard model
 * Gauge theory
 * Tachyon
 * Modern searches for Lorentz violation