Schrödinger equation

The Schrödinger equation is a partial differential equation that governs the wave function of a quantum-mechanical system. Its discovery was a significant landmark in the development of quantum mechanics. It is named after Erwin Schrödinger, who postulated the equation in 1925 and published it in 1926, forming the basis for the work that resulted in his Nobel Prize in Physics in 1933.

Conceptually, the Schrödinger equation is the quantum counterpart of Newton's second law in classical mechanics. Given a set of known initial conditions, Newton's second law makes a mathematical prediction as to what path a given physical system will take over time. The Schrödinger equation gives the evolution over time of the wave function, the quantum-mechanical characterization of an isolated physical system. The equation was postulated by Schrödinger based on a postulate of Louis de Broglie that all matter has an associated matter wave. The equation predicted bound states of the atom in agreement with experimental observations.

The Schrödinger equation is not the only way to study quantum mechanical systems and make predictions. Other formulations of quantum mechanics include matrix mechanics, introduced by Werner Heisenberg, and the path integral formulation, developed chiefly by Richard Feynman. When these approaches are compared, the use of the Schrödinger equation is sometimes called "wave mechanics". The Klein-Gordon equation is a wave equation which is the relativistic version of the Schrödinger equation. The Schrödinger equation is nonrelativistic because it contains a first derivative in time and a second derivative in space, and therefore space & time are not on equal footing.

Paul Dirac incorporated special relativity and quantum mechanics into a single formulation that simplifies to the Schrödinger equation in the non-relativistic limit. This is the Dirac equation, which contains a single derivative in both space and time. The second-derivative PDE of the Klein-Gordon equation led to a problem with probability density even though it was a relativistic wave equation. The probability density could be negative, which is physically unviable. This was fixed by Dirac by taking the so-called square-root of the Klein-Gordon operator and in turn introducing Dirac matrices. In a modern context, the Klein-Gordon equation describes spin-less particles, while the Dirac equation describes spin-1/2 particles.

Preliminaries


Introductory courses on physics or chemistry typically introduce the Schrödinger equation in a way that can be appreciated knowing only the concepts and notations of basic calculus, particularly derivatives with respect to space and time. A special case of the Schrödinger equation that admits a statement in those terms is the position-space Schrödinger equation for a single nonrelativistic particle in one dimension: $$i\hbar\frac{\partial}{\partial t} \Psi(x,t) = \left [ - \frac{\hbar^2}{2m}\frac{\partial^2}{\partial x^2} + V(x,t)\right ] \Psi(x,t).$$ Here, $$\Psi(x,t)$$ is a wave function, a function that assigns a complex number to each point $$x$$ at each time $$t$$. The parameter $$m$$ is the mass of the particle, and $$V(x,t)$$ is the potential that represents the environment in which the particle exists. The constant $$i$$ is the imaginary unit, and $$\hbar$$ is the reduced Planck constant, which has units of action (energy multiplied by time).

Broadening beyond this simple case, the mathematical formulation of quantum mechanics developed by Paul Dirac, David Hilbert, John von Neumann, and Hermann Weyl defines the state of a quantum mechanical system to be a vector $$|\psi\rangle$$ belonging to a separable complex Hilbert space $$\mathcal H$$. This vector is postulated to be normalized under the Hilbert space's inner product, that is, in Dirac notation it obeys $$\langle \psi | \psi \rangle = 1$$. The exact nature of this Hilbert space is dependent on the system – for example, for describing position and momentum the Hilbert space is the space of square-integrable functions $$L^2$$, while the Hilbert space for the spin of a single proton is the two-dimensional complex vector space $$\Complex^2$$ with the usual inner product.

Physical quantities of interest – position, momentum, energy, spin – are represented by observables, which are self-adjoint operators acting on the Hilbert space. A wave function can be an eigenvector of an observable, in which case it is called an eigenstate, and the associated eigenvalue corresponds to the value of the observable in that eigenstate. More generally, a quantum state will be a linear combination of the eigenstates, known as a quantum superposition. When an observable is measured, the result will be one of its eigenvalues with probability given by the Born rule: in the simplest case the eigenvalue $$\lambda$$ is non-degenerate and the probability is given by $$|\langle \lambda | \psi\rangle|^2$$, where $$ |\lambda\rangle$$ is its associated eigenvector. More generally, the eigenvalue is degenerate and the probability is given by $$\langle \psi | P_\lambda |\psi\rangle$$, where $$P_\lambda$$ is the projector onto its associated eigenspace.

A momentum eigenstate would be a perfectly monochromatic wave of infinite extent, which is not square-integrable. Likewise a position eigenstate would be a Dirac delta distribution, not square-integrable and technically not a function at all. Consequently, neither can belong to the particle's Hilbert space. Physicists sometimes regard these eigenstates as "generalized eigenvectors" for a Hilbert space composed of elements outside that space. These are used for calculational convenience and do not represent physical states. Thus, a position-space wave function $$\Psi(x,t)$$ as used above can be written as the inner product of a time-dependent state vector $$|\Psi(t)\rangle$$ with unphysical but convenient "position eigenstates" $$|x\rangle$$: $$\Psi(x,t) = \langle x | \Psi(t) \rangle.$$

Time-dependent equation
The form of the Schrödinger equation depends on the physical situation. The most general form is the time-dependent Schrödinger equation, which gives a description of a system evolving with time:

where $$t$$ is time, $$\vert\Psi(t)\rangle$$ is the state vector of the quantum system ($$\Psi$$ being the Greek letter psi), and $$\hat{H}$$ is an observable, the Hamiltonian operator.



The term "Schrödinger equation" can refer to both the general equation, or the specific nonrelativistic version. The general equation is indeed quite general, used throughout quantum mechanics, for everything from the Dirac equation to quantum field theory, by plugging in diverse expressions for the Hamiltonian. The specific nonrelativistic version is an approximation that yields accurate results in many situations, but only to a certain extent (see relativistic quantum mechanics and relativistic quantum field theory).

To apply the Schrödinger equation, write down the Hamiltonian for the system, accounting for the kinetic and potential energies of the particles constituting the system, then insert it into the Schrödinger equation. The resulting partial differential equation is solved for the wave function, which contains information about the system. In practice, the square of the absolute value of the wave function at each point is taken to define a probability density function. For example, given a wave function in position space $$\Psi(x,t)$$ as above, we have $$\Pr(x,t) = |\Psi(x,t)|^2.$$

Time-independent equation
The time-dependent Schrödinger equation described above predicts that wave functions can form standing waves, called stationary states. These states are particularly important as their individual study later simplifies the task of solving the time-dependent Schrödinger equation for any state. Stationary states can also be described by a simpler form of the Schrödinger equation, the time-independent Schrödinger equation.

where $$E$$ is the energy of the system. This is only used when the Hamiltonian itself is not dependent on time explicitly. However, even in this case the total wave function is dependent on time as explained in the section on linearity below. In the language of linear algebra, this equation is an eigenvalue equation. Therefore, the wave function is an eigenfunction of the Hamiltonian operator with corresponding eigenvalue(s) $$E$$.

Linearity
The Schrödinger equation is a linear differential equation, meaning that if two state vectors $$|\psi_1\rangle$$ and $$|\psi_2\rangle$$ are solutions, then so is any linear combination $$ |\psi\rangle = a|\psi_1\rangle + b |\psi_2\rangle $$ of the two state vectors where $a$ and $b$ are any complex numbers. Moreover, the sum can be extended for any number of state vectors. This property allows superpositions of quantum states to be solutions of the Schrödinger equation. Even more generally, it holds that a general solution to the Schrödinger equation can be found by taking a weighted sum over a basis of states. A choice often employed is the basis of energy eigenstates, which are solutions of the time-independent Schrödinger equation. In this basis, a time-dependent state vector $$|\Psi(t)\rangle$$ can be written as the linear combination $$|\Psi(t)\rangle = \sum_{n} A_n e^{ {-iE_n t}/\hbar} |\psi_{E_n}\rangle, $$ where $$A_n$$ are complex numbers and the vectors $$|\psi_{E_n}\rangle$$ are solutions of the time-independent equation $$\hat H |\psi_{E_n}\rangle = E_n |\psi_{E_n}\rangle$$.

Unitarity
Holding the Hamiltonian $$\hat{H}$$ constant, the Schrödinger equation has the solution $$ |\Psi(t)\rangle = e^{-i\hat{H}t/\hbar }|\Psi(0)\rangle.$$ The operator $$\hat{U}(t) = e^{-i\hat{H}t/\hbar}$$ is known as the time-evolution operator, and it is unitary: it preserves the inner product between vectors in the Hilbert space. Unitarity is a general feature of time evolution under the Schrödinger equation. If the initial state is $$|\Psi(0)\rangle$$, then the state at a later time $$t$$ will be given by $$ |\Psi(t)\rangle = \hat{U}(t) |\Psi(0)\rangle $$ for some unitary operator $$\hat{U}(t)$$. Conversely, suppose that $$\hat{U}(t)$$ is a continuous family of unitary operators parameterized by $$t$$. Without loss of generality, the parameterization can be chosen so that $$\hat{U}(0)$$ is the identity operator and that $$\hat{U}(t/N)^N = \hat{U}(t)$$ for any $$N > 0$$. Then $$\hat{U}(t)$$ depends upon the parameter $$t$$ in such a way that

for some self-adjoint operator $$\hat{G}$$, called the generator of the family $$\hat{U}(t)$$. A Hamiltonian is just such a generator (up to the factor of the Planck constant that would be set to 1 in natural units). To see that the generator is Hermitian, note that with $$\hat{U}(\delta t) \approx \hat{U}(0)-i\hat{G} \delta t$$, we have $$\hat{U}(\delta t)^\dagger \hat{U}(\delta t)\approx(\hat{U}(0)^\dagger+i\hat{G}^\dagger \delta t)(\hat{U}(0)-i\hat{G}\delta t)=I+i\delta t(\hat{G}^\dagger-\hat{G})+O(\delta t^2),$$ so $$\hat{U}(t)$$ is unitary only if, to first order, its derivative is Hermitian.

Changes of basis
The Schrödinger equation is often presented using quantities varying as functions of position, but as a vector-operator equation it has a valid representation in any arbitrary complete basis of kets in Hilbert space. As mentioned above, "bases" that lie outside the physical Hilbert space are also employed for calculational purposes. This is illustrated by the position-space and momentum-space Schrödinger equations for a nonrelativistic, spinless particle. The Hilbert space for such a particle is the space of complex square-integrable functions on three-dimensional Euclidean space, and its Hamiltonian is the sum of a kinetic-energy term that is quadratic in the momentum operator and a potential-energy term: $$i\hbar \frac{d}{dt}|\Psi(t)\rangle = \left(\frac{1}{2m}\hat{p}^2 + \hat{V}\right)|\Psi(t)\rangle.$$ Writing $$\mathbf{r}$$ for a three-dimensional position vector and $$\mathbf{p}$$ for a three-dimensional momentum vector, the position-space Schrödinger equation is $$i\hbar\frac{\partial}{\partial t} \Psi(\mathbf{r},t) = - \frac{\hbar^2}{2m} \nabla^2 \Psi(\mathbf{r},t) + V(\mathbf{r}) \Psi(\mathbf{r},t).$$ The momentum-space counterpart involves the Fourier transforms of the wave function and the potential: $$ i\hbar \frac{\partial}{\partial t} \tilde{\Psi}(\mathbf{p}, t) = \frac{\mathbf{p}^2}{2m} \tilde{\Psi}(\mathbf{p},t) + (2\pi\hbar)^{-3/2} \int d^3 \mathbf{p}' \, \tilde{V}(\mathbf{p} - \mathbf{p}') \tilde{\Psi}(\mathbf{p}',t).$$ The functions $$\Psi(\mathbf{r},t)$$ and $$\tilde{\Psi}(\mathbf{p},t)$$ are derived from $$|\Psi(t)\rangle$$ by $$\Psi(\mathbf{r},t) = \langle \mathbf{r} | \Psi(t)\rangle,$$ $$\tilde{\Psi}(\mathbf{p},t) = \langle \mathbf{p} | \Psi(t)\rangle,$$ where $$|\mathbf{r}\rangle$$ and $$|\mathbf{p}\rangle$$ do not belong to the Hilbert space itself, but have well-defined inner products with all elements of that space.

When restricted from three dimensions to one, the position-space equation is just the first form of the Schrödinger equation given above. The relation between position and momentum in quantum mechanics can be appreciated in a single dimension. In canonical quantization, the classical variables $$x$$ and $$p$$ are promoted to self-adjoint operators $$\hat{x}$$ and $$\hat{p}$$ that satisfy the canonical commutation relation $$[\hat{x}, \hat{p}] = i\hbar.$$ This implies that $$\langle x | \hat{p} | \Psi \rangle = -i\hbar \frac{d}{dx} \Psi(x),$$ so the action of the momentum operator $$\hat{p}$$ in the position-space representation is $-i\hbar \frac{d}{dx}$. Thus, $$\hat{p}^2$$ becomes a second derivative, and in three dimensions, the second derivative becomes the Laplacian $$\nabla^2$$.

The canonical commutation relation also implies that the position and momentum operators are Fourier conjugates of each other. Consequently, functions originally defined in terms of their position dependence can be converted to functions of momentum using the Fourier transform. In solid-state physics, the Schrödinger equation is often written for functions of momentum, as Bloch's theorem ensures the periodic crystal lattice potential couples $$\tilde{\Psi}(p) $$ with $$\tilde{\Psi}(p+K) $$ for only discrete reciprocal lattice vectors $$K $$. This makes it convenient to solve the momentum-space Schrödinger equation at each point in the Brillouin zone independently of the other points in the Brillouin zone.

Probability current
The Schrödinger equation is consistent with local probability conservation. It also ensures that a normalized wavefunction remains normalized after time evolution. In matrix mechanics, this means that the time evolution operator is a unitary operator. In contrast to, for example, the Klein Gordon equation, although a redefined inner product of a wavefunction can be time independent, the total volume integral of modulus square of the wavefunction need not be time independent.

The continuity equation for probability in non relativistic quantum mechanics is stated as: $$\frac{\partial}{\partial t} \rho\left(\mathbf{r},t\right) + \nabla \cdot \mathbf{j} = 0, $$where $$ \mathbf{j} = \frac{1}{2m} \left( \Psi^*\hat{\mathbf{p}}\Psi - \Psi\hat{\mathbf{p}}\Psi^* \right) = -\frac{i\hbar}{2m}(\psi^*\nabla\psi-\psi\nabla\psi^*) = \frac \hbar m \operatorname{Im} (\psi^*\nabla \psi)    $$ is the probability current or probability flux (flow per unit area).

If the wavefunction is represented as $\psi( {\bf x},t)=\sqrt{\rho({\bf x},t)}\exp\left(\frac{i S({\bf x},t)}{\hbar}\right), $ where $$S(\mathbf x,t) $$ is a real function which represents the complex phase of the wavefunction, then the probability flux is calculated as:$$ \mathbf{j} = \frac{\rho \nabla S}  {m} $$Hence, the spatial variation of the phase of a wavefunction is said to characterize the probability flux of the wavefunction. Although the $ \frac{ \nabla S} {m} $  term appears to play the role of velocity, it does not represent velocity at a point since simultaneous measurement of position and velocity violates uncertainty principle.

Separation of variables
If the Hamiltonian is not an explicit function of time, Schrödinger's equation reads: $$i\hbar\frac{\partial}{\partial t} \Psi(\mathbf{r},t) = \left [ - \frac{\hbar^2}{2m}\nabla^2 + V(\mathbf{r})\right ] \Psi(\mathbf{r},t).$$ The operator on the left side depends only on time; the one on the right side depends only on space. Solving the equation by separation of variables means seeking a solution of the form of a product of spatial and temporal parts $$\Psi(\mathbf{r},t)=\psi(\mathbf{r})\tau(t),$$ where $$\psi(\mathbf{r})$$ is a function of all the spatial coordinate(s) of the particle(s) constituting the system only, and $$\tau(t)$$ is a function of time only. Substituting this expression for $$\Psi$$ into the time dependent left hand side shows that $$\tau(t)$$ is a phase factor: $$ \Psi(\mathbf{r},t) = \psi(\mathbf{r}) e^{-i{E t/\hbar}}.$$ A solution of this type is called stationary, since the only time dependence is a phase factor that cancels when the probability density is calculated via the Born rule.

The spatial part of the full wave function solves: $$ \nabla^2\psi(\mathbf{r}) + \frac{2m}{\hbar^2} \left [E - V(\mathbf{r})\right ] \psi(\mathbf{r}) = 0.$$ where the energy $$E$$ appears in the phase factor.

This generalizes to any number of particles in any number of dimensions (in a time-independent potential): the standing wave solutions of the time-independent equation are the states with definite energy, instead of a probability distribution of different energies. In physics, these standing waves are called "stationary states" or "energy eigenstates"; in chemistry they are called "atomic orbitals" or "molecular orbitals". Superpositions of energy eigenstates change their properties according to the relative phases between the energy levels. The energy eigenstates form a basis: any wave function may be written as a sum over the discrete energy states or an integral over continuous energy states, or more generally as an integral over a measure. This is the spectral theorem in mathematics, and in a finite-dimensional state space it is just a statement of the completeness of the eigenvectors of a Hermitian matrix.

Separation of variables can also be a useful method for the time-independent Schrödinger equation. For example, depending on the symmetry of the problem, the Cartesian axes might be separated, $$\psi(\mathbf{r}) = \psi_x(x)\psi_y(y)\psi_z(z),$$ or radial and angular coordinates might be separated: $$\psi(\mathbf{r}) = \psi_r(r)\psi_\theta(\theta)\psi_\phi(\phi).$$

Particle in a box


The particle in a one-dimensional potential energy box is the most mathematically simple example where restraints lead to the quantization of energy levels. The box is defined as having zero potential energy inside a certain region and infinite potential energy outside. For the one-dimensional case in the $$x$$ direction, the time-independent Schrödinger equation may be written $$ - \frac {\hbar ^2}{2m} \frac {d ^2 \psi}{dx^2} = E \psi.$$

With the differential operator defined by $$ \hat{p}_x = -i\hbar\frac{d}{dx} $$ the previous equation is evocative of the classic kinetic energy analogue, $$ \frac{1}{2m} \hat{p}_x^2 = E,$$ with state $$\psi$$ in this case having energy $$E$$ coincident with the kinetic energy of the particle.

The general solutions of the Schrödinger equation for the particle in a box are $$ \psi(x) = A e^{ikx} + B e ^{-ikx} \qquad\qquad E = \frac{\hbar^2 k^2}{2m}$$ or, from Euler's formula, $$ \psi(x) = C \sin(kx) + D \cos(kx).$$

The infinite potential walls of the box determine the values of $$C, D, $$ and $$k$$ at $$x=0$$ and $$x=L$$ where $$\psi$$ must be zero. Thus, at $$x=0$$, $$\psi(0) = 0 = C\sin(0) + D\cos(0) = D$$ and $$D=0$$. At $$x=L$$, $$ \psi(L) = 0 = C\sin(kL),$$ in which $$C$$ cannot be zero as this would conflict with the postulate that $$\psi$$ has norm 1. Therefore, since $$\sin(kL)=0$$, $$kL$$ must be an integer multiple of $$\pi$$, $$k = \frac{n\pi}{L}\qquad\qquad n=1,2,3,\ldots.$$

This constraint on $$k$$ implies a constraint on the energy levels, yielding $$E_n = \frac{\hbar^2 \pi^2 n^2}{2mL^2} = \frac{n^2h^2}{8mL^2}.$$

A finite potential well is the generalization of the infinite potential well problem to potential wells having finite depth. The finite potential well problem is mathematically more complicated than the infinite particle-in-a-box problem as the wave function is not pinned to zero at the walls of the well. Instead, the wave function must satisfy more complicated mathematical boundary conditions as it is nonzero in regions outside the well. Another related problem is that of the rectangular potential barrier, which furnishes a model for the quantum tunneling effect that plays an important role in the performance of modern technologies such as flash memory and scanning tunneling microscopy.

Harmonic oscillator


The Schrödinger equation for this situation is $$ E\psi = -\frac{\hbar^2}{2m}\frac{d^2}{d x^2}\psi + \frac{1}{2} m\omega^2 x^2\psi, $$ where $$ x $$ is the displacement and $$ \omega $$ the angular frequency. Furthermore, it can be used to describe approximately a wide variety of other systems, including vibrating atoms, molecules, and atoms or ions in lattices, and approximating other potentials near equilibrium points. It is also the basis of perturbation methods in quantum mechanics.

The solutions in position space are $$ \psi_n(x) = \sqrt{\frac{1}{2^n\,n!}} \ \left(\frac{m\omega}{\pi \hbar}\right)^{1/4} \ e^{ - \frac{m\omega x^2}{2 \hbar}} \ \mathcal{H}_n\left(\sqrt{\frac{m\omega}{\hbar}} x \right), $$ where $$n \in \{0, 1, 2, \ldots \}$$, and the functions $$ \mathcal{H}_n $$ are the Hermite polynomials of order $$ n $$. The solution set may be generated by $$\psi_n(x) = \frac{1}{\sqrt{n!}} \left( \sqrt{\frac{m \omega}{2 \hbar}} \right)^{n} \left( x - \frac{\hbar}{m \omega} \frac{d}{dx}\right)^n \left( \frac{m \omega}{\pi \hbar} \right)^{\frac{1}{4}} e^{\frac{-m \omega x^2}{2\hbar}}.$$

The eigenvalues are $$ E_n = \left(n + \frac{1}{2} \right) \hbar \omega. $$

The case $$ n = 0 $$ is called the ground state, its energy is called the zero-point energy, and the wave function is a Gaussian.

The harmonic oscillator, like the particle in a box, illustrates the generic feature of the Schrödinger equation that the energies of bound eigenstates are discretized.

Hydrogen atom
The Schrödinger equation for the electron in a hydrogen atom (or a hydrogen-like atom) is $$ E \psi = -\frac{\hbar^2}{2\mu}\nabla^2\psi - \frac{q^2}{4\pi\varepsilon_0 r}\psi $$ where $$ q $$ is the electron charge, $$ \mathbf{r} $$ is the position of the electron relative to the nucleus, $$ r = |\mathbf{r}| $$ is the magnitude of the relative position, the potential term is due to the Coulomb interaction, wherein $$ \varepsilon_0 $$ is the permittivity of free space and $$ \mu = \frac{m_q m_p}{m_q+m_p} $$ is the 2-body reduced mass of the hydrogen nucleus (just a proton) of mass $$ m_p $$ and the electron of mass $$ m_q $$. The negative sign arises in the potential term since the proton and electron are oppositely charged. The reduced mass in place of the electron mass is used since the electron and proton together orbit each other about a common center of mass, and constitute a two-body problem to solve. The motion of the electron is of principal interest here, so the equivalent one-body problem is the motion of the electron using the reduced mass.

The Schrödinger equation for a hydrogen atom can be solved by separation of variables. In this case, spherical polar coordinates are the most convenient. Thus, $$ \psi(r,\theta,\varphi) = R(r)Y_\ell^m(\theta, \varphi) = R(r)\Theta(\theta)\Phi(\varphi),$$ where $V = 0$ are radial functions and $$ Y^m_l (\theta, \varphi) $$ are spherical harmonics of degree $$ \ell $$ and order $$ m $$. This is the only atom for which the Schrödinger equation has been solved for exactly. Multi-electron atoms require approximate methods. The family of solutions are: $$ \psi_{n\ell m}(r,\theta,\varphi) = \sqrt {\left ( \frac{2}{n a_0} \right )^3\frac{(n-\ell-1)!}{2n[(n+\ell)!]} } e^{- r/na_0} \left(\frac{2r}{na_0}\right)^\ell L_{n-\ell-1}^{2\ell+1}\left(\frac{2r}{na_0}\right) \cdot Y_{\ell}^m(\theta, \varphi ) $$ where
 * $$ a_0 = \frac{4 \pi \varepsilon_0 \hbar^2}{m_q q^2} $$ is the Bohr radius,
 * $$ L_{n-\ell-1}^{2\ell+1}(\cdots) $$ are the generalized Laguerre polynomials of degree $$ n - \ell - 1 $$,
 * $$ n, \ell, m $$ are the principal, azimuthal, and magnetic quantum numbers respectively, which take the values $$n = 1, 2, 3, \dots,$$ $$\ell = 0, 1, 2, \dots, n - 1,$$ $$m = -\ell, \dots, \ell.$$

Approximate solutions
It is typically not possible to solve the Schrödinger equation exactly for situations of physical interest. Accordingly, approximate solutions are obtained using techniques like variational methods and WKB approximation. It is also common to treat a problem of interest as a small modification to a problem that can be solved exactly, a method known as perturbation theory.

Semiclassical limit
One simple way to compare classical to quantum mechanics is to consider the time-evolution of the expected position and expected momentum, which can then be compared to the time-evolution of the ordinary position and momentum in classical mechanics. The quantum expectation values satisfy the Ehrenfest theorem. For a one-dimensional quantum particle moving in a potential $$V$$, the Ehrenfest theorem says $$m\frac{d}{dt}\langle x\rangle = \langle p\rangle;\quad \frac{d}{dt}\langle p\rangle = -\left\langle V'(X)\right\rangle.$$ Although the first of these equations is consistent with the classical behavior, the second is not: If the pair $$(\langle X\rangle, \langle P\rangle)$$ were to satisfy Newton's second law, the right-hand side of the second equation would have to be $$-V'\left(\left\langle X\right\rangle\right)$$ which is typically not the same as $$-\left\langle V'(X)\right\rangle$$. For a general $$V'$$, therefore, quantum mechanics can lead to predictions where expectation values do not mimic the classical behavior. In the case of the quantum harmonic oscillator, however, $$V'$$ is linear and this distinction disappears, so that in this very special case, the expected position and expected momentum do exactly follow the classical trajectories.

For general systems, the best we can hope for is that the expected position and momentum will approximately follow the classical trajectories. If the wave function is highly concentrated around a point $$x_0$$, then $$V'\left(\left\langle X\right\rangle\right)$$ and $$\left\langle V'(X)\right\rangle$$ will be almost the same, since both will be approximately equal to $$V'(x_0)$$. In that case, the expected position and expected momentum will remain very close to the classical trajectories, at least for as long as the wave function remains highly localized in position.

The Schrödinger equation in its general form $$ i\hbar \frac{\partial}{\partial t} \Psi\left(\mathbf{r},t\right) = \hat{H} \Psi\left(\mathbf{r},t\right)$$ is closely related to the Hamilton–Jacobi equation (HJE) $$ -\frac{\partial}{\partial t} S(q_i,t) = H\left(q_i,\frac{\partial S}{\partial q_i},t \right) $$ where $$S$$ is the classical action and $$H$$ is the Hamiltonian function (not operator). Here the generalized coordinates $$q_i$$ for $$i = 1, 2, 3$$ (used in the context of the HJE) can be set to the position in Cartesian coordinates as $$\mathbf{r} = (q_1, q_2, q_3) = (x, y, z)$$.

Substituting $$ \Psi = \sqrt{\rho(\mathbf{r},t)} e^{iS(\mathbf{r},t)/\hbar}$$ where $$\rho$$ is the probability density, into the Schrödinger equation and then taking the limit $$\hbar \to 0$$ in the resulting equation yield the Hamilton–Jacobi equation.

Density matrices
Wave functions are not always the most convenient way to describe quantum systems and their behavior. When the preparation of a system is only imperfectly known, or when the system under investigation is a part of a larger whole, density matrices may be used instead. A density matrix is a positive semi-definite operator whose trace is equal to 1. (The term "density operator" is also used, particularly when the underlying Hilbert space is infinite-dimensional.) The set of all density matrices is convex, and the extreme points are the operators that project onto vectors in the Hilbert space. These are the density-matrix representations of wave functions; in Dirac notation, they are written $$ \hat{\rho} = |\Psi\rangle\langle \Psi|.$$

The density-matrix analogue of the Schrödinger equation for wave functions is $$ i \hbar \frac{\partial \hat{\rho}}{\partial t} = [\hat{H}, \hat{\rho}],$$ where the brackets denote a commutator. This is variously known as the von Neumann equation, the Liouville–von Neumann equation, or just the Schrödinger equation for density matrices. If the Hamiltonian is time-independent, this equation can be easily solved to yield $$\hat{\rho}(t) = e^{-i \hat{H} t/\hbar} \hat{\rho}(0) e^{i \hat{H} t/\hbar}.$$

More generally, if the unitary operator $$\hat{U}(t)$$ describes wave function evolution over some time interval, then the time evolution of a density matrix over that same interval is given by $$ \hat{\rho}(t) = \hat{U}(t) \hat{\rho}(0) \hat{U}(t)^\dagger.$$

Unitary evolution of a density matrix conserves its von Neumann entropy.

Relativistic quantum physics and quantum field theory
The one-particle Schrödinger equation described above is valid essentially in the nonrelativistic domain. For one reason, it is essentially invariant under Galilean transformations, which form the symmetry group of Newtonian dynamics. Moreover, processes that change particle number are natural in relativity, and so an equation for one particle (or any fixed number thereof) can only be of limited use. A more general form of the Schrödinger equation that also applies in relativistic situations can be formulated within quantum field theory (QFT), a framework that allows the combination of quantum mechanics with special relativity. The region in which both simultaneously apply may be described by relativistic quantum mechanics. Such descriptions may use time evolution generated by a Hamiltonian operator, as in the Schrödinger functional method.

Klein–Gordon and Dirac equations
Attempts to combine quantum physics with special relativity began with building relativistic wave equations from the relativistic energy–momentum relation $$E^2 = (pc)^2 + \left(m_0 c^2\right)^2,$$ instead of nonrelativistic energy equations. The Klein–Gordon equation and the Dirac equation are two such equations. The Klein–Gordon equation, $$ -\frac {1}{c^2} \frac{\partial^2}{\partial t^2} \psi + \nabla^2 \psi = \frac {m^2 c^2}{\hbar^2} \psi,$$ was the first such equation to be obtained, even before the nonrelativistic one-particle Schrödinger equation, and applies to massive spinless particles. Historically, Dirac obtained the Dirac equation by seeking a differential equation that would be first-order in both time and space, a desirable property for a relativistic theory. Taking the "square root" of the left-hand side of the Klein–Gordon equation in this way required factorizing it into a product of two operators, which Dirac wrote using 4 × 4 matrices $$\alpha_1,\alpha_2,\alpha_3,\beta$$. Consequently, the wave function also became a four-component function, governed by the Dirac equation that, in free space, read $$\left(\beta mc^2 + c\left(\sum_{n \mathop = 1}^{3}\alpha_n p_n\right)\right) \psi = i \hbar \frac{\partial\psi }{\partial t}. $$

This has again the form of the Schrödinger equation, with the time derivative of the wave function being given by a Hamiltonian operator acting upon the wave function. Including influences upon the particle requires modifying the Hamiltonian operator. For example, the Dirac Hamiltonian for a particle of mass $R$ and electric charge $m$ in an electromagnetic field (described by the electromagnetic potentials $q$ and $φ$) is: $$\hat{H}_{\text{Dirac}}= \gamma^0 \left[c \boldsymbol{\gamma}\cdot\left(\hat{\mathbf{p}} - q \mathbf{A}\right) + mc^2 + \gamma^0q \varphi \right],$$ in which the $A$ and $γ = (γ^{1}, γ^{2}, γ^{3})$ are the Dirac gamma matrices related to the spin of the particle. The Dirac equation is true for all spin-$1/undefined$ particles, and the solutions to the equation are 4-component spinor fields with two components corresponding to the particle and the other two for the antiparticle.

For the Klein–Gordon equation, the general form of the Schrödinger equation is inconvenient to use, and in practice the Hamiltonian is not expressed in an analogous way to the Dirac Hamiltonian. The equations for relativistic quantum fields, of which the Klein–Gordon and Dirac equations are two examples, can be obtained in other ways, such as starting from a Lagrangian density and using the Euler–Lagrange equations for fields, or using the representation theory of the Lorentz group in which certain representations can be used to fix the equation for a free particle of given spin (and mass).

In general, the Hamiltonian to be substituted in the general Schrödinger equation is not just a function of the position and momentum operators (and possibly time), but also of spin matrices. Also, the solutions to a relativistic wave equation, for a massive particle of spin $γ^{0}$, are complex-valued $s$-component spinor fields.

Fock space
As originally formulated, the Dirac equation is an equation for a single quantum particle, just like the single-particle Schrödinger equation with wave function $\Psi(x,t)$. This is of limited use in relativistic quantum mechanics, where particle number is not fixed. Heuristically, this complication can be motivated by noting that mass–energy equivalence implies material particles can be created from energy. A common way to address this in QFT is to introduce a Hilbert space where the basis states are labeled by particle number, a so-called Fock space. The Schrödinger equation can then be formulated for quantum states on this Hilbert space. However, because the Schrödinger equation picks out a preferred time axis, the Lorentz invariance of the theory is no longer manifest, and accordingly, the theory is often formulated in other ways.

History
Following Max Planck's quantization of light (see black-body radiation), Albert Einstein interpreted Planck's quanta to be photons, particles of light, and proposed that the energy of a photon is proportional to its frequency, one of the first signs of wave–particle duality. Since energy and momentum are related in the same way as frequency and wave number in special relativity, it followed that the momentum $$p$$ of a photon is inversely proportional to its wavelength $$\lambda$$, or proportional to its wave number $$k$$: $$p = \frac{h}{\lambda} = \hbar k,$$ where $$h$$ is the Planck constant and $$\hbar = {h}/{2\pi}$$ is the reduced Planck constant. Louis de Broglie hypothesized that this is true for all particles, even particles which have mass such as electrons. He showed that, assuming that the matter waves propagate along with their particle counterparts, electrons form standing waves, meaning that only certain discrete rotational frequencies about the nucleus of an atom are allowed. These quantized orbits correspond to discrete energy levels, and de Broglie reproduced the Bohr model formula for the energy levels. The Bohr model was based on the assumed quantization of angular momentum $$L$$ according to $$ L = n \frac{h}{2\pi} = n\hbar.$$ According to de Broglie, the electron is described by a wave, and a whole number of wavelengths must fit along the circumference of the electron's orbit: $$n \lambda = 2 \pi r.$$

This approach essentially confined the electron wave in one dimension, along a circular orbit of radius $$r$$.

In 1921, prior to de Broglie, Arthur C. Lunn at the University of Chicago had used the same argument based on the completion of the relativistic energy–momentum 4-vector to derive what we now call the de Broglie relation. Unlike de Broglie, Lunn went on to formulate the differential equation now known as the Schrödinger equation and solve for its energy eigenvalues for the hydrogen atom; the paper was rejected by the Physical Review, according to Kamen.

Following up on de Broglie's ideas, physicist Peter Debye made an offhand comment that if particles behaved as waves, they should satisfy some sort of wave equation. Inspired by Debye's remark, Schrödinger decided to find a proper 3-dimensional wave equation for the electron. He was guided by William Rowan Hamilton's analogy between mechanics and optics, encoded in the observation that the zero-wavelength limit of optics resembles a mechanical system—the trajectories of light rays become sharp tracks that obey Fermat's principle, an analog of the principle of least action.

The equation he found is $$i\hbar \frac{\partial}{\partial t} \Psi(\mathbf{r}, t) = -\frac{\hbar^2}{2m} \nabla^2 \Psi(\mathbf{r}, t) + V(\mathbf{r})\Psi(\mathbf{r}, t).$$

By that time Arnold Sommerfeld had refined the Bohr model with relativistic corrections. Schrödinger used the relativistic energy–momentum relation to find what is now known as the Klein–Gordon equation in a Coulomb potential (in natural units): $$\left(E + \frac{e^2}{r}\right)^2 \psi(x) = - \nabla^2 \psi(x) + m^2 \psi(x).$$

He found the standing waves of this relativistic equation, but the relativistic corrections disagreed with Sommerfeld's formula. Discouraged, he put away his calculations and secluded himself with a mistress in a mountain cabin in December 1925.

While at the cabin, Schrödinger decided that his earlier nonrelativistic calculations were novel enough to publish and decided to leave off the problem of relativistic corrections for the future. Despite the difficulties in solving the differential equation for hydrogen (he had sought help from his friend the mathematician Hermann Weyl ) Schrödinger showed that his nonrelativistic version of the wave equation produced the correct spectral energies of hydrogen in a paper published in 1926. Schrödinger computed the hydrogen spectral series by treating a hydrogen atom's electron as a wave $$\Psi(\mathbf{x}, t)$$, moving in a potential well $$V$$, created by the proton. This computation accurately reproduced the energy levels of the Bohr model.

The Schrödinger equation details the behavior of $$\Psi$$ but says nothing of its nature. Schrödinger tried to interpret the real part of $$\Psi \frac{\partial \Psi^*}{\partial t}$$ as a charge density, and then revised this proposal, saying in his next paper that the modulus squared of $$\Psi$$ is a charge density. This approach was, however, unsuccessful. In 1926, just a few days after this paper was published, Max Born successfully interpreted $$\Psi$$ as the probability amplitude, whose modulus squared is equal to probability density. Later, Schrödinger himself explained this interpretation as follows: The already ... mentioned psi-function.... is now the means for predicting probability of measurement results. In it is embodied the momentarily attained sum of theoretically based future expectation, somewhat as laid down in a catalog.

Interpretation
The Schrödinger equation provides a way to calculate the wave function of a system and how it changes dynamically in time. However, the Schrödinger equation does not directly say what, exactly, the wave function is. The meaning of the Schrödinger equation and how the mathematical entities in it relate to physical reality depends upon the interpretation of quantum mechanics that one adopts.

In the views often grouped together as the Copenhagen interpretation, a system's wave function is a collection of statistical information about that system. The Schrödinger equation relates information about the system at one time to information about it at another. While the time-evolution process represented by the Schrödinger equation is continuous and deterministic, in that knowing the wave function at one instant is in principle sufficient to calculate it for all future times, wave functions can also change discontinuously and stochastically during a measurement. The wave function changes, according to this school of thought, because new information is available. The post-measurement wave function generally cannot be known prior to the measurement, but the probabilities for the different possibilities can be calculated using the Born rule. Other, more recent interpretations of quantum mechanics, such as relational quantum mechanics and QBism also give the Schrödinger equation a status of this sort.

Schrödinger himself suggested in 1952 that the different terms of a superposition evolving under the Schrödinger equation are "not alternatives but all really happen simultaneously". This has been interpreted as an early version of Everett's many-worlds interpretation. This interpretation, formulated independently in 1956, holds that all the possibilities described by quantum theory simultaneously occur in a multiverse composed of mostly independent parallel universes. This interpretation removes the axiom of wave function collapse, leaving only continuous evolution under the Schrödinger equation, and so all possible states of the measured system and the measuring apparatus, together with the observer, are present in a real physical quantum superposition. While the multiverse is deterministic, we perceive non-deterministic behavior governed by probabilities, because we do not observe the multiverse as a whole, but only one parallel universe at a time. Exactly how this is supposed to work has been the subject of much debate. Why we should assign probabilities at all to outcomes that are certain to occur in some worlds, and why should the probabilities be given by the Born rule? Several ways to answer these questions in the many-worlds framework have been proposed, but there is no consensus on whether they are successful.

Bohmian mechanics reformulates quantum mechanics to make it deterministic, at the price of adding a force due to a "quantum potential". It attributes to each physical system not only a wave function but in addition a real position that evolves deterministically under a nonlocal guiding equation. The evolution of a physical system is given at all times by the Schrödinger equation together with the guiding equation.