Wave packet



In physics, a wave packet (also known as a wave train or wave group) is a short burst of localized wave action that travels as a unit, outlined by an envelope. A wave packet can be analyzed into, or can be synthesized from, a potentially-infinite set of component sinusoidal waves of different wavenumbers, with phases and amplitudes such that they interfere constructively only over a small region of space, and destructively elsewhere. Any signal of a limited width in time or space requires many frequency components around a center frequency within a bandwidth inversely proportional to that width; even a gaussian function is considered a wave packet because its Fourier transform is a "packet" of waves of frequencies clustered around a central frequency. Each component wave function, and hence the wave packet, are solutions of a wave equation. Depending on the wave equation, the wave packet's profile may remain constant (no dispersion) or it may change (dispersion) while propagating.

Historical background
Ideas related to wave packets – modulation, carrier waves, phase velocity, and group velocity – date from the mid-1800s. The idea of a group velocity distinct from a wave's phase velocity was first proposed by W.R. Hamilton in 1839, and the first full treatment was by Rayleigh in his "Theory of Sound" in 1877.

Erwin Schrödinger introduced the idea of wave packets just after publishing his famous wave equation. He solved his wave equation for a quantum harmonic oscillator, introduced the superposition principle, and used it to show that a compact state could persist. While this work did result in the important concept of coherent states, the wave packet concept did not endure. The year after Schrödinger's paper, Werner Heisenberg published his paper on the uncertainty principle, showing in the process, that Schrödinger's results only applied to quantum harmonic oscillators, not for example to Coulomb potential characteristic of atoms.

The following year, 1927, Charles Galton Darwin explored Schrödinger's equation for an unbound electron in free space, assuming an initial Gaussian wave packet. Darwin showed that at time $$t$$ later the position $$x$$ of the packet traveling at velocity $$v$$ would be

$$x_0 + vt \pm \sqrt{\sigma^2 + (ht/2\pi\sigma m)^2}$$

where $$\sigma$$ is the uncertainty in the initial position.

Later in 1927 Paul Ehrenfest showed that the time, $$T$$ for a matter wave packet of width $$\Delta x$$ and mass $$m$$ to spread by a factor of 2 was $T\approx \Delta x \sqrt{m/\hbar}$. Since $$\hbar$$ is so small, wave packets on the scale of macroscopic objects, with large width and mass, double only at cosmic time scales.

Significance in quantum mechanics
Quantum mechanics describes the nature of atomic and subatomic systems using Schrödinger's wave equation. The classical limit of quantum mechanics and many formulations of quantum scattering use wave packets formed from various solutions to this equation. Quantum wave packet profiles change while propagating; they show dispersion. Physicists have concluded that "wave packets would not do as representations of subatomic particles".

Wave packets and the classical limit
Schrodinger developed wave packets in hopes of interpreting quantum wave solutions as locally compact wave groups. Such packets tradeoff position localization for spreading momentum. In the coordinate representation of the wave (such as the Cartesian coordinate system), the position of the particle's localized probability is specified by the position of the packet solution. The narrower the spatial wave packet, and therefore the better localized the position of the wave packet, the larger the spread in the momentum of the wave. This trade-off between spread in position and spread in momentum is a characteristic feature of the Heisenberg uncertainty principle.

One kind of optimal tradeoff minimizes the product of position uncertainty $$\Delta x$$ and momentum uncertainty $$\Delta p_x$$. If we place such a packet at rest it stays at rest: the average value of the position and momentum match a classical particle. However it spreads out in all directions with a velocity given by the optimal momentum uncertainty $$\Delta p_x$$. The spread is so fast that in the distance of once around an atom the wave packet is unrecognizable.

Wave packets and quantum scattering
Particle interactions are called scattering in physics; wave packet mathematics play an important role in quantum scattering approaches. A monochromatic (single momentum) source produces convergence difficulties in the scattering models. Scattering problems also have classical limits. Whenever the scattering target (for example an atom) has a size much smaller than wave packet, the center of the wave packet follows scattering classical trajectories. In other cases, the wave packet distorts and scatters as it interacts with the target.

Non-dispersive
Without dispersion the wave packet maintains its shape as it propagates. As an example of propagation without dispersion, consider wave solutions to the following wave equation from classical physics $${ \partial^2 u \over \partial t^2 } = c^2 \, \nabla^2 u,$$

where $c$ is the speed of the wave's propagation in a given medium.

Using the physics time convention, $e^{−iωt}$, the wave equation has plane-wave solutions $$ u(\mathbf{x},t) = e^{i{(\mathbf{k\cdot x}}-\omega t)},$$

where $$ \omega^2 =|\mathbf{k}|^2 c^2,$$ and $$ |\mathbf{k}|^2 = k_x^2 + k_y^2+ k_z^2.$$

This relation between $ω$ and $k$ should be valid so that the plane wave is a solution to the wave equation. It is called a dispersion relation.

To simplify, consider only waves propagating in one dimension (extension to three dimensions is straightforward). Then the general solution is $$ u(x,t)= A e^{i(kx-\omega t)} + B e^{-i(kx+\omega t)},$$ in which we may take $ω = kc$. The first term represents a wave propagating in the positive $x$-direction since it is a function of $x − ct$ only; the second term, being a function of $x + ct$, represents a wave propagating in the negative $x$-direction.

A wave packet is a localized disturbance that results from the sum of many different wave forms. If the packet is strongly localized, more frequencies are needed to allow the constructive superposition in the region of localization and destructive superposition outside the region. From the basic solutions in one dimension, a general form of a wave packet can be expressed as $$ u(x,t) = \frac{1}{\sqrt{2\pi}} \int^{\,\infty}_{-\infty} A(k) ~ e^{i(kx-\omega(k)t)} \, dk.$$

As in the plane-wave case the wave packet travels to the right for $ω(k) = kc$, since $u(x, t) = F(x − ct)$, and to the left for $ω(k) = −kc$, since $u(x, t) = F(x + ct)$.

The factor $$1 / \sqrt{2 \pi}$$ comes from Fourier transform conventions. The amplitude $A(k)$ contains the coefficients of the linear superposition of the plane-wave solutions. These coefficients can in turn be expressed as a function of $u(x, t)$ evaluated at $t = 0$ by inverting the Fourier transform relation above: $$ A(k) = \frac{1}{\sqrt{2\pi}} \int^{\,\infty}_{-\infty} u(x,0) ~ e^{-ikx} \, dx.$$

For instance, choosing $$ u(x,0) = e^{-x^2 +ik_0x},$$

we obtain $$ A(k) = \frac{1}{\sqrt{2}} e^{-\frac{(k-k_0)^2}{4}},$$

and finally $$\begin{align} u(x,t) &= e^{-(x-ct)^2 +ik_0(x-ct)}\\ &= e^{-(x-ct)^2} \left[\cos\left(2\pi \frac{x-ct}{\lambda}\right)+ i\sin\left(2\pi\frac{x-ct}{\lambda}\right)\right]. \end{align} $$

The nondispersive propagation of the real or imaginary part of this wave packet is presented in the above animation.

Dispersive


By contrast, as an example of dispersion where a wave changes shape during propagation, consider instead solutions to the Schrödinger equation (nondimensionalized with $2Δx$, $m$, and ħ set equal to one), $$i{ \partial \psi \over \partial t } = -\frac{1}{2} { \nabla^2 \psi },$$ yielding the dispersion relation $$ \omega = \frac{1}{2}|\mathbf{k}|^2. $$

Once again, restricting attention to one dimension, the solution to the Schrödinger equation satisfying the initial condition $ \psi(x,0)= \sqrt[4]{2/\pi} \exp\left({-x^2 + ik_0 x}\right)$, representing a wave packet localized in space at the origin, is seen to be $$\begin{align} \psi(x,t) &= \frac{ \sqrt[4]{2/\pi}}{\sqrt{1 + 2it}} e^{-\frac{1}{4}k_0^2} ~ e^{-\frac{1}{1 + 2it}\left(x - \frac{ik_0}{2}\right)^2}\\ &= \frac{ \sqrt[4]{2/\pi}}{\sqrt{1 + 2it}} e^{-\frac{1}{1 + 4t^2}(x - k_0t)^2}~ e^{i \frac{1}{1 + 4t^2}\left((k_0 + 2tx)x - \frac{1}{2}tk_0^2\right)} ~. \end{align} $$

An impression of the dispersive behavior of this wave packet is obtained by looking at the probability density: $$|\psi(x,t)|^2 = \frac{ \sqrt{2/\pi}}{\sqrt{1+4t^2}}~e^{-\frac{2(x-k_0t)^2}{1+4t^2}}~.$$ It is evident that this dispersive wave packet, while moving with constant group velocity $k_{o}$, is delocalizing rapidly: it has a width increasing with time as $\sqrt{ 1 + 4t^{2}} → 2t|undefined$, so eventually it diffuses to an unlimited region of space.

The momentum profile $A(k)$ remains invariant. The probability current is $$j=\rho v = \frac{1}{2i} (\psi^* \nabla \psi - \psi \nabla \psi^*)= \rho \left (k_0+\frac{4t(x-k_0 t)}{1+4t^2}\right ). $$

Gaussian wave packets in quantum mechanics




The above dispersive Gaussian wave packet, unnormalized and just centered at the origin, instead, at $a$=0, can now be written in 3D, now in standard units: $$ \psi(\mathbf{r},0) = e^{-\mathbf{r}\cdot\mathbf{r}/ 2a},$$ where $k$ is a positive real number, the square of the width of the wave packet, $$a = 2\langle \mathbf r \cdot \mathbf r\rangle/3\langle 1\rangle = 2 (\Delta x)^2.$$

The Fourier transform is also a Gaussian in terms of the wavenumber, the k-vector, (with inverse width, $$1/a = 2\langle\mathbf k\cdot \mathbf k\rangle/3\langle 1\rangle = 2 (\Delta p_x/\hbar)^2,$$ so that $$\Delta x \Delta p_x = \hbar/2,$$ i.e., it satisfies the uncertainty relation), $$ \psi(\mathbf{k},0) = (2\pi a)^{3/2} e^{- a \mathbf{k}\cdot\mathbf{k}/2}.$$

Each separate wave only phase-rotates in time, so that the time dependent Fourier-transformed solution is

The inverse Fourier transform is still a Gaussian, but now the parameter $t$ has become complex, and there is an overall normalization factor.

The integral of $Ψ$ over all space is invariant, because it is the inner product of $Ψ$ with the state of zero energy, which is a wave with infinite wavelength, a constant function of space. For any energy eigenstate $η(x)$, the inner product, $$\langle \eta | \psi \rangle = \int \eta(\mathbf{r}) \psi(\mathbf{r})d^3\mathbf{r},$$ only changes in time in a simple way: its phase rotates with a frequency determined by the energy of $η$. When $η$ has zero energy, like the infinite wavelength wave, it doesn't change at all.

The integral $∫&thinsp;|Ψ|^{2}d^{3}r$ is also invariant, which is a statement of the conservation of probability. Explicitly,

in which $\sqrt{a}$ is the width of $P(r)$ at $t = 0$; $r$ is the distance from the origin; the speed of the particle is zero; and the time origin $t = 0$ can be chosen arbitrarily.

The width of the Gaussian is the interesting quantity which can be read off from the probability density, $|Ψ|^{2}$,

$$ \sqrt{a^2 + (\hbar t/m)^2 \over a}.$$ This width eventually grows linearly in time, as $ħt/(m\sqrt{a})$, indicating wave-packet spreading.

For example, if an electron wave packet is initially localized in a region of atomic dimensions (i.e., $10^{−10}$ m) then the width of the packet doubles in about $10^{−16}$ s. Clearly, particle wave packets spread out very rapidly indeed (in free space): For instance, after $1$ ms, the width will have grown to about a kilometer.

This linear growth is a reflection of the (time-invariant) momentum uncertainty: the wave packet is confined to a narrow $Δx = \sqrt{a/2}$, and so has a momentum which is uncertain (according to the uncertainty principle) by the amount $ħ/\sqrt{2a}$, a spread in velocity of $ħ/m\sqrt{2a}$, and thus in the future position by $ħt /m\sqrt{2a}$. The uncertainty relation is then a strict inequality, very far from saturation, indeed! The initial uncertainty $ΔxΔp = ħ/2$ has now increased by a factor of $ħt/ma$ (for large $t$).

2D


A gaussian 2D quantum wave function:

$$\psi(x,y,t)=\psi(x,t) \psi(y,t)$$ $$ \psi (x,t) = (\frac{2a ^2}{\pi}) ^{1/4} \frac{e ^{i \phi}}{(a ^4 + \frac{4 \hbar ^2 t ^2}{m ^2}) ^{1/4}} e ^{i k_0 x} \exp [ - \frac{(x - \frac{\hbar k_0}{m} t ) ^2 }{a ^2 + \frac{2 i \hbar t}{m} } ]$$

where

$$ \phi = - \theta - \frac{\hbar k_0 ^2}{2 m} t$$

$$tg (2 \theta) = \frac{2 \hbar t}{m a ^2}$$

The Airy wave train
In contrast to the above Gaussian wave packet, it has been observed that a particular wave function based on Airy functions, propagates freely without envelope dispersion, maintaining its shape. It accelerates undistorted in the absence of a force field: $$\psi = \operatorname{Ai}(B(x-B^3t^2)) \, e^{iB^3t\left (x-\tfrac{2}{3}B^3t^2\right )} \, .$$ (For simplicity, $ħ = 1$, $m = 1/2$, and B is a constant, cf. nondimensionalization.)

Nevertheless, there is no dissonance with Ehrenfest's theorem in this force-free situation, because the state is both non-normalizable and has an undefined (infinite) $⟨x⟩$ for all times. (To the extent that it could be defined, $⟨p⟩ = 0$ for all times, despite the apparent acceleration of the front.)

In phase space, this is evident in the pure state Wigner quasiprobability distribution of this wavetrain, whose shape in x and p is invariant as time progresses, but whose features accelerate to the right, in accelerating parabolas $B(x − B^{3}t^{2}) + (p/B − tB^{2})^{2} = 0$, $$W(x,p;t) = W(x-B^3 t^2, p-B^3 t ;0) = {1\over 2^{1/3} \pi B} ~ \mathrm{Ai} \left(2^{2/3} \left(Bx + {p^2\over B^2}- 2Bpt\right)\right). $$

Note the momentum distribution obtained by integrating over all $a$ is constant. Since this is the probability density in momentum space, it is evident that the wave function itself is not normalizable.

Free propagator
The narrow-width limit of the Gaussian wave packet solution discussed is the free propagator kernel $a$. For other differential equations, this is usually called the Green's function, but in quantum mechanics it is traditional to reserve the name Green's function for the time Fourier transform of $x$.

Returning to one dimension for simplicity, with m and ħ set equal to one, when $K$ is the infinitesimal quantity $K$, the Gaussian initial condition, rescaled so that its integral is one, $$ \psi_0(x) = {1\over \sqrt{2\pi \varepsilon} } e^{-{x^2\over 2\varepsilon}} \,$$ becomes a delta function, $δ(x)$, so that its time evolution, $$ K_t(x) = {1\over \sqrt{2\pi (i t + \varepsilon)}} e^{ - x^2 \over 2it+\varepsilon }\,$$ yields the propagator.

Note that a very narrow initial wave packet instantly becomes infinitely wide, but with a phase which is more rapidly oscillatory at large values of x. This might seem strange—the solution goes from being localized at one point to being "everywhere" at all later times, but it is a reflection of the enormous momentum uncertainty of a localized particle, as explained above.

Further note that the norm of the wave function is infinite, which is also correct, since the square of a delta function is divergent in the same way.

The factor involving $a$ is an infinitesimal quantity which is there to make sure that integrals over $ε$ are well defined. In the limit that $ε → 0$, $ε$ becomes purely oscillatory, and integrals of $K$ are not absolutely convergent. In the remainder of this section, it will be set to zero, but in order for all the integrations over intermediate states to be well defined, the limit ε→0 is to be only taken after the final state is calculated.

The propagator is the amplitude for reaching point x at time t, when starting at the origin, x=0. By translation invariance, the amplitude for reaching a point x when starting at point y is the same function, only now translated, $$ K_t(x,y) = K_t(x-y) = {1\over \sqrt{2\pi it}} e^{i(x-y)^2 \over 2t} \, .$$

In the limit when t is small, the propagator goes to a delta function $$ \lim_{t \to 0} K_t(x-y) = \delta(x-y) ~,$$ but only in the sense of distributions: The integral of this quantity multiplied by an arbitrary differentiable test function gives the value of the test function at zero.

To see this, note that the integral over all space of $K$ equals 1 at all times, $$ \int K_t(x) dx = 1 \, ,$$ since this integral is the inner-product of K with the uniform wave function. But the phase factor in the exponent has a nonzero spatial derivative everywhere except at the origin, and so when the time is small there are fast phase cancellations at all but one point. This is rigorously true when the limit ε→0 is taken at the very end.

So the propagation kernel is the (future) time evolution of a delta function, and it is continuous, in a sense: it goes to the initial delta function at small times. If the initial wave function is an infinitely narrow spike at position $K$, $$ \psi_0(x) = \delta(x - y) \, ,$$ it becomes the oscillatory wave, $$ \psi_t(x) = {1\over \sqrt{2\pi i t}} e^{ i (x-y) ^2 /2t} \, .$$

Now, since every function can be written as a weighted sum of such narrow spikes, $$ \psi_0(x) = \int \psi_0(y) \delta(x-y) dy \, ,$$ the time evolution of every function $K$0 is determined by this propagation kernel $y$,

Thus, this is a formal way to express the fundamental solution or general solution. The interpretation of this expression is that the amplitude for a particle to be found at point $ψ$ at time $K$ is the amplitude that it started at $x$, times the amplitude that it went from $t$ to $y$, summed over all the possible starting points. In other words, it is a convolution of the kernel $y$ with the arbitrary initial condition $ψ_{0}$, $$ \psi_t = K * \psi_0 \, .$$

Since the amplitude to travel from $x$ to $K$ after a time $x$+$y$' can be considered in two steps, the propagator obeys the composition identity, $$\int K(x-y;t)K(y-z;t')dy = K(x-z;t+t')~ ,$$ which can be interpreted as follows: the amplitude to travel from $t$ to $t$ in time $x$+$z$' is the sum of the amplitude to travel from $t$ to $t$ in time $x$, multiplied by the amplitude to travel from $y$ to $t$ in time $y$', summed over all possible intermediate states y. This is a property of an arbitrary quantum system, and by subdividing the time into many segments, it allows the time evolution to be expressed as a path integral.

Analytic continuation to diffusion
The spreading of wave packets in quantum mechanics is directly related to the spreading of probability densities in diffusion. For a particle which is randomly walking, the probability density function at any point satisfies the diffusion equation (also see the heat equation), $$ {\partial \over \partial t} \rho = {1\over 2} {\partial^2 \over \partial x^2 } \rho ~,$$ where the factor of 2, which can be removed by rescaling either time or space, is only for convenience.

A solution of this equation is the spreading Gaussian, $$ \rho_t(x) = {1\over \sqrt{2\pi t}} e^{-x^2 \over 2t} ~,$$ and, since the integral of ρt is constant while the width is becoming narrow at small times, this function approaches a delta function at t=0, $$ \lim_{t \to 0} \rho_t(x) = \delta(x) $$ again only in the sense of distributions, so that $$ \lim_{t \to 0} \int_x f(x) \rho_t(x) = f(0) $$ for any smooth test function $z$.

The spreading Gaussian is the propagation kernel for the diffusion equation and it obeys the convolution identity, $$ K_{t+t'} = K_{t}*K_{t'} \, ,$$ which allows diffusion to be expressed as a path integral. The propagator is the exponential of an operator $t$, $$ K_t(x) = e^{-tH} \, ,$$ which is the infinitesimal diffusion operator, $$ H= -{\nabla^2\over 2} \, .$$

A matrix has two indices, which in continuous space makes it a function of $f$ and $H$'. In this case, because of translation invariance, the matrix element $x$ only depend on the difference of the position, and a convenient abuse of notation is to refer to the operator, the matrix elements, and the function of the difference by the same name: $$ K_t(x,x') = K_t(x-x') \, .$$

Translation invariance means that continuous matrix multiplication, $$ C(x,x) = \int_{x'} A(x,x')B(x',x) \, ,$$ is essentially convolution, $$ C(\Delta) = C(x-x) = \int_{x'} A(x-x') B(x'-x) = \int_{y} A(\Delta-y)B(y) \, .$$

The exponential can be defined over a range of ts which include complex values, so long as integrals over the propagation kernel stay convergent, $$ K_z(x) = e^{-zH} \, .$$ As long as the real part of $x$ is positive, for large values of $K$, $z$ is exponentially decreasing, and integrals over $x$ are indeed absolutely convergent.

The limit of this expression for $K$ approaching the pure imaginary axis is the above Schrödinger propagator encountered, $$ K_t^{\rm Schr} = K_{it+\varepsilon} = e^{-(it+\varepsilon)H} \, ,$$ which illustrates the above time evolution of Gaussians.

From the fundamental identity of exponentiation, or path integration, $$ K_z * K_{z'} = K_{z+z'} \,$$ holds for all complex z values, where the integrals are absolutely convergent so that the operators are well defined.

Thus, quantum evolution of a Gaussian, which is the complex diffusion kernel K, $$ \psi_0(x) = K_a(x) = K_a * \delta(x) \,$$ amounts to the time-evolved state, $$ \psi_t = K_{it} * K_a = K_{a+it} \, .$$

This illustrates the above diffusive form of the complex Gaussian solutions, $$ \psi_t(x) = {1\over \sqrt{2\pi (a+it)} } e^{- {x^2\over 2(a+it)} } \, .$$