Ostrogradsky instability

In applied mathematics, the Ostrogradsky instability is a feature of some solutions of theories having equations of motion with more than two time derivatives (higher-derivative theories). It is suggested by a theorem of Mikhail Ostrogradsky in classical mechanics according to which a non-degenerate Lagrangian dependent on time derivatives higher than the first corresponds to a Hamiltonian unbounded from below. As usual, the Hamiltonian is associated with the Lagrangian via a Legendre transform. The Ostrogradsky instability has been proposed as an explanation as to why no differential equations of higher order than two appear to describe physical phenomena. However, Ostrogradsky's theorem does not imply that all solutions of higher-derivative theories are unstable as many counterexamples are known.

==Outline of proof ==

The main points of the proof can be made clearer by considering a one-dimensional system with a Lagrangian $$L(q,{\dot q}, {\ddot q})$$. The Euler–Lagrange equation is


 * $$ \frac{\partial L}{\partial q} - \frac{d}{dt} \frac{\partial L}{\partial{\dot q}}+ \frac{d^2}{dt^2}\frac{\partial L}{\partial{\ddot q}} = 0.$$

Non-degeneracy of $$L$$ means that the canonical coordinates can be expressed in terms of the derivatives of $${q}$$ and vice versa. Thus, $$\partial L/\partial{\ddot q}$$ is a function of $${\ddot q}$$ (if it were not, the Jacobian $$\det[\partial^2 L/(\partial{\ddot q_i}\, \partial{\ddot q}_j)]$$ would vanish, which would mean that $$L$$ is degenerate), meaning that we can write $$q^{(4)} = F(q,{\dot q}, {\ddot q}, q^{(3)})$$ or, inverting, $$q = G(t, q_0, {\dot q}_0, {\ddot q}_0, q^{(3)}_0)$$. Since the evolution of $$q$$ depends upon four initial parameters, this means that there are four canonical coordinates. We can write those as


 * $$Q_1 : = q$$
 * $$Q_2 : = {\dot q}$$

and by using the definition of the conjugate momentum,


 * $$P_1 : = \frac{\partial L}{\partial{\dot q}} - \frac{d}{dt} \frac{\partial L}{\partial{\ddot q}}$$
 * $$P_2 : = \frac{\partial L}{\partial{\ddot q}} $$

The above results can be obtained as follows. First, we rewrite the Lagrangian into "ordinary" form by introducing a Lagrangian multiplier as a new dynamic variable $$\lambda$$
 * $$L(q,\dot{q},\ddot{q})\to \tilde{L}=L(Q_1,\dot{Q_1},\dot{Q_2})-\lambda(Q_2-\dot{Q_1})$$,

from which, the Euler-Lagrangian equations for $$Q_1,Q_2,\lambda$$ read
 * $$Q_1:\frac{d}{dt}\frac{\partial L}{\partial \dot{Q_1}}+\dot{\lambda}-\frac{\partial L}{\partial Q_1}=0$$,
 * $$Q_2:\frac{d}{dt}\frac{\partial L}{\partial \dot{Q_2}}+{\lambda}=0$$,
 * $$\lambda:Q_2-\dot{Q_1}=0$$,

Now, the canonical momentum $$P_1,P_2$$ with respect to $$\tilde{L}$$ are readily shown to be
 * $$P_1=\frac{\partial\tilde{L}}{\partial\dot{Q_1}}=\frac{\partial L}{\partial\dot{Q_1}}+\lambda=\frac{\partial L}{\partial\dot{Q_1}}-\frac{d}{dt}\frac{\partial L}{\partial\dot{Q_2}}$$
 * $$P_2=\frac{\partial\tilde{L}}{\partial\dot{Q_2}}=\frac{\partial {L}}{\partial\dot{Q_2}}$$

while
 * $$P_\lambda=0$$

These are precisely the definitions given above by Ostrogradski. One may proceed further to evaluate the Hamiltonian
 * $$\tilde{H}=P_1\dot{Q_1}+P_2\dot{Q_2}+p_\lambda\dot{\lambda}-\tilde{L}=P_1Q_2+P_2\dot{Q_2}-{L}$$,

where one makes use of the above Euler-Lagrangian equations for the second equality. We note that due to non-degeneracy, we can write $$ {\ddot q}=\dot{Q_2}$$ as $$a(Q_1, Q_2, P_2)$$. Here, only three arguments are needed since the Lagrangian itself only has three free parameters. Therefore, the last expression only depends on $$P_1,P_2,Q_1,Q_2$$, it effectively serves as the Hamiltonian of the original theory, namely,


 * $$H = P_1 Q_2 + P_2 a(Q_1, Q_2, P_2) - L(Q_1,Q_2,P_2) $$.

We now notice that the Hamiltonian is linear in $$P_1$$. This is a source of the Ostrogradsky instability, and it stems from the fact that the Lagrangian depends on fewer coordinates than there are canonical coordinates (which correspond to the initial parameters needed to specify the problem). The extension to higher dimensional systems is analogous, and the extension to higher derivatives simply means that the phase space is of even higher dimension than the configuration space.