Inverted pendulum



An inverted pendulum is a pendulum that has its center of mass above its pivot point. It is unstable and falls over without additional help. It can be suspended stably in this inverted position by using a control system to monitor the angle of the pole and move the pivot point horizontally back under the center of mass when it starts to fall over, keeping it balanced. The inverted pendulum is a classic problem in dynamics and control theory and is used as a benchmark for testing control strategies. It is often implemented with the pivot point mounted on a cart that can move horizontally under control of an electronic servo system as shown in the photo; this is called a cart and pole apparatus. Most applications limit the pendulum to 1 degree of freedom by affixing the pole to an axis of rotation. Whereas a normal pendulum is stable when hanging downward, an inverted pendulum is inherently unstable, and must be actively balanced in order to remain upright; this can be done either by applying a torque at the pivot point, by moving the pivot point horizontally as part of a feedback system, changing the rate of rotation of a mass mounted on the pendulum on an axis parallel to the pivot axis and thereby generating a net torque on the pendulum, or by oscillating the pivot point vertically. A simple demonstration of moving the pivot point in a feedback system is achieved by balancing an upturned broomstick on the end of one's finger.

A second type of inverted pendulum is a tiltmeter for tall structures, which consists of a wire anchored to the bottom of the foundation and attached to a float in a pool of oil at the top of the structure that has devices for measuring movement of the neutral position of the float away from its original position.

Overview
A pendulum with its bob hanging directly below the support pivot is at a stable equilibrium point, where it remains motionless because there is no torque on the pendulum. If displaced from this position, it experiences a restoring torque that returns it toward the equilibrium position. A pendulum with its bob in an inverted position, supported on a rigid rod directly above the pivot, 180° from its stable equilibrium position, is at an unstable equilibrium point. At this point again there is no torque on the pendulum, but the slightest displacement away from this position causes a gravitation torque on the pendulum that accelerates it away from equilibrium, causing it to fall over. In order to stabilize a pendulum in this inverted position, a feedback control system can be used, which monitors the pendulum's angle and moves the position of the pivot point sideways when the pendulum starts to fall over, to keep it balanced. The inverted pendulum is a classic problem in dynamics and control theory and is widely used as a benchmark for testing control algorithms (PID controllers, state-space representation, neural networks, fuzzy control, genetic algorithms, etc.). Variations on this problem include multiple links, allowing the motion of the cart to be commanded while maintaining the pendulum, and balancing the cart-pendulum system on a see-saw. The inverted pendulum is related to rocket or missile guidance, where the center of gravity is located behind the center of drag causing aerodynamic instability. The understanding of a similar problem can be shown by simple robotics in the form of a balancing cart. Balancing an upturned broomstick on the end of one's finger is a simple demonstration, and the problem is solved by self-balancing personal transporters such as the Segway PT, the self-balancing hoverboard and the self-balancing unicycle.

Another way that an inverted pendulum may be stabilized, without any feedback or control mechanism, is by oscillating the pivot rapidly up and down. This is called Kapitza's pendulum. If the oscillation is sufficiently strong (in terms of its acceleration and amplitude) then the inverted pendulum can recover from perturbations in a strikingly counterintuitive manner. If the driving point moves in simple harmonic motion, the pendulum's motion is described by the Mathieu equation.

Equations of motion
The equations of motion of inverted pendulums are dependent on what constraints are placed on the motion of the pendulum. Inverted pendulums can be created in various configurations resulting in a number of Equations of Motion describing the behavior of the pendulum.

Stationary pivot point
In a configuration where the pivot point of the pendulum is fixed in space, the equation of motion is similar to that for an uninverted pendulum. The equation of motion below assumes no friction or any other resistance to movement, a rigid massless rod, and the restriction to 2-dimensional movement.


 * $$\ddot \theta - {g \over \ell} \sin \theta = 0$$

Where $$\ddot \theta $$ is the angular acceleration of the pendulum, $$g $$ is the standard gravity on the surface of the Earth, $$\ell$$ is the length of the pendulum, and $$\theta$$ is the angular displacement measured from the equilibrium position.

When $$\ddot \theta $$ added to both sides, it has the same sign as the angular acceleration term:


 * $$\ddot \theta = {g \over \ell} \sin \theta$$

Thus, the inverted pendulum accelerates away from the vertical unstable equilibrium in the direction initially displaced, and the acceleration is inversely proportional to the length. Tall pendulums fall more slowly than short ones.

Derivation using torque and moment of inertia:

The pendulum is assumed to consist of a point mass, of mass $$m $$, affixed to the end of a massless rigid rod, of length $$\ell$$, attached to a pivot point at the end opposite the point mass.

The net torque of the system must equal the moment of inertia times the angular acceleration:
 * $$\boldsymbol{\tau}_{\mathrm{net}}=I \ddot \theta$$

The torque due to gravity providing the net torque:
 * $$\boldsymbol{\tau}_{\mathrm{net}}= m g \ell \sin \theta\,\!$$

Where $$ \theta\ $$ is the angle measured from the inverted equilibrium position.

The resulting equation:
 * $$ I \ddot \theta= m g \ell \sin \theta\,\!$$

The moment of inertia for a point mass:
 * $$I = m R^2$$

In the case of the inverted pendulum the radius is the length of the rod, $$ \ell $$.

Substituting in $$I = m \ell ^2$$
 * $$ m \ell ^2 \ddot \theta= m g \ell \sin \theta\,\!$$

Mass and $$\ell^2$$ is divided from each side resulting in:
 * $$\ddot \theta = {g \over \ell} \sin \theta$$

Inverted pendulum on a cart
An inverted pendulum on a cart consists of a mass $$ m $$ at the top of a pole of length $$ \ell $$ pivoted on a horizontally moving base as shown in the adjacent image. The cart is restricted to linear motion and is subject to forces resulting in or hindering motion.

Essentials of stabilization
The essentials of stabilizing the inverted pendulum can be summarized qualitatively in three steps.

1.	If the tilt angle $$ \theta $$ is to the right, the cart must accelerate to the right and vice versa.

2.	The position of the cart $$ x $$ relative to track center is stabilized by slightly modulating the null angle (the angle error that the control system tries to null) by the position of the cart, that is, null angle $$ = \theta + k x $$ where $$ k $$ is small. This makes the pole want to lean slightly toward track center and stabilize at track center where the tilt angle is exactly vertical. Any offset in the tilt sensor or track slope that would otherwise cause instability translates into a stable position offset. A further added offset gives position control.

3.	A normal pendulum subject to a moving pivot point such as a load lifted by a crane, has a peaked response at the pendulum radian frequency of $$ \omega_p = \sqrt {g/\ell} $$. To prevent uncontrolled swinging, the frequency spectrum of the pivot motion should be suppressed near $$ \omega_p $$. The inverted pendulum requires the same suppression filter to achieve stability.

As a consequence of the null angle modulation strategy, the position feedback is positive, that is, a sudden command to move right produces an initial cart motion to the left followed by a move right to rebalance the pendulum. The interaction of the pendulum instability and the positive position feedback instability to produce a stable system is a feature that makes the mathematical analysis an interesting and challenging problem.

From Lagrange's equations
The equations of motion can be derived using Lagrange's equations. We refer to the drawing to the right where $$\theta(t)$$ is the angle of the pendulum of length $$l$$ with respect to the vertical direction and the acting forces are gravity and an external force F in the x-direction. Define $$x(t)$$ to be the position of the cart.

The kinetic energy $$T$$ of the system is:



T = \frac{1}{2} M v_1^2 + \frac{1}{2} m v_2^2, $$

where $$v_1$$ is the velocity of the cart and $$v_2$$ is the velocity of the point mass $$m$$. $$v_1$$ and $$v_2$$ can be expressed in terms of x and $$\theta$$ by writing the velocity as the first derivative of the position;

v_1^2=\dot x^2, $$

v_2^2=\left({\frac{\rm d}{{\rm d}t}}{\left(x- \ell\sin\theta\right)}\right)^2 + \left({\frac{\rm d}{{\rm d}t}}{\left( \ell\cos\theta \right)}\right)^2. $$ Simplifying the expression for $$v_2$$ leads to:

v_2^2= \dot x^2 -2 \ell \dot x \dot \theta\cos \theta + \ell^2\dot \theta^2. $$

The kinetic energy is now given by:

T = \frac{1}{2} \left(M+m \right ) \dot x^2 -m \ell \dot x \dot\theta\cos\theta + \frac{1}{2} m \ell^2 \dot \theta^2. $$

The generalized coordinates of the system are $$\theta$$ and $$x$$, each has a generalized force. On the $$x$$ axis, the generalized force $$Q_x$$ can be calculated through its virtual work



Q_x\delta x=F \delta x,\quad Q_x=F, $$

on the $$\theta$$ axis, the generalized force $$Q_\theta$$ can be also calculated through its virtual work



Q_\theta\delta\theta=mgl\sin\theta \delta \theta,\quad Q_\theta=mgl\sin\theta. $$

According to the Lagrange's equations, the equations of motion are:

\frac{\mathrm{d}}{\mathrm{d}t}{\partial{T}\over \partial{\dot x}} - {\partial{T}\over \partial x} = F, $$

\frac{\mathrm{d}}{\mathrm{d}t}{\partial{T}\over \partial{\dot \theta}} - {\partial{T}\over \partial \theta} = mgl\sin\theta, $$ substituting $$T$$ in these equations and simplifying leads to the equations that describe the motion of the inverted pendulum:



\left ( M + m \right ) \ddot x - m \ell \ddot \theta \cos \theta + m \ell \dot \theta^2 \sin \theta = F, $$



\ell \ddot \theta - g \sin \theta = \ddot x \cos \theta. $$

These equations are nonlinear, but since the goal of a control system would be to keep the pendulum upright the equations can be linearized around $$\theta \approx 0$$.

From Euler-Lagrange equations
The generalized forces can be both written as potential energy $$V_x$$ and $$V_\theta$$,

According to the D'Alembert's principle, generalized forces and potential energy are connected:


 * $$Q_j = \frac{\mathrm{d}}{\mathrm{d}t}\frac{\partial V}{\partial \dot{q}_j} - \frac{\partial V}{\partial q_j} \,,$$

However, under certain circumstances, the potential energy is not accessible, only generalized forces are available.

After getting the Lagrangian $$L=T-V$$, we can also use Euler–Lagrange equation to solve for equations of motion:


 * $$\frac {\partial L}{\partial x} - \frac{\mathrm{d}}{\mathrm{d}t} \left ( \frac {\partial L}{\partial \dot{x}} \right ) = 0$$,
 * $$\frac {\partial L}{\partial \theta} - \frac{\mathrm{d}}{\mathrm{d}t} \left ( \frac {\partial L}{\partial \dot{\theta}} \right ) = 0$$.

The only difference is whether to incorporate the generalized forces into the potential energy $$V_j$$ or write them explicitly as $$Q_j$$ on the right side, they all lead to the same equations in the final.

From Newton's second law
Oftentimes it is beneficial to use Newton's second law instead of Lagrange's equations because Newton's equations give the reaction forces at the joint between the pendulum and the cart. These equations give rise to two equations for each body; one in the x-direction and the other in the y-direction. The equations of motion of the cart are shown below where the LHS is the sum of the forces on the body and the RHS is the acceleration.



F-R_x = M \ddot x $$

F_N - R_y - M g = 0 $$

In the equations above $$R_x$$ and $$R_y$$ are reaction forces at the joint. $$F_N$$ is the normal force applied to the cart. This second equation depends only on the vertical reaction force, thus the equation can be used to solve for the normal force. The first equation can be used to solve for the horizontal reaction force. In order to complete the equations of motion, the acceleration of the point mass attached to the pendulum must be computed. The position of the point mass can be given in inertial coordinates as



\vec r_P = (x-\ell \sin \theta) \hat x_I + \ell \cos \theta \hat y_I $$

Taking two derivatives yields the acceleration vector in the inertial reference frame.



\vec a_{P/I} = (\ddot x + \ell \dot \theta^2 \sin \theta - \ell \ddot \theta \cos \theta ) \hat x_I + (-\ell \dot \theta^2 \cos \theta  - \ell \ddot \theta \sin \theta) \hat y_I $$

Then, using Newton's second law, two equations can be written in the x-direction and the y-direction. Note that the reaction forces are positive as applied to the pendulum and negative when applied to the cart. This is due to Newton's Third Law.



R_x = m(\ddot x + \ell \dot \theta^2 \sin \theta - \ell \ddot \theta \cos \theta ) $$

R_y - m g = m (-\ell \dot \theta^2 \cos \theta - \ell \ddot \theta \sin \theta) $$

The first equation allows yet another way to compute the horizontal reaction force in the event the applied force $$F$$ is not known. The second equation can be used to solve for the vertical reaction force. The first equation of motion is derived by substituting $$F-R_x = M \ddot x$$ into $$R_x = m(\ddot x + \ell \dot \theta^2 \sin \theta - \ell \ddot \theta \cos \theta )$$, which yields



\left (M+m \right) \ddot x - m \ell \ddot \theta \cos \theta  + m \ell \dot \theta^2 \sin \theta  = F $$

By inspection this equation is identical to the result from Lagrange's Method. In order to obtain the second equation, the pendulum equation of motion must be dotted with a unit vector that runs perpendicular to the pendulum at all times and is typically noted as the x-coordinate of the body frame. In inertial coordinates this vector can be written using a simple 2-D coordinate transformation



\hat x_B = \cos \theta \hat x_I + \sin \theta \hat y_I $$

The pendulum equation of motion written in vector form is $$\sum \vec F = m \vec a_{P/I}$$. Dotting $$\hat x_B$$ with both sides yields the following on the LHS (note that a transpose is the same as a dot product)



(\hat x_B)^T\sum \vec F = (\hat x_B)^T (R_x \hat x_I + R_y \hat y_I - m g \hat y_I) = (\hat x_B)^T(R_p \hat y_B - m g \hat y_I) = -m g \sin \theta $$

In the above equation the relationship between body frame components of the reaction forces and inertial frame components of reaction forces is used. The assumption that the bar connecting the point mass to the cart is massless implies that this bar cannot transfer any load perpendicular to the bar. Thus, the inertial frame components of the reaction forces can be written simply as $$R_p \hat y_B$$, which signifies that the bar can transfer loads only along the axis of the bar itself. This gives rise to another equation that can be used to solve for the tension in the rod itself:



R_p = \sqrt{ R_x^2 + R_y^2} $$

The RHS of the equation is computed similarly by dotting $$\hat x_B$$ with the acceleration of the pendulum. The result (after some simplification) is shown below.



m(\hat x_B)^T(\vec a_{P/I}) = m(\ddot x \cos \theta - \ell \ddot \theta) $$

Combining the LHS with the RHS and dividing through by m yields



\ell \ddot \theta - g \sin \theta = \ddot x \cos \theta $$

which again is identical to the result of Lagrange's method. The benefit of using Newton's method is that all reaction forces are revealed to ensure that nothing is damaged.

Variants
Achieving stability of an inverted pendulum has become a common engineering challenge for researchers. There are different variations of the inverted pendulum on a cart ranging from a rod on a cart to a multiple segmented inverted pendulum on a cart. Another variation places the inverted pendulum's rod or segmented rod on the end of a rotating assembly. In both, (the cart and rotating system) the inverted pendulum can fall only in a plane. The inverted pendulums in these projects can either be required to maintain balance only after an equilibrium position is achieved, or can achieve equilibrium by itself. Another platform is a two-wheeled balancing inverted pendulum. The two wheeled platform has the ability to spin on the spot offering a great deal of maneuverability. Yet another variation balances on a single point. A spinning top, a unicycle, or an inverted pendulum atop a spherical ball all balance on a single point.

Kapitza's pendulum
An inverted pendulum in which the pivot is oscillated rapidly up and down can be stable in the inverted position. This is called Kapitza's pendulum, after Russian physicist Pyotr Kapitza who first analysed it. The equation of motion for a pendulum connected to a massless, oscillating base is derived the same way as with the pendulum on the cart. The position of the point mass is now given by:
 * $$\left( -\ell \sin \theta, y + \ell \cos \theta   \right)$$

and the velocity is found by taking the first derivative of the position:
 * $$v^2=\dot y^2-2 \ell \dot y \dot \theta \sin \theta + \ell^2\dot \theta ^2.$$[[Image:Inverted pendulum oscillatory base.svg|thumb|right|500px|Plots for the inverted pendulum on an oscillatory base. The first plot shows the response of the pendulum on a slow oscillation, the second the response on a fast oscillation]]

The Lagrangian for this system can be written as:

L = \frac{1 }{2} m \left ( \dot y^2-2 \ell \dot y \dot \theta \sin \theta + \ell^2\dot \theta ^2   \right) - m g \left( y + \ell \cos \theta  \right ) $$ and the equation of motion follows from:

{\mathrm{d} \over \mathrm{d}t}{\partial{L}\over \partial{\dot \theta}} - {\partial{L}\over \partial \theta} = 0 $$ resulting in:

\ell \ddot \theta - \ddot y \sin \theta = g \sin \theta. $$ If y represents a simple harmonic motion, $$y = A \sin \omega t$$, the following differential equation is:

\ddot \theta - {g \over \ell} \sin \theta = -{A \over \ell} \omega^2 \sin \omega t \sin \theta. $$

This equation does not have elementary closed-form solutions, but can be explored in a variety of ways. It is closely approximated by the Mathieu equation, for instance, when the amplitude of oscillations are small. Analyses show that the pendulum stays upright for fast oscillations. The first plot shows that when $$y$$ is a slow oscillation, the pendulum quickly falls over when disturbed from the upright position. The angle $$\theta$$ exceeds 90° after a short time, which means the pendulum has fallen on the ground. If $$y$$ is a fast oscillation the pendulum can be kept stable around the vertical position. The second plot shows that when disturbed from the vertical position, the pendulum now starts an oscillation around the vertical position ($$\theta = 0$$). The deviation from the vertical position stays small, and the pendulum doesn't fall over.

Examples
Arguably the most prevalent example of a stabilized inverted pendulum is a human being. A person standing upright acts as an inverted pendulum with their feet as the pivot, and without constant small muscular adjustments would fall over. The human nervous system contains an unconscious feedback control system, the sense of balance or righting reflex, that uses proprioceptive input from the eyes, muscles and joints, and orientation input from the vestibular system consisting of the three semicircular canals in the inner ear, and two otolith organs, to make continual small adjustments to the skeletal muscles to keep us standing upright. Walking, running, or balancing on one leg puts additional demands on this system. Certain diseases and alcohol or drug intoxication can interfere with this reflex, causing dizziness and disequilibration, an inability to stand upright. A field sobriety test used by police to test drivers for the influence of alcohol or drugs, tests this reflex for impairment.

Some simple examples include balancing brooms or meter sticks by hand. The inverted pendulum has been employed in various devices and trying to balance an inverted pendulum presents a unique engineering problem for researchers. The inverted pendulum was a central component in the design of several early seismometers due to its inherent instability resulting in a measurable response to any disturbance.

The inverted pendulum model has been used in some recent personal transporters, such as the two-wheeled self-balancing scooters and single-wheeled electric unicycles. These devices are kinematically unstable and use an electronic feedback servo system to keep them upright.

Swinging a pendulum on a cart into its inverted pendulum state is considered a traditional optimal control toy problem/benchmark.