$$\require{cancel}$$

# 15.5: Action-angle Variables

## Canonical transformation

Systems possessing periodic solutions are a ubiquitous feature in physics. The periodic motion can be either an oscillation, for which the trajectory in phase space is a closed loop (libration), or rolling (rotational) motion as discussed in chapter $$3.4$$. For many problems involving periodic motion, the interest often lies in the frequencies of motion rather than the detailed shape of the trajectories in phase space. The action-angle variable approach uses a canonical transformation to action and angle variables which provide a powerful, and elegant method to exploit Hamiltonian mechanics. In particular, it can determine the frequencies of periodic motion without having to calculate the exact trajectories for the motion. This method was introduced by the French astronomer Ch. E. Delaunay(1816 − 1872) for applications to orbits in celestial mechanics, but it has equally important applications beyond celestial mechanics such as to bound solutions of the atom in quantum mechanics.

The action-angle method replaces the momenta in the Hamilton-Jacobi procedure by the action phase integral for the closed loop (libration) trajectory in phase space defined by

$J_i \equiv \oint p_idq_i \label{15.116}$

where for each cyclic variable the integral is taken over one complete period of oscillation. The cyclic variable $$I_i$$ is called the action variable where

$I_i \equiv \frac{1}{ 2\pi} J_i = \frac{1}{ 2\pi} \oint p_idq_i \label{15.117}$

The canonical variable to the action variable $$\mathbf{I}$$ is the angle variable $$\boldsymbol{\phi}$$. Note that the name “action variable” is used to differentiate $$\mathbf{I}$$ from the action functional $$S = \int Ldt$$ which has the same units; i.e. angular momentum.

The general principle underlying the use of action-angle variables is illustrated by considering one body, of mass $$m$$, subject to a one-dimensional bound conservative potential energy $$U(q)$$. The Hamiltonian is given by

$H(p,q) = \frac{p^2}{ 2m} + U(q) \label{15.118}$

This bound system has a $$(q,p)$$ phase space contour for each energy $$H = E$$.

$p(q,E) = \pm \sqrt{2m(E − U(q))} \label{15.119}$

For an oscillatory system the two-valued momentum of Equation \ref{15.119} is non-trivial to handle. By contrast, the area $$J \equiv \oint pdq$$ of the closed loop in phase space is a single-valued scalar quantity that depends on $$E$$ and $$U(q)$$. Moreover, Liouville’s theorem states that the area of the closed contour in phase space $$J \equiv \oint pdq$$ is invariant to canonical transformations. These facts suggest the use of a new pair of conjugate variables, $$(\phi , I)$$, where $$I(E)$$ uniquely labels the trajectory, and corresponding area, of a closed loop in phase space for each value of $$E$$, and the single-valued function $$\phi$$ is a corresponding angle that specifies the exact point along the phase-space contour as illustrated in Fig $$\PageIndex{1}$$.

For simplicity consider the linear harmonic oscillator where

$U(q) = \frac{1}{ 2} m\omega^2q^2 \label{15.120}$

Then the Hamiltonian, \ref{15.118} equals

$H(p,q) = \frac{p^2 }{2m} + \frac{1}{ 2} m\omega^2q^2 \label{15.121}$

Hamilton’s equations of motion give that

$\dot{p} = −\frac{\partial H}{ \partial q} = −m\omega^2q \label{15.122}$

$\dot{q} = \frac{\partial H}{ \partial p} = \frac{p}{ m} \label{15.123}$

The solution of equations \ref{15.122} and \ref{15.123} is of the form

$q = C \cos(\omega (t − t_0)) \label{15.124}$

$p = −m\omega C \sin \omega (t − t_0) \label{15.125}$

where $$C$$, and $$t_0$$ are integration constants. For the harmonic oscillator, equations \ref{15.124} and \ref{15.125} correspond to the usual elliptical contours in phase space, as illustrated in Figure $$\PageIndex{1}$$.

The action-angle canonical transformation involves making the transform

$(q,p) \rightarrow (\phi , I) \label{15.126}$

where $$I$$ is defined by Equation \ref{15.117} and the angle $$\phi$$ being the corresponding canonical angle. The logical approach to this canonical transformation for the harmonic oscillator is to define $$q$$ and $$p$$ in terms of $$\phi$$ and $$I$$

$q = \sqrt{\frac{ 2I}{ m\omega}} \cos \phi \label{15.127}$

$p = \sqrt{2mI\omega } \sin \phi \label{15.128}$

Note that the Poisson bracket is unity

$[q, p]_{(\phi , I)} = 1 \nonumber$

which implies that the above transformation is canonical, and thus the phase space area $$I(E) \equiv \frac{1}{ 2\pi} \oint pdq$$ is conserved.

For this canonical transformation the transformed Hamiltonian $$\mathcal{H} (\phi , I)$$ is

$\mathcal{H} (\phi , I) = \frac{1}{ 2m } (2m\omega I) \sin^2 \phi + \frac{1}{ 2 }m\omega^2 \frac{2I}{ m\omega } \cos^2 \phi = \omega I \label{15.129}$

Note that this Hamiltonian is a constant that is independent of the angle $$\phi$$, and thus Hamilton’s equations of motion give

$\dot{ I} = −\frac{\partial \mathcal{H} (\phi , I)}{ \partial \phi} = 0 \label{15.130}$

$\dot{\phi} = \frac{\partial \mathcal{H} (\phi , I) }{\partial I} = \omega \label{15.131}$

Thus we have mapped the harmonic oscillator to new coordinates $$(\phi , I)$$ where

$I = \frac{\mathcal{H} (\phi , I)}{ \omega} = \frac{E }{\omega} \label{15.132}$

$\phi = \omega (t − t_0) \label{15.133}$

That is, the phase space has been mapped from ellipses, with area proportional to $$E$$ in the $$(q,p)$$ phase space, to a cylindrical $$(\phi , I)$$ phase space where $$I = \frac{E}{\omega}$$ are constant values that are independent of the angle, while $$\phi$$ increases linearly with time. Thus the variables $$(q,p)$$ are periodic with modulus $$\Delta\phi = 2\pi$$.

$q(\phi + 2\pi, I) = q (\phi , I) \label{15.134}$

$p(\phi + 2\pi, I) = p (\phi , I) \label{15.135}$

The period $$\tau$$ of the periodic oscillatory motion is given simply by $$\Delta\phi = 2\pi = \omega \tau$$ which is the well known result for the harmonic oscillator. Note that the action-angle variable canonical transformation has determined the frequency of the periodic motion without solving the detailed trajectory of the motion.

The above example of the harmonic oscillator has shown that, for integrable periodic systems, it is possible to identify a canonical transformation to $$(\phi , I)$$ such that the Hamiltonian is independent of the angle $$\phi$$ which specifies the instantaneous location on the constant energy contour $$I$$. If the phase space contour is a separatrix, then it divides phase space into invariant regions containing phase-space contours with differing behavior. The action-angle variables are not useful for separatrix contours. For rolling motion, the system rotates with continuously increasing, or decreasing angle, and there is no natural boundary for the action angle variable since the phase space trajectory is continuous and not closed. However, the action-angle approach still is valid if the motion involves periodic as well as rolling motion.

The example of the one-dimensional, one-body, harmonic oscillator can be expanded to the more general case for many bodies in three dimensions. This is illustrated by considering multiple periodic systems for which the Hamiltonian is conservative and where the equations of the canonical transformation are separable. The generalized momenta then can be written as

$p_i = \frac{\partial W_i (q_i; \alpha_1, \alpha_2, ..\alpha_n)}{ \partial q_i} \label{15.136}$

for which each $$p_i$$ is a function of $$q_i$$ and the $$n$$ integration constants $$\alpha_j$$

$p_i = p_i (q_i, \alpha_1, \alpha_2, ..\alpha_n) \label{15.137}$

The momentum $$p_i (q_i, \alpha_1, \alpha_2, ..\alpha_n)$$ represents the trajectory of the system in the $$(q_i, p_i)$$ phase space that is characterized by Hamilton’s characteristic function $$W(q,J)$$. Combining equations \ref{15.116}, \ref{15.136} gives

$J_i \equiv \oint \frac{\partial W_i (q_i; \alpha_1, \alpha_2, ..\alpha_n)}{ \partial q_i} dq_i \label{15.138}$

Since $$q_i$$ is merely a variable of integration, each active action variable $$J_i$$ is a function of the $$n$$ constants of integration in the Hamilton-Jacobi equation. Because of the independence of the separable-variable pairs $$(q_i, p_i)$$, the $$J_i$$ form $$n$$ independent functions of the $$\alpha_i$$, and hence are suitable for use as a new set of constant momenta. Thus the characteristic function $$W$$ can be written as

$W (q_1, ...q_n; J_1, ...J_n) = \sum_j W_j (q_j ; J_1, ...J_n) \label{15.139}$

while the Hamiltonian is only a function of the momenta $$H (J_1, .... J_n)$$

The generalized coordinate, conjugate to $$J$$, is known as the angle variable $$\phi_i$$ which is defined by the transformation equation

$\phi_i = \frac{\partial W }{\partial J_i} = \sum^n_{j=1} \frac{\partial W_j (q_j ; J_1, ...J_n)}{ \partial J_i} \label{15.140}$

The corresponding equation of motion for $$\phi$$ is given by

$\dot{\phi}_i = \frac{\partial H(J) }{\partial J_i} = 2\pi\omega_i(J_1, ...J_n) \label{15.141}$

where $$\omega_i(J)$$ are constant functions of the action variables $$J_j$$ with a solution

$\phi_i = 2\pi\omega_it + \beta_i \label{15.142}$

that is, they are linear functions of time. The constants $$\omega_i$$ can be identified with the frequencies of the multiple periodic motions.

The action-angle variables appear to be no different than a particular set of transformed coordinates. Their merit appears when the physical interpretation is assigned to $$\omega_i$$. Consider the change $$\delta \phi_i$$ as the $$q_j$$ are changed infinitesimally

$\delta \phi_i = \sum_j \frac{\partial \phi_i}{ \partial q_j} \partial q_j = \sum_j \frac{\partial^2 W }{\partial J_i\partial q_j } \partial q_j \label{15.143}$

The derivative with respect to $$q_i$$ vanishes except for the $$W_j$$ component of $$W$$. Thus Equation \ref{15.143} reduces to

$\delta \phi_i = \frac{\partial}{ \partial J_i} \sum_j p_j (q_j , J) dq_j \label{15.144}$

Therefore, the total change in $$\phi$$, as the system goes through one complete cycle is

$\Delta\phi_i = \sum_j \frac{\partial}{ \partial J_i} \oint p_j (q_j , J) dq_j = 2\pi\delta_{ij} \label{15.145}$

where $$\frac{\partial }{ \partial J_i}$$ is outside the integral since the $$J_i$$ are constants for cyclic motion. Thus $$\Delta\phi_i = 2\pi = \omega_i\tau_i$$ where $$\tau_i$$ is the period for one cycle of oscillation, where the angular frequency $$\omega_i$$ is given by

$\frac{\omega_i}{ 2\pi} = \nu_i = \frac{1}{ \tau_i} \label{15.146}$

Thus the frequency $$\nu$$ associated with the periodic motion is the reciprocal of the period $$\tau$$. The secret here is that the derivative of $$H$$ with respect to the action variable $$J$$ given by Equation \ref{15.141} directly determines the frequency of the periodic motion without the need to solve the complete equations of motion. Note that multiple periodic motion can be represented by a Fourier expansion of the form

$q_k = \sum^{\infty}_{j_1=−\infty} \sum^{\infty}_{j_2=−\infty} ... \sum^{\infty}_{j_n=−\infty} a^k_{j_1,..,j_n} e^{2\pi i(j_1\omega_1+ j_2\omega_2+ j_3\omega_3+..+ j_n\omega_n)} \label{15.147}$

Although the action-angle approach to Hamilton-Jacobi theory does not produce complete equations of motion, it does provide the frequency decomposition that often is the physics of interest. The reason that the powerful action-angle variable approach has been introduced here is that it is used extensively in celestial mechanics. The action-angle concept also played a key role in the development of quantum mechanics, in that Sommerfeld recognized that Bohr’s ad hoc assumption that angular momentum is quantized, could be expressed in terms of quantization of the angle variable as is mentioned in chapter $$18$$.

## Adiabatic invariance of the action variables

When the Hamiltonian depends on time it can be quite difficult to solve for the motion because it is difficult to find constants of motion for time-dependent systems. However, if the time dependence is sufficiently slow, that is, if the motion is adiabatic, then there exist dynamical variables that are almost constant which can be used to solve for the motion. In particular, such approximate constants are the familiar action-angle integrals. The adiabatic invariance of the action variables played an important role in the development of quantum mechanics during the 1911 Solvay Conference. This was a time when physicists were grappling with the concepts of quantum mechanics. Einstein used the following classical mechanics example of adiabatic invariance, applied to the simple pendulum, in order to illustrate the concept of adiabatic invariance of the action. This example demonstrates the power of using action-angle variables.

Example $$\PageIndex{1}$$: Adiabatic invariance for the simple pendulum

Consider that the pendulum is made up of a point mass $$M$$ suspended from a pivot by a light string of length $$L$$ that is swinging freely in a vertical plane. Derive the dependence of the amplitude of the oscillations $$\theta$$, assuming $$\theta$$ is small, if the string is very slowly shortened by a factor of 2, that is, assume that the change in length during one period of the oscillation is very small. The tension in the string $$T$$ is given by

$T = Mg \langle \cos \theta \rangle + \left\langle \frac{ML^2 \dot{\theta}^2 }{L}\right\rangle \nonumber$

Let the pendulum angle be oscillatory

$\theta = \theta_0 \cos(\omega t + \varphi_0) \nonumber$

Then the average mean square amplitude and velocity over one period are

$\langle \theta^2 \rangle = ­ \langle [\theta_0 \cos (\omega t + \varphi_0)]^2 \rangle = \frac{\theta^2_0}{ 2} \nonumber$

$\left\langle \dot{\theta}^2 \right\rangle = ­ \langle [−\theta_0\omega \sin(\omega t + \varphi_0)]^2\rangle = \frac{\omega^2\theta^2_0}{ 2} \nonumber$

Since, for the simple pendulum, $$\omega^2 = \frac{g}{L}$$, then the tension in the string

$T = Mg(1 − ­\frac{\langle \theta^2\rangle}{ 2 }) + ML \langle \dot{\theta}^2 \rangle = Mg(1 + \frac{\theta^2_0}{ 4} ) \nonumber$

Assuming that $$\theta_0$$ is a small angle, and that the change in length $$−\Delta L$$ is very small during one period $$\tau$$, then the work done is

$\Delta W = T\Delta L = −Mg\Delta L − Mg \frac{\theta^2_0}{4} \Delta L \label{a}\tag{a}$

while the change in internal oscillator energy is

$\Delta(−Mg L \cos \theta_0) = \Delta \left[ −Mg L(1 − \frac{\theta^2_0}{2} ) \right] = −Mg\Delta L + \frac{1}{ 2} Mg\Delta( L\theta^2_0) = −Mg\Delta L + \frac{1}{ 2} Mg\theta^2_0\Delta L + Mg L\theta_0 \Delta\theta_0 \label{b}\tag{b}$

The work done must balance the increment in internal energy therefore

$L\theta_0\Delta\theta_0 + \frac{3\theta^2_0\Delta L}{ 4} = 0 \nonumber$

or

$L\theta^2_0\Delta \ln (\theta_0 L^{\frac{3}{ 4}} )=0 \nonumber$

Therefore it follows that

$(\theta_0 L^{\frac{3}{ 4}} ) = \text{ constant} \label{c}\tag{c}$

or

$\theta_0 \propto L^{−\frac{ 3}{ 4}} \nonumber$

Thus shortening the length of the pendulum string from $$L$$ to $$\frac{L}{2}$$ adiabatically corresponds to the amplitude increasing by a factor 1.68.

Consider the action-angle integral for one closed period $$\tau = \frac{2\pi}{ \omega}$$ for this problem

$J = \oint P_{\theta} d\theta \\ = \oint ML^2 \dot{\theta} \cdot \dot{\theta} dt \\ = ML^2 \langle \dot{\theta}^2 \rangle \frac{2\pi}{ \omega} \\ = \pi ML^2\theta^2_0\omega \\= \pi Mg^{\frac{1}{ 2}} \theta^2_0 L^{\frac{3}{ 2}} = \text{ constant} \nonumber$

where that last step is due to Equation \ref{c}.

The above example shows that the action integral $$J = constant$$, that is, it is invariant to an adiabatic change. In retrospect this result is as expected in that the action integral should be minimized.