Skip to main content
Library homepage
 

Text Color

Text Size

 

Margin Size

 

Font Type

Enable Dyslexic Font
Physics LibreTexts

15.3: Canonical Transformations in Hamiltonian Mechanics

( \newcommand{\kernel}{\mathrm{null}\,}\)

Hamiltonian mechanics is an especially elegant and powerful way to derive the equations of motion for complicated systems. Unfortunately, integrating the equations of motion to derive a solution can be a challenge. Hamilton recognized this difficulty, so he proposed using generating functions to make canonical transformations which transform the equations into a known soluble form. Jacobi, a contemporary mathematician, recognized the importance of Hamilton’s pioneering developments in Hamiltonian mechanics, and therefore he developed a sophisticated mathematical framework for exploiting the generating function formalism in order to make the canonical transformations required to solve Hamilton’s equations of motion.

In the Lagrange formulation, transforming coordinates (qi,˙qi) to cyclic generalized coordinates (Qi,˙Qi), simplifies finding the Euler-Lagrange equations of motion. For the Hamiltonian formulation, the concept of coordinate transformations is extended to include simultaneous canonical transformation of both the spatial coordinates qi and the conjugate momenta pi from (qi,pi) to (Qi,Pi), where both of the canonical variables are treated equally in the transformation. Compared to Lagrangian mechanics, Hamiltonian mechanics has twice as many variables which is an asset, rather than a liability, since it widens the realm of possible canonical transformations.

Hamiltonian mechanics has the advantage that generating functions can be exploited to make canonical transformations to find solutions, which avoids having to use direct integration. Canonical transformations are the foundation of Hamiltonian mechanics; they underlie Hamilton-Jacobi theory and action-angle variable theory, both of which are powerful means for exploiting Hamiltonian mechanics to solve problems in physics and engineering. The concept underlying canonical transformations is that, if the equations of motion are simplified by using a new set of generalized variables (Q,P), compared to using the original set of variables (q,p), then an advantage has been gained. The solution, expressed in terms of the generalized variables (Q,P), can be transformed back to express the solution in terms of the original coordinates, (q,p).

Only a specialized subset of transformations will be considered, namely canonical transformations that preserve the canonical form of Hamilton’s equations of motion. That is, given that the original set of variables (qi,pi) satisfy Hamilton’s equations

˙q=H(q,p,t)p˙p=H(q,p,t)q

for some Hamiltonian H(q,p,t), then the transformation to coordinates Qi(qk,pk,t),Pi(qk,pk,t) is canonical if, and only if, there exists a function H(Q,P,t) such that the P and Q are still governed by Hamilton’s equations. That is,

˙Q=H(Q,P,t)P˙P=H(Q,P,t)Q

where H(Q,P,t) plays the role of the Hamiltonian for the new variables. Note that H(Q,P,t) may be very different from the old Hamiltonian H(q,p,t). The invariance of the Poisson bracket to canonical transformations, chapter 15.2, provides a powerful test that the transformation is canonical.

Hamilton’s Principle of least action, discussed in chapter 9, states that

δS=δt2t1L(q,˙q,t)dt=δt2t1[p˙qH(q,p,t)]dt=0

Similarly, applying Hamilton’s Principle of least action to the new Lagrangian L(Q,˙Q,t) gives

δS=δt2t1L(Q,˙Q,t)dt=δt2t1[P˙QH(Q,P,t)]dt=0

The discussion of gauge-invariant Lagrangians, chapter 9.3, showed that L and L can be related by the total time derivative of a generating function F where

dFdt=LL

The generating function F can be any well-behaved function with continuous second derivatives of both the old and new canonical variables p, q, P, Q and t. Thus the integrands of ??? and ??? are related by

p˙qH(q,p,t)=λ[P˙QH(Q,P,t)]+dFdt

where λ is a possible scale transformation. A scale transformation, such as changing units, is trivial, and will be assumed to be absorbed into the coordinates, making λ=1. Assuming that λ1 is called an extended canonical transformation.

Generating functions

The generating function F has to be chosen such that the transformation from the initial variables (q,p) to the final variables (Q,P) is a canonical transformation. The chosen generating function contributes to ??? only if it is a function of the old plus new variables. The four possible types of generating functions of the first kind, are F1(q,Q,t), F2(q,P,t), F3(p,Q,t), and F4(p,P,t). These four generating functions lead to relatively simple canonical transformations, are shown below.

Type 1: F=F1(q,Q,t):

The total time derivative of the generating function F=F1(q,Q,t) is given by

dF(q,Q,t)dt=[F1(q,Q,t)q˙q+F1(q,Q,t)Q˙Q]+F1(q,Q,t)t

Insert Equation ??? into Equation ???, and assume that the trivial scale factor λ=1, then

[pF1(q,Q,t)q]˙qH(q,p,t)=[P+F1(q,Q,t)Q]˙QH(Q,P,t)+F1(q,Q,t)t

Assume that the generating function F1 determines the canonical variables p and P to be

p=F1(q,Q,t)qP=F1(q,Q,t)Q

then the terms in each square bracket cancel, leading to the required canonical transformation

H(Q,P,t)=H(q,p,t)+F1(q,Q,t)t

Type 2: F=F2(q,P,t)QP:

The total time derivative of the generating function F=F2(q,P,t)QP is given by

dFdt=[F2(q,P,t)q˙q+F2(q,P,t)P˙pP˙Q˙PQ]+F2(q,P,t)t

Insert this into Equation ???, and assume that the trivial scale factor λ=1, then

(pF2(q,P,t)q)˙qH(q,p,t)=P˙QP˙Q+[F2(q,P,t)PQ]˙PH(Q,P,t)+F2(q,P,t)t

Assume that the generating function F2 determines the canonical variables p and Q to be

p=F2(q,P,t)qQ=F2(q,P,t)P

then the terms in brackets cancel, leading to the required transformation

H(Q,P,t)=H(q,p,t)+F2(q,P,t)t

Type 3: F=F3(p,Q,t)+qp:

The total time derivative of the generating function F=F3(p,Q,t)+qp is given by

dFdt=[F3(p,Q,t)p˙p+F3(p,Q,t)Q˙Q+˙qp+q˙p]+F3(p,Q,t)t

Insert this into Equation ???, and assume that the trivial scale factor λ=1, then

[q+F3(p,Q,t)p]˙pH(q,p,t)=[P+F3(p,Q,t)Q]˙QH(Q,P,t)+F3(p,Q,t)t

Assume that the generating function F3 determines the canonical variables q and P to be

q=F3(p,Q,t)pP=F3(p,Q,t)Q

then the terms in brackets cancel, leading to the required transformation

H(Q,P,t)=H(q,p,t)+F3(p,Q,t)t

Type 4: F=F4(p,P,t)+qpQP:

The total time derivative of the generating function F=F4(p,P,t)+qpQP is given by

dFdt=[F4(p,P,t)p˙p+F4(p,P,t)P˙p+˙qp+q˙p˙QPQ˙P]+F4(p,P,t)t

Insert this into Equation ???, and assume that the trivial scale factor λ=1, then

[q+F4(p,P,t)p]˙pH(q,p,t)=[F4(p,P,t)PQ]˙PH(Q,P,t)+F4(p,P,t)t

Assume that the generating function F4 determines the canonical variables q and Q to be

q=F4(p,P,t)pQ=F4(p,P,t)P

then the terms in brackets cancel, leading to the required transformation

H(Q,P,t)=H(q,p,t)+F4(p,P,t)t

Note that the last three generating functions require the inclusion of additional bilinear products of q, p, Q, P in order for the terms to cancel to give the required result. The addition of the bilinear terms, ensures that the resultant generating function F is the same using any of the four generating functions F1, F2, F3, F4. Frequently the F2(q,P,t) generating function is the most convenient. The four possible generating functions of the first kind, given above, are related by Legendre transformations. A canonical transformation does not have to conform to only one of the four generating functions Fk for all the degrees of freedom, they can be a mixture of different flavors for the different degrees of freedom. The properties of the generating functions are summarized in table 15.3.1.

Table 15.3.1: Canonical transformation generating functions
Generating function Generating function derivatives Trivial special examples
F=F1(q,Q,t) pi=F1qiPi=F1Qi F1=qiQiQi=piPi=qi
F=F2(q,P,t)QP pi=F2qiQi=F2Pi F2=qiPiQi=qiPi=pi
F=F3(p,Q,t)+qp qi=F3piPi=F3Qi F3=piQiQi=qiPi=pi
F=F4(p,P,t)+qpQP qi=F4piQi=F4Pi F4=piPiQi=piPi=qi

The partial derivatives of the generating functions Fi determine the corresponding conjugate variables not explicitly included in the generating function Fi. Note that, for the first trivial example F1=qiQi, the old momenta become the new coordinates, Qi=pi, and vice versa, Pi=qi. This illustrates that it is better to name them “conjugate variables” rather than “momenta” and “coordinates”.

In summary, Jacobi has developed a mathematical framework for finding the generating function F required to make a canonical transformation to a new Hamiltonian H(Q,P,t), that has a known solution. That is,

H(Q,P,t)=H(q,p,t)+Ft

When H(Q,P,t) is a constant, then a solution has been obtained. The inverse transformation for this solution Q(t),P(t)q(t),p(t) now can be used to express the final solution in terms of the original variables of the system.

Note the special case when H(Q,P,t)=0, then Equation ??? has been reduced to the Hamilton-Jacobi relation ???

H(q,p,t)+St=0

In this case, the generating function F determines the action functional S required to solve the Hamilton-Jacobi equation (15.4.23)). Since Equation ??? has transformed the Hamiltonian H(q,p,t)H(Q,P,t), for which H(Q,P,t)=0, then the solution Q(t),P(t) for the Hamiltonian H(Q,P,t)=0 is obtained easily. This approach underlies Hamilton-Jacobi theory presented in chapter 15.4.

Applications of Canonical Transformations

The canonical transformation procedure may appear unnecessarily complicated for solving the examples given in this book, but it is essential for solving the complicated systems that occur in nature. For example, canonical transformations can be used to transform time-dependent, (non-autonomous) Hamiltonians to time-independent, (autonomous) Hamiltonians for which the solutions are known. Example 15.6.2 describes such a system. Canonical transformations provide a remarkably powerful approach for solving the equations of motion in Hamiltonian mechanics, especially when using the Hamilton-Jacobi approach discussed in chapter 15.4.

Example 15.3.1: The identity canonical transformation

The identity transformation F2(q,P)=qP satisfies ??? if the following relations are satisfied pi=F2qi=Pi, Qi=F2Pi=qi, H=H. Note that the new and old coordinates are identical, hence F2=qiPi generates the identity transformation qi=Qi,pi=Pi.

Example 15.3.2: The point canonical transformation

Consider the point transformation F2(qP)=f(q,t)P where f(q,t) is some function of q. This transformation satisfies ??? if the following relations are satisfied Qi=F2Pi=fi(qi), pi=F2qi=fi(qi,t)qi, H=H. Point transformations correspond to point-to-point transformations of coordinates.

Example 15.3.3: The exchange canonical transformation

The identity transformation F1(q,Q)=qQ satisfies ??? if the following relations are satisfied pi=F1qi=Qi, Pi=F1Qi=qi, H=H That is, the coordinates and momenta have been interchanged.

Example 15.3.4: Infinitessimal point canonical transformation

Consider an infinitessimal point canonical transformation, that is infinitesimally close to a point identity.

F2(qP,t)=qP+ϵG(q,P,t)

satisfies ??? if the following relations are satisfied

Qi=F2Pi=qi+ϵG(q,P,t)Pi

pi=F2qi=Pi+ϵG(q,P,t)qi

Thus the infinitessimal changes in qi and pi are given by

δqi(q,p,t)=Qiqi=ϵG(q,P,t)Pi=ϵG(q,P,t)pi+O(ϵ2)

δpi(q,p,t)=Pipi=ϵG(q,P,t)qi=ϵG(q,P,t)pi+O(ϵ2)

Thus G(q,P,t) is the generator of the infinitessimal canonical transformation.

Example 15.3.5: 1-D harmonic oscillator via a cononical transformation

The classic one-dimensional harmonic oscillator provides an example of the use of canonical transformations. Consider the Hamiltonian where ω2=km then

H=p22m+kq22=12m(p2+m2ω2q2)

This form of the Hamiltonian is a sum of two squares suggesting a canonical transformation for which H is cyclic in a new coordinate. A guess for a canonical transformation is of the form p=mωqcotQ which is of the F1(q,Q) type where F1 equals F1(q,Q)=mωq22cotQ. Using ??? gives

p=F1(q,Q)qi=mωqcotQ

P=F1(q,Q)Q=m2ωq2sin2Q

Solving for the coordinates (p,q) yields

q=2PmωsinQ

p=2mωPcosQ

Inserting these into H gives

H=ωP(cos2Q+sin2Q)=ωP

which implies that Q is a cyclic coordinate.

The Hamiltonian is conservative, since it does not explicitly depend on time, and it equals the total energy since the transformation to generalized coordinates is time independent. Thus

H=E=ωP

Since

˙Q=HP=ω

then

Q=ωt+ϕ

Substituting Q into a gives the well known solution of the one-dimensional harmonic oscillator

q=2Emω2sin(ωt+ϕ)


This page titled 15.3: Canonical Transformations in Hamiltonian Mechanics is shared under a CC BY-NC-SA 4.0 license and was authored, remixed, and/or curated by Douglas Cline via source content that was edited to the style and standards of the LibreTexts platform.

Support Center

How can we help?