Skip to main content
Physics LibreTexts

9.S: Hamilton's Action Principle (Summary)

  • Page ID
    14199
  • \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    \( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)

    ( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\id}{\mathrm{id}}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\kernel}{\mathrm{null}\,}\)

    \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\)

    \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\)

    \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)

    \( \newcommand{\vectorA}[1]{\vec{#1}}      % arrow\)

    \( \newcommand{\vectorAt}[1]{\vec{\text{#1}}}      % arrow\)

    \( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vectorC}[1]{\textbf{#1}} \)

    \( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)

    \( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)

    \( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)

    \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    \(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)

    The Hamilton’s 1834 publication, introducing both Hamilton’s Principle of Stationary Action and Hamiltonian mechanics, marked the crowning achievements for the development of variational principles in classical mechanics. A fundamental advantage of Hamiltonian mechanics is that it uses the conjugate coordinates \(\mathbf{q}\), \(\mathbf{p}\), plus time \(t\), which is a considerable advantage in most branches of physics and engineering. Compared to Lagrangian mechanics, Hamiltonian mechanics has a significantly broader arsenal of powerful techniques that can be exploited to obtain an analytical solution of the integrals of the motion for complicated systems, as described in chapter \(15\). In addition, Hamiltonian dynamics provides a means of determining the unknown variables for which the solution assumes a soluble form, and is ideal for study of the fundamental underlying physics in applications to fields such as quantum or statistical physics. As a consequence, Hamiltonian mechanics has become the preeminent variational approach used in modern physics.

    This chapter has introduced and discussed Hamilton’s Principle of Stationary Action, which underlies the elegant and remarkably powerful Lagrangian and Hamiltonian representations of algebraic mechanics. The basic concepts employed in algebraic mechanics are summarized below.

    Hamilton’s Action Principle

    As discussed in chapter \(9.2\), Hamiltonian mechanics is built upon Hamilton’s action functional

    \[S(\mathbf{q},\mathbf{p},t) = \int^{t_f}_{t_i} L(\mathbf{q}, \mathbf{\dot{q}},t)dt \label{9.1}\]

    Hamilton’s Principle of least action states that

    \[\delta S(\mathbf{q},\mathbf{p},t) = \delta \int^{t_f}_{t_i} L(\mathbf{q}, \mathbf{\dot{q}},t)dt = 0 \label{9.2}\]

    Generalized momentum \(p\)

    In chapter \(7.2\), the generalized (canonical) momentum was defined in terms of the Lagrangian \(L\) to be

    \[p_i \equiv \frac{\partial L(\mathbf{q}, \mathbf{\dot{q}},t)}{\partial \dot{q}_i} \label{7.3}\]

    Chapter \(9.2.2\) defined the generalized momentum in terms of the action functional \(S\) to be

    \[p_j \equiv \frac{\partial S(\mathbf{q}, \mathbf{p},t)}{\partial \dot{q}_j} \label{9.12}\]

    Generalized energy \(h (\mathbf{q},\dot{q},t)\)

    Jacobi’s Generalized Energy \(h (\mathbf{q},\dot{q},t)\) was defined in Equation \ref{7.37} as

    \[h (\mathbf{q},\dot{q},t) \equiv \sum_j \left( \dot{q}_j \frac{\partial L(\mathbf{q}, \mathbf{\dot{q}},t)}{\partial \dot{q}_j} \right) - L(\mathbf{q}, \mathbf{\dot{q}},t) \label{7.37}\]

    Hamiltonian function \(H(\mathbf{q}, \mathbf{p},t)\)

    The Hamiltonian \(H(\mathbf{q}, \mathbf{p},t)\) was defined in terms of the generalized energy \(h (\mathbf{q},\dot{q},t)\) plus the generalized momentum. That is

    \[H(\mathbf{q}, \mathbf{p},t) \equiv h (\mathbf{q},\dot{q},t) = \sum_j p_j\dot{q}_j - L(\mathbf{q}, \mathbf{\dot{q}},t) = \mathbf{p} \cdot \mathbf{\dot{q}} - L(\mathbf{q}, \mathbf{\dot{q}},t) \label{10.S.1}\]

    where \(\mathbf{p}\), \(\mathbf{q}\) correspond to \(n\)-dimensional vectors, e.g. \(\mathbf{q} \equiv (q_1, q_2, \dots , q_n)\) and the scalar product \(\mathbf{p} \cdot \mathbf{\dot{q}} = \sum_i p_i \dot{q}_i\). Chapter \(8.2\) used a Legendre transformation to derive this relation between the Hamiltonian and Lagrangian functions. Note that whereas the Lagrangian \(L(\mathbf{q},\mathbf{\dot{q}},t)\) is expressed in terms of the coordinates \(\mathbf{q}\), plus conjugate velocities \(\mathbf{\dot{q}}\), the Hamiltonian \(H (\mathbf{q}, \mathbf{p}, t)\) is expressed in terms of the coordinates \(\mathbf{q}\) plus their conjugate momenta \(\mathbf{p}\). For scleronomic systems, using the standard Lagrangian, in equations \((7.9.4)\) and \((7.6.14)\), shows that the Hamiltonian simplifies to be equal to the total mechanical energy, that is, \(H = T + U\).

    Generalized energy theorem

    The equations of motion lead to the generalized energy theorem which states that the time dependence of the Hamiltonian is related to the time dependence of the Lagrangian.

    \[\frac{dH (\mathbf{q},\mathbf{p},t)}{dt} = \sum_j \dot{q}_j \left[ Q^{EXC}_j + \sum^{m}_{k=1} \lambda_k \frac{\partial g_k}{\partial q_j} (\mathbf{q},t) \right] - \frac{\partial L(\mathbf{q},\mathbf{\dot{q}},t)}{\partial t} \label{7.38}\]

    Note that if all the generalized non-potential forces and Lagrange multiplier terms are zero, and if the Lagrangian is not an explicit function of time, then the Hamiltonian is a constant of motion.

    Lagrange equations of motion

    Equation \ref{6.60} gives that the \(N\) Lagrange equations of motion are

    \[\left\{ \frac{d}{dt} \left(\frac{\partial L}{\partial \dot{q}_j}\right) - \frac{\partial L}{\partial q_j} \right\} = \sum^m_{k=1} \lambda_k \frac{\partial g_k}{\partial q_j} (\mathbf{q}, t) + Q^{EXC}_j \label{6.60}\]

    where \(j = 1, 2, 3, ....N\).

    Hamilton’s equations of motion

    Chapter \(8.3\) showed that a Legendre transform, plus the Lagrange-Euler equations, \((8.3.11, 8.3.12, 8.3.13)\) lead to Hamilton’s equations of motion. Hamilton derived these equations of motion directly from the action functional, as shown in chapter \(9.2\).

    \[\dot{q}_j = \frac{\partial H (\mathbf{q},\mathbf{p},t)}{ \partial p_j} \label{8.25} \]

    \[\begin{align} \dot{p}_j &=& −\frac{\partial H}{\partial q_j} (\mathbf{q},\mathbf{p},t) + \left[ \sum^m_{k=1} \lambda_k \frac{\partial g_k}{\partial q_j} (\mathbf{q}, t) + Q^{EXC}_j \right] \label{8.26}\end{align} \]

    \[ \frac{\partial H (\mathbf{q},\mathbf{p},t)}{\partial t } = −\frac{\partial L(\mathbf{q},\mathbf{\dot{q}},t)}{\partial t} \label{8.24} \]

    Note the symmetry of Hamilton’s two canonical equations. The canonical variables \(p_k\), \(q_k\) are treated as independent canonical variables. Lagrange was the first to derive the canonical equations but he did not recognize them as a basic set of equations of motion. Hamilton derived the canonical equations of motion from his fundamental variational principle and made them the basis for a far-reaching theory of dynamics. Hamilton’s equations give \(2s\) first-order differential equations for \(p_k\), \(q_k\) for each of the \(s\) degrees of freedom. Lagrange’s equations give \(s\) second-order differential equations for the variables \(q_k\), \(\dot{q}_k\).

    Hamilton-Jacobi equation

    Hamilton used Hamilton’s Principle plus Equation \ref{9.19} to derive the Hamilton-Jacobi equation.

    \[\frac{\partial S}{\partial t} + H(\mathbf{q},\mathbf{p},t)=0 \label{9.19}\]

    The solution of Hamilton’s equations is trivial if the Hamiltonian is a constant of motion, or when a set of generalized coordinate can be identified for which all the coordinates \(q_i\) are constant, or are cyclic (also called ignorable coordinates). Jacobi developed the mathematical framework of canonical transformation required to exploit the Hamilton-Jacobi equation.

    Hamilton’s Principle applied using initial boundary conditions

    The definition of Hamilton’s Principle assumes integration between the initial time \(t_i\) and final time \(t_f\). A recent development has extended applications of Hamilton’s Principle to apply to systems that are defined in terms of only the initial boundary conditions. This method doubles the number of degrees of freedom and uses a coupling Lagrangian \(K (\mathbf{q}_2, \mathbf{\dot{q}}_2, \mathbf{q}_1, \mathbf{\dot{q}}_1, t)\) between the corresponding \(\mathbf{q}_1\) and \(\mathbf{q}_2\) doubled degrees of freedom

    \[\frac{d}{dt} \frac{ \partial L}{ \partial \dot{q}^I_-} − \frac{\partial L}{\partial q^{I}_-} = \left[ \frac{\partial K}{\partial q^I_-} − \frac{d}{dt} \frac{\partial K}{\partial \dot{q}^I_-} \right]_{PL} \equiv Q^I (\mathbf{q}_1, \mathbf{\dot{q}}_1, t) \label{9.50} \]

    and where \(Q^I\) is a generalized nonconservative force derived from \(K\).

    Standard Lagrangians

    Derivation of Lagrangian mechanics, using d’Alembert’s principle of virtual work, assumed that the Lagrangian is defined by Equation \ref{9.52}

    \[L(\mathbf{q}, \mathbf{\dot{q}},t) = T(\mathbf{\dot{q}},t) − U(\mathbf{q}, t) \label{9.52}\]

    This was used in equation \((9.2.1)\) to derive the action in terms of the fundamental Lagrangian defined by Equation \ref{9.52}. The assumption that the action \(S\) is the fundamental property inverts this procedure and now equation \((9.2.1)\) is used to derived the Lagrangian. That is, the assumption that Hamilton’s Principle is the foundation of algebraic mechanics defines the Lagrangian in terms of the fundamental action \(S\).

    Non-standard Lagrangians

    The flexibility and power of Lagrangian mechanics can be extended to a broader range of dynamical systems by employing an extended definition of the Lagrangian that assumes that the action is the fundamental property, and then the Lagrangian is defined in terms of Hamilton’s variational action principle using Equation \ref{9.2}. It was illustrated that the inverse variational calculus formalism can be used to identify non-standard Lagrangians that generate the required equations of motion. These nonstandard Lagrangians can be very different from the standard Lagrangian and do not separate into kinetic and potential energy components. These alternative Lagrangians can be used to handle dissipative systems which are beyond the range of validity when using standard Lagrangians. That is, it was shown that several very different Lagrangians and Hamiltonians can be equivalent for generating useful equations of motion of a system. Currently the use of non-standard Lagrangians is a narrow, but active, frontier of classical mechanics with important applications to relativistic mechanics.

    Gauge invariance of the standard Lagrangian

    It was shown that there is a continuum of equivalent standard Lagrangians that lead to the same set of equations of motion for a system. This feature is related to gauge invariance in mechanics. The following transformations change the standard Lagrangian, but leave the equations of motion unchanged.

    1. The Lagrangian is indefinite with respect to addition of a constant to the scalar potential which cancels out when the derivatives in the Euler-Lagrange differential equations are applied.
    2. Similarly the Lagrangian is indefinite with respect to addition of a constant kinetic energy.
    3. The Lagrangian is indefinite with respect to addition of a total time derivative of the form \(L + \frac { d } { d t } \left[ \Lambda \left( q _ { i } , t \right) \right]\) for any differentiable function \(\Lambda \left( q _ { i } t \right)\) of the generalized coordinates, plus time, that has continuous second derivatives.

    Application of Hamilton’s Action Principle to mechanics

    The derivation of the equations of motion for any system can be separated into a hierarchical set of three stages in both sophistication and understanding. Variational principles are employed during the primary “action” stage and secondary “Hamilton/Lagrangian” stage to derive the required equations of motion, which then are solved during the third “equations-of-motion stage”. Hamilton’s Action Principle, is a scalar function that is the basis for deriving the Lagrangian and Hamiltonian functions. The primary “action stage” uses Hamilton’s Action functional, \(S = \int^{t_f}_{t_i} L (\mathbf{q},\mathbf{\dot{q}},t) dt\) to derive the Lagrangian and Hamiltonian functionals that are based on Hamilton’s action functional and provide the most fundamental and sophisticated level of understanding. The second “Hamiltonian/Lagrangian stage” involves using the Lagrangian and Hamiltonian functionals to derive the equations of motion. The third “equations-of-motion stage” uses the derived equations of motion to solve for the motion subject to a given set of initial boundary conditions. The Newtonian mechanics approach bypasses the primary “action” stage, as well as the secondary “Hamiltonian/Lagrangian” stage. That is, Newtonian mechanics starts at the third “equations-of-motion” stage, which does not allow exploiting the considerable advantages provided by use of action, the Lagrangian, and the Hamiltonian. Newtonian mechanics requires that all the active forces be included when deriving the equations of motion, which involves dealing with vector quantities. This is in contrast to the action, Lagrangian, and Hamiltonian which are scalar functionals. Both the primary “action” stage, and the secondary “Lagrangian/Hamiltonian” stage, exploit the powerful arsenal of mathematical techniques that have been developed for exploiting variational principles.


    This page titled 9.S: Hamilton's Action Principle (Summary) is shared under a CC BY-NC-SA 4.0 license and was authored, remixed, and/or curated by Douglas Cline via source content that was edited to the style and standards of the LibreTexts platform.