Skip to main content
Library homepage
Physics LibreTexts

17.7: Lorentz-invariant formulations of Hamiltonian Mechanics

  • Page ID
  • \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \) \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)\(\newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\) \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\) \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\) \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\) \( \newcommand{\Span}{\mathrm{span}}\) \(\newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\) \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\) \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\) \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\) \( \newcommand{\Span}{\mathrm{span}}\)\(\newcommand{\AA}{\unicode[.8,0]{x212B}}\)

    Extended Canonical Formalism

    A Lorentz-invariant formulation of Hamiltonian mechanics can be developed that is built upon the extended Lagrangian formalism assuming that the Hamiltonian and Lagrangian are related by a Legendre transformation. That is,

    \[H(\mathbf{q}, \mathbf{p}, t) = \sum^n_{\mu =1} p_{\mu} \frac{\partial q^{\mu}}{\partial t} − L(\mathbf{q}, \frac{\partial \mathbf{q}}{\partial t} , t) \label{17.73}\]

    where the generalized momentum is defined by

    \[p_{\mu} = \frac{\partial L}{ \partial \left( \frac{\partial q^{\mu}}{\partial t} \right)} \label{17.74}\]

    Struckmeier[Str08] assumes that the definitions of the extended Lagrangian \(\mathbb{L}\), and the extended Hamiltonian \(\mathbb{H}\), are related by a Legendre transformation, and are based on variational principles, analogous to the relation that exists between the conventional Lagrangian \(L\) and Hamiltonian \(H\). The Legendre transformation requires defining the extended generalized (canonical) momentum-energy four vector \(\mathbb{P}(s)= ( \frac{\mathbb{E} (s)}{c} , \mathbf{p}(s))\). The momentum components of the momentum-energy four vector \(\mathbb{P}(s)= (\frac{\mathbb{E} (s)}{c} , \mathbf{p}(s))\) are given by the \(1 \leq \mu \leq n\) components using either the conventional or the extended Lagrangians as given in Equation \ref{17.68}

    \[p_{\mu} (s) = \frac{\partial \mathbb{L}}{ \partial \left(\frac{dq^{\mu}}{ds} \right)} = \frac{\partial L}{ \partial \left(\frac{dq^{\mu}}{dt} \right)} \label{17.68}\]

    The \(\mu = 0\) component of the momentum-energy four vector is given by equation \((17.6.15)\)

    \[p_0 = \frac{1}{c} \left( \frac{\partial \mathbb{L}}{ \partial \left( \frac{dt}{ds} \right)} \right) = −\frac{H(p_{\mu} , q^{\mu} , t)}{ c} = −\frac{\mathcal{E} (s)}{ c} \label{17.75}\]

    where \(\mathcal{E} (s)\) represents the instantaneous generalized energy of the conventional Hamiltonian at the point \(s\), but not the functional form of \(H(\mathbf{q}(s), \mathbf{p}(s), t(s))\). That is

    \[\mathcal{E} (s) \underset{=}{\not\equiv} H(\mathbf{q}(s), \mathbf{p}(s), t(s)) \label{17.76}\]

    Note that \(\mathcal{E} (s)\) does not give the function \(H(\mathbf{q}, \mathbf{p}, t)\). Equations \ref{17.68} and \((17.6.15)\) give that

    \[p_0(s) = −\frac{\mathcal{E} (s)}{ c} \label{17.77}\]

    The extended Hamiltonian \(\mathbb{H}(\mathbf{q}, \mathbf{p}, t, \mathcal{E} (s))\), in an extended phase space, can be defined by the Legendre transformation and the four-vector \(\mathbb{P}\) to be

    \[\begin{align} \mathbb{H}(\mathbf{q}, \mathbf{p}, t, \mathcal{E} (s)) &= (\mathbb{P} \cdot \mathbf{q}) − \mathbb{L}(\mathbf{q}, \frac{d\mathbf{q}}{ds} ,t, \frac{dt}{ds} ) \label{17.78} \\[4pt] &= \sum^n_{\mu =0} p_{\mu} \left(\frac{dq^{\mu}}{ds} \right) − \mathbb{L}(\mathbf{q}, \frac{d\mathbf{q}}{ds} ,t, \frac{dt}{ds} ) \nonumber \\[4pt] &= \sum^n_{\mu =1} p_{\mu} \left(\frac{dq^{\mu}}{ds} \right) − \mathcal{E} \frac{dt}{ds} − \mathbb{L}(\mathbf{q}, \frac{d\mathbf{q}}{ds} ,t, \frac{dt}{ds}) \label{17.79} \end{align}\]

    where the \(p_0\) term has been written explicitly as \(−\mathcal{E} \frac{dt}{ds}\) in Equation \ref{17.79}. The extended Hamiltonian \(\mathbb{H}((\mathbf{q}, \mathbf{p}, t, \mathcal{E} (s))\) can carry all the information on the dynamical system that is carried by the extended Lagrangian \(\mathbb{L}(\mathbf{q}, \frac{d\mathbf{q}}{ds} ,t, \frac{dt}{ds} )\), if the Hesse matrix is non-singular. That is, if

    \[\text{det } \left( \frac{\partial^2 \mathbb{L}}{ \partial \left(\frac{dq^{\mu}}{ds} \right)\partial \left(\frac{dq_{\nu}}{ ds} \right)} \right) \neq 0 \label{17.80}\]

    If the extended Lagrangian \(\mathbb{L}(\mathbf{q}, \frac{d\mathbf{q}}{ds} ,t, \frac{dt}{ds} )\) is not homogeneous in the \(n+1\) velocities \(\frac{dq^{\mu}}{ds}\), then the extended set of Euler-Lagrange equations \((17.6.18)\) is not redundant. Thus equation \((17.6.12)\) is not an identity but it can be regarded as an implicit equation that is always satisfied by the extended set of Euler-Lagrange equations. As a result, the Legendre transformation to an extended Hamiltonian exists. That is, equation \((17.6.12)\) is identical to the Legendre transform for \(\mathbb{H}((\mathbf{q},\mathbf{p}, t, \mathcal{E} (s))\) which was shown to equal zero. Therefore

    \[\mathbf{H}(\mathbf{q}(s), \mathbf{p}(s), t(s), \mathcal{E} (s)) = 0 \label{17.81}\]

    which means that the extended Hamiltonian \(\mathbb{H}((\mathbf{q}, \mathbf{p}, t, \mathcal{E} (s))\) directly defines the restricted hypersurface on which the particle motion is confined.

    The extended canonical equations of motion, derived using the extended Hamiltonian \(\mathbb{H}(\mathbf{q}(s), \mathbf{p}(s), t(s), \mathcal{E} (s))\) with the usual Hamiltonian mechanics relations, are:

    \[\begin{align} \frac{\partial \mathbb{H}}{ \partial p_{\mu}} &= \frac{dq^{\mu}}{ds} \label{17.82} \\[4pt] \frac{\partial \mathbb{H}}{ \partial q^{\mu}} &= −\frac{dp_{\mu}}{ ds} \label{17.83} \\ \frac{\partial \mathbb{H}}{ \partial t} &= \frac{d\mathcal{E}}{ds} \label{17.84} \\ \frac{\partial \mathbb{H}}{ \partial \mathcal{E}} &= − \frac{dt}{ds} \label{17.85} \end{align}\]

    These canonical equations give that the total derivative of \(\mathbb{H}((\mathbf{q}(s), \mathbf{p}(s), t(s), \mathcal{E} (s))\) with respect to \(s\), is

    \[\begin{align} \frac{d\mathbb{H}}{ ds} &= \frac{\partial \mathbb{H}}{ \partial p_{\mu}} \frac{dp_{\mu}}{ ds} + \frac{\partial \mathbb{H}}{ \partial q^{\mu}} \frac{dq^{\mu}}{ds} + \frac{\partial \mathbb{H}}{ \partial t} \frac{dt}{ds} + \frac{\partial \mathbb{H}}{ \partial \mathcal{E}} \frac{d\mathcal{E}}{ds} \nonumber \\[4pt] &= \frac{dq^{\mu}}{ds} \frac{dp_{\mu} }{ds} − \frac{dp_{\mu}}{ ds} \frac{dq^{\mu}}{ds } + \frac{d\mathcal{E}}{ds} \frac{dt}{ds} − \frac{dt}{ds} \frac{d\mathcal{E}}{ds} = 0 \label{17.86} \end{align}\]

    That is, in contrast to the total time derivative of \(H(\mathbf{q}, \mathbf{p}, t)\), the total \(s\) derivative of the extended Hamiltonian \(\mathbb{H}((\mathbf{q}(s), \mathbf{p}(s), t(s), \mathcal{E} (s))\) always vanishes, that is, \(\mathbb{H}((\mathbf{q}(s), \mathbf{p}(s), t(s), \mathcal{E} (s))\) is autonomous which is ideal for use with Hamilton’s equations of motion. The constraints give that \(\mathbb{H}((\mathbf{q}(s), \mathbf{p}(s), t(s), \mathcal{E} (s)) = 0\), (Equation \ref{17.81}) and \(\frac{d\mathbb{H}}{ ds} = 0\), (Equation \ref{17.86}) implying that the correlation between the extended and conventional Hamiltonians is given by

    \[\begin{align}\mathbb{H}((\mathbf{q}(s), \mathbf{p}(s), t(s), \mathcal{E} (s)) &= \sum^n_{\mu =1} p_{\mu} \left(\frac{dq^{\mu}}{ds} \right) − \mathcal{E} \frac{dt}{ds} − \mathbb{L}(\mathbf{q}, \frac{d\mathbf{q}}{ds} ,t, \frac{dt}{ds} ) \label{17.87} \\[4pt] &= \sum^n_{\mu =1} p_{\mu} \left(\frac{dq^{\mu}}{ds} \right) − \mathcal{E} \frac{dt}{ds} − L(\mathbf{q}, \frac{d\mathbf{q}}{ds} ,t,) \frac{dt}{ds} \label{17.88} \\[4pt] &= \sum^n_{\mu =1} p_{\mu} \left(\frac{dq^{\mu}}{ds} \right) − \mathcal{E} \frac{dt}{ds} + \left[ H(\mathbf{q},\mathbf{p}, t) −\sum^n_{\mu =1} p_{\mu} \left(\frac{dq^{\mu}}{dt} \right) \right] \frac{dt}{ds} \label{17.89} \\[4pt] &= (H(\mathbf{q}, \mathbf{p}, t) − \mathcal{E} ) \frac{dt}{ds} = 0 \label{17.90} \end{align}\]

    since only the term with \(\mu = 0\) does not cancel in Equation \ref{17.79}. Equations \ref{17.81} and \ref{17.90} give that both the left and right-hand sides of Equation \ref{17.90} are zero while Equation \ref{17.86} implies that \(\mathbb{H}((\mathbf{q}(s), \mathbf{p}(s), t(s), \mathcal{E} (s))\) is a constant of motion, that is, \(s\) is a cyclic variable for \(\mathbb{H}((\mathbf{q}(s), \mathbf{p}(s), t(s), \mathcal{E} (s))\). Formally one can consider the extended Hamiltonian is a constant which equals zero

    \[\mathbb{H}(\mathbf{q}, \mathbf{p}, t, \mathcal{E} (s)) = \mathbb{E} (s)=0 \label{17.91}\]

    Equations \ref{17.84}, \ref{17.85} imply that \((\mathcal{E}, t)\) form a pair of canonically conjugate variables in addition to the newly-introduced canonically-conjugate variables \((\mathbb{E} (s), s)\). Equation \ref{17.90} shows that the motion in the \(2n + 2\) extended phase space is constrained to the surface reflecting the fact that the observed system has one less degree of freedom than used by the extended Hamiltonian.

    In summary, the Lorentz-invariant extended canonical formalism leads to Hamilton’s first-order equations of motion in terms of derivatives with respect to \(s\), where \(s\) is related to the proper time \(\tau\) for a relativistic system.

    Extended Poisson Bracket representation

    Struckmeier[Str08] investigated the usefulness of the extended formalism when applied to the Poisson bracket representation of Hamiltonian mechanics. The extended Poisson bracket for two differentiable functions \(F\) and \(G\) is defined as

    \[ \left\{\left\{ F,G \right\}\right\} = \sum^n_{j=1} \left( \frac{\partial F}{ \partial q^j} \frac{\partial G}{ \partial p_j} − \frac{\partial F}{ \partial p_j} \frac{\partial G}{ \partial q^j} \right) − \frac{\partial F}{ \partial t} \frac{\partial G}{ \partial H} + \frac{\partial F}{ \partial H} \frac{\partial G}{ \partial t} \label{17.92}\]

    As for the conventional Poisson bracket discussed in chapter \(15\), the extended Poisson also leads to the fundamental Poisson bracket relations

    \[ \left\{\left\{ q^i,q^j \right\}\right\} = 0 \quad \left\{\left\{ p_i,p_j \right\}\right\} = 0 \quad \left\{\left\{ q^i,p_j \right\}\right\} = \delta^i_j \label{17.93}\]

    where \(i, j = 0, 1, \dots , n\). These are identical to the non-extended fundamental Poisson brackets.

    The discussion of observables in Hamiltonian mechanics in chapter \(15.2.5\) can be trivially expanded to the extended Poisson bracket representation. In particular, the total \(s\) derivative of the function \(G\) is given by

    \[\frac{dG}{ ds} = \frac{\partial G}{ \partial s} + \left\{\left\{ G, \mathbb{H} \right\}\right\} \label{17.94}\]

    If \(G\) commutes with the extended Hamiltonian, that is, the Poisson bracket equals zero, and if \(\frac{\partial G}{ \partial s} = 0\), then \(\frac{dG}{ ds} = 0\). That is, the observable \(G\) is a constant of motion.

    Substitute the fundamental variables for \(G\) gives

    \[\frac{dp_{\mu} }{ds} = \left\{\left\{ p_{\mu}, \mathbb{H} \right\}\right\} = − \frac{\partial \mathbb{H}}{ \partial q^{\mu}} \quad \frac{dq^{\mu}}{ds} = \left\{\left\{ q^{\mu}, \mathbb{H} \right\}\right\} = \frac{\partial \mathbb{H}}{ \partial p_{\mu}} \label{17.95}\]

    where \(i, j = 0, 1, \dots , n\). These are Hamilton’s extended canonical equations of motion expressed in terms of the system evolution parameter \(s\). The extended Poisson bracket representation is a trivial extension of the conventional canonical equations presented in chapter \(15.3\).

    Extended canonical transformation and Hamilton-Jacobi theory

    Struckmeier[Str08] presented plausible extended versions of canonical transformation and Hamilton-Jacobi theories that can be used to provide a Lorentz-invariant formulation of Hamiltonian mechanics for relativistic one-body systems. A detailed description can be found in Struckmeier[Str08].1

    Validity of the extended Hamilton-Lagrange formalism

    It has been shown that the extended Lagrangian and Hamiltonian formalism, based on the parametric model of Lanczos[La49], leads to a plausible manifestly-covariant approach for the one-body system. The general features developed for handling Lagrangian and Hamiltonian mechanics carry over to the Special Theory of Relativity assuming the use of a non-standard, extended Lagrangian or Hamiltonian. This expansion of the range of validity of the well-known Hamiltonian and Lagrangian mechanics into the relativistic domain is important, and reduces any Lorentz transformation to a canonical transformation. The validity of this extended Hamilton-Lagrange formalism has been criticized, and problems exist extending this approach to the \(N\)-body system for \(N > 1\). For example, as discussed by Goldstein[Go50] and Johns[Jo05], each of the \(N\) moving bodies have their own world lines and momenta. Defining the total momentum \(\mathbf{P}\) requires knowing simultaneously the momenta of the individual bodies, but simultaneity is body dependent and thus even the total momentum is not a simple four vector. A general method is required that will allow using a manifestly-covariant Lagrangian or Hamiltonian for the \(N\)-body system. For the one-body system, the extended Hamilton-Lagrange formalism provides a powerful and logical approach to exploit analytical mechanics in the relativistic domain that retains the form of the conventional Lagrangian/Hamiltonian formalisms. Note that Noether’s theorem relating energy and time is readily apparent using the extended formalism.

    Example \(\PageIndex{1}\): The Bohr-Sommerfeld hydrogen atom

    The classical relativistic hydrogen atom was first solved by Sommerfeld in 1916. Sommerfeld used Bohr’s “old quantum theory” plus Hamiltonian mechanics to make an important step in the development of quantum mechanics by obtaining the first-order expressions for the fine structure of the hydrogen atom. As in the non-relativistic case, the motion is confined to a plane allowing use of planar polar coordinates. Thus the relativistic Lagrangian is given by

    \[\begin{align*} L &= −\frac{mc^2}{ \gamma } − U \\[4pt] &= −mc^2 \sqrt{ 1 − \frac{\dot{r}^2 + r^2 \dot{\theta}^2}{ c^2}} + \frac{ke^2}{ r} \end{align*}\]

    The canonical momenta are given by

    \[\begin{align*} p_{\theta} &= \frac{\partial L}{ \partial \dot{\theta}} = m\gamma r^2 \dot{\theta} \\[4pt] p_r &= \frac{\partial L}{ \partial \dot{r}} = m\gamma \dot{r} \\[4pt] \dot{p}_{\theta} &= \frac{\partial L}{ \partial \theta} = 0 \\[4pt] \dot{p}_r &= \frac{\partial L}{ \partial r} = m\gamma r \dot{\theta}^2 + k \frac{e^2}{ r^2} \end{align*}\]

    As for the non-relativistic case, \(\theta\) is a cyclic variable and thus the angular momentum \(p_{\theta} = m\gamma r^2 \dot{\theta}\) is conserved.

    Figure \(\PageIndex{1}\): The advance of the perihelion of bound orbits due to the dependence of the relativistic mass on velocity.

    The relativistic Hamiltonian for the Coulomb potential between an electron and the proton, assuming that the motion is confined to a plane, which allows use of planar polar coordinates, leads to

    \[H = \sqrt{ p^2_rc^2 + \frac{p^2_{\theta} c^2}{ r^2} + m^2c^4} − \frac{ke^2}{r} \nonumber\]

    The same equations of motion are obtained using Hamiltonian mechanics, that is:

    \[\begin{align*} \dot{\theta} &= \frac{\partial H}{ \partial p_{\theta}} = \frac{p_{\theta}}{ m\gamma r^2} \\[4pt] \dot{r} &= \frac{\partial H}{ \partial p_r} = \frac{p_r}{ m\gamma} \\[4pt] \dot{p}_{\theta} &= −\frac{\partial H}{ \partial \theta} = 0 \\[4pt] \dot{p}_r &= −\frac{\partial H}{ \partial r} = m \gamma r \dot{\theta}^2 + k \frac{e^2 }{r^2} \end{align*}\]

    The radial dependence can be solved using either Lagrangian or Hamiltonian mechanics, but the solution is non-trivial. Using the same techniques applied to solve Kepler’s problem, leads to the radial solution

    \[r = \frac{q}{ 1 + \epsilon \cos [\Gamma (\theta − \theta_0]} \quad \Gamma = \sqrt{ 1 − \frac{e^4}{ c^2 p^2_{\theta}}} \quad q = \frac{c^2\Gamma^2 p^2_{\theta}}{e^2 E} \quad \epsilon = \sqrt{1 + \frac{\Gamma^2 (1 − \frac{m^2c^4}{E^2} )}{ 1 − \Gamma^2}} \nonumber\]

    The apses are \(r_{\text{min}} = \frac{q}{ (1+\epsilon)}\) for \(\Gamma (\theta − \theta_0) = 0\), \(2\pi , 4\pi \), and \(r_{\text{max}} = \frac{q}{ (1−\epsilon)}\) for \(\Gamma (\theta − \theta_0) = \pi , 3\pi ,\). The perihelion advances between cycles due to the change in relativistic mass during the trajectory as shown in (Figure \(\PageIndex{1}\)). This precession leads to the fine structure observed in the optical spectra of the hydrogen atom. The same precession of the perihelion occurs for planetary motion, however, there is a comparable size effect due to gravity that requires use of general relativity to compute the trajectories.

    1Note that Greiner[Gr10] includes a reproduction of the Struckmeier paper[Str08].

    This page titled 17.7: Lorentz-invariant formulations of Hamiltonian Mechanics is shared under a CC BY-NC-SA 4.0 license and was authored, remixed, and/or curated by Douglas Cline via source content that was edited to the style and standards of the LibreTexts platform; a detailed edit history is available upon request.