Skip to main content
Physics LibreTexts

6.5: Constrained Systems

  • Page ID
    9598
  • \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \) \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)\(\newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\) \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\) \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\) \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\) \( \newcommand{\Span}{\mathrm{span}}\) \(\newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\) \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\) \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\) \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\) \( \newcommand{\Span}{\mathrm{span}}\)\(\newcommand{\AA}{\unicode[.8,0]{x212B}}\)

    The motion for systems subject to constraints is difficult to calculate using Newtonian mechanics because all the unknown constraint forces must be included explicitly with the active forces in order to determine the equations of motion. Lagrangian mechanics avoids these difficulties by allowing selection of independent generalized coordinates that incorporate the correlated motion induced by the constraint forces. This allows the constraint forces acting on the system to be ignored by reducing the system to a minimal set of generalized coordinates. The holonomic constraint forces can be determined using the Lagrange multiplier approach, or all constraint forces can be determined by including them as generalized forces, as described below.

    Choice of generalized coordinates

    As discussed in chapter \(5.8\), the flexibility and freedom for selection of generalized coordinates is a considerable advantage of Lagrangian mechanics when handling constrained systems. The generalized coordinates can be any set of independent variables that completely specify the scalar action functional, equation \((6.4.1)\). The generalized coordinates are not required to be orthogonal as is required when using the vectorial Newtonian approach. The secret to using generalized coordinates is to select coordinates that are perpendicular to the constraint forces so that the constraint forces do no work. Moreover, if the constraints are rigid, then the constraint forces do no work in the direction of the constraint force. As a consequence, the constraint forces do not contribute to the action integral and thus the \(\sum_{i}^{n}\mathbf{f}_{i}^{C}\cdot \delta \mathbf{r}_{i}\) term in equation \((6.3.2)\) can be omitted from the action integral. Generalized coordinates allow reducing the number of unknowns from \(n\) to \(s=n-m\) when the system has \(m\) holonomic constraints. In addition, generalized coordinates facilitate using both the Lagrange multipliers, and the generalized forces, approaches for determining the constraint forces.

    Minimal set of generalized coordinates

    The set of \(n\) generalized coordinates \(q_{i}\) are used to describe the motion of the system. No restrictions have been placed on the nature of the constraints other than they are workless for a virtual displacement. If the \(m\) constraints are holonomic, then it is possible to find sets of \(s=n-m\) independent generalized coordinates \(q_{j}\) that contain the \(m\) constraint conditions implicitly in the transformation equations \[\label{6.49} \mathbf{r}_{i}=\mathbf{r}_{i}(q_{1},q_{2},q_{3}\dots ,q_{s},t)\]

    For the case of \(s=n-m\) unknowns, any virtual displacement \(\delta q_{j}\) is independent of \(\delta q_{k}\), therefore the only way for \((6.3.27)\) to hold is for the term in brackets to vanish for each value of \(j\), that is

    \[\label{6.50} \left\{ \frac{d}{dt}\left( \frac{\partial L}{\partial \dot{q}_{j}}\right) -\frac{ \partial L}{\partial q_{j}}\right\} =Q_{j}^{EX}\] where \(j=1,2,3,..\) \(s.\) These are the Lagrange equations for the minimal set of \(s\) independent generalized coordinates.

    If all the generalized forces are conservative plus velocity independent, and are included in the potential \(U,\) and \(Q_{j}^{EX}=0\), then \ref{6.50} simplifies to

    \[\label{6.51} \left\{ \frac{d}{dt}\left( \frac{\partial L}{\partial \dot{q}_{j}}\right) - \frac{\partial L}{\partial q_{j}}\right\} =0\]

    This is Euler’s differential equation, derived earlier using the calculus of variations. Thus d’Alembert’s Principle leads to a solution that minimizes the action integral \(\delta \int_{t_{1}}^{t_{2}}Ldt=0\) as stated by Hamilton’s Principle.

    Lagrange multipliers approach

    Equation \((6.3.27)\) sums over all \(n\) coordinates for \(N\) particles, providing \(n\) equations of motion. If the \(m\) constraints are holonomic they can be expressed by \(m\) algebraic equations of constraint

    \[\label{6.52} g_{k}(q_{1},q_{2},..q_{n},t)=0\]

    where \(k=1,2,3,\dots m.\) Kinematic constraints can be expressed in terms of the infinitessimal displacements of the form

    \[\label{6.53} \sum_{j=1}^{n} \frac{\partial g_{k}}{\partial q_{j}}(\mathbf{q},t)dq_{j}+\frac{\partial g_{k}}{\partial t}dt=0\]

    where \(k=1,2,3,\dots m\), \(j=1,2,3,\dots n\), and where the \(\frac{\partial g_{k}}{ \partial q_{j}}\), and \(\frac{\partial g_{k}}{\partial t}\) are functions of the generalized coordinates \(q_{j}\), described by the vector \(\mathbf{q,}\) that are derived from the equations of constraint. As discussed in chapter \(5.7\), if \ref{6.53} represents the total differential of a function, then it can be integrated to give a holonomic relation of the form of Equation \ref{6.52}. However, if \ref{6.53} is not the total differential, then it can be integrated only after having solved the full problem. If \(\frac{\partial g_{k}}{\partial t}=0\) then the \(k^{th}\) constraint is scleronomic.

    The discussion of Lagrange multipliers in chapter \(5.9.1\), showed that, for virtual displacements \(\delta q_{j},\) the correlation of the generalized coordinates, due to the constraint forces, can be taken into account by multiplying \ref{6.53} by unknown Lagrange multipliers \(\lambda _{k}\) and summing over all \(m\) constraints. Generalized forces can be partitioned into a Lagrange multiplier term plus a remainder force. That is

    \[\label{6.54} Q_{j}^{EX}=\sum_{k=1}^{m}\lambda _{k} \frac{\partial g_{k}}{\partial q_{j}}(\mathbf{q},t)+Q_{j}^{EXC}\]

    since by definition \(\delta t=0\) for virtual displacements.

    Chapter \(5.9.1\) showed that holonomic forces of constraint can be taken into account by introducing the Lagrange undetermined multipliers approach, which is equivalent to defining an extended Lagrangian \(L^{\prime }(\mathbf{q,\dot{ q},\lambda ,}t)\) where

    \[\label{6.55} L^{\prime }(\mathbf{q,\dot{q},\lambda ,}t)=L(\mathbf{q,\dot{q},} t)+\sum_{k=1}^{m}\sum_{j=1}^{n}\lambda _{k}\frac{\partial g_{k}}{\partial q_{j}}(\mathbf{q},t)\]

    Finding the extremum for the extended Lagrangian \(L^{\prime }(\mathbf{q,\dot{ q},\lambda ,}t)\) using \((6.4.2)\) gives

    \[\label{6.56} \sum_{j}^{n}\left[ \left\{ \frac{d}{dt}\left( \frac{\partial L}{\partial \dot{q}_{j}}\right) -\frac{\partial L}{\partial q_{j}}\right\} -\sum_{k=1}^{m}\lambda _{k}\frac{\partial g_{k}}{\partial q_{j}}(\mathbf{q} ,t)-Q_{j}^{EXC}\right] \delta q_{j}=0\]

    where \(Q_{j}^{EXC}\) is the remaining part of the generalized force \(Q_{j}\) after subtracting both the part of the force absorbed in the potential energy \(U\), which is buried in the Lagrangian \(L\), as well as the holonomic constraint forces which are included in the Lagrange multiplier terms \(\sum_{k=1}^{m}\lambda _{k}\frac{\partial g_{k}}{\partial q_{j}}(\mathbf{q} ,t)\). The \(m\) Lagrange multipliers \(\lambda _{k}\) can be chosen arbitrarily in \ref{6.56}. Utilizing the free choice of the \(m\) Lagrange multipliers \(\lambda _{k}\) allows them to be determined in such a way that the coefficients of the first \(m\) infinitessimals, i.e. the square brackets vanish. Therefore the expression in the square bracket must vanish for each value of \(\ 1\leq j\leq m\). Thus it follows that

    \[\label{6.57} \left\{ \frac{d}{dt}\left( \frac{\partial L}{\partial \dot{q}_{j}}\right) - \frac{\partial L}{\partial q_{j}}\right\} -\sum_{k=1}^{m}\lambda _{k}\frac{ \partial g_{k}}{\partial q_{j}}(\mathbf{q},t)-Q_{j}^{EXC}=0\]

    when \(j=1,2,..m.\) Thus \ref{6.56} reduces to a sum over the remaining coordinates between \(m+1\leq j\leq n\)

    \[\label{6.58} \sum_{j=m+1}^{n}\left[ \left\{ \frac{d}{dt}\left( \frac{\partial L}{\partial \dot{q}_{j}}\right) -\frac{\partial L}{\partial q_{j}}\right\} -\sum_{k=1}^{m}\lambda _{k}\frac{\partial g_{k}}{\partial q_{j}}(\mathbf{q} ,t)-Q_{j}^{EXC}\right] \delta q_{j}=0\]

    In Equation \ref{6.58} the \(s=n-m\) infinitessimals \(\delta q_{j}\) can be chosen freely since the \(s=n-m\) degrees of freedom are independent. Therefore the expression in the square bracket must vanish for each value of \(m+1\leq j\leq n\). Thus it follows that

    \[\label{6.59} \left\{ \frac{d}{dt}\left( \frac{\partial L}{\partial \dot{q}_{j}}\right) - \frac{\partial L}{\partial q_{j}}\right\} -\sum_{k=1}^{m}\lambda _{k}\frac{ \partial g_{k}}{\partial q_{j}}(\mathbf{q},t)-Q_{j}^{EXC}=0\]

    where \(j=m+1,m+2,..n.\) Combining equations \ref{6.57} and \ref{6.59} then gives the important general relation that for \(1\leq j\leq n\) \[\label{6.60} \left\{ \frac{d}{dt}\left( \frac{\partial L}{\partial \dot{q}_{j}}\right) - \frac{\partial L}{\partial q_{j}}\right\} =\sum_{k=1}^{m}\lambda _{k}\frac{ \partial g_{k}}{\partial q_{j}}(\mathbf{q},t)+Q_{j}^{EXC}\]

    To summarize, the Lagrange multiplier approach \ref{6.60} automatically solves the \(n\) equations plus the \(m\) holonomic equations of constraint, which determines the \(n+m\) unknowns, that is, the \(n\) coordinates plus the \(m\) forces of constraint. The beauty of the Lagrange multipliers is that all \(n\) variables, plus the \(m\) constraint forces, are found simultaneously by using the calculus of variations to determine the extremum for the expanded Lagrangian \(L^{\prime }(\mathbf{q, \dot{q},\lambda ,}t)\).

    Generalized forces approach

    The two right-hand terms in \ref{6.60} can be understood to be those forces acting on the system that are not absorbed into the scalar potential \(U\) component of the Lagrangian \(L\). The Lagrange multiplier terms \(\sum_{k=1}^{m} \lambda _{k} \frac{\partial g_{k}}{\partial q_{j}}(\mathbf{q},t)\) account for the holonomic forces of constraint that are not included in the conservative potential or in the generalized forces \(Q_{j}^{EXC}\). The generalized force

    \[\label{6.61}Q_{j}^{EXC}=\sum_{i}^{n}\mathbf{F}_{i}^{A}\cdot \frac{\partial \mathbf{r}_{i} }{\partial q_{j}} \]

    is the sum of the components in the \(q_{j}\) direction for all external forces that have not been taken into account by the scalar potential or the Lagrange multipliers. Thus the non-conservative generalized force \(Q_{j}^{EXC}\) contains non-holonomic constraint forces, including dissipative forces such as drag or friction, that are not included in \(U,\) or used in the Lagrange multiplier terms to account for the holonomic constraint forces.

    The concept of generalized forces is illustrated by the case of spherical coordinate systems. The attached table gives the displacement elements \(\delta q_{i}\), (taken from table \(C4\)) and the generalized force for the three coordinates. Note that \(Q_{i}\) has the dimensions of force and \(Q_{i}.\delta q_{i}\) has the units of energy. By contrast equation \((6.3.13)\) gives that \(Q_{\theta }=F_{\theta }r\) and \(Q_{\phi }=F_{\phi }r\) which have the dimensions of torque. However, \(Q_{\theta }\delta \theta\) and \(Q_{\phi }\delta \phi\) both have the dimensions of energy as is required in equation \((6.3.13)\). This illustrates that the units used for generalized forces depend on the units of the corresponding generalized coordinate.

    Unit vectors \(\delta q_{i}\) \(Q_{i}\) \(Q_{i}\cdot \delta q_{i}\)
    \(\hat{r}\) \(\mathbf{\hat{r}}dr\) \(\mathbf{\hat{r}}F_{r}\) \(F_{r}dr\)
    \(\mathbf{\hat{\theta}}\) \(\mathbf{\hat{\theta}}rd\theta\) \(\mathbf{\hat{ \theta}}F_{\theta }r\) \(F_{\theta }rd\theta\)
    \(\mathbf{\hat{\phi}}\) \(\mathbf{\hat{\phi}}r\sin \theta d\phi\) \(\mathbf{ \hat{\phi}}F_{\phi }r\sin \theta\) \(F_{\phi }r\sin \theta d\phi\)

    This page titled 6.5: Constrained Systems is shared under a CC BY-NC-SA 4.0 license and was authored, remixed, and/or curated by Douglas Cline via source content that was edited to the style and standards of the LibreTexts platform; a detailed edit history is available upon request.