Skip to main content
\(\require{cancel}\)
Physics LibreTexts

3.13: The Virial Theorem

  • Page ID
    8827
  • [ "article:topic", "second moment of inertia", "Virial Theorem", "authorname:tatumj" ]

    First, let me say that I am not sure how this theorem got its name, other than that my Latin dictionary tells me that vis, viris means force, and its plural form, vires, virium is generally translated as strength. The term was apparently introduced by Rudolph Clausius of thermodynamics fame. We do not use the word strength in any particular technical sense in classical mechanics, although we do talk about the tensile strength of a wire, which is the force that it can summon up before it snaps. We use the word energy to mean the ability to do work; perhaps we could use the word strength to mean the ability to exert a force. But enough of these idle speculations.

    Before proceeding, I define the quantity

    \[ \boldsymbol\iota = \sum_i m_{i}r_{i}^{2} \label{eq:3.13.1}\]

    as the second moment of mass of a system of particles with respect to the origin. As discussed in Chapter 2, mass is (apart from some niceties in general relativity) synonymous with inertia, and the second moment of mass is used so often that it is nearly always called simply “the” moment of inertia, as though there were only one moment, the second, worth considering. Note carefully, however, that you are probably much more used to thinking about the moment of inertia with respect to an axis rather than with respect to a point. This distinction is discussed in Section 2.19. Note also that, since the symbol \( I\) tends to be heavily used in any discussion of moments of inertia, for moment of inertia with respect to a point I am using the symbol \( \boldsymbol\iota\)

    I can also write Equation \( \ref{eq:3.13.1}\) as

    \[\boldsymbol\iota = \sum_i m_{i}({\bf r}_{i} \times {\bf r}_{i}) \label{eq:3.13.2} \]

    Differentiate twice with respect to time:

    \[\dot{\boldsymbol\iota} = 2\sum_i m_{i}({\bf r}_{i}\dot{\bf r}_{i}), \label{eq:3.13.3} \]

    and

    \[\ddot{\boldsymbol\iota} = 2\sum_i m_{i}(\dot{\bf r}_{i}^{2}+ {\bf r}_{i}\ddot{\bf r}_{i}), \label{eq:3.13.4} \]

    or

    \[\ddot{\boldsymbol\iota} = 4T + 2 \sum_i{\bf r}_{i}m_{i}\ddot{\bf r}_{i}, \label{eq:3.13.5} \]

    where \( T\) is the kinetic energy of the system of particles. The sums are understood to be over all particles - i.e. \( i\) from 1 to \( n\).

    \(m_{i} \ddot{\bf r}_{i}\) is the force on the \( i\)th particle. I am now going to suppose that there are no external forces on any of the particles in the system, but the particles interact with each other with conservative forces, \( {\bf F}_{ij}\) being the force exerted on particle \( i\) by particle \( j\). I am also going to introduce the notation \( {\bf r} _{ij} = {\bf r} _{i} -{\bf r} _{i}\), which is a vector directed from particle \( i\) to particle \( j\). The relation between these three vectors in shown in Figure III.8.

    I have not drawn the force \( {\bf F}_{ij}\), but it will be in the opposite direction to \( {\bf r}_{ji}\) if it is a repulsive force and in the same direction as \( {\bf r}_{ji}\) if it is an attractive force.

    The total force on particle i is \(\sum_{j\neq i}{\bf F}_{ij}\) and this is equal to \( m_{i} \ddot{\bf r}_{i} \) Therefore, Equation \( \ref{eq:3.13.5}\) becomes

    \[\ddot {\boldsymbol\iota} = 4T + 2\sum_i {\bf r}_{i} \sum_{j\neq i}{\bf F}_{ij} \label{eq:3.13.6} \]

    Now it is clear that

    \[\sum_{i}{\bf r}_{i} \sum_{j\neq i} {\bf F}_{ij} = \sum_i \sum_{j<i}{\bf r}_{ij}{\bf F}_{ij} \label{eq:3.13.7} \]

    However, in case, like me, you find double subscripts and summations confusing and you have really no idea what Equation \( \ref{eq:3.13.7}\) means, and it is by no means at all clear, I write it out in full in the case where there are five particles. Thus:

    \(\sum_i {\bf r}_i \sum_{j\neq i } {\bf F}_{ij} = {\bf r}_{1}({\bf F}_{12}+ {\bf F}_{13}+{\bf F}_{14}+{\bf F}_{15})\)

    \(+ {\bf r}_{2}({\bf F}_{21}+ {\bf F}_{23}+{\bf F}_{24}+{\bf F}_{25})\)

    \(+ {\bf r}_{3}({\bf F}_{31}+ {\bf F}_{33}+{\bf F}_{34}+{\bf F}_{35})\)

    \(+ {\bf r}_{4}({\bf F}_{41}+ {\bf F}_{43}+{\bf F}_{44}+{\bf F}_{45})\)

    \(+ {\bf r}_{5}({\bf F}_{51}+ {\bf F}_{53}+{\bf F}_{54}+{\bf F}_{55})\)

    Now apply Newton’s third law of motion:

    \(\sum_i r_i \sum_{j\neq i } {\bf F}_{ij} = r_{1}(-{\bf F}_{21}- {\bf F}_{31}-{\bf F}_{41}+{\bf F}_{51})\)

    \(+ {\bf r}_{2}({\bf F}_{21}- {\bf F}_{32}-{\bf F}_{42}-{\bf F}_{52})\)

    \(+ {\bf r}_{3}({\bf F}_{31}- {\bf F}_{32}-{\bf F}_{43}-{\bf F}_{53})\)

    \(+ {\bf r}_{4}({\bf F}_{41}- {\bf F}_{42}-{\bf F}_{43}-{\bf F}_{54})\)

    \(+ {\bf r}_{5}({\bf F}_{51}- {\bf F}_{52}-{\bf F}_{53}+{\bf F}_{54})\)

    Now bear in mind that \({\bf r}_{2} - {\bf r}_{1} = {\bf r}_{21}\), and we see that this becomes

    \(\sum_{i} r_{i} \sum {j \neq i} {\bf F}_{ij} = {\bf F}_{21}*r_{21}+{\bf F}_{31}*r_{31} + {\bf F}_{41}*r_{41}+ {\bf F}_{51}*r_{51}\)

    \({\bf F}_{32}*{\bf r}_{32} + {\bf F}_{42}*{\bf r}_{42} + {\bf F}_{52}*{\bf r}_{52}\)

    \(+ {\bf F}_{43}*{\bf r}_{43} + {\bf F}_{53}*{\bf r}_{53}\)

    \(+ {\bf F}_{54}*{\bf r}_{54}\)

    and we have arrived at Equation \( \ref{eq:3.13.7}\). Equation \( \ref{eq:3.13.6}\) then becomes

    \[\ddot{\boldsymbol\iota} = 4T + 2\sum_{i}\sum_{j<i}{\bf r}_{ij}{\bf F}_{ij} \label{eq:3.13.8} \]

    This is the most general form of the virial Equation. It tells us whether the cluster is going to disperse (\(\ddot{\boldsymbol\iota}\)positive) or collapse ( \(\ddot{\boldsymbol\iota}\)negative) – though this will evidently depend on the nature of the force law \( {\bf F}_{ij}\).

    Now suppose that the particles attract each other with a force that is inversely proportional to the \( n\)th power of their distance apart. For gravitating particles, of course, \( n\) = 2. The force between two particles can then be written in various forms, such as

    \[ {\bf F}_{ij} = -{\bf F}_{ij} \hat{\bf r}_{ij} = -\frac{k}{\bf r_{ij}^{n}}\hat{\bf r}_{ij} = - \frac{k}{r_{ij}^{n+1}}{\bf r}_{ij} \label{eq:3.13.9} \]

    and the mutual potential energy between two particles is minus the integral of \({\bf F}_{ij}dr\), or

    \[ U_{ij} = -\frac{k}{(n-1){\bf r}^{n-1}} \label{eq:3.13.10} \]

    I now suppose that the forces between the particles are gravitational forces, such that

    \[ {\bf F}_{ij} = -\frac{Gm_{i}m_{j}}{\bf r^{3}_{ij}}{\bf r}_{ij} \label{eq:3.13.11} \]

    Now return to the term \({\bf r}_{ij} {\bf F}_{ij}\) which occurs in Equation \( \ref{eq:3.13.8}\):

    \[ {\bf r}_{ij} {\bf F}_{ij} = -\frac{k}{\bf r_{ij}^{n+1}}{\bf r}_{ij}{\bf r}_{ij} = -\frac{k}{r_{ij}^{n-1}}= (n-1)U_{ij} \label{eq:3.13.12} \]

    Thus Equation \( \ref{eq:3.13.8}\) becomes

    \[\ddot{\boldsymbol\iota} = 4T + 2(n-1)U, \label{eq:3.13.13} \]

    where \( T\) and \( U\) are the kinetic and potential energies of the system. Note that for gravitational interaction (or any attractive) forces, the quantity \( U\) is negative. Equation \( \ref{eq:3.13.13}\) is the virial theorem for a system of particles with an \(r^{-2}\) attractive force between them. The system will disperse or collapse according the sign of \(\ddot{\boldsymbol\iota}\) For a system of gravitationally-interacting particles, \( n = 2\), and so the virial theorem takes the form

    \[\ddot{\boldsymbol\iota} = 4T + 2U \label{eq:3.13.14} \]

    changing from moment to moment, but always in such a manner that Equation \( \ref{eq:3.13.13}\) is satisfied.

    In a stable, bound system, by which I mean that, over a long period of time, there is no long-term change in the moment of inertia of the system, and the system is neither irreversibly dispersing or contracting, that is to say in a system in which the average value of \(\ddot{\boldsymbol\iota}\) over a long period of time is zero (I’ll define “long” soon), the virial theorem for a stable, bound system of r-n particles takes the form

    \[ 2 \langle t \rangle + (n-1)<u> = 0, \label{eq:3.13.15} \]

    and for a stable system of gravitationally-interacting particles,

    \[ 2 \langle t \rangle + <u> = 0, \label{eq:3.13.16} \]

    Here the angular brackets are understood to mean the average values of the kinetic and potential energies over a long period of time. By a “long” period we mean, for example, long compared with the time that a particle takes to cross from one side of the system to the other, or long compared with the time that a particle takes to move in an orbit around the centre of mass of the system. (In the absence of external forces, of course, the centre of mass does not move, or it moves with a constant velocity.)

    For example, if a bound cluster of stars occupies a spherical volume of uniform density, the potential energy is \(\frac{3GM^{2}}{5a}\) (Equation 5.9.1 of Celestial Mechanics), so the virial theorem (Equation \( \ref{eq:3.13.16}\)) will enable you to work out the mean kinetic energy and hence speed of the stars. A globular cluster has roughly spherical symmetry, but it is not of uniform density, being centrally condensed. If you assume some functional form for the density distribution, this will give a slightly different formula for the potential energy, and you can then still use the virial theorem to calculate the mean kinetic energy.

    Example \(\PageIndex{1}\)

    Consider a planet of mass \( m\) moving in a circular orbit of radius \( a\) around a Sun of mass \( M\), such that \( m\)<<\( M\) and the Sun does not move.

    The potential energy of the system is \( U=-GM\frac{m}{a}\).
    The speed of the planet is given by equating \( \frac{mv^{2}}{a}\) to \( \frac{GMm}{a^{2}}\), from which \( T=GM\frac{m}{(2a)}\),

    so we easily see in this case that \( 2T+U=0\).

    Contributor