4.2: Inertia Tensor
( \newcommand{\kernel}{\mathrm{null}\,}\)
Since the dynamics of each point of a rigid body is strongly constrained by the conditions rkk ’ = const, this is one of the most important fields of application of the Lagrangian formalism discussed in Chapter 2. For using this approach, the first thing we need to calculate is the kinetic energy of the body in an inertial reference frame. Since it is just the sum of the kinetic energies (1.19) of all its points, we can use Eq. (10) to write: 4 T≡∑m2v2=∑m2(v0+ω×r)2=∑m2v20+∑mv0⋅(ω×r)+∑m2(ω×r)2. Let us apply to the right-hand side of Eq. (11) two general vector analysis formulas listed in the Math Appendix: the so-called operand rotation rule MA Eq. (7.6) to the second term, and MA Eq. (7.7b) to the third term. The result is T=∑m2v20+∑mr⋅(v0×ω)+∑m2[ω2r2−(ω⋅r)2]. This expression may be further simplified by making a specific choice of the point 0 (from that the radius vectors r of all particles are measured), namely by using for this point the center of mass of the body. As was already mentioned in Sec. 3.4 for the 2-point case, the radius vector R of this point is defined as MR≡∑mr,M≡∑m, where M is the total mass of the body. In the reference frame centered as this point, R=0, so that the second sum in Eq. (12) vanishes, and the kinetic energy is a sum of just two terms: T=Ttran +Trot ,Ttran ≡M2V2,Trot ≡∑m2[ω2r2−(ω⋅r)2] where V≡dR/dt is the center-of-mass velocity in our inertial reference frame, and all particle positions r are measured in the center-of-mass frame. Since the angular velocity vector ω is common for all points of a rigid body, it is more convenient to rewrite the rotational energy in a form in that the summation over the components of this vector is clearly separated from the summation over the points of the body: Trot=123∑j,j′=1Ijj′ωjωj′, where the 3×3 matrix with elements Ijj′≡∑m(r2δjj′−rjrj′) is called the inertia tensor of the body. 5
Actually, the term "tensor" for this matrix has to be justified, because in physics this term implies a certain reference-frame-independent notion, whose elements have to obey certain rules at the transfer between reference frames. To show that the matrix (16) indeed deserves its title, let us calculate another key quantity, the total angular momentum L of the same body. 6 Summing up the angular momenta of each particle, defined by Eq. (1.31), and then using Eq. (10) again, in our inertial reference frame we get L≡∑r×p=∑mr×v=∑mr×(v0+ω×r)≡∑mr×v0+∑mr×(ω×r). We see that the momentum may be represented as a sum of two terms. The first one, L0≡∑mr×v0=MR×v0, describes the possible rotation of the center of mass around the inertial frame’s origin. This term evidently vanishes if the moving reference frame’s origin 0 is positioned at the center of mass (where R =0). In this case, we are left with only the second term, which describes a pure rotation of the body about its center of mass: L=Lrot≡∑mr×(ω×r) Using one more vector algebra formula, the "bac minis cab" rule, 7 we may rewrite this expression as L=∑m[ωr2−r(r⋅ω)]. Let us spell out an arbitrary Cartesian component of this vector: Lj=∑m[ωjr2−rj3∑j′=1rj′ωj′]≡∑m3∑j′=1ωj′(r2δij′−rjrj′). By changing the summation order and comparing the result with Eq. (16), the angular momentum may be conveniently expressed via the same matrix elements Iij ’ as the rotational kinetic energy: Lj=3∑j′=1Ijj′ωj′. Since L and ω are both legitimate vectors (meaning that they describe physical vectors independent of the reference frame choice), their connection, the matrix of elements Iij, is a legitimate tensor. This fact, and the symmetry of the tensor (Iij′=Ij′j), which is evident from its definition (16), allow the tensor to be further simplified. In particular, mathematics tells us that by a certain choice of the axes’ orientations, any symmetric tensor may be reduced to a diagonal form Ijj′=Ijδjj′, where in our case Ij=∑m(r2−r2j)=∑m(r2j′+r2j′)≡∑mρ2j, being the distance of the particle from the jth axis, i.e. the length of the perpendicular dropped from the point to that axis. The axes of such a special coordinate system are called the principal axes, while the diagonal elements Ij given by Eq. (24), the principal moments of inertia of the body. In such a special reference frame, Eqs. (15) and (22) are reduced to very simple forms: Trot=3∑j=1Ij2ω2j,Lj=Ijωj. Both these results remind the corresponding relations for the translational motion, Ttran =MV2/2 and P= MV, with the angular velocity ω replacing the linear velocity V, and the tensor of inertia playing the role of scalar mass M. However, let me emphasize that even in the specially selected reference frame, with axes pointing in principal directions, the analogy is incomplete, and rotation is generally more complex than translation, because the measures of inertia, Ij, are generally different for each principal axis.
Let me illustrate this fact on a simple but instructive system of three similar massive particles fixed in the vertices of an equilateral triangle (Figure 3).

Due to the symmetry of the configuration, one of the principal axes has to pass through the center of mass 0 and be normal to the plane of the triangle. For the corresponding principal moment of inertia, Eq. (24) readily yields I3=3mρ2. If we want to express the result in terms of the triangle side a, we may notice that due to the system’s symmetry, the angle marked in Figure 3 equals π/6, and from the shaded right triangle, a/2=ρcos(π/6)≡ρ√3/2, giving ρ=a/√3, so that, finally, I3=ma2.
Let me use this simple case to illustrate the following general axis shift theorem, which may be rather useful - especially for more complex systems. For that, let us relate the inertia tensor components Ijj′ and I′jj′, calculated in two reference frames - one with the origin in the center of mass 0 , and another one (0′) displaced by a certain vector d (Figure 4a), so that for an arbitrary point, r′=r+d. Plugging this relation into Eq. (16), we get I′′jj′=∑m[(r+d)2δjj′−(rj+dj)(rj′+dj′)]=∑m[(r2+2r⋅d+d2)δjj′−(rjrj′+rjdj′+rj′dj+djdj′)]. Since in the center-of-mass frame, all sums ∑mrj equal zero, we may use Eq. (16) to finally obtain I′jj′=Ijj′+M(δjj′d2−djdj′). In particular, this equation shows that if the shift vector d is perpendicular to one (say, jth ) of the principal axes (Figure 4 b ), i.e. dj=0, then Eq. (28) is reduced to a very simple formula:
Principal axis’

Now returning to the system shown in Figure 3, let us perform such a shift to the new ("primed") axis passing through the location of one of the particles, still perpendicular to the particles’ plane. Then the contribution of that particular mass to the primed moment of inertia vanishes, and I′3=2ma2. Now, returning to the center of mass and applying Eq. (29), we get I3=I3−Mρ2=2ma2−(3m)(a/√3)2=ma2, i.e. the same result as above.
The symmetry situation inside the triangle’s plane is somewhat less evident, so let us start with calculating the moments of inertia for the axes shown vertical and horizontal in Figure 3. From Eq. (24) we readily get: I1=2mh2+mρ2=m[2(a2√3)2+(a√3)2]=ma22,I2=2m(a2)2=ma22, where h is the distance from the center of mass and any side of the triangle: h=ρsin(π/6)=ρ/2= a/2√3. We see that I1=I2, and mathematics tells us that in this case any in-plane axis (passing through the center-of-mass 0 ) may be considered as principal, and has the same moment of inertia. A rigid body Symmetric
top: with this property, I1=I2≠I3, is called the symmetric top. (The last direction is called the main principal with this property, I1=I2≠I3, is called the symmetric top. (The last direction is called the main principal axis of the system.)
Despite the symmetric top’s name, the situation may be even more symmetric in the so-called spherical tops, i.e. highly symmetric systems whose principal moments of inertia are all equal, I1=I2=I3≡I, Mathematics says that in this case, the moment of inertia for rotation about any axis (but still passing through the center of mass) is equal to the same I. Hence Eqs. (25) and (26) are further simplified for any direction of the vector ω : Trot =I2ω2,L=Iω thus making the analogy of rotation and translation complete. (As will be discussed in the next section, this analogy is also complete if the rotation axis is fixed by external constraints.)
Evident examples of a spherical top are a uniform sphere and a uniform spherical shell; a less obvious example is a uniform cube - with masses either concentrated in vertices, or uniformly spread over the faces, or uniformly distributed over the volume. Again, in this case any axis passing through the center of mass is principal and has the same principal moment of inertia. For a sphere, this is natural; for a cube, rather surprising - but may be confirmed by a direct calculation.
4 Actually, all symbols for particle masses, coordinates, and velocities should carry the particle’s index, over which the summation is carried out. However, in this section, for the notation simplicity, this index is just implied.
5 While the ABCs of the rotational dynamics were developed by Leonhard Euler in 1765 , an introduction of the inertia tensor’s formalism had to wait very long - until the invention of the tensor analysis by Tullio Levi-Civita and Gregorio Ricci-Curbastro in 1900 - soon popularized by its use in Einstein’s theory of general relativity.
6 Hopefully, there is very little chance of confusing the angular momentum L (a vector) and its Cartesian components Lj (scalars with an index) on one hand, and the Lagrangian function L (a scalar without an index) on the other hand.
7 See, e.g., MA Eq. (7.5).