24.8: Diagonalizing the Inertia Tensor

Last updated
Save as PDF

Page ID: 30519

\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \) \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)\(\newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\) \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\) \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\) \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\) \( \newcommand{\Span}{\mathrm{span}}\) \(\newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\) \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\) \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\) \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\) \( \newcommand{\Span}{\mathrm{span}}\)\(\newcommand{\AA}{\unicode[.8,0]{x212B}}\)

The inertial tensor has the form of a real symmetric matrix. By an appropriate choice of axes \(\left(x_{1}, x_{2}, x_{3}\right)\) any such tensor can be put in diagonal form, so that

\begin{equation}
T_{\text {rot }}=\frac{1}{2}\left(I_{1} \Omega_{1}^{2}+I_{2} \Omega_{2}^{2}+I_{3} \Omega_{3}^{2}\right)
\end{equation}

These axes, with respect to which the inertia tensor is diagonal, are called the principal axes of inertia, the moments about them \(I_{1}, I_{2}, I_{3}\) the principal moments of inertia.

If you’re already familiar with the routine for diagonalizing a real symmetric matrix, you can skip this review.

The diagonalization of the tensor/matrix proceeds as follows.

First, find the eigenvalues \(\lambda_{i}\) and corresponding eigenvectors \(\mathbf{e}_{i}\) of the inertial tensor \(I\) :

\begin{equation}
\mathbf{I e}_{\mathbf{i}}=\lambda_{i} \mathbf{e}_{\mathbf{i}}(i=1,2,3, \text { not summed })
\end{equation}

(The \(\lambda_{i} \text { turn out to be the principal moments } I_{i}, \text { but we'll leave them as } \lambda_{i}\) for now, we need first to establish that they’re real.)

Now since \(I\) is real and symmetric, \(\mathbf{I}^{\mathrm{T}}=\mathbf{I}\) the eigenvalues are real. To prove this, take the equation for \(\mathbf{e}_{1}\) above and premultiply by the row vector \(\mathbf{e}_{1}^{* \mathrm{T}}\), the complex conjugate transpose:

\begin{equation}
\mathbf{e}_{\mathbf{1}}^{* \mathrm{T}} \mathbf{I} \mathbf{e}_{\mathbf{1}}=\lambda_{1} \mathbf{e}_{\mathbf{1}}^{\mathbf{*} \mathbf{T}} \mathbf{e}_{\mathbf{1}}
\end{equation}

The left hand side is a real number: this can be established by taking its complex conjugate. The fact that the tensor is real and symmetric is crucial!

\begin{equation}
\left(e_{1 i}^{*} I_{i j} e_{1 j}\right)^{*}=e_{1 i} I_{i j}^{*} e_{1 j}^{*}=e_{1 i} I_{j i} e_{1 j}^{*}=e_{1 j}^{*} I_{j i} e_{1 i}
\end{equation}

And since these are dummy suffixes, we can swap the i ’s and j ’s to establish that this number is identical to its complex conjugate, hence it’s real. Clearly, \(\mathrm{e}_{1}^{* \mathrm{T}} \mathrm{e}_{1}\) is real and positive, so the eigenvalues are real.

(Note: a real symmetric matrix does not necessarily have positive roots: for example \(\left(\begin{array}{ll}
0 & 1 \\
1 & 0
\end{array}\right)\)

Taking the eigenvalues to be distinct (the degenerate case is easy to deal with) the eigenvectors are orthogonal, by the standard proof, for this matrix left eigenvectors (rows) have the same eigenvalues as their transpose, so

\begin{equation}
\mathbf{e}_{2}^{\mathrm{T}} \mathbf{I} \mathbf{e}_{\mathbf{1}}=\lambda_{\mathbf{2}} \mathbf{e}_{\mathbf{2}}^{\mathrm{T}} \mathbf{e}_{\mathbf{1}}=\lambda_{\mathbf{1}} \mathbf{e}_{\mathbf{2}}^{\mathrm{T}} \mathbf{e}_{\mathbf{1}}
\end{equation}

and \(\mathbf{e}_{2}^{\mathrm{T}} \mathbf{e}_{1}=0\).

The diagonalizing matrix is made up of these eigenvectors (assumed normalized):

\begin{equation}
\mathbf{R}=\left(\begin{array}{c}
\mathbf{e}_{\mathbf{1}}^{\mathbf{T}} \\
\mathbf{e}_{\mathbf{2}}^{\mathbf{T}} \\
\mathbf{e}_{\mathbf{3}}^{\mathbf{T}}
\end{array}\right)
\end{equation}

a column of row vectors.

To check that this is indeed a rotation vector, from one orthogonal set of axes to another, notice first that its transpose \(\mathbf{R}^{\mathrm{T}}=\left(\begin{array}{lll}
\mathbf{e}_{1} & \mathbf{e}_{2} & \mathbf{e}_{3}
\end{array}\right)\) is its inverse (as required for a rotation), since the eigenvectors form an orthonormal set.

Now apply this \(R\) to an arbitrary vector:

\begin{equation}
\mathbf{x}^{\prime}=\mathbf{R} \mathbf{x}=\left(\begin{array}{c}
\mathbf{e}_{\mathbf{1}}^{\mathbf{T}} \\
\mathbf{e}_{\mathbf{2}}^{\mathbf{T}} \\
\mathbf{e}_{\mathbf{3}}^{\mathbf{T}}
\end{array}\right) \mathbf{x}=\left(\begin{array}{c}
\mathbf{e}_{\mathbf{1}}^{\mathbf{T}} \mathbf{x} \\
\mathbf{e}_{\mathbf{2}}^{\mathbf{T}} \mathbf{x} \\
\mathbf{e}_{\mathbf{3}}^{\mathbf{T}} \mathbf{x}
\end{array}\right)
\end{equation}

In vector language, these elements are just \(\begin{equation}
\vec{e}_{1} \cdot \vec{x}, \text { etc., so } x_{1}^{\prime}=\vec{e}_{1} \cdot \vec{x}
\end{equation}\), the primed components are just the components of \(\vec{x}\) along the eigenvector axes, so the operator \(R\) gives the vector components relative to these axes, meaning it has rotated the coordinate system to one with the principal axes of the body are now the \(x_{1}, x_{2}, x_{3}\) axes.

We can confirm this by applying the rotation to the inertia tensor itself:

\begin{equation}
\mathbf{I}^{\prime}=\mathbf{R} \mathbf{T} \mathbf{R}^{\mathbf{T}}=\left(\begin{array}{c}
\mathbf{e}_{1}^{\mathrm{T}} \\
\mathbf{e}_{2}^{\mathrm{T}} \\
\mathbf{e}_{3}^{\mathrm{T}}
\end{array}\right) \mathbf{I}\left(\begin{array}{lll}
\mathbf{e}_{1} & \mathbf{e}_{2} & \mathbf{e}_{3}
\end{array}\right)=\left(\begin{array}{c}
\mathbf{e}_{1}^{\mathrm{T}} \\
\mathbf{e}_{2}^{\mathrm{T}} \\
\mathbf{e}_{3}^{\mathrm{T}}
\end{array}\right)\left(\begin{array}{lll}
\lambda_{1} \mathbf{e}_{1} & \lambda_{2} \mathbf{e}_{2} & \lambda_{3} \mathbf{e}_{3}
\end{array}\right)=\left(\begin{array}{ccc}
\lambda_{1} & 0 & 0 \\
0 & \lambda_{2} & 0 \\
0 & 0 & \lambda_{3}
\end{array}\right)
\end{equation}

Let’s examine the contribution of one particle to the inertia tensor:

\begin{equation}
\mathbf{I}_{\mathbf{1}}=m\left[\left(\mathbf{x}^{\mathbf{T}} \mathbf{x}\right) \mathbf{1}-\mathbf{x} \mathbf{x}^{\mathbf{T}}\right]
\end{equation}

Note that \(x\) here represents the column vector of the particle coordinates, in other words, it’s just \(\vec{r} !\) And, watch out for the inertia tensor I and the unit tensor 1.

They transform as \(\mathbf{x}^{\prime}=\mathbf{R} \mathbf{x}\), note that this agrees with \(\mathbf{I}^{\prime}=\mathbf{R} \mathbf{I} \mathbf{R}^{\mathbf{T}}\). Since under rotation the length of a vector is invariant \(\mathbf{x}^{\prime \mathbf{T}} \mathbf{x}^{\prime}=\mathbf{x}^{\mathbf{T}} \mathbf{x}, \text { and } \mathbf{R} \mathbf{x} \mathbf{x}^{\mathbf{T}} \mathbf{R}^{\mathbf{T}}=\mathbf{x}^{\prime} \mathbf{x}^{\mathbf{\prime}} \mathbf{T}\) it is evident that in the rotated frame (the eigenvector frame) the single particle contributes to the diagonal elements

\begin{equation}
m\left[\left(x_{2}^{2}+x_{3}^{2}\right),\left(x_{3}^{2}+x_{1}^{2}\right),\left(x_{1}^{2}+x_{2}^{2}\right)\right]
\end{equation}

. We’ve dropped the primes, since we’ll be working in this natural frame from now on.