Moment of inertia
Flywheels have large moments of inertia to smooth out rotational motion.
Common symbols
I
SI unitkg m2
Other units
lbf·ft·s2
Derivations from
other quantities
${\displaystyle I={\frac {L}{\omega ))}$
DimensionM L2
Tightrope walkers use the moment of inertia of a long rod for balance as they walk the rope. Samuel Dixon crossing the Niagara River in 1890.
To improve their maneuverability, war planes are designed to have smaller moments of inertia compared to commercial planes.

The moment of inertia, otherwise known as the mass moment of inertia, angular mass, second moment of mass, or most accurately, rotational inertia, of a rigid body is a quantity that determines the torque needed for a desired angular acceleration about a rotational axis, akin to how mass determines the force needed for a desired acceleration. It depends on the body's mass distribution and the axis chosen, with larger moments requiring more torque to change the body's rate of rotation.

It is an extensive (additive) property: for a point mass the moment of inertia is simply the mass times the square of the perpendicular distance to the axis of rotation. The moment of inertia of a rigid composite system is the sum of the moments of inertia of its component subsystems (all taken about the same axis). Its simplest definition is the second moment of mass with respect to distance from an axis.

For bodies constrained to rotate in a plane, only their moment of inertia about an axis perpendicular to the plane, a scalar value, matters. For bodies free to rotate in three dimensions, their moments can be described by a symmetric 3 × 3 matrix, with a set of mutually perpendicular principal axes for which this matrix is diagonal and torques around the axes act independently of each other.

Introduction

When a body is free to rotate around an axis, torque must be applied to change its angular momentum. The amount of torque needed to cause any given angular acceleration (the rate of change in angular velocity) is proportional to the moment of inertia of the body. Moments of inertia may be expressed in units of kilogram metre squared (kg·m2) in SI units and pound-foot-second squared (lbf·ft·s2) in imperial or US units.

The moment of inertia plays the role in rotational kinetics that mass (inertia) plays in linear kinetics—both characterize the resistance of a body to changes in its motion. The moment of inertia depends on how mass is distributed around an axis of rotation, and will vary depending on the chosen axis. For a point-like mass, the moment of inertia about some axis is given by ${\displaystyle mr^{2))$, where ${\displaystyle r}$ is the distance of the point from the axis, and ${\displaystyle m}$ is the mass. For an extended rigid body, the moment of inertia is just the sum of all the small pieces of mass multiplied by the square of their distances from the axis in rotation. For an extended body of a regular shape and uniform density, this summation sometimes produces a simple expression that depends on the dimensions, shape and total mass of the object.

In 1673 Christiaan Huygens introduced this parameter in his study of the oscillation of a body hanging from a pivot, known as a compound pendulum.[1] The term moment of inertia was introduced by Leonhard Euler in his book Theoria motus corporum solidorum seu rigidorum in 1765,[1][2] and it is incorporated into Euler's second law.

The natural frequency of oscillation of a compound pendulum is obtained from the ratio of the torque imposed by gravity on the mass of the pendulum to the resistance to acceleration defined by the moment of inertia. Comparison of this natural frequency to that of a simple pendulum consisting of a single point of mass provides a mathematical formulation for moment of inertia of an extended body.[3][4]

The moment of inertia also appears in momentum, kinetic energy, and in Newton's laws of motion for a rigid body as a physical parameter that combines its shape and mass. There is an interesting difference in the way moment of inertia appears in planar and spatial movement. Planar movement has a single scalar that defines the moment of inertia, while for spatial movement the same calculations yield a 3 × 3 matrix of moments of inertia, called the inertia matrix or inertia tensor.[5][6]

The moment of inertia of a rotating flywheel is used in a machine to resist variations in applied torque to smooth its rotational output. The moment of inertia of an airplane about its longitudinal, horizontal and vertical axes determine how steering forces on the control surfaces of its wings, elevators and rudder(s) affect the plane's motions in roll, pitch and yaw.

Definition

The moment of inertia is defined as the product of mass of section and the square of the distance between the reference axis and the centroid of the section.

Spinning figure skaters can reduce their moment of inertia by pulling in their arms, allowing them to spin faster due to conservation of angular momentum.
Video of rotating chair experiment, illustrating moment of inertia. When the spinning professor pulls his arms, his moment of inertia decreases; to conserve angular momentum, his angular velocity increases.

The moment of inertia I is also defined as the ratio of the net angular momentum L of a system to its angular velocity ω around a principal axis,[7][8] that is

${\displaystyle I={\frac {L}{\omega )).}$

If the angular momentum of a system is constant, then as the moment of inertia gets smaller, the angular velocity must increase. This occurs when spinning figure skaters pull in their outstretched arms or divers curl their bodies into a tuck position during a dive, to spin faster.[7][8][9][10][11][12][13]

If the shape of the body does not change, then its moment of inertia appears in Newton's law of motion as the ratio of an applied torque τ on a body to the angular acceleration α around a principal axis, that is

${\displaystyle \tau =I\alpha .}$

For a simple pendulum, this definition yields a formula for the moment of inertia I in terms of the mass m of the pendulum and its distance r from the pivot point as,

${\displaystyle I=mr^{2}.}$

Thus, the moment of inertia of the pendulum depends on both the mass m of a body and its geometry, or shape, as defined by the distance r to the axis of rotation.

This simple formula generalizes to define moment of inertia for an arbitrarily shaped body as the sum of all the elemental point masses dm each multiplied by the square of its perpendicular distance r to an axis k. An arbitrary object's moment of inertia thus depends on the spatial distribution of its mass.

In general, given an object of mass m, an effective radius k can be defined, dependent on a particular axis of rotation, with such a value that its moment of inertia around the axis is

${\displaystyle I=mk^{2},}$
where k is known as the radius of gyration around the axis.

Examples

Simple pendulum

Mathematically, the moment of inertia of a simple pendulum is the ratio of the torque due to gravity about the pivot of a pendulum to its angular acceleration about that pivot point. For a simple pendulum this is found to be the product of the mass of the particle ${\displaystyle m}$ with the square of its distance ${\displaystyle r}$ to the pivot, that is

${\displaystyle I=mr^{2}.}$

This can be shown as follows: The force of gravity on the mass of a simple pendulum generates a torque ${\displaystyle {\boldsymbol {\tau ))=\mathbf {r} \times \mathbf {F} }$ around the axis perpendicular to the plane of the pendulum movement. Here ${\displaystyle \mathbf {r} }$ is the distance vector from the torque axis to the pendulum center of mass, and ${\displaystyle \mathbf {F} }$ is the net force on the mass. Associated with this torque is an angular acceleration, ${\displaystyle {\boldsymbol {\alpha ))}$, of the string and mass around this axis. Since the mass is constrained to a circle the tangential acceleration of the mass is ${\displaystyle \mathbf {a} ={\boldsymbol {\alpha ))\times \mathbf {r} }$. Since ${\displaystyle \mathbf {F} =m\mathbf {a} }$ the torque equation becomes:

{\displaystyle {\begin{aligned}{\boldsymbol {\tau ))&=\mathbf {r} \times \mathbf {F} =\mathbf {r} \times (m{\boldsymbol {\alpha ))\times \mathbf {r} )\\&=m\left(\left(\mathbf {r} \cdot \mathbf {r} \right){\boldsymbol {\alpha ))-\left(\mathbf {r} \cdot {\boldsymbol {\alpha ))\right)\mathbf {r} \right)\\&=mr^{2}{\boldsymbol {\alpha ))=I\alpha \mathbf {\hat {k)) ,\end{aligned))}

where ${\displaystyle \mathbf {\hat {k)) }$ is a unit vector perpendicular to the plane of the pendulum. (The second to last step uses the vector triple product expansion with the perpendicularity of ${\displaystyle {\boldsymbol {\alpha ))}$ and ${\displaystyle \mathbf {r} }$.) The quantity ${\displaystyle I=mr^{2))$ is the moment of inertia of this single mass around the pivot point.

The quantity ${\displaystyle I=mr^{2))$ also appears in the angular momentum of a simple pendulum, which is calculated from the velocity ${\displaystyle \mathbf {v} ={\boldsymbol {\omega ))\times \mathbf {r} }$ of the pendulum mass around the pivot, where ${\displaystyle {\boldsymbol {\omega ))}$ is the angular velocity of the mass about the pivot point. This angular momentum is given by

{\displaystyle {\begin{aligned}\mathbf {L} &=\mathbf {r} \times \mathbf {p} =\mathbf {r} \times \left(m{\boldsymbol {\omega ))\times \mathbf {r} \right)\\&=m\left(\left(\mathbf {r} \cdot \mathbf {r} \right){\boldsymbol {\omega ))-\left(\mathbf {r} \cdot {\boldsymbol {\omega ))\right)\mathbf {r} \right)\\&=mr^{2}{\boldsymbol {\omega ))=I\omega \mathbf {\hat {k)) ,\end{aligned))}
using a similar derivation to the previous equation.

Similarly, the kinetic energy of the pendulum mass is defined by the velocity of the pendulum around the pivot to yield

${\displaystyle E_{\text{K))={\frac {1}{2))m\mathbf {v} \cdot \mathbf {v} ={\frac {1}{2))\left(mr^{2}\right)\omega ^{2}={\frac {1}{2))I\omega ^{2}.}$

This shows that the quantity ${\displaystyle I=mr^{2))$ is how mass combines with the shape of a body to define rotational inertia. The moment of inertia of an arbitrarily shaped body is the sum of the values ${\displaystyle mr^{2))$ for all of the elements of mass in the body.

Compound pendulums

Pendulums used in Mendenhall gravimeter apparatus, from 1897 scientific journal. The portable gravimeter developed in 1890 by Thomas C. Mendenhall provided the most accurate relative measurements of the local gravitational field of the Earth.

A compound pendulum is a body formed from an assembly of particles of continuous shape that rotates rigidly around a pivot. Its moment of inertia is the sum of the moments of inertia of each of the particles that it is composed of.[14][15]: 395–396 [16]: 51–53  The natural frequency (${\displaystyle \omega _{\text{n))}$) of a compound pendulum depends on its moment of inertia, ${\displaystyle I_{P))$,

${\displaystyle \omega _{\text{n))={\sqrt {\frac {mgr}{I_{P)))),}$
where ${\displaystyle m}$ is the mass of the object, ${\displaystyle g}$ is local acceleration of gravity, and ${\displaystyle r}$ is the distance from the pivot point to the center of mass of the object. Measuring this frequency of oscillation over small angular displacements provides an effective way of measuring moment of inertia of a body.[17]: 516–517

Thus, to determine the moment of inertia of the body, simply suspend it from a convenient pivot point ${\displaystyle P}$ so that it swings freely in a plane perpendicular to the direction of the desired moment of inertia, then measure its natural frequency or period of oscillation (${\displaystyle t}$), to obtain

${\displaystyle I_{P}={\frac {mgr}{\omega _{\text{n))^{2))}={\frac {mgrt^{2)){4\pi ^{2))},}$
where ${\displaystyle t}$ is the period (duration) of oscillation (usually averaged over multiple periods).

Center of oscillation

A simple pendulum that has the same natural frequency as a compound pendulum defines the length ${\displaystyle L}$ from the pivot to a point called the center of oscillation of the compound pendulum. This point also corresponds to the center of percussion. The length ${\displaystyle L}$ is determined from the formula,

${\displaystyle \omega _{\text{n))={\sqrt {\frac {g}{L))}={\sqrt {\frac {mgr}{I_{P)))),}$
or
${\displaystyle L={\frac {g}{\omega _{\text{n))^{2))}={\frac {I_{P)){mr)).}$

The seconds pendulum, which provides the "tick" and "tock" of a grandfather clock, takes one second to swing from side-to-side. This is a period of two seconds, or a natural frequency of ${\displaystyle \pi \ \mathrm {rad/s} }$ for the pendulum. In this case, the distance to the center of oscillation, ${\displaystyle L}$, can be computed to be

${\displaystyle L={\frac {g}{\omega _{\text{n))^{2))}\approx {\frac {9.81\ \mathrm {m/s^{2)) }{(3.14\ \mathrm {rad/s} )^{2))}\approx 0.99\ \mathrm {m} .}$

Notice that the distance to the center of oscillation of the seconds pendulum must be adjusted to accommodate different values for the local acceleration of gravity. Kater's pendulum is a compound pendulum that uses this property to measure the local acceleration of gravity, and is called a gravimeter.

Measuring moment of inertia

The moment of inertia of a complex system such as a vehicle or airplane around its vertical axis can be measured by suspending the system from three points to form a trifilar pendulum. A trifilar pendulum is a platform supported by three wires designed to oscillate in torsion around its vertical centroidal axis.[18] The period of oscillation of the trifilar pendulum yields the moment of inertia of the system.[19]

Moment of Inertia of Areas

Moment of Inertia of Areas is also known as Second moment of area. These calculations are commonly used in civil engineering for structural design of beams and columns. Cross-sectional areas calculated for vertical moment of the X axis ${\displaystyle I_{xx))$ and horizontal moment of the Y axis ${\displaystyle I_{yy))$.
Height (h) and breadth (b) are the linear measures, except for circles, which are effectively half-breadth derived, ${\displaystyle r}$

Sectional areas moment calculated thus

1. Square: ${\displaystyle I_{xx}=I_{yy}={\frac {b^{4)){12))}$
2. Rectangular: ${\displaystyle I_{xx}={\frac {bh^{3)){12))}$ and; ${\displaystyle I_{yy}={\frac {hb^{3)){12))}$
3. Triangular: ${\displaystyle I_{xx}={\frac {bh^{3)){36))}$
4. Circular: ${\displaystyle I_{xx}=I_{yy}={\frac {1}{4)){\pi }r^{4))$

Motion in a fixed plane

Point mass

Four objects with identical masses and radii racing down a plane while rolling without slipping.
From back to front:
•   spherical shell,
•   solid sphere,
•   cylindrical ring, and
•   solid cylinder.
The time for each object to reach the finishing line depends on their moment of inertia. (OGV version)

The moment of inertia about an axis of a body is calculated by summing ${\displaystyle mr^{2))$ for every particle in the body, where ${\displaystyle r}$ is the perpendicular distance to the specified axis. To see how moment of inertia arises in the study of the movement of an extended body, it is convenient to consider a rigid assembly of point masses. (This equation can be used for axes that are not principal axes provided that it is understood that this does not fully describe the moment of inertia.[21])

Consider the kinetic energy of an assembly of ${\displaystyle N}$ masses ${\displaystyle m_{i))$ that lie at the distances ${\displaystyle r_{i))$ from the pivot point ${\displaystyle P}$, which is the nearest point on the axis of rotation. It is the sum of the kinetic energy of the individual masses,[17]: 516–517 [22]: 1084–1085 [22]: 1296–1300

${\displaystyle E_{\text{K))=\sum _{i=1}^{N}{\frac {1}{2))\,m_{i}\mathbf {v} _{i}\cdot \mathbf {v} _{i}=\sum _{i=1}^{N}{\frac {1}{2))\,m_{i}\left(\omega r_{i}\right)^{2}={\frac {1}{2))\,\omega ^{2}\sum _{i=1}^{N}m_{i}r_{i}^{2}.}$

This shows that the moment of inertia of the body is the sum of each of the ${\displaystyle mr^{2))$ terms, that is

${\displaystyle I_{P}=\sum _{i=1}^{N}m_{i}r_{i}^{2}.}$

Thus, moment of inertia is a physical property that combines the mass and distribution of the particles around the rotation axis. Notice that rotation about different axes of the same body yield different moments of inertia.

The moment of inertia of a continuous body rotating about a specified axis is calculated in the same way, except with infinitely many point particles. Thus the limits of summation are removed, and the sum is written as follows:

${\displaystyle I_{P}=\sum _{i}m_{i}r_{i}^{2))$

Another expression replaces the summation with an integral,

${\displaystyle I_{P}=\iiint _{Q}\rho (x,y,z)\left\|\mathbf {r} \right\|^{2}dV}$

Here, the function ${\displaystyle \rho }$ gives the mass density at each point ${\displaystyle (x,y,z)}$, ${\displaystyle \mathbf {r} }$ is a vector perpendicular to the axis of rotation and extending from a point on the rotation axis to a point ${\displaystyle (x,y,z)}$ in the solid, and the integration is evaluated over the volume ${\displaystyle V}$ of the body ${\displaystyle Q}$. The moment of inertia of a flat surface is similar with the mass density being replaced by its areal mass density with the integral evaluated over its area.

Note on second moment of area: The moment of inertia of a body moving in a plane and the second moment of area of a beam's cross-section are often confused. The moment of inertia of a body with the shape of the cross-section is the second moment of this area about the ${\displaystyle z}$-axis perpendicular to the cross-section, weighted by its density. This is also called the polar moment of the area, and is the sum of the second moments about the ${\displaystyle x}$- and ${\displaystyle y}$-axes.[23] The stresses in a beam are calculated using the second moment of the cross-sectional area around either the ${\displaystyle x}$-axis or ${\displaystyle y}$-axis depending on the load.

Examples

 Main article: List of moments of inertia

The moment of inertia of a compound pendulum constructed from a thin disc mounted at the end of a thin rod that oscillates around a pivot at the other end of the rod, begins with the calculation of the moment of inertia of the thin rod and thin disc about their respective centers of mass.[22]

• The moment of inertia of a thin rod with constant cross-section ${\displaystyle s}$ and density ${\displaystyle \rho }$ and with length ${\displaystyle \ell }$ about a perpendicular axis through its center of mass is determined by integration.[22]: 1301  Align the ${\displaystyle x}$-axis with the rod and locate the origin its center of mass at the center of the rod, then
${\displaystyle I_{C,{\text{rod))}=\iiint _{Q}\rho \,x^{2}\,dV=\int _{-{\frac {\ell }{2))}^{\frac {\ell }{2))\rho \,x^{2}s\,dx=\left.\rho s{\frac {x^{3)){3))\right|_{-{\frac {\ell }{2))}^{\frac {\ell }{2))={\frac {\rho s}{3))\left({\frac {\ell ^{3)){8))+{\frac {\ell ^{3)){8))\right)={\frac {m\ell ^{2)){12)),}$
where ${\displaystyle m=\rho s\ell }$ is the mass of the rod.
• The moment of inertia of a thin disc of constant thickness ${\displaystyle s}$, radius ${\displaystyle R}$, and density ${\displaystyle \rho }$ about an axis through its center and perpendicular to its face (parallel to its axis of rotational symmetry) is determined by integration.[22]: 1301 [failed verification] Align the ${\displaystyle z}$-axis with the axis of the disc and define a volume element as ${\displaystyle dV=sr\,dr\,d\theta }$, then
${\displaystyle I_{C,{\text{disc))}=\iiint _{Q}\rho \,r^{2}\,dV=\int _{0}^{2\pi }\int _{0}^{R}\rho r^{2}sr\,dr\,d\theta =2\pi \rho s{\frac {R^{4)){4))={\frac {1}{2))mR^{2},}$
where ${\displaystyle m=\pi R^{2}\rho s}$ is its mass.
• The moment of inertia of the compound pendulum is now obtained by adding the moment of inertia of the rod and the disc around the pivot point ${\displaystyle P}$ as,
${\displaystyle I_{P}=I_{C,{\text{rod))}+M_{\text{rod))\left({\frac {L}{2))\right)^{2}+I_{C,{\text{disc))}+M_{\text{disc))(L+R)^{2},}$
where ${\displaystyle L}$ is the length of the pendulum. Notice that the parallel axis theorem is used to shift the moment of inertia from the center of mass to the pivot point of the pendulum.

A list of moments of inertia formulas for standard body shapes provides a way to obtain the moment of inertia of a complex body as an assembly of simpler shaped bodies. The parallel axis theorem is used to shift the reference point of the individual bodies to the reference point of the assembly.

As one more example, consider the moment of inertia of a solid sphere of constant density about an axis through its center of mass. This is determined by summing the moments of inertia of the thin discs that can form the sphere whose centers are along the axis chosen for consideration. If the surface of the ball is defined by the equation[22]: 1301

${\displaystyle x^{2}+y^{2}+z^{2}=R^{2},}$

then the square of the radius ${\displaystyle r}$ of the disc at the cross-section ${\displaystyle z}$ along the ${\displaystyle z}$-axis is

${\displaystyle r(z)^{2}=x^{2}+y^{2}=R^{2}-z^{2}.}$

Therefore, the moment of inertia of the ball is the sum of the moments of inertia of the discs along the ${\displaystyle z}$-axis,

{\displaystyle {\begin{aligned}I_{C,{\text{ball))}&=\int _{-R}^{R}{\frac {\pi \rho }{2))r(z)^{4}\,dz=\int _{-R}^{R}{\frac {\pi \rho }{2))\left(R^{2}-z^{2}\right)^{2}\,dz\\&={\frac {\pi \rho }{2))\left[R^{4}z-{\frac {2}{3))R^{2}z^{3}+{\frac {1}{5))z^{5}\right]_{-R}^{R}\\&=\pi \rho \left(1-{\frac {2}{3))+{\frac {1}{5))\right)R^{5}\\&={\frac {2}{5))mR^{2},\end{aligned))}
where ${\textstyle m={\frac {4}{3))\pi R^{3}\rho }$ is the mass of the sphere.

Rigid body

The cylinders with higher moment of inertia roll down a slope with a smaller acceleration, as more of their potential energy needs to be converted into the rotational kinetic energy.

If a mechanical system is constrained to move parallel to a fixed plane, then the rotation of a body in the system occurs around an axis ${\displaystyle \mathbf {\hat {k)) }$ perpendicular to this plane. In this case, the moment of inertia of the mass in this system is a scalar known as the polar moment of inertia. The definition of the polar moment of inertia can be obtained by considering momentum, kinetic energy and Newton's laws for the planar movement of a rigid system of particles.[14][17][24][25]

If a system of ${\displaystyle n}$ particles, ${\displaystyle P_{i},i=1,\dots ,n}$, are assembled into a rigid body, then the momentum of the system can be written in terms of positions relative to a reference point ${\displaystyle \mathbf {R} }$, and absolute velocities ${\displaystyle \mathbf {v} _{i))$:

{\displaystyle {\begin{aligned}\Delta \mathbf {r} _{i}&=\mathbf {r} _{i}-\mathbf {R} ,\\\mathbf {v} _{i}&={\boldsymbol {\omega ))\times \left(\mathbf {r} _{i}-\mathbf {R} \right)+\mathbf {V} ={\boldsymbol {\omega ))\times \Delta \mathbf {r} _{i}+\mathbf {V} ,\end{aligned))}
where ${\displaystyle {\boldsymbol {\omega ))}$ is the angular velocity of the system and ${\displaystyle \mathbf {V} }$ is the velocity of ${\displaystyle \mathbf {R} }$.

For planar movement the angular velocity vector is directed along the unit vector ${\displaystyle \mathbf {k} }$ which is perpendicular to the plane of movement. Introduce the unit vectors ${\displaystyle \mathbf {e} _{i))$ from the reference point ${\displaystyle \mathbf {R} }$ to a point ${\displaystyle \mathbf {r} _{i))$, and the unit vector ${\displaystyle \mathbf {\hat {t)) _{i}=\mathbf {\hat {k)) \times \mathbf {\hat {e)) _{i))$, so

{\displaystyle {\begin{aligned}\mathbf {\hat {e)) _{i}&={\frac {\Delta \mathbf {r} _{i)){\Delta r_{i))},\quad \mathbf {\hat {k)) ={\frac {\boldsymbol {\omega )){\omega )),\quad \mathbf {\hat {t)) _{i}=\mathbf {\hat {k)) \times \mathbf {\hat {e)) _{i},\\\mathbf {v} _{i}&={\boldsymbol {\omega ))\times \Delta \mathbf {r} _{i}+\mathbf {V} =\omega \mathbf {\hat {k)) \times \Delta r_{i}\mathbf {\hat {e)) _{i}+\mathbf {V} =\omega \,\Delta r_{i}\mathbf {\hat {t)) _{i}+\mathbf {V} \end{aligned))}

This defines the relative position vector and the velocity vector for the rigid system of the particles moving in a plane.

Note on the cross product: When a body moves parallel to a ground plane, the trajectories of all the points in the body lie in planes parallel to this ground plane. This means that any rotation that the body undergoes must be around an axis perpendicular to this plane. Planar movement is often presented as projected onto this ground plane so that the axis of rotation appears as a point. In this case, the angular velocity and angular acceleration of the body are scalars and the fact that they are vectors along the rotation axis is ignored. This is usually preferred for introductions to the topic. But in the case of moment of inertia, the combination of mass and geometry benefits from the geometric properties of the cross product. For this reason, in this section on planar movement the angular velocity and accelerations of the body are vectors perpendicular to the ground plane, and the cross product operations are the same as used for the study of spatial rigid body movement.

Angular momentum

The angular momentum vector for the planar movement of a rigid system of particles is given by[14][17]

{\displaystyle {\begin{aligned}\mathbf {L} &=\sum _{i=1}^{n}m_{i}\Delta \mathbf {r} _{i}\times \mathbf {v} _{i}\\&=\sum _{i=1}^{n}m_{i}\,\Delta r_{i}\mathbf {\hat {e)) _{i}\times \left(\omega \,\Delta r_{i}\mathbf {\hat {t)) _{i}+\mathbf {V} \right)\\&=\left(\sum _{i=1}^{n}m_{i}\,\Delta r_{i}^{2}\right)\omega \mathbf {\hat {k)) +\left(\sum _{i=1}^{n}m_{i}\,\Delta r_{i}\mathbf {\hat {e)) _{i}\right)\times \mathbf {V} .\end{aligned))}

Use the center of mass ${\displaystyle \mathbf {C} }$ as the reference point so

{\displaystyle {\begin{aligned}\Delta r_{i}\mathbf {\hat {e)) _{i}&=\mathbf {r} _{i}-\mathbf {C} ,\\\sum _{i=1}^{n}m_{i}\,\Delta r_{i}\mathbf {\hat {e)) _{i}&=0,\end{aligned))}

and define the moment of inertia relative to the center of mass ${\displaystyle I_{\mathbf {C} ))$ as

${\displaystyle I_{\mathbf {C} }=\sum _{i}m_{i}\,\Delta r_{i}^{2},}$

then the equation for angular momentum simplifies to[22]: 1028

${\displaystyle \mathbf {L} =I_{\mathbf {C} }\omega \mathbf {\hat {k)) .}$

The moment of inertia ${\displaystyle I_{\mathbf {C} ))$ about an axis perpendicular to the movement of the rigid system and through the center of mass is known as the polar moment of inertia. Specifically, it is the second moment of mass with respect to the orthogonal distance from an axis (or pole).

For a given amount of angular momentum, a decrease in the moment of inertia results in an increase in the angular velocity. Figure skaters can change their moment of inertia by pulling in their arms. Thus, the angular velocity achieved by a skater with outstretched arms results in a greater angular velocity when the arms are pulled in, because of the reduced moment of inertia. A figure skater is not, however, a rigid body.

Kinetic energy

This 1906 rotary shear uses the moment of inertia of two flywheels to store kinetic energy which when released is used to cut metal stock (International Library of Technology, 1906).

The kinetic energy of a rigid system of particles moving in the plane is given by[14][17]

{\displaystyle {\begin{aligned}E_{\text{K))&={\frac {1}{2))\sum _{i=1}^{n}m_{i}\mathbf {v} _{i}\cdot \mathbf {v} _{i},\\&={\frac {1}{2))\sum _{i=1}^{n}m_{i}\left(\omega \,\Delta r_{i}\mathbf {\hat {t)) _{i}+\mathbf {V} \right)\cdot \left(\omega \,\Delta r_{i}\mathbf {\hat {t)) _{i}+\mathbf {V} \right),\\&={\frac {1}{2))\omega ^{2}\left(\sum _{i=1}^{n}m_{i}\,\Delta r_{i}^{2}\mathbf {\hat {t)) _{i}\cdot \mathbf {\hat {t)) _{i}\right)+\omega \mathbf {V} \cdot \left(\sum _{i=1}^{n}m_{i}\,\Delta r_{i}\mathbf {\hat {t)) _{i}\right)+{\frac {1}{2))\left(\sum _{i=1}^{n}m_{i}\right)\mathbf {V} \cdot \mathbf {V} .\end{aligned))}

Let the reference point be the center of mass ${\displaystyle \mathbf {C} }$ of the system so the second term becomes zero, and introduce the moment of inertia ${\displaystyle I_{\mathbf {C} ))$ so the kinetic energy is given by[22]: 1084

${\displaystyle E_{\text{K))={\frac {1}{2))I_{\mathbf {C} }\omega ^{2}+{\frac {1}{2))M\mathbf {V} \cdot \mathbf {V} .}$

The moment of inertia ${\displaystyle I_{\mathbf {C} ))$ is the polar moment of inertia of the body.

Newton's laws

A 1920s John Deere tractor with the spoked flywheel on the engine. The large moment of inertia of the flywheel smooths the operation of the tractor.

Newton's laws for a rigid system of ${\displaystyle n}$ particles, ${\displaystyle P_{i},i=1,\dots ,n}$, can be written in terms of a resultant force and torque at a reference point ${\displaystyle \mathbf {R} }$, to yield[14][17]

{\displaystyle {\begin{aligned}\mathbf {F} &=\sum _{i=1}^{n}m_{i}\mathbf {A} _{i},\\{\boldsymbol {\tau ))&=\sum _{i=1}^{n}\Delta \mathbf {r} _{i}\times m_{i}\mathbf {A} _{i},\end{aligned))}
where ${\displaystyle \mathbf {r} _{i))$ denotes the trajectory of each particle.

The kinematics of a rigid body yields the formula for the acceleration of the particle ${\displaystyle P_{i))$ in terms of the position ${\displaystyle \mathbf {R} }$ and acceleration ${\displaystyle \mathbf {A} }$ of the reference particle as well as the angular velocity vector ${\displaystyle {\boldsymbol {\omega ))}$ and angular acceleration vector ${\displaystyle {\boldsymbol {\alpha ))}$ of the rigid system of particles as,

${\displaystyle \mathbf {A} _{i}={\boldsymbol {\alpha ))\times \Delta \mathbf {r} _{i}+{\boldsymbol {\omega ))\times {\boldsymbol {\omega ))\times \Delta \mathbf {r} _{i}+\mathbf {A} .}$

For systems that are constrained to planar movement, the angular velocity and angular acceleration vectors are directed along ${\displaystyle \mathbf {\hat {k)) }$ perpendicular to the plane of movement, which simplifies this acceleration equation. In this case, the acceleration vectors can be simplified by introducing the unit vectors ${\displaystyle \mathbf {\hat {e)) _{i))$ from the reference point ${\displaystyle \mathbf {R} }$ to a point ${\displaystyle \mathbf {r} _{i))$ and the unit vectors ${\displaystyle \mathbf {\hat {t)) _{i}=\mathbf {\hat {k)) \times \mathbf {\hat {e)) _{i))$, so

{\displaystyle {\begin{aligned}\mathbf {A} _{i}&=\alpha \mathbf {\hat {k)) \times \Delta r_{i}\mathbf {\hat {e)) _{i}-\omega \mathbf {\hat {k)) \times \omega \mathbf {\hat {k)) \times \Delta r_{i}\mathbf {\hat {e)) _{i}+\mathbf {A} \\&=\alpha \Delta r_{i}\mathbf {\hat {t)) _{i}-\omega ^{2}\Delta r_{i}\mathbf {\hat {e)) _{i}+\mathbf {A} .\end{aligned))}

This yields the resultant torque on the system as

{\displaystyle {\begin{aligned}{\boldsymbol {\tau ))&=\sum _{i=1}^{n}m_{i}\,\Delta r_{i}\mathbf {\hat {e)) _{i}\times \left(\alpha \Delta r_{i}\mathbf {\hat {t)) _{i}-\omega ^{2}\Delta r_{i}\mathbf {\hat {e)) _{i}+\mathbf {A} \right)\\&=\left(\sum _{i=1}^{n}m_{i}\,\Delta r_{i}^{2}\right)\alpha \mathbf {\hat {k)) +\left(\sum _{i=1}^{n}m_{i}\,\Delta r_{i}\mathbf {\hat {e)) _{i}\right)\times \mathbf {A} ,\end{aligned))}

where ${\displaystyle \mathbf {\hat {e)) _{i}\times \mathbf {\hat {e)) _{i}=\mathbf {0} }$, and ${\displaystyle \mathbf {\hat {e)) _{i}\times \mathbf {\hat {t)) _{i}=\mathbf {\hat {k)) }$ is the unit vector perpendicular to the plane for all of the particles ${\displaystyle P_{i))$.

Use the center of mass ${\displaystyle \mathbf {C} }$ as the reference point and define the moment of inertia relative to the center of mass ${\displaystyle I_{\mathbf {C} ))$, then the equation for the resultant torque simplifies to[22]: 1029

${\displaystyle {\boldsymbol {\tau ))=I_{\mathbf {C} }\alpha \mathbf {\hat {k)) .}$

Motion in space of a rigid body, and the inertia matrix

The scalar moments of inertia appear as elements in a matrix when a system of particles is assembled into a rigid body that moves in three-dimensional space. This inertia matrix appears in the calculation of the angular momentum, kinetic energy and resultant torque of the rigid system of particles.[3][4][5][6][26]

 For analysis of a spinning top, see Precession § Classical (Newtonian), and Euler's equations (rigid body dynamics).

Let the system of ${\displaystyle n}$ particles, ${\displaystyle P_{i},i=1,\dots ,n}$ be located at the coordinates ${\displaystyle \mathbf {r} _{i))$ with velocities ${\displaystyle \mathbf {v} _{i))$ relative to a fixed reference frame. For a (possibly moving) reference point ${\displaystyle \mathbf {R} }$, the relative positions are

${\displaystyle \Delta \mathbf {r} _{i}=\mathbf {r} _{i}-\mathbf {R} }$
and the (absolute) velocities are
${\displaystyle \mathbf {v} _{i}={\boldsymbol {\omega ))\times \Delta \mathbf {r} _{i}+\mathbf {V} _{\mathbf {R} ))$
where ${\displaystyle {\boldsymbol {\omega ))}$ is the angular velocity of the system, and ${\displaystyle \mathbf {V_{R)) }$ is the velocity of ${\displaystyle \mathbf {R} }$.

Angular momentum

Note that the cross product can be equivalently written as matrix multiplication by combining the first operand and the operator into a skew-symmetric matrix, ${\displaystyle \left[\mathbf {b} \right]}$, constructed from the components of ${\displaystyle \mathbf {b} =(b_{x},b_{y},b_{z})}$:

{\displaystyle {\begin{aligned}\mathbf {b} \times \mathbf {y} &\equiv \left[\mathbf {b} \right]\mathbf {y} \\\left[\mathbf {b} \right]&\equiv {\begin{bmatrix}0&-b_{z}&b_{y}\\b_{z}&0&-b_{x}\\-b_{y}&b_{x}&0\end{bmatrix)).\end{aligned))}

The inertia matrix is constructed by considering the angular momentum, with the reference point ${\displaystyle \mathbf {R} }$ of the body chosen to be the center of mass ${\displaystyle \mathbf {C} }$:[3][6]

{\displaystyle {\begin{aligned}\mathbf {L} &=\sum _{i=1}^{n}m_{i}\,\Delta \mathbf {r} _{i}\times \mathbf {v} _{i}\\&=\sum _{i=1}^{n}m_{i}\,\Delta \mathbf {r} _{i}\times \left({\boldsymbol {\omega ))\times \Delta \mathbf {r} _{i}+\mathbf {V} _{\mathbf {R} }\right)\\&=\left(-\sum _{i=1}^{n}m_{i}\,\Delta \mathbf {r} _{i}\times \left(\Delta \mathbf {r} _{i}\times {\boldsymbol {\omega ))\right)\right)+\left(\sum _{i=1}^{n}m_{i}\,\Delta \mathbf {r} _{i}\times \mathbf {V} _{\mathbf {R} }\right),\end{aligned))}
where the terms containing ${\displaystyle \mathbf {V_{R)) }$ (${\displaystyle =\mathbf {C} }$) sum to zero by the definition of center of mass.

Then, the skew-symmetric matrix ${\displaystyle [\Delta \mathbf {r} _{i}]}$ obtained from the relative position vector ${\displaystyle \Delta \mathbf {r} _{i}=\mathbf {r} _{i}-\mathbf {C} }$, can be used to define,

${\displaystyle \mathbf {L} =\left(-\sum _{i=1}^{n}m_{i}\left[\Delta \mathbf {r} _{i}\right]^{2}\right){\boldsymbol {\omega ))=\mathbf {I} _{\mathbf {C} }{\boldsymbol {\omega )),}$
where ${\displaystyle \mathbf {I_{C)) }$ defined by
${\displaystyle \mathbf {I} _{\mathbf {C} }=-\sum _{i=1}^{n}m_{i}\left[\Delta \mathbf {r} _{i}\right]^{2},}$
is the symmetric inertia matrix of the rigid system of particles measured relative to the center of mass ${\displaystyle \mathbf {C} }$.

Kinetic energy

The kinetic energy of a rigid system of particles can be formulated in terms of the center of mass and a matrix of mass moments of inertia of the system. Let the system of ${\displaystyle n}$ particles ${\displaystyle P_{i},i=1,\dots ,n}$ be located at the coordinates ${\displaystyle \mathbf {r} _{i))$ with velocities ${\displaystyle \mathbf {v} _{i))$, then the kinetic energy is[3][6]

${\displaystyle E_{\text{K))={\frac {1}{2))\sum _{i=1}^{n}m_{i}\mathbf {v} _{i}\cdot \mathbf {v} _{i}={\frac {1}{2))\sum _{i=1}^{n}m_{i}\left({\boldsymbol {\omega ))\times \Delta \mathbf {r} _{i}+\mathbf {V} _{\mathbf {C} }\right)\cdot \left({\boldsymbol {\omega ))\times \Delta \mathbf {r} _{i}+\mathbf {V} _{\mathbf {C} }\right),}$
where ${\displaystyle \Delta \mathbf {r} _{i}=\mathbf {r} _{i}-\mathbf {C} }$ is the position vector of a particle relative to the center of mass.

This equation expands to yield three terms

${\displaystyle E_{\text{K))={\frac {1}{2))\left(\sum _{i=1}^{n}m_{i}\left({\boldsymbol {\omega ))\times \Delta \mathbf {r} _{i}\right)\cdot \left({\boldsymbol {\omega ))\times \Delta \mathbf {r} _{i}\right)\right)+\left(\sum _{i=1}^{n}m_{i}\mathbf {V} _{\mathbf {C} }\cdot \left({\boldsymbol {\omega ))\times \Delta \mathbf {r} _{i}\right)\right)+{\frac {1}{2))\left(\sum _{i=1}^{n}m_{i}\mathbf {V} _{\mathbf {C} }\cdot \mathbf {V} _{\mathbf {C} }\right).}$

The second term in this equation is zero because ${\displaystyle \mathbf {C} }$ is the center of mass. Introduce the skew-symmetric matrix ${\displaystyle [\Delta \mathbf {r} _{i}]}$ so the kinetic energy becomes

{\displaystyle {\begin{aligned}E_{\text{K))&={\frac {1}{2))\left(\sum _{i=1}^{n}m_{i}\left(\left[\Delta \mathbf {r} _{i}\right]{\boldsymbol {\omega ))\right)\cdot \left(\left[\Delta \mathbf {r} _{i}\right]{\boldsymbol {\omega ))\right)\right)+{\frac {1}{2))\left(\sum _{i=1}^{n}m_{i}\right)\mathbf {V} _{\mathbf {C} }\cdot \mathbf {V} _{\mathbf {C} }\\&={\frac {1}{2))\left(\sum _{i=1}^{n}m_{i}\left({\boldsymbol {\omega ))^{\mathsf {T))\left[\Delta \mathbf {r} _{i}\right]^{\mathsf {T))\left[\Delta \mathbf {r} _{i}\right]{\boldsymbol {\omega ))\right)\right)+{\frac {1}{2))\left(\sum _{i=1}^{n}m_{i}\right)\mathbf {V} _{\mathbf {C} }\cdot \mathbf {V} _{\mathbf {C} }\\&={\frac {1}{2)){\boldsymbol {\omega ))\cdot \left(-\sum _{i=1}^{n}m_{i}\left[\Delta \mathbf {r} _{i}\right]^{2}\right){\boldsymbol {\omega ))+{\frac {1}{2))\left(\sum _{i=1}^{n}m_{i}\right)\mathbf {V} _{\mathbf {C} }\cdot \mathbf {V} _{\mathbf {C} }.\end{aligned))}

Thus, the kinetic energy of the rigid system of particles is given by

${\displaystyle E_{\text{K))={\frac {1}{2)){\boldsymbol {\omega ))\cdot \mathbf {I} _{\mathbf {C} }{\boldsymbol {\omega ))+{\frac {1}{2))M\mathbf {V} _{\mathbf {C} }^{2}.}$
where ${\displaystyle \mathbf {I_{C)) }$ is the inertia matrix relative to the center of mass and ${\displaystyle M}$ is the total mass.

Resultant torque

The inertia matrix appears in the application of Newton's second law to a rigid assembly of particles. The resultant torque on this system is,[3][6]

${\displaystyle {\boldsymbol {\tau ))=\sum _{i=1}^{n}\left(\mathbf {r_{i)) -\mathbf {R} \right)\times m_{i}\mathbf {a} _{i},}$
where ${\displaystyle \mathbf {a} _{i))$ is the acceleration of the particle ${\displaystyle P_{i))$. The kinematics of a rigid body yields the formula for the acceleration of the particle ${\displaystyle P_{i))$ in terms of the position ${\displaystyle \mathbf {R} }$ and acceleration ${\displaystyle \mathbf {A} _{\mathbf {R} ))$ of the reference point, as well as the angular velocity vector ${\displaystyle {\boldsymbol {\omega ))}$ and angular acceleration vector ${\displaystyle {\boldsymbol {\alpha ))}$ of the rigid system as,
${\displaystyle \mathbf {a} _{i}={\boldsymbol {\alpha ))\times \left(\mathbf {r} _{i}-\mathbf {R} \right)+{\boldsymbol {\omega ))\times \left({\boldsymbol {\omega ))\times \left(\mathbf {r} _{i}-\mathbf {R} \right)\right)+\mathbf {A} _{\mathbf {R} }.}$

Use the center of mass ${\displaystyle \mathbf {C} }$ as the reference point, and introduce the skew-symmetric matrix ${\displaystyle \left[\Delta \mathbf {r} _{i}\right]=\left[\mathbf {r} _{i}-\mathbf {C} \right]}$ to represent the cross product ${\displaystyle (\mathbf {r} _{i}-\mathbf {C} )\times }$, to obtain

${\displaystyle {\boldsymbol {\tau ))=\left(-\sum _{i=1}^{n}m_{i}\left[\Delta \mathbf {r} _{i}\right]^{2}\right){\boldsymbol {\alpha ))+{\boldsymbol {\omega ))\times \left(-\sum _{i=1}^{n}m_{i}\left[\Delta \mathbf {r} _{i}\right]^{2}\right){\boldsymbol {\omega ))}$

The calculation uses the identity

${\displaystyle \Delta \mathbf {r} _{i}\times \left({\boldsymbol {\omega ))\times \left({\boldsymbol {\omega ))\times \Delta \mathbf {r} _{i}\right)\right)+{\boldsymbol {\omega ))\times \left(\left({\boldsymbol {\omega ))\times \Delta \mathbf {r} _{i}\right)\times \Delta \mathbf {r} _{i}\right)=0,}$
obtained from the Jacobi identity for the triple cross product as shown in the proof below:

Proof

{\displaystyle {\begin{aligned}{\boldsymbol {\tau ))&=\sum _{i=1}^{n}(\mathbf {r_{i)) -\mathbf {R} )\times (m_{i}\mathbf {a} _{i})\\&=\sum _{i=1}^{n}{\boldsymbol {\Delta ))\mathbf {r} _{i}\times (m_{i}\mathbf {a} _{i})\\&=\sum _{i=1}^{n}m_{i}[{\boldsymbol {\Delta ))\mathbf {r} _{i}\times \mathbf {a} _{i}]\;\ldots {\text{ cross-product scalar multiplication))\\&=\sum _{i=1}^{n}m_{i}[{\boldsymbol {\Delta ))\mathbf {r} _{i}\times (\mathbf {a} _((\text{tangential)),i}+\mathbf {a} _((\text{centripetal)),i}+\mathbf {A} _{\mathbf {R} })]\\&=\sum _{i=1}^{n}m_{i}[{\boldsymbol {\Delta ))\mathbf {r} _{i}\times (\mathbf {a} _((\text{tangential)),i}+\mathbf {a} _((\text{centripetal)),i}+0)]\\&\;\;\;\;\;\ldots \;\mathbf {R} {\text{ is either at rest or moving at a constant velocity but not accelerated, or ))\\&\;\;\;\;\;\;\;\;\;\;\;{\text{the origin of the fixed (world) coordinate reference system is placed at the center of mass ))\mathbf {C} \\&=\sum _{i=1}^{n}m_{i}[{\boldsymbol {\Delta ))\mathbf {r} _{i}\times \mathbf {a} _((\text{tangential)),i}+{\boldsymbol {\Delta ))\mathbf {r} _{i}\times \mathbf {a} _((\text{centripetal)),i}]\;\ldots {\text{ cross-product distributivity over addition))\\&=\sum _{i=1}^{n}m_{i}[{\boldsymbol {\Delta ))\mathbf {r} _{i}\times ({\boldsymbol {\alpha ))\times {\boldsymbol {\Delta ))\mathbf {r} _{i})+{\boldsymbol {\Delta ))\mathbf {r} _{i}\times ({\boldsymbol {\omega ))\times \mathbf {v} _((\text{tangential)),i})]\\{\boldsymbol {\tau ))&=\sum _{i=1}^{n}m_{i}[{\boldsymbol {\Delta ))\mathbf {r} _{i}\times ({\boldsymbol {\alpha ))\times {\boldsymbol {\Delta ))\mathbf {r} _{i})+{\boldsymbol {\Delta ))\mathbf {r} _{i}\times ({\boldsymbol {\omega ))\times ({\boldsymbol {\omega ))\times {\boldsymbol {\Delta ))\mathbf {r} _{i}))]\\\end{aligned))}

Then, the following Jacobi identity is used on the last term:

{\displaystyle {\begin{aligned}0&={\boldsymbol {\Delta ))\mathbf {r} _{i}\times ({\boldsymbol {\omega ))\times ({\boldsymbol {\omega ))\times {\boldsymbol {\Delta ))\mathbf {r} _{i}))+{\boldsymbol {\omega ))\times (({\boldsymbol {\omega ))\times {\boldsymbol {\Delta ))\mathbf {r} _{i})\times {\boldsymbol {\Delta ))\mathbf {r} _{i})+({\boldsymbol {\omega ))\times {\boldsymbol {\Delta ))\mathbf {r} _{i})\times ({\boldsymbol {\Delta ))\mathbf {r} _{i}\times {\boldsymbol {\omega )))\\&={\boldsymbol {\Delta ))\mathbf {r} _{i}\times ({\boldsymbol {\omega ))\times ({\boldsymbol {\omega ))\times {\boldsymbol {\Delta ))\mathbf {r} _{i}))+{\boldsymbol {\omega ))\times (({\boldsymbol {\omega ))\times {\boldsymbol {\Delta ))\mathbf {r} _{i})\times {\boldsymbol {\Delta ))\mathbf {r} _{i})+({\boldsymbol {\omega ))\times {\boldsymbol {\Delta ))\mathbf {r} _{i})\times -({\boldsymbol {\omega ))\times {\boldsymbol {\Delta ))\mathbf {r} _{i})\;\ldots {\text{ cross-product anticommutativity))\\&={\boldsymbol {\Delta ))\mathbf {r} _{i}\times ({\boldsymbol {\omega ))\times ({\boldsymbol {\omega ))\times {\boldsymbol {\Delta ))\mathbf {r} _{i}))+{\boldsymbol {\omega ))\times (({\boldsymbol {\omega ))\times {\boldsymbol {\Delta ))\mathbf {r} _{i})\times {\boldsymbol {\Delta ))\mathbf {r} _{i})+-[({\boldsymbol {\omega ))\times {\boldsymbol {\Delta ))\mathbf {r} _{i})\times ({\boldsymbol {\omega ))\times {\boldsymbol {\Delta ))\mathbf {r} _{i})]\;\ldots {\text{ cross-product scalar multiplication))\\&={\boldsymbol {\Delta ))\mathbf {r} _{i}\times ({\boldsymbol {\omega ))\times ({\boldsymbol {\omega ))\times {\boldsymbol {\Delta ))\mathbf {r} _{i}))+{\boldsymbol {\omega ))\times (({\boldsymbol {\omega ))\times {\boldsymbol {\Delta ))\mathbf {r} _{i})\times {\boldsymbol {\Delta ))\mathbf {r} _{i})+-[0]\;\ldots {\text{ self cross-product))\\0&={\boldsymbol {\Delta ))\mathbf {r} _{i}\times ({\boldsymbol {\omega ))\times ({\boldsymbol {\omega ))\times {\boldsymbol {\Delta ))\mathbf {r} _{i}))+{\boldsymbol {\omega ))\times (({\boldsymbol {\omega ))\times {\boldsymbol {\Delta ))\mathbf {r} _{i})\times {\boldsymbol {\Delta ))\mathbf {r} _{i})\end{aligned))}

The result of applying Jacobi identity can then be continued as follows:

{\displaystyle {\begin{aligned}{\boldsymbol {\Delta ))\mathbf {r} _{i}\times ({\boldsymbol {\omega ))\times ({\boldsymbol {\omega ))\times {\boldsymbol {\Delta ))\mathbf {r} _{i}))&=-[{\boldsymbol {\omega ))\times (({\boldsymbol {\omega ))\times {\boldsymbol {\Delta ))\mathbf {r} _{i})\times {\boldsymbol {\Delta ))\mathbf {r} _{i})]\\&=-[({\boldsymbol {\omega ))\times {\boldsymbol {\Delta ))\mathbf {r} _{i})({\boldsymbol {\omega ))\cdot {\boldsymbol {\Delta ))\mathbf {r} _{i})-{\boldsymbol {\Delta ))\mathbf {r} _{i}({\boldsymbol {\omega ))\cdot ({\boldsymbol {\omega ))\times {\boldsymbol {\Delta ))\mathbf {r} _{i}))]\;\ldots {\text{ vector triple product))\\&=-[({\boldsymbol {\omega ))\times {\boldsymbol {\Delta ))\mathbf {r} _{i})({\boldsymbol {\omega ))\cdot {\boldsymbol {\Delta ))\mathbf {r} _{i})-{\boldsymbol {\Delta ))\mathbf {r} _{i}({\boldsymbol {\Delta ))\mathbf {r} _{i}\cdot ({\boldsymbol {\omega ))\times {\boldsymbol {\omega ))))]\;\ldots {\text{ scalar triple product))\\&=-[({\boldsymbol {\omega ))\times {\boldsymbol {\Delta ))\mathbf {r} _{i})({\boldsymbol {\omega ))\cdot {\boldsymbol {\Delta ))\mathbf {r} _{i})-{\boldsymbol {\Delta ))\mathbf {r} _{i}({\boldsymbol {\Delta ))\mathbf {r} _{i}\cdot (0))]\;\ldots {\text{ self cross-product))\\&=-[({\boldsymbol {\omega ))\times {\boldsymbol {\Delta ))\mathbf {r} _{i})({\boldsymbol {\omega ))\cdot {\boldsymbol {\Delta ))\mathbf {r} _{i})]\\&=-[{\boldsymbol {\omega ))\times ({\boldsymbol {\Delta ))\mathbf {r} _{i}({\boldsymbol {\omega ))\cdot {\boldsymbol {\Delta ))\mathbf {r} _{i}))]\;\ldots {\text{ cross-product scalar multiplication))\\&={\boldsymbol {\omega ))\times -({\boldsymbol {\Delta ))\mathbf {r} _{i}({\boldsymbol {\omega ))\cdot {\boldsymbol {\Delta ))\mathbf {r} _{i}))\;\ldots {\text{ cross-product scalar multiplication))\\{\boldsymbol {\Delta ))\mathbf {r} _{i}\times ({\boldsymbol {\omega ))\times ({\boldsymbol {\omega ))\times {\boldsymbol {\Delta ))\mathbf {r} _{i}))&={\boldsymbol {\omega ))\times -({\boldsymbol {\Delta ))\mathbf {r} _{i}({\boldsymbol {\Delta ))\mathbf {r} _{i}\cdot {\boldsymbol {\omega ))))\;\ldots {\text{ dot-product commutativity))\\\end{aligned))}

The final result can then be substituted to the main proof as follows:

{\displaystyle {\begin{aligned}{\boldsymbol {\tau ))&=\sum _{i=1}^{n}m_{i}[{\boldsymbol {\Delta ))\mathbf {r} _{i}\times ({\boldsymbol {\alpha ))\times {\boldsymbol {\Delta ))\mathbf {r} _{i})+{\boldsymbol {\Delta ))\mathbf {r} _{i}\times ({\boldsymbol {\omega ))\times ({\boldsymbol {\omega ))\times {\boldsymbol {\Delta ))\mathbf {r} _{i}))]\\&=\sum _{i=1}^{n}m_{i}[{\boldsymbol {\Delta ))\mathbf {r} _{i}\times ({\boldsymbol {\alpha ))\times {\boldsymbol {\Delta ))\mathbf {r} _{i})+{\boldsymbol {\omega ))\times -({\boldsymbol {\Delta ))\mathbf {r} _{i}({\boldsymbol {\Delta ))\mathbf {r} _{i}\cdot {\boldsymbol {\omega ))))]\\&=\sum _{i=1}^{n}m_{i}[{\boldsymbol {\Delta ))\mathbf {r} _{i}\times ({\boldsymbol {\alpha ))\times {\boldsymbol {\Delta ))\mathbf {r} _{i})+{\boldsymbol {\omega ))\times \{0-{\boldsymbol {\Delta ))\mathbf {r} _{i}({\boldsymbol {\Delta ))\mathbf {r} _{i}\cdot {\boldsymbol {\omega )))\}]\\&=\sum _{i=1}^{n}m_{i}[{\boldsymbol {\Delta ))\mathbf {r} _{i}\times ({\boldsymbol {\alpha ))\times {\boldsymbol {\Delta ))\mathbf {r} _{i})+{\boldsymbol {\omega ))\times \{[{\boldsymbol {\omega ))({\boldsymbol {\Delta ))\mathbf {r} _{i}\cdot {\boldsymbol {\Delta ))\mathbf {r} _{i})-{\boldsymbol {\omega ))({\boldsymbol {\Delta ))\mathbf {r} _{i}\cdot {\boldsymbol {\Delta ))\mathbf {r} _{i})]-{\boldsymbol {\Delta ))\mathbf {r} _{i}({\boldsymbol {\Delta ))\mathbf {r} _{i}\cdot {\boldsymbol {\omega )))\}]\;\ldots \;{\boldsymbol {\omega ))({\boldsymbol {\Delta ))\mathbf {r} _{i}\cdot {\boldsymbol {\Delta ))\mathbf {r} _{i})-{\boldsymbol {\omega ))({\boldsymbol {\Delta ))\mathbf {r} _{i}\cdot {\boldsymbol {\Delta ))\mathbf {r} _{i})=0\\&=\sum _{i=1}^{n}m_{i}[{\boldsymbol {\Delta ))\mathbf {r} _{i}\times ({\boldsymbol {\alpha ))\times {\boldsymbol {\Delta ))\mathbf {r} _{i})+{\boldsymbol {\omega ))\times \{[{\boldsymbol {\omega ))({\boldsymbol {\Delta ))\mathbf {r} _{i}\cdot {\boldsymbol {\Delta ))\mathbf {r} _{i})-{\boldsymbol {\Delta ))\mathbf {r} _{i}({\boldsymbol {\Delta ))\mathbf {r} _{i}\cdot {\boldsymbol {\omega )))]-{\boldsymbol {\omega ))({\boldsymbol {\Delta ))\mathbf {r} _{i}\cdot {\boldsymbol {\Delta ))\mathbf {r} _{i})\}]\;\ldots {\text{ addition associativity))\\\end{aligned))}
{\displaystyle {\begin{aligned}&=\sum _{i=1}^{n}m_{i}[{\boldsymbol {\Delta ))\mathbf {r} _{i}\times ({\boldsymbol {\alpha ))\times {\boldsymbol {\Delta ))\mathbf {r} _{i})+{\boldsymbol {\omega ))\times \((\boldsymbol {\omega ))({\boldsymbol {\Delta ))\mathbf {r} _{i}\cdot {\boldsymbol {\Delta ))\mathbf {r} _{i})-{\boldsymbol {\Delta ))\mathbf {r} _{i}({\boldsymbol {\Delta ))\mathbf {r} _{i}\cdot {\boldsymbol {\omega )))\}-{\boldsymbol {\omega ))\times {\boldsymbol {\omega ))({\boldsymbol {\Delta ))\mathbf {r} _{i}\cdot {\boldsymbol {\Delta ))\mathbf {r} _{i})]\;\ldots {\text{ cross-product distributivity over addition))\\&=\sum _{i=1}^{n}m_{i}[{\boldsymbol {\Delta ))\mathbf {r} _{i}\times ({\boldsymbol {\alpha ))\times {\boldsymbol {\Delta ))\mathbf {r} _{i})+{\boldsymbol {\omega ))\times \((\boldsymbol {\omega ))({\boldsymbol {\Delta ))\mathbf {r} _{i}\cdot {\boldsymbol {\Delta ))\mathbf {r} _{i})-{\boldsymbol {\Delta ))\mathbf {r} _{i}({\boldsymbol {\Delta ))\mathbf {r} _{i}\cdot {\boldsymbol {\omega )))\}-({\boldsymbol {\Delta ))\mathbf {r} _{i}\cdot {\boldsymbol {\Delta ))\mathbf {r} _{i})({\boldsymbol {\omega ))\times {\boldsymbol {\omega )))]\;\ldots {\text{ cross-product scalar multiplication))\\&=\sum _{i=1}^{n}m_{i}[{\boldsymbol {\Delta ))\mathbf {r} _{i}\times ({\boldsymbol {\alpha ))\times {\boldsymbol {\Delta ))\mathbf {r} _{i})+{\boldsymbol {\omega ))\times \((\boldsymbol {\omega ))({\boldsymbol {\Delta ))\mathbf {r} _{i}\cdot {\boldsymbol {\Delta ))\mathbf {r} _{i})-{\boldsymbol {\Delta ))\mathbf {r} _{i}({\boldsymbol {\Delta ))\mathbf {r} _{i}\cdot {\boldsymbol {\omega )))\}-({\boldsymbol {\Delta ))\mathbf {r} _{i}\cdot {\boldsymbol {\Delta ))\mathbf {r} _{i})(0)]\;\ldots {\text{ self cross-product))\\&=\sum _{i=1}^{n}m_{i}[{\boldsymbol {\Delta ))\mathbf {r} _{i}\times ({\boldsymbol {\alpha ))\times {\boldsymbol {\Delta ))\mathbf {r} _{i})+{\boldsymbol {\omega ))\times \((\boldsymbol {\omega ))({\boldsymbol {\Delta ))\mathbf {r} _{i}\cdot {\boldsymbol {\Delta ))\mathbf {r} _{i})-{\boldsymbol {\Delta ))\mathbf {r} _{i}({\boldsymbol {\Delta ))\mathbf {r} _{i}\cdot {\boldsymbol {\omega )))\}]\\&=\sum _{i=1}^{n}m_{i}[{\boldsymbol {\Delta ))\mathbf {r} _{i}\times ({\boldsymbol {\alpha ))\times {\boldsymbol {\Delta ))\mathbf {r} _{i})+{\boldsymbol {\omega ))\times \((\boldsymbol {\Delta ))\mathbf {r} _{i}\times ({\boldsymbol {\omega ))\times {\boldsymbol {\Delta ))\mathbf {r} _{i})\}]\;\ldots {\text{ vector triple product))\\&=\sum _{i=1}^{n}m_{i}[{\boldsymbol {\Delta ))\mathbf {r} _{i}\times -({\boldsymbol {\Delta ))\mathbf {r} _{i}\times {\boldsymbol {\alpha )))+{\boldsymbol {\omega ))\times \((\boldsymbol {\Delta ))\mathbf {r} _{i}\times -({\boldsymbol {\Delta ))\mathbf {r} _{i}\times {\boldsymbol {\omega )))\}]\;\ldots {\text{ cross-product anticommutativity))\\&=-\sum _{i=1}^{n}m_{i}[{\boldsymbol {\Delta ))\mathbf {r} _{i}\times ({\boldsymbol {\Delta ))\mathbf {r} _{i}\times {\boldsymbol {\alpha )))+{\boldsymbol {\omega ))\times \((\boldsymbol {\Delta ))\mathbf {r} _{i}\times ({\boldsymbol {\Delta ))\mathbf {r} _{i}\times {\boldsymbol {\omega )))\}]\;\ldots {\text{ cross-product scalar multiplication))\\&=-\sum _{i=1}^{n}m_{i}[{\boldsymbol {\Delta ))\mathbf {r} _{i}\times ({\boldsymbol {\Delta ))\mathbf {r} _{i}\times {\boldsymbol {\alpha )))]+-\sum _{i=1}^{n}m_{i}[{\boldsymbol {\omega ))\times \((\boldsymbol {\Delta ))\mathbf {r} _{i}\times ({\boldsymbol {\Delta ))\mathbf {r} _{i}\times {\boldsymbol {\omega )))\}]\;\ldots {\text{ summation distributivity))\\{\boldsymbol {\tau ))&=-\sum _{i=1}^{n}m_{i}[{\boldsymbol {\Delta ))\mathbf {r} _{i}\times ({\boldsymbol {\Delta ))\mathbf {r} _{i}\times {\boldsymbol {\alpha )))]+{\boldsymbol {\omega ))\times -\sum _{i=1}^{n}m_{i}[{\boldsymbol {\Delta ))\mathbf {r} _{i}\times ({\boldsymbol {\Delta ))\mathbf {r} _{i}\times {\boldsymbol {\omega )))]\;\ldots \;{\boldsymbol {\omega )){\text{ is not characteristic of particle ))P_{i}\end{aligned))}

Notice that for any vector ${\displaystyle \mathbf {u} }$, the following holds:

{\displaystyle {\begin{aligned}-\sum _{i=1}^{n}m_{i}[{\boldsymbol {\Delta ))\mathbf {r} _{i}\times ({\boldsymbol {\Delta ))\mathbf {r} _{i}\times \mathbf {u} )]&=-\sum _{i=1}^{n}m_{i}\left({\begin{bmatrix}0&-\Delta r_{3,i}&\Delta r_{2,i}\\\Delta r_{3,i}&0&-\Delta r_{1,i}\\-\Delta r_{2,i}&\Delta r_{1,i}&0\end{bmatrix))\left({\begin{bmatrix}0&-\Delta r_{3,i}&\Delta r_{2,i}\\\Delta r_{3,i}&0&-\Delta r_{1,i}\\-\Delta r_{2,i}&\Delta r_{1,i}&0\end{bmatrix)){\begin{bmatrix}u_{1}\\u_{2}\\u_{3}\end{bmatrix))\right)\right)\;\ldots {\text{ cross-product as matrix multiplication))\\[6pt]&=-\sum _{i=1}^{n}m_{i}\left({\begin{bmatrix}0&-\Delta r_{3,i}&\Delta r_{2,i}\\\Delta r_{3,i}&0&-\Delta r_{1,i}\\-\Delta r_{2,i}&\Delta r_{1,i}&0\end{bmatrix)){\begin{bmatrix}-\Delta r_{3,i}\,u_{2}+\Delta r_{2,i}\,u_{3}\\+\Delta r_{3,i}\,u_{1}-\Delta r_{1,i}\,u_{3}\\-\Delta r_{2,i}\,u_{1}+\Delta r_{1,i}\,u_{2}\end{bmatrix))\right)\\[6pt]&=-\sum _{i=1}^{n}m_{i}{\begin{bmatrix}-\Delta r_{3,i}(+\Delta r_{3,i}\,u_{1}-\Delta r_{1,i}\,u_{3})+\Delta r_{2,i}(-\Delta r_{2,i}\,u_{1}+\Delta r_{1,i}\,u_{2})\\+\Delta r_{3,i}(-\Delta r_{3,i}\,u_{2}+\Delta r_{2,i}\,u_{3})-\Delta r_{1,i}(-\Delta r_{2,i}\,u_{1}+\Delta r_{1,i}\,u_{2})\\-\Delta r_{2,i}(-\Delta r_{3,i}\,u_{2}+\Delta r_{2,i}\,u_{3})+\Delta r_{1,i}(+\Delta r_{3,i}\,u_{1}-\Delta r_{1,i}\,u_{3})\end{bmatrix))\\[6pt]&=-\sum _{i=1}^{n}m_{i}{\begin{bmatrix}-\Delta r_{3,i}^{2}\,u_{1}+\Delta r_{1,i}\Delta r_{3,i}\,u_{3}-\Delta r_{2,i}^{2}\,u_{1}+\Delta r_{1,i}\Delta r_{2,i}\,u_{2}\\-\Delta r_{3,i}^{2}\,u_{2}+\Delta r_{2,i}\Delta r_{3,i}\,u_{3}+\Delta r_{2,i}\Delta r_{1,i}\,u_{1}-\Delta r_{1,i}^{2}\,u_{2}\\+\Delta r_{3,i}\Delta r_{2,i}\,u_{2}-\Delta r_{2,i}^{2}\,u_{3}+\Delta r_{3,i}\Delta r_{1,i}\,u_{1}-\Delta r_{1,i}^{2}\,u_{3}\end{bmatrix))\\[6pt]&=-\sum _{i=1}^{n}m_{i}{\begin{bmatrix}-(\Delta r_{2,i}^{2}+\Delta r_{3,i}^{2})\,u_{1}+\Delta r_{1,i}\Delta r_{2,i}\,u_{2}+\Delta r_{1,i}\Delta r_{3,i}\,u_{3}\\+\Delta r_{2,i}\Delta r_{1,i}\,u_{1}-(\Delta r_{1,i}^{2}+\Delta r_{3,i}^{2})\,u_{2}+\Delta r_{2,i}\Delta r_{3,i}\,u_{3}\\+\Delta r_{3,i}\Delta r_{1,i}\,u_{1}+\Delta r_{3,i}\Delta r_{2,i}\,u_{2}-(\Delta r_{1,i}^{2}+\Delta r_{2,i}^{2})\,u_{3}\end{bmatrix))\\[6pt]&=-\sum _{i=1}^{n}m_{i}{\begin{bmatrix}-(\Delta r_{2,i}^{2}+\Delta r_{3,i}^{2})&\Delta r_{1,i}\Delta r_{2,i}&\Delta r_{1,i}\Delta r_{3,i}\\\Delta r_{2,i}\Delta r_{1,i}&-(\Delta r_{1,i}^{2}+\Delta r_{3,i}^{2})&\Delta r_{2,i}\Delta r_{3,i}\\\Delta r_{3,i}\Delta r_{1,i}&\Delta r_{3,i}\Delta r_{2,i}&-(\Delta r_{1,i}^{2}+\Delta r_{2,i}^{2})\end{bmatrix)){\begin{bmatrix}u_{1}\\u_{2}\\u_{3}\end{bmatrix))\\&=-\sum _{i=1}^{n}m_{i}[\Delta r_{i}]^{2}\mathbf {u} \\[6pt]-\sum _{i=1}^{n}m_{i}[{\boldsymbol {\Delta ))\mathbf {r} _{i}\times ({\boldsymbol {\Delta ))\mathbf {r} _{i}\times \mathbf {u} )]&=\left(-\sum _{i=1}^{n}m_{i}[\Delta r_{i}]^{2}\right)\mathbf {u} \;\ldots \;\mathbf {u} {\text{ is not characteristic of ))P_{i}\end{aligned))}

Finally, the result is used to complete the main proof as follows:

{\displaystyle {\begin{aligned}{\boldsymbol {\tau ))&=-\sum _{i=1}^{n}m_{i}[{\boldsymbol {\Delta ))\mathbf {r} _{i}\times ({\boldsymbol {\Delta ))\mathbf {r} _{i}\times {\boldsymbol {\alpha )))]+{\boldsymbol {\omega ))\times -\sum _{i=1}^{n}m_{i}{\boldsymbol {\Delta ))\mathbf {r} _{i}\times ({\boldsymbol {\Delta ))\mathbf {r} _{i}\times {\boldsymbol {\omega )))]\\&=\left(-\sum _{i=1}^{n}m_{i}[\Delta r_{i}]^{2}\right){\boldsymbol {\alpha ))+{\boldsymbol {\omega ))\times \left(-\sum _{i=1}^{n}m_{i}[\Delta r_{i}]^{2}\right){\boldsymbol {\omega ))\end{aligned))}

Thus, the resultant torque on the rigid system of particles is given by

${\displaystyle {\boldsymbol {\tau ))=\mathbf {I} _{\mathbf {C} }{\boldsymbol {\alpha ))+{\boldsymbol {\omega ))\times \mathbf {I} _{\mathbf {C} }{\boldsymbol {\omega )),}$
where ${\displaystyle \mathbf {I_{C)) }$ is the inertia matrix relative to the center of mass.

Parallel axis theorem

 Main article: Parallel axis theorem

The inertia matrix of a body depends on the choice of the reference point. There is a useful relationship between the inertia matrix relative to the center of mass ${\displaystyle \mathbf {C} }$ and the inertia matrix relative to another point ${\displaystyle \mathbf {R} }$. This relationship is called the parallel axis theorem.[3][6]

Consider the inertia matrix ${\displaystyle \mathbf {I_{R)) }$ obtained for a rigid system of particles measured relative to a reference point ${\displaystyle \mathbf {R} }$, given by

${\displaystyle \mathbf {I} _{\mathbf {R} }=-\sum _{i=1}^{n}m_{i}\left[\mathbf {r} _{i}-\mathbf {R} \right]^{2}.}$

Let ${\displaystyle \mathbf {C} }$ be the center of mass of the rigid system, then

${\displaystyle \mathbf {R} =(\mathbf {R} -\mathbf {C} )+\mathbf {C} =\mathbf {d} +\mathbf {C} ,}$
where ${\displaystyle \mathbf {d} }$ is the vector from the center of mass ${\displaystyle \mathbf {C} }$ to the reference point ${\displaystyle \mathbf {R} }$. Use this equation to compute the inertia matrix,
${\displaystyle \mathbf {I} _{\mathbf {R} }=-\sum _{i=1}^{n}m_{i}[\mathbf {r} _{i}-\left(\mathbf {C} +\mathbf {d} \right)]^{2}=-\sum _{i=1}^{n}m_{i}[\left(\mathbf {r} _{i}-\mathbf {C} \right)-\mathbf {d} ]^{2}.}$

Distribute over the cross product to obtain

${\displaystyle \mathbf {I} _{\mathbf {R} }=-\left(\sum _{i=1}^{n}m_{i}[\mathbf {r} _{i}-\mathbf {C} ]^{2}\right)+\left(\sum _{i=1}^{n}m_{i}[\mathbf {r} _{i}-\mathbf {C} ]\right)[\mathbf {d} ]+[\mathbf {d} ]\left(\sum _{i=1}^{n}m_{i}[\mathbf {r} _{i}-\mathbf {C} ]\right)-\left(\sum _{i=1}^{n}m_{i}\right)[\mathbf {d} ]^{2}.}$

The first term is the inertia matrix ${\displaystyle \mathbf {I_{C)) }$ relative to the center of mass. The second and third terms are zero by definition of the center of mass ${\displaystyle \mathbf {C} }$. And the last term is the total mass of the system multiplied by the square of the skew-symmetric matrix ${\displaystyle [\mathbf {d} ]}$ constructed from ${\displaystyle \mathbf {d} }$.

The result is the parallel axis theorem,

${\displaystyle \mathbf {I} _{\mathbf {R} }=\mathbf {I} _{\mathbf {C} }-M[\mathbf {d} ]^{2},}$
where ${\displaystyle \mathbf {d} }$ is the vector from the center of mass ${\displaystyle \mathbf {C} }$ to the reference point ${\displaystyle \mathbf {R} }$.

Note on the minus sign: By using the skew symmetric matrix of position vectors relative to the reference point, the inertia matrix of each particle has the form ${\displaystyle -m\left[\mathbf {r} \right]^{2))$, which is similar to the ${\displaystyle mr^{2))$ that appears in planar movement. However, to make this to work out correctly a minus sign is needed. This minus sign can be absorbed into the term ${\displaystyle m\left[\mathbf {r} \right]^{\mathsf {T))\left[\mathbf {r} \right]}$, if desired, by using the skew-symmetry property of ${\displaystyle [\mathbf {r} ]}$.

Scalar moment of inertia in a plane

The scalar moment of inertia, ${\displaystyle I_{L))$, of a body about a specified axis whose direction is specified by the unit vector ${\displaystyle \mathbf {\hat {k)) }$ and passes through the body at a point ${\displaystyle \mathbf {R} }$ is as follows:[6]

${\displaystyle I_{L}=\mathbf {\hat {k)) \cdot \left(-\sum _{i=1}^{N}m_{i}\left[\Delta \mathbf {r} _{i}\right]^{2}\right)\mathbf {\hat {k)) =\mathbf {\hat {k)) \cdot \mathbf {I} _{\mathbf {R} }\mathbf {\hat {k)) =\mathbf {\hat {k)) ^{\mathsf {T))\mathbf {I} _{\mathbf {R} }\mathbf {\hat {k)) ,}$
where ${\displaystyle \mathbf {I_{R)) }$ is the moment of inertia matrix of the system relative to the reference point ${\displaystyle \mathbf {R} }$, and ${\displaystyle [\Delta \mathbf {r} _{i}]}$ is the skew symmetric matrix obtained from the vector ${\displaystyle \Delta \mathbf {r} _{i}=\mathbf {r} _{i}-\mathbf {R} }$.

This is derived as follows. Let a rigid assembly of ${\displaystyle n}$ particles, ${\displaystyle P_{i},i=1,\dots ,n}$, have coordinates ${\displaystyle \mathbf {r} _{i))$. Choose ${\displaystyle \mathbf {R} }$ as a reference point and compute the moment of inertia around a line L defined by the unit vector ${\displaystyle \mathbf {\hat {k)) }$ through the reference point ${\displaystyle \mathbf {R} }$, ${\displaystyle \mathbf {L} (t)=\mathbf {R} +t\mathbf {\hat {k)) }$. The perpendicular vector from this line to the particle ${\displaystyle P_{i))$ is obtained from ${\displaystyle \Delta \mathbf {r} _{i))$ by removing the component that projects onto ${\displaystyle \mathbf {\hat {k)) }$.

${\displaystyle \Delta \mathbf {r} _{i}^{\perp }=\Delta \mathbf {r} _{i}-\left(\mathbf {\hat {k)) \cdot \Delta \mathbf {r} _{i}\right)\mathbf {\hat {k)) =\left(\mathbf {E} -\mathbf {\hat {k)) \mathbf {\hat {k)) ^{\mathsf {T))\right)\Delta \mathbf {r} _{i},}$
where ${\displaystyle \mathbf {E} }$ is the identity matrix, so as to avoid confusion with the inertia matrix, and ${\displaystyle \mathbf {\hat {k)) \mathbf {\hat {k)) ^{\mathsf {T))}$ is the outer product matrix formed from the unit vector ${\displaystyle \mathbf {\hat {k)) }$ along the line ${\displaystyle L}$.

To relate this scalar moment of inertia to the inertia matrix of the body, introduce the skew-symmetric matrix ${\displaystyle \left[\mathbf {\hat {k)) \right]}$ such that ${\displaystyle \left[\mathbf {\hat {k)) \right]\mathbf {y} =\mathbf {\hat {k)) \times \mathbf {y} }$, then we have the identity

${\displaystyle -\left[\mathbf {\hat {k)) \right]^{2}\equiv \left|\mathbf {\hat {k)) \right|^{2}\left(\mathbf {E} -\mathbf {\hat {k)) \mathbf {\hat {k)) ^{\mathsf {T))\right)=\mathbf {E} -\mathbf {\hat {k)) \mathbf {\hat {k)) ^{\mathsf {T)),}$
noting that ${\displaystyle \mathbf {\hat {k)) }$ is a unit vector.

The magnitude squared of the perpendicular vector is

{\displaystyle {\begin{aligned}\left|\Delta \mathbf {r} _{i}^{\perp }\right|^{2}&=\left(-\left[\mathbf {\hat {k)) \right]^{2}\Delta \mathbf {r} _{i}\right)\cdot \left(-\left[\mathbf {\hat {k)) \right]^{2}\Delta \mathbf {r} _{i}\right)\\&=\left(\mathbf {\hat {k)) \times \left(\mathbf {\hat {k)) \times \Delta \mathbf {r} _{i}\right)\right)\cdot \left(\mathbf {\hat {k)) \times \left(\mathbf {\hat {k)) \times \Delta \mathbf {r} _{i}\right)\right)\end{aligned))}

The simplification of this equation uses the triple scalar product identity

${\displaystyle \left(\mathbf {\hat {k)) \times \left(\mathbf {\hat {k)) \times \Delta \mathbf {r} _{i}\right)\right)\cdot \left(\mathbf {\hat {k)) \times \left(\mathbf {\hat {k)) \times \Delta \mathbf {r} _{i}\right)\right)\equiv \left(\left(\mathbf {\hat {k)) \times \left(\mathbf {\hat {k)) \times \Delta \mathbf {r} _{i}\right)\right)\times \mathbf {\hat {k)) \right)\cdot \left(\mathbf {\hat {k)) \times \Delta \mathbf {r} _{i}\right),}$
where the dot and the cross products have been interchanged. Exchanging products, and simplifying by noting that ${\displaystyle \Delta \mathbf {r} _{i))$ and ${\displaystyle \mathbf {\hat {k)) }$ are orthogonal:
{\displaystyle {\begin{aligned}&\left(\mathbf {\hat {k)) \times \left(\mathbf {\hat {k)) \times \Delta \mathbf {r} _{i}\right)\right)\cdot \left(\mathbf {\hat {k)) \times \left(\mathbf {\hat {k)) \times \Delta \mathbf {r} _{i}\right)\right)\\={}&\left(\left(\mathbf {\hat {k)) \times \left(\mathbf {\hat {k)) \times \Delta \mathbf {r} _{i}\right)\right)\times \mathbf {\hat {k)) \right)\cdot \left(\mathbf {\hat {k)) \times \Delta \mathbf {r} _{i}\right)\\={}&\left(\mathbf {\hat {k)) \times \Delta \mathbf {r} _{i}\right)\cdot \left(-\Delta \mathbf {r} _{i}\times \mathbf {\hat {k)) \right)\\={}&-\mathbf {\hat {k)) \cdot \left(\Delta \mathbf {r} _{i}\times \Delta \mathbf {r} _{i}\times \mathbf {\hat {k)) \right)\\={}&-\mathbf {\hat {k)) \cdot \left[\Delta \mathbf {r} _{i}\right]^{2}\mathbf {\hat {k)) .\end{aligned))}

Thus, the moment of inertia around the line ${\displaystyle L}$ through ${\displaystyle \mathbf {R} }$ in the direction ${\displaystyle \mathbf {\hat {k)) }$ is obtained from the calculation

{\displaystyle {\begin{aligned}I_{L}&=\sum _{i=1}^{N}m_{i}\left|\Delta \mathbf {r} _{i}^{\perp }\right|^{2}\\&=-\sum _{i=1}^{N}m_{i}\mathbf {\hat {k)) \cdot \left[\Delta \mathbf {r} _{i}\right]^{2}\mathbf {\hat {k)) =\mathbf {\hat {k)) \cdot \left(-\sum _{i=1}^{N}m_{i}\left[\Delta \mathbf {r} _{i}\right]^{2}\right)\mathbf {\hat {k)) \\&=\mathbf {\hat {k)) \cdot \mathbf {I} _{\mathbf {R} }\mathbf {\hat {k)) =\mathbf {\hat {k)) ^{\mathsf {T))\mathbf {I} _{\mathbf {R} }\mathbf {\hat {k)) ,\end{aligned))}
where ${\displaystyle \mathbf {I_{R)) }$ is the moment of inertia matrix of the system relative to the reference point ${\displaystyle \mathbf {R} }$.

This shows that the inertia matrix can be used to calculate the moment of inertia of a body around any specified rotation axis in the body.

Inertia tensor

For the same object, different axes of rotation will have different moments of inertia about those axes. In general, the moments of inertia are not equal unless the object is symmetric about all axes. The moment of inertia tensor is a convenient way to summarize all moments of inertia of an object with one quantity. It may be calculated with respect to any point in space, although for practical purposes the center of mass is most commonly used.

Definition

For a rigid object of ${\displaystyle N}$ point masses ${\displaystyle m_{k))$, the moment of inertia tensor is given by

${\displaystyle \mathbf {I} ={\begin{bmatrix}I_{11}&I_{12}&I_{13}\\I_{21}&I_{22}&I_{23}\\I_{31}&I_{32}&I_{33}\end{bmatrix)).}$

Its components are defined as

${\displaystyle I_{ij}\ {\stackrel {\mathrm {def} }{=))\ \sum _{k=1}^{N}m_{k}\left(\left\|\mathbf {r} _{k}\right\|^{2}\delta _{ij}-x_{i}^{(k)}x_{j}^{(k)}\right)}$

where

• ${\displaystyle i}$, ${\displaystyle j}$ is equal to 1, 2 or 3 for ${\displaystyle x}$, ${\displaystyle y}$, and ${\displaystyle z}$, respectively,
• ${\displaystyle \mathbf {r} _{k}=\left(x_{1}^{(k)},x_{2}^{(k)},x_{3}^{(k)}\right)}$ is the vector to the point mass ${\displaystyle m_{k))$ from the point about which the tensor is calculated and
• ${\displaystyle \delta _{ij))$ is the Kronecker delta.

Note that, by the definition, ${\displaystyle \mathbf {I} }$ is a symmetric tensor.

The diagonal elements are more succinctly written as

{\displaystyle {\begin{aligned}I_{xx}\ &{\stackrel {\mathrm {def} }{=))\ \sum _{k=1}^{N}m_{k}\left(y_{k}^{2}+z_{k}^{2}\right),\\I_{yy}\ &{\stackrel {\mathrm {def} }{=))\ \sum _{k=1}^{N}m_{k}\left(x_{k}^{2}+z_{k}^{2}\right),\\I_{zz}\ &{\stackrel {\mathrm {def} }{=))\ \sum _{k=1}^{N}m_{k}\left(x_{k}^{2}+y_{k}^{2}\right),\end{aligned))}

while the off-diagonal elements, also called the products of inertia, are

{\displaystyle {\begin{aligned}I_{xy}=I_{yx}\ &{\stackrel {\mathrm {def} }{=))\ -\sum _{k=1}^{N}m_{k}x_{k}y_{k},\\I_{xz}=I_{zx}\ &{\stackrel {\mathrm {def} }{=))\ -\sum _{k=1}^{N}m_{k}x_{k}z_{k},\\I_{yz}=I_{zy}\ &{\stackrel {\mathrm {def} }{=))\ -\sum _{k=1}^{N}m_{k}y_{k}z_{k}.\end{aligned))}

Here ${\displaystyle I_{xx))$ denotes the moment of inertia around the ${\displaystyle x}$-axis when the objects are rotated around the x-axis, ${\displaystyle I_{xy))$ denotes the moment of inertia around the ${\displaystyle y}$-axis when the objects are rotated around the ${\displaystyle x}$-axis, and so on.

These quantities can be generalized to an object with distributed mass, described by a mass density function, in a similar fashion to the scalar moment of inertia. One then has

${\displaystyle \mathbf {I} =\iiint _{V}\rho (x,y,z)\left(\|\mathbf {r} \|^{2}\mathbf {E} _{3}-\mathbf {r} \otimes \mathbf {r} \right)\,dx\,dy\,dz,}$

where ${\displaystyle \mathbf {r} \otimes \mathbf {r} }$ is their outer product, E3 is the 3×3 identity matrix, and V is a region of space completely containing the object.

Alternatively it can also be written in terms of the angular momentum operator ${\displaystyle [\mathbf {r} ]\mathbf {x} =\mathbf {r} \times \mathbf {x} }$:

${\displaystyle \mathbf {I} =\iiint _{V}\rho (\mathbf {r} )[\mathbf {r} ]^{\textsf {T))[\mathbf {r} ]\,dV=-\iiint _{Q}\rho (\mathbf {r} )[\mathbf {r} ]^{2}\,dV}$

The inertia tensor can be used in the same way as the inertia matrix to compute the scalar moment of inertia about an arbitrary axis in the direction ${\displaystyle \mathbf {n} }$,

${\displaystyle I_{n}=\mathbf {n} \cdot \mathbf {I} \cdot \mathbf {n} ,}$

where the dot product is taken with the corresponding elements in the component tensors. A product of inertia term such as ${\displaystyle I_{12))$ is obtained by the computation

${\displaystyle I_{12}=\mathbf {e} _{1}\cdot \mathbf {I} \cdot \mathbf {e} _{2},}$
and can be interpreted as the moment of inertia around the ${\displaystyle x}$-axis when the object rotates around the ${\displaystyle y}$-axis.

The components of tensors of degree two can be assembled into a matrix. For the inertia tensor this matrix is given by,

${\displaystyle \mathbf {I} ={\begin{bmatrix}I_{11}&I_{12}&I_{13}\\I_{21}&I_{22}&I_{23}\\I_{31}&I_{32}&I_{33}\end{bmatrix))={\begin{bmatrix}I_{xx}&I_{xy}&I_{xz}\\I_{yx}&I_{yy}&I_{yz}\\I_{zx}&I_{zy}&I_{zz}\end{bmatrix))={\begin{bmatrix}\sum _{k=1}^{N}m_{k}\left(y_{k}^{2}+z_{k}^{2}\right)&-\sum _{k=1}^{N}m_{k}x_{k}y_{k}&-\sum _{k=1}^{N}m_{k}x_{k}z_{k}\\-\sum _{k=1}^{N}m_{k}x_{k}y_{k}&\sum _{k=1}^{N}m_{k}\left(x_{k}^{2}+z_{k}^{2}\right)&-\sum _{k=1}^{N}m_{k}y_{k}z_{k}\\-\sum _{k=1}^{N}m_{k}x_{k}z_{k}&-\sum _{k=1}^{N}m_{k}y_{k}z_{k}&\sum _{k=1}^{N}m_{k}\left(x_{k}^{2}+y_{k}^{2}\right)\end{bmatrix)).}$

It is common in rigid body mechanics to use notation that explicitly identifies the ${\displaystyle x}$, ${\displaystyle y}$, and ${\displaystyle z}$-axes, such as ${\displaystyle I_{xx))$ and ${\displaystyle I_{xy))$, for the components of the inertia tensor.

Alternate inertia convention

There are some CAD and CAE applications such as SolidWorks, Unigraphics NX/Siemens NX and MSC Adams that use an alternate convention for the products of inertia. According to this convention, the minus sign is removed from the product of inertia formulas and instead inserted in the inertia matrix:

{\displaystyle {\begin{aligned}I_{xy}=I_{yx}\ &{\stackrel {\mathrm {def} }{=))\ \sum _{k=1}^{N}m_{k}x_{k}y_{k},\\I_{xz}=I_{zx}\ &{\stackrel {\mathrm {def} }{=))\ \sum _{k=1}^{N}m_{k}x_{k}z_{k},\\I_{yz}=I_{zy}\ &{\stackrel {\mathrm {def} }{=))\ \sum _{k=1}^{N}m_{k}y_{k}z_{k},\\[3pt]\mathbf {I} ={\begin{bmatrix}I_{11}&I_{12}&I_{13}\\I_{21}&I_{22}&I_{23}\\I_{31}&I_{32}&I_{33}\end{bmatrix))&={\begin{bmatrix}I_{xx}&-I_{xy}&-I_{xz}\\-I_{yx}&I_{yy}&-I_{yz}\\-I_{zx}&-I_{zy}&I_{zz}\end{bmatrix))={\begin{bmatrix}\sum _{k=1}^{N}m_{k}\left(y_{k}^{2}+z_{k}^{2}\right)&-\sum _{k=1}^{N}m_{k}x_{k}y_{k}&-\sum _{k=1}^{N}m_{k}x_{k}z_{k}\\-\sum _{k=1}^{N}m_{k}x_{k}y_{k}&\sum _{k=1}^{N}m_{k}\left(x_{k}^{2}+z_{k}^{2}\right)&-\sum _{k=1}^{N}m_{k}y_{k}z_{k}\\-\sum _{k=1}^{N}m_{k}x_{k}z_{k}&-\sum _{k=1}^{N}m_{k}y_{k}z_{k}&\sum _{k=1}^{N}m_{k}\left(x_{k}^{2}+y_{k}^{2}\right)\end{bmatrix)).\end{aligned))}

Determine inertia convention (Principal axes method)

If one has the inertia data ${\displaystyle (I_{xx},I_{yy},I_{zz},I_{xy},I_{xz},I_{yz})}$ without knowing which inertia convention that has been used, it can be determined if one also has the principal axes. With the principal axes method, one makes inertia matrices from the following two assumptions:

1. The standard inertia convention has been used ${\displaystyle (I_{12}=I_{xy},I_{13}=I_{xz},I_{23}=I_{yz})}$.
2. The alternate inertia convention has been used ${\displaystyle (I_{12}=-I_{xy},I_{13}=-I_{xz},I_{23}=-I_{yz})}$.

Next, one calculates the eigenvectors for the two matrices. The matrix whose eigenvectors are parallel to the principal axes corresponds to the inertia convention that has been used.

Derivation of the tensor components

The distance ${\displaystyle r}$ of a particle at ${\displaystyle \mathbf {x} }$ from the axis of rotation passing through the origin in the ${\displaystyle \mathbf {\hat {n)) }$ direction is ${\displaystyle \left|\mathbf {x} -\left(\mathbf {x} \cdot \mathbf {\hat {n)) \right)\mathbf {\hat {n)) \right|}$, where ${\displaystyle \mathbf {\hat {n)) }$ is unit vector. The moment of inertia on the axis is

${\displaystyle I=mr^{2}=m\left(\mathbf {x} -\left(\mathbf {x} \cdot \mathbf {\hat {n)) \right)\mathbf {\hat {n)) \right)\cdot \left(\mathbf {x} -\left(\mathbf {x} \cdot \mathbf {\hat {n)) \right)\mathbf {\hat {n)) \right)=m\left(\mathbf {x} ^{2}-2\mathbf {x} \left(\mathbf {x} \cdot \mathbf {\hat {n)) \right)\mathbf {\hat {n)) +\left(\mathbf {x} \cdot \mathbf {\hat {n)) \right)^{2}\mathbf {\hat {n)) ^{2}\right)=m\left(\mathbf {x} ^{2}-\left(\mathbf {x} \cdot \mathbf {\hat {n)) \right)^{2}\right).}$

Rewrite the equation using matrix transpose:

${\displaystyle I=m\left(\mathbf {x} ^{\textsf {T))\mathbf {x} -\mathbf {\hat {n)) ^{\textsf {T))\mathbf {x} \mathbf {x} ^{\textsf {T))\mathbf {\hat {n)) \right)=m\cdot \mathbf {\hat {n)) ^{\textsf {T))\left(\mathbf {x} ^{\textsf {T))\mathbf {x} \cdot \mathbf {E_{3)) -\mathbf {x} \mathbf {x} ^{\textsf {T))\right)\mathbf {\hat {n)) ,}$

where E3 is the 3×3 identity matrix.

This leads to a tensor formula for the moment of inertia

${\displaystyle I=m{\begin{bmatrix}n_{1}&n_{2}&n_{3}\end{bmatrix)){\begin{bmatrix}y^{2}+z^{2}&-xy&-xz\\-yx&x^{2}+z^{2}&-yz\\-zx&-zy&x^{2}+y^{2}\end{bmatrix)){\begin{bmatrix}n_{1}\\n_{2}\\n_{3}\end{bmatrix)).}$

For multiple particles, we need only recall that the moment of inertia is additive in order to see that this formula is correct.

Inertia tensor of translation

 Main article: Parallel axis theorem § Tensor generalization

Let ${\displaystyle \mathbf {I} _{0))$ be the inertia tensor of a body calculated at its center of mass, and ${\displaystyle \mathbf {R} }$ be the displacement vector of the body. The inertia tensor of the translated body respect to its original center of mass is given by: