Shell theorem

Last updated

In classical mechanics, the shell theorem gives gravitational simplifications that can be applied to objects inside or outside a spherically symmetrical body. This theorem has particular application to astronomy.

Contents

Isaac Newton proved the shell theorem [1] and stated that:

  1. A spherically symmetric body affects external objects gravitationally as though all of its mass were concentrated at a point at its center.
  2. If the body is a spherically symmetric shell (i.e., a hollow ball), no net gravitational force is exerted by the shell on any object inside, regardless of the object's location within the shell.

A corollary is that inside a solid sphere of constant density, the gravitational force within the object varies linearly with distance from the center, becoming zero by symmetry at the center of mass. This can be seen as follows: take a point within such a sphere, at a distance from the center of the sphere. Then you can ignore all of the shells of greater radius, according to the shell theorem (2). But the point can be considered to be external to the remaining sphere of radius r, and according to (1) all of the mass of this sphere can be considered to be concentrated at its centre. The remaining mass is proportional to (because it is based on volume). The gravitational force exerted on a body at radius r will be proportional to (the inverse square law), so the overall gravitational effect is proportional to , so is linear in .

These results were important to Newton's analysis of planetary motion; they are not immediately obvious, but they can be proven with calculus. (Gauss's law for gravity offers an alternative way to state the theorem.)

In addition to gravity, the shell theorem can also be used to describe the electric field generated by a static spherically symmetric charge density, or similarly for any other phenomenon that follows an inverse square law. The derivations below focus on gravity, but the results can easily be generalized to the electrostatic force.

Derivation of gravitational field outside of a solid sphere

There are three steps to proving Newton's shell theorem (1). First, the equation for a gravitational field due to a ring of mass will be derived. Arranging an infinite number of infinitely thin rings to make a disc, this equation involving a ring will be used to find the gravitational field due to a disk. Finally, arranging an infinite number of infinitely thin discs to make a sphere, this equation involving a disc will be used to find the gravitational field due to a sphere.

The gravitational field at a position called at on the x-axis due to a point of mass at the origin is

Point2.png Suppose that this mass is moved upwards along the y-axis to the point . The distance between and the point mass is now longer than before; It becomes the hypotenuse of the right triangle with legs and which is . Hence, the gravitational field of the elevated point is:

Pointy2.png

The magnitude of the gravitational field that would pull a particle at point in the x-direction is the gravitational field multiplied by where is the angle adjacent to the x-axis. In this case, . Hence, the magnitude of the gravitational field in the x-direction, is:

Substituting in gives

Suppose that this mass is evenly distributed in a ring centered at the origin and facing point with the same radius . Because all of the mass is located at the same angle with respect to the x-axis, and the distance between the points on the ring is the same distance as before, the gravitational field in the x-direction at point due to the ring is the same as a point mass located at a point units above the y-axis:

Wider ring2.png

To find the gravitational field at point due to a disc, an infinite number of infinitely thin rings facing , each with a radius , width of , and mass of may be placed inside one another to form a disc. The mass of any one of the rings is the mass of the disc multiplied by the ratio of the area of the ring to the total area of the disc . So, . Hence, a small change in the gravitational field, is:

Wider ring with inside ring2.png

Substituting in and integrating both sides gives the gravitational field of the disk:

Adding up the contribution to the gravitational field from each of these rings will yield the expression for the gravitational field due to a disc. This is equivalent to integrating this above expression from to , resulting in:

To find the gravitational field at point due to a sphere centered at the origin, an infinite amount of infinitely thin discs facing , each with a radius , width of , and mass of may be placed together.

These discs' radii follow the height of the cross section of a sphere (with constant radius ) which is an equation of a semi-circle: . varies from to .

The mass of any of the discs is the mass of the sphere multiplied by the ratio of the volume of an infinitely thin disc divided by the volume of a sphere (with constant radius ). The volume of an infinitely thin disc is , or . So, . Simplifying gives .

Each discs' position away from will vary with its position within the 'sphere' made of the discs, so must be replaced with .

Series of discs2.png

Replacing with , with , and with in the 'disc' equation yields:

Simplifying,

Integrating the gravitational field of each thin disc from to with respect to , and doing some careful algebra, yields Newton's shell theorem:

where is the distance between the center of the spherical mass and an arbitrary point . The gravitational field of a spherical mass may be calculated by treating all the mass as a point particle at the center of the sphere.

Outside a shell

A solid, spherically symmetric body can be modeled as an infinite number of concentric, infinitesimally thin spherical shells. If one of these shells can be treated as a point mass, then a system of shells (i.e. the sphere) can also be treated as a point mass. Consider one such shell (the diagram shows a cross-section):

Shell-diag-1.svg

(Note: the in the diagram refers to the small angle, not the arc length. The arc length is .)

Applying Newton's Universal Law of Gravitation, the sum of the forces due to the mass elements in the shaded band is

However, since there is partial cancellation due to the vector nature of the force in conjunction with the circular band's symmetry, the leftover component (in the direction pointing towards ) is given by

The total force on , then, is simply the sum of the force exerted by all the bands. By shrinking the width of each band, and increasing the number of bands, the sum becomes an integral expression:

Since and are constants, they may be taken out of the integral:

To evaluate this integral, one must first express as a function of

The total surface of a spherical shell is

while the surface area of the thin slice between and is

If the mass of the shell is , one therefore has that

and

By the law of cosines,

and

These two relations link the three parameters , and that appear in the integral together. As increases from to radians, varies from the initial value 0 to a maximal value before finally returning to zero at . At the same time, increases from the initial value to the final value as increases from 0 to radians. This is illustrated in the following animation:

Shell-diag-1-anim.gif

(Note: As viewed from , the shaded blue band appears as a thin annulus whose inner and outer radii converge to as vanishes.)

To find a primitive function to the integrand, one has to make the independent integration variable instead of .

Performing an implicit differentiation of the second of the "cosine law" expressions above yields

and thus

It follows that

where the new integration variable increases from to .

Inserting the expression for using the first of the "cosine law" expressions above, one finally gets that

A primitive function to the integrand is

and inserting the bounds and for the integration variable in this primitive function, one gets that

saying that the gravitational force is the same as that of a point mass in the center of the shell with the same mass.

Spherical Shell to Solid Sphere

It is possible to use this spherical shell result to re-derive the solid sphere result from earlier. This is done by integrating an infinitesimally thin spherical shell with mass of , and we can obtain the total gravity contribution of a solid ball to the object outside the ball

Uniform density means between the radius of to , can be expressed as a function of , i.e.,

Therefore, the total gravity is

As found earlier, this suggests that the gravity of a solid spherical ball to an exterior object can be simplified as that of a point mass in the center of the ball with the same mass.

Inside a shell

For a point inside the shell, the difference is that when θ is equal to zero, ϕ takes the value π radians and s the value Rr. When θ increases from 0 to π radians, ϕ decreases from the initial value π radians to zero and s increases from the initial value Rr to the value R + r.

This can all be seen in the following figure

Shell-diag-2.svg

Inserting these bounds into the primitive function

one gets that, in this case

saying that the net gravitational forces acting on the point mass from the mass elements of the shell, outside the measurement point, cancel out.

Generalization: If , the resultant force inside the shell is:

The above results into being identically zero if and only if

Outside the shell (i.e. or ):

Derivation using Gauss's law

The shell theorem is an immediate consequence of Gauss's law for gravity saying that

where M is the mass of the part of the spherically symmetric mass distribution that is inside the sphere with radius r and

is the surface integral of the gravitational field over any closed surface inside which the total mass is M, the unit vector being the outward normal to the surface.

The gravitational field of a spherically symmetric mass distribution like a mass point, a spherical shell or a homogeneous sphere must also be spherically symmetric. If is a unit vector in the direction from the point of symmetry to another point the gravitational field at this other point must therefore be

where g(r) only depends on the distance r to the point of symmetry

Selecting the closed surface as a sphere with radius r with center at the point of symmetry the outward normal to a point on the surface, , is precisely the direction pointing away from the point of symmetry of the mass distribution.

One, therefore, has that

and

as the area of the sphere is 4πr2.

From Gauss's law it then follows that

or,

Converses and generalizations

It is natural to ask whether the converse of the shell theorem is true, namely whether the result of the theorem implies the law of universal gravitation, or if there is some more general force law for which the theorem holds. More specifically, one may ask the question:

Suppose there is a force between masses M and m, separated by a distance r of the form such that any spherically symmetric body affects external bodies as if its mass were concentrated at its center. Then what form can the function take?

In fact, this allows exactly one more class of force than the (Newtonian) inverse square. [2] [3] The most general force as derived by Vahe Gurzadyan in [2] Gurzadyan theorem is:

where and can be constants taking any value. The first term is the familiar law of universal gravitation; the second is an additional force, analogous to the cosmological constant term in general relativity.

If we further constrain the force by requiring that the second part of the theorem also holds, namely that there is no force inside a hollow ball, we exclude the possibility of the additional term, and the inverse square law is indeed the unique force law satisfying the theorem.

On the other hand, if we relax the conditions, and require only that the field everywhere outside a spherically symmetric body is the same as the field from some point mass at the center (of any mass), we allow a new class of solutions given by the Yukawa potential, of which the inverse square law is a special case.

Another generalization can be made for a disc by observing that

so:

where , and is the density of the body.

Doing all the intermediate calculations we get:

Newton's proofs

Introduction

Propositions 70 and 71 consider the force acting on a particle from a hollow sphere with an infinitesimally thin surface, whose mass density is constant over the surface. The force on the particle from a small area of the surface of the sphere is proportional to the mass of the area and inversely as the square of its distance from the particle. The first proposition considers the case when the particle is inside the sphere, the second when it is outside. The use of infinitesimals and limiting processes in geometrical constructions are simple and elegant and avoid the need for any integrations. They well illustrate Newton's method of proving many of the propositions in the Principia.

His proof of Propositions 70 is trivial. In the following, it is considered in slightly greater detail than Newton provides.

The proof of Proposition 71 is more historically significant. It forms the first part of his proof that the gravitational force of a solid sphere acting on a particle outside it is inversely proportional to the square of its distance from the center of the sphere, provided the density at any point inside the sphere is a function only of its distance from the center of the sphere.

Although the following are completely faithful to Newton's proofs, very minor changes have been made to attempt to make them clearer.

Force on a point inside a hollow sphere

Attraction Interior Sphere.png

Fig. 2 is a cross-section of the hollow sphere through the center, S and an arbitrary point, P, inside the sphere. Through P draw two lines IL and HK such that the angle KPL is very small. JM is the line through P that bisects that angle. From the inscribed angle theorem, the triangles IPH and KPL are similar. The lines KH and IL are rotated about the axis JM to form two cones that intersect the sphere in two closed curves. In Fig. 1 the sphere is seen from a distance along the line PE and is assumed transparent so both curves can be seen.

The surface of the sphere that the cones intersect can be considered to be flat, and .

Since the intersection of a cone with a plane is an ellipse, in this case the intersections form two ellipses with major axes IH and KL, where .

By a similar argument, the minor axes are in the same ratio. This is clear if the sphere is viewed from above. Therefore the two ellipses are similar, so their areas are as the squares of their major axes. As the mass of any section of the surface is proportional to the area of that section, for the two elliptical areas the ratios of their masses .

Since the force of attraction on P in the direction JM from either of the elliptic areas, is direct as the mass of the area and inversely as the square of its distance from P, it is independent of the distance of P from the sphere. Hence, the forces on P from the two infinitesimal elliptical areas are equal and opposite and there is no net force in the direction JM.

As the position of P and the direction of JM are both arbitrary, it follows that any particle inside a hollow sphere experiences no net force from the mass of the sphere.

Note: Newton simply describes the arcs IH and KL as 'minimally small' and the areas traced out by the lines IL and HK can be any shape, not necessarily elliptic, but they will always be similar.

Force on a point outside a hollow sphere

Attraction Exterior Sphere.png

Fig. 1 is a cross-section of the hollow sphere through the center, S with an arbitrary point, P, outside the sphere. PT is the tangent to the circle at T which passes through P. HI is a small arc on the surface such that PH is less than PT. Extend PI to intersect the sphere at L and draw SF to the point F that bisects IL. Extend PH to intersect the sphere at K and draw SE to the point E that bisects HK, and extend SF to intersect HK at D. Drop a perpendicular IQ on to the line PS joining P to the center S. Let the radius of the sphere be a and the distance PS be D.

Let arc IH be extended perpendicularly out of the plane of the diagram, by a small distance ζ. The area of the figure generated is , and its mass is proportional to this product.

The force due to this mass on the particle at P and is along the line PI.

The component of this force towards the center .

If now the arc HI is rotated completely about the line PS to form a ring of width HI and radius IQ, the length of the ring is 2π·IQ and its area is 2π·IQ·IH. The component of the force due to this ring on the particle at P in the direction PS becomes .

The perpendicular components of the force directed towards PS cancel out since the mass in the ring is distributed symmetrically about PS. Therefore, the component in the direction PS is the total force on P due to the ring formed by rotating arc HI about PS.

From similar triangles: ;, and .

If HI is sufficiently small that it can be taken as a straight line, is a right angle, and , so that .

Hence the force on P due to the ring .

Assume now in Fig. 2 that another particle is outside the sphere at a point p, a different distance d from the center of the sphere, with corresponding points lettered in lower case. For easy comparison, the construction of P in Fig. 1 is also shown in Fig. 2. As before, ph is less than pt.

Generate a ring with width ih and radius iq by making angle and the slightly larger angle , so that the distance PS is subtended by the same angle at I as is pS at i. The same holds for H and h, respectively.

The total force on p due to this ring is

Clearly ,, and .

Newton claims that DF and df can be taken as equal in the limit as the angles DPF and dpf 'vanish together'. Note that angles DPF and dpf are not equal. Although DS and dS become equal in the limit, this does not imply that the ratio of DF to df becomes equal to unity, when DF and df both approach zero. In the finite case DF depends on D, and df on d, so they are not equal.

Since the ratio of DF to df in the limit is crucial, more detailed analysis is required. From the similar right triangles, and , giving . Solving the quadratic for DF, in the limit as ES approaches FS, the smaller root, . More simply, as DF approaches zero, in the limit the term can be ignored: leading to the same result. Clearly df has the same limit, justifying Newton’s claim.

Comparing the force from the ring HI rotated about PS to the ring hi about pS, the ratio of these 2 forces equals .

By dividing up the arcs AT and Bt into corresponding infinitesimal rings, it follows that the ratio of the force due to the arc AT rotated about PS to that of Bt rotated about pS is in the same ratio, and similarly, the ratio of the forces due to arc TB to that of tA both rotated are in the same ratio.

Therefore, the force on a particle any distance D from the center of the hollow sphere is inversely proportional to , which proves the proposition.

Shell theorem in general relativity

An analogue for shell theorem exists in general relativity (GR).

Spherical symmetry implies that the metric has time-independent Schwarzschild geometry, even if a central mass is undergoing gravitational collapse (Misner et al. 1973; see Birkhoff's theorem). The metric thus has form

(using geometrized units, where ). For (where is the radius of some mass shell), mass acts as a delta function at the origin. For , shells of mass may exist externally, but for the metric to be non-singular at the origin, must be zero in the metric. This reduces the metric to flat Minkowski space; thus external shells have no gravitational effect.

This result illuminates the gravitational collapse leading to a black hole and its effect on the motion of light-rays and particles outside and inside the event horizon (Hartle 2003, chapter 12).

See also

Related Research Articles

<span class="mw-page-title-main">Spherical coordinate system</span> 3-dimensional coordinate system

In mathematics, a spherical coordinate system is a coordinate system for three-dimensional space where the position of a given point in space is specified by three numbers, : the radial distance of the radial liner connecting the point to the fixed point of origin ; the polar angle θ of the radial line r; and the azimuthal angle φ of the radial line r.

<span class="mw-page-title-main">Laplace's equation</span> Second-order partial differential equation

In mathematics and physics, Laplace's equation is a second-order partial differential equation named after Pierre-Simon Laplace, who first studied its properties. This is often written as

In geometry, a solid angle is a measure of the amount of the field of view from some particular point that a given object covers. That is, it is a measure of how large the object appears to an observer looking from that point. The point from which the object is viewed is called the apex of the solid angle, and the object is said to subtend its solid angle at that point.

<span class="mw-page-title-main">Ellipsoid</span> Quadric surface that looks like a deformed sphere

An ellipsoid is a surface that can be obtained from a sphere by deforming it by means of directional scalings, or more generally, of an affine transformation.

In mechanics and geometry, the 3D rotation group, often denoted SO(3), is the group of all rotations about the origin of three-dimensional Euclidean space under the operation of composition.

<span class="mw-page-title-main">Spherical harmonics</span> Special mathematical functions defined on the surface of a sphere

In mathematics and physical science, spherical harmonics are special functions defined on the surface of a sphere. They are often employed in solving partial differential equations in many scientific fields. A list of the spherical harmonics is available in Table of spherical harmonics.

In fluid dynamics, Stokes' law is an empirical law for the frictional force – also called drag force – exerted on spherical objects with very small Reynolds numbers in a viscous fluid. It was derived by George Gabriel Stokes in 1851 by solving the Stokes flow limit for small Reynolds numbers of the Navier–Stokes equations.

<span class="mw-page-title-main">Scalar potential</span> When potential energy difference depends only on displacement

In mathematical physics, scalar potential, simply stated, describes the situation where the difference in the potential energies of an object in two different positions depends only on the positions, not upon the path taken by the object in traveling from one position to the other. It is a scalar field in three-space: a directionless value (scalar) that depends only on its location. A familiar example is potential energy due to gravity.

<span class="mw-page-title-main">Gravitational potential</span> Fundamental study of potential theory

In classical mechanics, the gravitational potential is a scalar field associating with each point in space the work per unit mass that would be needed to move an object to that point from a fixed reference point. It is analogous to the electric potential with mass playing the role of charge. The reference point, where the potential is zero, is by convention infinitely far away from any mass, resulting in a negative potential at any finite distance.

<span class="mw-page-title-main">Projectile motion</span> Motion of launched objects due to gravity

Projectile motion is a form of motion experienced by an object or particle that is projected in a gravitational field, such as from Earth's surface, and moves along a curved path under the action of gravity only. In the particular case of projectile motion on Earth, most calculations assume the effects of air resistance are passive and negligible. The curved path of objects in projectile motion was shown by Galileo to be a parabola, but may also be a straight line in the special case when it is thrown directly upward or downward. The study of such motions is called ballistics, and such a trajectory is a ballistic trajectory. The only force of mathematical significance that is actively exerted on the object is gravity, which acts downward, thus imparting to the object a downward acceleration towards the Earth’s center of mass. Because of the object's inertia, no external force is needed to maintain the horizontal velocity component of the object's motion. Taking other forces into account, such as aerodynamic drag or internal propulsion, requires additional analysis. A ballistic missile is a missile only guided during the relatively brief initial powered phase of flight, and whose remaining course is governed by the laws of classical mechanics.

In geometry, the area enclosed by a circle of radius r is πr2. Here the Greek letter π represents the constant ratio of the circumference of any circle to its diameter, approximately equal to 3.14159.

In calculus, the Leibniz integral rule for differentiation under the integral sign states that for an integral of the form

In mathematical physics and differential geometry, a gravitational instanton is a four-dimensional complete Riemannian manifold satisfying the vacuum Einstein equations. They are so named because they are analogues in quantum theories of gravity of instantons in Yang–Mills theory. In accordance with this analogy with self-dual Yang–Mills instantons, gravitational instantons are usually assumed to look like four dimensional Euclidean space at large distances, and to have a self-dual Riemann tensor. Mathematically, this means that they are asymptotically locally Euclidean hyperkähler 4-manifolds, and in this sense, they are special examples of Einstein manifolds. From a physical point of view, a gravitational instanton is a non-singular solution of the vacuum Einstein equations with positive-definite, as opposed to Lorentzian, metric.

<span class="mw-page-title-main">Multiple integral</span> Generalization of definite integrals to functions of multiple variables

In mathematics (specifically multivariable calculus), a multiple integral is a definite integral of a function of several real variables, for instance, f(x, y) or f(x, y, z). Physical (natural philosophy) interpretation: S any surface, V any volume, etc.. Incl. variable to time, position, etc.

The method of image charges is a basic problem-solving tool in electrostatics. The name originates from the replacement of certain elements in the original layout with imaginary charges, which replicates the boundary conditions of the problem.

<span class="mw-page-title-main">Pendulum (mechanics)</span> Free swinging suspended body

A pendulum is a body suspended from a fixed support so that it swings freely back and forth under the influence of gravity. When a pendulum is displaced sideways from its resting, equilibrium position, it is subject to a restoring force due to gravity that will accelerate it back towards the equilibrium position. When released, the restoring force acting on the pendulum's mass causes it to oscillate about the equilibrium position, swinging it back and forth. The mathematics of pendulums are in general quite complicated. Simplifying assumptions can be made, which in the case of a simple pendulum allow the equations of motion to be solved analytically for small-angle oscillations.

In special functions, a topic in mathematics, spin-weighted spherical harmonics are generalizations of the standard spherical harmonics and—like the usual spherical harmonics—are functions on the sphere. Unlike ordinary spherical harmonics, the spin-weighted harmonics are U(1) gauge fields rather than scalar fields: mathematically, they take values in a complex line bundle. The spin-weighted harmonics are organized by degree l, just like ordinary spherical harmonics, but have an additional spin weights that reflects the additional U(1) symmetry. A special basis of harmonics can be derived from the Laplace spherical harmonics Ylm, and are typically denoted by sYlm, where l and m are the usual parameters familiar from the standard Laplace spherical harmonics. In this special basis, the spin-weighted spherical harmonics appear as actual functions, because the choice of a polar axis fixes the U(1) gauge ambiguity. The spin-weighted spherical harmonics can be obtained from the standard spherical harmonics by application of spin raising and lowering operators. In particular, the spin-weighted spherical harmonics of spin weight s = 0 are simply the standard spherical harmonics:

In mathematics, vector spherical harmonics (VSH) are an extension of the scalar spherical harmonics for use with vector fields. The components of the VSH are complex-valued functions expressed in the spherical coordinate basis vectors.

<span class="mw-page-title-main">Kepler orbit</span> Celestial orbit whose trajectory is a conic section in the orbital plane

In celestial mechanics, a Kepler orbit is the motion of one body relative to another, as an ellipse, parabola, or hyperbola, which forms a two-dimensional orbital plane in three-dimensional space. A Kepler orbit can also form a straight line. It considers only the point-like gravitational attraction of two bodies, neglecting perturbations due to gravitational interactions with other objects, atmospheric drag, solar radiation pressure, a non-spherical central body, and so on. It is thus said to be a solution of a special case of the two-body problem, known as the Kepler problem. As a theory in classical mechanics, it also does not take into account the effects of general relativity. Keplerian orbits can be parametrized into six orbital elements in various ways.

In Einstein's theory of general relativity, the interior Schwarzschild metric is an exact solution for the gravitational field in the interior of a non-rotating spherical body which consists of an incompressible fluid and has zero pressure at the surface. This is a static solution, meaning that it does not change over time. It was discovered by Karl Schwarzschild in 1916, who earlier had found the exterior Schwarzschild metric.

References

  1. Newton, Isaac (1687). Philosophiae Naturalis Principia Mathematica. London. pp. 193, Theorem XXXI.
  2. 1 2 Gurzadyan, Vahe (1985). "The cosmological constant in McCrea-Milne cosmological scheme". The Observatory. 105: 42–43. Bibcode:1985Obs...105...42G. http://adsabs.harvard.edu/full/1985Obs...105...42G
  3. Arens, Richard (January 1, 1990). "Newton's observations about the field of a uniform thin spherical shell". Note di Matematica. X (Suppl. n. 1): 39–45.