Aberration (astronomy)

Last updated
A diagram showing how the apparent position of a star viewed from the Earth can change depending on the Earth's velocity. The effect is typically much smaller than illustrated. Simple stellar aberration diagram.svg
A diagram showing how the apparent position of a star viewed from the Earth can change depending on the Earth's velocity. The effect is typically much smaller than illustrated.

In astronomy, aberration (also referred to as astronomical aberration, stellar aberration, or velocity aberration) is a phenomenon where celestial objects exhibit an apparent motion about their true positions based on the velocity of the observer: It causes objects to appear to be displaced towards the observer's direction of motion. The change in angle is of the order of where is the speed of light and the velocity of the observer. In the case of "stellar" or "annual" aberration, the apparent position of a star to an observer on Earth varies periodically over the course of a year as the Earth's velocity changes as it revolves around the Sun, by a maximum angle of approximately 20  arcseconds in right ascension or declination.

Contents

The term aberration has historically been used to refer to a number of related phenomena concerning the propagation of light in moving bodies. [1] Aberration is distinct from parallax, which is a change in the apparent position of a relatively nearby object, as measured by a moving observer, relative to more distant objects that define a reference frame. The amount of parallax depends on the distance of the object from the observer, whereas aberration does not. Aberration is also related to light-time correction and relativistic beaming, although it is often considered separately from these effects.

Aberration is historically significant because of its role in the development of the theories of light, electromagnetism and, ultimately, the theory of special relativity. It was first observed in the late 1600s by astronomers searching for stellar parallax in order to confirm the heliocentric model of the Solar System. However, it was not understood at the time to be a different phenomenon. [2] In 1727, James Bradley provided a classical explanation for it in terms of the finite speed of light relative to the motion of the Earth in its orbit around the Sun, [3] [4] which he used to make one of the earliest measurements of the speed of light. However, Bradley's theory was incompatible with 19th-century theories of light, and aberration became a major motivation for the aether drag theories of Augustin Fresnel (in 1818) and G. G. Stokes (in 1845), and for Hendrik Lorentz's aether theory of electromagnetism in 1892. The aberration of light, together with Lorentz's elaboration of Maxwell's electrodynamics, the moving magnet and conductor problem, the negative aether drift experiments, as well as the Fizeau experiment, led Albert Einstein to develop the theory of special relativity in 1905, which presents a general form of the equation for aberration in terms of such theory. [5]

Explanation

Light rays striking the earth in the Sun's rest frame compared to the same rays in the Earth's rest frame according to special relativity. The effect is exaggerated for illustrative purposes. Sun earth relativistic aberration.svg
Light rays striking the earth in the Sun's rest frame compared to the same rays in the Earth's rest frame according to special relativity. The effect is exaggerated for illustrative purposes.

Aberration may be explained as the difference in angle of a beam of light in different inertial frames of reference. A common analogy is to consider the apparent direction of falling rain. If rain is falling vertically in the frame of reference of a person standing still, then to a person moving forwards the rain will appear to arrive at an angle, requiring the moving observer to tilt their umbrella forwards. The faster the observer moves, the more tilt is needed.

The net effect is that light rays striking the moving observer from the sides in a stationary frame will come angled from ahead in the moving observer's frame. This effect is sometimes called the "searchlight" or "headlight" effect.

In the case of annual aberration of starlight, the direction of incoming starlight as seen in the Earth's moving frame is tilted relative to the angle observed in the Sun's frame. Since the direction of motion of the Earth changes during its orbit, the direction of this tilting changes during the course of the year, and causes the apparent position of the star to differ from its true position as measured in the inertial frame of the Sun.

While classical reasoning gives intuition for aberration, it leads to a number of physical paradoxes observable even at the classical level (see history). The theory of special relativity is required to correctly account for aberration. The relativistic explanation is very similar to the classical one however, and in both theories aberration may be understood as a case of addition of velocities.

Classical explanation

In the Sun's frame, consider a beam of light with velocity equal to the speed of light , with x and y velocity components and , and thus at an angle such that . If the Earth is moving at velocity in the x direction relative to the Sun, then by velocity addition the x component of the beam's velocity in the Earth's frame of reference is , and the y velocity is unchanged, . Thus the angle of the light in the Earth's frame in terms of the angle in the Sun's frame is

In the case of , this result reduces to , which in the limit may be approximated by .

Relativistic explanation

The reasoning in the relativistic case is the same except that the relativistic velocity addition formulas must be used, which can be derived from Lorentz transformations between different frames of reference. These formulas are

where , giving the components of the light beam in the Earth's frame in terms of the components in the Sun's frame. The angle of the beam in the Earth's frame is thus [6]

In the case of , this result reduces to , and in the limit this may be approximated by . This relativistic derivation keeps the speed of light constant in all frames of reference, unlike the classical derivation above.

Relationship to light-time correction and relativistic beaming

Aberration, light-time correction, and relativistic beaming can be considered the same phenomenon depending on the frame of reference. Aberrationlighttimebeaming.gif
Aberration, light-time correction, and relativistic beaming can be considered the same phenomenon depending on the frame of reference.

Aberration is related to two other phenomena, light-time correction, which is due to the motion of an observed object during the time taken by its light to reach an observer, and relativistic beaming, which is an angling of the light emitted by a moving light source. It can be considered equivalent to them but in a different inertial frame of reference. In aberration, the observer is considered to be moving relative to a (for the sake of simplicity [7] ) stationary light source, while in light-time correction and relativistic beaming the light source is considered to be moving relative to a stationary observer.

Consider the case of an observer and a light source moving relative to each other at constant velocity, with a light beam moving from the source to the observer. At the moment of emission, the beam in the observer's rest frame is tilted compared to the one in the source's rest frame, as understood through relativistic beaming. During the time it takes the light beam to reach the observer the light source moves in the observer's frame, and the 'true position' of the light source is displaced relative to the apparent position the observer sees, as explained by light-time correction. Finally, the beam in the observer's frame at the moment of observation is tilted compared to the beam in source's frame, which can be understood as an aberrational effect. Thus, a person in the light source's frame would describe the apparent tilting of the beam in terms of aberration, while a person in the observer's frame would describe it as a light-time effect.

The relationship between these phenomena is only valid if the observer and source's frames are inertial frames. In practice, because the Earth is not an inertial rest frame but experiences centripetal acceleration towards the Sun, many aberrational effects such as annual aberration on Earth cannot be considered light-time corrections. However, if the time between emission and detection of the light is short compared to the orbital period of the Earth, the Earth may be approximated as an inertial frame and aberrational effects are equivalent to light-time corrections.

Types

The Astronomical Almanac describes several different types of aberration, arising from differing components of the Earth's and observed object's motion:

Annual aberration

Stars at the ecliptic poles appear to move in circles, stars exactly in the ecliptic plane move in lines, and stars at intermediate angles move in ellipses. Shown here are the apparent motions of stars with the ecliptic latitudes corresponding to these cases, and with ecliptic longitude of 270deg. Aberration3.svg
Stars at the ecliptic poles appear to move in circles, stars exactly in the ecliptic plane move in lines, and stars at intermediate angles move in ellipses. Shown here are the apparent motions of stars with the ecliptic latitudes corresponding to these cases, and with ecliptic longitude of 270°.
The direction of aberration of a star at the northern ecliptic pole differs at different times of the year Aberrationseasons.svg
The direction of aberration of a star at the northern ecliptic pole differs at different times of the year

Annual aberration is caused by the motion of an observer on Earth as the planet revolves around the Sun. Due to orbital eccentricity, the orbital velocity of Earth (in the Sun's rest frame) varies periodically during the year as the planet traverses its elliptic orbit and consequently the aberration also varies periodically, typically causing stars to appear to move in small ellipses.

Approximating Earth's orbit as circular, the maximum displacement of a star due to annual aberration is known as the constant of aberration, conventionally represented by . It may be calculated using the relation substituting the Earth's average speed in the Sun's frame for and the speed of light . Its accepted value is 20.49552  arcseconds (sec) or 0.000099365  radians (rad) (at J2000). [9]

Assuming a circular orbit, annual aberration causes stars exactly on the ecliptic (the plane of Earth's orbit) to appear to move back and forth along a straight line, varying by on either side of their position in the Sun's frame. A star that is precisely at one of the ecliptic poles (at 90° from the ecliptic plane) will appear to move in a circle of radius about its true position, and stars at intermediate ecliptic latitudes will appear to move along a small ellipse.

For illustration, consider a star at the northern ecliptic pole viewed by an observer at a point on the Arctic Circle. Such an observer will see the star transit at the zenith, once every day (strictly speaking sidereal day). At the time of the March equinox, Earth's orbit carries the observer in a southwards direction, and the star's apparent declination is therefore displaced to the south by an angle of . On the September equinox, the star's position is displaced to the north by an equal and opposite amount. On either solstice, the displacement in declination is 0. Conversely, the amount of displacement in right ascension is 0 on either equinox and at maximum on either solstice.

In actuality, Earth's orbit is slightly elliptic rather than circular, and its speed varies somewhat over the course of its orbit, which means the description above is only approximate. Aberration is more accurately calculated using Earth's instantaneous velocity relative to the barycenter of the Solar System. [9]

Note that the displacement due to aberration is orthogonal to any displacement due to parallax. If parallax is detectable, the maximum displacement to the south would occur in December, and the maximum displacement to the north in June. It is this apparently anomalous motion that so mystified early astronomers.

Solar annual aberration

A special case of annual aberration is the nearly constant deflection of the Sun from its position in the Sun's rest frame by towards the west (as viewed from Earth), opposite to the apparent motion of the Sun along the ecliptic (which is from west to east, as seen from Earth). The deflection thus makes the Sun appear to be behind (or retarded) from its rest-frame position on the ecliptic by a position or angle .

This deflection may equivalently be described as a light-time effect due to motion of the Earth during the 8.3 minutes that it takes light to travel from the Sun to Earth. The relation with is : [0.000099365 rad / 2 π rad] x [365.25 d x 24 h/d x 60 min/h] = 8.3167 min ≈ 8 min 19 sec = 499 sec. This is possible since the transit time of sunlight is short relative to the orbital period of the Earth, so the Earth's frame may be approximated as inertial. In the Earth's frame, the Sun moves, at a mean velocity v = 29.789 km/s, by a distance ≈ 14,864.7 km in the time it takes light to reach Earth, ≈ 499 sec for the orbit of mean radius = 1 AU = 149,597,870.7 km. This gives an angular correction ≈ 0.000099364 rad = 20.49539 sec, which can be solved to give ≈ 0.000099365 rad = 20.49559 sec, very nearly the same as the aberrational correction (here is in radian and not in arcsecond).

Diurnal aberration

Diurnal aberration is caused by the velocity of the observer on the surface of the rotating Earth. It is therefore dependent not only on the time of the observation, but also the latitude and longitude of the observer. Its effect is much smaller than that of annual aberration, and is only 0.32 arcseconds in the case of an observer at the Equator, where the rotational velocity is greatest. [10]

Secular aberration

The secular component of aberration, caused by the motion of the Solar System in space, has been further subdivided into several components: aberration resulting from the motion of the solar system barycenter around the center of our Galaxy, aberration resulting from the motion of the Galaxy relative to the Local Group, and aberration resulting from the motion of the Local Group relative to the cosmic microwave background. [11] :6 Secular aberration affects the apparent positions of stars and extragalactic objects. The large, constant part of secular aberration cannot be directly observed and "It has been standard practice to absorb this large, nearly constant effect into the reported" [12] :1 positions of stars. [13]

In about 200 million years, the Sun circles the galactic center, whose measured location is near right ascension (α = 266.4°) and declination (δ = −29.0°). [12] :2 The constant, unobservable, effect of the solar system's motion around the galactic center has been computed variously as 150 [14] :743 or 165 [12] :1 arcseconds. The other, observable, part is an acceleration toward the galactic center of approximately 2.5 × 10−10 m/s2, which yields a change of aberration of about 5 μas/yr. [15] Highly precise measurements extending over several years can observe this change in secular aberration, often called the secular aberration drift or the acceleration of the Solar System, as a small apparent proper motion. [16] :1 [12] :1

Recently, highly precise astrometry of extragalactic objects using both Very Long Baseline Interferometry and the Gaia space observatory have successfully measured this small effect. [16] The first VLBI measurement of the apparent motion, over a period of 20 years, of 555 extragalactic objects towards the center of our galaxy at equatorial coordinates of α = 263° and δ = −20° indicated a secular aberration drift 6.4 ±1.5 μas/yr. [16] :1 Later determinations using a series of VLBI measurements extending over almost 40 years determined the secular aberration drift to be 5.83 ± 0.23 μas/yr in the direction α = 270.2 ± 2.3° and δ = −20.2° ± 3.6°. [11] :7 Optical observations using only 33 months of Gaia satellite data of 1.6 million extragalactic sources indicated an acceleration of the solar system of 2.32 ± 0.16 × 10−10 m/s2 and a corresponding secular aberration drift of 5.05 ± 0.35 μas/yr in the direction of α = 269.1° ± 5.4°, δ = −31.6° ± 4.1°. It is expected that later Gaia data releases, incorporating about 66 and 120 months of data, will reduce the random errors of these results by factors of 0.35 and 0.15. [17] [18] :1,14 The latest edition of the International Celestial Reference Frame (ICRF3) adopted a recommended galactocentric aberration constant of 5.8 μas/yr [12] :5,7 and recommended a correction for secular aberration to obtain the highest positional accuracy for times other than the reference epoch 2015.0. [11] :17–19

Planetary aberration

Planetary aberration is the combination of the aberration of light (due to Earth's velocity) and light-time correction (due to the object's motion and distance), as calculated in the rest frame of the Solar System. Both are determined at the instant when the moving object's light reaches the moving observer on Earth. It is so called because it is usually applied to planets and other objects in the Solar System whose motion and distance are accurately known.

Discovery and first observations

The discovery of the aberration of light was totally unexpected, and it was only by considerable perseverance and perspicacity that Bradley was able to explain it in 1727. It originated from attempts to discover whether stars possessed appreciable parallaxes.

Search for stellar parallax

The Copernican heliocentric theory of the Solar System had received confirmation by the observations of Galileo and Tycho Brahe and the mathematical investigations of Kepler and Newton. [19] As early as 1573, Thomas Digges had suggested that parallactic shifting of the stars should occur according to the heliocentric model, and consequently if stellar parallax could be observed it would help confirm this theory. Many observers claimed to have determined such parallaxes, but Tycho Brahe and Giovanni Battista Riccioli concluded that they existed only in the minds of the observers, and were due to instrumental and personal errors. However, in 1680 Jean Picard, in his Voyage d'Uranibourg, stated, as a result of ten years' observations, that Polaris, the Pole Star, exhibited variations in its position amounting to 40 annually. Some astronomers endeavoured to explain this by parallax, but these attempts failed because the motion differed from that which parallax would produce. John Flamsteed, from measurements made in 1689 and succeeding years with his mural quadrant, similarly concluded that the declination of Polaris was 40 less in July than in September. Robert Hooke, in 1674, published his observations of γ Draconis, a star of magnitude 2m which passes practically overhead at the latitude of London (hence its observations are largely free from the complex corrections due to atmospheric refraction), and concluded that this star was 23 more northerly in July than in October. [19]

James Bradley's observations

Bradley's observations of g Draconis and 35 Camelopardalis as reduced by Busch to the year 1730. Bradley's observations of g Draconis and 35 Camelopardalis as reduced by Busch.jpg
Bradley's observations of γ Draconis and 35 Camelopardalis as reduced by Busch to the year 1730.

Consequently, when Bradley and Samuel Molyneux entered this sphere of research in 1725, there was still considerable uncertainty as to whether stellar parallaxes had been observed or not, and it was with the intention of definitely answering this question that they erected a large telescope at Molyneux's house at Kew. [4] They decided to reinvestigate the motion of γ Draconis with a telescope constructed by George Graham (1675–1751), a celebrated instrument-maker. This was fixed to a vertical chimney stack in such manner as to permit a small oscillation of the eyepiece, the amount of which (i.e. the deviation from the vertical) was regulated and measured by the introduction of a screw and a plumb line. [19]

The instrument was set up in November 1725, and observations on γ Draconis were made starting in December. The star was observed to move 40 southwards between September and March, and then reversed its course from March to September. [19] At the same time, 35 Camelopardalis, a star with a right ascension nearly exactly opposite to that of γ Draconis, was 19" more northerly at the beginning of March than in September. [20] The asymmetry of these results, which were expected to be mirror images of each other, were completely unexpected and inexplicable by existing theories.

Early hypotheses

Hypothetical observation of g Draconis if its movement was caused by parallax. Hypothetical movement of g Draconis caused by parallax.jpg
Hypothetical observation of γ Draconis if its movement was caused by parallax.
Hypothetical observation of g Draconis and 35 Camelopardalis if their movements were caused by nutation. Hypothetical movement of g Draconis and 35 Camelopardalis caused by nutation.jpg
Hypothetical observation of γ Draconis and 35 Camelopardalis if their movements were caused by nutation.

Bradley and Molyneux discussed several hypotheses in the hope of finding the solution. Since the apparent motion was evidently caused neither by parallax nor observational errors, Bradley first hypothesized that it could be due to oscillations in the orientation of the Earth's axis relative to the celestial sphere – a phenomenon known as nutation. 35 Camelopardalis was seen to possess an apparent motion which could be consistent with nutation, but since its declination varied only one half as much as that of γ Draconis, it was obvious that nutation did not supply the answer [21] (however, Bradley later went on to discover that the Earth does indeed nutate). [22] He also investigated the possibility that the motion was due to an irregular distribution of the Earth's atmosphere, thus involving abnormal variations in the refractive index, but again obtained negative results. [21]

On August 19, 1727, Bradley embarked upon a further series of observations using a telescope of his own erected at the Rectory, Wanstead. This instrument had the advantage of a larger field of view and he was able to obtain precise positions of a large number of stars over the course of about twenty years. During his first two years at Wanstead, he established the existence of the phenomenon of aberration beyond all doubt, and this also enabled him to formulate a set of rules that would allow the calculation of the effect on any given star at a specified date.

Development of the theory of aberration

Bradley eventually developed his explanation of aberration in about September 1728 and this theory was presented to the Royal Society in mid January the following year. One well-known story was that he saw the change of direction of a wind vane on a boat on the Thames, caused not by an alteration of the wind itself, but by a change of course of the boat relative to the wind direction. [22] However, there is no record of this incident in Bradley's own account of the discovery, and it may therefore be apocryphal.

The following table shows the magnitude of deviation from true declination for γ Draconis and the direction, on the planes of the solstitial colure and ecliptic prime meridian, of the tangent of the velocity of the Earth in its orbit for each of the four months where the extremes are found, as well as expected deviation from true ecliptic longitude if Bradley had measured its deviation from right ascension:

MonthDirection of tangential velocity of Earth on the plane of the solstitial colureDeviation from true declination of γ DraconisDirection of tangential velocity of Earth on the plane of the ecliptic prime meridianExpected deviation from true ecliptic longitude of γ Draconis
Decemberzeronone← (moving toward perihelion at fast velocity)decrease of more than 20.2"
March← (moving toward aphelion)19.5" southwardzeronone
Junezeronone→ (moving toward aphelion at slow velocity)increase of less than 20.2"
September→ (moving toward perihelion)19.5" northwardzeronone

Bradley proposed that the aberration of light not only affected declination, but right ascension as well, so that a star in the pole of the ecliptic would describe a little ellipse with a diameter of about 40", but for simplicity, he assumed it to be a circle. Since he only observed the deviation in declination, and not in right ascension, his calculations for the maximum deviation of a star in the pole of the ecliptic are for its declination only, which will coincide with the diameter of the little circle described by such star. For eight different stars, his calculations are as follows:

StarAnnual Variation (")Maximum deviation in declination of a star in the pole of the ecliptic (")
γ Draconis3940.4
β Draconis3940.2
η Ursa Maj.3640.4
α Cass.3440.8
τ Persei2541.0
α Persei2340.2
35 Camel.1940.2
Capella1640.0
MEAN40.4

Based on these calculations, Bradley was able to estimate the constant of aberration at 20.2", which is equal to 0.00009793 radians, and with this was able to estimate the speed of light at 183,300 miles (295,000 km) per second. [23] By projecting the little circle for a star in the pole of the ecliptic, he could simplify the calculation of the relationship between the speed of light and the speed of the Earth's annual motion in its orbit as follows:

Thus, the speed of light to the speed of the Earth's annual motion in its orbit is 10,210 to one, from whence it would follow, that light moves, or is propagated as far as from the Sun to the Earth in 8 minutes 12 seconds. [24]

The original motivation of the search for stellar parallax was to test the Copernican theory that the Earth revolves around the Sun. The change of aberration in the course of the year demonstrates the relative motion of the Earth and the stars.

Retrodiction on Descartes' lightspeed argument

In the prior century, René Descartes argued that if light were not instantaneous, then shadows of moving objects would lag; and if propagation times over terrestrial distances were appreciable, then during a lunar eclipse the Sun, Earth, and Moon would be out of alignment by hours' motion, contrary to observation. Huygens commented that, on Rømer's lightspeed data (yielding an earth-moon round-trip time of only seconds), the lag angle would be imperceptible. What they both overlooked [25] is that aberration (as understood only later) would exactly counteract the lag even if large, leaving this eclipse method completely insensitive to light speed. (Otherwise, shadow-lag methods could be made to sense absolute translational motion, contrary to a basic principle of relativity.)

Historical theories of aberration

The phenomenon of aberration became a driving force for many physical theories during the 200 years between its observation and the explanation by Albert Einstein.

The first classical explanation was provided in 1729, by James Bradley as described above, who attributed it to the finite speed of light and the motion of Earth in its orbit around the Sun. [3] [4] However, this explanation proved inaccurate once the wave nature of light was better understood, and correcting it became a major goal of the 19th century theories of luminiferous aether. Augustin-Jean Fresnel proposed a correction due to the motion of a medium (the aether) through which light propagated, known as "partial aether drag". He proposed that objects partially drag the aether along with them as they move, and this became the accepted explanation for aberration for some time. George Stokes proposed a similar theory, explaining that aberration occurs due to the flow of aether induced by the motion of the Earth. Accumulated evidence against these explanations, combined with new understanding of the electromagnetic nature of light, led Hendrik Lorentz to develop an electron theory which featured an immobile aether, and he explained that objects contract in length as they move through the aether. Motivated by these previous theories, Albert Einstein then developed the theory of special relativity in 1905, which provides the modern account of aberration.

Bradley's classical explanation

Figure 2: As light propagates down the telescope, the telescope moves requiring a tilt to the telescope that depends on the speed of light. The apparent angle of the star ph differs from its true angle th. Stellar aberration.JPG
Figure 2: As light propagates down the telescope, the telescope moves requiring a tilt to the telescope that depends on the speed of light. The apparent angle of the star φ differs from its true angle θ.

Bradley conceived of an explanation in terms of a corpuscular theory of light in which light is made of particles. [1] His classical explanation appeals to the motion of the earth relative to a beam of light-particles moving at a finite velocity, and is developed in the Sun's frame of reference, unlike the classical derivation given above.

Consider the case where a distant star is motionless relative to the Sun, and the star is extremely far away, so that parallax may be ignored. In the rest frame of the Sun, this means light from the star travels in parallel paths to the Earth observer, and arrives at the same angle regardless of where the Earth is in its orbit. Suppose the star is observed on Earth with a telescope, idealized as a narrow tube. The light enters the tube from the star at angle and travels at speed taking a time to reach the bottom of the tube, where it is detected. Suppose observations are made from Earth, which is moving with a speed . During the transit of the light, the tube moves a distance . Consequently, for the particles of light to reach the bottom of the tube, the tube must be inclined at an angle different from , resulting in an apparent position of the star at angle . As the Earth proceeds in its orbit it changes direction, so changes with the time of year the observation is made. The apparent angle and true angle are related using trigonometry as:

.

In the case of , this gives . While this is different from the more accurate relativistic result described above, in the limit of small angle and low velocity they are approximately the same, within the error of the measurements of Bradley's day. These results allowed Bradley to make one of the earliest measurements of the speed of light. [24] [26]

Luminiferous aether

Young reasoned that aberration could only be explained if the aether were immobile in the frame of the Sun. On the left, stellar aberration occurs if an immobile aether is assumed, showing that the telescope must be tilted. On the right, the aberration disappears if the aether moves with the telescope, and the telescope does not need to be tilted. Stellar aberration versus the dragged aether.gif
Young reasoned that aberration could only be explained if the aether were immobile in the frame of the Sun. On the left, stellar aberration occurs if an immobile aether is assumed, showing that the telescope must be tilted. On the right, the aberration disappears if the aether moves with the telescope, and the telescope does not need to be tilted.

In the early nineteenth century the wave theory of light was being rediscovered, and in 1804 Thomas Young adapted Bradley's explanation for corpuscular light to wavelike light traveling through a medium known as the luminiferous aether. His reasoning was the same as Bradley's, but it required that this medium be immobile in the Sun's reference frame and must pass through the earth unaffected, otherwise the medium (and therefore the light) would move along with the earth and no aberration would be observed. [27] He wrote:

Upon consideration of the phenomena of the aberration of the stars I am disposed to believe that the luminiferous aether pervades the substance of all material bodies with little or no resistance, as freely perhaps as the wind passes through a grove of trees.

Thomas Young, 1804 [1]

However, it soon became clear Young's theory could not account for aberration when materials with a non-vacuum refractive index were present. An important example is of a telescope filled with water. The speed of light in such a telescope will be slower than in vacuum, and is given by rather than where is the refractive index of the water. Thus, by Bradley and Young's reasoning the aberration angle is given by

.

which predicts a medium-dependent angle of aberration. When refraction at the telescope's objective is taken into account this result deviates even more from the vacuum result. In 1810 François Arago performed a similar experiment and found that the aberration was unaffected by the medium in the telescope, providing solid evidence against Young's theory. This experiment was subsequently verified by many others in the following decades, most accurately by Airy in 1871, with the same result. [27]

Aether drag models

Fresnel's aether drag

In 1818, Augustin Fresnel developed a modified explanation to account for the water telescope and for other aberration phenomena. He explained that the aether is generally at rest in the Sun's frame of reference, but objects partially drag the aether along with them as they move. That is, the aether in an object of index of refraction moving at velocity is partially dragged with a velocity bringing the light along with it. This factor is known as "Fresnel's dragging coefficient". This dragging effect, along with refraction at the telescope's objective, compensates for the slower speed of light in the water telescope in Bradley's explanation. [lower-alpha 1] With this modification Fresnel obtained Bradley's vacuum result even for non-vacuum telescopes, and was also able to predict many other phenomena related to the propagation of light in moving bodies. Fresnel's dragging coefficient became the dominant explanation of aberration for the next decades.

Conceptual illustration of Stokes' aether drag theory. In the rest frame of the Sun the Earth moves to the right through the aether, in which it induces a local current. A ray of light (in red) coming from the vertical becomes dragged and tilted due to the flow of aether. Stokes aether drag.svg
Conceptual illustration of Stokes' aether drag theory. In the rest frame of the Sun the Earth moves to the right through the aether, in which it induces a local current. A ray of light (in red) coming from the vertical becomes dragged and tilted due to the flow of aether.

Stokes' aether drag

However, the fact that light is polarized (discovered by Fresnel himself) led scientists such as Cauchy and Green to believe that the aether was a totally immobile elastic solid as opposed to Fresnel's fluid aether. There was thus renewed need for an explanation of aberration consistent both with Fresnel's predictions (and Arago's observations) as well as polarization.

In 1845, Stokes proposed a 'putty-like' aether which acts as a liquid on large scales but as a solid on small scales, thus supporting both the transverse vibrations required for polarized light and the aether flow required to explain aberration. Making only the assumptions that the fluid is irrotational and that the boundary conditions of the flow are such that the aether has zero velocity far from the Earth, but moves at the Earth's velocity at its surface and within it, he was able to completely account for aberration. [lower-alpha 2] The velocity of the aether outside of the Earth would decrease as a function of distance from the Earth so light rays from stars would be progressively dragged as they approached the surface of the Earth. The Earth's motion would be unaffected by the aether due to D'Alembert's paradox.

Both Fresnel and Stokes' theories were popular. However, the question of aberration was put aside during much of the second half of the 19th century as focus of inquiry turned to the electromagnetic properties of aether.

Lorentz' length contraction

In the 1880s once electromagnetism was better understood, interest turned again to the problem of aberration. By this time flaws were known to both Fresnel's and Stokes' theories. Fresnel's theory required that the relative velocity of aether and matter to be different for light of different colors, and it was shown that the boundary conditions Stokes had assumed in his theory were inconsistent with his assumption of irrotational flow. [1] [27] [28] At the same time, the modern theories of electromagnetic aether could not account for aberration at all. Many scientists such as Maxwell, Heaviside and Hertz unsuccessfully attempted to solve these problems by incorporating either Fresnel or Stokes' theories into Maxwell's new electromagnetic laws.

Hendrik Lorentz spent considerable effort along these lines. After working on this problem for a decade, the issues with Stokes' theory caused him to abandon it and to follow Fresnel's suggestion of a (mostly) stationary aether (1892, 1895). However, in Lorentz's model the aether was completely immobile, like the electromagnetic aethers of Cauchy, Green and Maxwell and unlike Fresnel's aether. He obtained Fresnel's dragging coefficient from modifications of Maxwell's electromagnetic theory, including a modification of the time coordinates in moving frames ("local time"). In order to explain the Michelson–Morley experiment (1887), which apparently contradicted both Fresnel's and Lorentz's immobile aether theories, and apparently confirmed Stokes' complete aether drag, Lorentz theorized (1892) that objects undergo "length contraction" by a factor of in the direction of their motion through the aether. In this way, aberration (and all related optical phenomena) can be accounted for in the context of an immobile aether. Lorentz' theory became the basis for much research in the next decade, and beyond. Its predictions for aberration are identical to those of the relativistic theory. [27] [29]

Special relativity

Lorentz' theory matched experiment well, but it was complicated and made many unsubstantiated physical assumptions about the microscopic nature of electromagnetic media. In his 1905 theory of special relativity, Albert Einstein reinterpreted the results of Lorentz' theory in a much simpler and more natural conceptual framework which disposed of the idea of an aether. His derivation is given above, and is now the accepted explanation. Robert S. Shankland reported some conversations with Einstein, in which Einstein emphasized the importance of aberration: [30]

He continued to say the experimental results which had influenced him most were the observations of stellar aberration and Fizeau's measurements on the speed of light in moving water. "They were enough," he said.

Other important motivations for Einstein's development of relativity were the moving magnet and conductor problem and (indirectly) the negative aether drift experiments, already mentioned by him in the introduction of his first relativity paper. Einstein wrote in a note in 1952: [5]

My own thought was more indirectly influenced by the famous Michelson-Morley experiment. I learned of it through Lorentz' path breaking investigation on the electrodynamics of moving bodies (1895), of which I knew before the establishment of the special theory of relativity. Lorentz' basic assumption of a resting ether did not seem directly convincing to me, since it led to an [struck out: to me artificial appearing] interpretation of the Michelson-Morley experiment, which [struck out: did not convince me] seemed unnatural to me. My direct path to the sp. th. rel. was mainly determined by the conviction that the electromotive force induced in a conductor moving in a magnetic field is nothing other than an electric field. But the result of Fizeau's experiment and the phenomenon of aberration also guided me.

While Einstein's result is the same as Bradley's original equation except for an extra factor of , Bradley's result does not merely give the classical limit of the relativistic case, in the sense that it gives incorrect predictions even at low relative velocities. Bradley's explanation cannot account for situations such as the water telescope, nor for many other optical effects (such as interference) that might occur within the telescope. This is because in the Earth's frame it predicts that the direction of propagation of the light beam in the telescope is not normal to the wavefronts of the beam, in contradiction with Maxwell's theory of electromagnetism. It also does not preserve the speed of light c between frames. However, Bradley did correctly infer that the effect was due to relative velocities.

See also

Notes

  1. More in detail, Fresnel explains that the incoming light of angle is first refracted at the end of the telescope, to a new angle within the telescope. This may be accounted for by Snell's law, giving . Then drag must be accounted for. Without drag, the x and y components of the light in the telescope are and , but drag modifies the x component to if the Earth moves with velocity . If is angle and is the velocity of the light with these velocity components, then by Bradley's reasoning where is the modified path length through the water and t is the time it takes the light to travel the distance h, . Upon solving these equations for in terms of one obtains Bradley's vacuum result.
  2. The propagating wavefront moving through the aether. Stokes aether drag proof.svg
    The propagating wavefront moving through the aether.
    Stokes' derivation may be summarized as follows: Consider a wavefront moving in the downwards z direction. Say the aether has velocity field as a function of . Now, motion of the aether in the x and y directions does not affect the wavefront, but the motion in the z direction advances it (in addition to the amount it advances at speed c). If the z velocity of the aether varies over space, for example if it is slower for higher x as shown in the figure, then the wavefront becomes angled, by an angle . Now, say in time t the wavefront has moved by a span (assuming the speed of the aether is negligible compared to the speed of light). Then for each distance the ray descends, it is bent by an angle , and so the total angle by which it has changed after travelling through the entire fluid is
    If the fluid is irrotational it will satisfy the Cauchy–Riemann equations, one of which is
    .
    Inserting this into the previous result gives an aberration angle where the s represent the x component of the aether's velocity at the start and end of the ray. Far from the earth the aether has zero velocity, so and at the surface of the earth it has the earth's velocity . Thus we finally get
    which is the known aberration result.

Related Research Articles

<span class="mw-page-title-main">Diffraction</span> Phenomenon of the motion of waves

Diffraction is the interference or bending of waves around the corners of an obstacle or through an aperture into the region of geometrical shadow of the obstacle/aperture. The diffracting object or aperture effectively becomes a secondary source of the propagating wave. Italian scientist Francesco Maria Grimaldi coined the word diffraction and was the first to record accurate observations of the phenomenon in 1660.

<span class="mw-page-title-main">Nutation</span> Wobble of the axis of rotation

Nutation is a rocking, swaying, or nodding motion in the axis of rotation of a largely axially symmetric object, such as a gyroscope, planet, or bullet in flight, or as an intended behaviour of a mechanism. In an appropriate reference frame it can be defined as a change in the second Euler angle. If it is not caused by forces external to the body, it is called free nutation or Euler nutation. A pure nutation is a movement of a rotational axis such that the first Euler angle is constant. Therefore it can be seen that the circular red arrow in the diagram indicates the combined effects of precession and nutation, while nutation in the absence of precession would only change the tilt from vertical. However, in spacecraft dynamics, precession is sometimes referred to as nutation.

<span class="mw-page-title-main">Polar coordinate system</span> Coordinates comprising a distance and an angle

In mathematics, the polar coordinate system is a two-dimensional coordinate system in which each point on a plane is determined by a distance from a reference point and an angle from a reference direction. The reference point is called the pole, and the ray from the pole in the reference direction is the polar axis. The distance from the pole is called the radial coordinate, radial distance or simply radius, and the angle is called the angular coordinate, polar angle, or azimuth. Angles in polar notation are generally expressed in either degrees or radians.

<span class="mw-page-title-main">Special relativity</span> Theory of interwoven space and time by Albert Einstein

In physics, the special theory of relativity, or special relativity for short, is a scientific theory of the relationship between space and time. In Albert Einstein's 1905 treatment, the theory is presented as being based on just two postulates:

  1. The laws of physics are invariant (identical) in all inertial frames of reference.
  2. The speed of light in vacuum is the same for all observers, regardless of the motion of light source or observer.
<span class="mw-page-title-main">Spherical coordinate system</span> Coordinates comprising a distance and two angles

In mathematics, a spherical coordinate system is a coordinate system for three-dimensional space where the position of a given point in space is specified by three numbers, : the radial distance of the radial liner connecting the point to the fixed point of origin ; the polar angle θ of the radial line r; and the azimuthal angle φ of the radial line r.

<span class="mw-page-title-main">Snell's law</span> Formula for refraction angles

Snell's law is a formula used to describe the relationship between the angles of incidence and refraction, when referring to light or other waves passing through a boundary between two different isotropic media, such as water, glass, or air. In optics, the law is used in ray tracing to compute the angles of incidence or refraction, and in experimental optics to find the refractive index of a material. The law is also satisfied in meta-materials, which allow light to be bent "backward" at a negative angle of refraction with a negative refractive index.

<span class="mw-page-title-main">Angular velocity</span> Direction and rate of rotation

In physics, angular velocity, also known as angular frequency vector, is a pseudovector representation of how the angular position or orientation of an object changes with time, i.e. how quickly an object rotates around an axis of rotation and how fast the axis itself changes direction.

<span class="mw-page-title-main">Trajectory</span> Path of a moving object

A trajectory or flight path is the path that an object with mass in motion follows through space as a function of time. In classical mechanics, a trajectory is defined by Hamiltonian mechanics via canonical coordinates; hence, a complete trajectory is defined by position and momentum, simultaneously.

<span class="mw-page-title-main">Superluminal motion</span> Apparent faster-than-light motion of distant astronomical objects

In astronomy, superluminal motion is the apparently faster-than-light motion seen in some radio galaxies, BL Lac objects, quasars, blazars and recently also in some galactic sources called microquasars. Bursts of energy moving out along the relativistic jets emitted from these objects can have a proper motion that appears greater than the speed of light. All of these sources are thought to contain a black hole, responsible for the ejection of mass at high velocities. Light echoes can also produce apparent superluminal motion.

<span class="mw-page-title-main">Relativistic Doppler effect</span> Scientific phenomenon

The relativistic Doppler effect is the change in frequency, wavelength and amplitude of light, caused by the relative motion of the source and the observer, when taking into account effects described by the special theory of relativity.

A nonholonomic system in physics and mathematics is a physical system whose state depends on the path taken in order to achieve it. Such a system is described by a set of parameters subject to differential constraints and non-linear constraints, such that when the system evolves along a path in its parameter space but finally returns to the original set of parameter values at the start of the path, the system itself may not have returned to its original state. Nonholonomic mechanics is autonomous division of Newtonian mechanics.

<span class="mw-page-title-main">Projectile motion</span> Motion of launched objects due to gravity

Projectile motion is a form of motion experienced by an object or particle that is projected in a gravitational field, such as from Earth's surface, and moves along a curved path under the action of gravity only. In the particular case of projectile motion on Earth, most calculations assume the effects of air resistance are passive and negligible. The curved path of objects in projectile motion was shown by Galileo to be a parabola, but may also be a straight line in the special case when it is thrown directly upward or downward. The study of such motions is called ballistics, and such a trajectory is a ballistic trajectory. The only force of mathematical significance that is actively exerted on the object is gravity, which acts downward, thus imparting to the object a downward acceleration towards the Earth’s center of mass. Because of the object's inertia, no external force is needed to maintain the horizontal velocity component of the object's motion. Taking other forces into account, such as aerodynamic drag or internal propulsion, requires additional analysis. A ballistic missile is a missile only guided during the relatively brief initial powered phase of flight, and whose remaining course is governed by the laws of classical mechanics.

<span class="mw-page-title-main">Velocity-addition formula</span> Equation used in relativistic physics

In relativistic physics, a velocity-addition formula is an equation that specifies how to combine the velocities of objects in a way that is consistent with the requirement that no object's speed can exceed the speed of light. Such formulas apply to successive Lorentz transformations, so they also relate different frames. Accompanying velocity addition is a kinematic effect known as Thomas precession, whereby successive non-collinear Lorentz boosts become equivalent to the composition of a rotation of the coordinate system and a boost.

In the 19th century, the theory of the luminiferous aether as the hypothetical medium for the propagation of light waves was widely discussed. The aether hypothesis arose because physicists of that era could not conceive of light waves propagating without a physical medium in which to do so. When experiments failed to detect the hypothesized luminiferous aether, physicists conceived explanations for the experiments' failure which preserved the hypothetical aether's existence.

In physics, relativistic aberration is the relativistic version of aberration of light, including relativistic corrections that become significant for observers who move with velocities close to the speed of light. It is described by Einstein's special theory of relativity.

A theoretical motivation for general relativity, including the motivation for the geodesic equation and the Einstein field equation, can be obtained from special relativity by examining the dynamics of particles in circular orbits about the Earth. A key advantage in examining circular orbits is that it is possible to know the solution of the Einstein Field Equation a priori. This provides a means to inform and verify the formalism.

The history of Lorentz transformations comprises the development of linear transformations forming the Lorentz group or Poincaré group preserving the Lorentz interval and the Minkowski inner product .

<span class="mw-page-title-main">Proper acceleration</span> Physical acceleration experienced by an object

In relativity theory, proper acceleration is the physical acceleration experienced by an object. It is thus acceleration relative to a free-fall, or inertial, observer who is momentarily at rest relative to the object being measured. Gravitation therefore does not cause proper acceleration, because the same gravity acts equally on the inertial observer. As a consequence, all inertial observers always have a proper acceleration of zero.

<span class="mw-page-title-main">Kepler orbit</span> Celestial orbit whose trajectory is a conic section in the orbital plane

In celestial mechanics, a Kepler orbit is the motion of one body relative to another, as an ellipse, parabola, or hyperbola, which forms a two-dimensional orbital plane in three-dimensional space. A Kepler orbit can also form a straight line. It considers only the point-like gravitational attraction of two bodies, neglecting perturbations due to gravitational interactions with other objects, atmospheric drag, solar radiation pressure, a non-spherical central body, and so on. It is thus said to be a solution of a special case of the two-body problem, known as the Kepler problem. As a theory in classical mechanics, it also does not take into account the effects of general relativity. Keplerian orbits can be parametrized into six orbital elements in various ways.

In fluid dynamics, the Oseen equations describe the flow of a viscous and incompressible fluid at small Reynolds numbers, as formulated by Carl Wilhelm Oseen in 1910. Oseen flow is an improved description of these flows, as compared to Stokes flow, with the (partial) inclusion of convective acceleration.

References

  1. 1 2 3 4 Schaffner, Kenneth F. (1972). Nineteenth-century aether theories. Oxford: Pergamon Press. pp. 99–117 und 255–273. ISBN   0-08-015674-6.
  2. Williams, M. E. W. (1979). "Flamsteed's Alleged Measurement of Annual Parallax for the Pole Star". Journal for the History of Astronomy. 10 (2): 102–116. Bibcode:1979JHA....10..102W. doi:10.1177/002182867901000203. S2CID   118565124.
  3. 1 2 Bradley, James (1727–1728). "A Letter from the Reverend Mr. James Bradley Savilian Professor of Astronomy at Oxford, and F.R.S. to Dr.Edmond Halley Astronom. Reg. &c. Giving an Account of a New Discovered Motion of the Fix'd Stars". Phil. Trans. R. Soc. 35 (406): 637–661. Bibcode:1727RSPT...35..637B. doi: 10.1098/rstl.1727.0064 .
  4. 1 2 3 Hirschfeld, Alan (2001). Parallax:The Race to Measure the Cosmos. New York, New York: Henry Holt. ISBN   0-8050-7133-4.
  5. 1 2 Norton, John D. (2004). "Einstein's Investigations of Galilean Covariant Electrodynamics prior to 1905". Archive for History of Exact Sciences. 59 (1): 45–105. Bibcode:2004AHES...59...45N. doi:10.1007/s00407-004-0085-6. S2CID   17459755. Archived from the original on 2009-01-11.
  6. Richard A. Mould (2001). Basic Relativity (2nd ed.). Springer. p. 8. ISBN   0-387-95210-1.
  7. In fact, the light source doesn't need to be stationary, consider for example eclipsing binary stars: they are rotating with high speed —and ever changing and different velocity vectors— around each other, but they appear as one spot all the time.
  8. U.S. Nautical Almanac Office (21 March 2014). "Glossary". Astronomical Almanac for the Year 2015 and Its Companion, The Astronomical Almanac Online. Washington, DC: U.S. Government Printing Office (published 2014). p. M1. ISBN   9780707741499.
  9. 1 2 Kovalevsky, Jean & Seidelmann, P. Kenneth (2004). Fundamentals of Astrometry. Cambridge: Cambridge University Press. ISBN   0-521-64216-7.
  10. Newcomb, Simon (1960). A Compendium of Spherical Astronomy. Macmillan, 1906 – republished by Dover.
  11. 1 2 3 Charlot, P.; Jacobs, C. S.; Gordon, D.; Lambert, S.; et al. (2020). "The third realization of the International Celestial Reference Frame by very long baseline interferometry". Astronomy and Astrophysics. 644: A159. arXiv: 2010.13625 . Bibcode:2020A&A...644A.159C. doi:10.1051/0004-6361/202038368. S2CID   225068756.
  12. 1 2 3 4 5 MacMillan, D. S.; Fey, A.; Gipson, J. M.; et al. (2019). "Galactocentric acceleration in VLBI analysis". Astronomy and Astrophysics. 630: A93. Bibcode:2019A&A...630A..93M. doi:10.1051/0004-6361/201935379. S2CID   198471325.
  13. Hagihara, Yusuke (1933). "On the Theory of Secular Aberration". Proceedings of the Physico-Mathematical Society of Japan . 3rd Series. 15 (3–6): 175. doi:10.11429/ppmsj1919.15.3-6_155. the correction of star places with secular aberration is not at all necessary and is even inconvenient, so long as the solar motion remains uniform and rectilinear.
  14. Kovalevsky, J. (2003). "Aberration in proper motions". Astronomy and Astrophysics. 404 (2): 743–747. Bibcode:2003A&A...404..743K. doi: 10.1051/0004-6361:20030560 .
  15. Kopeikin, S.; Makarov, V. (2006). "Astrometric effects of secular aberration". The Astronomical Journal (USA). 131 (3): 1471–1478. arXiv: astro-ph/0508505 . Bibcode:2006AJ....131.1471K. doi: 10.1086/500170 .
  16. 1 2 3 Titov, O.; Lambert, S. B.; Gontier, A.-M. (2011). "VLBI measurement of the secular aberration drift". Astronomy and Astrophysics. 529: A91. arXiv: 1009.3698 . Bibcode:2011A&A...529A..91T. doi:10.1051/0004-6361/201015718. S2CID   119305429.
  17. "Gaia's measurement of the solar system acceleration with respect to the distant universe". esa.int. European Space Agency. 3 December 2020. Retrieved 14 September 2022.
  18. Gaia Collaboration; Klioner, S. A.; et al. (2021). "Gaia Early Data Release 3: Acceleration of the Solar System from Gaia astrometry". Astronomy & Astrophysics. 649: A9. arXiv: 2012.02036 . Bibcode:2021A&A...649A...9G. doi:10.1051/0004-6361/202039734.
  19. 1 2 3 4 Eppenstein (1911), p. 54.
  20. Bradley, James; Rigaud, Stephen Peter (1832). Miscellaneous works and correspondence of the Rev. James Bradley, D.D., F.R.S. Oxford: University Press. p. 11.
  21. 1 2 Eppenstein (1911), p. 55.
  22. 1 2 Berry, Arthur (1961) [1898]. A Short History of Astronomy . Dover. ISBN   9780486202105.
  23. Hoiberg, Dale H., ed. (2010). "aberration, constant of" . Encyclopædia Britannica. Vol. I: A-ak Bayes (15th ed.). Chicago, IL: Encyclopædia Britannica Inc. pp.  30. ISBN   978-1-59339-837-8.
  24. 1 2 James Bradley (1729). "An account of a new discovered motion of the fixed stars". Philosophical Transactions of the Royal Society. 35: 637–661. doi: 10.1098/rstl.1727.0064 .
  25. Sakellariadis, Spyros (1982). "Descartes' Experimental Proof of the Infinite Velocity of Light and Huygens' Rejoinder". Archive for History of Exact Sciences . 26 (1): 1–12. doi:10.1007/BF00348308. ISSN   0003-9519. JSTOR   41133639. S2CID   118187860.
  26. Encyclopædia Britannica Archived 2013-11-11 at the Wayback Machine
  27. 1 2 3 4 Whittaker, Edmund Taylor (1910). A History of the theories of aether and electricity (1. ed.). Dublin: Longman, Green and Co. Archived from the original on 2016-02-15.
    Whittaker, Edmund Taylor (1953). A History of the Theories of Aether and Electricity (2. ed.). T. Nelson.
  28. Janssen, Michel & Stachel, John (2010). "The Optics and Electrodynamics of Moving Bodies" (PDF). In John Stachel (ed.). Going Critical. Springer. ISBN   978-1-4020-1308-9. Archived (PDF) from the original on 2022-10-09.
  29. Darrigol, Olivier (2000). Electrodynamics from Ampére to Einstein . Oxford: Clarendon Press. ISBN   0-19-850594-9.
  30. Shankland, R. S. (1963). "Conversations with Albert Einstein". American Journal of Physics. 31 (1): 47–57. Bibcode:1963AmJPh..31...47S. doi:10.1119/1.1969236.

Further reading