Part of a series of articles about |
Gravitational lensing |
---|
|
Einstein ring Formalism Strong lensing Microlensing Weak lensing |
In general relativity, a point mass deflects a light ray with impact parameter by an angle approximately equal to
where G is the gravitational constant, M the mass of the deflecting object and c the speed of light. A naive application of Newtonian gravity can yield exactly half this value, where the light ray is assumed as a massed particle and scattered by the gravitational potential well. This approximation is good when is small.
In situations where general relativity can be approximated by linearized gravity, the deflection due to a spatially extended mass can be written simply as a vector sum over point masses. In the continuum limit, this becomes an integral over the density , and if the deflection is small we can approximate the gravitational potential along the deflected trajectory by the potential along the undeflected trajectory, as in the Born approximation in quantum mechanics. The deflection is then
where is the line-of-sight coordinate, and is the vector impact parameter of the actual ray path from the infinitesimal mass located at the coordinates . [1]
In the limit of a "thin lens", where the distances between the source, lens, and observer are much larger than the size of the lens (this is almost always true for astronomical objects), we can define the projected mass density
where is a vector in the plane of the sky. The deflection angle is then
As shown in the diagram on the right, the difference between the unlensed angular position and the observed position is this deflection angle, reduced by a ratio of distances, described as the lens equation
where is the distance from the lens to the source, is the distance from the observer to the source, and is the distance from the observer to the lens. For extragalactic lenses, these must be angular diameter distances.
In strong gravitational lensing, this equation can have multiple solutions, because a single source at can be lensed into multiple images.
The reduced deflection angle can be written as
where we define the convergence
and the critical surface density (not to be confused with the critical density of the universe)
We can also define the deflection potential
such that the scaled deflection angle is just the gradient of the potential and the convergence is half the Laplacian of the potential:
The deflection potential can also be written as a scaled projection of the Newtonian gravitational potential of the lens [2]
The Jacobian between the unlensed and lensed coordinate systems is
where is the Kronecker delta. Because the matrix of second derivatives must be symmetric, the Jacobian can be decomposed into a diagonal term involving the convergence and a trace-free term involving the shear
where is the angle between and the x-axis. The term involving the convergence magnifies the image by increasing its size while conserving surface brightness. The term involving the shear stretches the image tangentially around the lens, as discussed in weak lensing observables.
The shear defined here is not equivalent to the shear traditionally defined in mathematics, though both stretch an image non-uniformly.
There is an alternative way of deriving the lens equation, starting from the photon arrival time (Fermat surface)
where is the time to travel an infinitesimal line element along the source-observer straight line in vacuum, which is then corrected by the factor
to get the line element along the bended path with a varying small pitch angle and the refraction index n for the "aether", i.e., the gravitational field. The last can be obtained from the fact that a photon travels on a null geodesic of a weakly perturbed static Minkowski universe
where the uneven gravitational potential drives a changing the speed of light
So the refraction index
The refraction index greater than unity because of the negative gravitational potential .
Put these together and keep the leading terms we have the time arrival surface
The first term is the straight path travel time, the second term is the extra geometric path, and the third is the gravitational delay. Make the triangle approximation that for the path between the observer and the lens, and for the path between the lens and the source. The geometric delay term becomes
(How? There is no on the left. Angular diameter distances don't add in a simple way, in general.) So the Fermat surface becomes
where is so-called dimensionless time delay, and the 2D lensing potential
The images lie at the extrema of this surface, so the variation of with is zero,
which is the lens equation. Take the Poisson's equation for 3D potential
and we find the 2D lensing potential
Here we assumed the lens is a collection of point masses at angular coordinates and distances Use for very small x we find
One can compute the convergence by applying the 2D Laplacian of the 2D lensing potential
in agreement with earlier definition as the ratio of projected density with the critical density. Here we used and
We can also confirm the previously defined reduced deflection angle
where is the so-called Einstein angular radius of a point lens . For a single point lens at the origin we recover the standard result that there will be two images at the two solutions of the essentially quadratic equation
The amplification matrix can be obtained by double derivatives of the dimensionless time delay
where we have define the derivatives
which takes the meaning of convergence and shear. The amplification is the inverse of the Jacobian
where a positive means either a maxima or a minima, and a negative means a saddle point in the arrival surface.
For a single point lens, one can show (albeit a lengthy calculation) that
So the amplification of a point lens is given by
Note A diverges for images at the Einstein radius
In cases there are multiple point lenses plus a smooth background of (dark) particles of surface density the time arrival surface is
To compute the amplification, e.g., at the origin (0,0), due to identical point masses distributed at we have to add up the total shear, and include a convergence of the smooth background,
This generally creates a network of critical curves, lines connecting image points of infinite amplification.
In weak lensing by large-scale structure, the thin-lens approximation may break down, and low-density extended structures may not be well approximated by multiple thin-lens planes. In this case, the deflection can be derived by instead assuming that the gravitational potential is slowly varying everywhere (for this reason, this approximation is not valid for strong lensing). This approach assumes the universe is well described by a Newtonian-perturbed FRW metric, but it makes no other assumptions about the distribution of the lensing mass.
As in the thin-lens case, the effect can be written as a mapping from the unlensed angular position to the lensed position . The Jacobian of the transform can be written as an integral over the gravitational potential along the line of sight [3]
where is the comoving distance, are the transverse distances, and
is the lensing kernel, which defines the efficiency of lensing for a distribution of sources .
The Jacobian can be decomposed into convergence and shear terms just as with the thin-lens case, and in the limit of a lens that is both thin and weak, their physical interpretations are the same.
In weak gravitational lensing, the Jacobian is mapped out by observing the effect of the shear on the ellipticities of background galaxies. This effect is purely statistical; the shape of any galaxy will be dominated by its random, unlensed shape, but lensing will produce a spatially coherent distortion of these shapes.
In most fields of astronomy, the ellipticity is defined as , where is the axis ratio of the ellipse. In weak gravitational lensing, two different definitions are commonly used, and both are complex quantities which specify both the axis ratio and the position angle :
Like the traditional ellipticity, the magnitudes of both of these quantities range from 0 (circular) to 1 (a line segment). The position angle is encoded in the complex phase, but because of the factor of 2 in the trigonometric arguments, ellipticity is invariant under a rotation of 180 degrees. This is to be expected; an ellipse is unchanged by a 180° rotation. Taken as imaginary and real parts, the real part of the complex ellipticity describes the elongation along the coordinate axes, while the imaginary part describes the elongation at 45° from the axes.
The ellipticity is often written as a two-component vector instead of a complex number, though it is not a true vector with regard to transforms:
Real astronomical background sources are not perfect ellipses. Their ellipticities can be measured by finding a best-fit elliptical model to the data, or by measuring the second moments of the image about some centroid
The complex ellipticities are then
This can be used to relate the second moments to traditional ellipse parameters:
and in reverse:
The unweighted second moments above are problematic in the presence of noise, neighboring objects, or extended galaxy profiles, so it is typical to use apodized moments instead:
Here is a weight function that typically goes to zero or quickly approaches zero at some finite radius.
Image moments cannot generally be used to measure the ellipticity of galaxies without correcting for observational effects, particularly the point spread function. [4]
Recall that the lensing Jacobian can be decomposed into shear and convergence . Acting on a circular background source with radius , lensing generates an ellipse with major and minor axes
as long as the shear and convergence do not change appreciably over the size of the source (in that case, the lensed image is not an ellipse). Galaxies are not intrinsically circular, however, so it is necessary to quantify the effect of lensing on a non-zero ellipticity.
We can define the complex shear in analogy to the complex ellipticities defined above
as well as the reduced shear
The lensing Jacobian can now be written as
For a reduced shear and unlensed complex ellipticities and , the lensed ellipticities are
In the weak lensing limit, and , so
If we can assume that the sources are randomly oriented, their complex ellipticities average to zero, so
This is the principal equation of weak lensing: the average ellipticity of background galaxies is a direct measure of the shear induced by foreground mass.
While gravitational lensing preserves surface brightness, as dictated by Liouville's theorem, lensing does change the apparent solid angle of a source. The amount of magnification is given by the ratio of the image area to the source area. For a circularly symmetric lens, the magnification factor μ is given by
In terms of convergence and shear
For this reason, the Jacobian is also known as the "inverse magnification matrix".
The reduced shear is invariant with the scaling of the Jacobian by a scalar , which is equivalent to the transformations
and
Thus, can only be determined up to a transformation , which is known as the "mass sheet degeneracy." In principle, this degeneracy can be broken if an independent measurement of the magnification is available because the magnification is not invariant under the aforementioned degeneracy transformation. Specifically, scales with as .
In fluid dynamics, potential flow or irrotational flow refers to a description of a fluid flow with no vorticity in it. Such a description typically arises in the limit of vanishing viscosity, i.e., for an inviscid fluid and with no vorticity present in the flow.
In mechanics and geometry, the 3D rotation group, often denoted SO(3), is the group of all rotations about the origin of three-dimensional Euclidean space under the operation of composition.
In physics, the Hamilton–Jacobi equation, named after William Rowan Hamilton and Carl Gustav Jacob Jacobi, is an alternative formulation of classical mechanics, equivalent to other formulations such as Newton's laws of motion, Lagrangian mechanics and Hamiltonian mechanics.
In mathematics and physics, the Christoffel symbols are an array of numbers describing a metric connection. The metric connection is a specialization of the affine connection to surfaces or other manifolds endowed with a metric, allowing distances to be measured on that surface. In differential geometry, an affine connection can be defined without reference to a metric, and many additional concepts follow: parallel transport, covariant derivatives, geodesics, etc. also do not require the concept of a metric. However, when a metric is available, these concepts can be directly tied to the "shape" of the manifold itself; that shape is determined by how the tangent space is attached to the cotangent space by the metric tensor. Abstractly, one would say that the manifold has an associated (orthonormal) frame bundle, with each "frame" being a possible choice of a coordinate frame. An invariant metric implies that the structure group of the frame bundle is the orthogonal group O(p, q). As a result, such a manifold is necessarily a (pseudo-)Riemannian manifold. The Christoffel symbols provide a concrete representation of the connection of (pseudo-)Riemannian geometry in terms of coordinates on the manifold. Additional concepts, such as parallel transport, geodesics, etc. can then be expressed in terms of Christoffel symbols.
In probability and statistics, a circular distribution or polar distribution is a probability distribution of a random variable whose values are angles, usually taken to be in the range [0, 2π). A circular distribution is often a continuous probability distribution, and hence has a probability density, but such distributions can also be discrete, in which case they are called circular lattice distributions. Circular distributions can be used even when the variables concerned are not explicitly angles: the main consideration is that there is not usually any real distinction between events occurring at the opposite ends of the range, and the division of the range could notionally be made at any point.
In probability theory and directional statistics, the von Mises distribution is a continuous probability distribution on the circle. It is a close approximation to the wrapped normal distribution, which is the circular analogue of the normal distribution. A freely diffusing angle on a circle is a wrapped normally distributed random variable with an unwrapped variance that grows linearly in time. On the other hand, the von Mises distribution is the stationary distribution of a drift and diffusion process on the circle in a harmonic potential, i.e. with a preferred orientation. The von Mises distribution is the maximum entropy distribution for circular data when the real and imaginary parts of the first circular moment are specified. The von Mises distribution is a special case of the von Mises–Fisher distribution on the N-dimensional sphere.
In directional statistics, the von Mises–Fisher distribution, is a probability distribution on the -sphere in . If the distribution reduces to the von Mises distribution on the circle.
The Newman–Penrose (NP) formalism is a set of notation developed by Ezra T. Newman and Roger Penrose for general relativity (GR). Their notation is an effort to treat general relativity in terms of spinor notation, which introduces complex forms of the usual variables used in GR. The NP formalism is itself a special case of the tetrad formalism, where the tensors of the theory are projected onto a complete vector basis at each point in spacetime. Usually this vector basis is chosen to reflect some symmetry of the spacetime, leading to simplified expressions for physical observables. In the case of the NP formalism, the vector basis chosen is a null tetrad: a set of four null vectors—two real, and a complex-conjugate pair. The two real members often asymptotically point radially inward and radially outward, and the formalism is well adapted to treatment of the propagation of radiation in curved spacetime. The Weyl scalars, derived from the Weyl tensor, are often used. In particular, it can be shown that one of these scalars— in the appropriate frame—encodes the outgoing gravitational radiation of an asymptotically flat system.
The Debye–Hückel theory was proposed by Peter Debye and Erich Hückel as a theoretical explanation for departures from ideality in solutions of electrolytes and plasmas. It is a linearized Poisson–Boltzmann model, which assumes an extremely simplified model of electrolyte solution but nevertheless gave accurate predictions of mean activity coefficients for ions in dilute solution. The Debye–Hückel equation provides a starting point for modern treatments of non-ideality of electrolyte solutions.
In classical mechanics, a Liouville dynamical system is an exactly solvable dynamical system in which the kinetic energy T and potential energy V can be expressed in terms of the s generalized coordinates q as follows:
In the Standard Model, using quantum field theory it is conventional to use the helicity basis to simplify calculations. In this basis, the spin is quantized along the axis in the direction of motion of the particle.
Bilinear time–frequency distributions, or quadratic time–frequency distributions, arise in a sub-field of signal analysis and signal processing called time–frequency signal processing, and, in the statistical analysis of time series data. Such methods are used where one needs to deal with a situation where the frequency composition of a signal may be changing over time; this sub-field used to be called time–frequency signal analysis, and is now more often called time–frequency signal processing due to the progress in using these methods to a wide range of signal-processing problems.
While the presence of any mass bends the path of light passing near it, this effect rarely produces the giant arcs and multiple images associated with strong gravitational lensing. Most lines of sight in the universe are thoroughly in the weak lensing regime, in which the deflection is impossible to detect in a single background source. However, even in these cases, the presence of the foreground mass can be detected, by way of a systematic alignment of background sources around the lensing mass. Weak gravitational lensing is thus an intrinsically statistical measurement, but it provides a way to measure the masses of astronomical objects without requiring assumptions about their composition or dynamical state.
The derivatives of scalars, vectors, and second-order tensors with respect to second-order tensors are of considerable use in continuum mechanics. These derivatives are used in the theories of nonlinear elasticity and plasticity, particularly in the design of algorithms for numerical simulations.
In fluid dynamics, the mild-slope equation describes the combined effects of diffraction and refraction for water waves propagating over bathymetry and due to lateral boundaries—like breakwaters and coastlines. It is an approximate model, deriving its name from being originally developed for wave propagation over mild slopes of the sea floor. The mild-slope equation is often used in coastal engineering to compute the wave-field changes near harbours and coasts.
Calculations in the Newman–Penrose (NP) formalism of general relativity normally begin with the construction of a complex null tetrad, where is a pair of real null vectors and is a pair of complex null vectors. These tetrad vectors respect the following normalization and metric conditions assuming the spacetime signature
Vasiliev equations are formally consistent gauge invariant nonlinear equations whose linearization over a specific vacuum solution describes free massless higher-spin fields on anti-de Sitter space. The Vasiliev equations are classical equations and no Lagrangian is known that starts from canonical two-derivative Frønsdal Lagrangian and is completed by interactions terms. There is a number of variations of Vasiliev equations that work in three, four and arbitrary number of space-time dimensions. Vasiliev's equations admit supersymmetric extensions with any number of super-symmetries and allow for Yang–Mills gaugings. Vasiliev's equations are background independent, the simplest exact solution being anti-de Sitter space. It is important to note that locality is not properly implemented and the equations give a solution of certain formal deformation procedure, which is difficult to map to field theory language. The higher-spin AdS/CFT correspondence is reviewed in Higher-spin theory article.
A proper reference frame in the theory of relativity is a particular form of accelerated reference frame, that is, a reference frame in which an accelerated observer can be considered as being at rest. It can describe phenomena in curved spacetime, as well as in "flat" Minkowski spacetime in which the spacetime curvature caused by the energy–momentum tensor can be disregarded. Since this article considers only flat spacetime—and uses the definition that special relativity is the theory of flat spacetime while general relativity is a theory of gravitation in terms of curved spacetime—it is consequently concerned with accelerated frames in special relativity.
The Pomeranchuk instability is an instability in the shape of the Fermi surface of a material with interacting fermions, causing Landau’s Fermi liquid theory to break down. It occurs when a Landau parameter in Fermi liquid theory has a sufficiently negative value, causing deformations of the Fermi surface to be energetically favourable. It is named after the Soviet physicist Isaak Pomeranchuk.
Taylor–Maccoll flow refers to the steady flow behind a conical shock wave that is attached to a solid cone. The flow is named after G. I. Taylor and J. W. Maccoll, whom described the flow in 1933, guided by an earlier work of Theodore von Kármán.