Christoffel symbols

Last updated

In mathematics and physics, the Christoffel symbols are an array of numbers describing a metric connection. [1] The metric connection is a specialization of the affine connection to surfaces or other manifolds endowed with a metric, allowing distances to be measured on that surface. In differential geometry, an affine connection can be defined without reference to a metric, and many additional concepts follow: parallel transport, covariant derivatives, geodesics, etc. also do not require the concept of a metric. [2] [3] However, when a metric is available, these concepts can be directly tied to the "shape" of the manifold itself; that shape is determined by how the tangent space is attached to the cotangent space by the metric tensor. [4] Abstractly, one would say that the manifold has an associated (orthonormal) frame bundle, with each "frame" being a possible choice of a coordinate frame. An invariant metric implies that the structure group of the frame bundle is the orthogonal group O(p, q). As a result, such a manifold is necessarily a (pseudo-)Riemannian manifold. [5] [6] The Christoffel symbols provide a concrete representation of the connection of (pseudo-)Riemannian geometry in terms of coordinates on the manifold. Additional concepts, such as parallel transport, geodesics, etc. can then be expressed in terms of Christoffel symbols.

Contents

In general, there are an infinite number of metric connections for a given metric tensor; however, there is a unique connection that is free of torsion, the Levi-Civita connection. It is common in physics and general relativity to work almost exclusively with the Levi-Civita connection, by working in coordinate frames (called holonomic coordinates) where the torsion vanishes. For example, in Euclidean spaces, the Christoffel symbols describe how the local coordinate bases change from point to point.

At each point of the underlying n-dimensional manifold, for any local coordinate system around that point, the Christoffel symbols are denoted Γijk for i, j, k = 1, 2, ..., n. Each entry of this n × n × n array is a real number. Under linear coordinate transformations on the manifold, the Christoffel symbols transform like the components of a tensor, but under general coordinate transformations (diffeomorphisms) they do not. Most of the algebraic properties of the Christoffel symbols follow from their relationship to the affine connection; only a few follow from the fact that the structure group is the orthogonal group O(m, n) (or the Lorentz group O(3, 1) for general relativity).

Christoffel symbols are used for performing practical calculations. For example, the Riemann curvature tensor can be expressed entirely in terms of the Christoffel symbols and their first partial derivatives. In general relativity, the connection plays the role of the gravitational force field with the corresponding gravitational potential being the metric tensor. When the coordinate system and the metric tensor share some symmetry, many of the Γijk are zero.

The Christoffel symbols are named for Elwin Bruno Christoffel (1829–1900). [7]

Note

The definitions given below are valid for both Riemannian manifolds and pseudo-Riemannian manifolds, such as those of general relativity, with careful distinction being made between upper and lower indices (contra-variant and co-variant indices). The formulas hold for either sign convention, unless otherwise noted.

Einstein summation convention is used in this article, with vectors indicated by bold font. The connection coefficients of the Levi-Civita connection (or pseudo-Riemannian connection) expressed in a coordinate basis are called Christoffel symbols.

Preliminary definitions

Given a manifold , an atlas consists of a collection of charts for each open cover . Such charts allow the standard vector basis on to be pulled back to a vector basis on the tangent space of . This is done as follows. Given some arbitrary real function , the chart allows a gradient to be defined:

This gradient is commonly called a pullback because it "pulls back" the gradient on to a gradient on . The pullback is independent of the chart . In this way, the standard vector basis on pulls back to a standard ("coordinate") vector basis on . This is called the "coordinate basis", because it explicitly depends on the coordinates on . It is sometimes called the "local basis".

This definition allows a common abuse of notation. The were defined to be in one-to-one correspondence with the basis vectors on . The notation serves as a reminder that the basis vectors on the tangent space came from a gradient construction. Despite this, it is common to "forget" this construction, and just write (or rather, define) vectors on such that . The full range of commonly used notation includes the use of arrows and boldface to denote vectors:

where is used as a reminder that these are defined to be equivalent notation for the same concept. The choice of notation is according to style and taste, and varies from text to text.

The coordinate basis provides a vector basis for vector fields on . Commonly used notation for vector fields on include

The upper-case , without the vector-arrow, is particularly popular for index-free notation, because it both minimizes clutter and reminds that results are independent of the chosen basis, and, in this case, independent of the atlas.

The same abuse of notation is used to push forward one-forms from to . This is done by writing or or . The one-form is then . This is soldered to the basis vectors as . Note the careful use of upper and lower indexes, to distinguish contravarient and covariant vectors.

The pullback induces (defines) a metric tensor on . Several styles of notation are commonly used:

where both the centerdot and the angle-bracket denote the scalar product. The last form uses the tensor , which is understood to be the "flat-space" metric tensor. For Riemannian manifolds, it is the Kronecker delta . For pseudo-Riemannian manifolds, it is the diagonal matrix having signature . The notation serves as a reminder that pullback really is a linear transform, given as the gradient, above. The index letters live in while the index letters live in the tangent manifold.

The matrix inverse of the metric tensor is given by

This is used to define the dual basis:

Some texts write for , so that the metric tensor takes the particularly beguiling form . This is commonly done so that the symbol can be used unambiguously for the vierbein.

Definition in Euclidean space

In Euclidean space, the general definition given below for the Christoffel symbols of the second kind can be proven to be equivalent to:

Christoffel symbols of the first kind can then be found via index lowering:

Rearranging, we see that (assuming the partial derivative belongs to the tangent space, which cannot occur on a non-Euclidean curved space):

In words, the arrays represented by the Christoffel symbols track how the basis changes from point to point. If the derivative does not lie on the tangent space, the right expression is the projection of the derivative over the tangent space (see covariant derivative below). Symbols of the second kind decompose the change with respect to the basis, while symbols of the first kind decompose it with respect to the dual basis. In this form, it is easy to see the symmetry of the lower or last two indices:

and

from the definition of and the fact that partial derivatives commute (as long as the manifold and coordinate system are well behaved).

The same numerical values for Christoffel symbols of the second kind also relate to derivatives of the dual basis, as seen in the expression:

which we can rearrange as:

General definition

The Christoffel symbols come in two forms: the first kind, and the second kind. The definition of the second kind is more basic, and thus is presented first.

Christoffel symbols of the second kind (symmetric definition)

The Christoffel symbols of the second kind are the connection coefficients—in a coordinate basis—of the Levi-Civita connection. In other words, the Christoffel symbols of the second kind [8] [9] Γkij (sometimes Γk
ij
or {k
ij
}
) [7] [8] are defined as the unique coefficients such that

where i is the Levi-Civita connection on M taken in the coordinate direction ei (i.e., i ≡ ∇ei) and where ei = ∂i is a local coordinate (holonomic) basis. Since this connection has zero torsion, and holonomic vector fields commute (i.e. ) we have

Hence in this basis the connection coefficients are symmetric: [8]

For this reason, a torsion-free connection is often called symmetric.

The Christoffel symbols can be derived from the vanishing of the covariant derivative of the metric tensor gik:

As a shorthand notation, the nabla symbol and the partial derivative symbols are frequently dropped, and instead a semicolon and a comma are used to set off the index that is being used for the derivative. Thus, the above is sometimes written as

Using that the symbols are symmetric in the lower two indices, one can solve explicitly for the Christoffel symbols as a function of the metric tensor by permuting the indices and resumming: [10]

where (gjk) is the inverse of the matrix (gjk), defined as (using the Kronecker delta, and Einstein notation for summation) gjigik = δjk. Although the Christoffel symbols are written in the same notation as tensors with index notation, they do not transform like tensors under a change of coordinates.

Contraction of indices

Contracting the upper index with either of the lower indices (those being symmetric) leads to

where is the determinant of the metric tensor. This identity can be used to evaluate divergence of vectors.

Christoffel symbols of the first kind

The Christoffel symbols of the first kind can be derived either from the Christoffel symbols of the second kind and the metric, [11]

or from the metric alone, [11]

As an alternative notation one also finds [7] [12] [13]

It is worth noting that [ab, c] = [ba, c]. [10]

Connection coefficients in a nonholonomic basis

The Christoffel symbols are most typically defined in a coordinate basis, which is the convention followed here. In other words, the name Christoffel symbols is reserved only for coordinate (i.e., holonomic) frames. However, the connection coefficients can also be defined in an arbitrary (i.e., nonholonomic) basis of tangent vectors ui by

Explicitly, in terms of the metric tensor, this is [9]

where cklm = gmpcklp are the commutation coefficients of the basis; that is,

where uk are the basis vectors and [ , ] is the Lie bracket. The standard unit vectors in spherical and cylindrical coordinates furnish an example of a basis with non-vanishing commutation coefficients. The difference between the connection in such a frame, and the Levi-Civita connection is known as the contorsion tensor.

Ricci rotation coefficients (asymmetric definition)

When we choose the basis Xiui orthonormal: gabηab = ⟨Xa, Xb then gmk,lηmk,l = 0. This implies that

and the connection coefficients become antisymmetric in the first two indices:

where

In this case, the connection coefficients ωabc are called the Ricci rotation coefficients. [14] [15]

Equivalently, one can define Ricci rotation coefficients as follows: [9]

where ui is an orthonormal nonholonomic basis and uk = ηklul its co-basis.

Transformation law under change of variable

Under a change of variable from to , Christoffel symbols transform as

where the overline denotes the Christoffel symbols in the coordinate system. The Christoffel symbol does not transform as a tensor, but rather as an object in the jet bundle. More precisely, the Christoffel symbols can be considered as functions on the jet bundle of the frame bundle of M, independent of any local coordinate system. Choosing a local coordinate system determines a local section of this bundle, which can then be used to pull back the Christoffel symbols to functions on M, though of course these functions then depend on the choice of local coordinate system.

For each point, there exist coordinate systems in which the Christoffel symbols vanish at the point. [16] These are called (geodesic) normal coordinates, and are often used in Riemannian geometry.

There are some interesting properties which can be derived directly from the transformation law.

Relationship to parallel transport and derivation of Christoffel symbols in Riemannian space

If a vector is transported parallel on a curve parametrized by some parameter on a Riemannian manifold, the rate of change of the components of the vector is given by

Now just by using the condition that the scalar product formed by two arbitrary vectors and is unchanged is enough to derive the Christoffel symbols. The condition is

which by the product rule expands to

Applying the parallel transport rule for the two arbitrary vectors and relabelling dummy indices and collecting the coefficients of (arbitrary), we obtain

This is same as the equation obtained by requiring the covariant derivative of the metric tensor to vanish in the General definition section. The derivation from here is simple. By cyclically permuting the indices in above equation, we can obtain two more equations and then linearly combining these three equations, we can express in terms of metric tensor.

Relationship to index-free notation

Let X and Y be vector fields with components Xi and Yk. Then the kth component of the covariant derivative of Y with respect to X is given by

Here, the Einstein notation is used, so repeated indices indicate summation over indices and contraction with the metric tensor serves to raise and lower indices:

Keep in mind that gikgik and that gik = δik, the Kronecker delta. The convention is that the metric tensor is the one with the lower indices; the correct way to obtain gik from gik is to solve the linear equations gijgjk = δik.

The statement that the connection is torsion-free, namely that

is equivalent to the statement that—in a coordinate basis—the Christoffel symbol is symmetric in the lower two indices:

The index-less transformation properties of a tensor are given by pullbacks for covariant indices, and pushforwards for contravariant indices. The article on covariant derivatives provides additional discussion of the correspondence between index-free notation and indexed notation.

Covariant derivatives of tensors

The covariant derivative of a vector field with components Vm is

By corollary, divergence of a vector can be obtained as

The covariant derivative of a covector field ωm is

The symmetry of the Christoffel symbol now implies

for any scalar field, but in general the covariant derivatives of higher order tensor fields do not commute (see curvature tensor).

The covariant derivative of a type (2, 0) tensor field Aik is

that is,

If the tensor field is mixed then its covariant derivative is

and if the tensor field is of type (0, 2) then its covariant derivative is

Contravariant derivatives of tensors

To find the contravariant derivative of a vector field, we must first transform it into a covariant derivative using the metric tensor

Applications

In general relativity

The Christoffel symbols find frequent use in Einstein's theory of general relativity, where spacetime is represented by a curved 4-dimensional Lorentz manifold with a Levi-Civita connection. The Einstein field equations—which determine the geometry of spacetime in the presence of matter—contain the Ricci tensor, and so calculating the Christoffel symbols is essential. Once the geometry is determined, the paths of particles and light beams are calculated by solving the geodesic equations in which the Christoffel symbols explicitly appear.

In classical (non-relativistic) mechanics

Let be the generalized coordinates and be the generalized velocities, then the kinetic energy for a unit mass is given by , where is the metric tensor. If , the potential function, exists then the contravariant components of the generalized force per unit mass are . The metric (here in a purely spatial domain) can be obtained from the line element . Substituting the Lagrangian into the Euler-Lagrange equation, we get [19]

Now multiplying by , we get

When Cartesian coordinates can be adopted (as in inertial frames of reference), we have an Euclidean metrics, the Christoffel symbol vanishes, and the equation reduces to Newton's second law of motion. In curvilinear coordinates [20] (forcedly in non-inertial frames, where the metrics is non-Euclidean and not flat), fictitious forces like the Centrifugal force and Coriolis force originate from the Christoffel symbols, so from the purely spatial curvilinear coordinates.

In Earth surface coordinates

Given a spherical coordinate system, which describes points on the Earth surface (approximated as an ideal sphere).

For a point x, R is the distance to the Earth core (usually approximately the Earth radius). θ and φ are the latitude and longitude. Positive θ is the northern hemisphere. To simplify the derivatives, the angles are given in radians (where d sin(x)/dx = cos(x), the degree values introduce an additional factor of 360 / 2 pi).

At any location, the tangent directions are (up), (north) and (east) - you can also use indices 1,2,3.

The related metric tensor has only diagonal elements (the squared vector lengths). This is an advantage of the coordinate system and not generally true.

[21]

Now the necessary quantities can be calculated. Examples:

The resulting Christoffel symbols of the second kind then are (organized by the "derivative" index i in a matrix):

These values show how the tangent directions (columns: , , ) change, seen from an outside perspective (e.g. from space), but given in the tangent directions of the actual location (rows: R, θ, φ).

As an example, take the nonzero derivatives by θ in , which corresponds to a movement towards north (positive dθ):

These effects are maybe not apparent during the movement, because they are the adjustments that keep the measurements in the coordinates R, θ, φ. Nevertheless, it can affect distances, physics equations, etc. So if e.g. you need the exact change of a magnetic field pointing approximately "south", it can be necessary to also correct your measurement by the change of the north direction using the Christoffel symbols to get the "true" (tensor) value.

The Christoffel symbols of the first kind show the same change using metric-corrected coordinates, e.g. for derivative by φ:

[21]

See also

Notes

  1. See, for instance, ( Spivak 1999 ) and ( Choquet-Bruhat & DeWitt-Morette 1977 )
  2. Ronald Adler, Maurice Bazin, Menahem Schiffer, Introduction to General Relativity (1965) McGraw-Hill Book Company ISBN   0-07-000423-4 (See section 2.1)
  3. Charles W. Misner, Kip S. Thorne, John Archibald Wheeler, Gravitation (1973) W. H. Freeman ISBN   0-7167-0334-3 (See chapters 8-11)
  4. Misner, Thorne, Wheeler, op. cit. (See chapter 13)
  5. Jurgen Jost, Riemannian Geometry and Geometric Analysis, (2002) Springer-Verlag ISBN   3-540-42627-2
  6. David Bleeker, Gauge Theory and Variational Principles (1991) Addison-Wesely Publishing Company ISBN   0-201-10096-7
  7. 1 2 3 Christoffel, E.B. (1869), "Ueber die Transformation der homogenen Differentialausdrücke zweiten Grades", Journal für die reine und angewandte Mathematik, 70: 46–70
  8. 1 2 3 Chatterjee, U.; Chatterjee, N. (2010). Vector & Tensor Analysis. p. 480.
  9. 1 2 3 "Christoffel Symbol of the Second Kind -- from Wolfram MathWorld". mathworld.wolfram.com. Archived from the original on 2009-01-23.
  10. 1 2 Bishop, R.L.; Goldberg (1968), Tensor Analysis on Manifolds, p. 241
  11. 1 2 Ludvigsen, Malcolm (1999), General Relativity: A Geometrical Approach, p. 88
  12. Chatterjee, U.; Chatterjee, N. (2010). Vector and Tensor Analysis. p. 480.
  13. Struik, D.J. (1961). Lectures on Classical Differential Geometry (first published in 1988 Dover ed.). p. 114.
  14. G. Ricci-Curbastro (1896). "Dei sistemi di congruenze ortogonali in una varietà qualunque". Mem. Acc. Lincei. 2 (5): 276–322.
  15. H. Levy (1925). "Ricci's coefficients of rotation". Bull. Amer. Math. Soc. 31 (3–4): 142–145. doi: 10.1090/s0002-9904-1925-03996-8 .
  16. This is assuming that the connection is symmetric (e.g., the Levi-Civita connection). If the connection has torsion, then only the symmetric part of the Christoffel symbol can be made to vanish.
  17. Einstein, Albert (2005). "The Meaning of Relativity (1956, 5th Edition)". Princeton University Press (2005).
  18. Schrödinger, E. (1950). Space-time structure. Cambridge University Press.
  19. Adler, R., Bazin, M., & Schiffer, M. Introduction to General Relativity (New York, 1965).
  20. David, Kay, Tensor Calculus (1988) McGraw-Hill Book Company ISBN   0-07-033484-6 (See section 11.4)
  21. 1 2 3 Sesslar, Alexander  J. “Published Mathematical Works | Christoffel Symbols and Spherical Coordinates .” 2023 https://sites.google.com/view/published-mathematical-works/home

Related Research Articles

<span class="mw-page-title-main">Divergence</span> Vector operator in vector calculus

In vector calculus, divergence is a vector operator that operates on a vector field, producing a scalar field giving the quantity of the vector field's source at each point. More technically, the divergence represents the volume density of the outward flux of a vector field from an infinitesimal volume around a given point.

<span class="mw-page-title-main">Gradient</span> Multivariate derivative (mathematics)

In vector calculus, the gradient of a scalar-valued differentiable function of several variables is the vector field whose value at a point gives the direction and the rate of fastest increase. The gradient transforms like a vector under change of basis of the space of variables of . If the gradient of a function is non-zero at a point , the direction of the gradient is the direction in which the function increases most quickly from , and the magnitude of the gradient is the rate of increase in that direction, the greatest absolute directional derivative. Further, a point where the gradient is the zero vector is known as a stationary point. The gradient thus plays a fundamental role in optimization theory, where it is used to minimize a function by gradient descent. In coordinate-free terms, the gradient of a function may be defined by:

<span class="mw-page-title-main">Laplace's equation</span> Second-order partial differential equation

In mathematics and physics, Laplace's equation is a second-order partial differential equation named after Pierre-Simon Laplace, who first studied its properties. This is often written as

<span class="mw-page-title-main">Navier–Stokes equations</span> Equations describing the motion of viscous fluid substances

The Navier–Stokes equations are partial differential equations which describe the motion of viscous fluid substances. They were named after French engineer and physicist Claude-Louis Navier and the Irish physicist and mathematician George Gabriel Stokes. They were developed over several decades of progressively building the theories, from 1822 (Navier) to 1842–1850 (Stokes).

In mathematics, the Laplace operator or Laplacian is a differential operator given by the divergence of the gradient of a scalar function on Euclidean space. It is usually denoted by the symbols , (where is the nabla operator), or . In a Cartesian coordinate system, the Laplacian is given by the sum of second partial derivatives of the function with respect to each independent variable. In other coordinate systems, such as cylindrical and spherical coordinates, the Laplacian also has a useful form. Informally, the Laplacian Δf (p) of a function f at a point p measures by how much the average value of f over small spheres or balls centered at p deviates from f (p).

In the mathematical field of differential geometry, a metric tensor is an additional structure on a manifold M that allows defining distances and angles, just as the inner product on a Euclidean space allows defining distances and angles there. More precisely, a metric tensor at a point p of M is a bilinear form defined on the tangent space at p, and a metric field on M consists of a metric tensor at each point p of M that varies smoothly with p.

<span class="mw-page-title-main">Spherical harmonics</span> Special mathematical functions defined on the surface of a sphere

In mathematics and physical science, spherical harmonics are special functions defined on the surface of a sphere. They are often employed in solving partial differential equations in many scientific fields. A list of the spherical harmonics is available in Table of spherical harmonics.

<span class="mw-page-title-main">Four-vector</span> 4-dimensional vector in relativity

In special relativity, a four-vector is an object with four components, which transform in a specific way under Lorentz transformations. Specifically, a four-vector is an element of a four-dimensional vector space considered as a representation space of the standard representation of the Lorentz group, the representation. It differs from a Euclidean vector in how its magnitude is determined. The transformations that preserve this magnitude are the Lorentz transformations, which include spatial rotations and boosts.

In Riemannian or pseudo-Riemannian geometry, the Levi-Civita connection is the unique affine connection on the tangent bundle of a manifold that preserves the (pseudo-)Riemannian metric and is torsion-free.

In mathematics, the covariant derivative is a way of specifying a derivative along tangent vectors of a manifold. Alternatively, the covariant derivative is a way of introducing and working with a connection on a manifold by means of a differential operator, to be contrasted with the approach given by a principal connection on the frame bundle – see affine connection. In the special case of a manifold isometrically embedded into a higher-dimensional Euclidean space, the covariant derivative can be viewed as the orthogonal projection of the Euclidean directional derivative onto the manifold's tangent space. In this case the Euclidean derivative is broken into two parts, the extrinsic normal component and the intrinsic covariant derivative component.

In rotordynamics, the rigid rotor is a mechanical model of rotating systems. An arbitrary rigid rotor is a 3-dimensional rigid object, such as a top. To orient such an object in space requires three angles, known as Euler angles. A special rigid rotor is the linear rotor requiring only two angles to describe, for example of a diatomic molecule. More general molecules are 3-dimensional, such as water, ammonia, or methane.

<span class="mw-page-title-main">Cartesian tensor</span>

In geometry and linear algebra, a Cartesian tensor uses an orthonormal basis to represent a tensor in a Euclidean space in the form of components. Converting a tensor's components from one such basis to another is done through an orthogonal transformation.

A theoretical motivation for general relativity, including the motivation for the geodesic equation and the Einstein field equation, can be obtained from special relativity by examining the dynamics of particles in circular orbits about the Earth. A key advantage in examining circular orbits is that it is possible to know the solution of the Einstein Field Equation a priori. This provides a means to inform and verify the formalism.

<span class="mw-page-title-main">Mathematical descriptions of the electromagnetic field</span> Formulations of electromagnetism

There are various mathematical descriptions of the electromagnetic field that are used in the study of electromagnetism, one of the four fundamental interactions of nature. In this article, several approaches are discussed, although the equations are in terms of electric and magnetic fields, potentials, and charges with currents, generally speaking.

<span class="mw-page-title-main">Differential geometry of surfaces</span> The mathematics of smooth surfaces

In mathematics, the differential geometry of surfaces deals with the differential geometry of smooth surfaces with various additional structures, most often, a Riemannian metric. Surfaces have been extensively studied from various perspectives: extrinsically, relating to their embedding in Euclidean space and intrinsically, reflecting their properties determined solely by the distance within the surface as measured along curves on the surface. One of the fundamental concepts investigated is the Gaussian curvature, first studied in depth by Carl Friedrich Gauss, who showed that curvature was an intrinsic property of a surface, independent of its isometric embedding in Euclidean space.

The derivatives of scalars, vectors, and second-order tensors with respect to second-order tensors are of considerable use in continuum mechanics. These derivatives are used in the theories of nonlinear elasticity and plasticity, particularly in the design of algorithms for numerical simulations.

In fluid dynamics, the Oseen equations describe the flow of a viscous and incompressible fluid at small Reynolds numbers, as formulated by Carl Wilhelm Oseen in 1910. Oseen flow is an improved description of these flows, as compared to Stokes flow, with the (partial) inclusion of convective acceleration.

Curvilinear coordinates can be formulated in tensor calculus, with important applications in physics and engineering, particularly for describing transportation of physical quantities and deformation of matter in fluid mechanics and continuum mechanics.

Lagrangian field theory is a formalism in classical field theory. It is the field-theoretic analogue of Lagrangian mechanics. Lagrangian mechanics is used to analyze the motion of a system of discrete particles each with a finite number of degrees of freedom. Lagrangian field theory applies to continua and fields, which have an infinite number of degrees of freedom.

References