Connection (mathematics)

Last updated

In geometry, the notion of a connection makes precise the idea of transporting local geometric objects, such as tangent vectors or tensors in the tangent space, along a curve or family of curves in a parallel and consistent manner. There are various kinds of connections in modern geometry, depending on what sort of data one wants to transport. For instance, an affine connection, the most elementary type of connection, gives a means for parallel transport of tangent vectors on a manifold from one point to another along a curve. An affine connection is typically given in the form of a covariant derivative, which gives a means for taking directional derivatives of vector fields, measuring the deviation of a vector field from being parallel in a given direction.

Contents

Connections are of central importance in modern geometry in large part because they allow a comparison between the local geometry at one point and the local geometry at another point. Differential geometry embraces several variations on the connection theme, which fall into two major groups: the infinitesimal and the local theory. The local theory concerns itself primarily with notions of parallel transport and holonomy. The infinitesimal theory concerns itself with the differentiation of geometric data. Thus a covariant derivative is a way of specifying a derivative of a vector field along another vector field on a manifold. A Cartan connection is a way of formulating some aspects of connection theory using differential forms and Lie groups. An Ehresmann connection is a connection in a fibre bundle or a principal bundle by specifying the allowed directions of motion of the field. A Koszul connection is a connection which defines directional derivative for sections of a vector bundle more general than the tangent bundle.

Connections also lead to convenient formulations of geometric invariants, such as the curvature (see also curvature tensor and curvature form), and torsion tensor.

Motivation: the unsuitability of coordinates

Parallel transport (of the black arrow) on a sphere. Blue and red arrows represent parallel transports in different directions but ending at the same lower right point. The fact that they end up pointing in different directions is a result of the curvature of the sphere. Connection-on-sphere.png
Parallel transport (of the black arrow) on a sphere. Blue and red arrows represent parallel transports in different directions but ending at the same lower right point. The fact that they end up pointing in different directions is a result of the curvature of the sphere.

Consider the following problem. Suppose that a tangent vector to the sphere S is given at the north pole, and we are to define a manner of consistently moving this vector to other points of the sphere: a means for parallel transport. Naively, this could be done using a particular coordinate system. However, unless proper care is applied, the parallel transport defined in one system of coordinates will not agree with that of another coordinate system. A more appropriate parallel transportation system exploits the symmetry of the sphere under rotation. Given a vector at the north pole, one can transport this vector along a curve by rotating the sphere in such a way that the north pole moves along the curve without axial rolling. This latter means of parallel transport is the Levi-Civita connection on the sphere. If two different curves are given with the same initial and terminal point, and a vector v is rigidly moved along the first curve by a rotation, the resulting vector at the terminal point will be different from the vector resulting from rigidly moving v along the second curve. This phenomenon reflects the curvature of the sphere. A simple mechanical device that can be used to visualize parallel transport is the south-pointing chariot.

For instance, suppose that S is a sphere given coordinates by the stereographic projection. Regard S as consisting of unit vectors in R3. Then S carries a pair of coordinate patches corresponding to the projections from north pole and south pole. The mappings

cover a neighborhood U0 of the north pole and U1 of the south pole, respectively. Let X, Y, Z be the ambient coordinates in R3. Then φ0 and φ1 have inverses

so that the coordinate transition function is inversion in the circle:

Let us now represent a vector field on S (an assignment of a tangent vector to each point in S) in local coordinates. If P is a point of U0S, then a vector field may be represented by the pushforward of a vector field v0 on R2 by :

 

 

 

 

(1)

where denotes the Jacobian matrix of φ0 (), and v0 = v0(x, y) is a vector field on R2 uniquely determined by v (since the pushforward of a local diffeomorphism at any point is invertible). Furthermore, on the overlap between the coordinate charts U0U1, it is possible to represent the same vector field with respect to the φ1 coordinates:

 

 

 

 

(2)

To relate the components v0 and v1, apply the chain rule to the identity φ1 = φ0 o φ01:

Applying both sides of this matrix equation to the component vector v11−1(P)) and invoking ( 1 ) and ( 2 ) yields

 

 

 

 

(3)

We come now to the main question of defining how to transport a vector field parallelly along a curve. Suppose that P(t) is a curve in S. Naïvely, one may consider a vector field parallel if the coordinate components of the vector field are constant along the curve. However, an immediate ambiguity arises: in which coordinate system should these components be constant?

For instance, suppose that v(P(t)) has constant components in the U1 coordinate system. That is, the functions v1(φ11(P(t))) are constant. However, applying the product rule to ( 3 ) and using the fact that dv1/dt = 0 gives

But is always a non-singular matrix (provided that the curve P(t) is not stationary), so v1 and v0cannot ever be simultaneously constant along the curve.

Resolution

The problem observed above is that the usual directional derivative of vector calculus does not behave well under changes in the coordinate system when applied to the components of vector fields. This makes it quite difficult to describe how to translate vector fields in a parallel manner, if indeed such a notion makes any sense at all. There are two fundamentally different ways of resolving this problem.

The first approach is to examine what is required for a generalization of the directional derivative to "behave well" under coordinate transitions. This is the tactic taken by the covariant derivative approach to connections: good behavior is equated with covariance. Here one considers a modification of the directional derivative by a certain linear operator, whose components are called the Christoffel symbols, which involves no derivatives on the vector field itself. The directional derivative Duv of the components of a vector v in a coordinate system φ in the direction u are replaced by a covariant derivative:

where Γ depends on the coordinate system φ and is bilinear in u and v. In particular, Γ does not involve any derivatives on u or v. In this approach, Γ must transform in a prescribed manner when the coordinate system φ is changed to a different coordinate system. This transformation is not tensorial, since it involves not only the first derivative of the coordinate transition, but also its second derivative. Specifying the transformation law of Γ is not sufficient to determine Γ uniquely. Some other normalization conditions must be imposed, usually depending on the type of geometry under consideration. In Riemannian geometry, the Levi-Civita connection requires compatibility of the Christoffel symbols with the metric (as well as a certain symmetry condition). With these normalizations, the connection is uniquely defined.

The second approach is to use Lie groups to attempt to capture some vestige of symmetry on the space. This is the approach of Cartan connections. The example above using rotations to specify the parallel transport of vectors on the sphere is very much in this vein.

Historical survey of connections

Historically, connections were studied from an infinitesimal perspective in Riemannian geometry. The infinitesimal study of connections began to some extent with Elwin Christoffel. This was later taken up more thoroughly by Gregorio Ricci-Curbastro and Tullio Levi-Civita ( Levi-Civita & Ricci 1900 ) who observed in part that a connection in the infinitesimal sense of Christoffel also allowed for a notion of parallel transport.

The work of Levi-Civita focused exclusively on regarding connections as a kind of differential operator whose parallel displacements were then the solutions of differential equations. As the twentieth century progressed, Élie Cartan developed a new notion of connection. He sought to apply the techniques of Pfaffian systems to the geometries of Felix Klein's Erlangen program. In these investigations, he found that a certain infinitesimal notion of connection (a Cartan connection) could be applied to these geometries and more: his connection concept allowed for the presence of curvature which would otherwise be absent in a classical Klein geometry. (See, for example, ( Cartan 1926 ) and ( Cartan 1983 ).) Furthermore, using the dynamics of Gaston Darboux, Cartan was able to generalize the notion of parallel transport for his class of infinitesimal connections. This established another major thread in the theory of connections: that a connection is a certain kind of differential form.

The two threads in connection theory have persisted through the present day: a connection as a differential operator, and a connection as a differential form. In 1950, Jean-Louis Koszul ( Koszul 1950 ) gave an algebraic framework for regarding a connection as a differential operator by means of the Koszul connection. The Koszul connection was both more general than that of Levi-Civita, and was easier to work with because it finally was able to eliminate (or at least to hide) the awkward Christoffel symbols from the connection formalism. The attendant parallel displacement operations also had natural algebraic interpretations in terms of the connection. Koszul's definition was subsequently adopted by most of the differential geometry community, since it effectively converted the analytic correspondence between covariant differentiation and parallel translation to an algebraic one.

In that same year, Charles Ehresmann ( Ehresmann 1950 ), a student of Cartan's, presented a variation on the connection as a differential form view in the context of principal bundles and, more generally, fibre bundles. Ehresmann connections were, strictly speaking, not a generalization of Cartan connections. Cartan connections were quite rigidly tied to the underlying differential topology of the manifold because of their relationship with Cartan's equivalence method. Ehresmann connections were rather a solid framework for viewing the foundational work of other geometers of the time, such as Shiing-Shen Chern, who had already begun moving away from Cartan connections to study what might be called gauge connections. In Ehresmann's point of view, a connection in a principal bundle consists of a specification of horizontal and vertical vector fields on the total space of the bundle. A parallel translation is then a lifting of a curve from the base to a curve in the principal bundle which is horizontal. This viewpoint has proven especially valuable in the study of holonomy.

Possible approaches

See also

Related Research Articles

<span class="mw-page-title-main">Gradient</span> Multivariate derivative (mathematics)

In vector calculus, the gradient of a scalar-valued differentiable function of several variables is the vector field whose value at a point is the "direction and rate of fastest increase". The gradient transforms like a vector under change of basis of the space of variables of . If the gradient of a function is non-zero at a point , the direction of the gradient is the direction in which the function increases most quickly from , and the magnitude of the gradient is the rate of increase in that direction, the greatest absolute directional derivative. Further, a point where the gradient is the zero vector is known as a stationary point. The gradient thus plays a fundamental role in optimization theory, where it is used to maximize a function by gradient ascent. In coordinate-free terms, the gradient of a function may be defined by:

<span class="mw-page-title-main">Geodesic</span> Straight path on a curved surface or a Riemannian manifold

In geometry, a geodesic is a curve representing in some sense the shortest path (arc) between two points in a surface, or more generally in a Riemannian manifold. The term also has meaning in any differentiable manifold with a connection. It is a generalization of the notion of a "straight line".

In the mathematical field of differential geometry, a metric tensor is an additional structure on a manifold M that allows defining distances and angles, just as the inner product on a Euclidean space allows defining distances and angles there. More precisely, a metric tensor at a point p of M is a bilinear form defined on the tangent space at p, and a metric tensor on M consists of a metric tensor at each point p of M that varies smoothly with p.

In Riemannian or pseudo-Riemannian geometry, the Levi-Civita connection is the unique affine connection on the tangent bundle of a manifold that preserves the (pseudo-)Riemannian metric and is torsion-free.

<span class="mw-page-title-main">Parallel transport</span> Construct in differential geometry

In geometry, parallel transport is a way of transporting geometrical data along smooth curves in a manifold. If the manifold is equipped with an affine connection, then this connection allows one to transport vectors of the manifold along curves so that they stay parallel with respect to the connection.

In differential geometry, the Lie derivative, named after Sophus Lie by Władysław Ślebodziński, evaluates the change of a tensor field, along the flow defined by another vector field. This change is coordinate invariant and therefore the Lie derivative is defined on any differentiable manifold.

In mathematics, conformal geometry is the study of the set of angle-preserving (conformal) transformations on a space.

In mathematics, and especially differential geometry and gauge theory, a connection on a fiber bundle is a device that defines a notion of parallel transport on the bundle; that is, a way to "connect" or identify fibers over nearby points. The most common case is that of a linear connection on a vector bundle, for which the notion of parallel transport must be linear. A linear connection is equivalently specified by a covariant derivative, an operator that differentiates sections of the bundle along tangent directions in the base manifold, in such a way that parallel sections have derivative zero. Linear connections generalize, to arbitrary vector bundles, the Levi-Civita connection on the tangent bundle of a pseudo-Riemannian manifold, which gives a standard way to differentiate vector fields. Nonlinear connections generalize this concept to bundles whose fibers are not necessarily linear.

In the mathematical field of differential geometry, a Cartan connection is a flexible generalization of the notion of an affine connection. It may also be regarded as a specialization of the general concept of a principal connection, in which the geometry of the principal bundle is tied to the geometry of the base manifold using a solder form. Cartan connections describe the geometry of manifolds modelled on homogeneous spaces.

In mathematics, the covariant derivative is a way of specifying a derivative along tangent vectors of a manifold. Alternatively, the covariant derivative is a way of introducing and working with a connection on a manifold by means of a differential operator, to be contrasted with the approach given by a principal connection on the frame bundle – see affine connection. In the special case of a manifold isometrically embedded into a higher-dimensional Euclidean space, the covariant derivative can be viewed as the orthogonal projection of the Euclidean directional derivative onto the manifold's tangent space. In this case the Euclidean derivative is broken into two parts, the extrinsic normal component and the intrinsic covariant derivative component.

<span class="mw-page-title-main">Affine connection</span> Construct allowing differentiation of tangent vector fields of manifolds

In differential geometry, an affine connection is a geometric object on a smooth manifold which connects nearby tangent spaces, so it permits tangent vector fields to be differentiated as if they were functions on the manifold with values in a fixed vector space. Connections are among the simplest methods of defining differentiation of the sections of vector bundles.

In mathematics and physics, the Christoffel symbols are an array of numbers describing a metric connection. The metric connection is a specialization of the affine connection to surfaces or other manifolds endowed with a metric, allowing distances to be measured on that surface. In differential geometry, an affine connection can be defined without reference to a metric, and many additional concepts follow: parallel transport, covariant derivatives, geodesics, etc. also do not require the concept of a metric. However, when a metric is available, these concepts can be directly tied to the "shape" of the manifold itself; that shape is determined by how the tangent space is attached to the cotangent space by the metric tensor. Abstractly, one would say that the manifold has an associated (orthonormal) frame bundle, with each "frame" being a possible choice of a coordinate frame. An invariant metric implies that the structure group of the frame bundle is the orthogonal group O(p, q). As a result, such a manifold is necessarily a (pseudo-)Riemannian manifold. The Christoffel symbols provide a concrete representation of the connection of (pseudo-)Riemannian geometry in terms of coordinates on the manifold. Additional concepts, such as parallel transport, geodesics, etc. can then be expressed in terms of Christoffel symbols.

<span class="mw-page-title-main">Differentiable manifold</span> Manifold upon which it is possible to perform calculus

In mathematics, a differentiable manifold is a type of manifold that is locally similar enough to a vector space to allow one to apply calculus. Any manifold can be described by a collection of charts (atlas). One may then apply ideas from calculus while working within the individual charts, since each chart lies within a vector space to which the usual rules of calculus apply. If the charts are suitably compatible, then computations done in one chart are valid in any other differentiable chart.

<span class="mw-page-title-main">Torsion tensor</span> Manner of characterizing a twist or screw of a moving frame around a curve

In differential geometry, the notion of torsion is a manner of characterizing a twist or screw of a moving frame around a curve. The torsion of a curve, as it appears in the Frenet–Serret formulas, for instance, quantifies the twist of a curve about its tangent vector as the curve evolves. In the geometry of surfaces, the geodesic torsion describes how a surface twists about a curve on the surface. The companion notion of curvature measures how moving frames "roll" along a curve "without twisting".

In differential geometry, an Ehresmann connection is a version of the notion of a connection, which makes sense on any smooth fiber bundle. In particular, it does not rely on the possible vector bundle structure of the underlying fiber bundle, but nevertheless, linear connections may be viewed as a special case. Another important special case of Ehresmann connections are principal connections on principal bundles, which are required to be equivariant in the principal Lie group action.

<span class="mw-page-title-main">Mathematical descriptions of the electromagnetic field</span> Formulations of electromagnetism

There are various mathematical descriptions of the electromagnetic field that are used in the study of electromagnetism, one of the four fundamental interactions of nature. In this article, several approaches are discussed, although the equations are in terms of electric and magnetic fields, potentials, and charges with currents, generally speaking.

The tetrad formalism is an approach to general relativity that generalizes the choice of basis for the tangent bundle from a coordinate basis to the less restrictive choice of a local basis, i.e. a locally defined set of four linearly independent vector fields called a tetrad or vierbein. It is a special case of the more general idea of a vielbein formalism, which is set in (pseudo-)Riemannian geometry. This article as currently written makes frequent mention of general relativity; however, almost everything it says is equally applicable to (pseudo-)Riemannian manifolds in general, and even to spin manifolds. Most statements hold simply by substituting arbitrary for . In German, "vier" translates to "four", and "viel" to "many".

<span class="mw-page-title-main">Differential geometry of surfaces</span> The mathematics of smooth surfaces

In mathematics, the differential geometry of surfaces deals with the differential geometry of smooth surfaces with various additional structures, most often, a Riemannian metric. Surfaces have been extensively studied from various perspectives: extrinsically, relating to their embedding in Euclidean space and intrinsically, reflecting their properties determined solely by the distance within the surface as measured along curves on the surface. One of the fundamental concepts investigated is the Gaussian curvature, first studied in depth by Carl Friedrich Gauss, who showed that curvature was an intrinsic property of a surface, independent of its isometric embedding in Euclidean space.

In mathematics, the Riemannian connection on a surface or Riemannian 2-manifold refers to several intrinsic geometric structures discovered by Tullio Levi-Civita, Élie Cartan and Hermann Weyl in the early part of the twentieth century: parallel transport, covariant derivative and connection form. These concepts were put in their current form with principal bundles only in the 1950s. The classical nineteenth century approach to the differential geometry of surfaces, due in large part to Carl Friedrich Gauss, has been reworked in this modern framework, which provides the natural setting for the classical theory of the moving frame as well as the Riemannian geometry of higher-dimensional Riemannian manifolds. This account is intended as an introduction to the theory of connections.

Lagrangian field theory is a formalism in classical field theory. It is the field-theoretic analogue of Lagrangian mechanics. Lagrangian mechanics is used to analyze the motion of a system of discrete particles each with a finite number of degrees of freedom. Lagrangian field theory applies to continua and fields, which have an infinite number of degrees of freedom.

References