Tensor contraction

Last updated January 13, 2024

In multilinear algebra, a tensor contraction is an operation on a tensor that arises from the canonical pairing of a vector space and its dual. In components, it is expressed as a sum of products of scalar components of the tensor(s) caused by applying the summation convention to a pair of dummy indices that are bound to each other in an expression. The contraction of a single mixed tensor occurs when a pair of literal indices (one a subscript, the other a superscript) of the tensor are set equal to each other and summed over. In Einstein notation this summation is built into the notation. The result is another tensor with order reduced by 2.

Abstract formulation

Let V be a vector space over a field k. The core of the contraction operation, and the simplest case, is the canonical pairing of V with its dual vector space V^∗. The pairing is the linear map from the tensor product of these two spaces to the field k:

C:V\otimes V^{*}\rightarrow k

corresponding to the bilinear form

\langle v,f\rangle =f(v)

where f is in V^∗ and v is in V. The map C defines the contraction operation on a tensor of type (1, 1), which is an element of $V\otimes V^{*}$ . Note that the result is a scalar (an element of k). In finite dimensions, using the natural isomorphism between $V\otimes V^{*}$ and the space of linear map from V to V,^[1] one obtains a basis-free definition of the trace.

In general, a tensor of type (m, n) (with m ≥ 1 and n ≥ 1) is an element of the vector space

V\otimes \cdots \otimes V\otimes V^{*}\otimes \cdots \otimes V^{*}

(where there are m factors V and n factors V^∗).^[2]^[3] Applying the canonical pairing to the kth V factor and the lth V^∗ factor, and using the identity on all other factors, defines the (k, l) contraction operation, which is a linear map that yields a tensor of type (m − 1, n − 1).^[2] By analogy with the (1, 1) case, the general contraction operation is sometimes called the trace.

Contraction in index notation

In tensor index notation, the basic contraction of a vector and a dual vector is denoted by

{\tilde {f}}({\vec {v}})=f_{\gamma }v^{\gamma },

which is shorthand for the explicit coordinate summation^[4]

f_{\gamma }v^{\gamma }=f_{1}v^{1}+f_{2}v^{2}+\cdots +f_{n}v^{n}

(where $v i$ are the components of $v$ in a particular basis and $f i$ are the components of $f$ in the corresponding dual basis).

Since a general mixed dyadic tensor is a linear combination of decomposable tensors of the form $f\otimes v$ , the explicit formula for the dyadic case follows: let

\mathbf {T} =T_{j}^{i}\mathbf {e} _{i}\otimes \mathbf {e} ^{j}

be a mixed dyadic tensor. Then its contraction is

T_{j}^{i}\mathbf {e} _{i}\cdot \mathbf {e} ^{j}=T_{j}^{i}\delta _{i}{}^{j}=T_{j}^{j}=T_{1}^{1}+\cdots +T_{n}^{n}

.

A general contraction is denoted by labeling one covariant index and one contravariant index with the same letter, summation over that index being implied by the summation convention. The resulting contracted tensor inherits the remaining indices of the original tensor. For example, contracting a tensor T of type (2,2) on the second and third indices to create a new tensor U of type (1,1) is written as

T^{ab}{}_{bc}=\sum _{b}{T^{ab}{}_{bc}}=T^{a1}{}_{1c}+T^{a2}{}_{2c}+\cdots +T^{an}{}_{nc}=U^{a}{}_{c}.

By contrast, let

\mathbf {T} =\mathbf {e} ^{i}\otimes \mathbf {e} ^{j}

be an unmixed dyadic tensor. This tensor does not contract; if its base vectors are dotted,^{[ clarification needed ]} the result is the contravariant metric tensor,

g^{ij}=\mathbf {e} ^{i}\cdot \mathbf {e} ^{j}

,

whose rank is 2.

Metric contraction

As in the previous example, contraction on a pair of indices that are either both contravariant or both covariant is not possible in general. However, in the presence of an inner product (also known as a metric) g, such contractions are possible. One uses the metric to raise or lower one of the indices, as needed, and then one uses the usual operation of contraction. The combined operation is known as metric contraction .^[5]

Application to tensor fields

Contraction is often applied to tensor fields over spaces (e.g. Euclidean space, manifolds, or schemes ^{[ citation needed ]}). Since contraction is a purely algebraic operation, it can be applied pointwise to a tensor field, e.g. if T is a (1,1) tensor field on Euclidean space, then in any coordinates, its contraction (a scalar field) U at a point x is given by

U(x)=\sum _{i}T_{i}^{i}(x)

Since the role of x is not complicated here, it is often suppressed, and the notation for tensor fields becomes identical to that for purely algebraic tensors.

Over a Riemannian manifold, a metric (field of inner products) is available, and both metric and non-metric contractions are crucial to the theory. For example, the Ricci tensor is a non-metric contraction of the Riemann curvature tensor, and the scalar curvature is the unique metric contraction of the Ricci tensor.

One can also view contraction of a tensor field in the context of modules over an appropriate ring of functions on the manifold^[5] or the context of sheaves of modules over the structure sheaf;^[6] see the discussion at the end of this article.

Tensor divergence

As an application of the contraction of a tensor field, let V be a vector field on a Riemannian manifold (for example, Euclidean space). Let $V^{\alpha }{}_{\beta }$ be the covariant derivative of V (in some choice of coordinates). In the case of Cartesian coordinates in Euclidean space, one can write

V^{\alpha }{}_{\beta }={\partial V^{\alpha } \over \partial x^{\beta }}.

Then changing index β to α causes the pair of indices to become bound to each other, so that the derivative contracts with itself to obtain the following sum:

V^{\alpha }{}_{\alpha }=V^{0}{}_{0}+\cdots +V^{n}{}_{n},

which is the divergence div V. Then

\operatorname {div} V=V^{\alpha }{}_{\alpha }=0

is a continuity equation for V.

In general, one can define various divergence operations on higher-rank tensor fields, as follows. If T is a tensor field with at least one contravariant index, taking the covariant differential and contracting the chosen contravariant index with the new covariant index corresponding to the differential results in a new tensor of rank one lower than that of T.^[5]

Contraction of a pair of tensors

One can generalize the core contraction operation (vector with dual vector) in a slightly different way, by considering a pair of tensors T and U. The tensor product $T\otimes U$ is a new tensor, which, if it has at least one covariant and one contravariant index, can be contracted. The case where T is a vector and U is a dual vector is exactly the core operation introduced first in this article.

In tensor index notation, to contract two tensors with each other, one places them side by side (juxtaposed) as factors of the same term. This implements the tensor product, yielding a composite tensor. Contracting two indices in this composite tensor implements the desired contraction of the two tensors.

For example, matrices can be represented as tensors of type (1,1) with the first index being contravariant and the second index being covariant. Let $\Lambda ^{\alpha }{}_{\beta }$ be the components of one matrix and let $\mathrm {M} ^{\beta }{}_{\gamma }$ be the components of a second matrix. Then their multiplication is given by the following contraction, an example of the contraction of a pair of tensors:

\Lambda ^{\alpha }{}_{\beta }\mathrm {M} ^{\beta }{}_{\gamma }=\mathrm {N} ^{\alpha }{}_{\gamma }

.

Also, the interior product of a vector with a differential form is a special case of the contraction of two tensors with each other.

More general algebraic contexts

Let R be a commutative ring and let M be a finite free module over R. Then contraction operates on the full (mixed) tensor algebra of M in exactly the same way as it does in the case of vector spaces over a field. (The key fact is that the canonical pairing is still perfect in this case.)

More generally, let O_X be a sheaf of commutative rings over a topological space X, e.g. O_X could be the structure sheaf of a complex manifold, analytic space, or scheme. Let M be a locally free sheaf of modules over O_X of finite rank. Then the dual of M is still well-behaved^[6] and contraction operations make sense in this context.

Notes

↑ Let L(V, V) be the space of linear maps from V to V. Then the natural map
$V^{*}\otimes V\rightarrow L(V,V)$
is defined by
$f\otimes v\mapsto g,$
where g(w) = f(w)v. Suppose that V is finite-dimensional. If {v_i} is a basis of V and {fⁱ} is the corresponding dual basis, then $f^{i}\otimes v_{j}$ maps to the transformation whose matrix in this basis has only one nonzero entry, a 1 in the i,j position. This shows that the map is an isomorphism.
1 2 Fulton, William; Harris, Joe (1991). Representation Theory: A First Course. GTM. Vol. 129. New York: Springer. pp. 471–476. ISBN 0-387-97495-4.
↑ Warner, Frank (1993). Foundations of Differentiable Manifolds and Lie Groups. GTM. Vol. 94. New York: Springer. pp. 54–56. ISBN 0-387-90894-3.
↑ In physics (and sometimes in mathematics), indices often start with zero instead of one. In four-dimensional spacetime, indices run from 0 to 3.
1 2 3 O'Neill, Barrett (1983). Semi-Riemannian Geometry with Applications to Relativity. Academic Press. p. 86. ISBN 0-12-526740-1.
1 2 Hartshorne, Robin (1977). Algebraic Geometry. New York: Springer. ISBN 0-387-90244-9.

Related Research Articles

In mathematics, a tensor is an algebraic object that describes a multilinear relationship between sets of algebraic objects related to a vector space. Tensors may map between different objects such as vectors, scalars, and even other tensors. There are many types of tensors, including scalars and vectors, dual vectors, multilinear maps between vector spaces, and even some operations such as the dot product. Tensors are defined independent of any basis, although they are often referred to by their components in a basis related to a particular coordinate system; those components form an array, which can be thought of as a high-dimensional matrix.

In mathematics, especially the usage of linear algebra in mathematical physics and differential geometry, Einstein notation is a notational convention that implies summation over a set of indexed terms in a formula, thus achieving brevity. As part of mathematics it is a notational subset of Ricci calculus; however, it is often used in physics applications that do not distinguish between tangent and cotangent spaces. It was introduced to physics by Albert Einstein in 1916.

In the mathematical field of differential geometry, a metric tensor is an additional structure on a manifold $M$ that allows defining distances and angles, just as the inner product on a Euclidean space allows defining distances and angles there. More precisely, a metric tensor at a point $p$ of $M$ is a bilinear form defined on the tangent space at $p$ , and a metric tensor on $M$ consists of a metric tensor at each point $p$ of $M$ that varies smoothly with $p$ .

In mathematics, particularly in linear algebra, tensor analysis, and differential geometry, the Levi-Civita symbol or Levi-Civita epsilon represents a collection of numbers; defined from the sign of a permutation of the natural numbers $1, 2, ..., n$ , for some positive integer $n$ . It is named after the Italian mathematician and physicist Tullio Levi-Civita. Other names include the permutation symbol, antisymmetric symbol, or alternating symbol, which refer to its antisymmetric property and definition in terms of permutations.

In tensor analysis, a mixed tensor is a tensor which is neither strictly covariant nor strictly contravariant; at least one of the indices of a mixed tensor will be a subscript (covariant) and at least one of the indices will be a superscript (contravariant).

In physics, especially in multilinear algebra and tensor analysis, covariance and contravariance describe how the quantitative description of certain geometric or physical entities changes with a change of basis. In modern mathematical notation, the role is sometimes swapped.

In special relativity, a four-vector is an object with four components, which transform in a specific way under Lorentz transformations. Specifically, a four-vector is an element of a four-dimensional vector space considered as a representation space of the standard representation of the Lorentz group, the representation. It differs from a Euclidean vector in how its magnitude is determined. The transformations that preserve this magnitude are the Lorentz transformations, which include spatial rotations and boosts.

In mathematics, the modern component-free approach to the theory of a tensor views a tensor as an abstract object, expressing some definite type of multilinear concept. Their properties can be derived from their definitions, as linear maps or more generally; and the rules for manipulations of tensors arise as an extension of linear algebra to multilinear algebra.

In mathematics and physics, a tensor field assigns a tensor to each point of a mathematical space. Tensor fields are used in differential geometry, algebraic geometry, general relativity, in the analysis of stress and strain in materials, and in numerous applications in the physical sciences. As a tensor is a generalization of a scalar and a vector, a tensor field is a generalization of a scalar field or vector field that assigns, respectively, a scalar or vector to each point of space. If a tensor $A$ is defined on a vector fields set $X(M)$ over a module $M$ , we call $A$ a tensor field on $M$ .

In mathematics, the covariant derivative is a way of specifying a derivative along tangent vectors of a manifold. Alternatively, the covariant derivative is a way of introducing and working with a connection on a manifold by means of a differential operator, to be contrasted with the approach given by a principal connection on the frame bundle – see affine connection. In the special case of a manifold isometrically embedded into a higher-dimensional Euclidean space, the covariant derivative can be viewed as the orthogonal projection of the Euclidean directional derivative onto the manifold's tangent space. In this case the Euclidean derivative is broken into two parts, the extrinsic normal component and the intrinsic covariant derivative component.

Let $be a smooth map between smooth manifolds and . Then there is an associated linear map from the space of 1-forms on to the space of 1-forms on . This linear map is known as the pullback, and is frequently denoted by . More generally, any covariant tensor field - in particular any differential form - on may be pulled back to using .$

In mathematics, and specifically differential geometry, a connection form is a manner of organizing the data of a connection using the language of moving frames and differential forms.

<span class="mw-page-title-main">Curvilinear coordinates</span> Coordinate system whose directions vary in space

In geometry, curvilinear coordinates are a coordinate system for Euclidean space in which the coordinate lines may be curved. These coordinates may be derived from a set of Cartesian coordinates by using a transformation that is locally invertible at each point. This means that one can convert a point given in a Cartesian coordinate system to its curvilinear coordinates and back. The name curvilinear coordinates, coined by the French mathematician Lamé, derives from the fact that the coordinate surfaces of the curvilinear systems are curved.

In differential geometry, a tensor density or relative tensor is a generalization of the tensor field concept. A tensor density transforms as a tensor field when passing from one coordinate system to another, except that it is additionally multiplied or weighted by a power W of the Jacobian determinant of the coordinate transition function or its absolute value. A tensor density with a single index is called a vector density. A distinction is made among (authentic) tensor densities, pseudotensor densities, even tensor densities and odd tensor densities. Sometimes tensor densities with a negative weight W are called tensor capacity. A tensor density can also be regarded as a section of the tensor product of a tensor bundle with a density bundle.

When studying and formulating Albert Einstein's theory of general relativity, various mathematical structures and techniques are utilized. The main tools used in this geometrical theory of gravitation are tensor fields defined on a Lorentzian manifold representing spacetime. This article is a general description of the mathematics of general relativity.

In mathematics—more specifically, in differential geometry—the musical isomorphism is an isomorphism between the tangent bundle $and the cotangent bundle of a pseudo-Riemannian manifold induced by its metric tensor. There are similar isomorphisms on symplectic manifolds. The term musical refers to the use of the symbols (flat) and (sharp).$

In mathematics, a metric connection is a connection in a vector bundle E equipped with a bundle metric; that is, a metric for which the inner product of any two vectors will remain the same when those vectors are parallel transported along any curve. This is equivalent to:

<span class="mw-page-title-main">Calculus of moving surfaces</span> Extension of the classical tensor calculus

The calculus of moving surfaces (CMS) is an extension of the classical tensor calculus to deforming manifolds. Central to the CMS is the Tensorial Time Derivative $whose original definition was put forth by Jacques Hadamard. It plays the role analogous to that of the covariant derivative on differential manifolds in that it produces a tensor when applied to a tensor.$

In mathematics, Ricci calculus constitutes the rules of index notation and manipulation for tensors and tensor fields on a differentiable manifold, with or without a metric tensor or connection. It is also the modern name for what used to be called the absolute differential calculus, developed by Gregorio Ricci-Curbastro in 1887–1896, and subsequently popularized in a paper written with his pupil Tullio Levi-Civita in 1900. Jan Arnoldus Schouten developed the modern notation and formalism for this mathematical framework, and made contributions to the theory, during its applications to general relativity and differential geometry in the early twentieth century.

The optical metric was defined by German theoretical physicist Walter Gordon in 1923 to study the geometrical optics in curved space-time filled with moving dielectric materials.

References

Bishop, Richard L.; Goldberg, Samuel I. (1980). Tensor Analysis on Manifolds . New York: Dover. ISBN 0-486-64039-6.
Menzel, Donald H. (1961). Mathematical Physics. New York: Dover. ISBN 0-486-60056-4.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[natural_iso-1] Let L(V, V) be the space of linear maps from V to V. Then the natural map
$V^{*}\otimes V\rightarrow L(V,V)$
is defined by
$f\otimes v\mapsto g,$
where g(w) = f(w)v. Suppose that V is finite-dimensional. If {v_i} is a basis of V and {fⁱ} is the corresponding dual basis, then $f^{i}\otimes v_{j}$ maps to the transformation whose matrix in this basis has only one nonzero entry, a 1 in the i,j position. This shows that the map is an isomorphism.

[fulton_harris-2] 1 2 Fulton, William; Harris, Joe (1991). Representation Theory: A First Course. GTM. Vol. 129. New York: Springer. pp. 471–476. ISBN 0-387-97495-4.

[warner-3] Warner, Frank (1993). Foundations of Differentiable Manifolds and Lie Groups. GTM. Vol. 94. New York: Springer. pp. 54–56. ISBN 0-387-90894-3.

[physics-4] In physics (and sometimes in mathematics), indices often start with zero instead of one. In four-dimensional spacetime, indices run from 0 to 3.

[o'neill-5] 1 2 3 O'Neill, Barrett (1983). Semi-Riemannian Geometry with Applications to Relativity. Academic Press. p. 86. ISBN 0-12-526740-1.

[hartshorne-6] 1 2 Hartshorne, Robin (1977). Algebraic Geometry. New York: Springer. ISBN 0-387-90244-9.

[1]

[2]

[3]

[4]

[5]

[6]