Minkowski space

Last updated
Hermann Minkowski (1864-1909) found that the theory of special relativity, introduced by his former student Albert Einstein, could be best understood as a four-dimensional space, since known as the Minkowski spacetime. De Raum zeit Minkowski Bild.jpg
Hermann Minkowski (1864–1909) found that the theory of special relativity, introduced by his former student Albert Einstein, could be best understood as a four-dimensional space, since known as the Minkowski spacetime.

In mathematical physics, Minkowski space (or Minkowski spacetime) ( /mɪŋˈkɔːfski,-ˈkɒf-/ [1] ) combines inertial space and time manifolds (x,y) with a non-inertial reference frame of space and time (x',t') into a four-dimensional model relating a position (inertial frame of reference) to the field (physics). A four-vector (x,y,z,t) consisting of coordinate axes such as a Euclidean space plus time may be used with the non-inertial frame to illustrate specifics of motion, but should not be confused with the spacetime model generally. The model helps show how a spacetime interval between any two events is independent of the inertial frame of reference in which they are recorded. Although initially developed by mathematician Hermann Minkowski for Maxwell's equations of electromagnetism, the mathematical structure of Minkowski spacetime was shown to be implied by the postulates of special relativity. [2]


Minkowski space is closely associated with Einstein's theories of special relativity and general relativity and is the most common mathematical structure on which special relativity is formulated. While the individual components in Euclidean space and time may differ due to length contraction and time dilation, in Minkowski spacetime, all frames of reference will agree on the total distance in spacetime between events. [nb 1] Because it treats time differently to how it treats the 3 spatial dimensions, Minkowski space differs from four-dimensional Euclidean space.

In 3-dimensional Euclidean space, the isometry group (the maps preserving the regular Euclidean distance) is the Euclidean group. It is generated by rotations, reflections and translations. When time is appended as a fourth dimension, the further transformations of translations in time and Lorentz boosts are added, and the group of all these transformations is called the Poincaré group. Minkowski's model follows special relativity where motion causes time dilation changing the scale applied to the frame in motion and shifts the phase of light.

Spacetime is equipped with an indefinite non-degenerate bilinear form, variously called the Minkowski metric, [3] the Minkowski norm squared or Minkowski inner product depending on the context. [nb 2] The Minkowski inner product is defined so as to yield the spacetime interval between two events when given their coordinate difference vector as argument. [4] Equipped with this inner product, the mathematical model of spacetime is called Minkowski space. The group of transformations for Minkowski space, preserving the spacetime interval (as opposed to the spatial Euclidean distance) is the Poincaré group.


Complex Minkowski spacetime

In his second relativity paper in 190506, Henri Poincaré showed [5] how, by taking time to be an imaginary fourth spacetime coordinate ict, where c is the speed of light and i is the imaginary unit, Lorentz transformations can be visualized as ordinary rotations of the four dimensional Euclidean sphere. The four-dimensional spacetime can be visualized as a four-dimensional sphere, with each point on the sphere representing an event in spacetime. The Lorentz transformations can then be thought of as rotations of this four-dimensional sphere, where the rotation axis corresponds to the direction of relative motion between the two observers and the rotation angle is related to their relative velocity.

To see this, consider the coordinates of an event in spacetime represented as a four-vector (t, x, y, z). A Lorentz transformation can be represented as a matrix that acts on the four-vector and changes its components. This matrix can be thought of as a rotation matrix in four-dimensional space, which rotates the four-vector about a particular axis.

Poincaré set c = 1 for convenience. Rotations in planes spanned by two space unit vectors appear in coordinate space as well as in physical spacetime as Euclidean rotations, and are interpreted in the ordinary sense. The "rotation" in a plane spanned by a space unit vector and a time unit vector, while formally still a rotation in coordinate space, is a Lorentz boost in physical spacetime with real inertial coordinates. The analogy with Euclidean rotations is only partial since the radius of the sphere is actually imaginary which turns rotations into rotations in hyperbolic space (see hyperbolic rotation).

This idea, which was mentioned only briefly by Poincaré, was elaborated by Minkowski in a paper in German published in 1908 called "The Fundamental Equations for Electromagnetic Processes in Moving Bodies". [6] Minkowski, using this formulation, restated the then-recent theory of relativity of Einstein. In particular, by restating the Maxwell equations as a symmetrical set of equations in the four variables (x, y, z, ict) combined with redefined vector variables for electromagnetic quantities, he was able to show directly and very simply their invariance under Lorentz transformation. He also made other important contributions and used matrix notation for the first time in this context. From his reformulation he concluded that time and space should be treated equally, and so arose his concept of events taking place in a unified four-dimensional spacetime continuum.

Real Minkowski spacetime

In a further development in his 1908 "Space and Time" lecture, [7] Minkowski gave an alternative formulation of this idea that used a real time coordinate instead of an imaginary one, representing the four variables (x, y, z, t) of space and time in coordinate form in a four dimensional real vector space. Points in this space correspond to events in spacetime. In this space, there is a defined light-cone associated with each point, and events not on the light-cone are classified by their relation to the apex as spacelike or timelike. It is principally this view of spacetime that is current nowadays, although the older view involving imaginary time has also influenced special relativity.

In the English translation of Minkowski's paper, the Minkowski metric as defined below is referred to as the line element. The Minkowski inner product of below appears unnamed when referring to orthogonality (which he calls normality) of certain vectors, and the Minkowski norm squared is referred to (somewhat cryptically, perhaps this is translation dependent) as "sum".

Minkowski's principal tool is the Minkowski diagram, and he uses it to define concepts and demonstrate properties of Lorentz transformations (e.g. proper time and length contraction) and to provide geometrical interpretation to the generalization of Newtonian mechanics to relativistic mechanics. For these special topics, see the referenced articles, as the presentation below will be principally confined to the mathematical structure (Minkowski metric and from it derived quantities and the Poincaré group as symmetry group of spacetime) following from the invariance of the spacetime interval on the spacetime manifold as consequences of the postulates of special relativity, not to specific application or derivation of the invariance of the spacetime interval. This structure provides the background setting of all present relativistic theories, barring general relativity for which flat Minkowski spacetime still provides a springboard as curved spacetime is locally Lorentzian.

Minkowski, aware of the fundamental restatement of the theory which he had made, said

The views of space and time which I wish to lay before you have sprung from the soil of experimental physics, and therein lies their strength. They are radical. Henceforth space by itself, and time by itself, are doomed to fade away into mere shadows, and only a kind of union of the two will preserve an independent reality.

Hermann Minkowski, 1908, 1909 [7]

Though Minkowski took an important step for physics, Albert Einstein saw its limitation:

At a time when Minkowski was giving the geometrical interpretation of special relativity by extending the Euclidean three-space to a quasi-Euclidean four-space that included time, Einstein was already aware that this is not valid, because it excludes the phenomenon of gravitation. He was still far from the study of curvilinear coordinates and Riemannian geometry, and the heavy mathematical apparatus entailed. [8]

For further historical information see references Galison (1979), Corry (1997) and Walter (1999).

Causal structure

Subdivision of Minkowski spacetime with respect to an event in four disjoint sets. The light cone, the absolute future, the absolute past, and elsewhere. The terminology is from Sard (1970). World line.svg
Subdivision of Minkowski spacetime with respect to an event in four disjoint sets. The light cone, the absolute future, the absolute past, and elsewhere. The terminology is from Sard (1970).

Where v is velocity, and x, y, and z are Cartesian coordinates in 3-dimensional space, and c is the constant representing the universal speed limit, and t is time, the four-dimensional vector v = (ct, x, y, z) = (ct, r) is classified according to the sign of c2t2r2. A vector is timelike if c2t2 > r2, spacelike if c2t2 < r2, and null or lightlike if c2t2 = r2. This can be expressed in terms of the sign of η(v, v) as well, which depends on the signature. The classification of any vector will be the same in all frames of reference that are related by a Lorentz transformation (but not by a general Poincaré transformation because the origin may then be displaced) because of the invariance of the interval.

The set of all null vectors at an event [nb 3] of Minkowski space constitutes the light cone of that event. Given a timelike vector v, there is a worldline of constant velocity associated with it, represented by a straight line in a Minkowski diagram.

Once a direction of time is chosen, [nb 4] timelike and null vectors can be further decomposed into various classes. For timelike vectors one has

  1. future-directed timelike vectors whose first component is positive, (tip of vector located in absolute future in figure) and
  2. past-directed timelike vectors whose first component is negative (absolute past).

Null vectors fall into three classes:

  1. the zero vector, whose components in any basis are (0, 0, 0, 0) (origin),
  2. future-directed null vectors whose first component is positive (upper light cone), and
  3. past-directed null vectors whose first component is negative (lower light cone).

Together with spacelike vectors there are 6 classes in all.

An orthonormal basis for Minkowski space necessarily consists of one timelike and three spacelike unit vectors. If one wishes to work with non-orthonormal bases it is possible to have other combinations of vectors. For example, one can easily construct a (non-orthonormal) basis consisting entirely of null vectors, called a null basis.

Vector fields are called timelike, spacelike or null if the associated vectors are timelike, spacelike or null at each point where the field is defined.

Properties of time-like vectors

Time-like vectors have special importance in the theory of relativity as they correspond to events which are accessible to the observer at (0, 0, 0, 0) with a speed less than that of light. Of most interest are time-like vectors which are similarly directed i.e. all either in the forward or in the backward cones. Such vectors have several properties not shared by space-like vectors. These arise because both forward and backward cones are convex whereas the space-like region is not convex.

Scalar product

The scalar product of two time-like vectors u1 = (t1, x1, y1, z1) and u2 = (t2, x2, y2, z2) is

Positivity of scalar product: An important property is that the scalar product of two similarly directed time-like vectors is always positive. This can be seen from the reversed Cauchy–Schwarz inequality below. It follows that if the scalar product of two vectors is zero then one of these at least, must be space-like. The scalar product of two space-like vectors can be positive or negative as can be seen by considering the product of two space-like vectors having orthogonal spatial components and times either of different or the same signs.

Using the positivity property of time-like vectors it is easy to verify that a linear sum with positive coefficients of similarly directed time-like vectors is also similarly directed time-like (the sum remains within the light-cone because of convexity).

Norm and reversed Cauchy inequality

The norm of a time-like vector u = (ct, x, y, z) is defined as

The reversed Cauchy inequality is another consequence of the convexity of either light-cone. [9] For two distinct similarly directed time-like vectors u1 and u2 this inequality is

or algebraically,

From this the positivity property of the scalar product can be seen.

The reversed triangle inequality

For two similarly directed time-like vectors u and w, the inequality is [10]

where the equality holds when the vectors are linearly dependent.

The proof uses the algebraic definition with the reversed Cauchy inequality: [11]

The result now follows by taking the square root on both sides.

Mathematical structure

It is assumed below that spacetime is endowed with a coordinate system corresponding to an inertial frame. This provides an origin, which is necessary in order to be able to refer to spacetime as being modeled as a vector space. This is not really physically motivated in that a canonical origin ("central" event in spacetime) should exist. One can get away with less structure, that of an affine space, but this would needlessly complicate the discussion and would not reflect how flat spacetime is normally treated mathematically in modern introductory literature.

For an overview, Minkowski space is a 4-dimensional real vector space equipped with a nondegenerate, symmetric bilinear form on the tangent space at each point in spacetime, here simply called the Minkowski inner product, with metric signature either (+ − − −) or (− + + +). The tangent space at each event is a vector space of the same dimension as spacetime, 4.

Tangent vectors

A pictorial representation of the tangent space at a point, x, on a sphere. This vector space can be thought of as a subspace of R itself. Then vectors in it would be called geometrical tangent vectors. By the same principle, the tangent space at a point in flat spacetime can be thought of as a subspace of spacetime which happens to be all of spacetime. Image Tangent-plane.svg
A pictorial representation of the tangent space at a point, x, on a sphere. This vector space can be thought of as a subspace of R itself. Then vectors in it would be called geometrical tangent vectors. By the same principle, the tangent space at a point in flat spacetime can be thought of as a subspace of spacetime which happens to be all of spacetime.

In practice, one need not be concerned with the tangent spaces. The vector space nature of Minkowski space allows for the canonical identification of vectors in tangent spaces at points (events) with vectors (points, events) in Minkowski space itself. See e.g. Lee (2003 , Proposition 3.8.) or Lee (2012 , Proposition 3.13.) These identifications are routinely done in mathematics. They can be expressed formally in Cartesian coordinates as [12]

with basis vectors in the tangent spaces defined by

Here p and q are any two events and the second basis vector identification is referred to as parallel transport. The first identification is the canonical identification of vectors in the tangent space at any point with vectors in the space itself. The appearance of basis vectors in tangent spaces as first order differential operators is due to this identification. It is motivated by the observation that a geometrical tangent vector can be associated in a one-to-one manner with a directional derivative operator on the set of smooth functions. This is promoted to a definition of tangent vectors in manifolds not necessarily being embedded in Rn. This definition of tangent vectors is not the only possible one as ordinary n-tuples can be used as well.

Definitions of tangent vectors as ordinary vectors

A tangent vector at a point p may be defined, here specialized to Cartesian coordinates in Lorentz frames, as 4 × 1 column vectors v associated to each Lorentz frame related by Lorentz transformation Λ such that the vector v in a frame related to some frame by Λ transforms according to v → Λv. This is the same way in which the coordinates xμ transform. Explicitly,

This definition is equivalent to the definition given above under a canonical isomorphism.

For some purposes it is desirable to identify tangent vectors at a point p with displacement vectors at p, which is, of course, admissible by essentially the same canonical identification. [13] The identifications of vectors referred to above in the mathematical setting can correspondingly be found in a more physical and explicitly geometrical setting in Misner, Thorne & Wheeler (1973). They offer various degree of sophistication (and rigor) depending on which part of the material one chooses to read.

Metric signature

The metric signature refers to which sign the Minkowski inner product yields when given space (spacelike to be specific, defined further down) and time basis vectors (timelike) as arguments. Further discussion about this theoretically inconsequential, but practically necessary, choice for purposes of internal consistency and convenience is deferred to the hide box below.

The choice of metric signature

In general, but with several exceptions, mathematicians and general relativists prefer spacelike vectors to yield a positive sign, (− + + +), while particle physicists tend to prefer timelike vectors to yield a positive sign, (+ − − −). Authors covering several areas of physics, e.g. Steven Weinberg and Landau and Lifshitz ((− + + +) and (+ − − −) respectively) stick to one choice regardless of topic. Arguments for the former convention include "continuity" from the Euclidean case corresponding to the non-relativistic limit c → ∞. Arguments for the latter include that minus signs, otherwise ubiquitous in particle physics, go away. Yet other authors, especially of introductory texts, e.g. Kleppner & Kolenkow (1978), do not choose a signature at all, but instead opt to coordinatize spacetime such that the time coordinate (but not time itself!) is imaginary. This removes the need of the explicit introduction of a metric tensor (which may seem as an extra burden in an introductory course), and one needs not be concerned with covariant vectors and contravariant vectors (or raising and lowering indices) to be described below. The inner product is instead effected by a straightforward extension of the dot product in R3 to R3 × C. This works in the flat spacetime of special relativity, but not in the curved spacetime of general relativity, see Misner, Thorne & Wheeler (1973, Box 2.1, Farewell to ict) (who, by the way use (− + + +)). MTW also argues that it hides the true indefinite nature of the metric and the true nature of Lorentz boosts, which aren't rotations. It also needlessly complicates the use of tools of differential geometry that are otherwise immediately available and useful for geometrical description and calculation even in the flat spacetime of special relativity, e.g. of the electromagnetic field.


Mathematically associated to the bilinear form is a tensor of type (0,2) at each point in spacetime, called the Minkowski metric. [nb 5] The Minkowski metric, the bilinear form, and the Minkowski inner product are all the same object; it is a bilinear function that accepts two (contravariant) vectors and returns a real number. In coordinates, this is the 4×4 matrix representing the bilinear form.

For comparison, in general relativity, a Lorentzian manifold L is likewise equipped with a metric tensor g, which is a nondegenerate symmetric bilinear form on the tangent space TpL at each point p of L. In coordinates, it may be represented by a 4×4 matrix depending on spacetime position. Minkowski space is thus a comparatively simple special case of a Lorentzian manifold. Its metric tensor is in coordinates the same symmetric matrix at every point of M, and its arguments can, per above, be taken as vectors in spacetime itself.

Introducing more terminology (but not more structure), Minkowski space is thus a pseudo-Euclidean space with total dimension n = 4 and signature (3, 1) or (1, 3). Elements of Minkowski space are called events. Minkowski space is often denoted R3,1 or R1,3 to emphasize the chosen signature, or just M. It is perhaps the simplest example of a pseudo-Riemannian manifold.

Then mathematically, the metric is a bilinear form on an abstract four-dimensional real vector space , that is,

where has signature , and signature is a coordinate-invariant property of . The space of bilinear maps forms a vector space which can be identified with , and may be equivalently viewed as an element of this space. By making a choice of orthonormal basis , we can identify with the space . The notation is meant to emphasise the fact that and are not just vector spaces but have added structure. .

An interesting example of non-inertial coordinates for (part of) Minkowski spacetime are the Born coordinates. Another useful set of coordinates are the light-cone coordinates.

Pseudo-Euclidean metrics

The Minkowski inner product is not an inner product, since it is not positive-definite, i.e. the quadratic form η(v, v) need not be positive for nonzero v. The positive-definite condition has been replaced by the weaker condition of non-degeneracy. The bilinear form is said to be indefinite. The Minkowski metric η is the metric tensor of Minkowski space. It is a pseudo-Euclidean metric, or more generally a constant pseudo-Riemannian metric in Cartesian coordinates. As such it is a nondegenerate symmetric bilinear form, a type (0, 2) tensor. It accepts two arguments up, vp, vectors in TpM, pM, the tangent space at p in M. Due to the above-mentioned canonical identification of TpM with M itself, it accepts arguments u, v with both u and v in M.

As a notational convention, vectors v in M, called 4-vectors, are denoted in italics, and not, as is common in the Euclidean setting, with boldface v. The latter is generally reserved for the 3-vector part (to be introduced below) of a 4-vector.

The definition [14]

yields an inner product-like structure on M, previously and also henceforth, called the Minkowski inner product, similar to the Euclidean inner product, but it describes a different geometry. It is also called the relativistic dot product. If the two arguments are the same,

the resulting quantity will be called the Minkowski norm squared. The Minkowski inner product satisfies the following properties.

Linearity in first argument

The first two conditions imply bilinearity. The defining difference between a pseudo-inner product and an inner product proper is that the former is not required to be positive definite, that is, η(u, u) < 0 is allowed.

The most important feature of the inner product and norm squared is that these are quantities unaffected by Lorentz transformations. In fact, it can be taken as the defining property of a Lorentz transformation that it preserves the inner product (i.e. the value of the corresponding bilinear form on two vectors). This approach is taken more generally for all classical groups definable this way in classical group. There, the matrix Φ is identical in the case O(3, 1) (the Lorentz group) to the matrix η to be displayed below.

Two vectors v and w are said to be orthogonal if η(v, w) = 0. For a geometric interpretation of orthogonality in the special case when η(v, v) ≤ 0 and η(w, w) ≥ 0 (or vice versa), see hyperbolic orthogonality.

A vector e is called a unit vector if η(e, e) = ±1. A basis for M consisting of mutually orthogonal unit vectors is called an orthonormal basis. [15]

For a given inertial frame, an orthonormal basis in space, combined with the unit time vector, forms an orthonormal basis in Minkowski space. The number of positive and negative unit vectors in any such basis is a fixed pair of numbers, equal to the signature of the bilinear form associated with the inner product. This is Sylvester's law of inertia.

More terminology (but not more structure): The Minkowski metric is a pseudo-Riemannian metric, more specifically, a Lorentzian metric, even more specifically, the Lorentz metric, reserved for 4-dimensional flat spacetime with the remaining ambiguity only being the signature convention.

Minkowski metric

From the second postulate of special relativity, together with homogeneity of spacetime and isotropy of space, it follows that the spacetime interval between two arbitrary events called 1 and 2 is: [16]

This quantity is not consistently named in the literature. The interval is sometimes referred to as the square root of the interval as defined here. [17] [18]

The invariance of the interval under coordinate transformations between inertial frames follows from the invariance of

provided the transformations are linear. This quadratic form can be used to define a bilinear form

via the polarization identity. This bilinear form can in turn be written as

Where [η] is a matrix associated with η. While possibly confusing, it is common practice to denote [η] with just η. The matrix is read off from the explicit bilinear form as

and the bilinear form

with which this section started by assuming its existence, is now identified.

For definiteness and shorter presentation, the signature (− + + +) is adopted below. This choice (or the other possible choice) has no (known) physical implications. The symmetry group preserving the bilinear form with one choice of signature is isomorphic (under the map given here) with the symmetry group preserving the other choice of signature. This means that both choices are in accord with the two postulates of relativity. Switching between the two conventions is straightforward. If the metric tensor η has been used in a derivation, go back to the earliest point where it was used, substitute η for η, and retrace forward to the desired formula with the desired metric signature.

Standard basis

A standard or orthonormal basis for Minkowski space is a set of four mutually orthogonal vectors {e0, e1, e2, e3} such that

These conditions can be written compactly in the form

Relative to a standard basis, the components of a vector v are written (v0, v1, v2, v3) where the Einstein notation is used to write v = vμeμ. The component v0 is called the timelike component of v while the other three components are called the spatial components. The spatial components of a 4-vector v may be identified with a 3-vector v = (v1, v2, v3).

In terms of components, the Minkowski inner product between two vectors v and w is given by


Here lowering of an index with the metric was used.

There are many possible choices of standard basis obeying the condition Any two such bases are related in some sense by a Lorentz transformation, either by a change-of-basis matrix , a real matrix satisfying

or a linear map on the abstract vector space satisfying, for any pair of vectors

Then if we have two different bases and , we can write or . While it might be tempting to think of and as the same thing, mathematically they are elements of different spaces, and act on the space of standard bases from different sides.

Raising and lowering of indices

Linear functionals (1-forms) a, b and their sum s and vectors u, v, w, in 3d Euclidean space. The number of (1-form) hyperplanes intersected by a vector equals the inner product. 1-form linear functional.svg
Linear functionals (1-forms) α, β and their sum σ and vectors u, v, w, in 3d Euclidean space. The number of (1-form) hyperplanes intersected by a vector equals the inner product.

Technically, a non-degenerate bilinear form provides a map between a vector space and its dual; in this context, the map is between the tangent spaces of M and the cotangent spaces of M. At a point in M, the tangent and cotangent spaces are dual vector spaces (so the dimension of the cotangent space at an event is also 4). Just as an authentic inner product on a vector space with one argument fixed, by Riesz representation theorem, may be expressed as the action of a linear functional on the vector space, the same holds for the Minkowski inner product of Minkowski space. [20]

Thus if vμ are the components of a vector in a tangent space, then ημνvμ = vν are the components of a vector in the cotangent space (a linear functional). Due to the identification of vectors in tangent spaces with vectors in M itself, this is mostly ignored, and vectors with lower indices are referred to as covariant vectors. In this latter interpretation, the covariant vectors are (almost always implicitly) identified with vectors (linear functionals) in the dual of Minkowski space. The ones with upper indices are contravariant vectors. In the same fashion, the inverse of the map from tangent to cotangent spaces, explicitly given by the inverse of η in matrix representation, can be used to define raising of an index. The components of this inverse are denoted ημν. It happens that ημν = ημν. These maps between a vector space and its dual can be denoted η (eta-flat) and η (eta-sharp) by the musical analogy. [21]

Contravariant and covariant vectors are geometrically very different objects. The first can and should be thought of as arrows. A linear functional can be characterized by two objects: its kernel, which is a hyperplane passing through the origin, and its norm. Geometrically thus, covariant vectors should be viewed as a set of hyperplanes, with spacing depending on the norm (bigger = smaller spacing), with one of them (the kernel) passing through the origin. The mathematical term for a covariant vector is 1-covector or 1-form (though the latter is usually reserved for covector fields).

Misner, Thorne & Wheeler (1973) uses a vivid analogy with wave fronts of a de Broglie wave (scaled by a factor of Planck's reduced constant) quantum mechanically associated to a momentum four-vector to illustrate how one could imagine a covariant version of a contravariant vector. The inner product of two contravariant vectors could equally well be thought of as the action of the covariant version of one of them on the contravariant version of the other. The inner product is then how many time the arrow pierces the planes. The mathematical reference, Lee (2003), offers the same geometrical view of these objects (but mentions no piercing).

The electromagnetic field tensor is a differential 2-form, which geometrical description can as well be found in MTW.

One may, of course, ignore geometrical views all together (as is the style in e.g. Weinberg (2002) and Landau & Lifshitz 2002) and proceed algebraically in a purely formal fashion. The time-proven robustness of the formalism itself, sometimes referred to as index gymnastics, ensures that moving vectors around and changing from contravariant to covariant vectors and vice versa (as well as higher order tensors) is mathematically sound. Incorrect expressions tend to reveal themselves quickly.

Coordinate free raising and lowering

Given a bilinear form , the lowered version of a vector can be thought of as the partial evaluation of , that is, there is an associated partial evaluation map

The lowered vector is then the dual map . Note it does not matter which argument is partially evaluated due to symmetry of .

Non-degeneracy is then equivalent to injectivity of the partial evaluation map, or equivalently non-degeneracy tells us the kernel of the map is trivial. In finite dimension, as we have here, and noting that the dimension of a finite dimensional space is equal to the dimension of the dual, this is enough to conclude the partial evaluation map is a linear isomorphism from to . This then allows definition of the inverse partial evaluation map,

which allows us to define the inverse metric

where the two different usages of can be told apart by the argument each is evaluated on. This can then be used to raise indices. If we work in a coordinate basis, we find that the metric is indeed the matrix inverse to

The formalism of the Minkowski metric

The present purpose is to show semi-rigorously how formally one may apply the Minkowski metric to two vectors and obtain a real number, i.e. to display the role of the differentials, and how they disappear in a calculation. The setting is that of smooth manifold theory, and concepts such as convector fields and exterior derivatives are introduced.

A formal approach to the Minkowski metric

A full-blown version of the Minkowski metric in coordinates as a tensor field on spacetime has the appearance

Explanation: The coordinate differentials are 1-form fields. They are defined as the exterior derivative of the coordinate functions xμ. These quantities evaluated at a point p provide a basis for the cotangent space at p. The tensor product (denoted by the symbol ) yields a tensor field of type (0, 2), i.e. the type that expects two contravariant vectors as arguments. On the right hand side, the symmetric product (denoted by the symbol or by juxtaposition) has been taken. The equality holds since, by definition, the Minkowski metric is symmetric. [22] The notation on the far right is also sometimes used for the related, but different, line element. It is not a tensor. For elaboration on the differences and similarities, see Misner, Thorne & Wheeler (1973, Box 3.2 and section 13.2.)

Tangent vectors are, in this formalism, given in terms of a basis of differential operators of the first order,

where p is an event. This operator applied to a function f gives the directional derivative of f at p in the direction of increasing xμ with xν, νμ fixed. They provide a basis for the tangent space at p.

The exterior derivative df of a function f is a covector field, i.e. an assignment of a cotangent vector to each point p, by definition such that

for each vector field X. A vector field is an assignment of a tangent vector to each point p. In coordinates X can be expanded at each point p in the basis given by the ∂/∂xν|p. Applying this with f = xμ, the coordinate function itself, and X = ∂/∂xν, called a coordinate vector field, one obtains

Since this relation holds at each point p, the dxμ|p provide a basis for the cotangent space at each p and the bases dxμ|p and ∂/∂xν|p are dual to each other,

at each p. Furthermore, one has

for general one-forms on a tangent space α, β and general tangent vectors a, b. (This can be taken as a definition, but may also be proved in a more general setting.)

Thus when the metric tensor is fed two vectors fields a, b, both expanded in terms of the basis coordinate vector fields, the result is

where aμ, bν are the component functions of the vector fields. The above equation holds at each point p, and the relation may as well be interpreted as the Minkowski metric at p applied to two tangent vectors at p.

As mentioned, in a vector space, such as that modelling the spacetime of special relativity, tangent vectors can be canonically identified with vectors in the space itself, and vice versa. This means that the tangent spaces at each point are canonically identified with each other and with the vector space itself. This explains how the right hand side of the above equation can be employed directly, without regard to spacetime point the metric is to be evaluated and from where (which tangent space) the vectors come from.

This situation changes in general relativity. There one has

where now ηg(p), i.e., g is still a metric tensor but now depending on spacetime and is a solution of Einstein's field equations. Moreover, a, bmust be tangent vectors at spacetime point p and can no longer be moved around freely.

Chronological and causality relations

Let x, yM. We say that

  1. xchronologically precedesy if yx is future-directed timelike. This relation has the transitive property and so can be written x < y.
  2. xcausally precedesy if yx is future-directed null or future-directed timelike. It gives a partial ordering of spacetime and so can be written xy.

Suppose xM is timelike. Then the simultaneous hyperplane for x is Since this hyperplane varies as x varies, there is a relativity of simultaneity in Minkowski space.


A Lorentzian manifold is a generalization of Minkowski space in two ways. The total number of spacetime dimensions is not restricted to be 4 (2 or more) and a Lorentzian manifold need not be flat, i.e. it allows for curvature.

Complexified Minkowski space

Complexified Minkowski space is defined as Mc = MiM. [23] Its real part is the Minkowski space of four-vectors, such as the four-velocity and the four-momentum, which are independent of the choice of orientation of the space. The imaginary part, on the other hand, may consist of four-pseudovectors, such as angular velocity and magnetic moment, which change their direction with a change of orientation. We introduce a pseudoscalar i which also changes sign with a change of orientation. Thus, elements of Mc are independent of the choice of the orientation.

The inner product-like structure on Mc is defined as uv = η(u,v) for any u,vMc. A relativistic pure spin of an electron or any half spin particle is described by ρ Mc as ρ = u+is, where u is the four-velocity of the particle, satisfying u2 = 1 and s is the 4D spin vector, [24] which is also the Pauli–Lubanski pseudovector satisfying s2 = −1 and us = 0.

Generalized Minkowski space

Minkowski space refers to a mathematical formulation in four dimensions. However, the mathematics can easily be extended or simplified to create an analogous generalized Minkowski space in any number of dimensions. If n ≥ 2, n-dimensional Minkowski space is a vector space of real dimension n on which there is a constant Minkowski metric of signature (n − 1, 1) or (1, n − 1). These generalizations are used in theories where spacetime is assumed to have more or less than 4 dimensions. String theory and M-theory are two examples where n > 4. In string theory, there appears conformal field theories with 1 + 1 spacetime dimensions.

de Sitter space can be formulated as a submanifold of generalized Minkowski space as can the model spaces of hyperbolic geometry (see below).


As a flat spacetime, the three spatial components of Minkowski spacetime always obey the Pythagorean Theorem. Minkowski space is a suitable basis for special relativity, a good description of physical systems over finite distances in systems without significant gravitation. However, in order to take gravity into account, physicists use the theory of general relativity, which is formulated in the mathematics of a non-Euclidean geometry. When this geometry is used as a model of physical space, it is known as curved space .

Even in curved space, Minkowski space is still a good description in an infinitesimal region surrounding any point (barring gravitational singularities). [nb 6] More abstractly, we say that in the presence of gravity spacetime is described by a curved 4-dimensional manifold for which the tangent space to any point is a 4-dimensional Minkowski space. Thus, the structure of Minkowski space is still essential in the description of general relativity.


The meaning of the term geometry for the Minkowski space depends heavily on the context. Minkowski space is not endowed with a Euclidean geometry, and not with any of the generalized Riemannian geometries with intrinsic curvature, those exposed by the model spaces in hyperbolic geometry (negative curvature) and the geometry modeled by the sphere (positive curvature). The reason is the indefiniteness of the Minkowski metric. Minkowski space is, in particular, not a metric space and not a Riemannian manifold with a Riemannian metric. However, Minkowski space contains submanifolds endowed with a Riemannian metric yielding hyperbolic geometry.

Model spaces of hyperbolic geometry of low dimension, say 2 or 3, cannot be isometrically embedded in Euclidean space with one more dimension, i.e. 3 or 4 respectively, with the Euclidean metric g, disallowing easy visualization. [nb 7] [25] By comparison, model spaces with positive curvature are just spheres in Euclidean space of one higher dimension. [26] Hyperbolic spaces can be isometrically embedded in spaces of one more dimension when the embedding space is endowed with the Minkowski metric η.

Define H1(n)
to be the upper sheet (ct > 0) of the hyperboloid

in generalized Minkowski space Mn+1 of spacetime dimension n + 1. This is one of the surfaces of transitivity of the generalized Lorentz group. The induced metric on this submanifold,

the pullback of the Minkowski metric η under inclusion, is a Riemannian metric. With this metric H1(n)
is a Riemannian manifold. It is one of the model spaces of Riemannian geometry, the hyperboloid model of hyperbolic space. It is a space of constant negative curvature −1/R2. [27] The 1 in the upper index refers to an enumeration of the different model spaces of hyperbolic geometry, and the n for its dimension. A 2(2) corresponds to the Poincaré disk model, while 3(n) corresponds to the Poincaré half-space model of dimension n.


In the definition above ι: H1(n)
is the inclusion map and the superscript star denotes the pullback. The present purpose is to describe this and similar operations as a preparation for the actual demonstration that H1(n)
actually is a hyperbolic space.

Hyperbolic stereographic projection

Red circular arc is geodesic in Poincare disk model; it projects to the brown geodesic on the green hyperboloid. HyperboloidProjection.png
Red circular arc is geodesic in Poincaré disk model; it projects to the brown geodesic on the green hyperboloid.

In order to exhibit the metric, it is necessary to pull it back via a suitable parametrization. A parametrization of a submanifold S of M is a map URmM whose range is an open subset of S. If S has the same dimension as M, a parametrization is just the inverse of a coordinate map φ: MURm. The parametrization to be used is the inverse of hyperbolic stereographic projection. This is illustrated in the figure to the left for n = 2. It is instructive to compare to stereographic projection for spheres.

Stereographic projection σ: Hn
and its inverse σ−1: RnHn
are given by

where, for simplicity, τct. The (τ, x) are coordinates on Mn+1 and the u are coordinates on Rn.

Detailed derivation


and let


then it is geometrically clear that the vector

intersects the hyperplane

once in point denoted

One has


By construction of stereographic projection one has

This leads to the system of equations

The first of these is solved for and one obtains for stereographic projection

Next, the inverse must be calculated. Use the same considerations as before, but now with

One gets

but now with depending on The condition for P lying in the hyperboloid is


leading to

With this , one obtains

Pulling back the metric

One has

and the map

The pulled back metric can be obtained by straightforward methods of calculus;

One computes according to the standard rules for computing differentials (though one is really computing the rigorously defined exterior derivatives),

and substitutes the results into the right hand side. This yields

This last equation shows that the metric on the ball is identical to the Riemannian metric h2(n)
in the Poincaré ball model, another standard model of hyperbolic geometry.

See also


  1. This makes spacetime distance an invariant.
  2. Consistent use of the terms "Minkowski inner product", "Minkowski norm" or "Minkowski metric" is intended for the bilinear form here, since it is in widespread use. It is by no means "standard" in the literature, but no standard terminology seems to exist.
  3. Translate the coordinate system so that the event is the new origin.
  4. This corresponds to the time coordinate either increasing or decreasing when proper time for any particle increases. An application of T flips this direction.
  5. For comparison and motivation of terminology, take a Riemannian metric, which provides a positive definite symmetric bilinear form, i. e. an inner product proper at each point on a manifold.
  6. This similarity between flat space and curved space at infinitesimally small distance scales is foundational to the definition of a manifold in general.
  7. There is an isometric embedding into n according to the Nash embedding theorem (Nash (1956)), but the embedding dimension is much higher, n = (m/2)(m + 1)(3m + 11) for a Riemannian manifold of dimension m.


  1. "Minkowski". Random House Webster's Unabridged Dictionary .
  2. Landau & Lifshitz 2002 , p. 4
  3. Lee 1997 , p. 31
  4. Schutz, John W. (1977). Independent Axioms for Minkowski Space–Time (illustrated ed.). CRC Press. pp. 184–185. ISBN   978-0-582-31760-4. Extract of page 184
  5. Poincaré 1905–1906 , pp. 129–176 Wikisource translation: On the Dynamics of the Electron
  6. Minkowski 1907–1908 , pp. 53–111 *Wikisource translation: s:Translation:The Fundamental Equations for Electromagnetic Processes in Moving Bodies.
  7. 1 2 Minkowski 1908–1909 , pp. 75–88 Various English translations on Wikisource: "Space and Time."
  8. Cornelius Lanczos (1972) "Einstein's Path from Special to General Relativity", pages 5–19 of General Relativity: Papers in Honour of J. L. Synge, L. O'Raifeartaigh editor, Clarendon Press, see page 11
  9. See Schutz's proof p 148, also Naber p.48
  10. Schutz p.148, Naber p.49
  11. Schutz p.148
  12. Lee 1997 , p. 15
  13. Lee 2003 , See Lee's discussion on geometric tangent vectors early in chapter 3.
  14. Giulini 2008 pp. 5,6
  15. Gregory L. Naber (2003). The Geometry of Minkowski Spacetime: An Introduction to the Mathematics of the Special Theory of Relativity (illustrated ed.). Courier Corporation. p. 8. ISBN   978-0-486-43235-9. Extract of page 8
  16. Sean M. Carroll (2019). Spacetime and Geometry (illustrated, herdruk ed.). Cambridge University Press. p. 7. ISBN   978-1-108-48839-6.
  17. Sard 1970 , p. 71
  18. Minkowski, Landau & Lifshitz 2002 , p. 4
  19. Misner, Thorne & Wheeler 1973
  20. Lee 2003. One point in Lee's proof of existence of this map needs modification (Lee deals with Riemannian metrics.). Where Lee refers to positive definiteness to show injectivity of the map, one needs instead appeal to non-degeneracy.
  21. Lee 2003 , The tangent-cotangent isomorphism p. 282.
  22. Lee 2003
  23. Y. Friedman, A Physically Meaningful Relativistic Description of the Spin State of an Electron, Symmetry 2021, 13(10), 1853; https://doi.org/10.3390/sym13101853
  24. Jackson, J.D., Classical Electrodynamics, 3rd ed.; John Wiley \& Sons: Hoboken, NJ, USA,1998
  25. Lee 1997 , p. 66
  26. Lee 1997 , p. 33
  27. Lee 1997

Related Research Articles

<span class="mw-page-title-main">Lorentz transformation</span> Family of linear transformations

In physics, the Lorentz transformations are a six-parameter family of linear transformations from a coordinate frame in spacetime to another frame that moves at a constant velocity relative to the former. The respective inverse transformation is then parameterized by the negative of this velocity. The transformations are named after the Dutch physicist Hendrik Lorentz.

<span class="mw-page-title-main">Four-momentum</span> 4D relativistic energy and momentum

In special relativity, four-momentum (also called momentum–energy or momenergy ) is the generalization of the classical three-dimensional momentum to four-dimensional spacetime. Momentum is a vector in three dimensions; similarly four-momentum is a four-vector in spacetime. The contravariant four-momentum of a particle with relativistic energy E and three-momentum p = (px, py, pz) = γmv, where v is the particle's three-velocity and γ the Lorentz factor, is

<span class="mw-page-title-main">Four-vector</span> 4-dimensional vector in relativity

In special relativity, a four-vector is an object with four components, which transform in a specific way under Lorentz transformations. Specifically, a four-vector is an element of a four-dimensional vector space considered as a representation space of the standard representation of the Lorentz group, the representation. It differs from a Euclidean vector in how its magnitude is determined. The transformations that preserve this magnitude are the Lorentz transformations, which include spatial rotations and boosts.

In physics, in particular in special relativity and general relativity, a four-velocity is a four-vector in four-dimensional spacetime that represents the relativistic counterpart of velocity, which is a three-dimensional vector in space.

<span class="mw-page-title-main">Four-current</span> 4D analogue of electric current density

In special and general relativity, the four-current is the four-dimensional analogue of the electric current density. Also known as vector current, it is used in the geometric context of four-dimensional spacetime, rather than three-dimensional space and time separately. Mathematically it is a four-vector, and is Lorentz covariant.

In differential geometry, the four-gradient is the four-vector analogue of the gradient from vector calculus.

In a relativistic theory of physics, a Lorentz scalar is an expression, formed from items of the theory, which evaluates to a scalar, invariant under any Lorentz transformation. A Lorentz scalar may be generated from e.g., the scalar product of vectors, or from contracting tensors of the theory. While the components of vectors and tensors are in general altered under Lorentz transformations, Lorentz scalars remain unchanged.

<span class="mw-page-title-main">Perfect fluid</span> Fluid fully characterized by its density and isotropic pressure

In physics, a perfect fluid is a fluid that can be completely characterized by its rest frame mass density and isotropic pressure p. Real fluids are "sticky" and contain heat. Perfect fluids are idealized models in which these possibilities are neglected. Specifically, perfect fluids have no shear stresses, viscosity, or heat conduction. Quark–gluon plasma is the closest known substance to a perfect fluid.

In general relativity, the metric tensor is the fundamental object of study. It may loosely be thought of as a generalization of the gravitational potential of Newtonian gravitation. The metric captures all the geometric and causal structure of spacetime, being used to define notions such as time, distance, volume, curvature, angle, and separation of the future and the past.

In general relativity, if two objects are set in motion along two initially parallel trajectories, the presence of a tidal gravitational force will cause the trajectories to bend towards or away from each other, producing a relative acceleration between the objects.

<span class="mw-page-title-main">Linearized gravity</span> Linear perturbations to solutions of nonlinear Einstein field equations

In the theory of general relativity, linearized gravity is the application of perturbation theory to the metric tensor that describes the geometry of spacetime. As a consequence, linearized gravity is an effective method for modeling the effects of gravity when the gravitational field is weak. The usage of linearized gravity is integral to the study of gravitational waves and weak-field gravitational lensing.

In differential geometry and mathematical physics, a spin connection is a connection on a spinor bundle. It is induced, in a canonical manner, from the affine connection. It can also be regarded as the gauge field generated by local Lorentz transformations. In some canonical formulations of general relativity, a spin connection is defined on spatial slices and can also be regarded as the gauge field generated by local rotations.

<span class="mw-page-title-main">Theoretical motivation for general relativity</span>

A theoretical motivation for general relativity, including the motivation for the geodesic equation and the Einstein field equation, can be obtained from special relativity by examining the dynamics of particles in circular orbits about the earth. A key advantage in examining circular orbits is that it is possible to know the solution of the Einstein Field Equation a priori. This provides a means to inform and verify the formalism.

<span class="mw-page-title-main">Electromagnetic stress–energy tensor</span>

In relativistic physics, the electromagnetic stress–energy tensor is the contribution to the stress–energy tensor due to the electromagnetic field. The stress–energy tensor describes the flow of energy and momentum in spacetime. The electromagnetic stress–energy tensor contains the negative of the classical Maxwell stress tensor that governs the electromagnetic interactions.

<span class="mw-page-title-main">Covariant formulation of classical electromagnetism</span>

The covariant formulation of classical electromagnetism refers to ways of writing the laws of classical electromagnetism in a form that is manifestly invariant under Lorentz transformations, in the formalism of special relativity using rectilinear inertial coordinate systems. These expressions both make it simple to prove that the laws of classical electromagnetism take the same form in any inertial coordinate system, and also provide a way to translate the fields and forces from one frame to another. However, this is not as general as Maxwell's equations in curved spacetime or non-rectilinear coordinate systems.

<span class="mw-page-title-main">Maxwell's equations in curved spacetime</span> Electromagnetism in general relativity

In physics, Maxwell's equations in curved spacetime govern the dynamics of the electromagnetic field in curved spacetime or where one uses an arbitrary coordinate system. These equations can be viewed as a generalization of the vacuum Maxwell's equations which are normally formulated in the local coordinates of flat spacetime. But because general relativity dictates that the presence of electromagnetic fields induce curvature in spacetime, Maxwell's equations in flat spacetime should be viewed as a convenient approximation.

The tetrad formalism is an approach to general relativity that generalizes the choice of basis for the tangent bundle from a coordinate basis to the less restrictive choice of a local basis, i.e. a locally defined set of four linearly independent vector fields called a tetrad or vierbein. It is a special case of the more general idea of a vielbein formalism, which is set in (pseudo-)Riemannian geometry. This article as currently written makes frequent mention of general relativity; however, almost everything it says is equally applicable to (pseudo-)Riemannian manifolds in general, and even to spin manifolds. Most statements hold simply by substituting arbitrary for . In German, "vier" translates to "four", and "viel" to "many".

<span class="mw-page-title-main">Relativistic Lagrangian mechanics</span> Mathematical formulation of special and general relativity

In theoretical physics, relativistic Lagrangian mechanics is Lagrangian mechanics applied in the context of special relativity and general relativity.

<span class="mw-page-title-main">Dirac equation in curved spacetime</span> Generalization of the Dirac equation

In mathematical physics, the Dirac equation in curved spacetime is a generalization of the Dirac equation from flat spacetime to curved spacetime, a general Lorentzian manifold.

Lagrangian field theory is a formalism in classical field theory. It is the field-theoretic analogue of Lagrangian mechanics. Lagrangian mechanics is used to analyze the motion of a system of discrete particles each with a finite number of degrees of freedom. Lagrangian field theory applies to continua and fields, which have an infinite number of degrees of freedom.


Commons-logo.svg Media related to Minkowski diagrams at Wikimedia Commons