Spinors in three dimensions

Last updated

In mathematics, the spinor concept as specialised to three dimensions can be treated by means of the traditional notions of dot product and cross product. This is part of the detailed algebraic discussion of the rotation group SO(3).

Contents

Formulation

The association of a spinor with a 2×2 complex traceless Hermitian matrix was formulated by Élie Cartan. [1]

In detail, given a vector x = (x1, x2, x3) of real (or complex) numbers, one can associate the complex matrix

In physics, this is often written as a dot product , where is the vector form of Pauli matrices. Matrices of this form have the following properties, which relate them intrinsically to the geometry of 3-space:

The last property can be used to simplify rotational operations. It is an elementary fact from linear algebra that any rotation in 3-space factors as a composition of two reflections. (More generally, any orientation-reversing orthogonal transformation is either a reflection or the product of three reflections.) Thus if R is a rotation which decomposes as the reflection in the plane perpendicular to a unit vector followed by the reflection in the plane perpendicular to , then the matrix represents the rotation of the vector through R.

Having effectively encoded all the rotational linear geometry of 3-space into a set of complex 2×2 matrices, it is natural to ask what role, if any, the 2×1 matrices (i.e., the column vectors) play. Provisionally, a spinor is a column vector

with complex entries ξ1 and ξ2.

The space of spinors is evidently acted upon by complex 2×2 matrices. As shown above, the product of two reflections in a pair of unit vectors defines a 2×2 matrix whose action on euclidean vectors is a rotation. So there is an action of rotations on spinors. However, there is one important caveat: the factorization of a rotation is not unique. Clearly, if is a representation of a rotation, then replacing R by −R will yield the same rotation. In fact, one can easily show that this is the only ambiguity which arises. Thus the action of a rotation on a spinor is always double-valued.

History

There were some precursors to Cartan's work with 2×2 complex matrices: Wolfgang Pauli had used these matrices so intensively that elements of a certain basis of a four-dimensional subspace are called Pauli matrices σi, so that the Hermitian matrix is written as a Pauli vector [2] In the mid 19th century the algebraic operations of this algebra of four complex dimensions were studied as biquaternions.

Michael Stone and Paul Goldbar, in Mathematics for Physics, contest this, saying, "The spin representations were discovered by ´Elie Cartan in 1913, some years before they were needed in physics."

Formulation using isotropic vectors

Spinors can be constructed directly from isotropic vectors in 3-space without using the quaternionic construction. To motivate this introduction of spinors, suppose that X is a matrix representing a vector x in complex 3-space. Suppose further that x is isotropic: i.e.,

Then since the determinant of X is zero there is a proportionality between its rows or columns. Thus the matrix may be written as an outer product of two complex 2-vectors:

This factorization yields an overdetermined system of equations in the coordinates of the vector x:

 

 

 

 

(1)

subject to the constraint

 

 

 

 

(2)

This system admits the solutions

 

 

 

 

(3)

Either choice of sign solves the system ( 1 ). Thus a spinor may be viewed as an isotropic vector, along with a choice of sign. Note that because of the logarithmic branching, it is impossible to choose a sign consistently so that ( 3 ) varies continuously along a full rotation among the coordinates x. In spite of this ambiguity of the representation of a rotation on a spinor, the rotations do act unambiguously by a fractional linear transformation on the ratio ξ1:ξ2 since one choice of sign in the solution ( 3 ) forces the choice of the second sign. In particular, the space of spinors is a projective representation of the orthogonal group.

As a consequence of this point of view, spinors may be regarded as a kind of "square root" of isotropic vectors. Specifically, introducing the matrix

the system ( 1 ) is equivalent to solving X = 2 ξtξC for the undetermined spinor ξ.

A fortiori, if the roles of ξ and x are now reversed, the form Q(ξ) = x defines, for each spinor ξ, a vector x quadratically in the components of ξ. If this quadratic form is polarized, it determines a bilinear vector-valued form on spinors Q(μ, ξ). This bilinear form then transforms tensorially under a reflection or a rotation.

Reality

The above considerations apply equally well whether the original euclidean space under consideration is real or complex. When the space is real, however, spinors possess some additional structure which in turn facilitates a complete description of the representation of the rotation group. Suppose, for simplicity, that the inner product on 3-space has positive-definite signature:

 

 

 

 

(4)

With this convention, real vectors correspond to Hermitian matrices. Furthermore, real rotations preserving the form ( 4 ) correspond (in the double-valued sense) to unitary matrices of determinant one. In modern terms, this presents the special unitary group SU(2) as a double cover of SO(3). As a consequence, the spinor Hermitian product

 

 

 

 

(5)

is preserved by all rotations, and therefore is canonical.

If, however, the signature of the inner product on 3-space is indefinite (i.e., non-degenerate, but also not positive definite), then the foregoing analysis must be adjusted to reflect this. Suppose then that the length form on 3-space is given by:

 

 

 

 

(4)

Then the construction of spinors of the preceding sections proceeds, but with replacing   in all the formulas. With this new convention, the matrix associated to a real vector is itself real:

.

The form ( 5 ) is no longer invariant under a real rotation (or reversal), since the group stabilizing ( 4 ) is now a Lorentz group O(2,1). Instead, the anti-Hermitian form

defines the appropriate notion of inner product for spinors in this metric signature. This form is invariant under transformations in the connected component of the identity of O(2,1).

In either case, the quartic form

is fully invariant under O(3) (or O(2,1), respectively), where Q is the vector-valued bilinear form described in the previous section. The fact that this is a quartic invariant, rather than quadratic, has an important consequence. If one confines attention to the group of special orthogonal transformations, then it is possible unambiguously to take the square root of this form and obtain an identification of spinors with their duals. In the language of representation theory, this implies that there is only one irreducible spin representation of SO(3) (or SO(2,1)) up to isomorphism. If, however, reversals (e.g., reflections in a plane) are also allowed, then it is no longer possible to identify spinors with their duals owing to a change of sign on the application of a reflection. Thus there are two irreducible spin representations of O(3) (or O(2,1)), sometimes called the pin representations.

Reality structures

The differences between these two signatures can be codified by the notion of a reality structure on the space of spinors. Informally, this is a prescription for taking a complex conjugate of a spinor, but in such a way that this may not correspond to the usual conjugate per the components of a spinor. Specifically, a reality structure is specified by a Hermitian 2 × 2 matrix K whose product with itself is the identity matrix: K2 = Id. The conjugate of a spinor with respect to a reality structure K is defined by

The particular form of the inner product on vectors (e.g., ( 4 ) or ( 4 )) determines a reality structure (up to a factor of -1) by requiring

, whenever X is a matrix associated to a real vector.

Thus K = i C is the reality structure in Euclidean signature ( 4 ), and K = Id is that for signature ( 4 ). With a reality structure in hand, one has the following results:

Examples in physics

Spinors of the Pauli spin matrices

Often, the first example of spinors that a student of physics encounters are the 2×1 spinors used in Pauli's theory of electron spin. The Pauli matrices are a vector of three 2×2 matrices that are used as spin operators.

Given a unit vector in 3 dimensions, for example (a, b, c), one takes a dot product with the Pauli spin matrices to obtain a spin matrix for spin in the direction of the unit vector.

The eigenvectors of that spin matrix are the spinors for spin-1/2 oriented in the direction given by the vector.

Example: u = (0.8, -0.6, 0) is a unit vector. Dotting this with the Pauli spin matrices gives the matrix:

The eigenvectors may be found by the usual methods of linear algebra, but a convenient trick is to note that a Pauli spin matrix is an involutory matrix, that is, the square of the above matrix is the identity matrix.

Thus a (matrix) solution to the eigenvector problem with eigenvalues of ±1 is simply 1 ± Su. That is,

One can then choose either of the columns of the eigenvector matrix as the vector solution, provided that the column chosen is not zero. Taking the first column of the above, eigenvector solutions for the two eigenvalues are:

The trick used to find the eigenvectors is related to the concept of ideals, that is, the matrix eigenvectors (1 ± Su)/2 are projection operators or idempotents and therefore each generates an ideal in the Pauli algebra. The same trick works in any Clifford algebra, in particular the Dirac algebra that is discussed below. These projection operators are also seen in density matrix theory where they are examples of pure density matrices.

More generally, the projection operator for spin in the (a, b, c) direction is given by

and any non zero column can be taken as the projection operator. While the two columns appear different, one can use a2 + b2 + c2 = 1 to show that they are multiples (possibly zero) of the same spinor.

General remarks

In atomic physics and quantum mechanics, the property of spin plays a major role. In addition to their other properties all particles possess a non-classical property, i.e., which has no correspondence at all in conventional physics, namely the spin, which is a kind of intrinsic angular momentum. In the position representation, instead of a wavefunction without spin, ψ = ψ(r), one has with spin: ψ = ψ(r, σ), where σ takes the following discrete set of values:

.

The total angular momentum operator, , of a particle corresponds to the sum of the orbital angular momentum (i.e., there only integers are allowed) and the intrinsic part, the spin. One distinguishes bosons (S = 0, ±1, ±2, ...) and fermions (S = ±1/2, ±3/2, ±5/2, ...).

See also

Related Research Articles

<span class="mw-page-title-main">Pauli matrices</span> Matrices important in quantum mechanics and the study of spin

In mathematical physics and mathematics, the Pauli matrices are a set of three 2 × 2 complex matrices which are Hermitian, involutory and unitary. Usually indicated by the Greek letter sigma, they are occasionally denoted by tau when used in connection with isospin symmetries.

<span class="mw-page-title-main">Spinor</span> Non-tensorial representation of the spin group; represents fermions in physics

In geometry and physics, spinors are elements of a complex number-based vector space that can be associated with Euclidean space. A spinor transforms linearly when the Euclidean space is subjected to a slight (infinitesimal) rotation, but unlike geometric vectors and tensors, a spinor transforms to its negative when the space rotates through 360°. It takes a rotation of 720° for a spinor to go back to its original state. This property characterizes spinors: spinors can be viewed as the "square roots" of vectors.

In particle physics, the Dirac equation is a relativistic wave equation derived by British physicist Paul Dirac in 1928. In its free form, or including electromagnetic interactions, it describes all spin-12 massive particles, called "Dirac particles", such as electrons and quarks for which parity is a symmetry. It is consistent with both the principles of quantum mechanics and the theory of special relativity, and was the first theory to account fully for special relativity in the context of quantum mechanics. It was validated by accounting for the fine structure of the hydrogen spectrum in a completely rigorous way.

In mathematics, particularly linear algebra and functional analysis, a spectral theorem is a result about when a linear operator or matrix can be diagonalized. This is extremely useful because computations involving a diagonalizable matrix can often be reduced to much simpler computations involving the corresponding diagonal matrix. The concept of diagonalization is relatively straightforward for operators on finite-dimensional vector spaces but requires some modification for operators on infinite-dimensional spaces. In general, the spectral theorem identifies a class of linear operators that can be modeled by multiplication operators, which are as simple as one can hope to find. In more abstract language, the spectral theorem is a statement about commutative C*-algebras. See also spectral theory for a historical perspective.

<span class="mw-page-title-main">Singular value decomposition</span> Matrix decomposition

In linear algebra, the singular value decomposition (SVD) is a factorization of a real or complex matrix. It generalizes the eigendecomposition of a square normal matrix with an orthonormal eigenbasis to any matrix. It is related to the polar decomposition.

In mechanics and geometry, the 3D rotation group, often denoted SO(3), is the group of all rotations about the origin of three-dimensional Euclidean space under the operation of composition.

In mathematics, a Hermitian matrix is a complex square matrix that is equal to its own conjugate transpose—that is, the element in the i-th row and j-th column is equal to the complex conjugate of the element in the j-th row and i-th column, for all indices i and j:

<span class="mw-page-title-main">Lorentz group</span> Lie group of Lorentz transformations

In physics and mathematics, the Lorentz group is the group of all Lorentz transformations of Minkowski spacetime, the classical and quantum setting for all (non-gravitational) physical phenomena. The Lorentz group is named for the Dutch physicist Hendrik Lorentz.

In quantum field theory, the Dirac spinor is the spinor that describes all known fundamental particles that are fermions, with the possible exception of neutrinos. It appears in the plane-wave solution to the Dirac equation, and is a certain combination of two Weyl spinors, specifically, a bispinor that transforms "spinorially" under the action of the Lorentz group.

<span class="mw-page-title-main">Bloch sphere</span> Geometrical representation of the pure state space of a two-level quantum mechanical system

In quantum mechanics and computing, the Bloch sphere is a geometrical representation of the pure state space of a two-level quantum mechanical system (qubit), named after the physicist Felix Bloch.

In the study of Dirac fields in quantum field theory, Richard Feynman invented the convenient Feynman slash notation. If A is a covariant vector,

The Rayleigh–Ritz method is a direct numerical method of approximating eigenvalues, originated in the context of solving physical boundary value problems and named after Lord Rayleigh and Walther Ritz.

In linear algebra, an eigenvector or characteristic vector of a linear transformation is a nonzero vector that changes at most by a constant factor when that linear transformation is applied to it. The corresponding eigenvalue, often represented by , is the multiplying factor.

In mathematical physics, the gamma matrices, also called the Dirac matrices, are a set of conventional matrices with specific anticommutation relations that ensure they generate a matrix representation of the Clifford algebra It is also possible to define higher-dimensional gamma matrices. When interpreted as the matrices of the action of a set of orthogonal basis vectors for contravariant vectors in Minkowski space, the column vectors on which the matrices act become a space of spinors, on which the Clifford algebra of spacetime acts. This in turn makes it possible to represent infinitesimal spatial rotations and Lorentz boosts. Spinors facilitate spacetime computations in general, and in particular are fundamental to the Dirac equation for relativistic spin particles. Gamma matrices were introduced by Dirac in 1928.

In physics, the Majorana equation is a relativistic wave equation. It is named after the Italian physicist Ettore Majorana, who proposed it in 1937 as a means of describing fermions that are their own antiparticle. Particles corresponding to this equation are termed Majorana particles, although that term now has a more expansive meaning, referring to any fermionic particle that is its own anti-particle.

In mathematics and physics, in particular quantum information, the term generalized Pauli matrices refers to families of matrices which generalize the properties of the Pauli matrices. Here, a few classes of such matrices are summarized.

<span class="mw-page-title-main">Classical group</span>

In mathematics, the classical groups are defined as the special linear groups over the reals R, the complex numbers C and the quaternions H together with special automorphism groups of symmetric or skew-symmetric bilinear forms and Hermitian or skew-Hermitian sesquilinear forms defined on real, complex and quaternionic finite-dimensional vector spaces. Of these, the complex classical Lie groups are four infinite families of Lie groups that together with the exceptional groups exhaust the classification of simple Lie groups. The compact classical groups are compact real forms of the complex classical groups. The finite analogues of the classical groups are the classical groups of Lie type. The term "classical group" was coined by Hermann Weyl, it being the title of his 1939 monograph The Classical Groups.

In the Standard Model, using quantum field theory it is conventional to use the helicity basis to simplify calculations. In this basis, the spin is quantized along the axis in the direction of motion of the particle.

In physics, and specifically in quantum field theory, a bispinor is a mathematical construction that is used to describe some of the fundamental particles of nature, including quarks and electrons. It is a specific embodiment of a spinor, specifically constructed so that it is consistent with the requirements of special relativity. Bispinors transform in a certain "spinorial" fashion under the action of the Lorentz group, which describes the symmetries of Minkowski spacetime. They occur in the relativistic spin-1/2 wave function solutions to the Dirac equation.

<span class="mw-page-title-main">Symmetry in quantum mechanics</span> Properties underlying modern physics

Symmetries in quantum mechanics describe features of spacetime and particles which are unchanged under some transformation, in the context of quantum mechanics, relativistic quantum mechanics and quantum field theory, and with applications in the mathematical formulation of the standard model and condensed matter physics. In general, symmetry in physics, invariance, and conservation laws, are fundamentally important constraints for formulating physical theories and models. In practice, they are powerful methods for solving problems and predicting what can happen. While conservation laws do not always give the answer to the problem directly, they form the correct constraints and the first steps to solving a multitude of problems.

References

  1. 1 2 Cartan, Élie (1981) [1938], The Theory of Spinors, New York: Dover Publications, ISBN   978-0-486-64070-9, MR   0631850
  2. The Pauli vector is a formal device. It may be thought of as an element of M2(ℂ) ⊗ ℝ3, where the tensor product space is endowed with a mapping ⋅: ℝ3 × M2(ℂ) ⊗ ℝ3M2(ℂ).